WorldWideScience

Sample records for incorporating sequence quality

  1. Formatt: Correcting protein multiple structural alignments by incorporating sequence alignment

    Directory of Open Access Journals (Sweden)

    Daniels Noah M

    2012-10-01

    Full Text Available Abstract Background The quality of multiple protein structure alignments are usually computed and assessed based on geometric functions of the coordinates of the backbone atoms from the protein chains. These purely geometric methods do not utilize directly protein sequence similarity, and in fact, determining the proper way to incorporate sequence similarity measures into the construction and assessment of protein multiple structure alignments has proved surprisingly difficult. Results We present Formatt, a multiple structure alignment based on the Matt purely geometric multiple structure alignment program, that also takes into account sequence similarity when constructing alignments. We show that Formatt outperforms Matt and other popular structure alignment programs on the popular HOMSTRAD benchmark. For the SABMark twilight zone benchmark set that captures more remote homology, Formatt and Matt outperform other programs; depending on choice of embedded sequence aligner, Formatt produces either better sequence and structural alignments with a smaller core size than Matt, or similarly sized alignments with better sequence similarity, for a small cost in average RMSD. Conclusions Considering sequence information as well as purely geometric information seems to improve quality of multiple structure alignments, though defining what constitutes the best alignment when sequence and structural measures would suggest different alignments remains a difficult open question.

  2. Reduction of excess sludge production in sequencing batch reactor through incorporation of chlorine dioxide oxidation

    International Nuclear Information System (INIS)

    Wang Guanghua; Sui Jun; Shen Huishan; Liang Shukun; He Xiangming; Zhang Minju; Xie Yizhong; Li Lingyun; Hu Yongyou

    2011-01-01

    In this study, chlorine dioxide (ClO 2 ) instead of chlorine (Cl 2 ) was proposed to minimize the formation of chlorine-based by-products and was incorporated into a sequencing batch reactor (SBR) for excess sludge reduction. The results showed that the sludge disintegrability of ClO 2 was excellent. The waste activated sludge at an initial concentration of 15 g MLSS/L was rapidly reduced by 36% using ClO 2 doses of 10 mg ClO 2 /g dry sludge which was much lower than that obtained using Cl 2 based on similar sludge reduction efficiency. Maximum sludge disintegration was achieved at 10 mg ClO 2 /g dry sludge for 40 min. ClO 2 oxidation can be successfully incorporated into a SBR for excess sludge reduction without significantly harming the bioreactor performance. The incorporation of ClO 2 oxidation resulted in a 58% reduction in excess sludge production, and the quality of the effluent was not significantly affected.

  3. Reduction of excess sludge production in sequencing batch reactor through incorporation of chlorine dioxide oxidation

    Energy Technology Data Exchange (ETDEWEB)

    Wang Guanghua [Ministry of Education Key Laboratory of Pollution Control and Ecological Remediation for Industrial Agglomeration area, College of Environmental Science and Engineering, South China University of Technology, Guangzhou, 510006 (China); Guangzhou municipal engineering design and research institute, Guangzhou, 510060 (China); Sui Jun [Guangzhou municipal engineering design and research institute, Guangzhou, 510060 (China); Shen Huishan; Liang Shukun [Ministry of Education Key Laboratory of Pollution Control and Ecological Remediation for Industrial Agglomeration area, College of Environmental Science and Engineering, South China University of Technology, Guangzhou, 510006 (China); He Xiangming; Zhang Minju; Xie Yizhong; Li Lingyun [Nanhai Limited Liability Development Company, Foshan, 528200 (China); Hu Yongyou, E-mail: ppyyhu@scut.edu.cn [Ministry of Education Key Laboratory of Pollution Control and Ecological Remediation for Industrial Agglomeration area, College of Environmental Science and Engineering, South China University of Technology, Guangzhou, 510006 (China) and State Key Lab of Pulp and Paper Engineering, College of Light Industry and Food Science, South China University of Technology; Guangzhou, 510640 (China)

    2011-08-15

    In this study, chlorine dioxide (ClO{sub 2}) instead of chlorine (Cl{sub 2}) was proposed to minimize the formation of chlorine-based by-products and was incorporated into a sequencing batch reactor (SBR) for excess sludge reduction. The results showed that the sludge disintegrability of ClO{sub 2} was excellent. The waste activated sludge at an initial concentration of 15 g MLSS/L was rapidly reduced by 36% using ClO{sub 2} doses of 10 mg ClO{sub 2}/g dry sludge which was much lower than that obtained using Cl{sub 2} based on similar sludge reduction efficiency. Maximum sludge disintegration was achieved at 10 mg ClO{sub 2}/g dry sludge for 40 min. ClO{sub 2} oxidation can be successfully incorporated into a SBR for excess sludge reduction without significantly harming the bioreactor performance. The incorporation of ClO{sub 2} oxidation resulted in a 58% reduction in excess sludge production, and the quality of the effluent was not significantly affected.

  4. Analysis of quality raw data of second generation sequencers with Quality Assessment Software.

    Science.gov (United States)

    Ramos, Rommel Tj; Carneiro, Adriana R; Baumbach, Jan; Azevedo, Vasco; Schneider, Maria Pc; Silva, Artur

    2011-04-18

    Second generation technologies have advantages over Sanger; however, they have resulted in new challenges for the genome construction process, especially because of the small size of the reads, despite the high degree of coverage. Independent of the program chosen for the construction process, DNA sequences are superimposed, based on identity, to extend the reads, generating contigs; mismatches indicate a lack of homology and are not included. This process improves our confidence in the sequences that are generated. We developed Quality Assessment Software, with which one can review graphs showing the distribution of quality values from the sequencing reads. This software allow us to adopt more stringent quality standards for sequence data, based on quality-graph analysis and estimated coverage after applying the quality filter, providing acceptable sequence coverage for genome construction from short reads. Quality filtering is a fundamental step in the process of constructing genomes, as it reduces the frequency of incorrect alignments that are caused by measuring errors, which can occur during the construction process due to the size of the reads, provoking misassemblies. Application of quality filters to sequence data, using the software Quality Assessment, along with graphing analyses, provided greater precision in the definition of cutoff parameters, which increased the accuracy of genome construction.

  5. Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

    Science.gov (United States)

    Hong, Jungeui; Gresham, David

    2017-11-01

    Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.

  6. Standardization and quality management in next-generation sequencing.

    Science.gov (United States)

    Endrullat, Christoph; Glökler, Jörn; Franke, Philipp; Frohme, Marcus

    2016-09-01

    DNA sequencing continues to evolve quickly even after > 30 years. Many new platforms suddenly appeared and former established systems have vanished in almost the same manner. Since establishment of next-generation sequencing devices, this progress gains momentum due to the continually growing demand for higher throughput, lower costs and better quality of data. In consequence of this rapid development, standardized procedures and data formats as well as comprehensive quality management considerations are still scarce. Here, we listed and summarized current standardization efforts and quality management initiatives from companies, organizations and societies in form of published studies and ongoing projects. These comprise on the one hand quality documentation issues like technical notes, accreditation checklists and guidelines for validation of sequencing workflows. On the other hand, general standard proposals and quality metrics are developed and applied to the sequencing workflow steps with the main focus on upstream processes. Finally, certain standard developments for downstream pipeline data handling, processing and storage are discussed in brief. These standardization approaches represent a first basis for continuing work in order to prospectively implement next-generation sequencing in important areas such as clinical diagnostics, where reliable results and fast processing is crucial. Additionally, these efforts will exert a decisive influence on traceability and reproducibility of sequence data.

  7. Tidying up international nucleotide sequence databases: ecological, geographical and sequence quality annotation of its sequences of mycorrhizal fungi.

    Science.gov (United States)

    Tedersoo, Leho; Abarenkov, Kessy; Nilsson, R Henrik; Schüssler, Arthur; Grelet, Gwen-Aëlle; Kohout, Petr; Oja, Jane; Bonito, Gregory M; Veldre, Vilmar; Jairus, Teele; Ryberg, Martin; Larsson, Karl-Henrik; Kõljalg, Urmas

    2011-01-01

    Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi.

  8. PuLSE: Quality control and quantification of peptide sequences explored by phage display libraries.

    Science.gov (United States)

    Shave, Steven; Mann, Stefan; Koszela, Joanna; Kerr, Alastair; Auer, Manfred

    2018-01-01

    The design of highly diverse phage display libraries is based on assumption that DNA bases are incorporated at similar rates within the randomized sequence. As library complexity increases and expected copy numbers of unique sequences decrease, the exploration of library space becomes sparser and the presence of truly random sequences becomes critical. We present the program PuLSE (Phage Library Sequence Evaluation) as a tool for assessing randomness and therefore diversity of phage display libraries. PuLSE runs on a collection of sequence reads in the fastq file format and generates tables profiling the library in terms of unique DNA sequence counts and positions, translated peptide sequences, and normalized 'expected' occurrences from base to residue codon frequencies. The output allows at-a-glance quantitative quality control of a phage library in terms of sequence coverage both at the DNA base and translated protein residue level, which has been missing from toolsets and literature. The open source program PuLSE is available in two formats, a C++ source code package for compilation and integration into existing bioinformatics pipelines and precompiled binaries for ease of use.

  9. cDNA sequence quality data - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Budding yeast cDNA sequencing project cDNA sequence quality data Data detail Data name cDNA sequence quality... data DOI 10.18908/lsdba.nbdc00838-003 Description of data contents Phred's quality score. P...tion Download License Update History of This Database Site Policy | Contact Us cDNA sequence quality

  10. Incorporating Contaminant Bioavailability into Sediment Quality Assessment Frameworks

    Science.gov (United States)

    The recently adopted sediment quality assessment framework for evaluating bay and estuarine sediments in the State of California incorporates bulk sediment chemistry as a key line of evidence(LOE) but does not address the bioavailability of measured contaminants. Thus, the chemis...

  11. A Next-Generation Sequencing Approach Uncovers Viral Transcripts Incorporated in Poxvirus Virions

    Directory of Open Access Journals (Sweden)

    Marica Grossegesse

    2017-10-01

    Full Text Available Transcripts are known to be incorporated in particles of DNA viruses belonging to the families of Herpesviridae and Mimiviridae, but the presence of transcripts in other DNA viruses, such as poxviruses, has not been analyzed yet. Therefore, we first established a next-generation-sequencing (NGS-based protocol, enabling the unbiased identification of transcripts in virus particles. Subsequently, we applied our protocol to analyze RNA in an emerging zoonotic member of the Poxviridae family, namely Cowpox virus. Our results revealed the incorporation of 19 viral transcripts, while host identifications were restricted to ribosomal and mitochondrial RNA. Most viral transcripts had an unknown and immunomodulatory function, suggesting that transcript incorporation may be beneficial for poxvirus immune evasion. Notably, the most abundant transcript originated from the D5L/I1R gene that encodes a viral inhibitor of the host cytoplasmic DNA sensing machinery.

  12. Sequence Quality Analysis Tool for HIV Type 1 Protease and Reverse Transcriptase

    OpenAIRE

    DeLong, Allison K.; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W.; Kantor, Rami

    2012-01-01

    Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802...

  13. Quack: A quality assurance tool for high throughput sequence data.

    Science.gov (United States)

    Thrash, Adam; Arick, Mark; Peterson, Daniel G

    2018-05-01

    The quality of data generated by high-throughput DNA sequencing tools must be rapidly assessed in order to determine how useful the data may be in making biological discoveries; higher quality data leads to more confident results and conclusions. Due to the ever-increasing size of data sets and the importance of rapid quality assessment, tools that analyze sequencing data should quickly produce easily interpretable graphics. Quack addresses these issues by generating information-dense visualizations from FASTQ files at a speed far surpassing other publicly available quality assurance tools in a manner independent of sequencing technology. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  14. Exome sequencing generates high quality data in non-target regions

    Directory of Open Access Journals (Sweden)

    Guo Yan

    2012-05-01

    Full Text Available Abstract Background Exome sequencing using next-generation sequencing technologies is a cost efficient approach to selectively sequencing coding regions of human genome for detection of disease variants. A significant amount of DNA fragments from the capture process fall outside target regions, and sequence data for positions outside target regions have been mostly ignored after alignment. Result We performed whole exome sequencing on 22 subjects using Agilent SureSelect capture reagent and 6 subjects using Illumina TrueSeq capture reagent. We also downloaded sequencing data for 6 subjects from the 1000 Genomes Project Pilot 3 study. Using these data, we examined the quality of SNPs detected outside target regions by computing consistency rate with genotypes obtained from SNP chips or the Hapmap database, transition-transversion (Ti/Tv ratio, and percentage of SNPs inside dbSNP. For all three platforms, we obtained high-quality SNPs outside target regions, and some far from target regions. In our Agilent SureSelect data, we obtained 84,049 high-quality SNPs outside target regions compared to 65,231 SNPs inside target regions (a 129% increase. For our Illumina TrueSeq data, we obtained 222,171 high-quality SNPs outside target regions compared to 95,818 SNPs inside target regions (a 232% increase. For the data from the 1000 Genomes Project, we obtained 7,139 high-quality SNPs outside target regions compared to 1,548 SNPs inside target regions (a 461% increase. Conclusions These results demonstrate that a significant amount of high quality genotypes outside target regions can be obtained from exome sequencing data. These data should not be ignored in genetic epidemiology studies.

  15. Speed Control of Multiphase Cage Induction Motors Incorporating Supply Sequence

    Directory of Open Access Journals (Sweden)

    Drozdowski Piotr

    2014-12-01

    Full Text Available The subject of this paper is the control possibility of the multiphase cage induction motors having number of phases greater than 3. These motors have additional properties for speed control that distinguish them from the standard 3 phase motors: operation at various sequences of supplying voltages due to the inverter control and possible operation with few open-circuited phases. For each supply sequence different no load speeds at the same frequency can be obtained. This feature extends the motor application for miscellaneous drive demands including vector or scalar control. This depends mainly on the type of the stator winding for a given number of phases, since the principle of motor operation is based on co-operation of higher harmonics of magnetic field. Examples of operation are presented for a 9-phase motor, though general approach has been discussed. This motor was fed by a voltage source inverter at field oriented control with forced currents. The mathematical model of the motor was reduced to the form incorporating all most important physical features and appropriate for the control law formulation. The operation was illustrated for various supply sequences for “healthy” motor and for the motor operating at one phase broken. The obtained results have shown that parasitic influence of harmonic fields interaction has negligible influence on motor operation with respect to the useful coupling for properly designed stator winding.

  16. A Systematic Approach to Quality Oriented Product Sequencing for Multistage Manufacturing Systems

    OpenAIRE

    Zhang, Faping; Butt, Shahid Ikramullah

    2016-01-01

    Product sequencing is one way to reduce cost and improve product quality for multistage manufacturing systems (MMS). However, systematically evaluating the influence of product sequence on quality performance for MMS is still a challenge. By considering the rate of incoming conforming product, manufacturing system quality transition between batch to batch, and quality propagation along stages, this paper investigates the appropriate batch policies and product sequencing for MMS so that satisf...

  17. Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

    Science.gov (United States)

    Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

    1997-01-01

    SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.

  18. Incorporating Quality Control Information in the Sensor Web

    Science.gov (United States)

    Devaraju, Anusuriya; Kunkel, Ralf; Bogena, Heye

    2013-04-01

    The rapid development of sensing technologies had led to the creation of large amounts of heterogeneous environmental observations. The Sensor Web provides a wider access to sensors and observations via common protocols and specifications. Observations typically go through several levels of quality control, and aggregation before they are made available to end-users. Raw data are usually inspected, and related quality flags are assigned. Data are gap-filled, and errors are removed. New data series may also be derived from one or more corrected data sets. Until now, it is unclear how these kinds of information can be captured in the Sensor Web Enablement (SWE) framework. Apart from the quality measures (e.g., accuracy, precision, tolerance, or confidence), the levels of observational series, the changes applied, and the methods involved must be specified. It is important that this kind of quality control information is well described and communicated to end-users to allow for a better usage and interpretation of data products. In this paper, we describe how quality control information can be incorporated into the SWE framework. Concerning this, first, we introduce the TERENO (TERrestrial ENvironmental Observatories), an initiative funded by the large research infrastructure program of the Helmholtz Association in Germany. The main goal of the initiative is to facilitate the study of long-term effects of climate and land use changes. The TERENO Online Data RepOsitORry (TEODOOR) is a software infrastructure that supports acquisition, provision, and management of observations within TERENO via SWE specifications and several other OGC web services. Next, we specify changes made to the existing observational data model to incorporate quality control information. Here, we describe the underlying TERENO data policy in terms of provision and maintenance issues. We present data levels, and their implementation within TEODOOR. The data levels are adapted from those used by

  19. The Incorporation of Quality Attributes into Online Course Design in Higher Education

    Science.gov (United States)

    Lenert, Kathleen Anne; Diane P. Janes

    2017-01-01

    A survey was designed incorporating questions on 28 attributes (compiled through a literature review) and considered to be quality features in online academic courses in higher education. This study sought to investigate the ongoing practice of instructional designers and instructors in the United States with respect to their incorporation of…

  20. Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

    Science.gov (United States)

    Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

    2012-08-01

    Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or 15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.

  1. [Incorporation of Information and Communication Technologies and quality of primary healthcare in Brazil].

    Science.gov (United States)

    Santos, Alaneir de Fátima Dos; Fonseca, Délcio; Araujo, Lucas Lobato; Procópio, Cristiane da Silva Diniz; Lopes, Érica Araújo Silva; Lima, Angela Maria de Lourdes Dayrell de; Reis, Clarice Magalhães Rodrigues Dos; Abreu, Daisy Maria Xavier de; Jorge, Alzira Oliveira; Matta-Machado, Antonio Thomaz

    2017-06-05

    Information and Communication Technologies (ICTs) are means to handle information, streamline communication, and contribute to patient care. This article describes the incorporation of Information and Communication Technologies in primary care and its association with quality, based on the Brazilian National Program for the Improvement of Access and Quality in Primary Care (PMAQ in portuguese). This was a cross-sectional study with 17,053 teams. An Index of Incorporation of ICTs was created, which included: information infrastructure, systems, and utilization. Regression analysis was used to assess associations. Only 13.5% of the teams had a high degree of ICTs. The strongest association was seen in the utilization of information. ICTs can contribute to improving quality of primary care.

  2. Incorporating Handling Qualities Analysis into Rotorcraft Conceptual Design

    Science.gov (United States)

    Lawrence, Ben

    2014-01-01

    This paper describes the initial development of a framework to incorporate handling qualities analyses into a rotorcraft conceptual design process. In particular, the paper describes how rotorcraft conceptual design level data can be used to generate flight dynamics models for handling qualities analyses. Also, methods are described that couple a basic stability augmentation system to the rotorcraft flight dynamics model to extend analysis to beyond that of the bare airframe. A methodology for calculating the handling qualities characteristics of the flight dynamics models and for comparing the results to ADS-33E criteria is described. Preliminary results from the application of the handling qualities analysis for variations in key rotorcraft design parameters of main rotor radius, blade chord, hub stiffness and flap moment of inertia are shown. Varying relationships, with counteracting trends for different handling qualities criteria and different flight speeds are exhibited, with the action of the control system playing a complex part in the outcomes. Overall, the paper demonstrates how a broad array of technical issues across flight dynamics stability and control, simulation and modeling, control law design and handling qualities testing and evaluation had to be confronted to implement even a moderately comprehensive handling qualities analysis of relatively low fidelity models. A key outstanding issue is to how to 'close the loop' with an overall design process, and options for the exploration of how to feedback handling qualities results to a conceptual design process are proposed for future work.

  3. Incorporating health care quality into health antitrust law

    Science.gov (United States)

    2008-01-01

    Background Antitrust authorities treat price as a proxy for hospital quality since health care quality is difficult to observe. As the ability to measure quality improved, more research became necessary to investigate the relationship between hospital market power and patient outcomes. This paper examines the impact of hospital competition on the quality of care as measured by the risk-adjusted mortality rates with the hospital as the unit of analysis. The study separately examines the effect of competition on non-profit hospitals. Methods We use California Office of Statewide Health Planning and Development (OSHPD) data from 1997 through 2002. Empirical model is a cross-sectional study of 373 hospitals. Regression analysis is used to estimate the relationship between Coronary Artery Bypass Graft (CABG) risk-adjusted mortality rates and hospital competition. Results Regression results show lower risk-adjusted mortality rates in the presence of a more competitive environment. This result holds for all alternative hospital market definitions. Non-profit hospitals do not have better patient outcomes than investor-owned hospitals. However, they tend to provide better quality in less competitive environments. CABG volume did not have a significant effect on patient outcomes. Conclusion Quality should be incorporated into the antitrust analysis. When mergers lead to higher prices and lower quality, thus lower social welfare, the antitrust challenge of hospital mergers is warranted. The impact of lower hospital competition on quality of care delivered by non-profit hospitals is ambiguous. PMID:18430219

  4. Incorporating health care quality into health antitrust law

    Directory of Open Access Journals (Sweden)

    Schneider Helen

    2008-04-01

    Full Text Available Abstract Background Antitrust authorities treat price as a proxy for hospital quality since health care quality is difficult to observe. As the ability to measure quality improved, more research became necessary to investigate the relationship between hospital market power and patient outcomes. This paper examines the impact of hospital competition on the quality of care as measured by the risk-adjusted mortality rates with the hospital as the unit of analysis. The study separately examines the effect of competition on non-profit hospitals. Methods We use California Office of Statewide Health Planning and Development (OSHPD data from 1997 through 2002. Empirical model is a cross-sectional study of 373 hospitals. Regression analysis is used to estimate the relationship between Coronary Artery Bypass Graft (CABG risk-adjusted mortality rates and hospital competition. Results Regression results show lower risk-adjusted mortality rates in the presence of a more competitive environment. This result holds for all alternative hospital market definitions. Non-profit hospitals do not have better patient outcomes than investor-owned hospitals. However, they tend to provide better quality in less competitive environments. CABG volume did not have a significant effect on patient outcomes. Conclusion Quality should be incorporated into the antitrust analysis. When mergers lead to higher prices and lower quality, thus lower social welfare, the antitrust challenge of hospital mergers is warranted. The impact of lower hospital competition on quality of care delivered by non-profit hospitals is ambiguous.

  5. Rapid evaluation and quality control of next generation sequencing data with FaQCs.

    Science.gov (United States)

    Lo, Chien-Chi; Chain, Patrick S G

    2014-11-19

    Next generation sequencing (NGS) technologies that parallelize the sequencing process and produce thousands to millions, or even hundreds of millions of sequences in a single sequencing run, have revolutionized genomic and genetic research. Because of the vagaries of any platform's sequencing chemistry, the experimental processing, machine failure, and so on, the quality of sequencing reads is never perfect, and often declines as the read is extended. These errors invariably affect downstream analysis/application and should therefore be identified early on to mitigate any unforeseen effects. Here we present a novel FastQ Quality Control Software (FaQCs) that can rapidly process large volumes of data, and which improves upon previous solutions to monitor the quality and remove poor quality data from sequencing runs. Both the speed of processing and the memory footprint of storing all required information have been optimized via algorithmic and parallel processing solutions. The trimmed output compared side-by-side with the original data is part of the automated PDF output. We show how this tool can help data analysis by providing a few examples, including an increased percentage of reads recruited to references, improved single nucleotide polymorphism identification as well as de novo sequence assembly metrics. FaQCs combines several features of currently available applications into a single, user-friendly process, and includes additional unique capabilities such as filtering the PhiX control sequences, conversion of FASTQ formats, and multi-threading. The original data and trimmed summaries are reported within a variety of graphics and reports, providing a simple way to do data quality control and assurance.

  6. Transcriptome sequencing of the Microarray Quality Control (MAQC RNA reference samples using next generation sequencing

    Directory of Open Access Journals (Sweden)

    Thierry-Mieg Danielle

    2009-06-01

    Full Text Available Abstract Background Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC reference RNA samples using Roche's 454 Genome Sequencer FLX. Results We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values ≤ 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.

  7. Model-based quality assessment and base-calling for second-generation sequencing data.

    Science.gov (United States)

    Bravo, Héctor Corrada; Irizarry, Rafael A

    2010-09-01

    Second-generation sequencing (sec-gen) technology can sequence millions of short fragments of DNA in parallel, making it capable of assembling complex genomes for a small fraction of the price and time of previous technologies. In fact, a recently formed international consortium, the 1000 Genomes Project, plans to fully sequence the genomes of approximately 1200 people. The prospect of comparative analysis at the sequence level of a large number of samples across multiple populations may be achieved within the next five years. These data present unprecedented challenges in statistical analysis. For instance, analysis operates on millions of short nucleotide sequences, or reads-strings of A,C,G, or T's, between 30 and 100 characters long-which are the result of complex processing of noisy continuous fluorescence intensity measurements known as base-calling. The complexity of the base-calling discretization process results in reads of widely varying quality within and across sequence samples. This variation in processing quality results in infrequent but systematic errors that we have found to mislead downstream analysis of the discretized sequence read data. For instance, a central goal of the 1000 Genomes Project is to quantify across-sample variation at the single nucleotide level. At this resolution, small error rates in sequencing prove significant, especially for rare variants. Sec-gen sequencing is a relatively new technology for which potential biases and sources of obscuring variation are not yet fully understood. Therefore, modeling and quantifying the uncertainty inherent in the generation of sequence reads is of utmost importance. In this article, we present a simple model to capture uncertainty arising in the base-calling procedure of the Illumina/Solexa GA platform. Model parameters have a straightforward interpretation in terms of the chemistry of base-calling allowing for informative and easily interpretable metrics that capture the variability in

  8. ReQON: a Bioconductor package for recalibrating quality scores from next-generation sequencing data

    Directory of Open Access Journals (Sweden)

    Cabanski Christopher R

    2012-09-01

    Full Text Available Abstract Background Next-generation sequencing technologies have become important tools for genome-wide studies. However, the quality scores that are assigned to each base have been shown to be inaccurate. If the quality scores are used in downstream analyses, these inaccuracies can have a significant impact on the results. Results Here we present ReQON, a tool that recalibrates the base quality scores from an input BAM file of aligned sequencing data using logistic regression. ReQON also generates diagnostic plots showing the effectiveness of the recalibration. We show that ReQON produces quality scores that are both more accurate, in the sense that they more closely correspond to the probability of a sequencing error, and do a better job of discriminating between sequencing errors and non-errors than the original quality scores. We also compare ReQON to other available recalibration tools and show that ReQON is less biased and performs favorably in terms of quality score accuracy. Conclusion ReQON is an open source software package, written in R and available through Bioconductor, for recalibrating base quality scores for next-generation sequencing data. ReQON produces a new BAM file with more accurate quality scores, which can improve the results of downstream analysis, and produces several diagnostic plots showing the effectiveness of the recalibration.

  9. Incorporating multi-leaf collimator leaf sequencing into iterative IMRT optimization

    International Nuclear Information System (INIS)

    Siebers, Jeffrey V.; Lauterbach, Marc; Keall, Paul J.; Mohan, Radhe

    2002-01-01

    Intensity modulated radiation therapy (IMRT) treatment planning typically considers beam optimization and beam delivery as separate tasks. Following optimization, a multi-leaf collimator (MLC) or other beam delivery device is used to generate fluence patterns for patient treatment delivery. Due to limitations and characteristics of the MLC, the deliverable intensity distributions often differ from those produced by the optimizer, leading to differences between the delivered and the optimized doses. Objective function parameters are then adjusted empirically, and the plan is reoptimized to achieve a desired deliverable dose distribution. The resulting plan, though usually acceptable, may not be the best achievable. A method has been developed to incorporate the MLC restrictions into the optimization process. Our in-house IMRT system has been modified to include the calculation of the deliverable intensity into the optimizer. In this process, prior to dose calculation, the MLC leaf sequencer is used to convert intensities to dynamic MLC sequences, from which the deliverable intensities are then determined. All other optimization steps remain the same. To evaluate the effectiveness of deliverable-based optimization, 17 patient cases have been studied. Compared with standard optimization plus conversion to deliverable beams, deliverable-based optimization results show improved isodose coverage and a reduced dose to critical structures. Deliverable-based optimization results are close to the original nondeliverable optimization results, suggesting that IMRT can overcome the MLC limitations by adjusting individual beamlets. The use of deliverable-based optimization may reduce the need for empirical adjustment of objective function parameters and reoptimization of a plan to achieve desired results

  10. Incorporating Servqual-QFD with Taguchi Design for optimizing service quality design

    Science.gov (United States)

    Arbi Hadiyat, M.

    2018-03-01

    Deploying good service design in service companies has been updated issue in improving customer satisfaction, especially based on the level of service quality measured by Parasuraman’s SERVQUAL. Many researchers have been proposing methods in designing the service, and some of them are based on engineering viewpoint, especially by implementing the QFD method or even using robust Taguchi method. The QFD method would found the qualitative solution by generating the “how’s”, while Taguchi method gives more quantitative calculation in optimizing best solution. However, incorporating both QFD and Taguchi has been done in this paper and yields better design process. The purposes of this research is to evaluate the incorporated methods by implemented it to a case study, then analyze the result and see the robustness of those methods to customer perception of service quality. Started by measuring service attributes using SERVQUAL and find the improvement with QFD, the deployment of QFD solution then generated by defining Taguchi factors levels and calculating the Signal-to-noise ratio in its orthogonal array, and optimized Taguchi response then found. A case study was given for designing service in local bank. Afterward, the service design obtained from previous analysis was then evaluated and shows that it was still meet the customer satisfaction. Incorporating QFD and Taguchi has performed well and can be adopted and developed for another research for evaluating the robustness of result.

  11. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data.

    Directory of Open Access Journals (Sweden)

    Ravi K Patel

    Full Text Available Next generation sequencing (NGS technologies provide a high-throughput means to generate large amount of sequence data. However, quality control (QC of sequence data generated from these technologies is extremely important for meaningful downstream analysis. Further, highly efficient and fast processing tools are required to handle the large volume of datasets. Here, we have developed an application, NGS QC Toolkit, for quality check and filtering of high-quality data. This toolkit is a standalone and open source application freely available at http://www.nipgr.res.in/ngsqctoolkit.html. All the tools in the application have been implemented in Perl programming language. The toolkit is comprised of user-friendly tools for QC of sequencing data generated using Roche 454 and Illumina platforms, and additional tools to aid QC (sequence format converter and trimming tools and analysis (statistics tools. A variety of options have been provided to facilitate the QC at user-defined parameters. The toolkit is expected to be very useful for the QC of NGS data to facilitate better downstream analysis.

  12. Amplification of oncogenes and integrated SV40 sequences in mammalian cells by the decay of incorporated iodine-125

    International Nuclear Information System (INIS)

    Ehrfeld, A.; Planas-Bohne, F.; Luecke-Huhle, C.

    1986-01-01

    Iodine-125, in the form of 5-[ 125 I]iododeoxyuridine (I-UdR), was incorporated into the DNA of SV40 transformed Chinese hamster embryo cells. Disintegration of the 125 I led to increased cell killing with increasing dose as measured by the colony-forming ability of single cells. The D37 (the dose at which 37% of the cells survive) amounts to 95 decays per cell, corresponding to 0.66 Gy. Variations in the copy number of specific DNA sequences was measured by using dispersed cell blotting with sensitive DNA hybridizations. A 13-fold amplification of the viral DNA sequences (SV40) and a twofold amplification of two cellular oncogenes of the ras-family (Ki-ras and Ha-ras) were found. Other cellular genes, like the alpha-actin gene, were not amplified, and no variation in gene copy number was detected after incubation of cells with cold I-UdR. We suggest the observed gene amplifications are induced by the densely ionizing radiation emitted by the decay of the incorporated 125 I atoms

  13. Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics

    Directory of Open Access Journals (Sweden)

    Richard Mark Leggett

    2013-12-01

    Full Text Available The processes of quality assessment and control are an active area of research at The Genome Analysis Centre (TGAC. Unlike other sequencing centres that often concentrate on a certain species or technology, TGAC applies expertise in genomics and bioinformatics to a wide range of projects, often requiring bespoke wet lab and in silico workflows. TGAC is fortunate to have access to a diverse range of sequencing and analysis platforms, and we are at the forefront of investigations into library quality and sequence data assessment. We have developed and implemented a number of algorithms, tools, pipelines and packages to ascertain, store, and expose quality metrics across a number of next-generation sequencing platforms, allowing rapid and in-depth cross-platform QC bioinformatics. In this review, we describe these tools as a vehicle for data-driven informatics, offering the potential to provide richer context for downstream analysis and to inform experimental design.

  14. Accurate estimation of short read mapping quality for next-generation genome sequencing

    Science.gov (United States)

    Ruffalo, Matthew; Koyutürk, Mehmet; Ray, Soumya; LaFramboise, Thomas

    2012-01-01

    Motivation: Several software tools specialize in the alignment of short next-generation sequencing reads to a reference sequence. Some of these tools report a mapping quality score for each alignment—in principle, this quality score tells researchers the likelihood that the alignment is correct. However, the reported mapping quality often correlates weakly with actual accuracy and the qualities of many mappings are underestimated, encouraging the researchers to discard correct mappings. Further, these low-quality mappings tend to correlate with variations in the genome (both single nucleotide and structural), and such mappings are important in accurately identifying genomic variants. Approach: We develop a machine learning tool, LoQuM (LOgistic regression tool for calibrating the Quality of short read mappings, to assign reliable mapping quality scores to mappings of Illumina reads returned by any alignment tool. LoQuM uses statistics on the read (base quality scores reported by the sequencer) and the alignment (number of matches, mismatches and deletions, mapping quality score returned by the alignment tool, if available, and number of mappings) as features for classification and uses simulated reads to learn a logistic regression model that relates these features to actual mapping quality. Results: We test the predictions of LoQuM on an independent dataset generated by the ART short read simulation software and observe that LoQuM can ‘resurrect’ many mappings that are assigned zero quality scores by the alignment tools and are therefore likely to be discarded by researchers. We also observe that the recalibration of mapping quality scores greatly enhances the precision of called single nucleotide polymorphisms. Availability: LoQuM is available as open source at http://compbio.case.edu/loqum/. Contact: matthew.ruffalo@case.edu. PMID:22962451

  15. Characterization of chemical vapour deposited diamond films: correlation between hydrogen incorporation and film morphology and quality

    International Nuclear Information System (INIS)

    Tang, C J; Neves, A J; Carmo, M C

    2005-01-01

    In order to tailor diamond synthesized through chemical vapour deposition (CVD) for different applications, many diamond films of different colours and variable quality were deposited by a 5 kW microwave plasma CVD reactor under different growth conditions. The morphology, quality and hydrogen incorporation of these films were characterized using scanning electron microscopy (SEM), Raman and Fourier-transform infrared (FTIR) spectroscopy, respectively. From this study, a general trend between hydrogen incorporation and film colour, morphology and quality was found. That is, as the films sorted by colour gradually become darker, ranging from white through grey to black, high magnification SEM images illustrate that the smoothness of the well defined crystalline facet gradually decreases and second nucleation starts to appear on it, indicating gradual degradation of the crystalline quality. Correspondingly, Raman spectra evidence that the diamond Raman peak at 1332 cm -1 becomes broader and the non-diamond carbon band around 1500 cm -1 starts to appear and becomes stronger, confirming increase of the non-diamond component and decrease of the phase purity of the film, while FTIR spectra show that the CH stretching band and the two CVD diamond specific peaks around 2830 cm -1 rise rapidly, and this indicates that the total amount of hydrogen incorporated into the film increases significantly

  16. Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

    Science.gov (United States)

    Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

    2017-07-01

    DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.

  17. [Study on correlation between ITS sequence of Arctium lappa and quality of Fructus Arctii].

    Science.gov (United States)

    Xu, Liang; Dou, Deqiang; Wang, Bing; Yang, Yanyun; Kang, Tingguo

    2011-07-01

    To study the correlation between ITS sequence of Arctium lappa and Fructus Arctii quality of different origin. The samples of Fructu arctii materials were collected from 26 different producing areas. Their ITS sequence were determined after polymerase chain reaction (PCR) and quality were evaluated through the determination of arctiin content by HPLC. Genetic diversity, genotype and correlation were analyzed by ClustalX (1.81), Mage 4.0, SPSS 13.0 statistical software. ITS sequence of A. was obtained from 26 samples, and was registered in the GenBank. Corresponding arctiin content of Fructus arctii and 1000-grain weight were determined. A. lappa genotype correlated with Fructus arctii quality by statistical analysis. The research provided a foundation for revealing the molecular mechanism of Fructus arctii geoherbs.

  18. RNA-Pareto: interactive analysis of Pareto-optimal RNA sequence-structure alignments.

    Science.gov (United States)

    Schnattinger, Thomas; Schöning, Uwe; Marchfelder, Anita; Kestler, Hans A

    2013-12-01

    Incorporating secondary structure information into the alignment process improves the quality of RNA sequence alignments. Instead of using fixed weighting parameters, sequence and structure components can be treated as different objectives and optimized simultaneously. The result is not a single, but a Pareto-set of equally optimal solutions, which all represent different possible weighting parameters. We now provide the interactive graphical software tool RNA-Pareto, which allows a direct inspection of all feasible results to the pairwise RNA sequence-structure alignment problem and greatly facilitates the exploration of the optimal solution set.

  19. Extracting a Good Quality Frontal Face Image from a Low-Resolution Video Sequence

    DEFF Research Database (Denmark)

    Nasrollahi, Kamal; Moeslund, Thomas B.

    2011-01-01

    Feeding low-resolution and low-quality images, from inexpensive surveillance cameras, to systems like, e.g., face recognition, produces erroneous and unstable results. Therefore, there is a need for a mechanism to bridge the gap between on one hand low-resolution and low-quality images......, we use a learning-based super-resolution algorithm applied to the result of the reconstruction-based part to improve the quality by another factor of two. This results in an improvement factor of four for the entire system. The proposed system has been tested on 122 low-resolution sequences from two...... different databases. The experimental results show that the proposed system can indeed produce a high-resolution and good quality frontal face image from low-resolution video sequences....

  20. Effectiveness of incorporating citric acid in cassava starch edible coatings to preserve quality of Martha tomatoes

    Science.gov (United States)

    Ambarsari, I.; Oktaningrum, G. N.; Endrasari, R.

    2018-01-01

    Tomato as an agricultural product is extremely perishable. Coatings of tomatoes with edible starch extend quality and storage life of the fruits. Incorporation of citric acid as antimicrobial agent in the edible starch coatings is expected to preserve the quality of tomatoes during storage. The aim of this study was to verify the effectiveness of citric acid incorporated in cassava starch coating to preserve quality of tomatoes. The edible coatings formula consisted of cassava starch solutions (1; 2; 3%), citric acid (0.5; 1.0%) and glycerol (10%). Tomatoes were dipped to the coating solution for 10 seconds, then air-dried and stored at room temperature during 18 days. All the treatments were carried out in triplicates. Experimental data were analyzed using One Way ANOVA. The results showed that coating treatments did not affect the weight loss, moisture content, color characteristic, carotene and vitamin C content on Martha tomatoes. The low concentration of starch coating on Martha tomatoes are indicated to be the reason why there was no significant difference between coated and coated tomatoes for some parameters. However, incorporating citric acid in cassava starch-based coatings could prevent tomato fruits from firmness reduction and spoilage during storage.

  1. Masking as an effective quality control method for next-generation sequencing data analysis.

    Science.gov (United States)

    Yun, Sajung; Yun, Sijung

    2014-12-13

    Next generation sequencing produces base calls with low quality scores that can affect the accuracy of identifying simple nucleotide variation calls, including single nucleotide polymorphisms and small insertions and deletions. Here we compare the effectiveness of two data preprocessing methods, masking and trimming, and the accuracy of simple nucleotide variation calls on whole-genome sequence data from Caenorhabditis elegans. Masking substitutes low quality base calls with 'N's (undetermined bases), whereas trimming removes low quality bases that results in a shorter read lengths. We demonstrate that masking is more effective than trimming in reducing the false-positive rate in single nucleotide polymorphism (SNP) calling. However, both of the preprocessing methods did not affect the false-negative rate in SNP calling with statistical significance compared to the data analysis without preprocessing. False-positive rate and false-negative rate for small insertions and deletions did not show differences between masking and trimming. We recommend masking over trimming as a more effective preprocessing method for next generation sequencing data analysis since masking reduces the false-positive rate in SNP calling without sacrificing the false-negative rate although trimming is more commonly used currently in the field. The perl script for masking is available at http://code.google.com/p/subn/. The sequencing data used in the study were deposited in the Sequence Read Archive (SRX450968 and SRX451773).

  2. Coval: improving alignment quality and variant calling accuracy for next-generation sequencing data.

    Directory of Open Access Journals (Sweden)

    Shunichi Kosugi

    Full Text Available Accurate identification of DNA polymorphisms using next-generation sequencing technology is challenging because of a high rate of sequencing error and incorrect mapping of reads to reference genomes. Currently available short read aligners and DNA variant callers suffer from these problems. We developed the Coval software to improve the quality of short read alignments. Coval is designed to minimize the incidence of spurious alignment of short reads, by filtering mismatched reads that remained in alignments after local realignment and error correction of mismatched reads. The error correction is executed based on the base quality and allele frequency at the non-reference positions for an individual or pooled sample. We demonstrated the utility of Coval by applying it to simulated genomes and experimentally obtained short-read data of rice, nematode, and mouse. Moreover, we found an unexpectedly large number of incorrectly mapped reads in 'targeted' alignments, where the whole genome sequencing reads had been aligned to a local genomic segment, and showed that Coval effectively eliminated such spurious alignments. We conclude that Coval significantly improves the quality of short-read sequence alignments, thereby increasing the calling accuracy of currently available tools for SNP and indel identification. Coval is available at http://sourceforge.net/projects/coval105/.

  3. Evaluating Quality of Aged Archival Formalin-Fixed Paraffin-Embedded Samples for RNA-Sequencing

    Science.gov (United States)

    Archival formalin-fixed paraffin-embedded (FFPE) samples offer a vast, untapped source of genomic data for biomarker discovery. However, the quality of FFPE samples is often highly variable, and conventional methods to assess RNA quality for RNA-sequencing (RNA-seq) are not infor...

  4. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

    Science.gov (United States)

    Chin, Chen-Shan; Alexander, David H; Marks, Patrick; Klammer, Aaron A; Drake, James; Heiner, Cheryl; Clum, Alicia; Copeland, Alex; Huddleston, John; Eichler, Evan E; Turner, Stephen W; Korlach, Jonas

    2013-06-01

    We present a hierarchical genome-assembly process (HGAP) for high-quality de novo microbial genome assemblies using only a single, long-insert shotgun DNA library in conjunction with Single Molecule, Real-Time (SMRT) DNA sequencing. Our method uses the longest reads as seeds to recruit all other reads for construction of highly accurate preassembled reads through a directed acyclic graph-based consensus procedure, which we follow with assembly using off-the-shelf long-read assemblers. In contrast to hybrid approaches, HGAP does not require highly accurate raw reads for error correction. We demonstrate efficient genome assembly for several microorganisms using as few as three SMRT Cell zero-mode waveguide arrays of sequencing and for BACs using just one SMRT Cell. Long repeat regions can be successfully resolved with this workflow. We also describe a consensus algorithm that incorporates SMRT sequencing primary quality values to produce de novo genome sequence exceeding 99.999% accuracy.

  5. Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

    Directory of Open Access Journals (Sweden)

    Pupko Tal

    2008-10-01

    Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.

  6. Next-generation sequencing (NGS) for assessment of microbial water quality: current progress, challenges, and future opportunities

    OpenAIRE

    BoonFei eTan; Charmaine Marie Ng; Jean Pierre Nshimyimana; Jean Pierre Nshimyimana; Lay-Leng eLoh; Lay-Leng eLoh; Karina Yew-Hoong Gin; Janelle Renee Thompson; Janelle Renee Thompson

    2015-01-01

    Water quality is an emergent property of a complex system comprised of interacting microbial populations and introduced microbial and chemical contaminants. Studies leveraging next-generation sequencing (NGS) technologies are providing new insights into the ecology of microbially mediated processes that influence fresh water quality such as algal blooms, contaminant biodegradation, and pathogen dissemination. In addition, sequencing methods targeting small subunit (SSU) rRNA hypervariable reg...

  7. An Evaluation Framework for Lossy Compression of Genome Sequencing Quality Values

    OpenAIRE

    Alberti, Claudio; Daniels, Noah; Hernaez, Mikel; Voges, Jan; Goldfeder, Rachel L.; Hernandez-Lopez, Ana A.; Mattavelli, Marco; Berger, Bonnie

    2016-01-01

    This paper provides the specification and an initial validation of an evaluation framework for the comparison of lossy compressors of genome sequencing quality values. The goal is to define reference data, test sets, tools and metrics that shall be used to evaluate the impact of lossy compression of quality values on human genome variant calling. The functionality of the framework is validated referring to two state-of-the-art genomic compressors. This work has been spurred by the current act...

  8. The need for high-quality whole-genome sequence databases in microbial forensics.

    Science.gov (United States)

    Sjödin, Andreas; Broman, Tina; Melefors, Öjar; Andersson, Gunnar; Rasmusson, Birgitta; Knutsson, Rickard; Forsman, Mats

    2013-09-01

    Microbial forensics is an important part of a strengthened capability to respond to biocrime and bioterrorism incidents to aid in the complex task of distinguishing between natural outbreaks and deliberate acts. The goal of a microbial forensic investigation is to identify and criminally prosecute those responsible for a biological attack, and it involves a detailed analysis of the weapon--that is, the pathogen. The recent development of next-generation sequencing (NGS) technologies has greatly increased the resolution that can be achieved in microbial forensic analyses. It is now possible to identify, quickly and in an unbiased manner, previously undetectable genome differences between closely related isolates. This development is particularly relevant for the most deadly bacterial diseases that are caused by bacterial lineages with extremely low levels of genetic diversity. Whole-genome analysis of pathogens is envisaged to be increasingly essential for this purpose. In a microbial forensic context, whole-genome sequence analysis is the ultimate method for strain comparisons as it is informative during identification, characterization, and attribution--all 3 major stages of the investigation--and at all levels of microbial strain identity resolution (ie, it resolves the full spectrum from family to isolate). Given these capabilities, one bottleneck in microbial forensics investigations is the availability of high-quality reference databases of bacterial whole-genome sequences. To be of high quality, databases need to be curated and accurate in terms of sequences, metadata, and genetic diversity coverage. The development of whole-genome sequence databases will be instrumental in successfully tracing pathogens in the future.

  9. Sequence Coding and Search System Backfit Quality Assurance Program Plan

    International Nuclear Information System (INIS)

    Lovell, C.J.; Stepina, P.L.

    1985-03-01

    The Sequence Coding and Search System is a computer-based encoding system for events described in Licensee Event Reports. This data system contains LERs from 1981 to present. Backfit of the data system to include LERs prior to 1981 is required. This report documents the Quality Assurance Program Plan that EG and G Idaho, Inc. will follow while encoding 1980 LERs

  10. Construction sequence scale model: an aid to productivity and quality assurance

    International Nuclear Information System (INIS)

    Clothier, W.A. Sr.

    1978-01-01

    The natural tendencies of an engineering scale model to promote a high level of quality by error prevention during design and construction stages of a project are studied. A brief section on the basic history of engineering modeling is used to describe TVA's usage of the model. The basic design model is explored in an overview touching the highlights of that form of modeling. A detailed look at the construction sequence model, a relatively new form of model, is presented to demonstrate quality and productivity awareness

  11. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    Directory of Open Access Journals (Sweden)

    Yu-Chih Tsai

    2016-02-01

    Full Text Available Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation.

  12. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    Science.gov (United States)

    Tsai, Yu-Chih; Deming, Clayton; Segre, Julia A.; Kong, Heidi H.; Korlach, Jonas

    2016-01-01

    ABSTRACT Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation. PMID:26861018

  13. A machine learning model to determine the accuracy of variant calls in capture-based next generation sequencing.

    Science.gov (United States)

    van den Akker, Jeroen; Mishne, Gilad; Zimmer, Anjali D; Zhou, Alicia Y

    2018-04-17

    Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to

  14. Whole mitochondrial genome sequencing of domestic horses reveals incorporation of extensive wild horse diversity during domestication

    Directory of Open Access Journals (Sweden)

    Lippold Sebastian

    2011-11-01

    Full Text Available Abstract Background DNA target enrichment by micro-array capture combined with high throughput sequencing technologies provides the possibility to obtain large amounts of sequence data (e.g. whole mitochondrial DNA genomes from multiple individuals at relatively low costs. Previously, whole mitochondrial genome data for domestic horses (Equus caballus were limited to only a few specimens and only short parts of the mtDNA genome (especially the hypervariable region were investigated for larger sample sets. Results In this study we investigated whole mitochondrial genomes of 59 domestic horses from 44 breeds and a single Przewalski horse (Equus przewalski using a recently described multiplex micro-array capture approach. We found 473 variable positions within the domestic horses, 292 of which are parsimony-informative, providing a well resolved phylogenetic tree. Our divergence time estimate suggests that the mitochondrial genomes of modern horse breeds shared a common ancestor around 93,000 years ago and no later than 38,000 years ago. A Bayesian skyline plot (BSP reveals a significant population expansion beginning 6,000-8,000 years ago with an ongoing exponential growth until the present, similar to other domestic animal species. Our data further suggest that a large sample of wild horse diversity was incorporated into the domestic population; specifically, at least 46 of the mtDNA lineages observed in domestic horses (73% already existed before the beginning of domestication about 5,000 years ago. Conclusions Our study provides a window into the maternal origins of extant domestic horses and confirms that modern domestic breeds present a wide sample of the mtDNA diversity found in ancestral, now extinct, wild horse populations. The data obtained allow us to detect a population expansion event coinciding with the beginning of domestication and to estimate both the minimum number of female horses incorporated into the domestic gene pool and the

  15. A new approach to incorporate operator actions in the simulation of accident sequences

    International Nuclear Information System (INIS)

    Antonio Exposito; Juan Antonio Quiroga; Javier Hortal; John-Einar Hulsund

    2006-01-01

    Full text of publication follows: Nowadays, simulation-based human reliability analysis (HRA) methods seem to provide a new direction for the development of advanced methodologies to study operator actions effect during accident sequences. Due to this, the Spanish Nuclear Safety Council (CSN) started a working group which has, among other objectives, to develop such simulation-based HRA methodology. As a result of its activities, a new methodology, named Integrated Safety Assessment (ISA), has been developed and is currently being incorporated into licensing activities at CSN. One of the key aspects of this approach is the incorporation of the capability to simulate operator actions, expanding the ISA methodology scopes to make HRA studies. For this reason, CSN is involved in several activities oriented to develop a new tool, which must be able to incorporate operator actions in conventional thermohydraulic (TH) simulations. One of them is the collaboration project between CSN, Halden Reactor Project (HRP) and the Department of Energy Systems (DSE) of the Polytechnic University of Madrid that started in 2003. The basic aim of the project is to develop a software tool that consists of a closed-loop plant/operator simulator, a thermal hydraulic (TH) code for simulating the plant transient and the procedures processor to give the information related with operator actions to the TH code, both coupled by a data communication system which allows the information exchange. For the plant simulation we have a plant transient simulator code (TRETA/TIZONA for PWR/BWR NPPs respectively), developed by the CSN, with PWR/BWR full scope models. The functionality of these thermalhydraulic codes has been expanded, allowing control the overall information flow between coupled codes, simulating the TH transient and determining when the operator actions must be considered. In the other hand, we have the COPMA-III code, a computerized procedure system able to manage XML operational

  16. An Evaluation Framework for Lossy Compression of Genome Sequencing Quality Values.

    Science.gov (United States)

    Alberti, Claudio; Daniels, Noah; Hernaez, Mikel; Voges, Jan; Goldfeder, Rachel L; Hernandez-Lopez, Ana A; Mattavelli, Marco; Berger, Bonnie

    2016-01-01

    This paper provides the specification and an initial validation of an evaluation framework for the comparison of lossy compressors of genome sequencing quality values. The goal is to define reference data, test sets, tools and metrics that shall be used to evaluate the impact of lossy compression of quality values on human genome variant calling. The functionality of the framework is validated referring to two state-of-the-art genomic compressors. This work has been spurred by the current activity within the ISO/IEC SC29/WG11 technical committee (a.k.a. MPEG), which is investigating the possibility of starting a standardization activity for genomic information representation.

  17. Technical Considerations for Reduced Representation Bisulfite Sequencing with Multiplexed Libraries

    Science.gov (United States)

    Chatterjee, Aniruddha; Rodger, Euan J.; Stockwell, Peter A.; Weeks, Robert J.; Morison, Ian M.

    2012-01-01

    Reduced representation bisulfite sequencing (RRBS), which couples bisulfite conversion and next generation sequencing, is an innovative method that specifically enriches genomic regions with a high density of potential methylation sites and enables investigation of DNA methylation at single-nucleotide resolution. Recent advances in the Illumina DNA sample preparation protocol and sequencing technology have vastly improved sequencing throughput capacity. Although the new Illumina technology is now widely used, the unique challenges associated with multiplexed RRBS libraries on this platform have not been previously described. We have made modifications to the RRBS library preparation protocol to sequence multiplexed libraries on a single flow cell lane of the Illumina HiSeq 2000. Furthermore, our analysis incorporates a bioinformatics pipeline specifically designed to process bisulfite-converted sequencing reads and evaluate the output and quality of the sequencing data generated from the multiplexed libraries. We obtained an average of 42 million paired-end reads per sample for each flow-cell lane, with a high unique mapping efficiency to the reference human genome. Here we provide a roadmap of modifications, strategies, and trouble shooting approaches we implemented to optimize sequencing of multiplexed libraries on an a RRBS background. PMID:23193365

  18. Next-generation sequencing (NGS) for assessment of microbial water quality: current progress, challenges, and future opportunities.

    Science.gov (United States)

    Tan, BoonFei; Ng, Charmaine; Nshimyimana, Jean Pierre; Loh, Lay Leng; Gin, Karina Y-H; Thompson, Janelle R

    2015-01-01

    Water quality is an emergent property of a complex system comprised of interacting microbial populations and introduced microbial and chemical contaminants. Studies leveraging next-generation sequencing (NGS) technologies are providing new insights into the ecology of microbially mediated processes that influence fresh water quality such as algal blooms, contaminant biodegradation, and pathogen dissemination. In addition, sequencing methods targeting small subunit (SSU) rRNA hypervariable regions have allowed identification of signature microbial species that serve as bioindicators for sewage contamination in these environments. Beyond amplicon sequencing, metagenomic and metatranscriptomic analyses of microbial communities in fresh water environments reveal the genetic capabilities and interplay of waterborne microorganisms, shedding light on the mechanisms for production and biodegradation of toxins and other contaminants. This review discusses the challenges and benefits of applying NGS-based methods to water quality research and assessment. We will consider the suitability and biases inherent in the application of NGS as a screening tool for assessment of biological risks and discuss the potential and limitations for direct quantitative interpretation of NGS data. Secondly, we will examine case studies from recent literature where NGS based methods have been applied to topics in water quality assessment, including development of bioindicators for sewage pollution and microbial source tracking, characterizing the distribution of toxin and antibiotic resistance genes in water samples, and investigating mechanisms of biodegradation of harmful pollutants that threaten water quality. Finally, we provide a short review of emerging NGS platforms and their potential applications to the next generation of water quality assessment tools.

  19. Next-generation sequencing (NGS for assessment of microbial water quality: current progress, challenges, and future opportunities

    Directory of Open Access Journals (Sweden)

    BoonFei eTan

    2015-09-01

    Full Text Available Water quality is an emergent property of a complex system comprised of interacting microbial populations and introduced microbial and chemical contaminants. Studies leveraging next-generation sequencing (NGS technologies are providing new insights into the ecology of microbially mediated processes that influence fresh water quality such as algal blooms, contaminant biodegradation, and pathogen dissemination. In addition, sequencing methods targeting small subunit (SSU rRNA hypervariable regions have allowed identification of signature microbial species that serve as bioindicators for sewage contamination in these environments. Beyond amplicon sequencing, metagenomic and metatranscriptomic analyses of microbial communities in fresh water environments reveal the genetic capabilities and interplay of waterborne microorganisms, shedding light on the mechanisms for production and biodegradation of toxins and other contaminants. This review discusses the challenges and benefits of applying NGS-based methods to water quality research and assessment. We will consider the suitability and biases inherent in the application of NGS as a screening tool for assessment of biological risks and discuss the potential and limitations for direct quantitative interpretation of NGS data. Secondly, we will examine case studies from recent literature where NGS based methods have been applied to topics in water quality assessment, including development of bioindicators for sewage pollution and microbial source tracking, characterizing the distribution of toxin and antibiotic resistance genes in water samples, and investigating mechanisms of biodegradation of harmful pollutants that threaten water quality. Finally, we provide a short review of emerging NGS platforms and their potential applications to the next generation of water quality assessment tools.

  20. Tips and tricks for the assembly of a Corynebacterium pseudotuberculosis genome using a semiconductor sequencer

    DEFF Research Database (Denmark)

    Ramos, Rommel Thiago Jucá; Carneiro, Adriana Ribeiro; Soares, Siomar de Castro

    2013-01-01

    that enable reference-based assembly, such as the one used in the present study, Corynebacterium pseudotuberculosis biovar equi, which causes high economic losses in the US equine industry. The quality treatment strategy incorporated into the assembly pipeline enabled a 16-fold greater use of the sequencing...... was validated by comparative genomics with other species of the genus Corynebacterium. The present study presents a modus operandi that enables a greater and better use of data obtained from semiconductor sequencing for obtaining the complete genome from a prokaryotic microorganism, C. pseudotuberculosis, which...

  1. Incorporating wind availability into land use regression modelling of air quality in mountainous high-density urban environment.

    Science.gov (United States)

    Shi, Yuan; Lau, Kevin Ka-Lun; Ng, Edward

    2017-08-01

    Urban air quality serves as an important function of the quality of urban life. Land use regression (LUR) modelling of air quality is essential for conducting health impacts assessment but more challenging in mountainous high-density urban scenario due to the complexities of the urban environment. In this study, a total of 21 LUR models are developed for seven kinds of air pollutants (gaseous air pollutants CO, NO 2 , NO x , O 3 , SO 2 and particulate air pollutants PM 2.5 , PM 10 ) with reference to three different time periods (summertime, wintertime and annual average of 5-year long-term hourly monitoring data from local air quality monitoring network) in Hong Kong. Under the mountainous high-density urban scenario, we improved the traditional LUR modelling method by incorporating wind availability information into LUR modelling based on surface geomorphometrical analysis. As a result, 269 independent variables were examined to develop the LUR models by using the "ADDRESS" independent variable selection method and stepwise multiple linear regression (MLR). Cross validation has been performed for each resultant model. The results show that wind-related variables are included in most of the resultant models as statistically significant independent variables. Compared with the traditional method, a maximum increase of 20% was achieved in the prediction performance of annual averaged NO 2 concentration level by incorporating wind-related variables into LUR model development. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Comparative Genomics in Switchgrass Using 61,585 High-Quality Expressed Sequence Tags

    Directory of Open Access Journals (Sweden)

    Christian M. Tobias

    2008-11-01

    Full Text Available The development of genomic resources for switchgrass ( L., a perennial NAD-malic enzyme type C grass, is required to enable molecular breeding and biotechnological approaches for improving its value as a forage and bioenergy crop. Expressed sequence tag (EST sequencing is one method that can quickly sample gene inventories and produce data suitable for marker development or analysis of tissue-specific patterns of expression. Toward this goal, three cDNA libraries from callus, crown, and seedling tissues of ‘Kanlow’ switchgrass were end-sequenced to generate a total of 61,585 high-quality ESTs from 36,565 separate clones. Seventy-three percent of the assembled consensus sequences could be aligned with the sorghum [ (L. Moench] genome at a -value of <1 × 10, indicating a high degree of similarity. Sixty-five percent of the ESTs matched with gene ontology molecular terms, and 3.3% of the sequences were matched with genes that play potential roles in cell-wall biogenesis. The representation in the three libraries of gene families known to be associated with C photosynthesis, cellulose and β-glucan synthesis, phenylpropanoid biosynthesis, and peroxidase activity indicated likely roles for individual family members. Pairwise comparisons of synonymous codon substitutions were used to assess genome sequence diversity and indicated an overall similarity between the two genome copies present in the tetraploid. Identification of EST–simple sequence repeat markers and amplification on two individual parents of a mapping population yielded an average of 2.18 amplicons per individual, and 35% of the markers produced fragment length polymorphisms.

  3. Use of plan quality degradation to evaluate tradeoffs in delivery efficiency and clinical plan metrics arising from IMRT optimizer and sequencer compromises

    Science.gov (United States)

    Wilkie, Joel R.; Matuszak, Martha M.; Feng, Mary; Moran, Jean M.; Fraass, Benedick A.

    2013-01-01

    Purpose: Plan degradation resulting from compromises made to enhance delivery efficiency is an important consideration for intensity modulated radiation therapy (IMRT) treatment plans. IMRT optimization and/or multileaf collimator (MLC) sequencing schemes can be modified to generate more efficient treatment delivery, but the effect those modifications have on plan quality is often difficult to quantify. In this work, the authors present a method for quantitative assessment of overall plan quality degradation due to tradeoffs between delivery efficiency and treatment plan quality, illustrated using comparisons between plans developed allowing different numbers of intensity levels in IMRT optimization and/or MLC sequencing for static segmental MLC IMRT plans. Methods: A plan quality degradation method to evaluate delivery efficiency and plan quality tradeoffs was developed and used to assess planning for 14 prostate and 12 head and neck patients treated with static IMRT. Plan quality was evaluated using a physician's predetermined “quality degradation” factors for relevant clinical plan metrics associated with the plan optimization strategy. Delivery efficiency and plan quality were assessed for a range of optimization and sequencing limitations. The “optimal” (baseline) plan for each case was derived using a clinical cost function with an unlimited number of intensity levels. These plans were sequenced with a clinical MLC leaf sequencer which uses >100 segments, assuring delivered intensities to be within 1% of the optimized intensity pattern. Each patient's optimal plan was also sequenced limiting the number of intensity levels (20, 10, and 5), and then separately optimized with these same numbers of intensity levels. Delivery time was measured for all plans, and direct evaluation of the tradeoffs between delivery time and plan degradation was performed. Results: When considering tradeoffs, the optimal number of intensity levels depends on the treatment

  4. When less is more: 'slicing' sequencing data improves read decoding accuracy and de novo assembly quality.

    Science.gov (United States)

    Lonardi, Stefano; Mirebrahim, Hamid; Wanamaker, Steve; Alpert, Matthew; Ciardo, Gianfranco; Duma, Denisa; Close, Timothy J

    2015-09-15

    As the invention of DNA sequencing in the 70s, computational biologists have had to deal with the problem of de novo genome assembly with limited (or insufficient) depth of sequencing. In this work, we investigate the opposite problem, that is, the challenge of dealing with excessive depth of sequencing. We explore the effect of ultra-deep sequencing data in two domains: (i) the problem of decoding reads to bacterial artificial chromosome (BAC) clones (in the context of the combinatorial pooling design we have recently proposed), and (ii) the problem of de novo assembly of BAC clones. Using real ultra-deep sequencing data, we show that when the depth of sequencing increases over a certain threshold, sequencing errors make these two problems harder and harder (instead of easier, as one would expect with error-free data), and as a consequence the quality of the solution degrades with more and more data. For the first problem, we propose an effective solution based on 'divide and conquer': we 'slice' a large dataset into smaller samples of optimal size, decode each slice independently, and then merge the results. Experimental results on over 15 000 barley BACs and over 4000 cowpea BACs demonstrate a significant improvement in the quality of the decoding and the final assembly. For the second problem, we show for the first time that modern de novo assemblers cannot take advantage of ultra-deep sequencing data. Python scripts to process slices and resolve decoding conflicts are available from http://goo.gl/YXgdHT; software Hashfilter can be downloaded from http://goo.gl/MIyZHs stelo@cs.ucr.edu or timothy.close@ucr.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Correlation between protein sequence similarity and x-ray diffraction quality in the protein data bank.

    Science.gov (United States)

    Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng

    2009-01-01

    As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.

  6. Quality changes of sea bass slices wrapped with gelatin film incorporated with lemongrass essential oil.

    Science.gov (United States)

    Ahmad, Mehraj; Benjakul, Soottawat; Sumpavapol, Punnanee; Nirmal, Nilesh Prakash

    2012-04-16

    Microbiological, chemical and physical changes of sea bass slices wrapped with gelatin film incorporated with 25% (w/w) lemongrass essential oil (LEO) during storage of 12 days at 4 °C were investigated. Sea bass slices wrapped with LEO film had the retarded growth of lactic acid bacteria (LAB), psychrophilic bacteria and spoilage microorganisms including H₂S-producing bacteria and Enterobacteriaceae throughout storage of 12 days in comparison with the control and those wrapped with gelatin film without LEO (G film) (P<0.05). Lowered changes of colour, K value, total volatile base nitrogen (TVB) and TBARS value were also found in LEO film wrapped samples, compared with those wrapped with G film and control, respectively. Therefore, the incorporation of LEO into gelatin film could enhance the antimicrobial and antioxidative properties of the film, thereby maintaining the qualities and extending the shelf-life of the sea bass slices stored at refrigerated temperature. Copyright © 2012 Elsevier B.V. All rights reserved.

  7. Staphylococcus aureus Sortase A-Mediated Incorporation of Peptides: Effect of Peptide Modification on Incorporation.

    Directory of Open Access Journals (Sweden)

    Silvie Hansenová Maňásková

    Full Text Available The endogenous Staphylococcus aureus sortase A (SrtA transpeptidase covalently anchors cell wall-anchored (CWA proteins equipped with a specific recognition motif (LPXTG into the peptidoglycan layer of the staphylococcal cell wall. Previous in situ experiments have shown that SrtA is also able to incorporate exogenous, fluorescently labelled, synthetic substrates equipped with the LPXTG motif (K(FITCLPETG-amide into the bacterial cell wall, albeit at high concentrations of 500 μM to 1 mM. In the present study, we have evaluated the effect of substrate modification on the incorporation efficiency. This revealed that (i by elongation of LPETG-amide with a sequence of positively charged amino acids, derived from the C-terminal domain of physiological SrtA substrates, the incorporation efficiency was increased by 20-fold at 10 μM, 100 μM and 250 μM; (ii Substituting aspartic acid (E for methionine increased the incorporation of the resulting K(FITCLPMTG-amide approximately three times at all concentrations tested; (iii conjugation of the lipid II binding antibiotic vancomycin to K(FITCLPMTG-amide resulted in the same incorporation levels as K(FITCLPETG-amide, but much more efficient at an impressive 500-fold lower substrate concentration. These newly developed synthetic substrates can potentially find broad applications in for example the in situ imaging of bacteria; the incorporation of antibody recruiting moieties; the targeted delivery and covalent incorporation of antimicrobial compounds into the bacterial cell wall.

  8. Developmental validation of a Nextera XT mitogenome Illumina MiSeq sequencing method for high-quality samples.

    Science.gov (United States)

    Peck, Michelle A; Sturk-Andreaggi, Kimberly; Thomas, Jacqueline T; Oliver, Robert S; Barritt-Ross, Suzanne; Marshall, Charla

    2018-05-01

    Generating mitochondrial genome (mitogenome) data from reference samples in a rapid and efficient manner is critical to harnessing the greater power of discrimination of the entire mitochondrial DNA (mtDNA) marker. The method of long-range target enrichment, Nextera XT library preparation, and Illumina sequencing on the MiSeq is a well-established technique for generating mitogenome data from high-quality samples. To this end, a validation was conducted for this mitogenome method processing up to 24 samples simultaneously along with analysis in the CLC Genomics Workbench and utilizing the AQME (AFDIL-QIAGEN mtDNA Expert) tool to generate forensic profiles. This validation followed the Federal Bureau of Investigation's Quality Assurance Standards (QAS) for forensic DNA testing laboratories and the Scientific Working Group on DNA Analysis Methods (SWGDAM) validation guidelines. The evaluation of control DNA, non-probative samples, blank controls, mixtures, and nonhuman samples demonstrated the validity of this method. Specifically, the sensitivity was established at ≥25 pg of nuclear DNA input for accurate mitogenome profile generation. Unreproducible low-level variants were observed in samples with low amplicon yields. Further, variant quality was shown to be a useful metric for identifying sequencing error and crosstalk. Success of this method was demonstrated with a variety of reference sample substrates and extract types. These studies further demonstrate the advantages of using NGS techniques by highlighting the quantitative nature of heteroplasmy detection. The results presented herein from more than 175 samples processed in ten sequencing runs, show this mitogenome sequencing method and analysis strategy to be valid for the generation of reference data. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Reference-quality genome sequence of Aegilops tauschii, the source of wheat D genome, shows that recombination shapes genome structure and evolution

    Science.gov (United States)

    Aegilops tauschii is the diploid progenitor of the D genome of hexaploid wheat and an important genetic resource for wheat. A reference-quality sequence for the Ae. tauschii genome was produced with a combination of ordered-clone sequencing, whole-genome shotgun sequencing, and BioNano optical geno...

  10. Short read sequence typing (SRST: multi-locus sequence types from short reads

    Directory of Open Access Journals (Sweden)

    Inouye Michael

    2012-07-01

    Full Text Available Abstract Background Multi-locus sequence typing (MLST has become the gold standard for population analyses of bacterial pathogens. This method focuses on the sequences of a small number of loci (usually seven to divide the population and is simple, robust and facilitates comparison of results between laboratories and over time. Over the last decade, researchers and population health specialists have invested substantial effort in building up public MLST databases for nearly 100 different bacterial species, and these databases contain a wealth of important information linked to MLST sequence types such as time and place of isolation, host or niche, serotype and even clinical or drug resistance profiles. Recent advances in sequencing technology mean it is increasingly feasible to perform bacterial population analysis at the whole genome level. This offers massive gains in resolving power and genetic profiling compared to MLST, and will eventually replace MLST for bacterial typing and population analysis. However given the wealth of data currently available in MLST databases, it is crucial to maintain backwards compatibility with MLST schemes so that new genome analyses can be understood in their proper historical context. Results We present a software tool, SRST, for quick and accurate retrieval of sequence types from short read sets, using inputs easily downloaded from public databases. SRST uses read mapping and an allele assignment score incorporating sequence coverage and variability, to determine the most likely allele at each MLST locus. Analysis of over 3,500 loci in more than 500 publicly accessible Illumina read sets showed SRST to be highly accurate at allele assignment. SRST output is compatible with common analysis tools such as eBURST, Clonal Frame or PhyloViz, allowing easy comparison between novel genome data and MLST data. Alignment, fastq and pileup files can also be generated for novel alleles. Conclusions SRST is a novel

  11. Robustness of ancestral sequence reconstruction to phylogenetic uncertainty.

    Science.gov (United States)

    Hanson-Smith, Victor; Kolaczkowski, Bryan; Thornton, Joseph W

    2010-09-01

    Ancestral sequence reconstruction (ASR) is widely used to formulate and test hypotheses about the sequences, functions, and structures of ancient genes. Ancestral sequences are usually inferred from an alignment of extant sequences using a maximum likelihood (ML) phylogenetic algorithm, which calculates the most likely ancestral sequence assuming a probabilistic model of sequence evolution and a specific phylogeny--typically the tree with the ML. The true phylogeny is seldom known with certainty, however. ML methods ignore this uncertainty, whereas Bayesian methods incorporate it by integrating the likelihood of each ancestral state over a distribution of possible trees. It is not known whether Bayesian approaches to phylogenetic uncertainty improve the accuracy of inferred ancestral sequences. Here, we use simulation-based experiments under both simplified and empirically derived conditions to compare the accuracy of ASR carried out using ML and Bayesian approaches. We show that incorporating phylogenetic uncertainty by integrating over topologies very rarely changes the inferred ancestral state and does not improve the accuracy of the reconstructed ancestral sequence. Ancestral state reconstructions are robust to uncertainty about the underlying tree because the conditions that produce phylogenetic uncertainty also make the ancestral state identical across plausible trees; conversely, the conditions under which different phylogenies yield different inferred ancestral states produce little or no ambiguity about the true phylogeny. Our results suggest that ML can produce accurate ASRs, even in the face of phylogenetic uncertainty. Using Bayesian integration to incorporate this uncertainty is neither necessary nor beneficial.

  12. Complete Genome Sequence of the Quality Control Strain Staphylococcus aureus subsp. aureus ATCC 25923.

    Science.gov (United States)

    Treangen, Todd J; Maybank, Rosslyn A; Enke, Sana; Friss, Mary Beth; Diviak, Lynn F; Karaolis, David K R; Koren, Sergey; Ondov, Brian; Phillippy, Adam M; Bergman, Nicholas H; Rosovitz, M J

    2014-11-06

    Staphylococcus aureus subsp. aureus ATCC 25923 is commonly used as a control strain for susceptibility testing to antibiotics and as a quality control strain for commercial products. We present the completed genome sequence for the strain, consisting of the chromosome and a 27.5-kb plasmid. Copyright © 2014 Treangen et al.

  13. WebPrInSeS: automated full-length clone sequence identification and verification using high-throughput sequencing data.

    Science.gov (United States)

    Massouras, Andreas; Decouttere, Frederik; Hens, Korneel; Deplancke, Bart

    2010-07-01

    High-throughput sequencing (HTS) is revolutionizing our ability to obtain cheap, fast and reliable sequence information. Many experimental approaches are expected to benefit from the incorporation of such sequencing features in their pipeline. Consequently, software tools that facilitate such an incorporation should be of great interest. In this context, we developed WebPrInSeS, a web server tool allowing automated full-length clone sequence identification and verification using HTS data. WebPrInSeS encompasses two separate software applications. The first is WebPrInSeS-C which performs automated sequence verification of user-defined open-reading frame (ORF) clone libraries. The second is WebPrInSeS-E, which identifies positive hits in cDNA or ORF-based library screening experiments such as yeast one- or two-hybrid assays. Both tools perform de novo assembly using HTS data from any of the three major sequencing platforms. Thus, WebPrInSeS provides a highly integrated, cost-effective and efficient way to sequence-verify or identify clones of interest. WebPrInSeS is available at http://webprinses.epfl.ch/ and is open to all users.

  14. PIRPLE: a penalized-likelihood framework for incorporation of prior images in CT reconstruction

    International Nuclear Information System (INIS)

    Stayman, J Webster; Dang, Hao; Ding, Yifu; Siewerdsen, Jeffrey H

    2013-01-01

    Over the course of diagnosis and treatment, it is common for a number of imaging studies to be acquired. Such imaging sequences can provide substantial patient-specific prior knowledge about the anatomy that can be incorporated into a prior-image-based tomographic reconstruction for improved image quality and better dose utilization. We present a general methodology using a model-based reconstruction approach including formulations of the measurement noise that also integrates prior images. This penalized-likelihood technique adopts a sparsity enforcing penalty that incorporates prior information yet allows for change between the current reconstruction and the prior image. Moreover, since prior images are generally not registered with the current image volume, we present a modified model-based approach that seeks a joint registration of the prior image in addition to the reconstruction of projection data. We demonstrate that the combined prior-image- and model-based technique outperforms methods that ignore the prior data or lack a noise model. Moreover, we demonstrate the importance of registration for prior-image-based reconstruction methods and show that the prior-image-registered penalized-likelihood estimation (PIRPLE) approach can maintain a high level of image quality in the presence of noisy and undersampled projection data. (paper)

  15. Long sequence correlation coprocessor

    Science.gov (United States)

    Gage, Douglas W.

    1994-09-01

    A long sequence correlation coprocessor (LSCC) accelerates the bitwise correlation of arbitrarily long digital sequences by calculating in parallel the correlation score for 16, for example, adjacent bit alignments between two binary sequences. The LSCC integrated circuit is incorporated into a computer system with memory storage buffers and a separate general purpose computer processor which serves as its controller. Each of the LSCC's set of sequential counters simultaneously tallies a separate correlation coefficient. During each LSCC clock cycle, computer enable logic associated with each counter compares one bit of a first sequence with one bit of a second sequence to increment the counter if the bits are the same. A shift register assures that the same bit of the first sequence is simultaneously compared to different bits of the second sequence to simultaneously calculate the correlation coefficient by the different counters to represent different alignments of the two sequences.

  16. Efficient error correction for next-generation sequencing of viral amplicons.

    Science.gov (United States)

    Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury

    2012-06-25

    Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.

  17. Centrically reordered inversion recovery half-Fourier single-shot turbo spin-echo sequence: improvement of the image quality of oxygen-enhanced MRI

    International Nuclear Information System (INIS)

    Ohno, Yoshiharu; Hatabu, Hiroto; Higashino, Takanori; Kawamitsu, Hideaki; Watanabe, Hirokazu; Takenaka, Daisuke; Cauteren, Marc van; Sugimura, Kazuro

    2004-01-01

    Purpose: The purpose of the study presented here was to determine the improvement in image quality of oxygen-enhanced magnetic resonance (MR) subtraction imaging obtained with a centrically reordered inversion recovery half-Fourier single-shot turbo spin-echo (c-IR-HASTE) sequence compared with that obtained with a conventional sequentially reordered inversion recovery single-shot HASTE (s-IR-HASTE) sequence for pulmonary imaging. Materials and methods: Oxygen-enhanced MR imaging using a 1.5 T whole body scanner was performed on 12 healthy, non-smoking volunteers. Oxygen-enhanced MR images were obtained with the coronal two-dimensional (2D) c-IR-HASTE sequence and 2D s-IR-HASTE sequence combined with respiratory triggering. For a 256x256 matrix, 132 phase-encoding steps were acquired including four steps for phase correction. Inter-echo spacing for each sequence was 4.0 ms. The effective echo time (TE) for c-IR-HASTE was 4.0 ms, and 16 ms for s-IR-HASTE. The inversion time (TI) was 900 ms. To determine the improvement in oxygen-enhanced MR subtraction imaging by c-IR-HASTE, CNRs of subtraction image, overall image quality, and image degradation of the c-IR-HASTE and s-IR-HASTE techniques were statistically compared. Results: CNR, overall image quality, and image degradation of c-IR-HASTE images showed significant improvement compared to those s-IR-HASTE images (P<0.05). Conclusion: Centrically reordered inversion recovery half-Fourier single-shot turbo spin-echo (c-IR-HASTE) sequence enhanced the signal from the lung and improved the image quality of oxygen-enhanced MR subtraction imaging

  18. Quality standards for DNA sequence variation databases to improve clinical management under development in Australia

    Directory of Open Access Journals (Sweden)

    B. Bennetts

    2014-09-01

    Full Text Available Despite the routine nature of comparing sequence variations identified during clinical testing to database records, few databases meet quality requirements for clinical diagnostics. To address this issue, The Royal College of Pathologists of Australasia (RCPA in collaboration with the Human Genetics Society of Australasia (HGSA, and the Human Variome Project (HVP is developing standards for DNA sequence variation databases intended for use in the Australian clinical environment. The outputs of this project will be promoted to other health systems and accreditation bodies by the Human Variome Project to support the development of similar frameworks in other jurisdictions.

  19. Effect of cassava starch-based edible coating incorporated with lemongrass essential oil on the quality of papaya MJ9

    Science.gov (United States)

    Praseptiangga, D.; Utami, R.; Khasanah, L. U.; Evirananda, I. P.; Kawiji

    2017-02-01

    Edible films and coatings have emerged as an alternative packaging in food applications and have received much attention due to their advantages. The incorporation of essential oils in film matrices to give antimicrobial properties had been observed recently, and could be used as promising preservation technology. In this study, cassava starch-based edible coating incorporated with lemongrass essential oil (1%) was applied by spraying and dipping methods to preserve papaya MJ9 during storage at room temperature. The quality of papaya MJ9 was analyzed based on its physicochemical and microbiological properties. The addition of lemongrass essential oil (1%) significantly inhibited the microbial growth on papaya MJ9 by reducing the value of total yeast and mold as compared to the control. This study also showed that for parameters of weight loss, total soluble solid, vitamin C, and total titratable acid, papaya MJ9 with cassava starch-based edible coating incorporated with lemongrass essential oil (1%) had the lower values than control, however, they had the higher value than control on firmness parameter. These results indicate that cassava starch-based edible coating incorporated with lemongrass essential oil (1%) can be used as an alternative preservation for papaya MJ9.

  20. Get your high-quality low-cost genome sequence

    NARCIS (Netherlands)

    Faino, L.; Thomma, B.P.H.J.

    2014-01-01

    The study of whole-genome sequences has become essential for almost all branches of biological research. Next-generation sequencing (NGS) has revolutionized the scalability, speed, and resolution of sequencing and brought genomic science within reach of academic laboratories that study non-model

  1. VPA: an R tool for analyzing sequencing variants with user-specified frequency pattern

    Directory of Open Access Journals (Sweden)

    Hu Qiang

    2012-01-01

    Full Text Available Abstract Background The massive amounts of genetic variant generated by the next generation sequencing systems demand the development of effective computational tools for variant prioritization. Findings VPA (Variant Pattern Analyzer is an R tool for prioritizing variants with specified frequency pattern from multiple study subjects in next-generation sequencing study. The tool starts from individual files of variant and sequence calls and extract variants with user-specified frequency pattern across the study subjects of interest. Several position level quality criteria can be incorporated into the variant extraction. It can be used in studies with matched pair design as well as studies with multiple groups of subjects. Conclusions VPA can be used as an automatic pipeline to prioritize variants for further functional exploration and hypothesis generation. The package is implemented in the R language and is freely available from http://vpa.r-forge.r-project.org.

  2. Assuring the Quality of Next-Generation Sequencing in Clinical Microbiology and Public Health Laboratories.

    Science.gov (United States)

    Gargis, Amy S; Kalman, Lisa; Lubin, Ira M

    2016-12-01

    Clinical microbiology and public health laboratories are beginning to utilize next-generation sequencing (NGS) for a range of applications. This technology has the potential to transform the field by providing approaches that will complement, or even replace, many conventional laboratory tests. While the benefits of NGS are significant, the complexities of these assays require an evolving set of standards to ensure testing quality. Regulatory and accreditation requirements, professional guidelines, and best practices that help ensure the quality of NGS-based tests are emerging. This review highlights currently available standards and guidelines for the implementation of NGS in the clinical and public health laboratory setting, and it includes considerations for NGS test validation, quality control procedures, proficiency testing, and reference materials. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  3. Isolation and analysis of high quality nuclear DNA with reduced organellar DNA for plant genome sequencing and resequencing

    Directory of Open Access Journals (Sweden)

    Zdepski Anna

    2011-05-01

    Full Text Available Abstract Background High throughput sequencing (HTS technologies have revolutionized the field of genomics by drastically reducing the cost of sequencing, making it feasible for individual labs to sequence or resequence plant genomes. Obtaining high quality, high molecular weight DNA from plants poses significant challenges due to the high copy number of chloroplast and mitochondrial DNA, as well as high levels of phenolic compounds and polysaccharides. Multiple methods have been used to isolate DNA from plants; the CTAB method is commonly used to isolate total cellular DNA from plants that contain nuclear DNA, as well as chloroplast and mitochondrial DNA. Alternatively, DNA can be isolated from nuclei to minimize chloroplast and mitochondrial DNA contamination. Results We describe optimized protocols for isolation of nuclear DNA from eight different plant species encompassing both monocot and eudicot species. These protocols use nuclei isolation to minimize chloroplast and mitochondrial DNA contamination. We also developed a protocol to determine the number of chloroplast and mitochondrial DNA copies relative to the nuclear DNA using quantitative real time PCR (qPCR. We compared DNA isolated from nuclei to total cellular DNA isolated with the CTAB method. As expected, DNA isolated from nuclei consistently yielded nuclear DNA with fewer chloroplast and mitochondrial DNA copies, as compared to the total cellular DNA prepared with the CTAB method. This protocol will allow for analysis of the quality and quantity of nuclear DNA before starting a plant whole genome sequencing or resequencing experiment. Conclusions Extracting high quality, high molecular weight nuclear DNA in plants has the potential to be a bottleneck in the era of whole genome sequencing and resequencing. The methods that are described here provide a framework for researchers to extract and quantify nuclear DNA in multiple types of plants.

  4. Clinical roundtable monograph: effective management of quality of life in metastatic breast cancer.

    Science.gov (United States)

    Christopher, Twelves; Gradishar, William J; O'Shaughnessy, Joyce A; Bramsen, Betsy; Lurie, Robert H

    2014-02-01

    Quality of life is accepted as an important consideration in the management of patients with metastatic breast cancer, which remains incurable. Recent clinical trials of newer agents, such as eribulin and trastuzumab emtansine, have incorporated quality of life analyses. Quality of life is impacted by multiple patient-related, disease-related, and treatment-related factors. Therapies most beneficial for maintaining or improving quality of life include those that can effectively reduce tumor burden and tumor-related symptoms, but have toxicity profiles that are well tolerated and easily managed. Overall outcomes of patients with metastatic breast cancer improve when therapy is focused not only on the disease itself, but also on the goals of minimizing diseaserelated and treatment-related symptoms. A paradigm shift now reflected in major guidelines is the incorporation of palliative care strategies earlier in the course of metastatic disease management. The selection and sequence of treatments should be made in cooperation with the patient and after consideration of her particular priorities.

  5. Health care in small prisons: incorporating high-quality standards.

    Science.gov (United States)

    Rieder, Jean-Pierre; Casillas, Alejandra; Mary, Gérard; Secretan, Anne-Dominique; Gaspoz, Jean-Michel; Wolff, Hans

    2013-01-01

    In the past, health management in Geneva's six post-trial prisons had been variable and inconsistent. In 2008, the unit of penitentiary medicine of the Geneva University Hospitals was mandated to re-organize and provide health care at all six prison facilities. The specific aim of this paper is to outline the example as a practical solution to some of the common challenges in unifying the structure and process of health services across multiple small facilities, while meeting European prison health and local quality standards. Geneva's post-trial prisons are small and close to one another in geographical proximity - ideal conditions for the construction of a health mobile team (HMT). This multidisciplinary mobile team operated like a community ambulatory care model; it was progressively launched in all prison facilities in Geneva. The authors incorporated an implementation strategy where health providers partnered with prison and community stakeholders in the health delivery model's development and adaption process. The model's strategic initiatives are described along the following areas, in light of other international prison health activity and prior care models: access to a health care professional, equivalence of care, patient consent, confidentiality, humanitarian interventions, and professional competence and independence. From the perspective of the HMT members, the authors provide the "lessons learned" through this experience, especially to providers who are working on prison health services reform and coordination improvement. The paper particularly stresses the importance of partnering with community health stakeholders and prison staff, a key component to the approach.

  6. Whey protein isolate edible films with essential oils incorporated to improve the microbial quality of poultry.

    Science.gov (United States)

    Fernández-Pan, Idoya; Mendoza, Mauricio; Maté, Juan I

    2013-09-01

    Whey protein isolate edible films with oregano or clove essential oils (EOs) incorporated as natural antimicrobials have been developed, with the aim of enhancing the microbial quality of poultry. The effectiveness of the films was determined against both the whole and selected microbiota developed during different periods of cold storage on the surface of skinless chicken breast. Tests were conducted by using both turbidimetric and agar disc diffusion methods. The antimicrobial edible films developed showed high effectiveness against the main spoilers developed on the surface of skinless chicken breasts cold-stored for 8 days. The films based on oregano EO showed greater effectiveness than those based on clove EO. Still, clove EO could be part of an effective antimicrobial edible film. Enterobacteriaceae was the most susceptible to the effect of the films when lower concentrations of EO were incorporated. The largest inhibition surfaces obtained were provoked by films with the highest concentration of oregano EO incorporated against lactic acid bacteria. The antimicrobial edible films developed in this study inhibited the growth of the microbial populations that developed through storage of the chicken breast and caused its spoilage. The results of this research have direct application in the food industry to enhance the control of the development of spoilers such as Pseudomonas spp. or lactic acid bacteria. © 2013 Society of Chemical Industry.

  7. Next generation sequencing (NGS)technologies and applications

    Energy Technology Data Exchange (ETDEWEB)

    Vuyisich, Momchilo [Los Alamos National Laboratory

    2012-09-11

    NGS technology overview: (1) NGS library preparation - Nucleic acids extraction, Sample quality control, RNA conversion to cDNA, Addition of sequencing adapters, Quality control of library; (2) Sequencing - Clonal amplification of library fragments, (except PacBio), Sequencing by synthesis, Data output (reads and quality); and (3) Data analysis - Read mapping, Genome assembly, Gene expression, Operon structure, sRNA discovery, and Epigenetic analyses.

  8. Protocol optimization of sacroiliac joint MR Imaging at 3 Tesla: Impact of coil design and motion resistant sequences on image quality.

    Science.gov (United States)

    Gondim Teixeira, P A; Bravetti, M; Hossu, G; Lecocq, S; Petit, D; Loeuille, D; Blum, A

    2017-12-01

    To evaluate the impact of coil design and motion-resistant sequences on the quality of sacroiliac magnetic resonance imaging (MRI) examination in patients with spondyloarthropathy. One hundred and twenty-one patients with suspected sacroiliitis and referred for MRI of the sacroiliac joints were retrospectively evaluated with MRI at 3-Tesla. There were 78 women and 43 men with a mean age of 36.7±11.5 (SD) years (range: 15.8-78.4 years). Conventional and motion-resistant fat-saturated fast-spin echo T2-weighted sequences were performed with two different coils. Image quality was subjectively evaluated by two independent readers (R1 and R2) using a four-point scale. Confidence in the identification of bone marrow edema pattern (BMEP) was also evaluated subjectively using a three-point scale. Phased array body coil yielded improved image quality compared to surface coil (14.1 to 30.4% for R1 and 14.6 to 25.7% for R2; Pcoil with motion-resistant T2-weighted sequence (kappa 0.990). The smallest number of indeterminate BMEP zones was seen on MRI set acquired with the phased-array body coil and motion-resistant T2-weighted sequence. Phased array body coil and motion-resistant T2-weighted sequences perform better than surface coil and conventional T2-weighted sequences for the evaluation of sacroiliac joints, increasing confidence in the identification of BMEP. Copyright © 2017 Editions françaises de radiologie. Published by Elsevier Masson SAS. All rights reserved.

  9. Artificial Intelligence Controls Tape-Recording Sequence

    Science.gov (United States)

    Schwuttke, Ursula M.; Otamura, Roy M.; Zottarelli, Lawrence J.

    1989-01-01

    Developmental expert-system computer program intended to schedule recording of large amounts of data on limited amount of magnetic tape. Schedules recording using two sets of rules. First set incorporates knowledge of locations for recording of new data. Second set incorporates knowledge about issuing commands to recorder. Designed primarily for use on Voyager Spacecraft, also applicable to planning and sequencing in industry.

  10. Automated sequence-specific protein NMR assignment using the memetic algorithm MATCH

    International Nuclear Information System (INIS)

    Volk, Jochen; Herrmann, Torsten; Wuethrich, Kurt

    2008-01-01

    MATCH (Memetic Algorithm and Combinatorial Optimization Heuristics) is a new memetic algorithm for automated sequence-specific polypeptide backbone NMR assignment of proteins. MATCH employs local optimization for tracing partial sequence-specific assignments within a global, population-based search environment, where the simultaneous application of local and global optimization heuristics guarantees high efficiency and robustness. MATCH thus makes combined use of the two predominant concepts in use for automated NMR assignment of proteins. Dynamic transition and inherent mutation are new techniques that enable automatic adaptation to variable quality of the experimental input data. The concept of dynamic transition is incorporated in all major building blocks of the algorithm, where it enables switching between local and global optimization heuristics at any time during the assignment process. Inherent mutation restricts the intrinsically required randomness of the evolutionary algorithm to those regions of the conformation space that are compatible with the experimental input data. Using intact and artificially deteriorated APSY-NMR input data of proteins, MATCH performed sequence-specific resonance assignment with high efficiency and robustness

  11. No Reference Prediction of Quality Metrics for H.264 Compressed Infrared Image Sequences for UAV Applications

    DEFF Research Database (Denmark)

    Hossain, Kabir; Mantel, Claire; Forchhammer, Søren

    2018-01-01

    The framework for this research work is the acquisition of Infrared (IR) images from Unmanned Aerial Vehicles (UAV). In this paper we consider the No-Reference (NR) prediction of Full Reference Quality Metrics for Infrared (IR) video sequences which are compressed and thus distorted by an H.264...

  12. SEQUenCE: a service user-centred quality of care instrument for mental health services.

    Science.gov (United States)

    Hester, Lorraine; O'Doherty, Lorna Jane; Schnittger, Rebecca; Skelly, Niamh; O'Donnell, Muireann; Butterly, Lisa; Browne, Robert; Frorath, Charlotte; Morgan, Craig; McLoughlin, Declan M; Fearon, Paul

    2015-08-01

    To develop a quality of care instrument that is grounded in the service user perspective and validate it in a mental health service. The instrument (SEQUenCE (SErvice user QUality of CarE)) was developed through analysis of focus group data and clinical practice guidelines, and refined through field-testing and psychometric analyses. All participants were attending an independent mental health service in Ireland. Participants had a diagnosis of bipolar affective disorder (BPAD) or a psychotic disorder. Twenty-nine service users participated in six focus group interviews. Seventy-one service users participated in field-testing: 10 judged the face validity of an initial 61-item instrument; 28 completed a revised 52-item instrument from which 12 items were removed following test-retest and convergent validity analyses; 33 completed the resulting 40-item instrument. Test-retest reliability, internal consistency and convergent validity of the instrument. The final instrument showed acceptable test-retest reliability at 5-7 days (r = 0.65; P Service Satisfaction Scale (r = 0.84, P internal consistency (Cronbach's alpha = 0.87). SEQUenCE is a valid, reliable scale that is grounded in the service user perspective and suitable for routine use. It may serve as a useful tool in individual care planning, service evaluation and research. The instrument was developed and validated with service users with a diagnosis of either BPAD or a psychotic disorder; it does not yet have established external validity for other diagnostic groups. © The Author 2015. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.

  13. Combinatorial delivery of small interfering RNAs reduces RNAi efficacy by selective incorporation into RISC

    Science.gov (United States)

    Castanotto, Daniela; Sakurai, Kumi; Lingeman, Robert; Li, Haitang; Shively, Louise; Aagaard, Lars; Soifer, Harris; Gatignol, Anne; Riggs, Arthur; Rossi, John J.

    2007-01-01

    Despite the great potential of RNAi, ectopic expression of shRNA or siRNAs holds the inherent risk of competition for critical RNAi components, thus altering the regulatory functions of some cellular microRNAs. In addition, specific siRNA sequences can potentially hinder incorporation of other siRNAs when used in a combinatorial approach. We show that both synthetic siRNAs and expressed shRNAs compete against each other and with the endogenous microRNAs for transport and for incorporation into the RNA induced silencing complex (RISC). The same siRNA sequences do not display competition when expressed from a microRNA backbone. We also show that TAR RNA binding protein (TRBP) is one of the sensors for selection and incorporation of the guide sequence of interfering RNAs. These findings reveal that combinatorial siRNA approaches can be problematic and have important implications for the methodology of expression and use of therapeutic interfering RNAs. PMID:17660190

  14. ChronQC: a quality control monitoring system for clinical next generation sequencing.

    Science.gov (United States)

    Tawari, Nilesh R; Seow, Justine Jia Wen; Perumal, Dharuman; Ow, Jack L; Ang, Shimin; Devasia, Arun George; Ng, Pauline C

    2018-05-15

    ChronQC is a quality control (QC) tracking system for clinical implementation of next-generation sequencing (NGS). ChronQC generates time series plots for various QC metrics to allow comparison of current runs to historical runs. ChronQC has multiple features for tracking QC data including Westgard rules for clinical validity, laboratory-defined thresholds and historical observations within a specified time period. Users can record their notes and corrective actions directly onto the plots for long-term recordkeeping. ChronQC facilitates regular monitoring of clinical NGS to enable adherence to high quality clinical standards. ChronQC is freely available on GitHub (https://github.com/nilesh-tawari/ChronQC), Docker (https://hub.docker.com/r/nileshtawari/chronqc/) and the Python Package Index. ChronQC is implemented in Python and runs on all common operating systems (Windows, Linux and Mac OS X). tawari.nilesh@gmail.com or pauline.c.ng@gmail.com. Supplementary data are available at Bioinformatics online.

  15. Opera: reconstructing optimal genomic scaffolds with high-throughput paired-end sequences.

    Science.gov (United States)

    Gao, Song; Sung, Wing-Kin; Nagarajan, Niranjan

    2011-11-01

    Scaffolding, the problem of ordering and orienting contigs, typically using paired-end reads, is a crucial step in the assembly of high-quality draft genomes. Even as sequencing technologies and mate-pair protocols have improved significantly, scaffolding programs still rely on heuristics, with no guarantees on the quality of the solution. In this work, we explored the feasibility of an exact solution for scaffolding and present a first tractable solution for this problem (Opera). We also describe a graph contraction procedure that allows the solution to scale to large scaffolding problems and demonstrate this by scaffolding several large real and synthetic datasets. In comparisons with existing scaffolders, Opera simultaneously produced longer and more accurate scaffolds demonstrating the utility of an exact approach. Opera also incorporates an exact quadratic programming formulation to precisely compute gap sizes (Availability: http://sourceforge.net/projects/operasf/ ).

  16. Quality control of next-generation sequencing library through an integrative digital microfluidic platform.

    Science.gov (United States)

    Thaitrong, Numrin; Kim, Hanyoup; Renzi, Ronald F; Bartsch, Michael S; Meagher, Robert J; Patel, Kamlesh D

    2012-12-01

    We have developed an automated quality control (QC) platform for next-generation sequencing (NGS) library characterization by integrating a droplet-based digital microfluidic (DMF) system with a capillary-based reagent delivery unit and a quantitative CE module. Using an in-plane capillary-DMF interface, a prepared sample droplet was actuated into position between the ground electrode and the inlet of the separation capillary to complete the circuit for an electrokinetic injection. Using a DNA ladder as an internal standard, the CE module with a compact LIF detector was capable of detecting dsDNA in the range of 5-100 pg/μL, suitable for the amount of DNA required by the Illumina Genome Analyzer sequencing platform. This DMF-CE platform consumes tenfold less sample volume than the current Agilent BioAnalyzer QC technique, preserving precious sample while providing necessary sensitivity and accuracy for optimal sequencing performance. The ability of this microfluidic system to validate NGS library preparation was demonstrated by examining the effects of limited-cycle PCR amplification on the size distribution and the yield of Illumina-compatible libraries, demonstrating that as few as ten cycles of PCR bias the size distribution of the library toward undesirable larger fragments. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Quality Control of the Traditional Patent Medicine Yimu Wan Based on SMRT Sequencing and DNA Barcoding

    Science.gov (United States)

    Jia, Jing; Xu, Zhichao; Xin, Tianyi; Shi, Linchun; Song, Jingyuan

    2017-01-01

    Substandard traditional patent medicines may lead to global safety-related issues. Protecting consumers from the health risks associated with the integrity and authenticity of herbal preparations is of great concern. Of particular concern is quality control for traditional patent medicines. Here, we establish an effective approach for verifying the biological composition of traditional patent medicines based on single-molecule real-time (SMRT) sequencing and DNA barcoding. Yimu Wan (YMW), a classical herbal prescription recorded in the Chinese Pharmacopoeia, was chosen to test the method. Two reference YMW samples were used to establish a standard method for analysis, which was then applied to three different batches of commercial YMW samples. A total of 3703 and 4810 circular-consensus sequencing (CCS) reads from two reference and three commercial YMW samples were mapped to the ITS2 and psbA-trnH regions, respectively. Moreover, comparison of intraspecific genetic distances based on SMRT sequencing data with reference data from Sanger sequencing revealed an ITS2 and psbA-trnH intergenic spacer that exhibited high intraspecific divergence, with the sites of variation showing significant differences within species. Using the CCS strategy for SMRT sequencing analysis was adequate to guarantee the accuracy of identification. This study demonstrates the application of SMRT sequencing to detect the biological ingredients of herbal preparations. SMRT sequencing provides an affordable way to monitor the legality and safety of traditional patent medicines. PMID:28620408

  18. Quality Control Test for Sequence-Phenotype Assignments

    Science.gov (United States)

    Ortiz, Maria Teresa Lara; Rosario, Pablo Benjamín Leon; Luna-Nevarez, Pablo; Gamez, Alba Savin; Martínez-del Campo, Ana; Del Rio, Gabriel

    2015-01-01

    Relating a gene mutation to a phenotype is a common task in different disciplines such as protein biochemistry. In this endeavour, it is common to find false relationships arising from mutations introduced by cells that may be depurated using a phenotypic assay; yet, such phenotypic assays may introduce additional false relationships arising from experimental errors. Here we introduce the use of high-throughput DNA sequencers and statistical analysis aimed to identify incorrect DNA sequence-phenotype assignments and observed that 10–20% of these false assignments are expected in large screenings aimed to identify critical residues for protein function. We further show that this level of incorrect DNA sequence-phenotype assignments may significantly alter our understanding about the structure-function relationship of proteins. We have made available an implementation of our method at http://bis.ifc.unam.mx/en/software/chispas. PMID:25700273

  19. Incorporating individualized quality of life measures in the evaluation of pharmacy services: the IN*COMPASS framework.

    Science.gov (United States)

    Funderburk, F R; Pathak, D S; Pleil, A M

    1998-01-01

    Quality of life is a fascinating field to researchers and practitioners alike. To some researchers, quality of life is of interest because it offers untold challenges in constructing instruments and capturing data necessary to answer key questions about health, disease, and treatment. For such researchers, quality of life is about statistical relationships among questions and about using questions to define the physical, social, and emotional domains of health. To other researchers, this field is about finding practical applications in policy and treatment decision making for the information provided by quality of life assessments. To these researchers, the focus of quality of life is on ways to apply knowledge of quality of life differences between groups with and without specific diseases or ways to use knowledge about how treatments affect the quality of life of various patient populations. To practitioners, quality of life is about treatment outcomes that impact individual patients' daily lives. It is the practitioner that Funderburk, Pleil, and Pathak are considering in their paper in this issue of Pharmacy Practice Management Quarterly. These authors give several important messages to practitioners seeking to serve their patients by incorporating quality of life into their practices. The key message in the paper is that to better understand and determine the impact of treatment on a patient's quality of life, it is critical to start with a baseline or reference point relevant to that patient. From that baseline or reference point, treatment decisions can be made and progress, in quality of life terms, can be evaluated. Critical questions in their framework, which is called the IN*COMPASS (Individualized Client Oriented Method for Preferred Alleviation of Sickness States) Approach, are "How are you now?" and "How would you like to be?" The authors do not endorse particular quality of life tools in their approach; rather they prescribe certain critical

  20. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    Science.gov (United States)

    Kelly, Steven; Maini, Philip K

    2013-01-01

    The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  1. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    Directory of Open Access Journals (Sweden)

    Steven Kelly

    Full Text Available The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  2. A Framework Incorporating Community Preferences in Use ...

    Science.gov (United States)

    The report is intended to assist water quality officials, watershed managers, members of stakeholder groups, and other interested individuals in fully evaluating ecological and socioeconomic objectives and the gains and losses that often are involved in use attainment decisions. In addition, this report enables local, state, and tribal managers to better understand the benefits, as well as the costs, of attaining high water quality, and to incorporate community preferences in decision-making. Specific objectives are (1) to provide an introduction to the CWA and WQS regulation and analyses related to setting or changing designated uses; (2) create a basis for understanding the relationship between use-attainment decisions and the effects on ecosystems, ecosystem services, and ecological benefits; (3) serve as reference for methods that elicit or infer preferences for benefits and costs related to attaining uses and (4) present process for incorporating new approaches in water quality decisions.

  3. Time-resolved echo-shared parallel MRA of the lung: observer preference study of image quality in comparison with non-echo-shared sequences

    International Nuclear Information System (INIS)

    Fink, C.; Puderbach, M.; Zaporozhan, J.; Plathow, C.; Kauczor, H.-U.; Ley, S.

    2005-01-01

    The aim of this study was to evaluate the image quality of time-resolved echo-shared parallel MRA of the lung. The pulmonary vasculature of nine patients (seven females, two males; median age: 44 years) with pulmonary disease was examined using a time-resolved MRA sequence combining echo sharing with parallel imaging (time-resolved echo-shared angiography technique, or TREAT). The sharpness of the vessel borders, conspicuousness of peripheral lung vessels, artifact level, and overall image quality of TREAT was assessed independently by four readers in a side-by-side comparison with non-echo-shared time-resolved parallel MRA data (pMRA) previously acquired in the same patients. Furthermore, the SNR of pulmonary arteries (PA) and veins (PV) achieved with both pulse sequences was compared. The mean voxel size of TREAT MRA was decreased by 24% compared with the non-echo-shared MRA. Regarding the sharpness of the vessel borders, conspicuousness of peripheral lung vessels, and overall image quality the TREAT sequence was rated superior in 75-76% of all cases. If the TREAT images were preferred over the pMRA images, the advantage was rated as major in 61-71% of all cases. The level of artifacts was not increased with the TREAT sequence. The mean interobserver agreement for all categories ranged between fair (artifact level) and good (overall image quality). The maximum SNR of TREAT did not differ from non-echo-shared parallel MRA (PA: TREAT: 273±45; pMRA: 280±71; PV: TREAT: 273±33; pMRA: 258±62). TREAT achieves a higher spatial resolution than non-echo-shared parallel MRA which is also perceived as an improved image quality. (orig.)

  4. MatrixPlot: visualizing sequence constraints

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Stærfeldt, Hans Henrik; Lund, Ole

    1999-01-01

    MatrixPlot: visualizing sequence constraints. Sub-title Abstract Summary : MatrixPlot is a program for making high-quality matrix plots, such as mutual information plots of sequence alignments and distance matrices of sequences with known three-dimensional coordinates. The user can add information...

  5. Metagenomic sequencing reveals the relationship between microbiota composition and quality of Chinese Rice Wine.

    Science.gov (United States)

    Hong, Xutao; Chen, Jing; Liu, Lin; Wu, Huan; Tan, Haiqin; Xie, Guangfa; Xu, Qian; Zou, Huijun; Yu, Wenjing; Wang, Lan; Qin, Nan

    2016-05-31

    Chinese Rice Wine (CRW) is a common alcoholic beverage in China. To investigate the influence of microbial composition on the quality of CRW, high throughput sequencing was performed for 110 wine samples on bacterial 16S rRNA gene and fungal Internal Transcribed Spacer II (ITS2). Bioinformatic analyses demonstrated that the quality of yeast starter and final wine correlated with microbial taxonomic composition, which was exemplified by our finding that wine spoilage resulted from a high proportion of genus Lactobacillus. Subsequently, based on Lactobacillus abundance of an early stage, a model was constructed to predict final wine quality. In addition, three batches of 20 representative wine samples selected from a pool of 110 samples were further analyzed in metagenomics. The results revealed that wine spoilage was due to rapid growth of Lactobacillus brevis at the early stage of fermentation. Gene functional analysis indicated the importance of some pathways such as synthesis of biotin, malolactic fermentation and production of short-chain fatty acid. These results led to a conclusion that metabolisms of microbes influence the wine quality. Thus, nurturing of beneficial microbes and inhibition of undesired ones are both important for the mechanized brewery.

  6. SU-D-209-02: Percent Depth Dose Curves for Fluoroscopic X-Ray Beam Qualities Incorporating Copper Filtration

    Energy Technology Data Exchange (ETDEWEB)

    Wunderle, K [Cleveland Clinic Foundation, Cleveland, OH (United States); Wayne State University School of Medicine, Detroit, MI (United States); Godley, A; Shen, Z; Dong, F [Cleveland Clinic Foundation, Cleveland, OH (United States); Rakowski, J [Wayne State University School of Medicine, Detroit, MI (United States)

    2016-06-15

    Purpose: The purpose of this investigation was to quantify percent depth dose (PDD) curves for fluoroscopic x-ray beam qualities incorporating added copper filtration. Methods: A PTW (Freiburg, Germany) MP3 water tank was used with a Standard Imaging (Middleton, WI) Exradin Model 11 Spokas Chamber to measure PDD curves for 60, 80, 100 and 120 kVp x-ray beams with copper filtration ranging from 0.0–0.9 mm at 22cm and 42cm fields of view from 0 to 150 mm of water. A free-in-air monitor chamber was used to normalize the water tank data to fluctuations in output from the fluoroscope. The measurements were acquired on a Siemens (Erlangen, Germany) Artis ZeeGo fluoroscope. The fluoroscope was inverted from the typical orientation providing an x-ray beam originating from above the water tank. The water tank was positioned so that the water level was located at 60cm from the focal spot; which also represents the focal spot to interventional reference plane distance for that fluoroscope. Results: PDDs for 60, 80, 100, and 120 kVp with 0 mm of copper filtration compared well to previously published data by Fetterly et al. [Med Phys, 28, 205 (2001)] for those beam qualities given differences in fluoroscopes, geometric orientation, type of ionization chamber, and the water tank used for data collection. PDDs for 60, 80, 100, and 120 kVp with copper filtration were obtained and are presented, which have not been previously investigated and published. Conclusion: The equipment and processes used to acquire the reported data were sound and compared well with previously published data for PDDs without copper filtration. PDD data for the fluoroscopic x-ray beams incorporating copper filtration can be used as reference data for estimating organ or soft tissue dose at depth involving similar beam qualities or for comparison with mathematical models.

  7. Experimental design-based functional mining and characterization of high-throughput sequencing data in the sequence read archive.

    Directory of Open Access Journals (Sweden)

    Takeru Nakazato

    Full Text Available High-throughput sequencing technology, also called next-generation sequencing (NGS, has the potential to revolutionize the whole process of genome sequencing, transcriptomics, and epigenetics. Sequencing data is captured in a public primary data archive, the Sequence Read Archive (SRA. As of January 2013, data from more than 14,000 projects have been submitted to SRA, which is double that of the previous year. Researchers can download raw sequence data from SRA website to perform further analyses and to compare with their own data. However, it is extremely difficult to search entries and download raw sequences of interests with SRA because the data structure is complicated, and experimental conditions along with raw sequences are partly described in natural language. Additionally, some sequences are of inconsistent quality because anyone can submit sequencing data to SRA with no quality check. Therefore, as a criterion of data quality, we focused on SRA entries that were cited in journal articles. We extracted SRA IDs and PubMed IDs (PMIDs from SRA and full-text versions of journal articles and retrieved 2748 SRA ID-PMID pairs. We constructed a publication list referring to SRA entries. Since, one of the main themes of -omics analyses is clarification of disease mechanisms, we also characterized SRA entries by disease keywords, according to the Medical Subject Headings (MeSH extracted from articles assigned to each SRA entry. We obtained 989 SRA ID-MeSH disease term pairs, and constructed a disease list referring to SRA data. We previously developed feature profiles of diseases in a system called "Gendoo". We generated hyperlinks between diseases extracted from SRA and the feature profiles of it. The developed project, publication and disease lists resulting from this study are available at our web service, called "DBCLS SRA" (http://sra.dbcls.jp/. This service will improve accessibility to high-quality data from SRA.

  8. Effect of incorporation of fermented bamboo shoot on physicochemical and microbial quality of pork pickle.

    Science.gov (United States)

    Chavhan, D M; Hazarika, M; Brahma, M L; Hazarika, R A; Rahman, Z

    2015-02-01

    Replacement of commercial chemical preservative (Vinegar) by incorporating fermented bamboo shoot (FBS) products partially or completely and their effect on physicochemical, microbial and shelf life qualities on pork pickle products was studied. Different FBS products such as FBS extract, paste and powder were incorporated in the pork pickle products at the level of 50 to 100 % with or without vinegar and stored at room temperature for 90 days. Highest pH values and lowest titrable acidity was recorded in products with 50 and 100 % FBS powder. No significant differences were observed with respect to proximate composition i.e. percent moisture, protein, fat and ash contents among the products except the product with 100 % FBS powder which had significantly (p acid (TBA) values were found to be increasing as the storage periods were advancing. Except the product with 100 % FBS powder which could be stored for 30 days only, other products could be stored upto 90 days without any physicochemical and microbial problems. It can be concluded from this study that natural and organic FBS extract and paste can be used successfully replacing the conventional chemical preservative (Vinegar) for preparation of pork pickle products and preserved more than 90 days at room temperature. However, FBS powder can be used for preservation of the pickle products for a short period (30 days).

  9. RANDNA: a random DNA sequence generator.

    Science.gov (United States)

    Piva, Francesco; Principato, Giovanni

    2006-01-01

    Monte Carlo simulations are useful to verify the significance of data. Genomic regularities, such as the nucleotide correlations or the not uniform distribution of the motifs throughout genomic or mature mRNA sequences, exist and their significance can be checked by means of the Monte Carlo test. The test needs good quality random sequences in order to work, moreover they should have the same nucleotide distribution as the sequences in which the regularities have been found. Random DNA sequences are also useful to estimate the background score of an alignment, that is a threshold below which the resulting score is merely due to chance. We have developed RANDNA, a free software which allows to produce random DNA or RNA sequences setting both their length and the percentage of nucleotide composition. Sequences having the same nucleotide distribution of exonic, intronic or intergenic sequences can be generated. Its graphic interface makes it possible to easily set the parameters that characterize the sequences being produced and saved in a text format file. The pseudo-random number generator function of Borland Delphi 6 is used, since it guarantees a good randomness, a long cycle length and a high speed. We have checked the quality of sequences generated by the software, by means of well-known tests, both by themselves and versus genuine random sequences. We show the good quality of the generated sequences. The software, complete with examples and documentation, is freely available to users from: http://www.introni.it/en/software.

  10. Control of the Free Convective Flow around the Human Body for Enhanced Inhaled Air Quality: Application to a Seat-Incorporated Personalized Ventilation Unit

    DEFF Research Database (Denmark)

    Bolashikov, Zhecho Dimitrov; Melikov, Arsen Krikor; Krenek, M.

    2010-01-01

    This paper reports on methods for control of the free convective flow around the human body, with the aim of improving inhaled air quality. The methods were studied with sea-incorporated personalized ventilation (PV)-two PV nozzles placed sideways at the head level of a seated occupant supplied...

  11. Determining RNA quality for NextGen sequencing: some exceptions to the gold standard rule of 23S to 16S rRNA ratio

    Science.gov (United States)

    Using next-generation-sequencing technology to assess entire transcriptomes requires high quality starting RNA. Currently, RNA quality is routinely judged using automated microfluidic gel electrophoresis platforms and associated algorithms. Here we report that such automated methods generate false-n...

  12. The Effect of Uncertain End-of-Life Product Quality and Consumer Incentives on Partial Disassembly Sequencing in Value Recovery Operations

    OpenAIRE

    Rickli, Jeremy Lewis

    2013-01-01

    This dissertation addresses gaps in the interaction between End-of-Life (EoL) product acquisition systems and disassembly sequencing. The research focuses on two remanufacturing research problems; 1) modeling uncertain EoL product quality, quantity, and timing in regards to EoL product acquisition and disassembly sequencing and 2) designing EoL product acquisition schemes considering EoL product uncertainty. The main research objectives within these areas are; analyzing, predicting, and contr...

  13. A Comprehensive Quality Evaluation System for Complex Herbal Medicine Using PacBio Sequencing, PCR-Denaturing Gradient Gel Electrophoresis, and Several Chemical Approaches

    Directory of Open Access Journals (Sweden)

    Xiasheng Zheng

    2017-09-01

    Full Text Available Herbal medicine is a major component of complementary and alternative medicine, contributing significantly to the health of many people and communities. Quality control of herbal medicine is crucial to ensure that it is safe and sound for use. Here, we investigated a comprehensive quality evaluation system for a classic herbal medicine, Danggui Buxue Formula, by applying genetic-based and analytical chemistry approaches to authenticate and evaluate the quality of its samples. For authenticity, we successfully applied two novel technologies, third-generation sequencing and PCR-DGGE (denaturing gradient gel electrophoresis, to analyze the ingredient composition of the tested samples. For quality evaluation, we used high performance liquid chromatography assays to determine the content of chemical markers to help estimate the dosage relationship between its two raw materials, plant roots of Huangqi and Danggui. A series of surveys were then conducted against several exogenous contaminations, aiming to further access the efficacy and safety of the samples. In conclusion, the quality evaluation system demonstrated here can potentially address the authenticity, quality, and safety of herbal medicines, thus providing novel insight for enhancing their overall quality control.Highlight: We established a comprehensive quality evaluation system for herbal medicine, by combining two genetic-based approaches third-generation sequencing and DGGE (denaturing gradient gel electrophoresis with analytical chemistry approaches to achieve the authentication and quality connotation of the samples.

  14. A Comprehensive Quality Evaluation System for Complex Herbal Medicine Using PacBio Sequencing, PCR-Denaturing Gradient Gel Electrophoresis, and Several Chemical Approaches.

    Science.gov (United States)

    Zheng, Xiasheng; Zhang, Peng; Liao, Baosheng; Li, Jing; Liu, Xingyun; Shi, Yuhua; Cheng, Jinle; Lai, Zhitian; Xu, Jiang; Chen, Shilin

    2017-01-01

    Herbal medicine is a major component of complementary and alternative medicine, contributing significantly to the health of many people and communities. Quality control of herbal medicine is crucial to ensure that it is safe and sound for use. Here, we investigated a comprehensive quality evaluation system for a classic herbal medicine, Danggui Buxue Formula, by applying genetic-based and analytical chemistry approaches to authenticate and evaluate the quality of its samples. For authenticity, we successfully applied two novel technologies, third-generation sequencing and PCR-DGGE (denaturing gradient gel electrophoresis), to analyze the ingredient composition of the tested samples. For quality evaluation, we used high performance liquid chromatography assays to determine the content of chemical markers to help estimate the dosage relationship between its two raw materials, plant roots of Huangqi and Danggui. A series of surveys were then conducted against several exogenous contaminations, aiming to further access the efficacy and safety of the samples. In conclusion, the quality evaluation system demonstrated here can potentially address the authenticity, quality, and safety of herbal medicines, thus providing novel insight for enhancing their overall quality control. Highlight : We established a comprehensive quality evaluation system for herbal medicine, by combining two genetic-based approaches third-generation sequencing and DGGE (denaturing gradient gel electrophoresis) with analytical chemistry approaches to achieve the authentication and quality connotation of the samples.

  15. A Comprehensive Quality Evaluation System for Complex Herbal Medicine Using PacBio Sequencing, PCR-Denaturing Gradient Gel Electrophoresis, and Several Chemical Approaches

    Science.gov (United States)

    Zheng, Xiasheng; Zhang, Peng; Liao, Baosheng; Li, Jing; Liu, Xingyun; Shi, Yuhua; Cheng, Jinle; Lai, Zhitian; Xu, Jiang; Chen, Shilin

    2017-01-01

    Herbal medicine is a major component of complementary and alternative medicine, contributing significantly to the health of many people and communities. Quality control of herbal medicine is crucial to ensure that it is safe and sound for use. Here, we investigated a comprehensive quality evaluation system for a classic herbal medicine, Danggui Buxue Formula, by applying genetic-based and analytical chemistry approaches to authenticate and evaluate the quality of its samples. For authenticity, we successfully applied two novel technologies, third-generation sequencing and PCR-DGGE (denaturing gradient gel electrophoresis), to analyze the ingredient composition of the tested samples. For quality evaluation, we used high performance liquid chromatography assays to determine the content of chemical markers to help estimate the dosage relationship between its two raw materials, plant roots of Huangqi and Danggui. A series of surveys were then conducted against several exogenous contaminations, aiming to further access the efficacy and safety of the samples. In conclusion, the quality evaluation system demonstrated here can potentially address the authenticity, quality, and safety of herbal medicines, thus providing novel insight for enhancing their overall quality control. Highlight: We established a comprehensive quality evaluation system for herbal medicine, by combining two genetic-based approaches third-generation sequencing and DGGE (denaturing gradient gel electrophoresis) with analytical chemistry approaches to achieve the authentication and quality connotation of the samples. PMID:28955365

  16. DNA Polymerases Drive DNA Sequencing-by-Synthesis Technologies: Both Past and Present

    Directory of Open Access Journals (Sweden)

    Cheng-Yao eChen

    2014-06-01

    Full Text Available Next-generation sequencing (NGS technologies have revolutionized modern biological and biomedical research. The engines responsible for this innovation are DNA polymerases; they catalyze the biochemical reaction for deriving template sequence information. In fact, DNA polymerase has been a cornerstone of DNA sequencing from the very beginning. E. coli DNA polymerase I proteolytic (Klenow fragment was originally utilized in Sanger's dideoxy chain terminating DNA sequencing chemistry. From these humble beginnings followed an explosion of organism-specific, genome sequence information accessible via public database. Family A/B DNA polymerases from mesophilic/thermophilic bacteria/archaea were modified and tested in today's standard capillary electrophoresis (CE and NGS sequencing platforms. These enzymes were selected for their efficient incorporation of bulky dye-terminator and reversible dye-terminator nucleotides respectively. Third generation, real-time single molecule sequencing platform requires slightly different enzyme properties. Enterobacterial phage ⱷ29 DNA polymerase copies long stretches of DNA and possesses a unique capability to efficiently incorporate terminal phosphate-labeled nucleoside polyphosphates. Furthermore, ⱷ29 enzyme has also been utilized in emerging DNA sequencing technologies including nanopore-, and protein-transistor-based sequencing. DNA polymerase is, and will continue to be, a crucial component of sequencing technologies.

  17. Quality improvement education incorporated as an integral part of critical care fellows training at the Mayo Clinic.

    Science.gov (United States)

    Kashani, Kianoush B; Ramar, Kannan; Farmer, J Christopher; Lim, Kaiser G; Moreno-Franco, Pablo; Morgenthaler, Timothy I; Dankbar, Gene C; Hale, Curt W

    2014-10-01

    The Accreditation Council for Graduate Medical Education emphasizes quality improvement (QI) education in residency/fellowship training programs. The Mayo Clinic Combined Critical Care Fellowship (CCF) program conducted a pilot QI education program to incorporate QI training as a required curriculum for the 2010-2011 academic year. CCF collaborated with the Mayo Quality Academy to customize and teach the existing Mayo Quality Fellows curriculum to the CCF fellows with the help of two quality coaches over five months starting July 2010. All fellows were to achieve Bronze and Silver certification prior to graduation. Silver required passing four written exams and submitting a health care QI project. Five projects were selected on the basis of the Impact-Effort Prioritization matrix, and DMAIC (Define, Measure, Analyze, Improve, and Control) methodology was used to complete the projects. The primary outcome was to assess learners' satisfaction, knowledge, and skill transfer. All 20 fellows were Bronze certified, and 14 (70%) were Silver certified by the time of graduation. All five QI projects were completed and showed positive impacts on patient safety and care. Surveys showed improved learner satisfaction. Graduates felt the QI training improved their QI skills and employment and career advancement. The QI curriculum had appropriate content and teaching pace and did not significantly displace other important clinical core curriculum topics. The pilot was successfully implemented in the CCF program and now is in the fourth academic year as an established and integral part of the fellowship core curriculum.

  18. Incorporating Writing in an Integrated Calculus, Linear Algebra, and Differential Equations Sequence.

    Science.gov (United States)

    Kelly, Susan E.; LeDocq, Rebecca Lewin

    2001-01-01

    Describes the specific courses in a sequence along with how the writing has been implemented in each course. Provides ideas for how to efficiently handle the additional paper load so students receive the necessary feedback while keeping the grading time reasonable. (Author/ASK)

  19. Comparative analysis of sequences from PT 2013

    DEFF Research Database (Denmark)

    Mikkelsen, Susie Sommer

    Sheatfish and not EHNV. Generally, mistakes occurred at the ends of the sequences. This can be due to several factors. One is that the sequence has not been trimmed of the sequence primer sites. Another is the lack of quality control of the chromatogram. Finally, sequencing in just one direction can result...... diseases in Europe. As part of the EURL proficiency test for fish diseases it is required to sequence any RANA virus isolates found in any of the samples. It is also highly recommended to sequence the ISA virus to determine whether it be HPRΔ or HPR0. Furthermore, it is recommended that any VHSV and IHNV...... isolates be genotyped. As part of the evaluation of the proficiency results it was decided this year to look into the quality and similarity of the sequence results for selected viruses. Ampoule III in the proficiency test 2013 contained an EHNV isolate. The EURL received 43 sequences from 41 laboratories...

  20. The effects of alignment quality, distance calculation method, sequence filtering, and region on the analysis of 16S rRNA gene-based studies.

    Directory of Open Access Journals (Sweden)

    Patrick D Schloss

    Full Text Available Pyrosequencing of PCR-amplified fragments that target variable regions within the 16S rRNA gene has quickly become a powerful method for analyzing the membership and structure of microbial communities. This approach has revealed and introduced questions that were not fully appreciated by those carrying out traditional Sanger sequencing-based methods. These include the effects of alignment quality, the best method of calculating pairwise genetic distances for 16S rRNA genes, whether it is appropriate to filter variable regions, and how the choice of variable region relates to the genetic diversity observed in full-length sequences. I used a diverse collection of 13,501 high-quality full-length sequences to assess each of these questions. First, alignment quality had a significant impact on distance values and downstream analyses. Specifically, the greengenes alignment, which does a poor job of aligning variable regions, predicted higher genetic diversity, richness, and phylogenetic diversity than the SILVA and RDP-based alignments. Second, the effect of different gap treatments in determining pairwise genetic distances was strongly affected by the variation in sequence length for a region; however, the effect of different calculation methods was subtle when determining the sample's richness or phylogenetic diversity for a region. Third, applying a sequence mask to remove variable positions had a profound impact on genetic distances by muting the observed richness and phylogenetic diversity. Finally, the genetic distances calculated for each of the variable regions did a poor job of correlating with the full-length gene. Thus, while it is tempting to apply traditional cutoff levels derived for full-length sequences to these shorter sequences, it is not advisable. Analysis of beta-diversity metrics showed that each of these factors can have a significant impact on the comparison of community membership and structure. Taken together, these results

  1. Effect of surfactant-coated iron oxide nanoparticles on the effluent water quality from a simulated sequencing batch reactor treating domestic wastewater

    International Nuclear Information System (INIS)

    Hwang, Sangchul; Martinez, Diana; Perez, Priscilla; Rinaldi, Carlos

    2011-01-01

    This study was conducted to evaluate the effect of commercially available engineered iron oxide nanoparticles coated with a surfactant (ENP Fe-surf ) on effluent water quality from a lab-scale sequencing batch reactor as a model secondary biological wastewater treatment. Results showed that ∼8.7% of ENP Fe-surf applied were present in the effluent stream. The stable presence of ENP Fe-surf was confirmed by analyzing the mean particle diameter and iron concentration in the effluent. Consequently, aqueous ENP Fe-surf deteriorated the effluent water quality at a statistically significant level (p Fe-surf would be introduced into environmental receptors through the treated effluent and could potentially impact them. - Highlights: → Surfactant-coated engineered iron oxide nanoparticles (ENP Fe-surf ) were assessed. → Effluent quality was analyzed from a sequencing batch reactor with ENP Fe-surf . → ∼8.7% of ENP Fe-surf applied was present in the effluent. → ENP Fe-surf significantly (p Fe-surf will be introduced into environmental receptors. - Stable presence of surfactant-coated engineered iron oxides nanoparticles deteriorated the effluent water quality at a statistically significant level (p < 0.05).

  2. The genome sequence of the outbreeding globe artichoke constructed de novo incorporating a phase-aware low-pass sequencing strategy of F1 progeny

    Science.gov (United States)

    Scaglione, Davide; Reyes-Chin-Wo, Sebastian; Acquadro, Alberto; Froenicke, Lutz; Portis, Ezio; Beitel, Christopher; Tirone, Matteo; Mauro, Rosario; Lo Monaco, Antonino; Mauromicale, Giovanni; Faccioli, Primetta; Cattivelli, Luigi; Rieseberg, Loren; Michelmore, Richard; Lanteri, Sergio

    2016-01-01

    Globe artichoke (Cynara cardunculus var. scolymus) is an out-crossing, perennial, multi-use crop species that is grown worldwide and belongs to the Compositae, one of the most successful Angiosperm families. We describe the first genome sequence of globe artichoke. The assembly, comprising of 13,588 scaffolds covering 725 of the 1,084 Mb genome, was generated using ~133-fold Illumina sequencing data and encodes 26,889 predicted genes. Re-sequencing (30×) of globe artichoke and cultivated cardoon (C. cardunculus var. altilis) parental genotypes and low-coverage (0.5 to 1×) genotyping-by-sequencing of 163 F1 individuals resulted in 73% of the assembled genome being anchored in 2,178 genetic bins ordered along 17 chromosomal pseudomolecules. This was achieved using a novel pipeline, SOILoCo (Scaffold Ordering by Imputation with Low Coverage), to detect heterozygous regions and assign parental haplotypes with low sequencing read depth and of unknown phase. SOILoCo provides a powerful tool for de novo genome analysis of outcrossing species. Our data will enable genome-scale analyses of evolutionary processes among crops, weeds, and wild species within and beyond the Compositae, and will facilitate the identification of economically important genes from related species. PMID:26786968

  3. SUGAR: graphical user interface-based data refiner for high-throughput DNA sequencing.

    Science.gov (United States)

    Sato, Yukuto; Kojima, Kaname; Nariai, Naoki; Yamaguchi-Kabata, Yumi; Kawai, Yosuke; Takahashi, Mamoru; Mimori, Takahiro; Nagasaki, Masao

    2014-08-08

    Next-generation sequencers (NGSs) have become one of the main tools for current biology. To obtain useful insights from the NGS data, it is essential to control low-quality portions of the data affected by technical errors such as air bubbles in sequencing fluidics. We develop a software SUGAR (subtile-based GUI-assisted refiner) which can handle ultra-high-throughput data with user-friendly graphical user interface (GUI) and interactive analysis capability. The SUGAR generates high-resolution quality heatmaps of the flowcell, enabling users to find possible signals of technical errors during the sequencing. The sequencing data generated from the error-affected regions of a flowcell can be selectively removed by automated analysis or GUI-assisted operations implemented in the SUGAR. The automated data-cleaning function based on sequence read quality (Phred) scores was applied to a public whole human genome sequencing data and we proved the overall mapping quality was improved. The detailed data evaluation and cleaning enabled by SUGAR would reduce technical problems in sequence read mapping, improving subsequent variant analysis that require high-quality sequence data and mapping results. Therefore, the software will be especially useful to control the quality of variant calls to the low population cells, e.g., cancers, in a sample with technical errors of sequencing procedures.

  4. Site-Specific Incorporation of Functional Components into RNA by an Unnatural Base Pair Transcription System

    Directory of Open Access Journals (Sweden)

    Rie Kawai

    2012-03-01

    Full Text Available Toward the expansion of the genetic alphabet, an unnatural base pair between 7-(2-thienylimidazo[4,5-b]pyridine (Ds and pyrrole-2-carbaldehyde (Pa functions as a third base pair in replication and transcription, and provides a useful tool for the site-specific, enzymatic incorporation of functional components into nucleic acids. We have synthesized several modified-Pa substrates, such as alkylamino-, biotin-, TAMRA-, FAM-, and digoxigenin-linked PaTPs, and examined their transcription by T7 RNA polymerase using Ds-containing DNA templates with various sequences. The Pa substrates modified with relatively small functional groups, such as alkylamino and biotin, were efficiently incorporated into RNA transcripts at the internal positions, except for those less than 10 bases from the 3′-terminus. We found that the efficient incorporation into a position close to the 3′-terminus of a transcript depended on the natural base contexts neighboring the unnatural base, and that pyrimidine-Ds-pyrimidine sequences in templates were generally favorable, relative to purine-Ds-purine sequences. The unnatural base pair transcription system provides a method for the site-specific functionalization of large RNA molecules.

  5. Stability of recursive out-of-sequence measurement filters: an open problem

    Science.gov (United States)

    Chen, Lingji; Moshtagh, Nima; Mehra, Raman K.

    2011-06-01

    In many applications where communication delays are present, measurements with earlier time stamps can arrive out-of-sequence, i.e., after state estimates have been obtained for the current time instant. To incorporate such an Out-Of-Sequence Measurement (OOSM), many algorithms have been proposed in the literature to obtain or approximate the optimal estimate that would have been obtained if the OOSM had arrived in-sequence. When OOSM occurs repeatedly, approximate estimations as a result of incorporating one OOSM have to serve as the basis for incorporating yet another OOSM. The question of whether the "approximation of approximation" is well behaved, i.e., whether approximation errors accumulate in a recursive setting, has not been adequately addressed in the literature. This paper draws attention to the stability question of recursive OOSM processing filters, formulates the problem in a specific setting, and presents some simulation results that suggest that such filters are indeed well-behaved. Our hope is that more research will be conducted in the future to rigorously establish stability properties of these filters.

  6. Subjective quality of video sequences rendered on LCD with local backlight dimming at different lighting conditions

    Science.gov (United States)

    Mantel, Claire; Korhonen, Jari; Pedersen, Jesper M.; Bech, Søren; Andersen, Jakob Dahl; Forchhammer, Søren

    2015-01-01

    This paper focuses on the influence of ambient light on the perceived quality of videos displayed on Liquid Crystal Display (LCD) with local backlight dimming. A subjective test assessing the quality of videos with two backlight dimming methods and three lighting conditions, i.e. no light, low light level (5 lux) and higher light level (60 lux) was organized to collect subjective data. Results show that participants prefer the method exploiting local dimming possibilities to the conventional full backlight but that this preference varies depending on the ambient light level. The clear preference for one method at the low light conditions decreases at the high ambient light, confirming that the ambient light significantly attenuates the perception of the leakage defect (light leaking through dark pixels). Results are also highly dependent on the content of the sequence, which can modulate the effect of the ambient light from having an important influence on the quality grades to no influence at all.

  7. Accurate Local-Ancestry Inference in Exome-Sequenced Admixed Individuals via Off-Target Sequence Reads

    Science.gov (United States)

    Hu, Youna; Willer, Cristen; Zhan, Xiaowei; Kang, Hyun Min; Abecasis, Gonçalo R.

    2013-01-01

    Estimates of the ancestry of specific chromosomal regions in admixed individuals are useful for studies of human evolutionary history and for genetic association studies. Previously, this ancestry inference relied on high-quality genotypes from genome-wide association study (GWAS) arrays. These high-quality genotypes are not always available when samples are exome sequenced, and exome sequencing is the strategy of choice for many ongoing genetic studies. Here we show that off-target reads generated during exome-sequencing experiments can be combined with on-target reads to accurately estimate the ancestry of each chromosomal segment in an admixed individual. To reconstruct local ancestry, our method SEQMIX models aligned bases directly instead of relying on hard genotype calls. We evaluate the accuracy of our method through simulations and analysis of samples sequenced by the 1000 Genomes Project and the NHLBI Grand Opportunity Exome Sequencing Project. In African Americans, we show that local-ancestry estimates derived by our method are very similar to those derived with Illumina’s Omni 2.5M genotyping array and much improved in relation to estimates that use only exome genotypes and ignore off-target sequencing reads. Software implementing this method, SEQMIX, can be applied to analysis of human population history or used for genetic association studies in admixed individuals. PMID:24210252

  8. A dimer of the lymphoid protein RAG1 recognizes the recombination signal sequence and the complex stably incorporates the high mobility group protein HMG2.

    Science.gov (United States)

    Rodgers, K K; Villey, I J; Ptaszek, L; Corbett, E; Schatz, D G; Coleman, J E

    1999-07-15

    RAG1 and RAG2 are the two lymphoid-specific proteins required for the cleavage of DNA sequences known as the recombination signal sequences (RSSs) flanking V, D or J regions of the antigen-binding genes. Previous studies have shown that RAG1 alone is capable of binding to the RSS, whereas RAG2 only binds as a RAG1/RAG2 complex. We have expressed recombinant core RAG1 (amino acids 384-1008) in Escherichia coli and demonstrated catalytic activity when combined with RAG2. This protein was then used to determine its oligomeric forms and the dissociation constant of binding to the RSS. Electrophoretic mobility shift assays show that up to three oligomeric complexes of core RAG1 form with a single RSS. Core RAG1 was found to exist as a dimer both when free in solution and as the minimal species bound to the RSS. Competition assays show that RAG1 recognizes both the conserved nonamer and heptamer sequences of the RSS. Zinc analysis shows the core to contain two zinc ions. The purified RAG1 protein overexpressed in E.coli exhibited the expected cleavage activity when combined with RAG2 purified from transfected 293T cells. The high mobility group protein HMG2 is stably incorporated into the recombinant RAG1/RSS complex and can increase the affinity of RAG1 for the RSS in the absence of RAG2.

  9. Sequence modelling and an extensible data model for genomic database

    Energy Technology Data Exchange (ETDEWEB)

    Li, Peter Wei-Der [California Univ., San Francisco, CA (United States); Univ. of California, Berkeley, CA (United States)

    1992-01-01

    The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS`s do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data model that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the ``Extensible Object Model``, to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.

  10. Sequence modelling and an extensible data model for genomic database

    Energy Technology Data Exchange (ETDEWEB)

    Li, Peter Wei-Der (California Univ., San Francisco, CA (United States) Lawrence Berkeley Lab., CA (United States))

    1992-01-01

    The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS's do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data model that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the Extensible Object Model'', to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.

  11. Sequencing games with Just-in-Time arrival, and related games

    NARCIS (Netherlands)

    Lohmann, E.R.M.A.; Borm, P.E.M.; Slikker, M.

    2014-01-01

    In this paper sequencing situations with Just-in-Time (JiT) arrival are introduced. This new type of one-machine sequencing situations assumes that a job is available to be handled by the machine as soon as its predecessor is finished. A basic predecessor dependent set-up time is incorporated in the

  12. Effects of aging and freezing/thawing sequence on quality attributes of bovine and

    Directory of Open Access Journals (Sweden)

    Hyun-Wook Kim

    2017-02-01

    Full Text Available Objective The effects of aging and freezing/thawing sequence on color, physicochemical, and enzymatic characteristics of two beef muscles (Mm. gluteus medius, GM and biceps femoris, BF were evaluated. Methods Beef muscles at 3 d postmortem were assigned to four different combinations of aging and freezing/thawing sequence as follows; aging at 2°C for 3 wk (A3, never-frozen control, freezing at −28°C for 2 wk then thawing (F2, frozen/thawed-only, aging at 2°C for 3 wk, freezing at −28°C for 2 wk then thawing (A3F2, and freezing at −28°C for 2 wk, thawing then further aging at 2°C for 3 wk (F2A3. Results No significant interactions between different aging/freezing/thawing treatments and muscle type on all measurements were found. Postmortem aging, regardless of aging/freezing/thawing sequence, had no impact on color stability of frozen/thawed beef muscles (p<0.05. F2A3 resulted in higher purge loss than F2 and A3F2 treatments (p<0.05. A3F2 and F2A3 treatments resulted in lower shear force of beef muscles compared to F2 (p<0.05. Although there was no significant difference in glutathione peroxidase (GSH-Px activity, F2A3 had the highest β-N-acetyl glucominidase (BNAG activity in purge, but the lowest BNAG activity in muscle (p<0.05. GM muscle exhibited higher total color changes and purge loss, and lower GSH-Px activity than BF muscle. Conclusion The results from this present study indicate that different combinations of aging/freezing/thawing sequence would result in considerable impacts on meat quality attributes, particularly thaw/purge loss and tenderness. Developing a novel freezing strategy combined with postmortem aging will be beneficial for the food/meat industry to maximize its positive impacts on tenderness, while minimizing thaw/purge loss of frozen/thawed meat.

  13. [Complete genome sequencing and sequence analysis of BCG Tice].

    Science.gov (United States)

    Wang, Zhiming; Pan, Yuanlong; Wu, Jun; Zhu, Baoli

    2012-10-04

    The objective of this study is to obtain the complete genome sequence of Bacillus Calmette-Guerin Tice (BCG Tice), in order to provide more information about the molecular biology of BCG Tice and design more reasonable vaccines to prevent tuberculosis. We assembled the data from high-throughput sequencing with SOAPdenovo software, with many contigs and scaffolds obtained. There are many sequence gaps and physical gaps remained as a result of regional low coverage and low quality. We designed primers at the end of contigs and performed PCR amplification in order to link these contigs and scaffolds. With various enzymes to perform PCR amplification, adjustment of PCR reaction conditions, and combined with clone construction to sequence, all the gaps were finished. We obtained the complete genome sequence of BCG Tice and submitted it to GenBank of National Center for Biotechnology Information (NCBI). The genome of BCG Tice is 4334064 base pairs in length, with GC content 65.65%. The problems and strategies during the finishing step of BCG Tice sequencing are illuminated here, with the hope of affording some experience to those who are involved in the finishing step of genome sequencing. The microarray data were verified by our results.

  14. Effects of nisin-incorporated films on the microbiological and physicochemical quality of minimally processed mangoes.

    Science.gov (United States)

    Barbosa, Ana Andréa Teixeira; Silva de Araújo, Hyrla Grazielle; Matos, Patrícia Nogueira; Carnelossi, Marcelo Augusto Guitierrez; Almeida de Castro, Alessandra

    2013-06-17

    The aim of this study is to examine the effects of nisin-incorporated cellulose films on the physicochemical and microbiological qualities of minimally processed mangoes. The use of antimicrobial films did not affect the physicochemical characteristics of mangoes and showed antimicrobial activity against Staphylococcus aureus, Listeria monocytogenes, Alicyclobacillus acidoterrestris and Bacillus cereus. The mango slices were inoculated with S. aureus and L. monocytogenes (10(7)CFU/g), and the viable cell numbers remained at 10(5) and 10(6)CFU/g, respectively, after 12days. In samples packed with antimicrobial films, the viable number of L. monocytogenes cells was reduced below the detection level after 4days. After 6days, a reduction of six log units was observed for S. aureus. In conclusion, nisin showed antimicrobial activity in mangoes without interfering with the organoleptic characteristics of the fruit. This result suggests that nisin could potentially be used in active packing to improve the safety of minimally processed mangoes. Copyright © 2013 Elsevier B.V. All rights reserved.

  15. Genome Sequence Databases (Overview): Sequencing and Assembly

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla L.

    2009-01-01

    From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly of whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality ({approx}1 error/10,000 bp), validated through a number of computer and laboratory experiments.

  16. Incorporating microbiota data into epidemiologic models: examples from vaginal microbiota research.

    Science.gov (United States)

    van de Wijgert, Janneke H; Jespers, Vicky

    2016-05-01

    Next generation sequencing and quantitative polymerase chain reaction technologies are now widely available, and research incorporating these methods is growing exponentially. In the vaginal microbiota (VMB) field, most research to date has been descriptive. The purpose of this article is to provide an overview of different ways in which next generation sequencing and quantitative polymerase chain reaction data can be used to answer clinical epidemiologic research questions using examples from VMB research. We reviewed relevant methodological literature and VMB articles (published between 2008 and 2015) that incorporated these methodologies. VMB data have been analyzed using ecologic methods, methods that compare the presence or relative abundance of individual taxa or community compositions between different groups of women or sampling time points, and methods that first reduce the complexity of the data into a few variables followed by the incorporation of these variables into traditional biostatistical models. To make future VMB research more clinically relevant (such as studying associations between VMB compositions and clinical outcomes and the effects of interventions on the VMB), it is important that these methods are integrated with rigorous epidemiologic methods (such as appropriate study designs, sampling strategies, and adjustment for confounding). Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.

  17. Sequence embedding for fast construction of guide trees for multiple sequence alignment

    LENUS (Irish Health Repository)

    Blackshields, Gordon

    2010-05-14

    Abstract Background The most widely used multiple sequence alignment methods require sequences to be clustered as an initial step. Most sequence clustering methods require a full distance matrix to be computed between all pairs of sequences. This requires memory and time proportional to N 2 for N sequences. When N grows larger than 10,000 or so, this becomes increasingly prohibitive and can form a significant barrier to carrying out very large multiple alignments. Results In this paper, we have tested variations on a class of embedding methods that have been designed for clustering large numbers of complex objects where the individual distance calculations are expensive. These methods involve embedding the sequences in a space where the similarities within a set of sequences can be closely approximated without having to compute all pair-wise distances. Conclusions We show how this approach greatly reduces computation time and memory requirements for clustering large numbers of sequences and demonstrate the quality of the clusterings by benchmarking them as guide trees for multiple alignment. Source code is available for download from http:\\/\\/www.clustal.org\\/mbed.tgz.

  18. Post-contrast T1-weighted sequences in pediatric abdominal imaging: comparative analysis of three different sequences and imaging approach

    Energy Technology Data Exchange (ETDEWEB)

    Roque, Andreia; Ramalho, Miguel; AlObaidy, Mamdoh; Heredia, Vasco; Burke, Lauren M.; De Campos, Rafael O.P.; Semelka, Richard C. [University of North Carolina at Chapel Hill, Department of Radiology, Chapel Hill, NC (United States)

    2014-10-15

    Post-contrast T1-weighted imaging is an essential component of a comprehensive pediatric abdominopelvic MR examination. However, consistent good image quality is challenging, as respiratory motion in sedated children can substantially degrade the image quality. To compare the image quality of three different post-contrast T1-weighted imaging techniques - standard three-dimensional gradient-echo (3-D-GRE), magnetization-prepared gradient-recall echo (MP-GRE) and 3-D-GRE with radial data sampling (radial 3-D-GRE) - acquired in pediatric patients younger than 5 years of age. Sixty consecutive exams performed in 51 patients (23 females, 28 males; mean age 2.5 ± 1.4 years) constituted the final study population. Thirty-nine scans were performed at 3 T and 21 scans were performed at 1.5 T. Two different reviewers independently and blindly qualitatively evaluated all sequences to determine image quality and extent of artifacts. MP-GRE and radial 3-D-GRE sequences had the least respiratory motion (P < 0.0001). Standard 3-D-GRE sequences displayed the lowest average score ratings in hepatic and pancreatic edge definition, hepatic vessel clarity and overall image quality. Radial 3-D-GRE sequences showed the highest scores ratings in overall image quality. Our preliminary results support the preference of fat-suppressed radial 3-D-GRE as the best post-contrast T1-weighted imaging approach for patients under the age of 5 years, when dynamic imaging is not essential. (orig.)

  19. BrAD-seq: Breath Adapter Directional sequencing: a streamlined, ultra-simple and fast library preparation protocol for strand specific mRNA library construction.

    Directory of Open Access Journals (Sweden)

    Brad Thomas Townsley

    2015-05-01

    Full Text Available Next Generation Sequencing (NGS is driving rapid advancement in biological understanding and RNA-sequencing (RNA-seq has become an indispensable tool for biology and medicine. There is a growing need for access to these technologies although preparation of NGS libraries remains a bottleneck to wider adoption. Here we report a novel method for the production of strand specific RNA-seq libraries utilizing inherent properties of double-stranded cDNA to capture and incorporate a sequencing adapter. Breath Adapter Directional sequencing (BrAD-seq reduces sample handling and requires far fewer enzymatic steps than most available methods to produce high quality strand-specific RNA-seq libraries. The method we present is optimized for 3-prime Digital Gene Expression (DGE libraries and can easily extend to full transcript coverage shotgun (SHO type strand-specific libraries and is modularized to accommodate a diversity of RNA and DNA input materials. BrAD-seq offers a highly streamlined and inexpensive option for RNA-seq libraries.

  20. Effects of PLA Film Incorporated with ZnO Nanoparticle on the Quality Attributes of Fresh-Cut Apple.

    Science.gov (United States)

    Li, Wenhui; Li, Lin; Cao, Yun; Lan, Tianqing; Chen, Haiyan; Qin, Yuyue

    2017-07-31

    A novel nanopackaging film was synthesized by incorporating ZnO nanoparticles into a poly-lactic acid (PLA) matrix, and its effect on the quality of fresh-cut apple during the period of preservation was investigated at 4 ± 1 °C for 14 days. Six wt % cinnamaldehyde was added into the nano-blend film. Scanning electron microscope (SEM) analysis showed a rougher cross-section of the nano-blend films and an X-ray diffraction (XRD) was carried out to determine the structure of the ZnO nanoparticles. Compared to the pure PLA film, the nano-blend film had a higher water vapor permeability (WVP) and lower oxygen permeability. With the increase of the nanoparticles (NPs) in the PLA, the elongation at break (ε) and elastic modulus (EM) increased, while tensile strength (TS) decreased. Thermogravimetric analysis (TGA) presented a relatively good thermostability. Most importantly, the physical and biochemical properties of the fresh-cut apple were also measured, such as weight loss, firmness, polyphenol oxidase (PPO), total phenolic content, browning index (BI), sensory quality, and microbiological level. The results indicated that nano-blend packaging films had the highest weight loss at the end of storage compared to the pure PLA film; however, nanopackaging provided a better retention of firmness, total phenolic countent, color, and sensory quality. It also had a remarkable inhibition on the growth of microorganisms. Therefore, Nano-ZnO active packaging could be used to improve the shelf-life of fresh-cut produce.

  1. "First generation" automated DNA sequencing technology.

    Science.gov (United States)

    Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

    2011-10-01

    Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.

  2. The diploid genome sequence of an Asian individual

    DEFF Research Database (Denmark)

    Wang, Jun; Wang, Wei; Li, Ruiqiang

    2008-01-01

    Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we...... used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP...... identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J...

  3. Special Issue: Next Generation DNA Sequencing

    Directory of Open Access Journals (Sweden)

    Paul Richardson

    2010-10-01

    Full Text Available Next Generation Sequencing (NGS refers to technologies that do not rely on traditional dideoxy-nucleotide (Sanger sequencing where labeled DNA fragments are physically resolved by electrophoresis. These new technologies rely on different strategies, but essentially all of them make use of real-time data collection of a base level incorporation event across a massive number of reactions (on the order of millions versus 96 for capillary electrophoresis for instance. The major commercial NGS platforms available to researchers are the 454 Genome Sequencer (Roche, Illumina (formerly Solexa Genome analyzer, the SOLiD system (Applied Biosystems/Life Technologies and the Heliscope (Helicos Corporation. The techniques and different strategies utilized by these platforms are reviewed in a number of the papers in this special issue. These technologies are enabling new applications that take advantage of the massive data produced by this next generation of sequencing instruments. [...

  4. Subjective quality of video sequences rendered on LCD with local backlight dimming at different lighting conditions

    DEFF Research Database (Denmark)

    Mantel, Claire; Korhonen, Jari; Pedersen, Jesper Mørkhøj

    2015-01-01

    This paper focuses on the influence of ambient light on the perceived quality of videos displayed on Liquid Crystal Display (LCD) with local backlight dimming. A subjective test assessing the quality of videos with two backlight dimming methods and three lighting conditions, i.e. no light, low...... light level (5 lux) and higher light level (60 lux) was organized to collect subjective data. Results show that participants prefer the method exploiting local dimming possibilities to the conventional full backlight but that this preference varies depending on the ambient light level. The clear...... preference for one method at the low light conditions decreases at the high ambient light, confirming that the ambient light significantly attenuates the perception of the leakage defect (light leaking through dark pixels). Results are also highly dependent on the content of the sequence, which can modulate...

  5. MEGGASENSE - The Metagenome/Genome Annotated Sequence Natural Language Search Engine: A Platform for 
the Construction of Sequence Data Warehouses.

    Science.gov (United States)

    Gacesa, Ranko; Zucko, Jurica; Petursdottir, Solveig K; Gudmundsdottir, Elisabet Eik; Fridjonsson, Olafur H; Diminic, Janko; Long, Paul F; Cullum, John; Hranueli, Daslav; Hreggvidsson, Gudmundur O; Starcevic, Antonio

    2017-06-01

    The MEGGASENSE platform constructs relational databases of DNA or protein sequences. The default functional analysis uses 14 106 hidden Markov model (HMM) profiles based on sequences in the KEGG database. The Solr search engine allows sophisticated queries and a BLAST search function is also incorporated. These standard capabilities were used to generate the SCATT database from the predicted proteome of Streptomyces cattleya . The implementation of a specialised metagenome database (AMYLOMICS) for bioprospecting of carbohydrate-modifying enzymes is described. In addition to standard assembly of reads, a novel 'functional' assembly was developed, in which screening of reads with the HMM profiles occurs before the assembly. The AMYLOMICS database incorporates additional HMM profiles for carbohydrate-modifying enzymes and it is illustrated how the combination of HMM and BLAST analyses helps identify interesting genes. A variety of different proteome and metagenome databases have been generated by MEGGASENSE.

  6. repDNA: a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects.

    Science.gov (United States)

    Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen

    2015-04-15

    In order to develop powerful computational predictors for identifying the biological features or attributes of DNAs, one of the most challenging problems is to find a suitable approach to effectively represent the DNA sequences. To facilitate the studies of DNAs and nucleotides, we developed a Python package called representations of DNAs (repDNA) for generating the widely used features reflecting the physicochemical properties and sequence-order effects of DNAs and nucleotides. There are three feature groups composed of 15 features. The first group calculates three nucleic acid composition features describing the local sequence information by means of kmers; the second group calculates six autocorrelation features describing the level of correlation between two oligonucleotides along a DNA sequence in terms of their specific physicochemical properties; the third group calculates six pseudo nucleotide composition features, which can be used to represent a DNA sequence with a discrete model or vector yet still keep considerable sequence-order information via the physicochemical properties of its constituent oligonucleotides. In addition, these features can be easily calculated based on both the built-in and user-defined properties via using repDNA. The repDNA Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repDNA/. bliu@insun.hit.edu.cn or kcchou@gordonlifescience.org Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. Sequencing of BAC pools by different next generation sequencing platforms and strategies

    Directory of Open Access Journals (Sweden)

    Scholz Uwe

    2011-10-01

    Full Text Available Abstract Background Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. Results Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. Conclusion Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.

  8. BLEACHING EUCALYPTUS PULPS WITH SHORT SEQUENCES

    Directory of Open Access Journals (Sweden)

    Flaviana Reis Milagres

    2011-03-01

    Full Text Available Eucalyptus spp kraft pulp, due to its high content of hexenuronic acids, is quite easy to bleach. Therefore, investigations have been made attempting to decrease the number of stages in the bleaching process in order to minimize capital costs. This study focused on the evaluation of short ECF (Elemental Chlorine Free and TCF (Totally Chlorine Free sequences for bleaching oxygen delignified Eucalyptus spp kraft pulp to 90% ISO brightness: PMoDP (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, PMoD/P (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, without washing PMoD(PO (Molybdenum catalyzed acid peroxide, chlorine dioxide and pressurized peroxide, D(EPODP (chlorine dioxide, extraction oxidative with oxygen and peroxide, chlorine dioxide and hydrogen peroxide, PMoQ(PO (Molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide, and XPMoQ(PO (Enzyme, molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide. Uncommon pulp treatments, such as molybdenum catalyzed acid peroxide (PMo and xylanase (X bleaching stages, were used. Among the ECF alternatives, the two-stage PMoD/P sequence proved highly cost-effective without affecting pulp quality in relation to the traditional D(EPODP sequence and produced better quality effluent in relation to the reference. However, a four stage sequence, XPMoQ(PO, was required to achieve full brightness using the TCF technology. This sequence was highly cost-effective although it only produced pulp of acceptable quality.

  9. Clustering of reads with alignment-free measures and quality values.

    Science.gov (United States)

    Comin, Matteo; Leoni, Andrea; Schimd, Michele

    2015-01-01

    The data volume generated by Next-Generation Sequencing (NGS) technologies is growing at a pace that is now challenging the storage and data processing capacities of modern computer systems. In this context an important aspect is the reduction of data complexity by collapsing redundant reads in a single cluster to improve the run time, memory requirements, and quality of post-processing steps like assembly and error correction. Several alignment-free measures, based on k-mers counts, have been used to cluster reads. Quality scores produced by NGS platforms are fundamental for various analysis of NGS data like reads mapping and error detection. Moreover future-generation sequencing platforms will produce long reads but with a large number of erroneous bases (up to 15 %). In this scenario it will be fundamental to exploit quality value information within the alignment-free framework. To the best of our knowledge this is the first study that incorporates quality value information and k-mers counts, in the context of alignment-free measures, for the comparison of reads data. Based on this principles, in this paper we present a family of alignment-free measures called D (q) -type. A set of experiments on simulated and real reads data confirms that the new measures are superior to other classical alignment-free statistics, especially when erroneous reads are considered. Also results on de novo assembly and metagenomic reads classification show that the introduction of quality values improves over standard alignment-free measures. These statistics are implemented in a software called QCluster (http://www.dei.unipd.it/~ciompin/main/qcluster.html).

  10. Inhaled air quality with desk incorporated personalized ventilation (PV): parametric study

    DEFF Research Database (Denmark)

    Bolashikov, Zhecho Dimitrov; Nagano, Hideaki; Melikov, Arsen Krikor

    A workstation consisting of a desk with installed personalized ventilation (PV) and a dressed breathing thermal manikin simulating seated occupant was set in a full-scale test room. The room was conditioned by overhead ventilation at 26 oC. The PV consisted of two confluent jets incorporated along...

  11. Studies on the incorporation of spent ion exchange resins from nuclear power plants into bitumen and cement

    International Nuclear Information System (INIS)

    Bonnevie-Svendsen, M.; Tallberg, K.; Aittola, P.; Tollbaeck, H.

    1976-01-01

    The joint Nordic incorporation experiments should provide technical data needed for the assessment of solidification techniques for wastes from nuclear reactors in the Nordic countries. Spent ion exchange resins are a main fraction of such wastes, and more knowledge about their incorporation is wanted. The effects of simulated and real ion exchange wastes on the quality of bitumen and cement incorporation products were studied. Blown and distilled bitumen and three Portland cement qualities were used. Product characterizations were based on properties relevant for safe waste management, storage, transport and disposal. The applicability and relevance of established and suggested tests is discussed. Up to 40-60% dry resin could be incorporated into bitumen without impairing product qualities. Products with higher resin contents were found to swell in contact with water. The products had a high leach resistance. Their form stability was improved by incorporated resins. Product qualities appeared to be less affected by physico-chemical variables than by mechanical process parameters. Pure resin-cement products tend to decompose in water. Product qualities were strongly affected by a variety of physico-chemical process parameters, and integer products were only obtained within narrow tolerance limits. Caesium was rapidly leached out. To attain integer products and improved leach resistance within technically acceptable tolerance limits it was necessary to utilize stabilizing and caesium-retaining additives such as Silix and vermiculite. Under the present conditions the water content of the resins limited the amounts that could be incorporated in 40-50wt% or about 70vol.% water-saturated (containing 20-40% dry) resin. (author)

  12. Improving the quality of tiger shrimp Penaeus monodon through dietary incorporation of algae as a source of natural pigment

    Directory of Open Access Journals (Sweden)

    Kunal Mondal

    2015-01-01

    Full Text Available Tiger shrimp is one of the major candidate species for export oriented aquaculture which dominates the seafood market in regions of European Union, Japan and USA. Carotenoid content in seafood has now become one of the important criteria in determining the quality of edible product. Recent trends in supplementing fish diets with natural pigment source are an alternative to the utilization of expensive synthetic pigments. In this context, green algae Enteromorpha intestinalis was selected as a source of natural pigment for inclusion in the diet of tiger shrimp Penaeus monodon. Astaxanthin being an important category of carotenoid pigment was monitored in shrimp muscle tissue during the feeding trial. Significant variation (p<0.05 was observed between the experimental groups as confirmed through ANOVA thus exhibiting higher astaxanthin content of shrimps (18.70 Å} 4.48 ppm fed with E. intestinalis incorporated diet as compared to control (15.80 Å} 2.33 ppm. The present programme therefore emphasizes on the quality improvement of aquaculture product by dietary inclusion of algae as a natural pigment source.

  13. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    Science.gov (United States)

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-07

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. One-Step PCR Sequencing. Final Technical Progress Report for February 15, 1997 - November 30, 2001

    Energy Technology Data Exchange (ETDEWEB)

    Shaw, B. R.

    2004-04-16

    We investigated new chemistries and alternate approaches for direct gene sequencing and detection based on the properties of boron-substituted nucleotides as chain delimiters in lieu of conventional chain terminators. Chain terminators, such as the widely used Sanger dideoxynucleotide truncators, stop DNA synthesis during replication and hence are incompatible with further PCR amplification. Chain delimiters, on the other hand, are chemically-modified, ''stealth'' nucleotides that act like normal nucleotides in DNA synthesis and PCR amplification, but can be unmasked following chain extension and exponential amplification. Specifically, chain delimiters give rise to an alternative sequencing strategy based on selective degradation of DNA chains generated by PCR amplification with modified nucleotides. The method as originally devised employed template-directed enzymatic, random incorporation of small amounts of boron-modified nucleotides (e.g., 2'-deoxynucleoside 5'-alpha-[P-borano]- triphosphates) during PCR amplification. Rather than incorporation of dideoxy chain terminators, which are less efficiently incorporated in PCR-based amplification than natural deoxynucleotides, our method is based on selective incorporation and exonuclease degradation of DNA chains generated by efficient PCR amplification of chemically-modified ''stealth'' nucleotides. The stealth nucleotides have a boranophosphate group instead of a normal phosphate, yet behave like normal nucleotides during PCR-amplification. The unique feature of our method is that the position of the stealth nucleotide, and hence DNA sequencing fragments, are revealed at the desired, appropriate moment following PCR amplification. During the current grant period, a variety of new boron-modified nucleotides were synthesized, and new chemistries and enzymatic methods and combinations thereof were explored to improve the method and study the effects of borane modified

  15. LOX: Inferring level of expression from diverse methods of census sequencing

    KAUST Repository

    Zhang, Zhang

    2010-06-10

    Summary: We present LOX (Level Of eXpression) that estimates the Level Of gene eXpression from high-throughput-expressed sequence datasets with multiple treatments or samples. Unlike most analyses, LOX incorporates a gene bias model that facilitates integration of diverse transcriptomic sequencing data that arises when transcriptomic data have been produced using diverse experimental methodologies. LOX integrates overall sequence count tallies normalized by total expressed sequence count to provide expression levels for each gene relative to all treatments as well as Bayesian credible intervals. © The Author 2010. Published by Oxford University Press. All rights reserved.

  16. LOX: Inferring level of expression from diverse methods of census sequencing

    KAUST Repository

    Zhang, Zhang; Ló pez-Girá ldez, Francesc Francisco; Townsend, Jeffrey P.

    2010-01-01

    Summary: We present LOX (Level Of eXpression) that estimates the Level Of gene eXpression from high-throughput-expressed sequence datasets with multiple treatments or samples. Unlike most analyses, LOX incorporates a gene bias model that facilitates integration of diverse transcriptomic sequencing data that arises when transcriptomic data have been produced using diverse experimental methodologies. LOX integrates overall sequence count tallies normalized by total expressed sequence count to provide expression levels for each gene relative to all treatments as well as Bayesian credible intervals. © The Author 2010. Published by Oxford University Press. All rights reserved.

  17. On the optimal trimming of high-throughput mRNA sequence data

    Directory of Open Access Journals (Sweden)

    Matthew D MacManes

    2014-01-01

    Full Text Available The widespread and rapid adoption of high-throughput sequencing technologies has afforded researchers the opportunity to gain a deep understanding of genome level processes that underlie evolutionary change, and perhaps more importantly, the links between genotype and phenotype. In particular, researchers interested in functional biology and adaptation have used these technologies to sequence mRNA transcriptomes of specific tissues, which in turn are often compared to other tissues, or other individuals with different phenotypes. While these techniques are extremely powerful, careful attention to data quality is required. In particular, because high-throughput sequencing is more error-prone than traditional Sanger sequencing, quality trimming of sequence reads should be an important step in all data processing pipelines. While several software packages for quality trimming exist, no general guidelines for the specifics of trimming have been developed. Here, using empirically derived sequence data, I provide general recommendations regarding the optimal strength of trimming, specifically in mRNA-Seq studies. Although very aggressive quality trimming is common, this study suggests that a more gentle trimming, specifically of those nucleotides whose Phred score < 2 or < 5, is optimal for most studies across a wide variety of metrics.

  18. QUASAR--scoring and ranking of sequence-structure alignments.

    Science.gov (United States)

    Birzele, Fabian; Gewehr, Jan E; Zimmer, Ralf

    2005-12-15

    Sequence-structure alignments are a common means for protein structure prediction in the fields of fold recognition and homology modeling, and there is a broad variety of programs that provide such alignments based on sequence similarity, secondary structure or contact potentials. Nevertheless, finding the best sequence-structure alignment in a pool of alignments remains a difficult problem. QUASAR (quality of sequence-structure alignments ranking) provides a unifying framework for scoring sequence-structure alignments that aids finding well-performing combinations of well-known and custom-made scoring schemes. Those scoring functions can be benchmarked against widely accepted quality scores like MaxSub, TMScore, Touch and APDB, thus enabling users to test their own alignment scores against 'standard-of-truth' structure-based scores. Furthermore, individual score combinations can be optimized with respect to benchmark sets based on known structural relationships using QUASAR's in-built optimization routines.

  19. Exploring evidence of positive selection reveals genetic basis of meat quality traits in Berkshire pigs through whole genome sequencing.

    Science.gov (United States)

    Jeong, Hyeonsoo; Song, Ki-Duk; Seo, Minseok; Caetano-Anollés, Kelsey; Kim, Jaemin; Kwak, Woori; Oh, Jae-Don; Kim, EuiSoo; Jeong, Dong Kee; Cho, Seoae; Kim, Heebal; Lee, Hak-Kyo

    2015-08-20

    Natural and artificial selection following domestication has led to the existence of more than a hundred pig breeds, as well as incredible variation in phenotypic traits. Berkshire pigs are regarded as having superior meat quality compared to other breeds. As the meat production industry seeks selective breeding approaches to improve profitable traits such as meat quality, information about genetic determinants of these traits is in high demand. However, most of the studies have been performed using trained sensory panel analysis without investigating the underlying genetic factors. Here we investigate the relationship between genomic composition and this phenotypic trait by scanning for signatures of positive selection in whole-genome sequencing data. We generated genomes of 10 Berkshire pigs at a total of 100.6 coverage depth, using the Illumina Hiseq2000 platform. Along with the genomes of 11 Landrace and 13 Yorkshire pigs, we identified genomic variants of 18.9 million SNVs and 3.4 million Indels in the mapped regions. We identified several associated genes related to lipid metabolism, intramuscular fatty acid deposition, and muscle fiber type which attribute to pork quality (TG, FABP1, AKIRIN2, GLP2R, TGFBR3, JPH3, ICAM2, and ERN1) by applying between population statistical tests (XP-EHH and XP-CLR). A statistical enrichment test was also conducted to detect breed specific genetic variation. In addition, de novo short sequence read assembly strategy identified several candidate genes (SLC25A14, IGF1, PI4KA, CACNA1A) as also contributing to lipid metabolism. Results revealed several candidate genes involved in Berkshire meat quality; most of these genes are involved in lipid metabolism and intramuscular fat deposition. These results can provide a basis for future research on the genomic characteristics of Berkshire pigs.

  20. Incorporating information on predicted solvent accessibility to the co-evolution-based study of protein interactions.

    Science.gov (United States)

    Ochoa, David; García-Gutiérrez, Ponciano; Juan, David; Valencia, Alfonso; Pazos, Florencio

    2013-01-27

    A widespread family of methods for studying and predicting protein interactions using sequence information is based on co-evolution, quantified as similarity of phylogenetic trees. Part of the co-evolution observed between interacting proteins could be due to co-adaptation caused by inter-protein contacts. In this case, the co-evolution is expected to be more evident when evaluated on the surface of the proteins or the internal layers close to it. In this work we study the effect of incorporating information on predicted solvent accessibility to three methods for predicting protein interactions based on similarity of phylogenetic trees. We evaluate the performance of these methods in predicting different types of protein associations when trees based on positions with different characteristics of predicted accessibility are used as input. We found that predicted accessibility improves the results of two recent versions of the mirrortree methodology in predicting direct binary physical interactions, while it neither improves these methods, nor the original mirrortree method, in predicting other types of interactions. That improvement comes at no cost in terms of applicability since accessibility can be predicted for any sequence. We also found that predictions of protein-protein interactions are improved when multiple sequence alignments with a richer representation of sequences (including paralogs) are incorporated in the accessibility prediction.

  1. High quality methylome-wide investigations through next-generation sequencing of DNA from a single archived dry blood spot.

    Science.gov (United States)

    Aberg, Karolina A; Xie, Lin Y; Nerella, Srilaxmi; Copeland, William E; Costello, E Jane; van den Oord, Edwin J C G

    2013-05-01

    The potential importance of DNA methylation in the etiology of complex diseases has led to interest in the development of methylome-wide association studies (MWAS) aimed at interrogating all methylation sites in the human genome. When using blood as biomaterial for a MWAS the DNA is typically extracted directly from fresh or frozen whole blood that was collected via venous puncture. However, DNA extracted from dry blood spots may also be an alternative starting material. In the present study, we apply a methyl-CpG binding domain (MBD) protein enrichment-based technique in combination with next generation sequencing (MBD-seq) to assess the methylation status of the ~27 million CpGs in the human autosomal reference genome. We investigate eight methylomes using DNA from blood spots. This data are compared with 1,500 methylomes previously assayed with the same MBD-seq approach using DNA from whole blood. When investigating the sequence quality and the enrichment profile across biological features, we find that DNA extracted from blood spots gives comparable results with DNA extracted from whole blood. Only if the amount of starting material is ≤ 0.5µg DNA we observe a slight decrease in the assay performance. In conclusion, we show that high quality methylome-wide investigations using MBD-seq can be conducted in DNA extracted from archived dry blood spots without sacrificing quality and without bias in enrichment profile as long as the amount of starting material is sufficient. In general, the amount of DNA extracted from a single blood spot is sufficient for methylome-wide investigations with the MBD-seq approach.

  2. Quantitative phenotyping via deep barcode sequencing.

    Science.gov (United States)

    Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey

    2009-10-01

    Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.

  3. Nursing Student Perceptions Regarding Simulation Experience Sequencing.

    Science.gov (United States)

    Woda, Aimee A; Gruenke, Theresa; Alt-Gehrman, Penny; Hansen, Jamie

    2016-09-01

    The use of simulated learning experiences (SLEs) have increased within nursing curricula with positive learning outcomes for nursing students. The purpose of this study is to explore nursing students' perceptions of their clinical decision making (CDM) related to the block sequencing of different patient care experiences, SLEs versus hospital-based learning experiences (HLEs). A qualitative descriptive design used open-ended survey questions to generate information about the block sequencing of SLEs and its impact on nursing students' perceived CDM. Three themes emerged from the data: Preexperience Anxiety, Real-Time Decision Making, and Increased Patient Care Experiences. Nursing students identified that having SLEs prior to HLEs provided several benefits. Even when students preferred SLEs prior to HLEs, the sequence did not impact their CDM. This suggests that alternating block sequencing can be used without impacting the students' perceptions of their ability to make decisions. [J Nurs Educ. 2016;55(9):528-532.]. Copyright 2016, SLACK Incorporated.

  4. Quality-Aware Estimation of Facial Landmarks in Video Sequences

    DEFF Research Database (Denmark)

    Haque, Mohammad Ahsanul; Nasrollahi, Kamal; Moeslund, Thomas B.

    2015-01-01

    Face alignment in video is a primitive step for facial image analysis. The accuracy of the alignment greatly depends on the quality of the face image in the video frames and low quality faces are proven to cause erroneous alignment. Thus, this paper proposes a system for quality aware face...... for facial landmark detection. If the face quality is low the proposed system corrects the facial landmarks that are detected by SDM. Depending upon the face velocity in consecutive video frames and face quality measure, two algorithms are proposed for correction of landmarks in low quality faces by using...

  5. Integrated evaluation of soil quality after the incorporation of organic matter and microorganisms

    Directory of Open Access Journals (Sweden)

    Valarini Pedro J.

    2002-01-01

    Full Text Available The soil quality was evaluated following the addition of organic matter and microorganisms to a clay loam soil collected in Aranjuez (Madrid under controlled conditions of temperature and moisture, and over a period of three months. The following treatments were carried out: soil (control; soil + 50 t/ha of animal manure (E50; soil + 50 t/ha of animal manure + 30l/ha of effective microorganisms (E50EM; soil + 30 t/ha of combination of various green crop residues and weeds (RC30 and soil + 30 t/ha of combination of various green crop residues and weeds + 30l/ha of effective microorganisms (RC30EM. The soil samples were taken before and after the incubation and analysed using physical, chemical and microbiological parameters. A significant increase in the production of polysaccharides and alkaline phosphatase and esterase enzymes in the treatments E50EM and RC30EM was observed, being in direct correlation with the humification of the organic matter, with the water retention at field capacity, and with the cationic exchange capacity (CEC. It can be concluded that the incorporation of microorganisms EM potentialized the soil biological activity and improved physico-chemical soil properties, contributing to a quick humification of fresh organic matter. Those findings were proved by microbiological activities of exopolysaccharides by alcaline phosphatase and esterase enzymes, which can be used as earlier and integral soil health indicators.

  6. The Vigna Genome Server, 'VigGS': A Genomic Knowledge Base of the Genus Vigna Based on High-Quality, Annotated Genome Sequence of the Azuki Bean, Vigna angularis (Willd.) Ohwi & Ohashi.

    Science.gov (United States)

    Sakai, Hiroaki; Naito, Ken; Takahashi, Yu; Sato, Toshiyuki; Yamamoto, Toshiya; Muto, Isamu; Itoh, Takeshi; Tomooka, Norihiko

    2016-01-01

    The genus Vigna includes legume crops such as cowpea, mungbean and azuki bean, as well as >100 wild species. A number of the wild species are highly tolerant to severe environmental conditions including high-salinity, acid or alkaline soil; drought; flooding; and pests and diseases. These features of the genus Vigna make it a good target for investigation of genetic diversity in adaptation to stressful environments; however, a lack of genomic information has hindered such research in this genus. Here, we present a genome database of the genus Vigna, Vigna Genome Server ('VigGS', http://viggs.dna.affrc.go.jp), based on the recently sequenced azuki bean genome, which incorporates annotated exon-intron structures, along with evidence for transcripts and proteins, visualized in GBrowse. VigGS also facilitates user construction of multiple alignments between azuki bean genes and those of six related dicot species. In addition, the database displays sequence polymorphisms between azuki bean and its wild relatives and enables users to design primer sequences targeting any variant site. VigGS offers a simple keyword search in addition to sequence similarity searches using BLAST and BLAT. To incorporate up to date genomic information, VigGS automatically receives newly deposited mRNA sequences of pre-set species from the public database once a week. Users can refer to not only gene structures mapped on the azuki bean genome on GBrowse but also relevant literature of the genes. VigGS will contribute to genomic research into plant biotic and abiotic stresses and to the future development of new stress-tolerant crops. © The Author 2015. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  7. Shared Investment Projects and Forecasting Errors: Setting Framework Conditions for Coordination and Sequencing Data Quality Activities

    Science.gov (United States)

    Leitner, Stephan; Brauneis, Alexander; Rausch, Alexandra

    2015-01-01

    In this paper, we investigate the impact of inaccurate forecasting on the coordination of distributed investment decisions. In particular, by setting up a computational multi-agent model of a stylized firm, we investigate the case of investment opportunities that are mutually carried out by organizational departments. The forecasts of concern pertain to the initial amount of money necessary to launch and operate an investment opportunity, to the expected intertemporal distribution of cash flows, and the departments’ efficiency in operating the investment opportunity at hand. We propose a budget allocation mechanism for coordinating such distributed decisions The paper provides guidance on how to set framework conditions, in terms of the number of investment opportunities considered in one round of funding and the number of departments operating one investment opportunity, so that the coordination mechanism is highly robust to forecasting errors. Furthermore, we show that—in some setups—a certain extent of misforecasting is desirable from the firm’s point of view as it supports the achievement of the corporate objective of value maximization. We then address the question of how to improve forecasting quality in the best possible way, and provide policy advice on how to sequence activities for improving forecasting quality so that the robustness of the coordination mechanism to errors increases in the best possible way. At the same time, we show that wrong decisions regarding the sequencing can lead to a decrease in robustness. Finally, we conduct a comprehensive sensitivity analysis and prove that—in particular for relatively good forecasters—most of our results are robust to changes in setting the parameters of our multi-agent simulation model. PMID:25803736

  8. Shared investment projects and forecasting errors: setting framework conditions for coordination and sequencing data quality activities.

    Science.gov (United States)

    Leitner, Stephan; Brauneis, Alexander; Rausch, Alexandra

    2015-01-01

    In this paper, we investigate the impact of inaccurate forecasting on the coordination of distributed investment decisions. In particular, by setting up a computational multi-agent model of a stylized firm, we investigate the case of investment opportunities that are mutually carried out by organizational departments. The forecasts of concern pertain to the initial amount of money necessary to launch and operate an investment opportunity, to the expected intertemporal distribution of cash flows, and the departments' efficiency in operating the investment opportunity at hand. We propose a budget allocation mechanism for coordinating such distributed decisions The paper provides guidance on how to set framework conditions, in terms of the number of investment opportunities considered in one round of funding and the number of departments operating one investment opportunity, so that the coordination mechanism is highly robust to forecasting errors. Furthermore, we show that-in some setups-a certain extent of misforecasting is desirable from the firm's point of view as it supports the achievement of the corporate objective of value maximization. We then address the question of how to improve forecasting quality in the best possible way, and provide policy advice on how to sequence activities for improving forecasting quality so that the robustness of the coordination mechanism to errors increases in the best possible way. At the same time, we show that wrong decisions regarding the sequencing can lead to a decrease in robustness. Finally, we conduct a comprehensive sensitivity analysis and prove that-in particular for relatively good forecasters-most of our results are robust to changes in setting the parameters of our multi-agent simulation model.

  9. Shared investment projects and forecasting errors: setting framework conditions for coordination and sequencing data quality activities.

    Directory of Open Access Journals (Sweden)

    Stephan Leitner

    Full Text Available In this paper, we investigate the impact of inaccurate forecasting on the coordination of distributed investment decisions. In particular, by setting up a computational multi-agent model of a stylized firm, we investigate the case of investment opportunities that are mutually carried out by organizational departments. The forecasts of concern pertain to the initial amount of money necessary to launch and operate an investment opportunity, to the expected intertemporal distribution of cash flows, and the departments' efficiency in operating the investment opportunity at hand. We propose a budget allocation mechanism for coordinating such distributed decisions The paper provides guidance on how to set framework conditions, in terms of the number of investment opportunities considered in one round of funding and the number of departments operating one investment opportunity, so that the coordination mechanism is highly robust to forecasting errors. Furthermore, we show that-in some setups-a certain extent of misforecasting is desirable from the firm's point of view as it supports the achievement of the corporate objective of value maximization. We then address the question of how to improve forecasting quality in the best possible way, and provide policy advice on how to sequence activities for improving forecasting quality so that the robustness of the coordination mechanism to errors increases in the best possible way. At the same time, we show that wrong decisions regarding the sequencing can lead to a decrease in robustness. Finally, we conduct a comprehensive sensitivity analysis and prove that-in particular for relatively good forecasters-most of our results are robust to changes in setting the parameters of our multi-agent simulation model.

  10. European external quality control study on the competence of laboratories to recognize rare sequence variants resulting in unusual genotyping results.

    Science.gov (United States)

    Márki-Zay, János; Klein, Christoph L; Gancberg, David; Schimmel, Heinz G; Dux, László

    2009-04-01

    Depending on the method used, rare sequence variants adjacent to the single nucleotide polymorphism (SNP) of interest may cause unusual or erroneous genotyping results. Because such rare variants are known for many genes commonly tested in diagnostic laboratories, we organized a proficiency study to assess their influence on the accuracy of reported laboratory results. Four external quality control materials were processed and sent to 283 laboratories through 3 EQA organizers for analysis of the prothrombin 20210G>A mutation. Two of these quality control materials contained sequence variants introduced by site-directed mutagenesis. One hundred eighty-nine laboratories participated in the study. When samples gave a usual result with the method applied, the error rate was 5.1%. Detailed analysis showed that more than 70% of the failures were reported from only 9 laboratories. Allele-specific amplification-based PCR had a much higher error rate than other methods (18.3% vs 2.9%). The variants 20209C>T and [20175T>G; 20179_20180delAC] resulted in unusual genotyping results in 67 and 85 laboratories, respectively. Eighty-three (54.6%) of these unusual results were not recognized, 32 (21.1%) were attributed to technical issues, and only 37 (24.3%) were recognized as another sequence variant. Our findings revealed that some of the participating laboratories were not able to recognize and correctly interpret unusual genotyping results caused by rare SNPs. Our study indicates that the majority of the failures could be avoided by improved training and careful selection and validation of the methods applied.

  11. Genotyping by sequencing for SNP-based linkage analysis and identification of QTLs linked to fruit quality traits in Japanese plum (Prunus salicina Lindl.)

    Science.gov (United States)

    Marker-assisted selection (MAS) in stone fruit (Prunus species) breeding is currently difficult to achieve due to the polygenic nature of themost relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for finemapp...

  12. Solid-Phase Synthesis of Smac Peptidomimetics Incorporating Triazoloprolines and Biarylalanines

    DEFF Research Database (Denmark)

    Le Quement, Sebastian T.; Ishoey, Mette; Petersen, Mette T.

    2011-01-01

    by deactivating proteolytic caspases. The Smac protein has an antagonistic effect on IAPs, thus providing structural clues for the synthesis of new pro-apoptotic compounds. Herein, we report a solid-phase approach for the synthesis of Smac-derived tetrapeptide libraries. On the basis of a common (N......-Me)AVPF sequence, peptides incorporating triazoloprolines and biarylalanines were synthesized by means of Cu(I)-catalyzed azide–alkyne cycloaddition and Pd-catalyzed Suzuki cross-coupling reactions. Solid-phase procedures were optimized to high efficiency, thus accessing all products in excellent crude purities...... and yields (both typically above 90%). The peptides were subjected to biological evaluation in a live/dead cellular assay which revealed that structural decorations on the AVPF sequence indeed are highly important for cytotoxicity toward HeLa cells....

  13. TrueFisp versus HASTE sequences in 3T cine MRI: Evaluation of image quality during phonation in patients with velopharyngeal insufficiency

    International Nuclear Information System (INIS)

    Kulinna-Cosentini, Christiane; Czerny, Christian; Weber, Michael; Baumann, Arnulf; Sinko, Klaus

    2016-01-01

    To evaluate the image quality of two fast dynamic magnetic resonance imaging (MRI) sequences: True fast imaging with steady state precession (TrueFisp) was compared with half-Fourier acquired single turbo-spin-echo (HASTE) sequence for the characterization of velopharyngeal insufficiency (VPI) in repaired cleft palate patients. Twenty-two patients (10 female and 12 male; mean age, 17.7 ± 10.6 years; range, 9-31) with suspected VPI underwent 3-T MRI using TrueFisp and HASTE sequences. Imaging was performed in the sagittal plane at rest and during phonation of ''ee'' and ''k'' to assess the velum, tongue, posterior pharyngeal wall and a potential VP closure. The results were analysed independently by one radiologist and one orthodontist. HASTE performed better than TrueFisp for all evaluated items, except the tongue evaluation by the orthodontist during phonation of ''k'' and ''ee''. A statistically significant difference in favour of HASTE was observed in assessing the velum at rest and during phonation of ''k'' and ''ee'', and also in assessing VP closure in both raters (p < 0.05). TrueFisp imaging was twice as fast as HASTE (0.36 vs. 0.75 s/image). Dynamic HASTE images were of superior quality to those obtained with TrueFisp, although TrueFisp imaging was twice as fast. (orig.)

  14. Applying Ancestry and Sex Computation as a Quality Control Tool in Targeted Next-Generation Sequencing.

    Science.gov (United States)

    Mathias, Patrick C; Turner, Emily H; Scroggins, Sheena M; Salipante, Stephen J; Hoffman, Noah G; Pritchard, Colin C; Shirts, Brian H

    2016-03-01

    To apply techniques for ancestry and sex computation from next-generation sequencing (NGS) data as an approach to confirm sample identity and detect sample processing errors. We combined a principal component analysis method with k-nearest neighbors classification to compute the ancestry of patients undergoing NGS testing. By combining this calculation with X chromosome copy number data, we determined the sex and ancestry of patients for comparison with self-report. We also modeled the sensitivity of this technique in detecting sample processing errors. We applied this technique to 859 patient samples with reliable self-report data. Our k-nearest neighbors ancestry screen had an accuracy of 98.7% for patients reporting a single ancestry. Visual inspection of principal component plots was consistent with self-report in 99.6% of single-ancestry and mixed-ancestry patients. Our model demonstrates that approximately two-thirds of potential sample swaps could be detected in our patient population using this technique. Patient ancestry can be estimated from NGS data incidentally sequenced in targeted panels, enabling an inexpensive quality control method when coupled with patient self-report. © American Society for Clinical Pathology, 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  15. Effect of alginate/carboxyl methyl cellulose composite coating incorporated with clove essential oil on the quality of silver carp fillet and Escherichia coli O157:H7 inhibition during refrigerated storage

    OpenAIRE

    Jalali, Nastaran; Ariiai, Peiman; Fattahi, Esmaeil

    2015-01-01

    The effects of alginate/carboxyl methylcellulose composite coating incorporated with clove essential oil on quality of silver carp fillet chilled storage (4 + 1 °C) were examined over a period of 16 days. The control samples (c), alginate/carboxyl methylcellulose coating (C-A), alginate/carboxyl methylcellulose composite coating incorporated with clove essential oil (with different concentration 1 and 1.5 %) (C-A + CEO1 % and C-A + CEO 15 % respectively) were analyzed by bacteriological (tota...

  16. Genomic multiple sequence alignments: refinement using a genetic algorithm

    Directory of Open Access Journals (Sweden)

    Lefkowitz Elliot J

    2005-08-01

    Full Text Available Abstract Background Genomic sequence data cannot be fully appreciated in isolation. Comparative genomics – the practice of comparing genomic sequences from different species – plays an increasingly important role in understanding the genotypic differences between species that result in phenotypic differences as well as in revealing patterns of evolutionary relationships. One of the major challenges in comparative genomics is producing a high-quality alignment between two or more related genomic sequences. In recent years, a number of tools have been developed for aligning large genomic sequences. Most utilize heuristic strategies to identify a series of strong sequence similarities, which are then used as anchors to align the regions between the anchor points. The resulting alignment is globally correct, but in many cases is suboptimal locally. We describe a new program, GenAlignRefine, which improves the overall quality of global multiple alignments by using a genetic algorithm to improve local regions of alignment. Regions of low quality are identified, realigned using the program T-Coffee, and then refined using a genetic algorithm. Because a better COFFEE (Consistency based Objective Function For alignmEnt Evaluation score generally reflects greater alignment quality, the algorithm searches for an alignment that yields a better COFFEE score. To improve the intrinsic slowness of the genetic algorithm, GenAlignRefine was implemented as a parallel, cluster-based program. Results We tested the GenAlignRefine algorithm by running it on a Linux cluster to refine sequences from a simulation, as well as refine a multiple alignment of 15 Orthopoxvirus genomic sequences approximately 260,000 nucleotides in length that initially had been aligned by Multi-LAGAN. It took approximately 150 minutes for a 40-processor Linux cluster to optimize some 200 fuzzy (poorly aligned regions of the orthopoxvirus alignment. Overall sequence identity increased only

  17. The effects of oat β-glucan incorporation on the quality, structure, consumer acceptance and glycaemic response of steamed bread.

    Science.gov (United States)

    Wang, Lei; Ye, Fayin; Feng, Liyuan; Wei, Fubin; Zhao, Guohua

    2017-12-01

    The objective of this study was to evaluate the effects of the incorporation of oat β-glucan (OβG) on the sensory and nutritional quality of steamed bread. Wheat flour was substituted with OβG at levels varying from 0 g/100 g to 5 g/100 g. The results showed that the products containing 1 g/100 g and 3 g/100 g OβG produced a comparable overall consumer acceptance while a significantly lower score was given to the product with 5 g/100 g. Nutritionally, the presence of OβG impeded in vitro starch hydrolysis by amylolytic enzymes. More importantly, the addition of OβG up to 5 g/100 g did bring about a lower in vitro predicted glycaemic index to steamed bread, but it generated insignificant effects on the in vivo glycaemic response. The current work first demonstrated the feasibility of fabricating OβG-enriched steamed bread and its nutritional superiority compared to the corresponding normal product. Modern methods of food elaboration and processing frequently reduce the content of dietary fiber (DF). Despite the well-known health benefits of DF consumption, average intake levels still fall far below recommended ones. Oat β-glucan (OβG) is a kind of indigestible polysaccharide with diverse bioactivity. This article evaluated the effects of OβG incorporation on quality, structure, consumer acceptance, and glycaemic response of steamed bread. The addition of less than 3 g/100 g of OβG had negligible effects on the consumer acceptance of steamed bread. The enrichment at 5 g/100 g indeed deteriorated the consumer acceptance. Moreover, OβG highly lowered the glycaemic response of steamed bread. The current work first demonstrated the feasibility of fabricating OβG-enriched steamed bread and its nutritional superiority compared to the corresponding normal product. OβG enriched steamed bread may offer an alternative to improve DF intake of residents. © 2017 Wiley Periodicals, Inc.

  18. What can we learn about lyssavirus genomes using 454 sequencing?

    Science.gov (United States)

    Höper, Dirk; Finke, Stefan; Freuling, Conrad M; Hoffmann, Bernd; Beer, Martin

    2012-01-01

    The main task of the individual project number four"Whole genome sequencing, virus-host adaptation, and molecular epidemiological analyses of lyssaviruses "within the network" Lyssaviruses--a potential re-emerging public health threat" is to provide high quality complete genome sequences from lyssaviruses. These sequences are analysed in-depth with regard to the diversity of the viral populations as to both quasi-species and so-called defective interfering RNAs. Moreover, the sequence data will facilitate further epidemiological analyses, will provide insight into the evolution of lyssaviruses and will be the basis for the design of novel nucleic acid based diagnostics. The first results presented here indicate that not only high quality full-length lyssavirus genome sequences can be generated, but indeed efficient analysis of the viral population gets feasible.

  19. Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies.

    Science.gov (United States)

    Spielman, Stephanie J; Wilke, Claus O

    2015-01-01

    We introduce Pyvolve, a flexible Python module for simulating genetic data along a phylogeny using continuous-time Markov models of sequence evolution. Easily incorporated into Python bioinformatics pipelines, Pyvolve can simulate sequences according to most standard models of nucleotide, amino-acid, and codon sequence evolution. All model parameters are fully customizable. Users can additionally specify custom evolutionary models, with custom rate matrices and/or states to evolve. This flexibility makes Pyvolve a convenient framework not only for simulating sequences under a wide variety of conditions, but also for developing and testing new evolutionary models. Pyvolve is an open-source project under a FreeBSD license, and it is available for download, along with a detailed user-manual and example scripts, from http://github.com/sjspielman/pyvolve.

  20. Magnetic Resonance Cholangiopancreatography: Image Quality, Ductal Morphology, and Value of Additional T2- and T1-weighted Sequences for the Assessment of Suspected Pancreatic Cancer

    International Nuclear Information System (INIS)

    Lopez Haenninen, E.; Ricke, H.; Amthauer, H.; Roettgen, R.; Boehmig, M.; Langrehr, J.; Pech, M.; Denecke, T.; Rosewicz, S.; Felix, R.

    2005-01-01

    Purpose: To assess image quality and duct morphology on magnetic resonance cholangiopancreatography (MRCP) and also the value of additional T2- and T1-weighted sequences for differentiation of benignity and malignancy in patients with suspected pancreatic tumors. Material and Methods: One-hundred-and-fourteen patients received MRCP and unenhanced and contrast material-enhanced MR imaging. MR results were analyzed independently by two blinded readers, and subsequently correlated with the results from surgery, biopsy, and follow-up. Assessment included the evaluation of image quality, duct visualization and morphology, and the differentiation of pancreatic lesion status (benign versus malignant).Results: Overall, 49 patients had benign final diagnoses, while 65 had a malignant diagnosis. Image quality of single-shot thick-slab MRCP was rated significantly better than the MIP images of multisection MRCP. With MRCP alone, the two readers' accuracy in the assessment of pancreatic lesion status was 72% (95% CI, 64% to 83%) and 69% (95% CI, 56% to 77%), respectively; with the addition of T2- and T1-weighted images the accuracy significantly improved to 89% (95% CI, 82% to 95%) and 84% (95% CI, 77% to 92%) for readers 1 and 2, respectively. Conclusion: Single-shot thick-slab MRCP and multisection MRCP provide complementary results; however, single-shot MRCP had superior image quality. Moreover, assessment of ductal morphology with MRCP alone facilitated the diagnosis of different pathologic conditions of the pancreatobiliary system in the majority of patients. However, with the addition of T2- and T1-weighted sequences the overall diagnostic accuracy was significantly improved and thus we consider that a comprehensive MR approach should comprise both MRCP techniques and parenchymal sequences

  1. Genome sequence of the thermotolerant foodborne pathogen Salmonella enterica serovar Senftenberg ATCC 43845 and phylogenetic analysis of Loci encoding increased protein quality control mechanisms

    Science.gov (United States)

    Salmonella enterica subsp. enterica bacteria are important foodborne pathogens with major economic impact. Some isolates exhibit increased heat tolerance, a concern for food safety. Analysis of a finished-quality genome sequence of an isolate commonly used in heat resistance studies, S. enterica sub...

  2. Topological disposition of the sequences -QRKIVE- and -KETYY in native (Na+ + K+)-ATPase

    International Nuclear Information System (INIS)

    Bayer, R.

    1990-01-01

    The dispositions with respect to the plane of the membrane of lysine-905 in the internal sequence -EQRKIVE- and of lysine-1012 in the carboxy-terminal sequence -RRPGGWVEKETYY of the α-polypeptide of sodium and potassium ion activated adenosinetriphosphatase have been determined. These lysines are found in peptides released from the intact α-polypeptide by the extracellular protease from Staphylococcus aureus strain V8 and by trypsin, respectively. Synthetic peptides containing terminal sequences of these were used to prepare polyclonal antibodies, which were then used to prepare immunoadsorbents directed against the respective peptides. Sealed, right-side-out membrane vesicles containing native (Na + + K + )-ATPase were labeled with pyridoxal phosphate and sodium [ 3 H]borohydride in the absence or presence of saponin. The labeled α-polypeptide was isolated from these vesicles and digested with appropriate proteases. The incorporation of radioactivity into the peptides binding to the immunoadsorbent directed against the sequence pyrERXIVE increased 3-fold int the presence of saponin as a result of the increased accessibility of this portion of the protein to the reagent when the vesicles were breached by saponin; hence, this sequence is located on the cytoplasmic face of the membrane. It was inferred that the carboxy-terminal sequence -KETYY is on the extracytoplasmic face since the incorporation of radioactivity into peptides binding to the immunoadsorbent directed against the sequence -ETYY did not change when the vesicles were breached with saponin

  3. Next-Generation Sequencing Workflow for NSCLC Critical Samples Using a Targeted Sequencing Approach by Ion Torrent PGM™ Platform.

    Science.gov (United States)

    Vanni, Irene; Coco, Simona; Truini, Anna; Rusmini, Marta; Dal Bello, Maria Giovanna; Alama, Angela; Banelli, Barbara; Mora, Marco; Rijavec, Erika; Barletta, Giulia; Genova, Carlo; Biello, Federica; Maggioni, Claudia; Grossi, Francesco

    2015-12-03

    Next-generation sequencing (NGS) is a cost-effective technology capable of screening several genes simultaneously; however, its application in a clinical context requires an established workflow to acquire reliable sequencing results. Here, we report an optimized NGS workflow analyzing 22 lung cancer-related genes to sequence critical samples such as DNA from formalin-fixed paraffin-embedded (FFPE) blocks and circulating free DNA (cfDNA). Snap frozen and matched FFPE gDNA from 12 non-small cell lung cancer (NSCLC) patients, whose gDNA fragmentation status was previously evaluated using a multiplex PCR-based quality control, were successfully sequenced with Ion Torrent PGM™. The robust bioinformatic pipeline allowed us to correctly call both Single Nucleotide Variants (SNVs) and indels with a detection limit of 5%, achieving 100% specificity and 96% sensitivity. This workflow was also validated in 13 FFPE NSCLC biopsies. Furthermore, a specific protocol for low input gDNA capable of producing good sequencing data with high coverage, high uniformity, and a low error rate was also optimized. In conclusion, we demonstrate the feasibility of obtaining gDNA from FFPE samples suitable for NGS by performing appropriate quality controls. The optimized workflow, capable of screening low input gDNA, highlights NGS as a potential tool in the detection, disease monitoring, and treatment of NSCLC.

  4. Levenshtein error-correcting barcodes for multiplexed DNA sequencing

    NARCIS (Netherlands)

    Buschmann, Tilo; Bystrykh, Leonid V.

    2013-01-01

    Background: High-throughput sequencing technologies are improving in quality, capacity and costs, providing versatile applications in DNA and RNA research. For small genomes or fraction of larger genomes, DNA samples can be mixed and loaded together on the same sequencing track. This so-called

  5. Incorporating personalized gene sequence variants, molecular genetics knowledge, and health knowledge into an EHR prototype based on the Continuity of Care Record standard

    Science.gov (United States)

    Jing, Xia; Kay, Stephen; Marley, Tom; Hardiker, Nicholas R.; Cimino, James J.

    2011-01-01

    Summary Objectives The current volume and complexity of genetic tests, and the molecular genetics knowledge and health knowledge related to interpretation of the results of those tests, are rapidly outstripping the ability of individual clinicians to recall, understand and convey to their patients information relevant to their care. The tailoring of molecular genetics knowledge and health knowledge in clinical settings is important both for the provision of personalized medicine and to reduce clinician information overload. In this paper we describe the incorporation, customization and demonstration of molecular genetic data (mainly sequence variants), molecular genetics knowledge and health knowledge into a standards-based electronic health record (EHR) prototype developed specifically for this study. Methods We extended the CCR (Continuity of Care Record), an existing EHR standard for representing clinical data, to include molecular genetic data. An EHR prototype was built based on the extended CCR and designed to display relevant molecular genetics knowledge and health knowledge from an existing knowledge base for cystic fibrosis (OntoKBCF). We reconstructed test records from published case reports and represented them in the CCR schema. We then used the EHR to dynamically filter molecular genetics knowledge and health knowledge from OntoKBCF using molecular genetic data and clinical data from the test cases. Results The molecular genetic data were successfully incorporated in the CCR by creating a category of laboratory results called “Molecular Genetics ” and specifying a particular class of test (“Gene Mutation Test”) in this category. Unlike other laboratory tests reported in the CCR, results of tests in this class required additional attributes (“Molecular Structure” and “Molecular Position”) to support interpretation by clinicians. These results, along with clinical data (age, sex, ethnicity, diagnostic procedures, and therapies) were used

  6. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-05-25

    The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.

  7. Infrared absorption study of hydrogen incorporation in thick nanocrystalline diamond films

    International Nuclear Information System (INIS)

    Tang, C.J.; Neves, A.J.; Carmo, M.C.

    2005-01-01

    We present an infrared (IR) optical absorbance study of hydrogen incorporation in nanocrystalline diamond films. The thick nanocrystalline diamond films were synthesized by microwave plasma-assisted chemical vapor deposition and a high growth rate about 3.0 μm/h was achieved. The morphology, phase quality, and hydrogen incorporation were assessed by means of scanning electron microscopy, Raman spectroscopy, and Fourier-transform infrared spectroscopy (FTIR). Large amount of hydrogen bonded to nanocrystalline diamond is clearly evidenced by the huge CH stretching band in the FTIR spectrum. The mechanism of hydrogen incorporation is discussed in light of the growth mechanism of nanocrystalline diamond. This suggests the potential of nanocrystalline diamond for IR electro-optical device applications

  8. Protecting genomic sequence anonymity with generalization lattices.

    Science.gov (United States)

    Malin, B A

    2005-01-01

    Current genomic privacy technologies assume the identity of genomic sequence data is protected if personal information, such as demographics, are obscured, removed, or encrypted. While demographic features can directly compromise an individual's identity, recent research demonstrates such protections are insufficient because sequence data itself is susceptible to re-identification. To counteract this problem, we introduce an algorithm for anonymizing a collection of person-specific DNA sequences. The technique is termed DNA lattice anonymization (DNALA), and is based upon the formal privacy protection schema of k -anonymity. Under this model, it is impossible to observe or learn features that distinguish one genetic sequence from k-1 other entries in a collection. To maximize information retained in protected sequences, we incorporate a concept generalization lattice to learn the distance between two residues in a single nucleotide region. The lattice provides the most similar generalized concept for two residues (e.g. adenine and guanine are both purines). The method is tested and evaluated with several publicly available human population datasets ranging in size from 30 to 400 sequences. Our findings imply the anonymization schema is feasible for the protection of sequences privacy. The DNALA method is the first computational disclosure control technique for general DNA sequences. Given the computational nature of the method, guarantees of anonymity can be formally proven. There is room for improvement and validation, though this research provides the groundwork from which future researchers can construct genomics anonymization schemas tailored to specific datasharing scenarios.

  9. Database-driven primary analysis of raw sequencing data

    DEFF Research Database (Denmark)

    2014-01-01

    The present invention relates to methods for identifying the source of a biological sequence containing sample from raw sequencing reads. The method may be used to identify the source of unknown DNA and can be used for diagnostic, biodefense, food safety and quality, and hygiene applications...

  10. Golden Ratio Versus Pi as Random Sequence Sources for Monte Carlo Integration

    Science.gov (United States)

    Sen, S. K.; Agarwal, Ravi P.; Shaykhian, Gholam Ali

    2007-01-01

    We discuss here the relative merits of these numbers as possible random sequence sources. The quality of these sequences is not judged directly based on the outcome of all known tests for the randomness of a sequence. Instead, it is determined implicitly by the accuracy of the Monte Carlo integration in a statistical sense. Since our main motive of using a random sequence is to solve real world problems, it is more desirable if we compare the quality of the sequences based on their performances for these problems in terms of quality/accuracy of the output. We also compare these sources against those generated by a popular pseudo-random generator, viz., the Matlab rand and the quasi-random generator ha/ton both in terms of error and time complexity. Our study demonstrates that consecutive blocks of digits of each of these numbers produce a good random sequence source. It is observed that randomly chosen blocks of digits do not have any remarkable advantage over consecutive blocks for the accuracy of the Monte Carlo integration. Also, it reveals that pi is a better source of a random sequence than theta when the accuracy of the integration is concerned.

  11. Perceived Quality of Full HD Video - Subjective Quality Assessment

    Directory of Open Access Journals (Sweden)

    Juraj Bienik

    2016-01-01

    Full Text Available In recent years, an interest in multimedia services has become a global trend and this trend is still rising. The video quality is a very significant part from the bundle of multimedia services, which leads to a requirement for quality assessment in the video domain. Video quality of a streamed video across IP networks is generally influenced by two factors “transmission link imperfection and efficiency of compression standards. This paper deals with subjective video quality assessment and the impact of the compression standards H.264, H.265 and VP9 on perceived video quality of these compression standards. The evaluation is done for four full HD sequences, the difference of scenes is in the content“ distinction is based on Spatial (SI and Temporal (TI Index of test sequences. Finally, experimental results follow up to 30% bitrate reducing of H.265 and VP9 compared with the reference H.264.

  12. Effectiveness of reinforcement incorporation and mechanical properties of composites produced from rheocast slurries

    Energy Technology Data Exchange (ETDEWEB)

    Robert, M.H.; Galvao da Silveira Mussi, R. [Faculty of Mechanical Engineering, State Univ. of Campinas, Cidade Univ. (Brazil)

    2002-07-01

    The influence of compocasting variables (SiC size and %vol, additions of grain refiner and extra Mg) in the quality of A356+SiCp MMCs is investigated. In all conditions microstructures show Al-{alpha} with globular morphology with eutectic and SiC in interglobular regions. Results show tendency of agglomeration of thinner particles, jeopardising their incorporation and dispersion in the matrix; high quantities of SiC are not incorporated into rheocast slurries with refined globules, grain refinement decreases incorporation of bigger particles. Extra Mg does not influence incorporation. Best results are obtained for 15% vol of 30 {mu}m SiC added to grain refined matrix. Mechanical properties are poor when agglomeration of particles is present. (orig.)

  13. Overview of BWR Severe Accident Sequence Analyses at Oak Ridge National Laboratory

    International Nuclear Information System (INIS)

    Hodge, S.A.

    1983-01-01

    Since its inception in October 1980, the Severe Accident Sequence Analysis (SASA) program at Oak Ridge National Laboratory (ORNL) has completed four studies including Station Blackout, Scram Discharge Volume Break, Loss of Decay Heat Removal, and Loss of Injection accident sequences for the Browns Ferry Nuclear Plant. The accident analyses incorporated in a SASA study provide much greater detail than that practically achievable in a Probabilistic Risk Assessment (PRA). When applied to the candidate dominant accident sequences identified by a PRA, the detailed SASA results determine if factors neglected by the PRA would have a significant effect on the order of dominant sequences. Ongoing SASA work at ORNL involves the analysis of Anticipated Transients Without Scram (ATWS) sequences for Browns Ferry

  14. Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies.

    Directory of Open Access Journals (Sweden)

    Stephanie J Spielman

    Full Text Available We introduce Pyvolve, a flexible Python module for simulating genetic data along a phylogeny using continuous-time Markov models of sequence evolution. Easily incorporated into Python bioinformatics pipelines, Pyvolve can simulate sequences according to most standard models of nucleotide, amino-acid, and codon sequence evolution. All model parameters are fully customizable. Users can additionally specify custom evolutionary models, with custom rate matrices and/or states to evolve. This flexibility makes Pyvolve a convenient framework not only for simulating sequences under a wide variety of conditions, but also for developing and testing new evolutionary models. Pyvolve is an open-source project under a FreeBSD license, and it is available for download, along with a detailed user-manual and example scripts, from http://github.com/sjspielman/pyvolve.

  15. 75 FR 65572 - Approval and Promulgation of Air Quality Implementation Plans; Ohio; Ohio Ambient Air Quality...

    Science.gov (United States)

    2010-10-26

    ... Promulgation of Air Quality Implementation Plans; Ohio; Ohio Ambient Air Quality Standards AGENCY... Ohio Administrative Code (OAC) relating to the consolidation of Ohio's Ambient Air Quality Standards... apply to Ohio's SIP. Incorporating the air quality standards into Ohio's SIP helps assure that...

  16. Automatic knowledge extraction in sequencing analysis with multiagent system and grid computing.

    Science.gov (United States)

    González, Roberto; Zato, Carolina; Benito, Rocío; Bajo, Javier; Hernández, Jesús M; De Paz, Juan F; Vera, Vicente; Corchado, Juan M

    2012-12-01

    Advances in bioinformatics have contributed towards a significant increase in available information. Information analysis requires the use of distributed computing systems to best engage the process of data analysis. This study proposes a multiagent system that incorporates grid technology to facilitate distributed data analysis by dynamically incorporating the roles associated to each specific case study. The system was applied to genetic sequencing data to extract relevant information about insertions, deletions or polymorphisms.

  17. Automatic knowledge extraction in sequencing analysis with multiagent system and grid computing

    Directory of Open Access Journals (Sweden)

    González Roberto

    2012-12-01

    Full Text Available Advances in bioinformatics have contributed towards a significant increase in available information. Information analysis requires the use of distributed computing systems to best engage the process of data analysis. This study proposes a multiagent system that incorporates grid technology to facilitate distributed data analysis by dynamically incorporating the roles associated to each specific case study. The system was applied to genetic sequencing data to extract relevant information about insertions, deletions or polymorphisms.

  18. Analysis of the cement clinker produced with incorporation of petroleum sludge

    Science.gov (United States)

    Benlamoudi, A.; Kadir, A. Abdul; Khodja, M.; Nuruddin, M. F.

    2018-04-01

    Very limited researches have been conducted on the incorporation of petroleum sludge waste into cement clinker production even though this waste may contain similar components to those of clinker raw materials. In this research, petroleum sludge was integrated into cement plant as raw material to produce the cement clinker. As results, incorporation of 5% of this waste was able to produce an acceptable quality of cement. Despite the use of petroleum sludge has decreased the properties of the produced clinker, but it still fit the requirements.

  19. Evaluating Methods for Isolating Total RNA and Predicting the Success of Sequencing Phylogenetically Diverse Plant Transcriptomes

    Science.gov (United States)

    Bruskiewich, Richard; Burris, Jason N.; Carrigan, Charlotte T.; Chase, Mark W.; Clarke, Neil D.; Covshoff, Sarah; dePamphilis, Claude W.; Edger, Patrick P.; Goh, Falicia; Graham, Sean; Greiner, Stephan; Hibberd, Julian M.; Jordon-Thaden, Ingrid; Kutchan, Toni M.; Leebens-Mack, James; Melkonian, Michael; Miles, Nicholas; Myburg, Henrietta; Patterson, Jordan; Pires, J. Chris; Ralph, Paula; Rolf, Megan; Sage, Rowan F.; Soltis, Douglas; Soltis, Pamela; Stevenson, Dennis; Stewart, C. Neal; Surek, Barbara; Thomsen, Christina J. M.; Villarreal, Juan Carlos; Wu, Xiaolei; Zhang, Yong; Deyholos, Michael K.; Wong, Gane Ka-Shu

    2012-01-01

    Next-generation sequencing plays a central role in the characterization and quantification of transcriptomes. Although numerous metrics are purported to quantify the quality of RNA, there have been no large-scale empirical evaluations of the major determinants of sequencing success. We used a combination of existing and newly developed methods to isolate total RNA from 1115 samples from 695 plant species in 324 families, which represents >900 million years of phylogenetic diversity from green algae through flowering plants, including many plants of economic importance. We then sequenced 629 of these samples on Illumina GAIIx and HiSeq platforms and performed a large comparative analysis to identify predictors of RNA quality and the diversity of putative genes (scaffolds) expressed within samples. Tissue types (e.g., leaf vs. flower) varied in RNA quality, sequencing depth and the number of scaffolds. Tissue age also influenced RNA quality but not the number of scaffolds ≥1000 bp. Overall, 36% of the variation in the number of scaffolds was explained by metrics of RNA integrity (RIN score), RNA purity (OD 260/230), sequencing platform (GAIIx vs HiSeq) and the amount of total RNA used for sequencing. However, our results show that the most commonly used measures of RNA quality (e.g., RIN) are weak predictors of the number of scaffolds because Illumina sequencing is robust to variation in RNA quality. These results provide novel insight into the methods that are most important in isolating high quality RNA for sequencing and assembling plant transcriptomes. The methods and recommendations provided here could increase the efficiency and decrease the cost of RNA sequencing for individual labs and genome centers. PMID:23185583

  20. Oxford Nanopore MinION Sequencing and Genome Assembly

    Directory of Open Access Journals (Sweden)

    Hengyun Lu

    2016-10-01

    Full Text Available The revolution of genome sequencing is continuing after the successful second-generation sequencing (SGS technology. The third-generation sequencing (TGS technology, led by Pacific Biosciences (PacBio, is progressing rapidly, moving from a technology once only capable of providing data for small genome analysis, or for performing targeted screening, to one that promises high quality de novo assembly and structural variation detection for human-sized genomes. In 2014, the MinION, the first commercial sequencer using nanopore technology, was released by Oxford Nanopore Technologies (ONT. MinION identifies DNA bases by measuring the changes in electrical conductivity generated as DNA strands pass through a biological pore. Its portability, affordability, and speed in data production makes it suitable for real-time applications, the release of the long read sequencer MinION has thus generated much excitement and interest in the genomics community. While de novo genome assemblies can be cheaply produced from SGS data, assembly continuity is often relatively poor, due to the limited ability of short reads to handle long repeats. Assembly quality can be greatly improved by using TGS long reads, since repetitive regions can be easily expanded into using longer sequencing lengths, despite having higher error rates at the base level. The potential of nanopore sequencing has been demonstrated by various studies in genome surveillance at locations where rapid and reliable sequencing is needed, but where resources are limited.

  1. Synthesis, Characterizations, and Applications of Metal-Ions Incorporated High Quality MCM-41 Catalysts

    International Nuclear Information System (INIS)

    Lim, Steven S.; Haller, Gary L.

    2013-01-01

    Various metal ions (transition and base metals) incorporated MCM-41 catalysts can be synthesized using colloidal and soluble silica with non-sodium involved process. Transition metal ion-typically V 5+ , Co 2+ , and Ni 2+ -incorporated MCM-41 catalysts were synthesized by isomorphous substitution of Si ions in the framework. Each incorporated metal ion created a single species in the silica framework, single-site solid catalyst, showing a substantial stability in reduction and catalytic activity. Radius of pore curvature effect was investigated with Co-MCM-41 by temperature programmed reduction (TPR). The size of metallic Co clusters, sub-nanometer, could be controlled by a proper reduction treatment of Co-MCM-41 having different pore size and the initial pH adjustment of the Co-MCM-41 synthesis solution. These small metallic clusters showed a high stability under a harsh reaction condition without serious migration, resulting from a direct anchoring of small metallic clusters to the partially or unreduced metal ions on the surface. After a complete reduction, partial occlusion of the metallic cluster surface by amorphous silica stabilized the particles against aggregations. As a probe reaction of particle size sensitivity, carbon single wall nanotubes (SWNT) were synthesized using Co-MCM-41. A metallic cluster stability test was performed by CO methanation using Co- and Ni-MCM-41. Methanol and methane partial oxidations were carried out with V-MCM-41, and the radius of pore curvature effect on the catalytic activity was investigated

  2. Quality indicators for all dimensions of infertility care quality: consensus between professionals and patients

    NARCIS (Netherlands)

    Dancet, E.A.; D'Hooghe, T.M.; Spiessens, C.; Sermeus, W.; Neubourg, D. De; Karel, N.; Kremer, J.A.M.; Nelen, W.L.D.M.

    2013-01-01

    STUDY QUESTION: What is the relative importance of the six dimensions of quality of care according to different stakeholders and can a quality indicator set address all six quality dimensions and incorporate the views from professionals working in different disciplines and from patients? SUMMARY

  3. Tub-Tag Labeling; Chemoenzymatic Incorporation of Unnatural Amino Acids.

    Science.gov (United States)

    Helma, Jonas; Leonhardt, Heinrich; Hackenberger, Christian P R; Schumacher, Dominik

    2018-01-01

    Tub-tag labeling is a chemoenzymatic method that enables the site-specific labeling of proteins. Here, the natural enzyme tubulin tyrosine ligase incorporates noncanonical tyrosine derivatives to the terminal carboxylic acid of proteins containing a 14-amino acid recognition sequence called Tub-tag. The tyrosine derivative carries a unique chemical reporter allowing for a subsequent bioorthogonal modification of proteins with a great variety of probes. Here, we describe the Tub-tag protein modification protocol in detail and explain its utilization to generate labeled proteins for advanced applications in cell biology, imaging, and diagnostics.

  4. Scheduling a Single Mobile Robot Incorporated into Production Environment

    DEFF Research Database (Denmark)

    Dang, Vinh Quang; Nielsen, Izabela Ewa; Steger-Jensen, Kenn

    2013-01-01

    to the challenges of issues such as energy conservation and pollution preventions. Facing the central tension between manufacturing and environmental drivers is difficult, but critical to develop new technologies, particularly mobile robots, that can be incorporated into production to achieve holistic solutions....... This chapter deals with the problem of finding optimal operating sequence in a manufacturing cell of a mobile robot with manipulation arm that feeds materials to feeders. The “Bartender Concept” is discussed to show the cooperation between the mobile robot and industrial environment. The performance criterion...

  5. On the relationship between perceptual impact of source and channel distortions in video sequences

    DEFF Research Database (Denmark)

    Korhonen, Jari; Reiter, Ulrich; You, Junyong

    2010-01-01

    It is known that peak signal-to-noise ratio (PSNR) can be used for assessing the relative qualities of distorted video sequences meaningfully only if the compared sequences contain similar types of distortions. In this paper, we propose a model for rough assessment of the bias in PSNR results, when...... video sequences with both channel and source distortion are compared against video sequences with source distortion only. The proposed method can be used to compare the relative perceptual quality levels of video sequences with different distortion types more reliably than using plain PSNR....

  6. PhyloSim - Monte Carlo simulation of sequence evolution in the R statistical computing environment

    Directory of Open Access Journals (Sweden)

    Massingham Tim

    2011-04-01

    Full Text Available Abstract Background The Monte Carlo simulation of sequence evolution is routinely used to assess the performance of phylogenetic inference methods and sequence alignment algorithms. Progress in the field of molecular evolution fuels the need for more realistic and hence more complex simulations, adapted to particular situations, yet current software makes unreasonable assumptions such as homogeneous substitution dynamics or a uniform distribution of indels across the simulated sequences. This calls for an extensible simulation framework written in a high-level functional language, offering new functionality and making it easy to incorporate further complexity. Results PhyloSim is an extensible framework for the Monte Carlo simulation of sequence evolution, written in R, using the Gillespie algorithm to integrate the actions of many concurrent processes such as substitutions, insertions and deletions. Uniquely among sequence simulation tools, PhyloSim can simulate arbitrarily complex patterns of rate variation and multiple indel processes, and allows for the incorporation of selective constraints on indel events. User-defined complex patterns of mutation and selection can be easily integrated into simulations, allowing PhyloSim to be adapted to specific needs. Conclusions Close integration with R and the wide range of features implemented offer unmatched flexibility, making it possible to simulate sequence evolution under a wide range of realistic settings. We believe that PhyloSim will be useful to future studies involving simulated alignments.

  7. dBBQs: dataBase of Bacterial Quality scores

    OpenAIRE

    Wanchai, Visanu; Patumcharoenpol, Preecha; Nookaew, Intawat; Ussery, David

    2017-01-01

    Background: It is well-known that genome sequencing technologies are becoming significantly cheaper and faster. As a result of this, the exponential growth in sequencing data in public databases allows us to explore ever growing large collections of genome sequences. However, it is less known that the majority of available sequenced genome sequences in public databases are not complete, drafts of varying qualities. We have calculated quality scores for around 100,000 bacterial genomes from al...

  8. Progressive multiple sequence alignments from triplets

    Directory of Open Access Journals (Sweden)

    Stadler Peter F

    2007-07-01

    Full Text Available Abstract Background The quality of progressive sequence alignments strongly depends on the accuracy of the individual pairwise alignment steps since gaps that are introduced at one step cannot be removed at later aggregation steps. Adjacent insertions and deletions necessarily appear in arbitrary order in pairwise alignments and hence form an unavoidable source of errors. Research Here we present a modified variant of progressive sequence alignments that addresses both issues. Instead of pairwise alignments we use exact dynamic programming to align sequence or profile triples. This avoids a large fractions of the ambiguities arising in pairwise alignments. In the subsequent aggregation steps we follow the logic of the Neighbor-Net algorithm, which constructs a phylogenetic network by step-wisely replacing triples by pairs instead of combining pairs to singletons. To this end the three-way alignments are subdivided into two partial alignments, at which stage all-gap columns are naturally removed. This alleviates the "once a gap, always a gap" problem of progressive alignment procedures. Conclusion The three-way Neighbor-Net based alignment program aln3nn is shown to compare favorably on both protein sequences and nucleic acids sequences to other progressive alignment tools. In the latter case one easily can include scoring terms that consider secondary structure features. Overall, the quality of resulting alignments in general exceeds that of clustalw or other multiple alignments tools even though our software does not included heuristics for context dependent (mismatch scores.

  9. The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation.

    Science.gov (United States)

    Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C

    2012-01-01

    The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).

  10. Using expected sequence features to improve basecalling accuracy of amplicon pyrosequencing data

    DEFF Research Database (Denmark)

    Rask, Thomas Salhøj; Petersen, Bent; Chen, Donald S.

    2016-01-01

    . The new basecalling method described here, named Multipass, implements a probabilistic framework for working with the raw flowgrams obtained by pyrosequencing. For each sequence variant Multipass calculates the likelihood and nucleotide sequence of several most likely sequences given the flowgram data....... This probabilistic approach enables integration of basecalling into a larger model where other parameters can be incorporated, such as the likelihood for observing a full-length open reading frame at the targeted region. We apply the method to 454 amplicon pyrosequencing data obtained from a malaria virulence gene...... family, where Multipass generates 20 % more error-free sequences than current state of the art methods, and provides sequence characteristics that allow generation of a set of high confidence error-free sequences. This novel method can be used to increase accuracy of existing and future amplicon...

  11. Innovative physico-chemical treatment of wastewater incorporating Moringa oleifera seed coagulant.

    Science.gov (United States)

    Bhuptawat, Hitendra; Folkard, G K; Chaudhari, Sanjeev

    2007-04-02

    Moringa oleifera is a pan tropical, multipurpose tree whose seeds contain a high quality edible oil (up to 40% by weight) and water soluble proteins that act as effective coagulants for water and wastewater treatment. The use of this natural coagulant material has not yet realised its potential. A water extract of M. oleifera seed was applied to a wastewater treatment sequence comprising coagulation-flocculation-sedimentation-sand filtration. The study was laboratory based using an actual wastewater. Overall COD removals of 50% were achieved at both 50 and 100mg/l M. oleifera doses. When 50 and 100mg/l seed doses were applied in combination with 10mg/l of alum, COD removal increased to 58 and 64%, respectively. The majority of COD removal occurred during the filtration process. In the tests incorporating alum, sludge generation and filter head loss increased by factors of 3 and 2, respectively. These encouraging treatment results indicate that this may be the first treatment application that can move to large scale adoption. The simple water extract may be obtained at minimal cost from the presscake residue remaining after oil extraction from the seed. The regulatory compliance issues of adopting 'new materials' for wastewater treatment are significantly less stringent than those applying to the production of potable water.

  12. 77 FR 12482 - Approval and Promulgation of Air Quality Implementation Plans; Indiana; Lead Ambient Air Quality...

    Science.gov (United States)

    2012-03-01

    ... Promulgation of Air Quality Implementation Plans; Indiana; Lead Ambient Air Quality Standards AGENCY... incorporates the National Ambient Air Quality Standards (NAAQS) for Pb promulgated by EPA in 2008. DATES: This... FR 66964) and codified at 40 CFR 50.16, ``National primary and secondary ambient air quality...

  13. High-quality draft genome sequence of Ensifer meliloti Mlalz-1, a microsymbiont of Medicago laciniata (L.) miller collected in Lanzarote, Canary Islands, Spain.

    Science.gov (United States)

    Osman, Wan Adnawani Meor; van Berkum, Peter; León-Barrios, Milagros; Velázquez, Encarna; Elia, Patrick; Tian, Rui; Ardley, Julie; Gollagher, Margaret; Seshadri, Rekha; Reddy, T B K; Ivanova, Natalia; Woyke, Tanja; Pati, Amrita; Markowitz, Victor; Baeshen, Mohamed N; Baeshen, Naseebh Nabeeh; Kyrpides, Nikos; Reeve, Wayne

    2017-01-01

    10.1601/nm.1335 Mlalz-1 (INSDC = ATZD00000000) is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen-fixing nodule of Medicago laciniata (L.) Miller from a soil sample collected near the town of Guatiza on the island of Lanzarote, the Canary Islands, Spain. This strain nodulates and forms an effective symbiosis with the highly specific host M. laciniata . This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) sequencing project. Here the features of 10.1601/nm.1335 Mlalz-1 are described, together with high-quality permanent draft genome sequence information and annotation. The 6,664,116 bp high-quality draft genome is arranged in 99 scaffolds of 100 contigs, containing 6314 protein-coding genes and 74 RNA-only encoding genes. Strain Mlalz-1 is closely related to 10.1601/nm.1335 10.1601/strainfinder?urlappend=%3Fid%3DIAM+12611 T , 10.1601/nm.1334 A 321 T and 10.1601/nm.17831 10.1601/strainfinder?urlappend=%3Fid%3DORS+1407 T , based on 16S rRNA gene sequences. gANI values of ≥98.1% support the classification of strain Mlalz-1 as 10.1601/nm.1335. Nodulation of M. laciniata requires a specific nodC allele, and the nodC gene of strain Mlalz-1 shares ≥98% sequence identity with nodC of M. laciniata -nodulating 10.1601/nm.1328 strains, but ≤93% with nodC of 10.1601/nm.1328 strains that nodulate other Medicago species. Strain Mlalz-1 is unique among sequenced 10.1601/nm.1335 strains in possessing genes encoding components of a T2SS and in having two versions of the adaptive acid tolerance response lpiA-acvB operon. In 10.1601/nm.1334 strain 10.1601/strainfinder?urlappend=%3Fid%3DWSM+419, lpiA is essential for enhancing survival in lethal acid conditions. The second copy of the lpiA-acvB operon of strain Mlalz-1 has highest sequence identity (> 96%) with that of 10.1601/nm.1334 strains, which suggests genetic

  14. dBBQs: dataBase of Bacterial Quality scores.

    Science.gov (United States)

    Wanchai, Visanu; Patumcharoenpol, Preecha; Nookaew, Intawat; Ussery, David

    2017-12-28

    It is well-known that genome sequencing technologies are becoming significantly cheaper and faster. As a result of this, the exponential growth in sequencing data in public databases allows us to explore ever growing large collections of genome sequences. However, it is less known that the majority of available sequenced genome sequences in public databases are not complete, drafts of varying qualities. We have calculated quality scores for around 100,000 bacterial genomes from all major genome repositories and put them in a fast and easy-to-use database. Prokaryotic genomic data from all sources were collected and combined to make a non-redundant set of bacterial genomes. The genome quality score for each was calculated by four different measurements: assembly quality, number of rRNA and tRNA genes, and the occurrence of conserved functional domains. The dataBase of Bacterial Quality scores (dBBQs) was designed to store and retrieve quality scores. It offers fast searching and download features which the result can be used for further analysis. In addition, the search results are shown in interactive JavaScript chart framework using DC.js. The analysis of quality scores across major public genome databases find that around 68% of the genomes are of acceptable quality for many uses. dBBQs (available at http://arc-gem.uams.edu/dbbqs ) provides genome quality scores for all available prokaryotic genome sequences with a user-friendly Web-interface. These scores can be used as cut-offs to get a high-quality set of genomes for testing bioinformatics tools or improving the analysis. Moreover, all data of the four measurements that were combined to make the quality score for each genome, which can potentially be used for further analysis. dBBQs will be updated regularly and is freely use for non-commercial purpose.

  15. Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes

    DEFF Research Database (Denmark)

    Albertsen, Mads; Hugenholtz, Philip; Skarshewski, Adam

    2013-01-01

    Reference genomes are required to understand the diverse roles of microorganisms in ecology, evolution, human and animal health, but most species remain uncultured. Here we present a sequence composition–independent approach to recover high-quality microbial genomes from deeply sequenced metageno......Reference genomes are required to understand the diverse roles of microorganisms in ecology, evolution, human and animal health, but most species remain uncultured. Here we present a sequence composition–independent approach to recover high-quality microbial genomes from deeply sequenced...

  16. Leachability of fired clay brick incorporating with sewage sludge waste

    Science.gov (United States)

    Kadir, Aeslina Abdul; Salim, Nurul Salhana Abdul; Sarani, Noor Amira; Rahmat, Nur Aqma Izurin; Abdullah, Mohd Mustafa Al Bakri

    2017-09-01

    Sewage sludge is sewerage from wastewater treatment plants that generates millions tons of sludge ever year. Regarding this activity, it causes lack management of waste which is harmful to the surrounding conditions. Therefore, this study is focuses on the incorporation of sewage sludge waste into fired clay brick to provide an option of disposal method, producing adequate quality of brick as well as limiting the heavy metal leachability to the environment. Sewage sludge brick (SSB) mixtures were incorporated with 0%, 1%, 5%, 10%, 20% and 30% of sewage sludge waste (SSW). Heavy metals of crushed SSB were determined by using Toxicity Characteristic Leaching Procedure (TCLP) according to Method 1311 of United State Environment Protection Agency (USEPA) standard. From the results obtained, up to 20% of SSW could be incorporated into fired clay brick and comply with the USEPA standard. Therefore, this study revealed that by incorporating SSW into fired clay brick it could be an alternative method to dispose the SSW and also could act as a replacement material for brick manufacturing with appropriate mix and design.

  17. Higher quality of molecular testing, an unfulfilled priority: Results from external quality assessment for KRAS mutation testing in colorectal cancer.

    Science.gov (United States)

    Tembuyser, Lien; Ligtenberg, Marjolijn J L; Normanno, Nicola; Delen, Sofie; van Krieken, J Han; Dequeker, Elisabeth M C

    2014-05-01

    Precision medicine is now a key element in clinical oncology. RAS mutational status is a crucial predictor of responsiveness to anti-epidermal growth factor receptor agents in metastatic colorectal cancer. In an effort to guarantee high-quality testing services in molecular pathology, the European Society of Pathology has been organizing an annual KRAS external quality assessment program since 2009. In 2012, 10 formalin-fixed, paraffin-embedded samples, of which 8 from invasive metastatic colorectal cancer tissue and 2 artificial samples of cell line material, were sent to more than 100 laboratories from 26 countries with a request for routine KRAS testing. Both genotyping and clinical reports were assessed independently. Twenty-seven percent of the participants genotyped at least 1 of 10 samples incorrectly. In total, less than 5% of the distributed specimens were genotyped incorrectly. Genotyping errors consisted of false negatives, false positives, and incorrectly genotyped mutations. Twenty percent of the laboratories reported a technical error for one or more samples. A review of the written reports showed that several essential elements were missing, most notably a clinical interpretation of the test result, the method sensitivity, and the use of a reference sequence. External quality assessment serves as a valuable educational tool in assessing and improving molecular testing quality and is an important asset for monitoring quality assurance upon incorporation of new biomarkers in diagnostic services. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  18. Does integration of HIV and sexual and reproductive health services improve technical efficiency in Kenya and Swaziland? An application of a two-stage semi parametric approach incorporating quality measures.

    Science.gov (United States)

    Obure, Carol Dayo; Jacobs, Rowena; Guinness, Lorna; Mayhew, Susannah; Vassall, Anna

    2016-02-01

    Theoretically, integration of vertically organized services is seen as an important approach to improving the efficiency of health service delivery. However, there is a dearth of evidence on the effect of integration on the technical efficiency of health service delivery. Furthermore, where technical efficiency has been assessed, there have been few attempts to incorporate quality measures within efficiency measurement models particularly in sub-Saharan African settings. This paper investigates the technical efficiency and the determinants of technical efficiency of integrated HIV and sexual and reproductive health (SRH) services using data collected from 40 health facilities in Kenya and Swaziland for 2008/2009 and 2010/2011. Incorporating a measure of quality, we estimate the technical efficiency of health facilities and explore the effect of integration and other environmental factors on technical efficiency using a two-stage semi-parametric double bootstrap approach. The empirical results reveal a high degree of inefficiency in the health facilities studied. The mean bias corrected technical efficiency scores taking quality into consideration varied between 22% and 65% depending on the data envelopment analysis (DEA) model specification. The number of additional HIV services in the maternal and child health unit, public ownership and facility type, have a positive and significant effect on technical efficiency. However, number of additional HIV and STI services provided in the same clinical room, proportion of clinical staff to overall staff, proportion of HIV services provided, and rural location had a negative and significant effect on technical efficiency. The low estimates of technical efficiency and mixed effects of the measures of integration on efficiency challenge the notion that integration of HIV and SRH services may substantially improve the technical efficiency of health facilities. The analysis of quality and efficiency as separate dimensions of

  19. Quality scores for 32,000 genomes

    DEFF Research Database (Denmark)

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran

    2014-01-01

    Background More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). We have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major...... public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Results Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes....... Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read...

  20. A Snapshot of the Emerging Tomato Genome Sequence

    Directory of Open Access Journals (Sweden)

    Lukas A. Mueller

    2009-03-01

    Full Text Available The genome of tomato ( L. is being sequenced by an international consortium of 10 countries (Korea, China, the United Kingdom, India, the Netherlands, France, Japan, Spain, Italy, and the United States as part of the larger “International Solanaceae Genome Project (SOL: Systems Approach to Diversity and Adaptation” initiative. The tomato genome sequencing project uses an ordered bacterial artificial chromosome (BAC approach to generate a high-quality tomato euchromatic genome sequence for use as a reference genome for the Solanaceae and euasterids. Sequence is deposited at GenBank and at the SOL Genomics Network (SGN. Currently, there are around 1000 BACs finished or in progress, representing more than a third of the projected euchromatic portion of the genome. An annotation effort is also underway by the International Tomato Annotation Group. The expected number of genes in the euchromatin is ∼40,000, based on an estimate from a preliminary annotation of 11% of finished sequence. Here, we present this first snapshot of the emerging tomato genome and its annotation, a short comparison with potato ( L. sequence data, and the tools available for the researchers to exploit this new resource are also presented. In the future, whole-genome shotgun techniques will be combined with the BAC-by-BAC approach to cover the entire tomato genome. The high-quality reference euchromatic tomato sequence is expected to be near completion by 2010.

  1. Droplet Digital™ PCR Next-Generation Sequencing Library QC Assay.

    Science.gov (United States)

    Heredia, Nicholas J

    2018-01-01

    Digital PCR is a valuable tool to quantify next-generation sequencing (NGS) libraries precisely and accurately. Accurately quantifying NGS libraries enable accurate loading of the libraries on to the sequencer and thus improve sequencing performance by reducing under and overloading error. Accurate quantification also benefits users by enabling uniform loading of indexed/barcoded libraries which in turn greatly improves sequencing uniformity of the indexed/barcoded samples. The advantages gained by employing the Droplet Digital PCR (ddPCR™) library QC assay includes the precise and accurate quantification in addition to size quality assessment, enabling users to QC their sequencing libraries with confidence.

  2. GapMis: a tool for pairwise sequence alignment with a single gap.

    Science.gov (United States)

    Flouri, Tomás; Frousios, Kimon; Iliopoulos, Costas S; Park, Kunsoo; Pissis, Solon P; Tischler, German

    2013-08-01

    Pairwise sequence alignment has received a new motivation due to the advent of recent patents in next-generation sequencing technologies, particularly so for the application of re-sequencing---the assembly of a genome directed by a reference sequence. After the fast alignment between a factor of the reference sequence and a high-quality fragment of a short read by a short-read alignment programme, an important problem is to find the alignment between a relatively short succeeding factor of the reference sequence and the remaining low-quality part of the read allowing a number of mismatches and the insertion of a single gap in the alignment. We present GapMis, a tool for pairwise sequence alignment with a single gap. It is based on a simple algorithm, which computes a different version of the traditional dynamic programming matrix. The presented experimental results demonstrate that GapMis is more suitable and efficient than most popular tools for this task.

  3. Variation of Se, Zn, Co, Fe and Rb distribution in rats upon sequence of injection with SeO2 and glutathione

    International Nuclear Information System (INIS)

    Czauderna, M.; Samochocka, K.; Kwiathkowska, J.

    1984-01-01

    The contents of Se, Zn, Co, Fe and Rb in several organs of Wistar rats were determined by instrumental neutron activation analysis (INAA) after injections of SeO 2 and glutathione (GSH). Se was incorporated in all the examined organs, and the efficiency of incorporation does not depend upon the sequence of injection with SeO 2 and GSH. The sequence of these injections affects the contents of the other elements in all the examined organs. (author)

  4. Topological disposition of the sequences -QRKIVE- and -KETYY in native (Na sup + + K sup + )-ATPase

    Energy Technology Data Exchange (ETDEWEB)

    Bayer, R. (Univ. of California, San Diego, La Jolla (USA))

    1990-03-06

    The dispositions with respect to the plane of the membrane of lysine-905 in the internal sequence -EQRKIVE- and of lysine-1012 in the carboxy-terminal sequence -RRPGGWVEKETYY of the {alpha}-polypeptide of sodium and potassium ion activated adenosinetriphosphatase have been determined. These lysines are found in peptides released from the intact {alpha}-polypeptide by the extracellular protease from Staphylococcus aureus strain V8 and by trypsin, respectively. Synthetic peptides containing terminal sequences of these were used to prepare polyclonal antibodies, which were then used to prepare immunoadsorbents directed against the respective peptides. Sealed, right-side-out membrane vesicles containing native (Na{sup +} + K{sup +})-ATPase were labeled with pyridoxal phosphate and sodium ({sup 3}H)borohydride in the absence or presence of saponin. The labeled {alpha}-polypeptide was isolated from these vesicles and digested with appropriate proteases. The incorporation of radioactivity into the peptides binding to the immunoadsorbent directed against the sequence pyrERXIVE increased 3-fold int the presence of saponin as a result of the increased accessibility of this portion of the protein to the reagent when the vesicles were breached by saponin; hence, this sequence is located on the cytoplasmic face of the membrane. It was inferred that the carboxy-terminal sequence -KETYY is on the extracytoplasmic face since the incorporation of radioactivity into peptides binding to the immunoadsorbent directed against the sequence -ETYY did not change when the vesicles were breached with saponin.

  5. Incorporating elements of social franchising in government health services improves the quality of infant and young child feeding counselling services at commune health centres in Vietnam.

    Science.gov (United States)

    Nguyen, Phuong H; Kim, Sunny S; Keithly, Sarah C; Hajeebhoy, Nemat; Tran, Lan M; Ruel, Marie T; Rawat, Rahul; Menon, Purnima

    2014-12-01

    Although social franchising has been shown to enhance the quality of reproductive health services in developing countries, its effect on nutrition services remains unexamined. This study assessed the effects of incorporating elements of social franchising on shaping the quality of infant and young child feeding (IYCF) counselling facilities and services in Vietnam. Process-related data collected 12 months after the launch of the first franchises were used to compare randomly assigned Alive & Thrive-supported health facilities (AT-F, n = 20) with standard facilities (SF, n = 12) across three dimensions of service quality: 'structure', 'process' and 'outcome' that capture the quality of facilities, service delivery, and client perceptions and use, respectively. Data collection included facility assessments (n = 32), staff surveys (n = 96), counselling observations (n = 137), client exit interviews (n = 137) and in-depth interviews with mothers (n = 48). Structure: AT-F were more likely to have an unshared, well-equipped room for nutrition counselling than SF (65.0% vs 10.0%). Compared with SF providers, AT-F staff had better IYCF knowledge (mean score 9.9 vs 8.8, range 0-11 for breastfeeding; mean score 3.6 vs 3.2, range 0-4 for complementary feeding). AT-F providers also demonstrated significantly better interpersonal communication skills (score 9.6 vs 5.1, range 0-13) and offered more comprehensive counselling sessions. Overall utilization of franchises was low (10%). A higher proportion of pregnant women utilized franchise services (48.9%), compared with mothers with children 6-23.9 months (1.4%). There was no quantitative difference in client satisfaction with counselling services between AT-F and SF, but franchise users praised the AT-F for problem solving related to child feeding. Incorporating elements of social franchising significantly enhances the quality of IYCF counselling services within government primary healthcare facilities, particularly their

  6. ReRep: Computational detection of repetitive sequences in genome survey sequences (GSS

    Directory of Open Access Journals (Sweden)

    Alves-Ferreira Marcelo

    2008-09-01

    Full Text Available Abstract Background Genome survey sequences (GSS offer a preliminary global view of a genome since, unlike ESTs, they cover coding as well as non-coding DNA and include repetitive regions of the genome. A more precise estimation of the nature, quantity and variability of repetitive sequences very early in a genome sequencing project is of considerable importance, as such data strongly influence the estimation of genome coverage, library quality and progress in scaffold construction. Also, the elimination of repetitive sequences from the initial assembly process is important to avoid errors and unnecessary complexity. Repetitive sequences are also of interest in a variety of other studies, for instance as molecular markers. Results We designed and implemented a straightforward pipeline called ReRep, which combines bioinformatics tools for identifying repetitive structures in a GSS dataset. In a case study, we first applied the pipeline to a set of 970 GSSs, sequenced in our laboratory from the human pathogen Leishmania braziliensis, the causative agent of leishmaniosis, an important public health problem in Brazil. We also verified the applicability of ReRep to new sequencing technologies using a set of 454-reads of an Escheria coli. The behaviour of several parameters in the algorithm is evaluated and suggestions are made for tuning of the analysis. Conclusion The ReRep approach for identification of repetitive elements in GSS datasets proved to be straightforward and efficient. Several potential repetitive sequences were found in a L. braziliensis GSS dataset generated in our laboratory, and further validated by the analysis of a more complete genomic dataset from the EMBL and Sanger Centre databases. ReRep also identified most of the E. coli K12 repeats prior to assembly in an example dataset obtained by automated sequencing using 454 technology. The parameters controlling the algorithm behaved consistently and may be tuned to the properties

  7. Finding and Improving the Key-Frames of Long Video Sequences for Face Recognition

    DEFF Research Database (Denmark)

    Nasrollahi, Kamal; Moeslund, Thomas B.

    2010-01-01

    Face recognition systems are very sensitive to the quality and resolution of their input face images. This makes such systems unreliable when working with long surveillance video sequences without employing some selection and enhancement algorithms. On the other hand, processing all the frames...... of such video sequences by any enhancement or even face recognition algorithm is demanding. Thus, there is a need for a mechanism to summarize the input video sequence to a set of key-frames and then applying an enhancement algorithm to this subset. This paper presents a system doing exactly this. The system...... uses face quality assessment to select the key-frames and a hybrid super-resolution to enhance the face image quality. The suggested system that employs a linear associator face recognizer to evaluate the enhanced results has been tested on real surveillance video sequences and the experimental results...

  8. Subjective and objective image qualities: a comparison of sagittal T2 weighted spin-echo and turbo-spin-eco sequences in magnetic resonance imaging of the spine by use of a subjective ranking system

    Energy Technology Data Exchange (ETDEWEB)

    Goerres, G. [Institut fuer diagnostische Radiologie, Departement Radiologie, Universitaetskliniken, Kantonsspital Basel (Switzerland); Mader, I. [Radiologische Gemeinschaftspraxis Dres. Siems, Grossmann, Bayreuth (Germany); Proske, M. [Klinikum Rosenheim (Germany). Inst. fuer Diagnostische Radiologie

    1998-12-31

    We evaluated the subjective image impression of two different magnetic resonance (MR) sequences by using a subjective ranking system. This ranking system was based on 20 criteria describing several tissue characteristics such as the signal intensity of normal anatomical structures and the changes of signal intensities and shape of lesions as well as artefacts. MR of the vertebral spine was performed in 48 female and 52 male patients (mean age 44.8 years) referred consecutively for investigation of a back problem. Ninety-six pathologies were found in 82 patients. Sagittal and axial T1 weighted spin-echo before and after administration of Gadolinium (Gd-DOTA), and sagittal T2 weighted spin-echo (T2wSE) and Turbo-spin-echo (TSE) sequences were performed by means of surface coils. Using the subjective ranking system the sagittal T2wSE and sagittal TSE were compared. Both sequences were suitable for identification of normal anatomy and pathologic changes and there was no trend for increased detection of disease by one imaging sequence over the other. We found that sagittal TSE sequences can replace sagittal T2wSE sequences in spinal MR and that artefacts at the cervical and lumbar spine are less frequent using TSE, thus confirming previous studies. In this study, our ranking system reveiled, that there are differences between the subjective judgement of image qualities and objective measurement of SNR. However, this approach may not be helpful to compare two different MR sequences as it is limited to the anatomical area investigated and is time consuming. The subjective image impression, i.e. the quality of images, may not always be represented by physical parameters such as a signal-to-noise ratio (SNR), radiologists should try to define influences of image quality also by subjective parameters. (orig.)

  9. 77 FR 12524 - Approval and Promulgation of Air Quality Implementation Plans; Indiana; Lead Ambient Air Quality...

    Science.gov (United States)

    2012-03-01

    ... Promulgation of Air Quality Implementation Plans; Indiana; Lead Ambient Air Quality Standards AGENCY... Indiana State Implementation Plan (SIP) for lead (Pb) under the Clean Air Act (CAA). This submittal incorporates the National Ambient Air Quality Standards (NAAQS) for Pb promulgated by EPA in 2008. DATES...

  10. SMITH: a LIMS for handling next-generation sequencing workflows

    OpenAIRE

    Venco, Francesco; Vaskin, Yuriy; Ceol, Arnaud; Muller, Heiko

    2014-01-01

    Background Life-science laboratories make increasing use of Next Generation Sequencing (NGS) for studying bio-macromolecules and their interactions. Array-based methods for measuring gene expression or protein-DNA interactions are being replaced by RNA-Seq and ChIP-Seq. Sequencing is generally performed by specialized facilities that have to keep track of sequencing requests, trace samples, ensure quality and make data available according to predefined privileges. An integrated tool helps to ...

  11. NeSSM: a Next-generation Sequencing Simulator for Metagenomics.

    Directory of Open Access Journals (Sweden)

    Ben Jia

    Full Text Available BACKGROUND: Metagenomics can reveal the vast majority of microbes that have been missed by traditional cultivation-based methods. Due to its extremely wide range of application areas, fast metagenome sequencing simulation systems with high fidelity are in great demand to facilitate the development and comparison of metagenomics analysis tools. RESULTS: We present here a customizable metagenome simulation system: NeSSM (Next-generation Sequencing Simulator for Metagenomics. Combining complete genomes currently available, a community composition table, and sequencing parameters, it can simulate metagenome sequencing better than existing systems. Sequencing error models based on the explicit distribution of errors at each base and sequencing coverage bias are incorporated in the simulation. In order to improve the fidelity of simulation, tools are provided by NeSSM to estimate the sequencing error models, sequencing coverage bias and the community composition directly from existing metagenome sequencing data. Currently, NeSSM supports single-end and pair-end sequencing for both 454 and Illumina platforms. In addition, a GPU (graphics processing units version of NeSSM is also developed to accelerate the simulation. By comparing the simulated sequencing data from NeSSM with experimental metagenome sequencing data, we have demonstrated that NeSSM performs better in many aspects than existing popular metagenome simulators, such as MetaSim, GemSIM and Grinder. The GPU version of NeSSM is more than one-order of magnitude faster than MetaSim. CONCLUSIONS: NeSSM is a fast simulation system for high-throughput metagenome sequencing. It can be helpful to develop tools and evaluate strategies for metagenomics analysis and it's freely available for academic users at http://cbb.sjtu.edu.cn/~ccwei/pub/software/NeSSM.php.

  12. Non-Functional Requirements Elicitation and Incorporation into Class Diagrams

    OpenAIRE

    Song , Xiaoyu; Duan , Zhenhua; Tian , Cong

    2010-01-01

    International audience; Top-quality software architecture should consider both functional and non-functional aspects of systems and their association. In the the existing literature, considerable efforts have been directed at functional requirement analysis and design, regardless of the non-functional aspects. This disassociation makes architecture comprehension and evolution hard. This paper proposes a strategy on how to elicit non-functional requirements and incorporate them into the design...

  13. Metaheuristic approaches to order sequencing on a unidirectional ...

    African Journals Online (AJOL)

    is that tabu search (TS) incorporates “intelligence. ..... location that will benefit the solution quality by considering implications that may arise ..... [12] Glover F, 1986, Future paths for integer programming and links to artificial intelligence, ...

  14. High quality draft genome sequence of Staphylococcus cohnii subsp. cohnii strain hu-01.

    Science.gov (United States)

    Hu, XinJun; Li, Ang; Lv, LongXian; Yuan, Chunhui; Guo, Lihua; Jiang, Xiawei; Jiang, Haiyin; Qian, GuiRong; Zheng, BeiWen; Guo, Jing; Li, LanJuan

    2014-06-15

    Staphylococcus cohnii subsp. cohnii belongs to the family Staphylococcaceae in the order Bacillales, class Bacilli and phylum Firmicutes. The increasing relevance of S. cohnii to human health prompted us to determine the genomic sequence of Staphylococcus cohnii subsp. cohnii strain hu-01, a multidrug-resistant isolate from a hospital in China. Here we describe the features of S. cohnii subsp. cohnii strain hu-01, together with the genome sequence and its annotation. This is the first genome sequence of the species Staphylococcus cohnii.

  15. The Application of Active Paper Incorporated with Oleoresin of Cinnamon Leaf (Cinnamomum burmanii) Distillation Residues on Maintaining Dragon Fruits (Hylocereus costaricensis) Quality during Storage

    Science.gov (United States)

    Aziz, M. S. H.; Manuhara, G. J.; Utami, R.; Khasanah, L. U.

    2018-03-01

    The purpose of this study was to determine the effect of active paper placement methods on super red dragon fruits quality during storage at ambient temperature. The active papers were incorporated with oleoresin of cinnamon leaf distillation residues. Various active paper placement methods were applied such as wrapping, placed on the cardboard wall, placed cardboard pad, and scrap of paper on the sidelines. Weight loss, peel color, surface and flesh hardness, total titratable acid, soluble solid total, pH flesh fruit, and total plate count (TPC) of super red dragon fruits samples were investigated during 9 days storage. The result shows that active paper placement methods significantly affected the weight loss, surface firmness and color peel change of super red dragon fruits samples. However, active paper placement methods insignificantly affected the titrable acid total, soluble solid total, pH, flesh firmness and microbial spoilage of super red dragon fruits samples. The best method to maintain the super red dragon fruits quality was wrapping method.

  16. Complete Genome Sequence of Bacillus subtilis Strain DKU_NT_03, Isolated from a Traditional Korean Food Using Soybean (Chung-gook-jang) for High-Quality Nattokinase Activity.

    Science.gov (United States)

    Jeong, Hee-Won; Bang, Man-Seok; Lee, Yea-Jin; Lee, Su Ji; Lee, Sang-Cheol; Shin, Jang-In; Oh, Chung-Hun

    2018-06-21

    We present here the complete genome sequence of Bacillus subtilis strain DKU_NT_03 isolated from the traditional Korean food chung-gook-jang, which is made from soybeans. This strain was chosen to identify genetic factors with high-quality nattokinase activity. Copyright © 2018 Jeong et al.

  17. Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.

    Science.gov (United States)

    Harvey, Michael G; Smith, Brian Tilston; Glenn, Travis C; Faircloth, Brant C; Brumfield, Robb T

    2016-09-01

    Sequence capture and restriction site associated DNA sequencing (RAD-Seq) are two genomic enrichment strategies for applying next-generation sequencing technologies to systematics studies. At shallow timescales, such as within species, RAD-Seq has been widely adopted among researchers, although there has been little discussion of the potential limitations and benefits of RAD-Seq and sequence capture. We discuss a series of issues that may impact the utility of sequence capture and RAD-Seq data for shallow systematics in non-model species. We review prior studies that used both methods, and investigate differences between the methods by re-analyzing existing RAD-Seq and sequence capture data sets from a Neotropical bird (Xenops minutus). We suggest that the strengths of RAD-Seq data sets for shallow systematics are the wide dispersion of markers across the genome, the relative ease and cost of laboratory work, the deep coverage and read overlap at recovered loci, and the high overall information that results. Sequence capture's benefits include flexibility and repeatability in the genomic regions targeted, success using low-quality samples, more straightforward read orthology assessment, and higher per-locus information content. The utility of a method in systematics, however, rests not only on its performance within a study, but on the comparability of data sets and inferences with those of prior work. In RAD-Seq data sets, comparability is compromised by low overlap of orthologous markers across species and the sensitivity of genetic diversity in a data set to an interaction between the level of natural heterozygosity in the samples examined and the parameters used for orthology assessment. In contrast, sequence capture of conserved genomic regions permits interrogation of the same loci across divergent species, which is preferable for maintaining comparability among data sets and studies for the purpose of drawing general conclusions about the impact of

  18. Incorporation of quality updates for JPSS CGS Products

    Science.gov (United States)

    Cochran, S.; Grant, K. D.; Ibrahim, W.; Brueske, K. F.; Smit, P.

    2016-12-01

    NOAA's next-generation environmental satellite, the Joint Polar Satellite System (JPSS) replaces the current Polar-orbiting Operational Environmental Satellites (POES). JPSS satellites carry sensors which collect meteorological, oceanographic, climatological, and solar-geophysical observations of the earth, atmosphere, and space. The first JPSS satellite was launched in 2011 and is currently NOAA's primary operational polar satellite. The JPSS ground system is the Common Ground System (CGS), and provides command, control, and communications (C3) and data processing (DP). A multi-mission system, CGS provides combinations of C3/DP for numerous NASA, NOAA, DoD, and international missions. In preparation for the next JPSS satellite, CGS improved its multi-mission capabilities to enhance mission operations for larger constellations of earth observing satellites with the added benefit of streamlining mission operations for other NOAA missions. This paper will discuss both the theoretical basis and the actual practices used to date to identify, test and incorporate algorithm updates into the CGS processing baseline. To provide a basis for this support, Raytheon developed a theoretical analysis framework, and the application of derived engineering processes, for the maintenance of consistency and integrity of remote sensing operational algorithm outputs. The framework is an abstraction of the operationalization of the science-grade algorithm (Sci2Ops) process used throughout the JPSS program. By combining software and systems engineering controls, manufacturing disciplines to detect and reduce defects, and a standard process to control analysis, an environment to maintain operational algorithm maturity is achieved. Results of the use of this approach to implement algorithm changes into operations will also be detailed.

  19. Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers.

    Directory of Open Access Journals (Sweden)

    Stephan Pabinger

    Full Text Available Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM. Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage

  20. Next-generation phylogeography: a targeted approach for multilocus sequencing of non-model organisms.

    Directory of Open Access Journals (Sweden)

    Jonathan B Puritz

    Full Text Available The field of phylogeography has long since realized the need and utility of incorporating nuclear DNA (nDNA sequences into analyses. However, the use of nDNA sequence data, at the population level, has been hindered by technical laboratory difficulty, sequencing costs, and problematic analytical methods dealing with genotypic sequence data, especially in non-model organisms. Here, we present a method utilizing the 454 GS-FLX Titanium pyrosequencing platform with the capacity to simultaneously sequence two species of sea star (Meridiastra calcar and Parvulastra exigua at five different nDNA loci across 16 different populations of 20 individuals each per species. We compare results from 3 populations with traditional Sanger sequencing based methods, and demonstrate that this next-generation sequencing platform is more time and cost effective and more sensitive to rare variants than Sanger based sequencing. A crucial advantage is that the high coverage of clonally amplified sequences simplifies haplotype determination, even in highly polymorphic species. This targeted next-generation approach can greatly increase the use of nDNA sequence loci in phylogeographic and population genetic studies by mitigating many of the time, cost, and analytical issues associated with highly polymorphic, diploid sequence markers.

  1. Scalable synthesis of sequence-defined, unimolecular macromolecules by Flow-IEG

    Science.gov (United States)

    Leibfarth, Frank A.; Johnson, Jeremiah A.; Jamison, Timothy F.

    2015-01-01

    We report a semiautomated synthesis of sequence and architecturally defined, unimolecular macromolecules through a marriage of multistep flow synthesis and iterative exponential growth (Flow-IEG). The Flow-IEG system performs three reactions and an in-line purification in a total residence time of under 10 min, effectively doubling the molecular weight of an oligomeric species in an uninterrupted reaction sequence. Further iterations using the Flow-IEG system enable an exponential increase in molecular weight. Incorporating a variety of monomer structures and branching units provides control over polymer sequence and architecture. The synthesis of a uniform macromolecule with a molecular weight of 4,023 g/mol is demonstrated. The user-friendly nature, scalability, and modularity of Flow-IEG provide a general strategy for the automated synthesis of sequence-defined, unimolecular macromolecules. Flow-IEG is thus an enabling tool for theory validation, structure–property studies, and advanced applications in biotechnology and materials science. PMID:26269573

  2. Toward a Better Compression for DNA Sequences Using Huffman Encoding.

    Science.gov (United States)

    Al-Okaily, Anas; Almarri, Badar; Al Yami, Sultan; Huang, Chun-Hsi

    2017-04-01

    Due to the significant amount of DNA data that are being generated by next-generation sequencing machines for genomes of lengths ranging from megabases to gigabases, there is an increasing need to compress such data to a less space and a faster transmission. Different implementations of Huffman encoding incorporating the characteristics of DNA sequences prove to better compress DNA data. These implementations center on the concepts of selecting frequent repeats so as to force a skewed Huffman tree, as well as the construction of multiple Huffman trees when encoding. The implementations demonstrate improvements on the compression ratios for five genomes with lengths ranging from 5 to 50 Mbp, compared with the standard Huffman tree algorithm. The research hence suggests an improvement on all such DNA sequence compression algorithms that use the conventional Huffman encoding. The research suggests an improvement on all DNA sequence compression algorithms that use the conventional Huffman encoding. Accompanying software is publicly available (AL-Okaily, 2016 ).

  3. Improving the quality of cardiopulmonary resuscitation by training dedicated cardiac arrest teams incorporating a mechanical load-distributing device at the emergency department.

    Science.gov (United States)

    Ong, Marcus Eng Hock; Quah, Joy Li Juan; Annathurai, Annitha; Noor, Noorkiah Mohamed; Koh, Zhi Xiong; Tan, Kenneth Boon Kiat; Pothiawala, Sohil; Poh, Ah Ho; Loy, Chye Khiaw; Fook-Chong, Stephanie

    2013-04-01

    Determine if implementing cardiac arrest teams trained with a 'pit-crew' protocol incorporating a load-distributing band mechanical CPR device (Autopulse™ ZOLL) improves the quality of CPR, as determined by no-flow ratio (NFR) in the first 10min of resuscitation. A phased, prospective, non-randomized, before-after cohort evaluation. Data collection was from April 2008 to February 2011. There were 100 before and 148 after cases. Continuous video and chest compression data of all study subjects were analyzed. All non-traumatic, collapsed patients aged 18 years and above presenting to the emergency department were eligible. Primary outcome was NFR. Secondary outcomes were return of spontaneous circulation (ROSC), survival to hospital admission and neurological outcome at discharge. After implementation, mean total NFR for the first 5min decreased from 0.42 to 0.27 (decrease=0.15, 95% CI 0.10-0.19, pCPR ratio increased from 46.4% to 88.4% (increase=41.9%, 95% CI 36.9-46.9, pTraining cardiac arrest teams in a 'pit-crew' protocol may improve the quality of CPR at the ED. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  4. Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads

    Directory of Open Access Journals (Sweden)

    Chengxi Ye

    2016-06-01

    Full Text Available Motivation. The third generation sequencing (3GS technology generates long sequences of thousands of bases. However, its current error rates are estimated in the range of 15–40%, significantly higher than those of the prevalent next generation sequencing (NGS technologies (less than 1%. Fundamental bioinformatics tasks such as de novo genome assembly and variant calling require high-quality sequences that need to be extracted from these long but erroneous 3GS sequences. Results. We describe a versatile and efficient linear complexity consensus algorithm Sparc to facilitate de novo genome assembly. Sparc builds a sparse k-mer graph using a collection of sequences from a targeted genomic region. The heaviest path which approximates the most likely genome sequence is searched through a sparsity-induced reweighted graph as the consensus sequence. Sparc supports using NGS and 3GS data together, which leads to significant improvements in both cost efficiency and computational efficiency. Experiments with Sparc show that our algorithm can efficiently provide high-quality consensus sequences using both PacBio and Oxford Nanopore sequencing technologies. With only 30× PacBio data, Sparc can reach a consensus with error rate <0.5%. With the more challenging Oxford Nanopore data, Sparc can also achieve similar error rate when combined with NGS data. Compared with the existing approaches, Sparc calculates the consensus with higher accuracy, and uses approximately 80% less memory and time. Availability. The source code is available for download at https://github.com/yechengxi/Sparc.

  5. Massively parallel whole genome amplification for single-cell sequencing using droplet microfluidics.

    Science.gov (United States)

    Hosokawa, Masahito; Nishikawa, Yohei; Kogawa, Masato; Takeyama, Haruko

    2017-07-12

    Massively parallel single-cell genome sequencing is required to further understand genetic diversities in complex biological systems. Whole genome amplification (WGA) is the first step for single-cell sequencing, but its throughput and accuracy are insufficient in conventional reaction platforms. Here, we introduce single droplet multiple displacement amplification (sd-MDA), a method that enables massively parallel amplification of single cell genomes while maintaining sequence accuracy and specificity. Tens of thousands of single cells are compartmentalized in millions of picoliter droplets and then subjected to lysis and WGA by passive droplet fusion in microfluidic channels. Because single cells are isolated in compartments, their genomes are amplified to saturation without contamination. This enables the high-throughput acquisition of contamination-free and cell specific sequence reads from single cells (21,000 single-cells/h), resulting in enhancement of the sequence data quality compared to conventional methods. This method allowed WGA of both single bacterial cells and human cancer cells. The obtained sequencing coverage rivals those of conventional techniques with superior sequence quality. In addition, we also demonstrate de novo assembly of uncultured soil bacteria and obtain draft genomes from single cell sequencing. This sd-MDA is promising for flexible and scalable use in single-cell sequencing.

  6. Quality assurance in the fuel fabrication

    International Nuclear Information System (INIS)

    Darmayan, P.

    1995-01-01

    The paper concentrates on the evolutions that FBFC Franco Belge de Fabrication de Combustible has initiated in order to make a further step in improving quality: 1. Improving each personnel's involment and responsability towards quality. 2. Incorporating quality assurance in a total quality management policy, involving both the fabrication teams of FBFC and the design teams of Framatome in order to improve quality. (orig./HP)

  7. Comparative performance of the BGISEQ-500 versus Illumina HiSeq2500 sequencing platforms for palaeogenomic sequencing

    DEFF Research Database (Denmark)

    Mak, Sarah Siu Tze Mak; Gopalakrishnan, Shyam Sunder; Carøe, Christian

    2017-01-01

    on degraded DNA, then directly compared the sequencing performance and data quality of the BGISEQ-500 to the Illumina HiSeq2500 platform, on DNA extracted from eight historic and ancient dog and wolf samples. Results: The data generated was largely comparable between sequencing platforms...... difference was also observed in the mitochondrial DNA percentages recovered (p = 0.018), although we believe this is likely a stochastic effect relating to the extremely low levels of mitochondria that were sequenced from three of the samples with overall very low levels of endogenous DNA. Conclusions......: Although we acknowledge our analyses were limited to animal material, our observations suggest that the BGISEQ-500 holds the potential to represent valid and potentially valuable alternative platform for palaeogenomic data generation, that is worthy of future exploration by those interested...

  8. MytiBase: a knowledgebase of mussel (M. galloprovincialis transcribed sequences

    Directory of Open Access Journals (Sweden)

    Roch Philippe

    2009-02-01

    Full Text Available Abstract Background Although Bivalves are among the most studied marine organisms due to their ecological role, economic importance and use in pollution biomonitoring, very little information is available on the genome sequences of mussels. This study reports the functional analysis of a large-scale Expressed Sequence Tag (EST sequencing from different tissues of Mytilus galloprovincialis (the Mediterranean mussel challenged with toxic pollutants, temperature and potentially pathogenic bacteria. Results We have constructed and sequenced seventeen cDNA libraries from different Mediterranean mussel tissues: gills, digestive gland, foot, anterior and posterior adductor muscle, mantle and haemocytes. A total of 24,939 clones were sequenced from these libraries generating 18,788 high-quality ESTs which were assembled into 2,446 overlapping clusters and 4,666 singletons resulting in a total of 7,112 non-redundant sequences. In particular, a high-quality normalized cDNA library (Nor01 was constructed as determined by the high rate of gene discovery (65.6%. Bioinformatic screening of the non-redundant M. galloprovincialis sequences identified 159 microsatellite-containing ESTs. Clusters, consensuses, related similarities and gene ontology searches have been organized in a dedicated, searchable database http://mussel.cribi.unipd.it. Conclusion We defined the first species-specific catalogue of M. galloprovincialis ESTs including 7,112 unique transcribed sequences. Putative microsatellite markers were identified. This annotated catalogue represents a valuable platform for expression studies, marker validation and genetic linkage analysis for investigations in the biology of Mediterranean mussels.

  9. Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses.

    Directory of Open Access Journals (Sweden)

    Arthur W Pightling

    Full Text Available The wide availability of whole-genome sequencing (WGS and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i depth of sequencing coverage, ii choice of reference-guided short-read sequence assembler, iii choice of reference genome, and iv whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT, using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming. We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers

  10. Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.

    Science.gov (United States)

    Schirmer, Melanie; Ijaz, Umer Z; D'Amore, Rosalinda; Hall, Neil; Sloan, William T; Quince, Christopher

    2015-03-31

    With read lengths of currently up to 2 × 300 bp, high throughput and low sequencing costs Illumina's MiSeq is becoming one of the most utilized sequencing platforms worldwide. The platform is manageable and affordable even for smaller labs. This enables quick turnaround on a broad range of applications such as targeted gene sequencing, metagenomics, small genome sequencing and clinical molecular diagnostics. However, Illumina error profiles are still poorly understood and programs are therefore not designed for the idiosyncrasies of Illumina data. A better knowledge of the error patterns is essential for sequence analysis and vital if we are to draw valid conclusions. Studying true genetic variation in a population sample is fundamental for understanding diseases, evolution and origin. We conducted a large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data. We tested state-of-the-art library preparation methods for amplicon sequencing and showed that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns. Furthermore we tested the efficiency of various error correction strategies and identified quality trimming (Sickle) combined with error correction (BayesHammer) followed by read overlapping (PANDAseq) as the most successful approach, reducing substitution error rates on average by 93%. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Haematobia irritans dataset of raw sequence reads from Illumina and Pac Bio sequencing of genomic DNA

    Science.gov (United States)

    The genome of the horn fly, Haematobia irritans, was sequenced using Illumina- and Pac Bio-based protocols. Following quality filtering, the raw reads have been deposited at NCBI under the BioProject and BioSample accession numbers PRJNA30967 and SAMN07830356, respectively. The Illumina reads are un...

  12. Math and Movement: Practical Ways to Incorporate Math into Physical Education

    Science.gov (United States)

    Wade, Marcia

    2016-01-01

    Each year, physical educators are asked to incorporate even more math, language arts, science and social studies into their curriculum. The challenge is how to do this without sacrificing the essential health and life skills provided by a quality physical education program. One program, Math & Movement, is a great aid for physical educators to…

  13. Sequence variants in the bovine silent information regulator 6, their linkage and their associations with body measurements and carcass quality traits in Qinchuan cattle.

    Science.gov (United States)

    Gui, Linsheng; Jiang, Bijie; Zhang, Yaran; Zan, Linsen

    2015-03-15

    Silent information regulator 6 (SIRT6) belongs to the family of class III nicotinamide adenine dinucleotide (NAD)-dependent deacetylase and plays an essential role in DNA repair and metabolism. This study was conducted to detect potential polymorphisms of the bovine SIRT6 gene and explore their relationships with body measurement and carcass quality in Qinchuan cattle. Four sequence variants (SVs) were identified in intron 6, exon 7, exon 9, and 3' UTR, via sequencing technology conducted in 468 individual Qinchuan cattle. Eleven different haplotypes were identified, of which two major haplotypes had a frequency of 45.7% (-CACT-) and 14.8% (-CGTC-). Three SVs (SV2, SV3 and SV4) were significantly associated with some of the body measurements and carcass quality traits (P<0.05 or P<0.01), and the H2H7 (CC-GA-TT-TC) diplotype had better performance than other combinations. Our results suggest that some polymorphisms in SIRT6 are associated with production traits and may be used as candidates for marker-assisted selection (MAS) and management in beef cattle breeding programs. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. A single multilocus sequence typing (MLST) scheme for seven pathogenic Leptospira species

    NARCIS (Netherlands)

    Boonsilp, Siriphan; Thaipadungpanit, Janjira; Amornchai, Premjit; Wuthiekanun, Vanaporn; Bailey, Mark S.; Holden, Matthew T. G.; Zhang, Cuicai; Jiang, Xiugao; Koizumi, Nobuo; Taylor, Kyle; Galloway, Renee; Hoffmaster, Alex R.; Craig, Scott; Smythe, Lee D.; Hartskeerl, Rudy A.; Day, Nicholas P.; Chantratita, Narisara; Feil, Edward J.; Aanensen, David M.; Spratt, Brian G.; Peacock, Sharon J.

    2013-01-01

    The available Leptospira multilocus sequence typing (MLST) scheme supported by a MLST website is limited to L. interrogans and L. kirschneri. Our aim was to broaden the utility of this scheme to incorporate a total of seven pathogenic species. We modified the existing scheme by replacing one of the

  15. Differentiating Research, Quality Improvement, and Case Studies to Ethically Incorporate Pregnant Women.

    Science.gov (United States)

    Phillippi, Julia C; Hartmann, Katherine E

    2018-01-01

    Pregnant women have been called therapeutic orphans because data supporting common interventions, medications, health teaching, and models of care are meager. The generation of quality evidence benefits from proactive approaches that ensure ethical standards are met to protect participants. The purpose of this article is to differentiate among health care, quality improvement, and research and to discuss ethical involvement of women who are pregnant and potentially childbearing in these initiatives. Health care is provided to protect and improve individual health. Quality improvement aims to enhance delivery of care for all those receiving care in particular settings. Research, whether retrospective or prospective, is designed to contribute to generalizable knowledge. This review includes vignettes to distinguish between research, quality improvement, and case study dissemination and to highlight the value of publication of information with applicability beyond a single site. As a community, perinatal care providers will be able to contribute more evidence to guide care if they err on the side of seeking institutional review board approval for activities that examine the care and outcomes of pregnant women and the fetus. Traditional research activities, including clinical trials, remain crucial. However, to fill gaps in knowledge, we must expedite our ability to report informative cases, examine clinical data, share lessons learned during quality improvement campaigns, and publish and disseminate these findings. Accelerating improvements in care demands expansion of the evidence base. © 2017 by the American College of Nurse-Midwives.

  16. Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.

    Science.gov (United States)

    Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga

    2015-01-01

    Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer's, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how

  17. Strategies for achieving high sequencing accuracy for low diversity samples and avoiding sample bleeding using illumina platform.

    Directory of Open Access Journals (Sweden)

    Abhishek Mitra

    Full Text Available Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding. Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants. Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer's, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively

  18. The role of RNase H2 in processing ribonucleotides incorporated during DNA replication.

    Science.gov (United States)

    Williams, Jessica S; Gehle, Daniel B; Kunkel, Thomas A

    2017-05-01

    Saccharomyces cerevisiae RNase H2 resolves RNA-DNA hybrids formed during transcription and it incises DNA at single ribonucleotides incorporated during nuclear DNA replication. To distinguish between the roles of these two activities in maintenance of genome stability, here we investigate the phenotypes of a mutant of yeast RNase H2 (rnh201-RED; ribonucleotide excision defective) that retains activity on RNA-DNA hybrids but is unable to cleave single ribonucleotides that are stably incorporated into the genome. The rnh201-RED mutant was expressed in wild type yeast or in a strain that also encodes a mutant allele of DNA polymerase ε (pol2-M644G) that enhances ribonucleotide incorporation during DNA replication. Similar to a strain that completely lacks RNase H2 (rnh201Δ), the pol2-M644G rnh201-RED strain exhibits replication stress and checkpoint activation. Moreover, like its null mutant counterpart, the double mutant pol2-M644G rnh201-RED strain and the single mutant rnh201-RED strain delete 2-5 base pairs in repetitive sequences at a high rate that is topoisomerase 1-dependent. The results highlight an important role for RNase H2 in maintaining genome integrity by removing single ribonucleotides incorporated during DNA replication. Published by Elsevier B.V.

  19. Understanding arsenic incorporation in CdTe with atom probe tomography

    Energy Technology Data Exchange (ETDEWEB)

    Burton, G. L.; Diercks, D. R.; Ogedengbe, O. S.; Jayathilaka, P. A. R. D.; Edirisooriya, M.; Myers, T. H.; Zaunbrecher, K. N.; Moseley, J.; Barnes, T. M.; Gorman, B. P.

    2018-08-01

    Overcoming the open circuit voltage deficiency in Cadmium Telluride (CdTe) photovoltaics may be achieved by increasing p-type doping while maintaining or increasing minority carrier lifetimes. Here, routes to higher doping efficiency using arsenic are explored through an atomic scale understanding of dopant incorporation limits and activation in molecular beam epitaxy grown CdTe layers. Atom probe tomography reveals spatial segregation into nanometer scale clusters containing > 60 at% As for samples with arsenic incorporation levels greater than 7-8 x 10^17 cm-3. The presence of arsenic clusters was accompanied by crystal quality degradation, particularly the introduction of arsenic-enriched extended defects. Post-growth annealing treatments are shown to increase the size of the As precipitates and the amount of As within the precipitates.

  20. Oxygen plasma etching of silver-incorporated diamond-like carbon films

    International Nuclear Information System (INIS)

    Marciano, F.R.; Bonetti, L.F.; Pessoa, R.S.; Massi, M.; Santos, L.V.; Trava-Airoldi, V.J.

    2009-01-01

    Diamond-like carbon (DLC) film as a solid lubricant coating represents an important area of investigation related to space devices. The environment for such devices involves high vacuum and high concentration of atomic oxygen. The purpose of this paper is to study the behavior of silver-incorporated DLC thin films against oxygen plasma etching. Silver nanoparticles were produced through an electrochemical process and incorporated into DLC bulk during the deposition process using plasma enhanced chemical vapor deposition technique. The presence of silver does not affect significantly DLC quality and reduces by more than 50% the oxygen plasma etching. Our results demonstrated that silver nanoparticles protect DLC films against etching process, which may increase their lifetime in low earth orbit environment.

  1. Oxygen plasma etching of silver-incorporated diamond-like carbon films

    Energy Technology Data Exchange (ETDEWEB)

    Marciano, F.R., E-mail: fernanda@las.inpe.b [Instituto Nacional de Pesquisas Espaciais (INPE), Laboratorio Associado de Sensores e Materiais (LAS), Av. dos Astronautas 1758, Sao Jose dos Campos, 12227-010, SP (Brazil); Instituto Tecnologico de Aeronautica (ITA), Centro Tecnico Aeroespacial (CTA), Pca. Marechal Eduardo Gomes, 50-Sao Jose dos Campos, 12228-900, SP (Brazil); Bonetti, L.F. [Clorovale Diamantes Industria e Comercio Ltda, Estr. do Torrao de Ouro, 500-Sao Jose dos Campos, 12229-390, SP (Brazil); Pessoa, R.S.; Massi, M. [Instituto Tecnologico de Aeronautica (ITA), Centro Tecnico Aeroespacial (CTA), Pca. Marechal Eduardo Gomes, 50-Sao Jose dos Campos, 12228-900, SP (Brazil); Santos, L.V.; Trava-Airoldi, V.J. [Instituto Nacional de Pesquisas Espaciais (INPE), Laboratorio Associado de Sensores e Materiais (LAS), Av. dos Astronautas 1758, Sao Jose dos Campos, 12227-010, SP (Brazil)

    2009-08-03

    Diamond-like carbon (DLC) film as a solid lubricant coating represents an important area of investigation related to space devices. The environment for such devices involves high vacuum and high concentration of atomic oxygen. The purpose of this paper is to study the behavior of silver-incorporated DLC thin films against oxygen plasma etching. Silver nanoparticles were produced through an electrochemical process and incorporated into DLC bulk during the deposition process using plasma enhanced chemical vapor deposition technique. The presence of silver does not affect significantly DLC quality and reduces by more than 50% the oxygen plasma etching. Our results demonstrated that silver nanoparticles protect DLC films against etching process, which may increase their lifetime in low earth orbit environment.

  2. Digital image sequence processing, compression, and analysis

    CERN Document Server

    Reed, Todd R

    2004-01-01

    IntroductionTodd R. ReedCONTENT-BASED IMAGE SEQUENCE REPRESENTATIONPedro M. Q. Aguiar, Radu S. Jasinschi, José M. F. Moura, andCharnchai PluempitiwiriyawejTHE COMPUTATION OF MOTIONChristoph Stiller, Sören Kammel, Jan Horn, and Thao DangMOTION ANALYSIS AND DISPLACEMENT ESTIMATION IN THE FREQUENCY DOMAINLuca Lucchese and Guido Maria CortelazzoQUALITY OF SERVICE ASSESSMENT IN NEW GENERATION WIRELESS VIDEO COMMUNICATIONSGaetano GiuntaERROR CONCEALMENT IN DIGITAL VIDEOFrancesco G.B. De NataleIMAGE SEQUENCE RESTORATION: A WIDER PERSPECTIVEAnil KokaramVIDEO SUMMARIZATIONCuneyt M. Taskiran and Edward

  3. Heuristics for multiobjective multiple sequence alignment.

    Science.gov (United States)

    Abbasi, Maryam; Paquete, Luís; Pereira, Francisco B

    2016-07-15

    Aligning multiple sequences arises in many tasks in Bioinformatics. However, the alignments produced by the current software packages are highly dependent on the parameters setting, such as the relative importance of opening gaps with respect to the increase of similarity. Choosing only one parameter setting may provide an undesirable bias in further steps of the analysis and give too simplistic interpretations. In this work, we reformulate multiple sequence alignment from a multiobjective point of view. The goal is to generate several sequence alignments that represent a trade-off between maximizing the substitution score and minimizing the number of indels/gaps in the sum-of-pairs score function. This trade-off gives to the practitioner further information about the similarity of the sequences, from which she could analyse and choose the most plausible alignment. We introduce several heuristic approaches, based on local search procedures, that compute a set of sequence alignments, which are representative of the trade-off between the two objectives (substitution score and indels). Several algorithm design options are discussed and analysed, with particular emphasis on the influence of the starting alignment and neighborhood search definitions on the overall performance. A perturbation technique is proposed to improve the local search, which provides a wide range of high-quality alignments. The proposed approach is tested experimentally on a wide range of instances. We performed several experiments with sequences obtained from the benchmark database BAliBASE 3.0. To evaluate the quality of the results, we calculate the hypervolume indicator of the set of score vectors returned by the algorithms. The results obtained allow us to identify reasonably good choices of parameters for our approach. Further, we compared our method in terms of correctly aligned pairs ratio and columns correctly aligned ratio with respect to reference alignments. Experimental results show

  4. Lossy compression of quality scores in genomic data.

    Science.gov (United States)

    Cánovas, Rodrigo; Moffat, Alistair; Turpin, Andrew

    2014-08-01

    Next-generation sequencing technologies are revolutionizing medicine. Data from sequencing technologies are typically represented as a string of bases, an associated sequence of per-base quality scores and other metadata, and in aggregate can require a large amount of space. The quality scores show how accurate the bases are with respect to the sequencing process, that is, how confident the sequencer is of having called them correctly, and are the largest component in datasets in which they are retained. Previous research has examined how to store sequences of bases effectively; here we add to that knowledge by examining methods for compressing quality scores. The quality values originate in a continuous domain, and so if a fidelity criterion is introduced, it is possible to introduce flexibility in the way these values are represented, allowing lossy compression over the quality score data. We present existing compression options for quality score data, and then introduce two new lossy techniques. Experiments measuring the trade-off between compression ratio and information loss are reported, including quantifying the effect of lossy representations on a downstream application that carries out single nucleotide polymorphism and insert/deletion detection. The new methods are demonstrably superior to other techniques when assessed against the spectrum of possible trade-offs between storage required and fidelity of representation. An implementation of the methods described here is available at https://github.com/rcanovas/libCSAM. rcanovas@student.unimelb.edu.au Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. The Release 6 reference sequence of the Drosophila melanogaster genome.

    Science.gov (United States)

    Hoskins, Roger A; Carlson, Joseph W; Wan, Kenneth H; Park, Soo; Mendez, Ivonne; Galle, Samuel E; Booth, Benjamin W; Pfeiffer, Barret D; George, Reed A; Svirskas, Robert; Krzywinski, Martin; Schein, Jacqueline; Accardo, Maria Carmela; Damia, Elisabetta; Messina, Giovanni; Méndez-Lago, María; de Pablos, Beatriz; Demakova, Olga V; Andreyeva, Evgeniya N; Boldyreva, Lidiya V; Marra, Marco; Carvalho, A Bernardo; Dimitri, Patrizio; Villasante, Alfredo; Zhimulev, Igor F; Rubin, Gerald M; Karpen, Gary H; Celniker, Susan E

    2015-03-01

    Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads. © 2015 Hoskins et al.; Published by Cold Spring Harbor Laboratory Press.

  6. Draft Genome Sequence and Gene Annotation of the Entomopathogenic Fungus Verticillium hemipterigenum

    OpenAIRE

    Horn, Fabian; Habel, Andreas; Scharf, Daniel H.; Dworschak, Jan; Brakhage, Axel A.; Guthke, Reinhard; Hertweck, Christian; Linde, J?rg

    2015-01-01

    Verticillium hemipterigenum (anamorph Torrubiella hemipterigena) is an entomopathogenic fungus and produces a broad range of secondary metabolites. Here, we present the draft genome sequence of the fungus, including gene structure and functional annotation. Genes were predicted incorporating RNA-Seq data and functionally annotated to provide the basis for further genome studies.

  7. Quantum Chemical Examination of the Sequential Halogen Incorporation Scheme for the Modeling of Speciation of I/Br/Cl-Containing Trihalomethanes.

    Science.gov (United States)

    Zhang, Chenyang; Li, Maodong; Han, Xuze; Yan, Mingquan

    2018-02-20

    The recently developed three-step ternary halogenation model interprets the incorporation of chlorine, bromine, and iodine ions into natural organic matter (NOM) and formation of iodine-, bromine-, and chlorine-containing trihalomethanes (THMs) based on the competition of iodine, bromine, and chlorine species at each node of the halogenation sequence. This competition is accounted for using the dimensionless ratios (denoted as γ) of kinetic rates of reactions of the initial attack sites or halogenated intermediates with chlorine, bromine, and iodine ions. However, correlations between the model predictions made and mechanistic aspects of the incorporation of halogen species need to be ascertained in more detail. In this study, quantum chemistry calculations were first used to probe the formation mechanism of 10 species of Cl-/Br-/I- THMs. The HOMO energy (E HOMO ) of each mono-, bi-, or trihalomethanes were calculated by B3LYP method in Gaussian 09 software. Linear correlations were found to exist between the logarithms of experimentally determined kinetic preference coefficients γ reported in prior research and, on the other hand, differences of E HOMO values between brominated/iodinated and chlorinated halomethanes. One notable exception from this trend was that observed for the incorporation of iodine into mono- and di-iodinated intermediates. These observations confirm the three-step halogen incorporation sequence and the factor γ in the statistical model. The combined use of quantum chemistry calculations and the ternary sequential halogenation model provides a new insight into the microscopic nature of NOM-halogen interactions and the trends seen in the behavior of γ factors incorporated in the THM speciation models.

  8. Estimating the annotation error rate of curated GO database sequence annotations

    Directory of Open Access Journals (Sweden)

    Brown Alfred L

    2007-05-01

    Full Text Available Abstract Background Annotations that describe the function of sequences are enormously important to researchers during laboratory investigations and when making computational inferences. However, there has been little investigation into the data quality of sequence function annotations. Here we have developed a new method of estimating the error rate of curated sequence annotations, and applied this to the Gene Ontology (GO sequence database (GOSeqLite. This method involved artificially adding errors to sequence annotations at known rates, and used regression to model the impact on the precision of annotations based on BLAST matched sequences. Results We estimated the error rate of curated GO sequence annotations in the GOSeqLite database (March 2006 at between 28% and 30%. Annotations made without use of sequence similarity based methods (non-ISS had an estimated error rate of between 13% and 18%. Annotations made with the use of sequence similarity methodology (ISS had an estimated error rate of 49%. Conclusion While the overall error rate is reasonably low, it would be prudent to treat all ISS annotations with caution. Electronic annotators that use ISS annotations as the basis of predictions are likely to have higher false prediction rates, and for this reason designers of these systems should consider avoiding ISS annotations where possible. Electronic annotators that use ISS annotations to make predictions should be viewed sceptically. We recommend that curators thoroughly review ISS annotations before accepting them as valid. Overall, users of curated sequence annotations from the GO database should feel assured that they are using a comparatively high quality source of information.

  9. Quality Assessment of Domesticated Animal Genome Assemblies

    DEFF Research Database (Denmark)

    Seemann, Stefan E; Anthon, Christian; Palasca, Oana

    2015-01-01

    affected by the lack of genomic sequence. Herein, we quantify the quality of the genome assemblies of 20 domesticated animals and related species by assessing a range of measurable parameters, and we show that there is a positive correlation between the fraction of mappable reads from RNAseq data...... domesticated animal genomes still need to be sequenced deeper in order to produce high-quality assemblies. In the meanwhile, ironically, the extent to which RNAseq and other next-generation data is produced frequently far exceeds that of the genomic sequence. Furthermore, basic comparative analysis is often...

  10. On generalized fixed sequence procedures for controlling the FWER.

    Science.gov (United States)

    Qiu, Zhiying; Guo, Wenge; Lynch, Gavin

    2015-12-30

    Testing a sequence of pre-ordered hypotheses to decide which of these can be rejected or accepted while controlling the familywise error rate (FWER) is of importance in many scientific studies such as clinical trials. In this paper, we first introduce a generalized fixed sequence procedure whose critical values are defined by using a function of the numbers of rejections and acceptances, and which allows follow-up hypotheses to be tested even if some earlier hypotheses are not rejected. We then construct the least favorable configuration for this generalized fixed sequence procedure and present a sufficient condition for the FWER control under arbitrary dependence. Based on the condition, we develop three new generalized fixed sequence procedures controlling the FWER under arbitrary dependence. We also prove that each generalized fixed sequence procedure can be described as a specific closed testing procedure. Through simulation studies and a clinical trial example, we compare the power performance of these proposed procedures with those of the existing FWER controlling procedures. Finally, when the pairwise joint distributions of the true null p-values are known, we further improve these procedures by incorporating pairwise correlation information while maintaining the control of the FWER. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  11. Management of High-Throughput DNA Sequencing Projects: Alpheus.

    Science.gov (United States)

    Miller, Neil A; Kingsmore, Stephen F; Farmer, Andrew; Langley, Raymond J; Mudge, Joann; Crow, John A; Gonzalez, Alvaro J; Schilkey, Faye D; Kim, Ryan J; van Velkinburgh, Jennifer; May, Gregory D; Black, C Forrest; Myers, M Kathy; Utsey, John P; Frost, Nicholas S; Sugarbaker, David J; Bueno, Raphael; Gullans, Stephen R; Baxter, Susan M; Day, Steve W; Retzel, Ernest F

    2008-12-26

    High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricultural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage significant volumes of high-dimensional and inter-related data and analysis. Alpheus is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multi-gigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche-454 (pyrosequencing) and Applied Biosystem's SOLiD (sequencing-by-ligation). Alpheus enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. Alpheus is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximizing the identification of true positives. Alpheus also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.

  12. AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.

    Directory of Open Access Journals (Sweden)

    Qi Zheng

    2016-10-01

    Full Text Available Accurate mapping of next-generation sequencing (NGS reads to reference genomes is crucial for almost all NGS applications and downstream analyses. Various repetitive elements in human and other higher eukaryotic genomes contribute in large part to ambiguously (non-uniquely mapped reads. Most available NGS aligners attempt to address this by either removing all non-uniquely mapping reads, or reporting one random or "best" hit based on simple heuristics. Accurate estimation of the mapping quality of NGS reads is therefore critical albeit completely lacking at present. Here we developed a generalized software toolkit "AlignerBoost", which utilizes a Bayesian-based framework to accurately estimate mapping quality of ambiguously mapped NGS reads. We tested AlignerBoost with both simulated and real DNA-seq and RNA-seq datasets at various thresholds. In most cases, but especially for reads falling within repetitive regions, AlignerBoost dramatically increases the mapping precision of modern NGS aligners without significantly compromising the sensitivity even without mapping quality filters. When using higher mapping quality cutoffs, AlignerBoost achieves a much lower false mapping rate while exhibiting comparable or higher sensitivity compared to the aligner default modes, therefore significantly boosting the detection power of NGS aligners even using extreme thresholds. AlignerBoost is also SNP-aware, and higher quality alignments can be achieved if provided with known SNPs. AlignerBoost's algorithm is computationally efficient, and can process one million alignments within 30 seconds on a typical desktop computer. AlignerBoost is implemented as a uniform Java application and is freely available at https://github.com/Grice-Lab/AlignerBoost.

  13. AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.

    Science.gov (United States)

    Zheng, Qi; Grice, Elizabeth A

    2016-10-01

    Accurate mapping of next-generation sequencing (NGS) reads to reference genomes is crucial for almost all NGS applications and downstream analyses. Various repetitive elements in human and other higher eukaryotic genomes contribute in large part to ambiguously (non-uniquely) mapped reads. Most available NGS aligners attempt to address this by either removing all non-uniquely mapping reads, or reporting one random or "best" hit based on simple heuristics. Accurate estimation of the mapping quality of NGS reads is therefore critical albeit completely lacking at present. Here we developed a generalized software toolkit "AlignerBoost", which utilizes a Bayesian-based framework to accurately estimate mapping quality of ambiguously mapped NGS reads. We tested AlignerBoost with both simulated and real DNA-seq and RNA-seq datasets at various thresholds. In most cases, but especially for reads falling within repetitive regions, AlignerBoost dramatically increases the mapping precision of modern NGS aligners without significantly compromising the sensitivity even without mapping quality filters. When using higher mapping quality cutoffs, AlignerBoost achieves a much lower false mapping rate while exhibiting comparable or higher sensitivity compared to the aligner default modes, therefore significantly boosting the detection power of NGS aligners even using extreme thresholds. AlignerBoost is also SNP-aware, and higher quality alignments can be achieved if provided with known SNPs. AlignerBoost's algorithm is computationally efficient, and can process one million alignments within 30 seconds on a typical desktop computer. AlignerBoost is implemented as a uniform Java application and is freely available at https://github.com/Grice-Lab/AlignerBoost.

  14. Air quality indices : a review

    International Nuclear Information System (INIS)

    Hewings, J.

    2001-10-01

    Pollution Probe presents some background information that will help in the development of a national Air Quality Index (AQI) in Canada. This report examines the issues that should be addressed in revising the national Index of the Quality of Air (IQUA) or creating a new national Air Quality Index. The IQUA was devised in 1976 and provides Canadians with real-time information on the state of community air quality by including major pollutants and their synergies. It is currently being used for air quality management plans and air quality alert systems. At the same time that the IQUA was devised, the United States Environmental Protection Agency (US EPA) produced a parallel air quality index known as the Pollution Standard Index (PSI) which incorporated 5 criteria pollutants (particulate matter, sulphur dioxide, carbon monoxide, nitrogen oxide and ground level ozone) for which national health-based standards were devised. In 1999, the US EPA renamed their index the Air Quality Index (AQI) and made revisions to the primary health-based national ambient air quality standards for ground-level ozone and particulate matter. Separate values for PM2.5 and PM10 were incorporated and mandatory reporting was required for metropolitan areas with populations of 350,000 or more. Similarly, the IQUA has undergone major developments that affect the validity of the index, including: rejection by the Working Group on Air Quality Objectives and Guidelines of the previous maximum desirable and maximum acceptable air quality criteria, recognition that standards for many of the contaminants are outdated, developing more sensitive instrumentation for real-time monitoring of contaminants. This report also describes the use of the national short term Air Quality Index by provincial, territorial and local authorities in Canada. Pollution Probe recommends setting up a mechanism to review and revise IQUA on a regular basis that would incorporate governments, the medical profession, special

  15. QTL MAPPING FOR GRAIN QUALITY TRAITS IN TESTCROSSES OF A MAIZE BIPARENTAL POPULATION USING GENOTYPING-BY-SEQUENCING DATA

    Directory of Open Access Journals (Sweden)

    Mario Franić

    2017-01-01

    Full Text Available We performed QTL mapping in testcrosses of maize population IBMSyn4 for three grain quality traits: oil and protein contents and test weight. 191 phenotyped and genotyped lines were used as a training set while 85 genotyped only lines comprised a validation set used to calculate best linear unbiased predictions (BLUP, making a total of 276 phenotypes for the QTL analysis. 92000 filtered Genotyping-By-Sequencing (GBS SNP markers were used to calculate BLUPs, while a set of 2178 genetically mapped SSRs was used in QTL analysis. By simple QTL scan, we scored several minor effect QTLs: one for oil content (chromosome 1, one for protein content (chromosome 10 and four for test weight (chromosomes 1, 3, 5 and 10. QTLs associated with test weight were found to be additive, and 18.25% of phenotypic variance was explained by their joint effect. Only one QTL for test weight was found to be significant in composite interval mapping and it was mapped on chromosome 5. This QTL accounted for 9.97% of phenotypic variance. QTLs detected in this study represent monitoring of commercially most successful elite maize germplasm for grain quality traits.

  16. Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space

    Science.gov (United States)

    Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R.; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J.

    2013-01-01

    For the vast majority of species – including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding. PMID:23592960

  17. Combinatorial pooling enables selective sequencing of the barley gene space.

    Directory of Open Access Journals (Sweden)

    Stefano Lonardi

    2013-04-01

    Full Text Available For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.

  18. Combinatorial pooling enables selective sequencing of the barley gene space.

    Science.gov (United States)

    Lonardi, Stefano; Duma, Denisa; Alpert, Matthew; Cordero, Francesca; Beccuti, Marco; Bhat, Prasanna R; Wu, Yonghui; Ciardo, Gianfranco; Alsaihati, Burair; Ma, Yaqin; Wanamaker, Steve; Resnik, Josh; Bozdag, Serdar; Luo, Ming-Cheng; Close, Timothy J

    2013-04-01

    For the vast majority of species - including many economically or ecologically important organisms, progress in biological research is hampered due to the lack of a reference genome sequence. Despite recent advances in sequencing technologies, several factors still limit the availability of such a critical resource. At the same time, many research groups and international consortia have already produced BAC libraries and physical maps and now are in a position to proceed with the development of whole-genome sequences organized around a physical map anchored to a genetic map. We propose a BAC-by-BAC sequencing protocol that combines combinatorial pooling design and second-generation sequencing technology to efficiently approach denovo selective genome sequencing. We show that combinatorial pooling is a cost-effective and practical alternative to exhaustive DNA barcoding when preparing sequencing libraries for hundreds or thousands of DNA samples, such as in this case gene-bearing minimum-tiling-path BAC clones. The novelty of the protocol hinges on the computational ability to efficiently compare hundred millions of short reads and assign them to the correct BAC clones (deconvolution) so that the assembly can be carried out clone-by-clone. Experimental results on simulated data for the rice genome show that the deconvolution is very accurate, and the resulting BAC assemblies have high quality. Results on real data for a gene-rich subset of the barley genome confirm that the deconvolution is accurate and the BAC assemblies have good quality. While our method cannot provide the level of completeness that one would achieve with a comprehensive whole-genome sequencing project, we show that it is quite successful in reconstructing the gene sequences within BACs. In the case of plants such as barley, this level of sequence knowledge is sufficient to support critical end-point objectives such as map-based cloning and marker-assisted breeding.

  19. SNAD: sequence name annotation-based designer

    Directory of Open Access Journals (Sweden)

    Gorbalenya Alexander E

    2009-08-01

    Full Text Available Abstract Background A growing diversity of biological data is tagged with unique identifiers (UIDs associated with polynucleotides and proteins to ensure efficient computer-mediated data storage, maintenance, and processing. These identifiers, which are not informative for most people, are often substituted by biologically meaningful names in various presentations to facilitate utilization and dissemination of sequence-based knowledge. This substitution is commonly done manually that may be a tedious exercise prone to mistakes and omissions. Results Here we introduce SNAD (Sequence Name Annotation-based Designer that mediates automatic conversion of sequence UIDs (associated with multiple alignment or phylogenetic tree, or supplied as plain text list into biologically meaningful names and acronyms. This conversion is directed by precompiled or user-defined templates that exploit wealth of annotation available in cognate entries of external databases. Using examples, we demonstrate how this tool can be used to generate names for practical purposes, particularly in virology. Conclusion A tool for controllable annotation-based conversion of sequence UIDs into biologically meaningful names and acronyms has been developed and placed into service, fostering links between quality of sequence annotation, and efficiency of communication and knowledge dissemination among researchers.

  20. Tools for integrated sequence-structure analysis with UCSF Chimera

    Directory of Open Access Journals (Sweden)

    Huang Conrad C

    2006-07-01

    Full Text Available Abstract Background Comparing related structures and viewing the structures in the context of sequence alignments are important tasks in protein structure-function research. While many programs exist for individual aspects of such work, there is a need for interactive visualization tools that: (a provide a deep integration of sequence and structure, far beyond mapping where a sequence region falls in the structure and vice versa; (b facilitate changing data of one type based on the other (for example, using only sequence-conserved residues to match structures, or adjusting a sequence alignment based on spatial fit; (c can be used with a researcher's own data, including arbitrary sequence alignments and annotations, closely or distantly related sets of proteins, etc.; and (d interoperate with each other and with a full complement of molecular graphics features. We describe enhancements to UCSF Chimera to achieve these goals. Results The molecular graphics program UCSF Chimera includes a suite of tools for interactive analyses of sequences and structures. Structures automatically associate with sequences in imported alignments, allowing many kinds of crosstalk. A novel method is provided to superimpose structures in the absence of a pre-existing sequence alignment. The method uses both sequence and secondary structure, and can match even structures with very low sequence identity. Another tool constructs structure-based sequence alignments from superpositions of two or more proteins. Chimera is designed to be extensible, and mechanisms for incorporating user-specific data without Chimera code development are also provided. Conclusion The tools described here apply to many problems involving comparison and analysis of protein structures and their sequences. Chimera includes complete documentation and is intended for use by a wide range of scientists, not just those in the computational disciplines. UCSF Chimera is free for non-commercial use and is

  1. Optical flow estimation on image sequences with differently exposed frames

    Science.gov (United States)

    Bengtsson, Tomas; McKelvey, Tomas; Lindström, Konstantin

    2015-09-01

    Optical flow (OF) methods are used to estimate dense motion information between consecutive frames in image sequences. In addition to the specific OF estimation method itself, the quality of the input image sequence is of crucial importance to the quality of the resulting flow estimates. For instance, lack of texture in image frames caused by saturation of the camera sensor during exposure can significantly deteriorate the performance. An approach to avoid this negative effect is to use different camera settings when capturing the individual frames. We provide a framework for OF estimation on such sequences that contain differently exposed frames. Information from multiple frames are combined into a total cost functional such that the lack of an active data term for saturated image areas is avoided. Experimental results demonstrate that using alternate camera settings to capture the full dynamic range of an underlying scene can clearly improve the quality of flow estimates. When saturation of image data is significant, the proposed methods show superior performance in terms of lower endpoint errors of the flow vectors compared to a set of baseline methods. Furthermore, we provide some qualitative examples of how and when our method should be used.

  2. MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.

    Science.gov (United States)

    Brown, Bonnie L; Watson, Mick; Minot, Samuel S; Rivera, Maria C; Franklin, Rima B

    2017-03-01

    Environmental metagenomic analysis is typically accomplished by assigning taxonomy and/or function from whole genome sequencing or 16S amplicon sequences. Both of these approaches are limited, however, by read length, among other technical and biological factors. A nanopore-based sequencing platform, MinION™, produces reads that are ≥1 × 104 bp in length, potentially providing for more precise assignment, thereby alleviating some of the limitations inherent in determining metagenome composition from short reads. We tested the ability of sequence data produced by MinION (R7.3 flow cells) to correctly assign taxonomy in single bacterial species runs and in three types of low-complexity synthetic communities: a mixture of DNA using equal mass from four species, a community with one relatively rare (1%) and three abundant (33% each) components, and a mixture of genomic DNA from 20 bacterial strains of staggered representation. Taxonomic composition of the low-complexity communities was assessed by analyzing the MinION sequence data with three different bioinformatic approaches: Kraken, MG-RAST, and One Codex. Results: Long read sequences generated from libraries prepared from single strains using the version 5 kit and chemistry, run on the original MinION device, yielded as few as 224 to as many as 3497 bidirectional high-quality (2D) reads with an average overall study length of 6000 bp. For the single-strain analyses, assignment of reads to the correct genus by different methods ranged from 53.1% to 99.5%, assignment to the correct species ranged from 23.9% to 99.5%, and the majority of misassigned reads were to closely related organisms. A synthetic metagenome sequenced with the same setup yielded 714 high quality 2D reads of approximately 5500 bp that were up to 98% correctly assigned to the species level. Synthetic metagenome MinION libraries generated using version 6 kit and chemistry yielded from 899 to 3497 2D reads with lengths averaging 5700 bp with up

  3. Value of a newly sequenced bacterial genome

    DEFF Research Database (Denmark)

    Barbosa, Eudes; Aburjaile, Flavia F; Ramos, Rommel Tj

    2014-01-01

    and annotation will not be undertaken. It is important to know what is lost when we settle for a draft genome and to determine the "scientific value" of a newly sequenced genome. This review addresses the expected impact of newly sequenced genomes on antibacterial discovery and vaccinology. Also, it discusses...... heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting...

  4. High-efficiency/CRI/color stability warm white organic light-emitting diodes by incorporating ultrathin phosphorescence layers in a blue fluorescence layer

    Directory of Open Access Journals (Sweden)

    Miao Yanqin

    2018-01-01

    Full Text Available By incorporating ultrathin (80, low correlated color temperature of <3600 K, and high color stability at a wide voltage range of 5 V–9 V. These hybrid WOLEDs also reveal high forward-viewing external quantum efficiencies (EQE of 17.82%–19.34%, which are close to the theoretical value of 20%, indicating an almost complete exciton harvesting. In addition, the electroluminescence spectra of the hybrid WOLEDs can be easily improved by only changing the incorporating sequence of the ultrathin phosphorescence layers without device efficiency loss. For example, the hybrid WOLED with an incorporation sequence of ultrathin red/yellow/green phosphorescence layers exhibits an ultra-high CRI of 96 and a high EQE of 19.34%. To the best of our knowledge, this is the first WOLED with good tradeoff among device efficiency, CRI, and color stability. The introduction of ultrathin (<0.1 nm phosphorescence layers can also greatly reduce the consumption of phosphorescent emitters as well as simplify device structures and fabrication process, thus leading to low cost. Such a finding is very meaningful for the potential commercialization of hybrid WOLEDs.

  5. A comparative analysis of double inversion recovery TFE and TSE sequences on carotid artery wall imaging

    International Nuclear Information System (INIS)

    Chen Jun; Di Yujin; Bu Chunqing; Zhang Yanfeng; Li Shuhua

    2012-01-01

    Objective: To analyze the characteristics of double inversion recovery (DIR) turbo field echo (TFE) and turbo spin echo (TSE) sequences and explore the value of double inversion recovery TFE sequence on carotid artery wall imaging. Patients and methods: 56 patients (32 males and 24 females, aged 31–76 years with a mean age of 53 years) were performed with DIR TFE and DIR TSE T1 weighted imaging (T1WI) sequences on carotid artery bifurcations. Image quality acquired by different techniques were evaluated and scored by two physicians. Whether there is significant difference is determined by SPSS 11.0 software. Paired-samples t test was used for statistics. Results: There was no significant difference in the image quality scores between two sequences (t = 0.880, P = 0.383 > 0.05). Conclusions: DIR TFE sequence has short scanning time and high spatial resolution. DIR TFE sequence can be used as the preferred sequence for screening carotid atherosclerotic plaque compared with DIR TSE sequence.

  6. ConDeTri--a content dependent read trimmer for Illumina data.

    Science.gov (United States)

    Smeds, Linnéa; Künstner, Axel

    2011-01-01

    During the last few years, DNA and RNA sequencing have started to play an increasingly important role in biological and medical applications, especially due to the greater amount of sequencing data yielded from the new sequencing machines and the enormous decrease in sequencing costs. Particularly, Illumina/Solexa sequencing has had an increasing impact on gathering data from model and non-model organisms. However, accurate and easy to use tools for quality filtering have not yet been established. We present ConDeTri, a method for content dependent read trimming for next generation sequencing data using quality scores of each individual base. The main focus of the method is to remove sequencing errors from reads so that sequencing reads can be standardized. Another aspect of the method is to incorporate read trimming in next-generation sequencing data processing and analysis pipelines. It can process single-end and paired-end sequence data of arbitrary length and it is independent from sequencing coverage and user interaction. ConDeTri is able to trim and remove reads with low quality scores to save computational time and memory usage during de novo assemblies. Low coverage or large genome sequencing projects will especially gain from trimming reads. The method can easily be incorporated into preprocessing and analysis pipelines for Illumina data. Freely available on the web at http://code.google.com/p/condetri.

  7. Application of a Software tool for Evaluating Human Factors in Accident Sequences

    International Nuclear Information System (INIS)

    Queral, Cesar; Exposito, Antonio; Gonzalez, Isaac; Quiroga, Juan Antonio; Ibarra, Aitor; Hortal, Javier; Hulsund, John-Einar; Nilsen, Svein

    2006-01-01

    The Probabilistic Safety Assessment (PSA) includes the actions of the operator like elements in the set of the considered protection performances during accident sequences. Nevertheless, its impact throughout a sequence is not analyzed in a dynamic way. In this sense, it is convenient to make more detailed studies about its importance in the dynamics of the sequences, letting make studies of sensitivity respect to the human reliability and the response times. For this reason, the CSN is involved in several activities oriented to develop a new safety analysis methodology, the Integrated Safety Assessment (ISA), which must be able to incorporate operator actions in conventional thermo-hydraulic (TH) simulations. One of them is the collaboration project between CSN, HRP and the DSE-UPM that started in 2003. In the framework of this project, a software tool has been developed to incorporate operator actions in TH simulations. As a part of the ISA, this tool permits to quantify human error probabilities (HEP) and to evaluate its impact in the final state of the plant. Independently, it can be used for evaluating the impact of the execution by operators of procedures and guidelines in the final state of the plant and the evaluation of the allowable response times for the manual actions of the operator. The results obtained in the first pilot case are included in this paper. (authors)

  8. The Douglas-fir genome sequence reveals specialization of the photosynthetic apparatus in Pinaceae

    Science.gov (United States)

    David B. Neale; Patrick E. McGuire; Nicholas C. Wheeler; Kristian A. Stevens; Marc W. Crepeau; Charis Cardeno; Aleksey V. Zimin; Daniela Puiu; Geo M. Pertea; U. Uzay Sezen; Claudio Casola; Tomasz E. Koralewski; Robin Paul; Daniel Gonzalez-Ibeas; Sumaira Zaman; Richard Cronn; Mark Yandell; Carson Holt; Charles H. Langley; James A. Yorke; Steven L. Salzberg; Jill L. Wegrzyn

    2017-01-01

    A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb.) Franco (Coastal Douglas-fir) is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50...

  9. Quality assurance for the incorporation surveillance in Germany; Qualitaetssicherung fuer die Inkorporationsueberwachung in Deutschland

    Energy Technology Data Exchange (ETDEWEB)

    Buchholz, Werner; Gerstmann, Udo; Hartmann, Martina; Loescher, Simone; Meisenberg, Oliver; Woidy, Patrick [Bundesamt fuer Strahlenschutz, Oberschleissheim (Germany). FB Strahlenschutz und Gesundheit

    2017-08-01

    The coordination center of the Bundesamt fuer Strahlenschutz is performing annual ring experiments for in-vivo and in-vitro measuring points. This procedure allows checking the measuring data and the competence of the personnel. In the last years these tests included also radionuclides that are important in radiation emergency protection. Case studies for dose determination based on incorporation scenarios are included.

  10. Genome sequence of the Brown Norway rat yields insights into mammalian evolution

    DEFF Research Database (Denmark)

    Gibbs, Richard A; Weinstock, George M; Metzker, Michael L

    2004-01-01

    The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality 'draft' covering ove...

  11. A novel constraint for thermodynamically designing DNA sequences.

    Directory of Open Access Journals (Sweden)

    Qiang Zhang

    Full Text Available Biotechnological and biomolecular advances have introduced novel uses for DNA such as DNA computing, storage, and encryption. For these applications, DNA sequence design requires maximal desired (and minimal undesired hybridizations, which are the product of a single new DNA strand from 2 single DNA strands. Here, we propose a novel constraint to design DNA sequences based on thermodynamic properties. Existing constraints for DNA design are based on the Hamming distance, a constraint that does not address the thermodynamic properties of the DNA sequence. Using a unique, improved genetic algorithm, we designed DNA sequence sets which satisfy different distance constraints and employ a free energy gap based on a minimum free energy (MFE to gauge DNA sequences based on set thermodynamic properties. When compared to the best constraints of the Hamming distance, our method yielded better thermodynamic qualities. We then used our improved genetic algorithm to obtain lower-bound DNA sequence sets. Here, we discuss the effects of novel constraint parameters on the free energy gap.

  12. Development of a mission-based funding model for undergraduate medical education: incorporation of quality.

    Science.gov (United States)

    Stagnaro-Green, Alex; Roe, David; Soto-Greene, Maria; Joffe, Russell

    2008-01-01

    Increasing financial pressures, along with a desire to realign resources with institutional priorities, has resulted in the adoption of mission-based funding (MBF) at many medical schools. The lack of inclusion of quality and the time and expense in developing and implementing mission based funding are major deficiencies in the models reported to date. In academic year 2002-2003 New Jersey Medical School developed a model that included both quantity and quality in the education metric and that was departmentally based. Eighty percent of the undergraduate medical education allocation was based on the quantity of undergraduate medical education taught by the department ($7.35 million), and 20% ($1.89 million) was allocated based on the quality of the education delivered. Quality determinations were made by the educational leadership based on student evaluations and departmental compliance with educational administrative requirements. Evolution of the model has included the development of a faculty oversight committee and the integration of peer evaluation in the determination of educational quality. Six departments had a documented increase in quality over time, and one department had a transient decrease in quality. The MBF model has been well accepted by chairs, educational leaders, and faculty and has been instrumental in enhancing the stature of education at our institution.

  13. DNAApp: a mobile application for sequencing data analysis.

    Science.gov (United States)

    Nguyen, Phi-Vu; Verma, Chandra Shekhar; Gan, Samuel Ken-En

    2014-11-15

    There have been numerous applications developed for decoding and visualization of ab1 DNA sequencing files for Windows and MAC platforms, yet none exists for the increasingly popular smartphone operating systems. The ability to decode sequencing files cannot easily be carried out using browser accessed Web tools. To overcome this hurdle, we have developed a new native app called DNAApp that can decode and display ab1 sequencing file on Android and iOS. In addition to in-built analysis tools such as reverse complementation, protein translation and searching for specific sequences, we have incorporated convenient functions that would facilitate the harnessing of online Web tools for a full range of analysis. Given the high usage of Android/iOS tablets and smartphones, such bioinformatics apps would raise productivity and facilitate the high demand for analyzing sequencing data in biomedical research. The Android version of DNAApp is available in Google Play Store as 'DNAApp', and the iOS version is available in the App Store. More details on the app can be found at www.facebook.com/APDLab; www.bii.a-star.edu.sg/research/trd/apd.php The DNAApp user guide is available at http://tinyurl.com/DNAAppuser, and a video tutorial is available on Google Play Store and App Store, as well as on the Facebook page. samuelg@bii.a-star.edu.sg. © The Author 2014. Published by Oxford University Press.

  14. DNAApp: a mobile application for sequencing data analysis

    Science.gov (United States)

    Nguyen, Phi-Vu; Verma, Chandra Shekhar; Gan, Samuel Ken-En

    2014-01-01

    Summary: There have been numerous applications developed for decoding and visualization of ab1 DNA sequencing files for Windows and MAC platforms, yet none exists for the increasingly popular smartphone operating systems. The ability to decode sequencing files cannot easily be carried out using browser accessed Web tools. To overcome this hurdle, we have developed a new native app called DNAApp that can decode and display ab1 sequencing file on Android and iOS. In addition to in-built analysis tools such as reverse complementation, protein translation and searching for specific sequences, we have incorporated convenient functions that would facilitate the harnessing of online Web tools for a full range of analysis. Given the high usage of Android/iOS tablets and smartphones, such bioinformatics apps would raise productivity and facilitate the high demand for analyzing sequencing data in biomedical research. Availability and implementation: The Android version of DNAApp is available in Google Play Store as ‘DNAApp’, and the iOS version is available in the App Store. More details on the app can be found at www.facebook.com/APDLab; www.bii.a-star.edu.sg/research/trd/apd.php The DNAApp user guide is available at http://tinyurl.com/DNAAppuser, and a video tutorial is available on Google Play Store and App Store, as well as on the Facebook page. Contact: samuelg@bii.a-star.edu.sg PMID:25095882

  15. Single Machine Multi-product Capacitated Lotsizing with Sequence-dependent Setups

    OpenAIRE

    Almada-Lobo , Bernardo; Klabjan , Diego; Carravilla , Maria Antónia; Oliveira , Jose Fernando

    2007-01-01

    Abstract In production planning in the glass container industry, machine dependent setup times and costs are incurred for switchovers from one product to another. The resulting multi-item capacitated lot sizing problem has sequence-dependent setup times and costs. We present two novel linear mixed integer programming formulations for this problem, incorporating all the necessary features of setup carryovers. The compact formulation has polynomially many constraints, while, on the o...

  16. Simultaneous digital quantification and fluorescence-based size characterization of massively parallel sequencing libraries.

    Science.gov (United States)

    Laurie, Matthew T; Bertout, Jessica A; Taylor, Sean D; Burton, Joshua N; Shendure, Jay A; Bielas, Jason H

    2013-08-01

    Due to the high cost of failed runs and suboptimal data yields, quantification and determination of fragment size range are crucial steps in the library preparation process for massively parallel sequencing (or next-generation sequencing). Current library quality control methods commonly involve quantification using real-time quantitative PCR and size determination using gel or capillary electrophoresis. These methods are laborious and subject to a number of significant limitations that can make library calibration unreliable. Herein, we propose and test an alternative method for quality control of sequencing libraries using droplet digital PCR (ddPCR). By exploiting a correlation we have discovered between droplet fluorescence and amplicon size, we achieve the joint quantification and size determination of target DNA with a single ddPCR assay. We demonstrate the accuracy and precision of applying this method to the preparation of sequencing libraries.

  17. Quality Rating and Improvement System State Evaluations and Research

    Science.gov (United States)

    Ferguson, Daniel

    2016-01-01

    A quality rating and improvement system (QRIS) is a method used by states and local jurisdictions to assess the level of quality of child care and early education programs, improve quality, and convey quality ratings to parents and other consumers. A typical QRIS incorporates the following components: quality standards for participating providers;…

  18. Sequence-based heuristics for faster annotation of non-coding RNA families.

    Science.gov (United States)

    Weinberg, Zasha; Ruzzo, Walter L

    2006-01-01

    Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. The source code is available under GNU Public License at the supplementary web site.

  19. Bringing Next-Generation Sequencing into the Classroom through a Comparison of Molecular Biology Techniques

    Science.gov (United States)

    Bowling, Bethany; Zimmer, Erin; Pyatt, Robert E.

    2014-01-01

    Although the development of next-generation (NextGen) sequencing technologies has revolutionized genomic research and medicine, the incorporation of these topics into the classroom is challenging, given an implied high degree of technical complexity. We developed an easy-to-implement, interactive classroom activity investigating the similarities…

  20. Image correlation method for DNA sequence alignment.

    Science.gov (United States)

    Curilem Saldías, Millaray; Villarroel Sassarini, Felipe; Muñoz Poblete, Carlos; Vargas Vásquez, Asticio; Maureira Butler, Iván

    2012-01-01

    The complexity of searches and the volume of genomic data make sequence alignment one of bioinformatics most active research areas. New alignment approaches have incorporated digital signal processing techniques. Among these, correlation methods are highly sensitive. This paper proposes a novel sequence alignment method based on 2-dimensional images, where each nucleic acid base is represented as a fixed gray intensity pixel. Query and known database sequences are coded to their pixel representation and sequence alignment is handled as object recognition in a scene problem. Query and database become object and scene, respectively. An image correlation process is carried out in order to search for the best match between them. Given that this procedure can be implemented in an optical correlator, the correlation could eventually be accomplished at light speed. This paper shows an initial research stage where results were "digitally" obtained by simulating an optical correlation of DNA sequences represented as images. A total of 303 queries (variable lengths from 50 to 4500 base pairs) and 100 scenes represented by 100 x 100 images each (in total, one million base pair database) were considered for the image correlation analysis. The results showed that correlations reached very high sensitivity (99.01%), specificity (98.99%) and outperformed BLAST when mutation numbers increased. However, digital correlation processes were hundred times slower than BLAST. We are currently starting an initiative to evaluate the correlation speed process of a real experimental optical correlator. By doing this, we expect to fully exploit optical correlation light properties. As the optical correlator works jointly with the computer, digital algorithms should also be optimized. The results presented in this paper are encouraging and support the study of image correlation methods on sequence alignment.

  1. KaKs_Calculator 2.0: A Toolkit Incorporating Gamma-Series Methods and Sliding Window Strategies

    KAUST Repository

    Wang, Dapeng

    2010-05-05

    We present an integrated stand-alone software package named KaKs_Calculator 2.0 as an updated version. It incorporates 17 methods for the calculation of nonsynonymous and synonymous substitution rates; among them, we added our modified versions of several widely used methods as the gamma series including γ-NG, γ-LWL, γ-MLWL, γ-LPB, γ-MLPB, γ-YN and γ-MYN, which have been demonstrated to perform better under certain conditions than their original forms and are not implemented in the previous version. The package is readily used for the identification of positively selected sites based on a sliding window across the sequences of interests in 5\\' to 3\\' direction of protein-coding sequences, and have improved the overall performance on sequence analysis for evolution studies. A toolbox, including C++ and Java source code and executable files on both Windows and Linux platforms together with a user instruction, is downloadable from the website for academic purpose at https://sourceforge.net/projects/kakscalculator2/.

  2. KaKs_Calculator 2.0: A Toolkit Incorporating Gamma-Series Methods and Sliding Window Strategies

    KAUST Repository

    Wang, Dapeng; Zhang, Yubin; Zhang, Zhang; Zhu, Jiang; Yu, Jun

    2010-01-01

    We present an integrated stand-alone software package named KaKs_Calculator 2.0 as an updated version. It incorporates 17 methods for the calculation of nonsynonymous and synonymous substitution rates; among them, we added our modified versions of several widely used methods as the gamma series including γ-NG, γ-LWL, γ-MLWL, γ-LPB, γ-MLPB, γ-YN and γ-MYN, which have been demonstrated to perform better under certain conditions than their original forms and are not implemented in the previous version. The package is readily used for the identification of positively selected sites based on a sliding window across the sequences of interests in 5' to 3' direction of protein-coding sequences, and have improved the overall performance on sequence analysis for evolution studies. A toolbox, including C++ and Java source code and executable files on both Windows and Linux platforms together with a user instruction, is downloadable from the website for academic purpose at https://sourceforge.net/projects/kakscalculator2/.

  3. Noninvasive estimation of global activation sequence using the extended Kalman filter.

    Science.gov (United States)

    Liu, Chenguang; He, Bin

    2011-03-01

    A new algorithm for 3-D imaging of the activation sequence from noninvasive body surface potentials is proposed. After formulating the nonlinear relationship between the 3-D activation sequence and the body surface recordings during activation, the extended Kalman filter (EKF) is utilized to estimate the activation sequence in a recursive way. The state vector containing the activation sequence is optimized during iteration by updating the error variance/covariance matrix. A new regularization scheme is incorporated into the "predict" procedure of EKF to tackle the ill-posedness of the inverse problem. The EKF-based algorithm shows good performance in simulation under single-site pacing. Between the estimated activation sequences and true values, the average correlation coefficient (CC) is 0.95, and the relative error (RE) is 0.13. The average localization error (LE) when localizing the pacing site is 3.0 mm. Good results are also obtained under dual-site pacing (CC = 0.93, RE = 0.16, and LE = 4.3 mm). Furthermore, the algorithm shows robustness to noise. The present promising results demonstrate that the proposed EKF-based inverse approach can noninvasively estimate the 3-D activation sequence with good accuracy and the new algorithm shows good features due to the application of EKF.

  4. Low-Bandwidth and Non-Compute Intensive Remote Identification of Microbes from Raw Sequencing Reads

    DEFF Research Database (Denmark)

    Gautier, Laurent; Lund, Ole

    2013-01-01

    , allowing a fully automated processing of sequencing data and routine instant quality check of sequencing runs from desktop sequencers. A web access is available at http://tapir.cbs.dtu.dk. The source code for a python command-line client, a server, and supplementary data are available at http://bit.ly/1aURxkc....

  5. Targeted amplicon sequencing (TAS): a scalable next-gen approach to multilocus, multitaxa phylogenetics.

    Science.gov (United States)

    Bybee, Seth M; Bracken-Grissom, Heather; Haynes, Benjamin D; Hermansen, Russell A; Byers, Robert L; Clement, Mark J; Udall, Joshua A; Wilcox, Edward R; Crandall, Keith A

    2011-01-01

    Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach.

  6. mtDNA sequence diversity of Hazara ethnic group from Pakistan.

    Science.gov (United States)

    Rakha, Allah; Fatima; Peng, Min-Sheng; Adan, Atif; Bi, Rui; Yasmin, Memona; Yao, Yong-Gang

    2017-09-01

    The present study was undertaken to investigate mitochondrial DNA (mtDNA) control region sequences of Hazaras from Pakistan, so as to generate mtDNA reference database for forensic casework in Pakistan and to analyze phylogenetic relationship of this particular ethnic group with geographically proximal populations. Complete mtDNA control region (nt 16024-576) sequences were generated through Sanger Sequencing for 319 Hazara individuals from Quetta, Baluchistan. The population sample set showed a total of 189 distinct haplotypes, belonging mainly to West Eurasian (51.72%), East & Southeast Asian (29.78%) and South Asian (18.50%) haplogroups. Compared with other populations from Pakistan, the Hazara population had a relatively high haplotype diversity (0.9945) and a lower random match probability (0.0085). The dataset has been incorporated into EMPOP database under accession number EMP00680. The data herein comprises the largest, and likely most thoroughly examined, control region mtDNA dataset from Hazaras of Pakistan. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Ankara National Textiles [Turkish] Incorporated Company (1916-1930

    Directory of Open Access Journals (Sweden)

    M. Bülent Varlık

    2014-06-01

    Full Text Available This study aims to provide brief information on the foundation, activities and closing down of Ankara National Textiles Incorporated Company, which was founded by a group of local tradesmen in Ankara within the scope of the “National Economy Policy” implemented by the Union and Progress Association. The aforementioned local tradesmen undertook the first steps to establish a factory in mid-1913, and as a result of these efforts, the aforementioned incorporated Company was founded in 1916. This Company, probably due to a low quality production, was closed down, possibly around the years 1921-1922. In 1925, a reconstruction process was initiated with the intervention of the politics and bureaucracy of Ankara, and the capital of the Company was increased. However, these attempts failed and finally, the Company was closed down in 1928. The factory was acquired by Türkiye İş Bankası in 1930, and after the endeavors, it continued its activities as a new company under the name Yün-İş.

  8. High quality draft genome sequence of Staphylococcus cohnii subsp. cohnii strain hu-01

    OpenAIRE

    Hu, XinJun; Li, Ang; Lv, LongXian; Yuan, Chunhui; Guo, Lihua; Jiang, Xiawei; Jiang, Haiyin; Qian, GuiRong; Zheng, BeiWen; Guo, Jing; Li, LanJuan

    2014-01-01

    Staphylococcus cohnii subsp. cohnii belongs to the family Staphylococcaceae in the order Bacillales , class Bacilli and phylum Firmicutes . The increasing relevance of S. cohnii to human health prompted us to determine the genomic sequence of Staphylococcus cohnii subsp. cohnii strain hu-01, a multidrug-resistant isolate from a hospital in China. Here we describe the features of S. cohnii subsp. cohnii strain hu-01, together with the genome sequence and its annotation. This is the first genom...

  9. The sequence coding and search system: an approach for constructing and analyzing event sequences at commercial nuclear power plants

    International Nuclear Information System (INIS)

    Mays, G.T.

    1990-01-01

    The U.S. Nuclear Regulatory Commission (NRC) has recognized the importance of the collection, assessment, and feedback of operating experience data from commercial nuclear power plants and has centralized these activities in the Office for Analysis and Evaluation of Operational Data (AEOD). Such data is essential for performing safety and reliability analyses, especially analyses of trends and patterns to identify undesirable changes in plant performance at the earliest opportunity to implement corrective measures to preclude the occurrence of a more serious event. One of NRC's principal tools for collecting and evaluating operating experience data is the Sequence Coding and Search System (SCSS). The SCSS consists of a methodology for structuring event sequences and the requisite computer system to store and search the data. The source information for SCSS is the Licensee Event Report (LER), which is a legally required document. This paper describes the objectives of SCSS, the information it contains, and the format and approach for constructing SCSS event sequences. Examples are presented demonstrating the use of SCSS to support the analysis of LER data. The SCSS contains over 30,000 LERs describing events from 1980 through the present. Insights gained from working with a complex data system from the initial developmental stage to the point of a mature operating system are highlighted. Considerable experience has been gained in the areas of evolving and changing data requirements, staffing requirements, and quality control and quality assurance procedures for addressing consistency, software/hardware considerations for developing and maintaining a complex system, documentation requirements, and end-user needs. Two other approaches for constructing and evaluating event sequences are examined including the Accident Precursor Program (ASP) where sequences having the potential for core damage are identified and analyzed, and the Significant Event Compilation Tree

  10. Incorporating DNA sequencing into current prenatal screening practice for Down's syndrome.

    Directory of Open Access Journals (Sweden)

    Nicholas J Wald

    Full Text Available BACKGROUND: Prenatal screening for Down's syndrome is performed using biochemical and ultrasound markers measured in early pregnancy such as the Integrated test using first and second trimester markers. Recently, DNA sequencing methods have been introduced on free DNA in maternal plasma, yielding a high screening performance. These methods are expensive and there is a test failure rate. We determined the screening performance of merging the Integrated test with the newer DNA techniques in a protocol that substantially reduces the cost compared with universal DNA testing and still achieves high screening performance with no test failures. METHODS: Published data were used to model screening performance of a protocol in which all women receive the first stage of the Integrated test at about 11 weeks of pregnancy. On the basis of this higher risk women have reflex DNA testing and lower risk women as well as those with a failed DNA test complete the Integrated test at about 15 weeks. RESULTS: The overall detection rate was 95% with a 0.1% false-positive rate if 20% of women were selected to receive DNA testing. If all women had DNA testing the detection rate would be 3 to 4 percentage points higher with a false-positive rate 30 times greater if women with failed tests were treated as positive and offered a diagnostic amniocentesis, or 3 times greater if they had a second trimester screening test (Quadruple test and treated as positive only if this were positive. The cost per women screened would be about one-fifth, compared with universal DNA testing, if the DNA test were 20 times the cost of the Integrated test. CONCLUSION: The proposed screening protocol achieves a high screening performance without programme test failures and at a substantially lower cost than offering all women DNA testing.

  11. Comparing an accelerated 3D fast spin-echo sequence (CS-SPACE) for knee 3-T magnetic resonance imaging with traditional 3D fast spin-echo (SPACE) and routine 2D sequences

    Energy Technology Data Exchange (ETDEWEB)

    Altahawi, Faysal F.; Blount, Kevin J.; Omar, Imran M. [Northwestern University Feinberg School of Medicine, Department of Radiology, Chicago, IL (United States); Morley, Nicholas P. [Marshfield Clinic, Department of Radiology, Marshfield, WI (United States); Raithel, Esther [Siemens Healthcare GmbH, Erlangen (Germany)

    2017-01-15

    To compare a faster, new, high-resolution accelerated 3D-fast-spin-echo (3D-FSE) acquisition sequence (CS-SPACE) to traditional 2D and high-resolution 3D sequences for knee 3-T magnetic resonance imaging (MRI). Twenty patients received knee MRIs that included routine 2D (T1, PD ± FS, T2-FS; 0.5 x 0.5 x 3 mm{sup 3}; ∝10 min), traditional 3D FSE (SPACE-PD-FS; 0.5 x 0.5 x 0.5 mm{sup 3}; ∝7.5 min), and accelerated 3D-FSE prototype (CS-SPACE-PD-FS; 0.5 x 0.5 x 0.5 mm{sup 3}; ∝5 min) acquisitions on a 3-T MRI system (Siemens MAGNETOM Skyra). Three musculoskeletal radiologists (MSKRs) prospectively and independently reviewed the studies with graded surveys comparing image and diagnostic quality. Tissue-specific signal-to-noise ratios (SNR) and contrast-to-noise ratios (CNR) were also compared. MSKR-perceived diagnostic quality of cartilage was significantly higher for CS-SPACE than for SPACE and 2D sequences (p < 0.001). Assessment of diagnostic quality of menisci and synovial fluid was higher for CS-SPACE than for SPACE (p < 0.001). CS-SPACE was not significantly different from SPACE but had lower assessments than 2D sequences for evaluation of bones, ligaments, muscles, and fat (p ≤ 0.004). 3D sequences had higher spatial resolution, but lower overall assessed contrast (p < 0.001). Overall image quality from CS-SPACE was assessed as higher than SPACE (p = 0.007), but lower than 2D sequences (p < 0.001). Compared to SPACE, CS-SPACE had higher fluid SNR and CNR against all other tissues (all p < 0.001). The CS-SPACE prototype allows for faster isotropic acquisitions of knee MRIs over currently used protocols. High fluid-to-cartilage CNR and higher spatial resolution over routine 2D sequences may present a valuable role for CS-SPACE in the evaluation of cartilage and menisci. (orig.)

  12. Improvement of remote monitoring on water quality in a subtropical reservoir by incorporating grammatical evolution with parallel genetic algorithms into satellite imagery.

    Science.gov (United States)

    Chen, Li; Tan, Chih-Hung; Kao, Shuh-Ji; Wang, Tai-Sheng

    2008-01-01

    Parallel GEGA was constructed by incorporating grammatical evolution (GE) into the parallel genetic algorithm (GA) to improve reservoir water quality monitoring based on remote sensing images. A cruise was conducted to ground-truth chlorophyll-a (Chl-a) concentration longitudinally along the Feitsui Reservoir, the primary water supply for Taipei City in Taiwan. Empirical functions with multiple spectral parameters from the Landsat 7 Enhanced Thematic Mapper (ETM+) data were constructed. The GE, an evolutionary automatic programming type system, automatically discovers complex nonlinear mathematical relationships among observed Chl-a concentrations and remote-sensed imageries. A GA was used afterward with GE to optimize the appropriate function type. Various parallel subpopulations were processed to enhance search efficiency during the optimization procedure with GA. Compared with a traditional linear multiple regression (LMR), the performance of parallel GEGA was found to be better than that of the traditional LMR model with lower estimating errors.

  13. Evaluation of finger millet incorporated noodles for nutritive value and glycemic index.

    Science.gov (United States)

    Shukla, Kamini; Srivastava, Sarita

    2014-03-01

    The present study was undertaken to develop finger millet incorporated noodles for diabetic patients. Finger millet variety VL-149 was taken. The finger millet flour and refined wheat flour (RWF) were evaluated for nutrient composition. The finger millet flour (FMF) was blended in various proportions (30 to 50%) in refined wheat flour and used for the preparation of noodles. Control consisted of RWF noodles. Sensory quality and nutrient composition of finger millet noodles was evaluated. The 30% finger millet incorporated noodles were selected best on the basis of sensory evaluation. Noodles in that proportion along with control were evaluated for glycemic response. Nutrient composition of noodles showed that 50% finger millet incorporated noodles contained highest amount of crude fat (1.15%), total ash (1.40%), crude fiber (1.28%), carbohydrate (78.54%), physiological energy (351.36 kcal), insoluble dietary fiber (5.45%), soluble dietary fiber (3.71%), iron (5.58%) and calcium (88.39%), respectively. However, control RWF noodles contained highest amount of starch (63.02%), amylose (8.72%) and amylopectin (54.29%). The glycemic index (GI) of 30% finger millet incorporated noodles (best selected by sensory evaluation) was observed significantly lower (45.13) than control noodles (62.59). It was found that finger millet flour incorporated noodles were found nutritious and showed hypoglycemic effect.

  14. Sequence and comparative analysis of Leuconostoc dairy bacteriophages

    DEFF Research Database (Denmark)

    Kot, Witold; Hansen, Lars Henrik; Neve, Horst

    2014-01-01

    Bacteriophages attacking Leuconostoc species may significantly influence the quality of the final product. There is however limited knowledge of this group of phages in the literature. We have determined the complete genome sequences of nine Leuconostoc bacteriophages virulent to either Leuconostoc...

  15. Functional role of a highly repetitive DNA sequence in anchorage of the mouse genome.

    Science.gov (United States)

    Neuer-Nitsche, B; Lu, X N; Werner, D

    1988-09-12

    The major portion of the eukaryotic genome consists of various categories of repetitive DNA sequences which have been studied with respect to their base compositions, organizations, copy numbers, transcription and species specificities; their biological roles, however, are still unclear. A novel quality of a highly repetitive mouse DNA sequence is described which points to a functional role: All copies (approximately 50,000 per haploid genome) of this DNA sequence reside on genomic Alu I DNA fragments each associated with nuclear polypeptides that are not released from DNA by proteinase K, SDS and phenol extraction. By this quality the repetitive DNA sequence is classified as a member of the sub-set of DNA sequences involved in tight DNA-polypeptide complexes which have been previously shown to be components of the subnuclear structure termed 'nuclear matrix'. From these results it has to be concluded that the repetitive DNA sequence characterized in this report represents or comprises a signal for a large number of site specific attachment points of the mouse genome in the nuclear matrix.

  16. Detecting false positive sequence homology: a machine learning approach.

    Science.gov (United States)

    Fujimoto, M Stanley; Suvorov, Anton; Jensen, Nicholas O; Clement, Mark J; Bybee, Seth M

    2016-02-24

    Accurate detection of homologous relationships of biological sequences (DNA or amino acid) amongst organisms is an important and often difficult task that is essential to various evolutionary studies, ranging from building phylogenies to predicting functional gene annotations. There are many existing heuristic tools, most commonly based on bidirectional BLAST searches that are used to identify homologous genes and combine them into two fundamentally distinct classes: orthologs and paralogs. Due to only using heuristic filtering based on significance score cutoffs and having no cluster post-processing tools available, these methods can often produce multiple clusters constituting unrelated (non-homologous) sequences. Therefore sequencing data extracted from incomplete genome/transcriptome assemblies originated from low coverage sequencing or produced by de novo processes without a reference genome are susceptible to high false positive rates of homology detection. In this paper we develop biologically informative features that can be extracted from multiple sequence alignments of putative homologous genes (orthologs and paralogs) and further utilized in context of guided experimentation to verify false positive outcomes. We demonstrate that our machine learning method trained on both known homology clusters obtained from OrthoDB and randomly generated sequence alignments (non-homologs), successfully determines apparent false positives inferred by heuristic algorithms especially among proteomes recovered from low-coverage RNA-seq data. Almost ~42 % and ~25 % of predicted putative homologies by InParanoid and HaMStR respectively were classified as false positives on experimental data set. Our process increases the quality of output from other clustering algorithms by providing a novel post-processing method that is both fast and efficient at removing low quality clusters of putative homologous genes recovered by heuristic-based approaches.

  17. Distributed representations of action sequences in anterior cingulate cortex: A recurrent neural network approach.

    Science.gov (United States)

    Shahnazian, Danesh; Holroyd, Clay B

    2018-02-01

    Anterior cingulate cortex (ACC) has been the subject of intense debate over the past 2 decades, but its specific computational function remains controversial. Here we present a simple computational model of ACC that incorporates distributed representations across a network of interconnected processing units. Based on the proposal that ACC is concerned with the execution of extended, goal-directed action sequences, we trained a recurrent neural network to predict each successive step of several sequences associated with multiple tasks. In keeping with neurophysiological observations from nonhuman animals, the network yields distributed patterns of activity across ACC neurons that track the progression of each sequence, and in keeping with human neuroimaging data, the network produces discrepancy signals when any step of the sequence deviates from the predicted step. These simulations illustrate a novel approach for investigating ACC function.

  18. Evaluation of various physico-chemical properties of Hibiscus sabdariffa and L. casei incorporated probiotic yoghurt.

    Science.gov (United States)

    Rasdhari, M; Parekh, T; Dave, N; Patel, V; Subhash, R

    2008-09-01

    The present investigation was carried out to examine the effect of Hibiscus sabdariffa Calyx extract on the physico-chemical properties, sensory attributes, texture and microbial analysis of L. casei incorporated in probiotic yoghurt after manufacture and during storage. Incorporation of Hibiscus sabdariffa Calyx extract into the probiotic yoghurt resulted into decrease in coagulation time by 25 min. The pH ranged from 4.39 to 4.59, TA 0.81 to 1.14%, moisture 3.05 to 3.37 g%, syneresis 18.85 to 24.90 mL/50 g of sample, % inhibition 12.32 to 59.43, TS 21.27 to 24.90 g% and beta-galactosidase activity 1.041 to 3.277. The protein content ranged between 4.11 and 4.14 g% while the fat content ranged between 3.43 and 3.49 g%. No major changes in sensory evaluation were observed on the day of manufacture and during storage for 7 days. Sabdariffa added yoghurt showed a higher score in almost all sensory attributes. Microbial analysis showed a total plate count ranging from 1.8 x 10(4) to 1.85 x 10(7) cfu mL(-1). Yeast and mold counts were negligible in the Sabdariffa yoghurts. Thus the study concludes that incorporation of Hibiscus sabdariffa extract in yoghurt improved the total antioxidant property, organoleptic qualities and decreased the exudation of whey proteins (Syneresis). Thus, Hibiscus sabdariffa Calyces has beneficial influence on the quality of L. casei incorporated probiotic yoghurt.

  19. DNA Sequencing as a Tool to Monitor Marine Ecological Status

    Directory of Open Access Journals (Sweden)

    Kelly D. Goodwin

    2017-05-01

    Full Text Available Many ocean policies mandate integrated, ecosystem-based approaches to marine monitoring, driving a global need for efficient, low-cost bioindicators of marine ecological quality. Most traditional methods to assess biological quality rely on specialized expertise to provide visual identification of a limited set of specific taxonomic groups, a time-consuming process that can provide a narrow view of ecological status. In addition, microbial assemblages drive food webs but are not amenable to visual inspection and thus are largely excluded from detailed inventory. Molecular-based assessments of biodiversity and ecosystem function offer advantages over traditional methods and are increasingly being generated for a suite of taxa using a “microbes to mammals” or “barcodes to biomes” approach. Progress in these efforts coupled with continued improvements in high-throughput sequencing and bioinformatics pave the way for sequence data to be employed in formal integrated ecosystem evaluation, including food web assessments, as called for in the European Union Marine Strategy Framework Directive. DNA sequencing of bioindicators, both traditional (e.g., benthic macroinvertebrates, ichthyoplankton and emerging (e.g., microbial assemblages, fish via eDNA, promises to improve assessment of marine biological quality by increasing the breadth, depth, and throughput of information and by reducing costs and reliance on specialized taxonomic expertise.

  20. Multispacer sequence typing relapsing fever Borreliae in Africa.

    Directory of Open Access Journals (Sweden)

    Haitham Elbir

    Full Text Available BACKGROUND: In Africa, relapsing fevers are neglected arthropod-borne infections caused by closely related Borrelia species. They cause mild to deadly undifferentiated fever particularly severe in pregnant women. Lack of a tool to genotype these Borrelia organisms limits knowledge regarding their reservoirs and their epidemiology. METHODOLOGY/PRINCIPAL FINDINGS: Genome sequence analysis of Borrelia crocidurae, Borrelia duttonii and Borrelia recurrentis yielded 5 intergenic spacers scattered between 10 chromosomal genes that were incorporated into a multispacer sequence typing (MST approach. Sequencing these spacers directly from human blood specimens previously found to be infected by B. recurrentis (30 specimens, B. duttonii (17 specimens and B. crocidurae (13 specimens resolved these 60 strains and the 3 type strains into 13 species-specific spacer types in the presence of negative controls. B. crocidurae comprised of 8 spacer types, B. duttonii of 3 spacer types and B. recurrentis of 2 spacer types. CONCLUSIONS/SIGNIFICANCE: Phylogenetic analyses of MST data suggested that B. duttonii, B. crocidurae and B. recurrentis are variants of a unique ancestral Borrelia species. MST proved to be a suitable approach for identifying and genotyping relapsing fever borreliae in Africa. It could be applied to both vectors and clinical specimens.

  1. MR imaging of the knee: Improvement of signal and contrast efficiency of T1-weighted turbo spin echo sequences by applying a driven equilibrium (DRIVE) pulse

    Energy Technology Data Exchange (ETDEWEB)

    Radlbauer, Rudolf, E-mail: rudolf.radlbauer@stpoelten.lknoe.a [MR Physics Group, Department of Radiology, Landesklinikum St. Poelten, Propst Fuehrer Strasse 4, 3100 St. Poelten (Austria); Lomoschitz, Friedrich, E-mail: friedrich.lomoschitz@stpoelten.lknoe.a [MR Physics Group, Department of Radiology, Landesklinikum St. Poelten, Propst Fuehrer Strasse 4, 3100 St. Poelten (Austria); Salomonowitz, Erich, E-mail: erich.salomonowitz@stpoelten.lknoe.a [MR Physics Group, Department of Radiology, Landesklinikum St. Poelten, Propst Fuehrer Strasse 4, 3100 St. Poelten (Austria); Eberhardt, Knut E., E-mail: info@mrt-kompetenzzentrum.d [MRT Competence Center Schloss Werneck, Balthasar-Neumann-Platz 2, 97440 Werneck (Germany); Stadlbauer, Andreas, E-mail: andi@nmr.a [MR Physics Group, Department of Radiology, Landesklinikum St. Poelten, Propst Fuehrer Strasse 4, 3100 St. Poelten (Austria); Department of Neurosurgery, University of Erlangen-Nuremberg, Schwabachanlage 6, 91054 Erlangen (Germany)

    2010-08-15

    The purpose of this study was to assess the effect of a driven equilibrium (DRIVE) pulse incorporated in a standard T1-weighted turbo spin echo (TSE) sequence as used in our routine MRI protocol for examination of pathologies of the knee. Sixteen consecutive patients with knee disorders were examined using the routine MRI protocol, including T1-weighted TSE-sequences with and without a DRIVE pulse. Signal-to-noise ratios (SNRs) and contrast-to-noise ratio (CNR) of anatomical structures and pathologies were calculated and compared for both sequences. The differences in diagnostic value of the T1-weighted images with and without DRIVE pulse were assessed. SNR was significantly higher on images acquired with DRIVE pulse for fluid, effusion, cartilage and bone. Differences in the SNR of meniscus and muscle between the two sequences were not statistically significant. CNR was significantly increased between muscle and effusion, fluid and cartilage, fluid and meniscus, cartilage and meniscus, bone and cartilage on images acquired using the DRIVE pulse. Diagnostic value of the T1-weighted images was found to be improved for delineation of anatomic structures and for diagnosing a variety of pathologies when a DRIVE pulse is incorporated in the sequence. Incorporation of a DRIVE pulse into a standard T1-weighted TSE-sequence leads to significant increase of SNR and CNR of both, anatomical structures and pathologies, and consequently to an increase in diagnostic value within the same acquisition time.

  2. MR imaging of the knee: Improvement of signal and contrast efficiency of T1-weighted turbo spin echo sequences by applying a driven equilibrium (DRIVE) pulse

    International Nuclear Information System (INIS)

    Radlbauer, Rudolf; Lomoschitz, Friedrich; Salomonowitz, Erich; Eberhardt, Knut E.; Stadlbauer, Andreas

    2010-01-01

    The purpose of this study was to assess the effect of a driven equilibrium (DRIVE) pulse incorporated in a standard T1-weighted turbo spin echo (TSE) sequence as used in our routine MRI protocol for examination of pathologies of the knee. Sixteen consecutive patients with knee disorders were examined using the routine MRI protocol, including T1-weighted TSE-sequences with and without a DRIVE pulse. Signal-to-noise ratios (SNRs) and contrast-to-noise ratio (CNR) of anatomical structures and pathologies were calculated and compared for both sequences. The differences in diagnostic value of the T1-weighted images with and without DRIVE pulse were assessed. SNR was significantly higher on images acquired with DRIVE pulse for fluid, effusion, cartilage and bone. Differences in the SNR of meniscus and muscle between the two sequences were not statistically significant. CNR was significantly increased between muscle and effusion, fluid and cartilage, fluid and meniscus, cartilage and meniscus, bone and cartilage on images acquired using the DRIVE pulse. Diagnostic value of the T1-weighted images was found to be improved for delineation of anatomic structures and for diagnosing a variety of pathologies when a DRIVE pulse is incorporated in the sequence. Incorporation of a DRIVE pulse into a standard T1-weighted TSE-sequence leads to significant increase of SNR and CNR of both, anatomical structures and pathologies, and consequently to an increase in diagnostic value within the same acquisition time.

  3. Measuring quality of care: considering conceptual approaches to quality indicator development and evaluation.

    Science.gov (United States)

    Stelfox, Henry T; Straus, Sharon E

    2013-12-01

    In this article, we describe one approach for developing and evaluating quality indicators. We focus on describing different conceptual approaches to quality indicator development, review one approach for developing quality indicators, outline how to evaluate quality indicators once developed, and discuss quality indicator maintenance. The key steps for developing quality indicators include specifying a clear goal for the indicators; using methodologies to incorporate evidence, expertise, and patient perspectives; and considering contextual factors and logistics of implementation. The Strategic Framework Board and the National Quality Measure Clearinghouse have developed criteria for evaluating quality indicators that complement traditional psychometric evaluations. Optimal strategies for quality indicator maintenance and dissemination have not been determined, but experiences with clinical guideline maintenance may be informative. For quality indicators to effectively guide quality improvement efforts, they must be developed, evaluated, maintained, and implemented using rigorous evidence-informed practices. Copyright © 2013 Elsevier Inc. All rights reserved.

  4. SMITH: a LIMS for handling next-generation sequencing workflows.

    Science.gov (United States)

    Venco, Francesco; Vaskin, Yuriy; Ceol, Arnaud; Muller, Heiko

    2014-01-01

    Life-science laboratories make increasing use of Next Generation Sequencing (NGS) for studying bio-macromolecules and their interactions. Array-based methods for measuring gene expression or protein-DNA interactions are being replaced by RNA-Seq and ChIP-Seq. Sequencing is generally performed by specialized facilities that have to keep track of sequencing requests, trace samples, ensure quality and make data available according to predefined privileges. An integrated tool helps to troubleshoot problems, to maintain a high quality standard, to reduce time and costs. Commercial and non-commercial tools called LIMS (Laboratory Information Management Systems) are available for this purpose. However, they often come at prohibitive cost and/or lack the flexibility and scalability needed to adjust seamlessly to the frequently changing protocols employed. In order to manage the flow of sequencing data produced at the Genomic Unit of the Italian Institute of Technology (IIT), we developed SMITH (Sequencing Machine Information Tracking and Handling). SMITH is a web application with a MySQL server at the backend. Wet-lab scientists of the Centre for Genomic Science and database experts from the Politecnico of Milan in the context of a Genomic Data Model Project developed SMITH. The data base schema stores all the information of an NGS experiment, including the descriptions of all protocols and algorithms used in the process. Notably, an attribute-value table allows associating an unconstrained textual description to each sample and all the data produced afterwards. This method permits the creation of metadata that can be used to search the database for specific files as well as for statistical analyses. SMITH runs automatically and limits direct human interaction mainly to administrative tasks. SMITH data-delivery procedures were standardized making it easier for biologists and analysts to navigate the data. Automation also helps saving time. The workflows are available

  5. SMITH: a LIMS for handling next-generation sequencing workflows

    Science.gov (United States)

    2014-01-01

    Background Life-science laboratories make increasing use of Next Generation Sequencing (NGS) for studying bio-macromolecules and their interactions. Array-based methods for measuring gene expression or protein-DNA interactions are being replaced by RNA-Seq and ChIP-Seq. Sequencing is generally performed by specialized facilities that have to keep track of sequencing requests, trace samples, ensure quality and make data available according to predefined privileges. An integrated tool helps to troubleshoot problems, to maintain a high quality standard, to reduce time and costs. Commercial and non-commercial tools called LIMS (Laboratory Information Management Systems) are available for this purpose. However, they often come at prohibitive cost and/or lack the flexibility and scalability needed to adjust seamlessly to the frequently changing protocols employed. In order to manage the flow of sequencing data produced at the Genomic Unit of the Italian Institute of Technology (IIT), we developed SMITH (Sequencing Machine Information Tracking and Handling). Methods SMITH is a web application with a MySQL server at the backend. Wet-lab scientists of the Centre for Genomic Science and database experts from the Politecnico of Milan in the context of a Genomic Data Model Project developed SMITH. The data base schema stores all the information of an NGS experiment, including the descriptions of all protocols and algorithms used in the process. Notably, an attribute-value table allows associating an unconstrained textual description to each sample and all the data produced afterwards. This method permits the creation of metadata that can be used to search the database for specific files as well as for statistical analyses. Results SMITH runs automatically and limits direct human interaction mainly to administrative tasks. SMITH data-delivery procedures were standardized making it easier for biologists and analysts to navigate the data. Automation also helps saving time. The

  6. [Study on quality evaluation of sequence and SSR information in transcriptome of Astragalus membranacus].

    Science.gov (United States)

    Chang, Yue; Yang, Song; Liu, Zhen-Peng; Ren, Wei-Chao; Liu, Jie; Ma, Wei

    2016-04-01

    In this study, 454/Roche GS FLX sequencing technology was used to obtain the data of the Astragalus membranaceus. Four hundred and fifty-four Sequencing System Software was applied to carry out the transcription of the group from scratch. Using MISA tools, 9 893 unigenes were selected for the sequence of the genome of A. membranaceus, and the information of SSR locus was analyzed. According to the result, the average length of reads was 413 bp, about 86% of the reads was involved in the splicing, the length of the N50 was 1 205 bp, the number of unigenes was measured by the whole transcript. 1 729 SSR loci in the A. membranaceus transcriptome were searched, the occurrence frequency of SSR was 9.24%, the frequency of SSR in the whole transcriptome was 13.42%, the average length of SSR was 7.97 kb. One hundred and twenty-seven kinds of core repeat sequences were found, the dominant type was TG/AC type of dinucleotide, it appeared to account for 4.25% of the total SSR locus. The results of the sequence of the transcription of the A. membranaceus transcriptome revealed the overall expression, and a large number of unigenessequence was obtained, and the SSR locus in the genome of the A. membranaceus is high, and the type is diverse, and the polymorphism of the gene is high. Copyright© by the Chinese Pharmaceutical Association.

  7. Parallel Monte Carlo Particle Transport and the Quality of Random Number Generators: How Good is Good Enough?

    International Nuclear Information System (INIS)

    Procassini, R J; Beck, B R

    2004-01-01

    It might be assumed that use of a ''high-quality'' random number generator (RNG), producing a sequence of ''pseudo random'' numbers with a ''long'' repetition period, is crucial for producing unbiased results in Monte Carlo particle transport simulations. While several theoretical and empirical tests have been devised to check the quality (randomness and period) of an RNG, for many applications it is not clear what level of RNG quality is required to produce unbiased results. This paper explores the issue of RNG quality in the context of parallel, Monte Carlo transport simulations in order to determine how ''good'' is ''good enough''. This study employs the MERCURY Monte Carlo code, which incorporates the CNPRNG library for the generation of pseudo-random numbers via linear congruential generator (LCG) algorithms. The paper outlines the usage of random numbers during parallel MERCURY simulations, and then describes the source and criticality transport simulations which comprise the empirical basis of this study. A series of calculations for each test problem in which the quality of the RNG (period of the LCG) is varied provides the empirical basis for determining the minimum repetition period which may be employed without producing a bias in the mean integrated results

  8. Peptide de novo sequencing of mixture tandem mass spectra

    DEFF Research Database (Denmark)

    Gorshkov, Vladimir; Hotta, Stéphanie Yuki Kolbeck; Braga, Thiago Verano

    2016-01-01

    they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co-isolation and thus prone to false identifications. The deconvolution approach matched...... complementary b-, y-ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co-isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced...... peptides. The improvement was in the range of 20–35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight...

  9. Reference genome sequence of the model plant Setaria.

    Science.gov (United States)

    Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M

    2012-05-13

    We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).

  10. Reference genome sequence of the model plant Setaria

    Energy Technology Data Exchange (ETDEWEB)

    Bennetzen, Jeffrey L [ORNL; Schmutz, Jeremy [Hudson Alpha Institute of Biotechnology; Wang, Hao [University of Georgia, Athens, GA; Percifield, Ryan [University of Georgia, Athens, GA; Hawkins, Jennifer [University of Georgia, Athens, GA; Pontaroli, Ana C. [University of Georgia, Athens, GA; Estep, Matt [University of Georgia, Athens, GA; Feng, Liang [University of Georgia, Athens, GA; Vaughn, Justin N [ORNL; Grimwood, Jane [Hudson Alpha Institute of Biotechnology; Jenkins, Jerry [Hudson Alpha Institute of Biotechnology; Barry, Kerrie [U.S. Department of Energy, Joint Genome Institute; Lindquist, Erika [U.S. Department of Energy, Joint Genome Institute; Hellsten, Uffe [U.S. Department of Energy, Joint Genome Institute; Deshpande, Shweta [U.S. Department of Energy, Joint Genome Institute; Wang, Xuewen [University of Georgia, Athens, GA; Wu, Xiaomei [University of Georgia, Athens, GA; Mitros, Therese [University of California, Berkeley; Triplett, Jimmy [University of Missouri, St. Louis; Yang, Xiaohan [ORNL; Ye, Chuyu [ORNL; Mauro-Herrera, Margarita [Oklahoma State University; Wang, Lin [Cornell University; Li, Pinghua [Cornell University; Sharma, Manoj [University of California, Davis; Sharma, Rita [University of California, Davis; Ronald, Pamela [University of California, Davis; Panaud, Olivier [Universite de Perpignan, Perpignan, France; Kellogg, Elizabeth A. [University of Missouri, St. Louis; Brutnell, Thomas P. [Cornell University; Doust, Andrew N. [Oklahoma State University; Tuskan, Gerald A [ORNL; Rokhsar, Daniel [U.S. Department of Energy, Joint Genome Institute; Devos, Katrien M [ORNL

    2012-01-01

    We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).

  11. Reference genome sequence of the model plant Setaria

    Energy Technology Data Exchange (ETDEWEB)

    Bennetzen, Jeffrey L [ORNL; Yang, Xiaohan [ORNL; Ye, Chuyu [ORNL; Tuskan, Gerald A [ORNL

    2012-01-01

    We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).

  12. Estimating mutation parameters, population history and genealogy simultaneously from temporally spaced sequence data.

    Science.gov (United States)

    Drummond, Alexei J; Nicholls, Geoff K; Rodrigo, Allen G; Solomon, Wiremu

    2002-07-01

    Molecular sequences obtained at different sampling times from populations of rapidly evolving pathogens and from ancient subfossil and fossil sources are increasingly available with modern sequencing technology. Here, we present a Bayesian statistical inference approach to the joint estimation of mutation rate and population size that incorporates the uncertainty in the genealogy of such temporally spaced sequences by using Markov chain Monte Carlo (MCMC) integration. The Kingman coalescent model is used to describe the time structure of the ancestral tree. We recover information about the unknown true ancestral coalescent tree, population size, and the overall mutation rate from temporally spaced data, that is, from nucleotide sequences gathered at different times, from different individuals, in an evolving haploid population. We briefly discuss the methodological implications and show what can be inferred, in various practically relevant states of prior knowledge. We develop extensions for exponentially growing population size and joint estimation of substitution model parameters. We illustrate some of the important features of this approach on a genealogy of HIV-1 envelope (env) partial sequences.

  13. From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes.

    Science.gov (United States)

    Kwok, Hin; Chiang, Alan Kwok Shing

    2016-02-24

    Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.

  14. From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes

    Directory of Open Access Journals (Sweden)

    Hin Kwok

    2016-02-01

    Full Text Available Genomic sequences of Epstein–Barr virus (EBV have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.

  15. Development and evaluation of a panel of filovirus sequence capture probes for pathogen detection by next-generation sequencing.

    Directory of Open Access Journals (Sweden)

    Jeffrey W Koehler

    Full Text Available A detailed understanding of the circulating pathogens in a particular geographic location aids in effectively utilizing targeted, rapid diagnostic assays, thus allowing for appropriate therapeutic and containment procedures. This is especially important in regions prevalent for highly pathogenic viruses co-circulating with other endemic pathogens such as the malaria parasite. The importance of biosurveillance is highlighted by the ongoing Ebola virus disease outbreak in West Africa. For example, a more comprehensive assessment of the regional pathogens could have identified the risk of a filovirus disease outbreak earlier and led to an improved diagnostic and response capacity in the region. In this context, being able to rapidly screen a single sample for multiple pathogens in a single tube reaction could improve both diagnostics as well as pathogen surveillance. Here, probes were designed to capture identifying filovirus sequence for the ebolaviruses Sudan, Ebola, Reston, Taï Forest, and Bundibugyo and the Marburg virus variants Musoke, Ci67, and Angola. These probes were combined into a single probe panel, and the captured filovirus sequence was successfully identified using the MiSeq next-generation sequencing platform. This panel was then used to identify the specific filovirus from nonhuman primates experimentally infected with Ebola virus as well as Bundibugyo virus in human sera samples from the Democratic Republic of the Congo, thus demonstrating the utility for pathogen detection using clinical samples. While not as sensitive and rapid as real-time PCR, this panel, along with incorporating additional sequence capture probe panels, could be used for broad pathogen screening and biosurveillance.

  16. Incorporation of conjugated linoleic acid (CLA and α-linolenic acid (LNA in pacu fillets

    Directory of Open Access Journals (Sweden)

    Deoclécio José Barilli

    2014-03-01

    Full Text Available The objective of this study was to evaluate the incorporation of conjugated linoleic acid and α-linolenic acid in fillets of pacu fish raised in net cages and fed diets enriched with these acids. The fish were fed for 49 days, and at the end of this period the fatty acid content in the fillets was determined by gas chromatography. Concentrations of α-linolenic acid, eicosapentaenoic acid, and the total omega-3 (n-3 fatty acid in the fillets increased, improving the n-6/n-3 ratio. In addition, the incorporation of conjugated linoleic acid in the fish fillets proved well established. This study showed that the use of diets enriched with conjugated linoleic acid and α-linolenic acid results in the incorporation of these acids in the of pacu fish fillets, improving their nutritional quality.

  17. ConDeTri--a content dependent read trimmer for Illumina data.

    Directory of Open Access Journals (Sweden)

    Linnéa Smeds

    Full Text Available UNLABELLED: During the last few years, DNA and RNA sequencing have started to play an increasingly important role in biological and medical applications, especially due to the greater amount of sequencing data yielded from the new sequencing machines and the enormous decrease in sequencing costs. Particularly, Illumina/Solexa sequencing has had an increasing impact on gathering data from model and non-model organisms. However, accurate and easy to use tools for quality filtering have not yet been established. We present ConDeTri, a method for content dependent read trimming for next generation sequencing data using quality scores of each individual base. The main focus of the method is to remove sequencing errors from reads so that sequencing reads can be standardized. Another aspect of the method is to incorporate read trimming in next-generation sequencing data processing and analysis pipelines. It can process single-end and paired-end sequence data of arbitrary length and it is independent from sequencing coverage and user interaction. ConDeTri is able to trim and remove reads with low quality scores to save computational time and memory usage during de novo assemblies. Low coverage or large genome sequencing projects will especially gain from trimming reads. The method can easily be incorporated into preprocessing and analysis pipelines for Illumina data. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://code.google.com/p/condetri.

  18. Foundations for a syntatic pattern recognition system for genomic DNA sequences

    Energy Technology Data Exchange (ETDEWEB)

    Searles, D.B.

    1993-03-01

    The goal of the proposed work is the creation of a software system that will perform sophisticated pattern recognition and related functions at a level of abstraction and with expressive power beyond current general-purpose pattern-matching systems for biological sequences; and with a more uniform language, environment, and graphical user interface, and with greater flexibility, extensibility, embeddability, and ability to incorporate other algorithms, than current special-purpose analytic software.

  19. [Incorporation monitoring of employees of a radioiodine therapy ward. Is incorporation monitoring required for routine?].

    Science.gov (United States)

    Happel, C; Kratzel, U; Selkinski, I; Bockisch, B; Etzel, M; Korkusuz, H; Sauter, B; Staudt, J; von Müller, F; Grünwald, F; Kranert, W T

    2013-01-01

    Aim of the study was to determine the annual incorporation of staff on a radioiodine therapy ward and the resulting annual effective dose (aed). Following the German incorporation guideline (gig), incorporation monitoring is not necessary for potential aed below 0.5 mSv/a. For aed > 0.5 mSv/a adherence to the 1 mSv dose limit must be verified. For doses > 1 mSv/a incorporation has to be monitored by the authority. Furthermore, the (131)I incorporation factor from the gig should be verified. To determine the actual work related incorporation, the (131)I activity concentration in urine samples (collection over 24 h) of 14 employees of different professions were examined over a period of 27 months. Measured activity concentrations were related to the individual time of exposure. A constant activity supply for at least three days was assumed. The mean annual effective doses were 2.4 · 10⁻¹ mSv/a (nursing staff; n = 3), 5.6 · 10⁻² mSv/a (cleaning staff; n = 2), 2.8 · 10⁻³ mSv/a (technical staff; n = 2) and 5.2 · 10⁻³ mSv/a (physicians; n = 7). All aed were below the dose limits of the gig. The calculated mean incorporation factors ranged from 3.0 · 10⁻⁸ for the nursing staff to 3.6 · 10⁻¹⁰ for the technical staff (cleaning staff: 7 · 10⁻⁹; physicians: 6.5 · 10⁻¹⁰) and were therefore well below the (131)I incorporation factor defined by the gig. To estimate the aed caused by incorporation of (131)I it has to be subdivided for the different requirements in the diverse fields of activity of the employees. Regarding those who spend most of their time nearby the patient an incorporation monitoring by the authority might be required. The (131)I incorporation factor from the guideline (10⁻⁶) can be reduced by a factor of 10. For (99m)Tc and (18)F an incorporation factor of 10⁻⁷ is accepted.

  20. Revised sequence components power system models for unbalanced power system studies

    Energy Technology Data Exchange (ETDEWEB)

    Abdel-Akher, M. [Tunku Abdul Rahman Univ., Kuala Lumpur (Malaysia); Nor, K.-M. [Univ. of Technology Malaysia, Johor (Malaysia); Rashid, A.H.A. [Univ. of Malaya, Kuala Lumpur (Malaysia)

    2007-07-01

    The principle method of analysis using positive, negative, and zero-sequence networks has been used to examine the balanced power system under both balanced and unbalanced loading conditions. The significant advantage of the sequence networks is that the sequence networks become entirely uncoupled in the case of balanced three-phase power systems. The uncoupled sequence networks then can be solved in independent way such as in fault calculation programs. However, the hypothesis of balanced power systems cannot be considered in many cases due to untransposed transmission lines; multiphase line segments in a distribution power system; or transformer phase shifts which cannot be incorporated in the existing models. A revised sequence decoupled power system models for analyzing unbalanced power systems based on symmetrical networks was presented in this paper. These models included synchronous machines, transformers, transmission lines, and voltage regulators. The models were derived from their counterpart's models in phase coordinates frame of reference. In these models, the three sequence networks were fully decoupled with a three-phase coordinates features such as transformer phase shifts and transmission line coupling. The proposed models were used to develop an unbalanced power-flow program for analyzing both balanced and unbalanced networks. The power flow solution was identical to results obtained from a full phase coordinate three-phase power-flow program. 11 refs., 3 tabs.

  1. Draft genome sequences of seven isolates of Phytophthora ramorum EU2 from Northern Ireland

    Directory of Open Access Journals (Sweden)

    Lourdes de la Mata Saez

    2015-12-01

    Full Text Available Here we present draft-quality genome sequence assemblies for the oomycete Phytophthora ramorum genetic lineage EU2. We sequenced genomes of seven isolates collected in Northern Ireland between 2010 and 2012. Multiple genome sequences from P. ramorum EU2 will be valuable for identifying genetic variation within the clonal lineage that can be useful for tracking its spread.

  2. 32 CFR 507.10 - Incorporation of designs or likenesses of approved designs in commercial articles.

    Science.gov (United States)

    2010-07-01

    ... DECORATIONS, MEDALS, BADGES, INSIGNIA, COMMERCIAL USE OF HERALDIC DESIGNS AND HERALDIC QUALITY CONTROL PROGRAM... organizational insignia may be incorporated in articles manufactured for sale provided that permission has been... the coat of arms, crest, seal and organizational emblems. Such permission will be in writing...

  3. Distribution of genotype network sizes in sequence-to-structure genotype-phenotype maps.

    Science.gov (United States)

    Manrubia, Susanna; Cuesta, José A

    2017-04-01

    An essential quantity to ensure evolvability of populations is the navigability of the genotype space. Navigability, understood as the ease with which alternative phenotypes are reached, relies on the existence of sufficiently large and mutually attainable genotype networks. The size of genotype networks (e.g. the number of RNA sequences folding into a particular secondary structure or the number of DNA sequences coding for the same protein structure) is astronomically large in all functional molecules investigated: an exhaustive experimental or computational study of all RNA folds or all protein structures becomes impossible even for moderately long sequences. Here, we analytically derive the distribution of genotype network sizes for a hierarchy of models which successively incorporate features of increasingly realistic sequence-to-structure genotype-phenotype maps. The main feature of these models relies on the characterization of each phenotype through a prototypical sequence whose sites admit a variable fraction of letters of the alphabet. Our models interpolate between two limit distributions: a power-law distribution, when the ordering of sites in the prototypical sequence is strongly constrained, and a lognormal distribution, as suggested for RNA, when different orderings of the same set of sites yield different phenotypes. Our main result is the qualitative and quantitative identification of those features of sequence-to-structure maps that lead to different distributions of genotype network sizes. © 2017 The Author(s).

  4. Optimizing the magnetization-prepared rapid gradient-echo (MP-RAGE sequence.

    Directory of Open Access Journals (Sweden)

    Jinghua Wang

    Full Text Available The three-dimension (3D magnetization-prepared rapid gradient-echo (MP-RAGE sequence is one of the most popular sequences for structural brain imaging in clinical and research settings. The sequence captures high tissue contrast and provides high spatial resolution with whole brain coverage in a short scan time. In this paper, we first computed the optimal k-space sampling by optimizing the contrast of simulated images acquired with the MP-RAGE sequence at 3.0 Tesla using computer simulations. Because the software of our scanner has only limited settings for k-space sampling, we then determined the optimal k-space sampling for settings that can be realized on our scanner. Subsequently we optimized several major imaging parameters to maximize normal brain tissue contrasts under the optimal k-space sampling. The optimal parameters are flip angle of 12°, effective inversion time within 900 to 1100 ms, and delay time of 0 ms. In vivo experiments showed that the quality of images acquired with our optimal protocol was significantly higher than that of images obtained using recommended protocols in prior publications. The optimization of k-spacing sampling and imaging parameters significantly improved the quality and detection sensitivity of brain images acquired with MP-RAGE.

  5. Mining dynamic noteworthy functions in software execution sequences.

    Science.gov (United States)

    Zhang, Bing; Huang, Guoyan; Wang, Yuqian; He, Haitao; Ren, Jiadong

    2017-01-01

    As the quality of crucial entities can directly affect that of software, their identification and protection become an important premise for effective software development, management, maintenance and testing, which thus contribute to improving the software quality and its attack-defending ability. Most analysis and evaluation on important entities like codes-based static structure analysis are on the destruction of the actual software running. In this paper, from the perspective of software execution process, we proposed an approach to mine dynamic noteworthy functions (DNFM)in software execution sequences. First, according to software decompiling and tracking stack changes, the execution traces composed of a series of function addresses were acquired. Then these traces were modeled as execution sequences and then simplified so as to get simplified sequences (SFS), followed by the extraction of patterns through pattern extraction (PE) algorithm from SFS. After that, evaluating indicators inner-importance and inter-importance were designed to measure the noteworthiness of functions in DNFM algorithm. Finally, these functions were sorted by their noteworthiness. Comparison and contrast were conducted on the experiment results from two traditional complex network-based node mining methods, namely PageRank and DegreeRank. The results show that the DNFM method can mine noteworthy functions in software effectively and precisely.

  6. Single-base resolution and long-coverage sequencing based on single-molecule nanomanipulation

    International Nuclear Information System (INIS)

    An Hongjie; Huang Jiehuan; Lue Ming; Li Xueling; Lue Junhong; Li Haikuo; Zhang Yi; Li Minqian; Hu Jun

    2007-01-01

    We show new approaches towards a novel single-molecule sequencing strategy which consists of high-resolution positioning isolation of overlapping DNA fragments with atomic force microscopy (AFM), subsequent single-molecule PCR amplification and conventional Sanger sequencing. In this study, a DNA labelling technique was used to guarantee the accuracy in positioning the target DNA. Single-molecule multiplex PCR was carried out to test the contamination. The results showed that the two overlapping DNA fragments isolated by AFM could be successfully sequenced with high quality and perfect contiguity, indicating that single-base resolution and long-coverage sequencing have been achieved simultaneously

  7. ViCTree: An automated framework for taxonomic classification from protein sequences.

    Science.gov (United States)

    Modha, Sejal; Thanki, Anil; Cotmore, Susan F; Davison, Andrew J; Hughes, Joseph

    2018-02-20

    The increasing rate of submission of genetic sequences into public databases is providing a growing resource for classifying the organisms that these sequences represent. To aid viral classification, we have developed ViCTree, which automatically integrates the relevant sets of sequences in NCBI GenBank and transforms them into an interactive maximum likelihood phylogenetic tree that can be updated automatically. ViCTree incorporates ViCTreeView, which is a JavaScript-based visualisation tool that enables the tree to be explored interactively in the context of pairwise distance data. To demonstrate utility, ViCTree was applied to subfamily Densovirinae of family Parvoviridae. This led to the identification of six new species of insect virus. ViCTree is open-source and can be run on any Linux- or Unix-based computer or cluster. A tutorial, the documentation and the source code are available under a GPL3 license, and can be accessed at http://bioinformatics.cvr.ac.uk/victree_web/. sejal.modha@glasgow.ac.uk.

  8. [Methodological quality and reporting quality evaluation of randomized controlled trials published in China Journal of Chinese Materia Medica].

    Science.gov (United States)

    Yu, Dan-Dan; Xie, Yan-Ming; Liao, Xing; Zhi, Ying-Jie; Jiang, Jun-Jie; Chen, Wei

    2018-02-01

    To evaluate the methodological quality and reporting quality of randomized controlled trials(RCTs) published in China Journal of Chinese Materia Medica, we searched CNKI and China Journal of Chinese Materia webpage to collect RCTs since the establishment of the magazine. The Cochrane risk of bias assessment tool was used to evaluate the methodological quality of RCTs. The CONSORT 2010 list was adopted as reporting quality evaluating tool. Finally, 184 RCTs were included and evaluated methodologically, of which 97 RCTs were evaluated with reporting quality. For the methodological evaluating, 62 trials(33.70%) reported the random sequence generation; 9(4.89%) trials reported the allocation concealment; 25(13.59%) trials adopted the method of blinding; 30(16.30%) trials reported the number of patients withdrawing, dropping out and those lost to follow-up;2 trials (1.09%) reported trial registration and none of the trial reported the trial protocol; only 8(4.35%) trials reported the sample size estimation in details. For reporting quality appraising, 3 reporting items of 25 items were evaluated with high-quality,including: abstract, participants qualified criteria, and statistical methods; 4 reporting items with medium-quality, including purpose, intervention, random sequence method, and data collection of sites and locations; 9 items with low-quality reporting items including title, backgrounds, random sequence types, allocation concealment, blindness, recruitment of subjects, baseline data, harms, and funding;the rest of items were of extremely low quality(the compliance rate of reporting item<10%). On the whole, the methodological and reporting quality of RCTs published in the magazine are generally low. Further improvement in both methodological and reporting quality for RCTs of traditional Chinese medicine are warranted. It is recommended that the international standards and procedures for RCT design should be strictly followed to conduct high-quality trials

  9. Rumen microbial growth estimation using in vitro radiophosphorous incorporation technique

    International Nuclear Information System (INIS)

    Bueno, Ives Claudio da Silva; Machado, Mariana de Carvalho; Cabral Filho, Sergio Lucio Salomon; Gobbo, Sarita Priscila; Vitti, Dorinha Miriam Silber Schmidt; Abdalla, Adibe Luiz

    2002-01-01

    Rumen microorganisms are able to transform low biological value nitrogen of feed stuff into high quality protein. To determine how much microbial protein that process forms, radiomarkers can be used. Radiophosphorous has been used to mark microbial protein, as element P is present in all rumen microorganisms (as phospholipids) and the P:N ratio of rumen biomass is quite constant. The aim of this work was to estimate microbial synthesis from feedstuff commonly used in ruminant nutrition in Brazil. Tested feeds were fresh alfalfa, raw sugarcane bagasse, rice hulls, rice meal, soybean meal, wheat meal, Tifton hay, leucaena, dehydrated citrus pulp, wet brewers' grains and cottonseed meal. 32 P-labelled phosphate solution was used as marker for microbial protein. Results showed the diversity of feeds by distinct quantities of nitrogen incorporated into microbial mass. Low nutrient availability feeds (sugarcane bagasse and rice hulls) promoted the lowest values of incorporated nitrogen. Nitrogen incorporation showed positive relationship (r=0.56; P=0.06) with the rate of degradation and negative relationship (r=-0.59; P<0.05) with fiber content of feeds. The results highlight that easier fermentable feeds (higher rates of degradation) and/or with lower fiber contents promote a more efficient microbial growth and better performance for the host animal. (author)

  10. Rumen microbial growth estimation using in vitro radiophosphorous incorporation technique

    Energy Technology Data Exchange (ETDEWEB)

    Bueno, Ives Claudio da Silva; Machado, Mariana de Carvalho; Cabral Filho, Sergio Lucio Salomon; Gobbo, Sarita Priscila; Vitti, Dorinha Miriam Silber Schmidt; Abdalla, Adibe Luiz [Centro de Energia Nuclear na Agricultura (CENA), Piracicaba, SP (Brazil)

    2002-07-01

    Rumen microorganisms are able to transform low biological value nitrogen of feed stuff into high quality protein. To determine how much microbial protein that process forms, radiomarkers can be used. Radiophosphorous has been used to mark microbial protein, as element P is present in all rumen microorganisms (as phospholipids) and the P:N ratio of rumen biomass is quite constant. The aim of this work was to estimate microbial synthesis from feedstuff commonly used in ruminant nutrition in Brazil. Tested feeds were fresh alfalfa, raw sugarcane bagasse, rice hulls, rice meal, soybean meal, wheat meal, Tifton hay, leucaena, dehydrated citrus pulp, wet brewers' grains and cottonseed meal. {sup 32} P-labelled phosphate solution was used as marker for microbial protein. Results showed the diversity of feeds by distinct quantities of nitrogen incorporated into microbial mass. Low nutrient availability feeds (sugarcane bagasse and rice hulls) promoted the lowest values of incorporated nitrogen. Nitrogen incorporation showed positive relationship (r=0.56; P=0.06) with the rate of degradation and negative relationship (r=-0.59; P<0.05) with fiber content of feeds. The results highlight that easier fermentable feeds (higher rates of degradation) and/or with lower fiber contents promote a more efficient microbial growth and better performance for the host animal. (author)

  11. PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET sequence data

    Directory of Open Access Journals (Sweden)

    Wei Chia-Lin

    2006-08-01

    Full Text Available Abstract Background We recently developed the Paired End diTag (PET strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. Results We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the ProjectManager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. Conclusion The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.

  12. Galactic chemical evolution with main-sequence mass loss and the distribution of F and G dwarfs

    International Nuclear Information System (INIS)

    Guzik, J.A.; Struck-Marcell, C.

    1988-01-01

    Simple closed galactic chemical-evolution models incorporating early main-sequence stellar mass loss have been developed for disk ages of 5, 10, and 15 Gyr. Relative to models without stellar mass loss, the models are shown to produce a 30-60 percent increase in the present mass ratio of dwarfs to dwarfs plus remnants, and a 200-250 percent increase in the total mass of late F dwarfs remaining on the main sequence at the current disk age. For present disk ages 5 and 10 Gyr, the total mass of mid-F dwarfs remaining on the main sequence is also shown to increase by 90-120 percent. It is concluded that models with main-sequence mass loss have a slightly reduced gas metallicity and slightly increased gas fraction midway through the evolution. 30 references

  13. Genome survey sequencing and genetic background characterization of Gracilariopsis lemaneiformis (Rhodophyta) based on next-generation sequencing.

    Science.gov (United States)

    Zhou, Wei; Hu, Yiyi; Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

    2013-01-01

    Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon.

  14. Genome Survey Sequencing and Genetic Background Characterization of Gracilariopsis lemaneiformis (Rhodophyta) Based on Next-Generation Sequencing

    Science.gov (United States)

    Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

    2013-01-01

    Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon. PMID:23875008

  15. A code reviewer assignment model incorporating the competence differences and participant preferences

    Directory of Open Access Journals (Sweden)

    Wang Yanqing

    2016-03-01

    Full Text Available A good assignment of code reviewers can effectively utilize the intellectual resources, assure code quality and improve programmers’ skills in software development. However, little research on reviewer assignment of code review has been found. In this study, a code reviewer assignment model is created based on participants’ preference to reviewing assignment. With a constraint of the smallest size of a review group, the model is optimized to maximize review outcomes and avoid the negative impact of “mutual admiration society”. This study shows that the reviewer assignment strategies incorporating either the reviewers’ preferences or the authors’ preferences get much improvement than a random assignment. The strategy incorporating authors’ preference makes higher improvement than that incorporating reviewers’ preference. However, when the reviewers’ and authors’ preference matrixes are merged, the improvement becomes moderate. The study indicates that the majority of the participants have a strong wish to work with reviewers and authors having highest competence. If we want to satisfy the preference of both reviewers and authors at the same time, the overall improvement of learning outcomes may be not the best.

  16. Prediction of accident sequence probabilities in a nuclear power plant due to earthquake events

    International Nuclear Information System (INIS)

    Hudson, J.M.; Collins, J.D.

    1980-01-01

    This paper presents a methodology to predict accident probabilities in nuclear power plants subject to earthquakes. The resulting computer program accesses response data to compute component failure probabilities using fragility functions. Using logical failure definitions for systems, and the calculated component failure probabilities, initiating event and safety system failure probabilities are synthesized. The incorporation of accident sequence expressions allows the calculation of terminal event probabilities. Accident sequences, with their occurrence probabilities, are finally coupled to a specific release category. A unique aspect of the methodology is an analytical procedure for calculating top event probabilities based on the correlated failure of primary events

  17. Methods improvements incorporated into the SAPHIRE ASP models

    International Nuclear Information System (INIS)

    Sattison, M.B.; Blackman, H.S.; Novack, S.D.; Smith, C.L.; Rasmuson, D.M.

    1994-01-01

    The Office for Analysis and Evaluation of Operational Data (AEOD) has sought the assistance of the Idaho National Engineering Laboratory (INEL) to make some significant enhancements to the SAPHIRE-based Accident Sequence Precursor (ASP) models recently developed by the INEL. The challenge of this project is to provide the features of a full-scale PRA within the framework of the simplified ASP models. Some of these features include: (1) uncertainty analysis addressing the standard PRA uncertainties and the uncertainties unique to the ASP models and methodology, (2) incorporation and proper quantification of individual human actions and the interaction among human actions, (3) enhanced treatment of common cause failures, and (4) extension of the ASP models to more closely mimic full-scale PRAs (inclusion of more initiators, explicitly modeling support system failures, etc.). This paper provides an overview of the methods being used to make the above improvements

  18. Methods improvements incorporated into the SAPHIRE ASP models

    International Nuclear Information System (INIS)

    Sattison, M.B.; Blackman, H.S.; Novack, S.D.

    1995-01-01

    The Office for Analysis and Evaluation of Operational Data (AEOD) has sought the assistance of the Idaho National Engineering Laboratory (INEL) to make some significant enhancements to the SAPHIRE-based Accident Sequence Precursor (ASP) models recently developed by the INEL. The challenge of this project is to provide the features of a full-scale PRA within the framework of the simplified ASP models. Some of these features include: (1) uncertainty analysis addressing the standard PRA uncertainties and the uncertainties unique to the ASP models and methods, (2) incorporation and proper quantification of individual human actions and the interaction among human actions, (3) enhanced treatment of common cause failures, and (4) extension of the ASP models to more closely mimic full-scale PRAs (inclusion of more initiators, explicitly modeling support system failures, etc.). This paper provides an overview of the methods being used to make the above improvements

  19. Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features.

    Science.gov (United States)

    Iwata, Hiroaki; Gotoh, Osamu

    2012-11-01

    Spliced alignment plays a central role in the precise identification of eukaryotic gene structures. Even though many spliced alignment programs have been developed, recent rapid progress in DNA sequencing technologies demands further improvements in software tools. Benchmarking algorithms under various conditions is an indispensable task for the development of better software; however, there is a dire lack of appropriate datasets usable for benchmarking spliced alignment programs. In this study, we have constructed two types of datasets: simulated sequence datasets and actual cross-species datasets. The datasets are designed to correspond to various real situations, i.e. divergent eukaryotic species, different types of reference sequences, and the wide divergence between query and target sequences. In addition, we have developed an extended version of our program Spaln, which incorporates two additional features to the scoring scheme of the original version, and examined this extended version, Spaln2, together with the original Spaln and other representative aligners based on our benchmark datasets. Although the effects of the modifications are not individually striking, Spaln2 is consistently most accurate and reasonably fast in most practical cases, especially for plants and fungi and for increasingly divergent pairs of target and query sequences.

  20. Effect of different fibers on dough properties and biscuit quality.

    Science.gov (United States)

    Blanco Canalis, María S; Steffolani, María E; León, Alberto E; Ribotta, Pablo D

    2017-03-01

    This study forms part of a broader project aimed at understanding the role of fibers from different sources in high-fat, high-sugar biscuits and at selecting the best fibers for biscuit quality. The main purpose of this work was to understand the rheological and structural properties involved in fiber-enriched biscuit dough. High-amylose corn starch (RSII), chemically modified starch (RSIV), oat fiber (OF) and inulin (IN) were used at two different levels of incorporation (6 and 12 g) in dough formulation. The influence of fiber on the properties of biscuit dough was studied via dynamic rheological tests, confocal microscopy and spreading behavior. Biscuit quality was assessed by width/thickness factor, texture and surface characteristics, total dietary fiber and sensory evaluation. Main results indicated that IN incorporation increased the capacity of dough spreading during baking and thus improved biscuit quality. OF reduced dough spreading during baking and strongly increased its resistance to deformation. RSII and RSIV slightly affected the quality of the biscuits. Sensory evaluation revealed that the panel liked IN-incorporated biscuits as much as control biscuits. The increase in total dietary fiber modified dough behavior and biscuit properties, and the extent of these effects depended on the type of fiber incorporated. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.

  1. Maximum-likelihood model averaging to profile clustering of site types across discrete linear sequences.

    Directory of Open Access Journals (Sweden)

    Zhang Zhang

    2009-06-01

    Full Text Available A major analytical challenge in computational biology is the detection and description of clusters of specified site types, such as polymorphic or substituted sites within DNA or protein sequences. Progress has been stymied by a lack of suitable methods to detect clusters and to estimate the extent of clustering in discrete linear sequences, particularly when there is no a priori specification of cluster size or cluster count. Here we derive and demonstrate a maximum likelihood method of hierarchical clustering. Our method incorporates a tripartite divide-and-conquer strategy that models sequence heterogeneity, delineates clusters, and yields a profile of the level of clustering associated with each site. The clustering model may be evaluated via model selection using the Akaike Information Criterion, the corrected Akaike Information Criterion, and the Bayesian Information Criterion. Furthermore, model averaging using weighted model likelihoods may be applied to incorporate model uncertainty into the profile of heterogeneity across sites. We evaluated our method by examining its performance on a number of simulated datasets as well as on empirical polymorphism data from diverse natural alleles of the Drosophila alcohol dehydrogenase gene. Our method yielded greater power for the detection of clustered sites across a breadth of parameter ranges, and achieved better accuracy and precision of estimation of clusters, than did the existing empirical cumulative distribution function statistics.

  2. 40 CFR 52.1116 - Significant deterioration of air quality.

    Science.gov (United States)

    2010-07-01

    ... quality. 52.1116 Section 52.1116 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Significant deterioration of air quality. (a) The requirements of sections 160 through 165 of the Clean Air... deterioration of air quality. (b) The following provisions of 40 CFR 52.21 are hereby incorporated and made a...

  3. The Douglas-Fir Genome Sequence Reveals Specialization of the Photosynthetic Apparatus in Pinaceae

    Directory of Open Access Journals (Sweden)

    David B. Neale

    2017-09-01

    Full Text Available A reference genome sequence for Pseudotsuga menziesii var. menziesii (Mirb. Franco (Coastal Douglas-fir is reported, thus providing a reference sequence for a third genus of the family Pinaceae. The contiguity and quality of the genome assembly far exceeds that of other conifer reference genome sequences (contig N50 = 44,136 bp and scaffold N50 = 340,704 bp. Incremental improvements in sequencing and assembly technologies are in part responsible for the higher quality reference genome, but it may also be due to a slightly lower exact repeat content in Douglas-fir vs. pine and spruce. Comparative genome annotation with angiosperm species reveals gene-family expansion and contraction in Douglas-fir and other conifers which may account for some of the major morphological and physiological differences between the two major plant groups. Notable differences in the size of the NDH-complex gene family and genes underlying the functional basis of shade tolerance/intolerance were observed. This reference genome sequence not only provides an important resource for Douglas-fir breeders and geneticists but also sheds additional light on the evolutionary processes that have led to the divergence of modern angiosperms from the more ancient gymnosperms.

  4. Effect of pumpkin powder incorporation on cooking and sensory parameters of pasta

    Directory of Open Access Journals (Sweden)

    Lucia Minarovičová

    2017-01-01

    Full Text Available Pasta is commonly consumed and low glycaemic cereal-based food with long shelf life, which is suitable food matrix for substitution with functional ingredients. Pumpkin powder can be considered as suitable component in pasta making. The effect of addition of pumpkin powder at different levels (5, 7.5, and 10% on wheat dough rheology, cooking quality of pasta as well as on sensory properties was evaluated. Pumpkin powder included: 6.1% moisture, 8.2%, protein, 0.7% crude fat, 2.3% ash, 27.4% total dietary fiber which is included soluble (10.2% and insoluble (17.2% dietary fiber. Farinograph properties of pumpkin powder incorporated dough showed increase in water absorption and dough development time while dough stability and mixing tolerance indes were decreased. It was also observed that addition of pumpkin powder significantly altered cooking quality of pasta. Generally, pasta incorporated with pumpkin powder had shorter optimal cooking time. The shortest optimal time (5.9 min was after addition of 10% of pumpkin powder compared with control pasta (7.0 min. From the results also concluded that addition of pumpkin powder significantly increased cooking loss. The highest cooking loss (6.6% was after addition 10% pumpkin powder. Furthermore it could be stated that incorporation of pumpkin powder in pasta increased water absorption of pasta from 181.0% (control to 211.2% (10% of pumpkin powder. From sensory evaluation resulted that pumpkin powder incorporated pasta were characterized by lower colour, flavor and grain taste. On the other hand vegetable taste and granular structure of pasta increased with higher addition level of pumpkin powder. Moreover, it was concluded that pasta with addition level 10% were the most acceptable for accesors. Normal 0 21 false false false EN-GB X-NONE X-NONE Normal 0 21 false false false EN-GB X-NONE X-NONE

  5. A Novel Ant Colony Algorithm for the Single-Machine Total Weighted Tardiness Problem with Sequence Dependent Setup Times

    Directory of Open Access Journals (Sweden)

    Fardin Ahmadizar

    2011-08-01

    Full Text Available This paper deals with the NP-hard single-machine total weighted tardiness problem with sequence dependent setup times. Incorporating fuzzy sets and genetic operators, a novel ant colony optimization algorithm is developed for the problem. In the proposed algorithm, artificial ants construct solutions as orders of jobs based on the heuristic information as well as pheromone trails. To calculate the heuristic information, three well-known priority rules are adopted as fuzzy sets and then aggregated. When all artificial ants have terminated their constructions, genetic operators such as crossover and mutation are applied to generate new regions of the solution space. A local search is then performed to improve the performance quality of some of the solutions found. Moreover, at run-time the pheromone trails are locally as well as globally updated, and limited between lower and upper bounds. The proposed algorithm is experimented on a set of benchmark problems from the literature and compared with other metaheuristics.

  6. Effect of alginate/carboxyl methyl cellulose composite coating incorporated with clove essential oil on the quality of silver carp fillet and Escherichia coli O157:H7 inhibition during refrigerated storage.

    Science.gov (United States)

    Jalali, Nastaran; Ariiai, Peiman; Fattahi, Esmaeil

    2016-01-01

    The effects of alginate/carboxyl methylcellulose composite coating incorporated with clove essential oil on quality of silver carp fillet chilled storage (4 + 1 °C) were examined over a period of 16 days. The control samples (c), alginate/carboxyl methylcellulose coating (C-A), alginate/carboxyl methylcellulose composite coating incorporated with clove essential oil (with different concentration 1 and 1.5 %) (C-A + CEO1 % and C-A + CEO 15 % respectively) were analyzed by bacteriological (total viable counts (TVC) and total psychrotrophic counts (TPC)), biochemical (Peroxide value (PV), free fatty acid (FFA), total volatile base nitrogen (TVB-N), and pH) and sensory characteristics. Also, the efficacy of these treatments was investigated in control of the population of Eschershia coli O157:H7 inoculated in silver carp fillet. According to the obtained results, C-A + CEO 1.5 % showed lowest (p clove essential oil might be recommended as a preservative in the meat products.

  7. False discovery rate control incorporating phylogenetic tree increases detection power in microbiome-wide multiple testing.

    Science.gov (United States)

    Xiao, Jian; Cao, Hongyuan; Chen, Jun

    2017-09-15

    Next generation sequencing technologies have enabled the study of the human microbiome through direct sequencing of microbial DNA, resulting in an enormous amount of microbiome sequencing data. One unique characteristic of microbiome data is the phylogenetic tree that relates all the bacterial species. Closely related bacterial species have a tendency to exhibit a similar relationship with the environment or disease. Thus, incorporating the phylogenetic tree information can potentially improve the detection power for microbiome-wide association studies, where hundreds or thousands of tests are conducted simultaneously to identify bacterial species associated with a phenotype of interest. Despite much progress in multiple testing procedures such as false discovery rate (FDR) control, methods that take into account the phylogenetic tree are largely limited. We propose a new FDR control procedure that incorporates the prior structure information and apply it to microbiome data. The proposed procedure is based on a hierarchical model, where a structure-based prior distribution is designed to utilize the phylogenetic tree. By borrowing information from neighboring bacterial species, we are able to improve the statistical power of detecting associated bacterial species while controlling the FDR at desired levels. When the phylogenetic tree is mis-specified or non-informative, our procedure achieves a similar power as traditional procedures that do not take into account the tree structure. We demonstrate the performance of our method through extensive simulations and real microbiome datasets. We identified far more alcohol-drinking associated bacterial species than traditional methods. R package StructFDR is available from CRAN. chen.jun2@mayo.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  8. Incorporation monitoring, above all for naturally occuring radionuclides; Inkorporationsueberwachung, vor allem bei natuerlichen radioaktiven Stoffen

    Energy Technology Data Exchange (ETDEWEB)

    Breustedt, Bastian [Karlsruher Institut fuer Technologie (KIT), Eggenstein-Leopoldshafen (Germany). Inst. fuer Strahlenforschung; Buchholz, Werner [Bundesamt fuer Strahlenschutz, Oberschleissheim (Germany). FB Strahlenschutz und Gesundheit; Burow, Mechthild [Forschungszentrum Juelich GmbH (Germany). Geschaeftsbereich ' ' Sicherheit und Strahlenschutz' ' ; and others

    2017-08-01

    The contribution of the main topic deals with further development of ICRP-models and standardization of regulations in the field of incorporation monitoring. Natural radionuclides, medically used radionuclides and quality assurance, will be discussed. The planning and realization of a shielding chamber for a whole body counter is presented.

  9. Effect of sodium alginate coating incorporated with nisin, Cinnamomum zeylanicum, and rosemary essential oils on microbial quality of chicken meat and fate of Listeria monocytogenes during refrigeration.

    Science.gov (United States)

    Raeisi, Mojtaba; Tabaraei, Alijan; Hashemi, Mohammad; Behnampour, Nasser

    2016-12-05

    The present study was conducted to preserve the microbial quality of chicken meat fillets during storage time by using sodium alginate active coating solutions incorporated with different natural antimicrobials including nisin, Cinnamomum zeylanicum (cinnamon), and rosemary essential oils (EOs) which were added individually and in combination. The samples were stored in refrigeration condition for 15days and were analyzed for total viable count, Enterobacteriaceae count, lactic acid bacteria count, Pseudomonas spp. count, psychrotrophic count, and yeast and mold count, as well as fate of inoculated Listeria monocytogenes at 3-day intervals. Results indicated that values of tested microbial indicators in all samples increased during storage. Antimicrobial agents, when used in combination, had stronger effect in preserving the microbial quality of chicken meat samples rather than their individual use and the strongest effect was observed in samples coated with alginate solution containing both cinnamon and rosemary EOs (CEO+REO). However, all treatments significantly inhibited microbial growth when compared to the control (Ppreservatives is recommended in meat products especially in chicken meats. Copyright © 2016. Published by Elsevier B.V.

  10. Preliminary hazard analysis using sequence tree method

    International Nuclear Information System (INIS)

    Huang Huiwen; Shih Chunkuan; Hung Hungchih; Chen Minghuei; Yih Swu; Lin Jiinming

    2007-01-01

    A system level PHA using sequence tree method was developed to perform Safety Related digital I and C system SSA. The conventional PHA is a brainstorming session among experts on various portions of the system to identify hazards through discussions. However, this conventional PHA is not a systematic technique, the analysis results strongly depend on the experts' subjective opinions. The analysis quality cannot be appropriately controlled. Thereby, this research developed a system level sequence tree based PHA, which can clarify the relationship among the major digital I and C systems. Two major phases are included in this sequence tree based technique. The first phase uses a table to analyze each event in SAR Chapter 15 for a specific safety related I and C system, such as RPS. The second phase uses sequence tree to recognize what I and C systems are involved in the event, how the safety related systems work, and how the backup systems can be activated to mitigate the consequence if the primary safety systems fail. In the sequence tree, the defense-in-depth echelons, including Control echelon, Reactor trip echelon, ESFAS echelon, and Indication and display echelon, are arranged to construct the sequence tree structure. All the related I and C systems, include digital system and the analog back-up systems are allocated in their specific echelon. By this system centric sequence tree based analysis, not only preliminary hazard can be identified systematically, the vulnerability of the nuclear power plant can also be recognized. Therefore, an effective simplified D3 evaluation can be performed as well. (author)

  11. Assembly of the Complete Sitka Spruce Chloroplast Genome Using 10X Genomics' GemCode Sequencing Data.

    Directory of Open Access Journals (Sweden)

    Lauren Coombe

    Full Text Available The linked read sequencing library preparation platform by 10X Genomics produces barcoded sequencing libraries, which are subsequently sequenced using the Illumina short read sequencing technology. In this new approach, long fragments of DNA are partitioned into separate micro-reactions, where the same index sequence is incorporated into each of the sequencing fragment inserts derived from a given long fragment. In this study, we exploited this property by using reads from index sequences associated with a large number of reads, to assemble the chloroplast genome of the Sitka spruce tree (Picea sitchensis. Here we report on the first Sitka spruce chloroplast genome assembled exclusively from P. sitchensis genomic libraries prepared using the 10X Genomics protocol. We show that the resulting 124,049 base pair long genome shares high sequence similarity with the related white spruce and Norway spruce chloroplast genomes, but diverges substantially from a previously published P. sitchensis- P. thunbergii chimeric genome. The use of reads from high-frequency indices enabled separation of the nuclear genome reads from that of the chloroplast, which resulted in the simplification of the de Bruijn graphs used at the various stages of assembly.

  12. Analysis of 16S rRNA amplicon sequencing options on the Roche/454 next-generation titanium sequencing platform.

    Directory of Open Access Journals (Sweden)

    Hideyuki Tamaki

    Full Text Available BACKGROUND: 16S rRNA gene pyrosequencing approach has revolutionized studies in microbial ecology. While primer selection and short read length can affect the resulting microbial community profile, little is known about the influence of pyrosequencing methods on the sequencing throughput and the outcome of microbial community analyses. The aim of this study is to compare differences in output, ease, and cost among three different amplicon pyrosequencing methods for the Roche/454 Titanium platform METHODOLOGY/PRINCIPAL FINDINGS: The following three pyrosequencing methods for 16S rRNA genes were selected in this study: Method-1 (standard method is the recommended method for bi-directional sequencing using the LIB-A kit; Method-2 is a new option designed in this study for unidirectional sequencing with the LIB-A kit; and Method-3 uses the LIB-L kit for unidirectional sequencing. In our comparison among these three methods using 10 different environmental samples, Method-2 and Method-3 produced 1.5-1.6 times more useable reads than the standard method (Method-1, after quality-based trimming, and did not compromise the outcome of microbial community analyses. Specifically, Method-3 is the most cost-effective unidirectional amplicon sequencing method as it provided the most reads and required the least effort in consumables management. CONCLUSIONS: Our findings clearly demonstrated that alternative pyrosequencing methods for 16S rRNA genes could drastically affect sequencing output (e.g. number of reads before and after trimming but have little effect on the outcomes of microbial community analysis. This finding is important for both researchers and sequencing facilities utilizing 16S rRNA gene pyrosequencing for microbial ecological studies.

  13. Optimal Quality Strategy and Matching Service on Crowdfunding Platforms

    Directory of Open Access Journals (Sweden)

    Wenqing Wu

    2018-04-01

    Full Text Available This paper develops a crowdfunding platform model incorporating quality and a matching service from the perspective of a two-sided market. It aims to explore the impact of different factors on the optimal quality threshold and matching service in a context of crowdfunding from the perspective of a two-sided market. We discuss the impact of different factors on the optimal quality threshold and matching service. Two important influential factors are under consideration, simultaneously. One is the quality threshold of admission and the other is the matching efficiency on crowdfunding platforms. This paper develops a two-sided market model incorporating quality, a matching service, and the characters of crowdfunding campaigns. After attempting to solve the model by derivative method, this paper identifies the mechanism of how the parameters influence the optimal quality threshold and matching service. Additionally, it compares the platform profits in scenarios with and without an exclusion policy. The results demonstrate that excluding low-quality projects is profitable when funder preference for project quality is substantial enough. Crowdfunding platform managers would be unwise to admit the quality threshold of the crowdfunding project and charge entrance fees when the parameter of funder preference for project quality is small.

  14. Rapid development of microsatellite markers for Callosobruchus chinensis using Illumina paired-end sequencing.

    Directory of Open Access Journals (Sweden)

    Can-Xing Duan

    Full Text Available BACKGROUND: The adzuki bean weevil, Callosobruchus chinensis L., is one of the most destructive pests of stored legume seeds such as mungbean, cowpea, and adzuki bean, which usually cause considerable loss in the quantity and quality of stored seeds during transportation and storage. However, a lack of genetic information of this pest results in a series of genetic questions remain largely unknown, including population genetic structure, kinship, biotype abundance, and so on. Co-dominant microsatellite markers offer a great resolving power to determine these events. Here, we report rapid microsatellite isolation from C. chinensis via high-throughput sequencing. PRINCIPAL FINDINGS: In this study, 94,560,852 quality-filtered and trimmed reads were obtained for the assembly of genome using Illumina paired-end sequencing technology. In total, the genome with total length of 497,124,785 bp, comprising 403,113 high quality contigs was generated with de novo assembly. More than 6800 SSR loci were detected and a suit of 6303 primer pair sequences were designed and 500 of them were randomly selected for validation. Of these, 196 pair of primers, i.e. 39.2%, produced reproducible amplicons that were polymorphic among 8 C. chinensis genotypes collected from different geographical regions. Twenty out of 196 polymorphic SSR markers were used to analyze the genetic diversity of 18 C. chinensis populations. The results showed the twenty SSR loci were highly polymorphic among these populations. CONCLUSIONS: This study presents a first report of genome sequencing and de novo assembly for C. chinensis and demonstrates the feasibility of generating a large scale of sequence information and SSR loci isolation by Illumina paired-end sequencing. Our results provide a valuable resource for C. chinensis research. These novel markers are valuable for future genetic mapping, trait association, genetic structure and kinship among C. chinensis.

  15. MISTICA: Minimum Spanning Tree-Based Coarse Image Alignment for Microscopy Image Sequences.

    Science.gov (United States)

    Ray, Nilanjan; McArdle, Sara; Ley, Klaus; Acton, Scott T

    2016-11-01

    Registration of an in vivo microscopy image sequence is necessary in many significant studies, including studies of atherosclerosis in large arteries and the heart. Significant cardiac and respiratory motion of the living subject, occasional spells of focal plane changes, drift in the field of view, and long image sequences are the principal roadblocks. The first step in such a registration process is the removal of translational and rotational motion. Next, a deformable registration can be performed. The focus of our study here is to remove the translation and/or rigid body motion that we refer to here as coarse alignment. The existing techniques for coarse alignment are unable to accommodate long sequences often consisting of periods of poor quality images (as quantified by a suitable perceptual measure). Many existing methods require the user to select an anchor image to which other images are registered. We propose a novel method for coarse image sequence alignment based on minimum weighted spanning trees (MISTICA) that overcomes these difficulties. The principal idea behind MISTICA is to reorder the images in shorter sequences, to demote nonconforming or poor quality images in the registration process, and to mitigate the error propagation. The anchor image is selected automatically making MISTICA completely automated. MISTICA is computationally efficient. It has a single tuning parameter that determines graph width, which can also be eliminated by the way of additional computation. MISTICA outperforms existing alignment methods when applied to microscopy image sequences of mouse arteries.

  16. Nonparametric Inference for Periodic Sequences

    KAUST Repository

    Sun, Ying

    2012-02-01

    This article proposes a nonparametric method for estimating the period and values of a periodic sequence when the data are evenly spaced in time. The period is estimated by a "leave-out-one-cycle" version of cross-validation (CV) and complements the periodogram, a widely used tool for period estimation. The CV method is computationally simple and implicitly penalizes multiples of the smallest period, leading to a "virtually" consistent estimator of integer periods. This estimator is investigated both theoretically and by simulation.We also propose a nonparametric test of the null hypothesis that the data have constantmean against the alternative that the sequence of means is periodic. Finally, our methodology is demonstrated on three well-known time series: the sunspots and lynx trapping data, and the El Niño series of sea surface temperatures. © 2012 American Statistical Association and the American Society for Quality.

  17. High quality maize centromere 10 sequence reveals evidence of frequent recombination events

    Directory of Open Access Journals (Sweden)

    Thomas Kai Wolfgruber

    2016-03-01

    Full Text Available The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR have presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 x 10-6 and 5 x 10-5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb of the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length centromeric retrotransposons from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. This repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to facilitate the repair of frequent DSBs in centromeres.

  18. Validation of Genotyping-By-Sequencing Analysis in Populations of Tetraploid Alfalfa by 454 Sequencing

    Science.gov (United States)

    Rocher, Solen; Jean, Martine; Castonguay, Yves; Belzile, François

    2015-01-01

    Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids. PMID:26115486

  19. A-Cell equipment removal quality process plan

    International Nuclear Information System (INIS)

    TAKASUMI, D.S.

    1999-01-01

    This document establishes the quality assuring activities used to manage the 324 building A-Cell equipment removal activity. This activity will package, remove, transport and dispose of the equipment in A-Cell. This document is provided to ensure that appropriate and effective quality assuring activities have been incorporated into the work controlling documentation and procedures

  20. SSR_pipeline--computer software for the identification of microsatellite sequences from paired-end Illumina high-throughput DNA sequence data

    Science.gov (United States)

    Miller, Mark P.; Knaus, Brian J.; Mullins, Thomas D.; Haig, Susan M.

    2013-01-01

    SSR_pipeline is a flexible set of programs designed to efficiently identify simple sequence repeats (SSRs; for example, microsatellites) from paired-end high-throughput Illumina DNA sequencing data. The program suite contains three analysis modules along with a fourth control module that can be used to automate analyses of large volumes of data. The modules are used to (1) identify the subset of paired-end sequences that pass quality standards, (2) align paired-end reads into a single composite DNA sequence, and (3) identify sequences that possess microsatellites conforming to user specified parameters. Each of the three separate analysis modules also can be used independently to provide greater flexibility or to work with FASTQ or FASTA files generated from other sequencing platforms (Roche 454, Ion Torrent, etc). All modules are implemented in the Python programming language and can therefore be used from nearly any computer operating system (Linux, Macintosh, Windows). The program suite relies on a compiled Python extension module to perform paired-end alignments. Instructions for compiling the extension from source code are provided in the documentation. Users who do not have Python installed on their computers or who do not have the ability to compile software also may choose to download packaged executable files. These files include all Python scripts, a copy of the compiled extension module, and a minimal installation of Python in a single binary executable. See program documentation for more information.

  1. Thousands of primer-free, high-quality, full-length SSU rRNA sequences from all domains of life

    DEFF Research Database (Denmark)

    Karst, Soeren M; Dueholm, Morten S; McIlroy, Simon J

    2016-01-01

    Ribosomal RNA (rRNA) genes are the consensus marker for determination of microbial diversity on the planet, invaluable in studies of evolution and, for the past decade, high-throughput sequencing of variable regions of ribosomal RNA genes has become the backbone of most microbial ecology studies...... (SSU) rRNA genes and synthetic long read sequencing by molecular tagging, to generate primer-free, full-length SSU rRNA gene sequences from all domains of life, with a median raw error rate of 0.17%. We generated thousands of full-length SSU rRNA sequences from five well-studied ecosystems (soil, human...... gut, fresh water, anaerobic digestion, and activated sludge) and obtained sequences covering all domains of life and the majority of all described phyla. Interestingly, 30% of all bacterial operational taxonomic units were novel, compared to the SILVA database (less than 97% similarity...

  2. Quality characteristics of egg-reduced pound cakes following WPI and emulsifier incorporation

    NARCIS (Netherlands)

    Paraskevopoulou, A.; Donsouzi, S.; Nikiforidis, C.V.; Kiosseoglou, V.

    2015-01-01

    The effect of partial (50 wt%) or total liquid egg replacement by whey proteins in combination with emulsifiers, i.e. hydroxypropylmethylcellulose (HPMC) and sodium stearoyl-2-lactylate (SSL), on the quality of pound cakes was investigated. Cakes containing whey protein isolate (WPI) solutions of

  3. Genome Sequence of the Freshwater Yangtze Finless Porpoise.

    Science.gov (United States)

    Yuan, Yuan; Zhang, Peijun; Wang, Kun; Liu, Mingzhong; Li, Jing; Zheng, Jingsong; Wang, Ding; Xu, Wenjie; Lin, Mingli; Dong, Lijun; Zhu, Chenglong; Qiu, Qiang; Li, Songhai

    2018-04-16

    The Yangtze finless porpoise ( Neophocaena asiaeorientalis ssp. asiaeorientalis ) is a subspecies of the narrow-ridged finless porpoise ( N. asiaeorientalis ). In total, 714.28 gigabases (Gb) of raw reads were generated by whole-genome sequencing of the Yangtze finless porpoise, using an Illumina HiSeq 2000 platform. After filtering the low-quality and duplicated reads, we assembled a draft genome of 2.22 Gb, with contig N50 and scaffold N50 values of 46.69 kilobases (kb) and 1.71 megabases (Mb), respectively. We identified 887.63 Mb of repetitive sequences and predicted 18,479 protein-coding genes in the assembled genome. The phylogenetic tree showed a relationship between the Yangtze finless porpoise and the Yangtze River dolphin, which diverged approximately 20.84 million years ago. In comparisons with the genomes of 10 other mammals, we detected 44 species-specific gene families, 164 expanded gene families, and 313 positively selected genes in the Yangtze finless porpoise genome. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information under BioProject accession number PRJNA433603.

  4. Long-read sequencing data analysis for yeasts.

    Science.gov (United States)

    Yue, Jia-Xing; Liti, Gianni

    2018-06-01

    Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ∼41 h to generate a complete and well-annotated genome from ∼100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.

  5. DNA Extraction Protocols for Whole-Genome Sequencing in Marine Organisms.

    Science.gov (United States)

    Panova, Marina; Aronsson, Henrik; Cameron, R Andrew; Dahl, Peter; Godhe, Anna; Lind, Ulrika; Ortega-Martinez, Olga; Pereyra, Ricardo; Tesson, Sylvie V M; Wrange, Anna-Lisa; Blomberg, Anders; Johannesson, Kerstin

    2016-01-01

    The marine environment harbors a large proportion of the total biodiversity on this planet, including the majority of the earths' different phyla and classes. Studying the genomes of marine organisms can bring interesting insights into genome evolution. Today, almost all marine organismal groups are understudied with respect to their genomes. One potential reason is that extraction of high-quality DNA in sufficient amounts is challenging for many marine species. This is due to high polysaccharide content, polyphenols and other secondary metabolites that will inhibit downstream DNA library preparations. Consequently, protocols developed for vertebrates and plants do not always perform well for invertebrates and algae. In addition, many marine species have large population sizes and, as a consequence, highly variable genomes. Thus, to facilitate the sequence read assembly process during genome sequencing, it is desirable to obtain enough DNA from a single individual, which is a challenge in many species of invertebrates and algae. Here, we present DNA extraction protocols for seven marine species (four invertebrates, two algae, and a marine yeast), optimized to provide sufficient DNA quality and yield for de novo genome sequencing projects.

  6. Fast T1-weighted imaging using GRASE sequence for the female pelvis

    International Nuclear Information System (INIS)

    Dohke, Masako; Watanabe, Yuji; Kumashiro, Masayuki; Amoh, Yoshiki; Ishimori, Takayoshi; Oda, Kazushige; Okumura, Akira; Koike, Shinji; Dodo, Yoshihiro

    1998-01-01

    GRASE sequence, a combination of TSE and gradient echo, has been developed as a fast T 2 -weighted imaging technique. We have modified the GRASE sequence to be used for fast T 1 -weighted imaging of the female pelvis. In this article, we compared image quality and incidence of artifacts between T 1 -weighted GRASE images and conventional T 1 -weighted SE images. In a phantom study, signal-to-noise ratio was inferior in the GRASE images relative to corresponding on SE images. Susceptibility and chemical shift artifacts seen in GRASE images were seen with almost equal incidence in SE and TSE images. In a clinical study, we compared GRASE images with SE images in six patients with endometrial cysts and four patients with dermoid cysts. The overall image quality obtained with GRASE sequence was satisfactory in all patients and was almost identical with that obtained with SE sequence. GRASE images demonstrated endometrial cysts and dermoid cysts as clearly as did SE images. T 1 -weighted GRASE imaging, however, has a relatively long TE (35 ms) for T 1 -weighted images, which makes the signal intensity of urine and uterine endometrium with long T 2 values higher than in SE images. In conclusion, GRASE sequence can be used for fast T 1 -weighted imaging of the female pelvis because of short imaging time. (author)

  7. Bromine isotopic signature facilitates de novo sequencing of peptides in free-radical-initiated peptide sequencing (FRIPS) mass spectrometry.

    Science.gov (United States)

    Nam, Jungjoo; Kwon, Hyuksu; Jang, Inae; Jeon, Aeran; Moon, Jingyu; Lee, Sun Young; Kang, Dukjin; Han, Sang Yun; Moon, Bongjin; Oh, Han Bin

    2015-02-01

    We recently showed that free-radical-initiated peptide sequencing mass spectrometry (FRIPS MS) assisted by the remarkable thermochemical stability of (2,2,6,6-tetramethyl-piperidin-1-yl)oxyl (TEMPO) is another attractive radical-driven peptide fragmentation MS tool. Facile homolytic cleavage of the bond between the benzylic carbon and the oxygen of the TEMPO moiety in o-TEMPO-Bz-C(O)-peptide and the high reactivity of the benzylic radical species generated in •Bz-C(O)-peptide are key elements leading to extensive radical-driven peptide backbone fragmentation. In the present study, we demonstrate that the incorporation of bromine into the benzene ring, i.e. o-TEMPO-Bz(Br)-C(O)-peptide, allows unambiguous distinction of the N-terminal peptide fragments from the C-terminal fragments through the unique bromine doublet isotopic signature. Furthermore, bromine substitution does not alter the overall radical-driven peptide backbone dissociation pathways of o-TEMPO-Bz-C(O)-peptide. From a practical perspective, the presence of the bromine isotopic signature in the N-terminal peptide fragments in TEMPO-assisted FRIPS MS represents a useful and cost-effective opportunity for de novo peptide sequencing. Copyright © 2015 John Wiley & Sons, Ltd.

  8. Engaging clinical nurses in quality and performance improvement activities.

    Science.gov (United States)

    Albanese, Madeline P; Evans, Dietra A; Schantz, Cathy A; Bowen, Margaret; Disbot, Maureen; Moffa, Joseph S; Piesieski, Patricia; Polomano, Rosemary C

    2010-01-01

    Nursing performance measures are an integral part of quality initiatives in acute care; however, organizations face numerous challenges in developing infrastructures to support quality improvement processes and timely dissemination of outcomes data. At the Hospital of the University of Pennsylvania, a Magnet-designated organization, extensive work has been conducted to incorporate nursing-related outcomes in the organization's quality plan and to integrate roles for clinical nurses into the Department of Nursing and organization's core performance-based programs. Content and strategies that promote active involvement of nurses and prepare them to be competent and confident stakeholders in quality initiatives are presented. Engaging clinical nurses in the work of quality and performance improvement is essential to achieving excellence in clinical care. It is important to have structures and processes in place to bring meaningful data to the bedside; however, it is equally important to incorporate outcomes into practice. When nurses are educated about performance and quality measures, are engaged in identifying outcomes and collecting meaningful data, are active participants in disseminating quality reports, and are able to recognize the value of these activities, data become one with practice.

  9. Markov model plus k-word distributions: a synergy that produces novel statistical measures for sequence comparison.

    Science.gov (United States)

    Dai, Qi; Yang, Yanchun; Wang, Tianming

    2008-10-15

    Many proposed statistical measures can efficiently compare biological sequences to further infer their structures, functions and evolutionary information. They are related in spirit because all the ideas for sequence comparison try to use the information on the k-word distributions, Markov model or both. Motivated by adding k-word distributions to Markov model directly, we investigated two novel statistical measures for sequence comparison, called wre.k.r and S2.k.r. The proposed measures were tested by similarity search, evaluation on functionally related regulatory sequences and phylogenetic analysis. This offers the systematic and quantitative experimental assessment of our measures. Moreover, we compared our achievements with these based on alignment or alignment-free. We grouped our experiments into two sets. The first one, performed via ROC (receiver operating curve) analysis, aims at assessing the intrinsic ability of our statistical measures to search for similar sequences from a database and discriminate functionally related regulatory sequences from unrelated sequences. The second one aims at assessing how well our statistical measure is used for phylogenetic analysis. The experimental assessment demonstrates that our similarity measures intending to incorporate k-word distributions into Markov model are more efficient.

  10. TOUR CONSTRUCTION HEURISTICS FOR AN ORDER SEQUENCING PROBLEM

    Directory of Open Access Journals (Sweden)

    De Villiers, A. P.

    2012-11-01

    Full Text Available An order picking system that requires pickers to move in a clockwise direction around a picking line with fixed locations is considered. The problem is divided into three tiers. The tier in which orders must be sequenced is addressed. Eight tour construction heuristics are developed and implemented for an order picking system operating in unidirectional picking lines. Two classes of tour construction heuristics the tour construction starting position ( and the tour construction ending position ( are developed to sequence orders in a picking line. All algorithms are tested and compared using real life data sets. The best solution quality was obtained by a heuristic with adaptations.

  11. A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers

    Directory of Open Access Journals (Sweden)

    Quail Michael A

    2012-07-01

    Full Text Available Abstract Background Next generation sequencing (NGS technology has revolutionized genomic and genetic research. The pace of change in this area is rapid with three major new sequencing platforms having been released in 2011: Ion Torrent’s PGM, Pacific Biosciences’ RS and the Illumina MiSeq. Here we compare the results obtained with those platforms to the performance of the Illumina HiSeq, the current market leader. In order to compare these platforms, and get sufficient coverage depth to allow meaningful analysis, we have sequenced a set of 4 microbial genomes with mean GC content ranging from 19.3 to 67.7%. Together, these represent a comprehensive range of genome content. Here we report our analysis of that sequence data in terms of coverage distribution, bias, GC distribution, variant detection and accuracy. Results Sequence generated by Ion Torrent, MiSeq and Pacific Biosciences technologies displays near perfect coverage behaviour on GC-rich, neutral and moderately AT-rich genomes, but a profound bias was observed upon sequencing the extremely AT-rich genome of Plasmodium falciparum on the PGM, resulting in no coverage for approximately 30% of the genome. We analysed the ability to call variants from each platform and found that we could call slightly more variants from Ion Torrent data compared to MiSeq data, but at the expense of a higher false positive rate. Variant calling from Pacific Biosciences data was possible but higher coverage depth was required. Context specific errors were observed in both PGM and MiSeq data, but not in that from the Pacific Biosciences platform. Conclusions All three fast turnaround sequencers evaluated here were able to generate usable sequence. However there are key differences between the quality of that data and the applications it will support.

  12. [Drinking water quality and safety].

    Science.gov (United States)

    Gómez-Gutiérrez, Anna; Miralles, Maria Josepa; Corbella, Irene; García, Soledad; Navarro, Sonia; Llebaria, Xavier

    2016-11-01

    The purpose of drinking water legislation is to guarantee the quality and safety of water intended for human consumption. In the European Union, Directive 98/83/EC updated the essential and binding quality criteria and standards, incorporated into Spanish national legislation by Royal Decree 140/2003. This article reviews the main characteristics of the aforementioned drinking water legislation and its impact on the improvement of water quality against empirical data from Catalonia. Analytical data reported in the Spanish national information system (SINAC) indicate that water quality in Catalonia has improved in recent years (from 88% of analytical reports in 2004 finding drinking water to be suitable for human consumption, compared to 95% in 2014). The improvement is fundamentally attributed to parameters concerning the organoleptic characteristics of water and parameters related to the monitoring of the drinking water treatment process. Two management experiences concerning compliance with quality standards for trihalomethanes and lead in Barcelona's water supply are also discussed. Finally, this paper presents some challenges that, in the opinion of the authors, still need to be incorporated into drinking water legislation. It is necessary to update Annex I of Directive 98/83/EC to integrate current scientific knowledge, as well as to improve consumer access to water quality data. Furthermore, a need to define common criteria for some non-resolved topics, such as products and materials in contact with drinking water and domestic conditioning equipment, has also been identified. Copyright © 2016 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.

  13. Open-pit coal mine production sequencing incorporating grade blending and stockpiling options: An application from an Indian mine

    Science.gov (United States)

    Kumar, Ashish; Chatterjee, Snehamoy

    2017-05-01

    Production scheduling is a crucial aspect of the mining industry. An optimal and efficient production schedule can increase the profits manifold and reduce the amount of waste to be handled. Production scheduling for coal mines is necessary to maintain consistency in the quality and quantity parameters of coal supplied to power plants. Irregularity in the quality parameters of the coal can lead to heavy losses in coal-fired power plants. Moreover, the stockpiling of coal poses environmental and fire problems owing to low incubation periods. This article proposes a production scheduling formulation for open-pit coal mines including stockpiling and blending opportunities, which play a major role in maintaining the quality and quantity of supplied coal. The proposed formulation was applied to a large open-pit coal mine in India. This contribution provides an efficient production scheduling formulation for coal mines after utilizing the stockpile coal within the incubation periods with the maximization of discounted cash flows. At the same time, consistency is maintained in the quality and quantity of coal to power plants through blending and stockpiling options to ensure smooth functioning.

  14. Gene amplification in Chinese hamster embryo cells by the decay of incorporated iodine-125

    International Nuclear Information System (INIS)

    Luecke-Huhle, Christine; Ehrfeld, Angelika; Rau, Waltraud

    1988-01-01

    Simian Virus 40-transformed Chinese hamster embryo cells (Co631) contain 5 viral copies integrated per cell genome. These SV40 sequences were used as an endogenous indicator gene to study response of mammalian cells to radiation at gene level. Cells were internally irradiated by Auger electrons emitted by Iodine-125 which was incorporated in cell DNA in form of 5-[ 125 I] iododeoxyuridine ( 125 IdU). An increase in gene copy number was measured using dispersed cell blotting and Southern analysis in combination with highly sensitive DNA hybridization. A 13-fold amplification of the SV40 sequences and a 2-fold amplification of two cellular oncogenes of the ras family were found. Other cellular genes, like the α-actin gene, are not amplified and no variation in gene copy number was observed after incubation of cells with cold IdU. Thus, specific gene amplification seems to be the consequence of radiation-induced DNA damage and the resulting cell cycle arrest. (author)

  15. Subjective Quality Assessment of H.264/AVC Video Streaming with Packet Losses

    Directory of Open Access Journals (Sweden)

    Naccari Matteo

    2011-01-01

    Full Text Available Research in the field of video quality assessment relies on the availability of subjective scores, collected by means of experiments in which groups of people are asked to rate the quality of video sequences. The availability of subjective scores is fundamental to enable validation and comparative benchmarking of the objective algorithms that try to predict human perception of video quality by automatically analyzing the video sequences, in a way to support reproducible and reliable research results. In this paper, a publicly available database of subjective quality scores and corrupted video sequences is described. The scores refer to 156 sequences at CIF and 4CIF spatial resolutions, encoded with H.264/AVC and corrupted by simulating the transmission over an error-prone network. The subjective evaluation has been performed by 40 subjects at the premises of two academic institutions, in standard-compliant controlled environments. In order to support reproducible research in the field of full-reference, reduced-reference, and no-reference video quality assessment algorithms, both the uncompressed files and the H.264/AVC bitstreams, as well as the packet loss patterns, have been made available to the research community.

  16. [Compatibility of different quality control systems].

    Science.gov (United States)

    Invernizzi, Enrico

    2002-01-01

    Management of the good laboratory practice (GLP) quality system presupposes its linking to a basic recognized and approved quality system, from which it can draw on management procedures common to all quality systems, such as the ISO 9000 set of norms. A quality system organized in this way can also be integrated with other dedicated quality systems, or parts of them, to obtain principles or management procedures for specific topics. The aim of this organization is to set up a reliable, recognized quality system compatible with the principles of GLP and other quality management systems, which provides users with a simplified set of easily accessible management tools and answers. The organization of this quality system is set out in the quality assurance programme, which is actually the document in which the test facility incorporates the GLP principles into its own quality organization.

  17. Nutritional and sensory quality evaluation of sponge cake prepared by incorporation of high dietary fiber containing mango (Mangifera indica var. Chokanan) pulp and peel flours.

    Science.gov (United States)

    Aziah, A A Noor; Min, W Lee; Bhat, Rajeev

    2011-09-01

    Sponge cake prepared by partial substitution of wheat flour with mango pulp and mango peel flours (MPuF and MPeF, respectively) at different concentrations (control, 5%, 10%, 20% or 30%) were investigated for the physico-chemical, nutritional and organoleptic characteristics. Results showed sponge cake incorporated with MPuF and MPeF to have high dietary fiber with low fat, calorie, hydrolysis and predicted glycemic index compared with the control. Increasing the levels of MPuF and MPeF in sponge cake had significant impact on the volume, firmness and color. Sensory evaluation showed sponge cake formulated with 10% MPuF and 10% MPeF to be the most acceptable. MPeF and MPuF have high potential as fiber-rich ingredients and can be utilized in the preparation of cake and other bakery products to improve the nutritional qualities.

  18. Motion Estimation Using the Firefly Algorithm in Ultrasonic Image Sequence of Soft Tissue

    Directory of Open Access Journals (Sweden)

    Chih-Feng Chao

    2015-01-01

    Full Text Available Ultrasonic image sequence of the soft tissue is widely used in disease diagnosis; however, the speckle noises usually influenced the image quality. These images usually have a low signal-to-noise ratio presentation. The phenomenon gives rise to traditional motion estimation algorithms that are not suitable to measure the motion vectors. In this paper, a new motion estimation algorithm is developed for assessing the velocity field of soft tissue in a sequence of ultrasonic B-mode images. The proposed iterative firefly algorithm (IFA searches for few candidate points to obtain the optimal motion vector, and then compares it to the traditional iterative full search algorithm (IFSA via a series of experiments of in vivo ultrasonic image sequences. The experimental results show that the IFA can assess the vector with better efficiency and almost equal estimation quality compared to the traditional IFSA method.

  19. DNA copy number, including telomeres and mitochondria, assayed using next-generation sequencing

    Directory of Open Access Journals (Sweden)

    Jackson Stuart

    2010-04-01

    Full Text Available Abstract Background DNA copy number variations occur within populations and aberrations can cause disease. We sought to develop an improved lab-automatable, cost-efficient, accurate platform to profile DNA copy number. Results We developed a sequencing-based assay of nuclear, mitochondrial, and telomeric DNA copy number that draws on the unbiased nature of next-generation sequencing and incorporates techniques developed for RNA expression profiling. To demonstrate this platform, we assayed UMC-11 cells using 5 million 33 nt reads and found tremendous copy number variation, including regions of single and homogeneous deletions and amplifications to 29 copies; 5 times more mitochondria and 4 times less telomeric sequence than a pool of non-diseased, blood-derived DNA; and that UMC-11 was derived from a male individual. Conclusion The described assay outputs absolute copy number, outputs an error estimate (p-value, and is more accurate than array-based platforms at high copy number. The platform enables profiling of mitochondrial levels and telomeric length. The assay is lab-automatable and has a genomic resolution and cost that are tunable based on the number of sequence reads.

  20. Implementation of Targeted Next Generation Sequencing in Clinical Diagnostics

    DEFF Research Database (Denmark)

    Larsen, Martin Jakob; Burton, Mark; Thomassen, Mads

    Accurate mutation detection is essential in clinical genetic diagnostics of monogenic hereditary diseases. Targeted next generation sequencing (NGS) provides a promising and cost-effective alternative to Sanger sequencing and MLPA analysis currently used in most diagnostic laboratories. One...... of mutation positive controls previously characterized by Sanger/MLPA analysis. Agilent SureSelect Target-Enrichment kits were used for capturing a set of genes associated with hereditary breast and ovarian cancer syndrome and a compilation of genes involved in multiple rare single gene disorders......, respectively. For diagnostics, the sequencing coverage is essential, wherefore a minimum coverage of 30x per nucleotide in the coding regions was used as our primary quality criterion. For the majority of the included genes, we obtained adequate gene coverage, in which we were able to detect 100% of the known...

  1. APPLICABILITY ANALYSIS OF THE PHASE CORRELATION ALGORITHM FOR STABILIZATION OF VIDEO FRAMES SEQUENCES FOR CAPILLARY BLOOD FLOW

    Directory of Open Access Journals (Sweden)

    K. A. Karimov

    2016-05-01

    Full Text Available Videocapillaroscopy is a convenient and non-invasive method of blood flow parameters recovery in the capillaries. Capillaries position can vary at recorded video sequences due to the registration features of capillary blood flow. Stabilization algorithm of video capillary blood flow based on phase correlation is proposed and researched. This algorithm is compared to the known algorithms of video frames stabilization with full-frame superposition and with key points. Programs, based on discussed algorithms, are compared under processing the experimentally recorded video sequences of human capillaries and under processing of computer-simulated sequences of video frames with the specified offset. The full-frame superposition algorithm provides high quality of stabilization; however, the program based on this algorithm requires significant computational resources. Software implementation of the algorithm based on the detection of the key points is characterized by good performance, but provides low quality of stabilization for video sequences capillary blood flow. Algorithm based on phase correlation method provides high quality of stabilization and program realization of this algorithm requires minimal computational resources. It is shown that the phase correlation algorithm is the most useful for stabilization of video sequences for capillaries blood flow. Obtained findings can be used in the software for biomedical diagnostics.

  2. Considerations on Integrating Risk and Quality Management

    Directory of Open Access Journals (Sweden)

    Maria POPESCU

    2011-03-01

    Full Text Available This paper aims to highlight the links between risk management and quality management and to study the possibility of their integrated approach. The study reviews the evolution of risk approach within organizations and stresses the need to increase the effectiveness of this approach by incorporating risk management methodology in the quality management system. Starting from this idea, the authors present the current state of risk approach into quality management, basic rules of integrated quality-risk management and major difficulties which may arise in the implementation of integrated quality–risk systems.

  3. Parsing a perceptual decision into a sequence of moments of thought

    Directory of Open Access Journals (Sweden)

    Martin eGraziano

    2011-09-01

    Full Text Available Theoretical, computational and experimental studies have converged to a model of decision-making in which sensory evidence is stochastically integrated to a threshold, implementing a shift from an analog to a discrete form of computation. Understanding how this process can be chained and sequenced - as virtually all real-life tasks involve a sequence of decisions - remains an open question in neuroscience. We reasoned that incorporating a virtual continuum of possible behavioral outcomes in a simple decision task- a fundamental ingredient of real-life decision making – should result in a progressive sequential approximation to the correct response. We used real-time tracking of motor action in a decision task, as a measure of cognitive states reflecting an internal decision process. We found that response trajectories were spontaneously segmented into a discrete sequence of explorations separated by brief stops (about 200 ms – which remained unconscious to the participants. The characteristics of these stops were indicative of a decision process - a moment of thought: their duration correlated with the difficulty of the decision and with the efficiency of the subsequent exploration. Our findings suggest that simple navigation in an abstract space involves a discrete sequence of explorations and stops and, moreover, that these stops reveal a fingerprint of moments of thought.

  4. Social economy of quality food

    NARCIS (Netherlands)

    Otsuki, K.

    2014-01-01

    Purpose – This paper aims to examine the implications of the efforts to promote a quality-oriented economy that incorporates a vision of environmental sustainability and equitable social development. Design/methodology/approach – The analysis builds on a case study of food procurement in Brazil,

  5. Evaluation of finger millet incorporated noodles for nutritive value and glycemic index

    OpenAIRE

    Shukla, Kamini; Srivastava, Sarita

    2011-01-01

    The present study was undertaken to develop finger millet incorporated noodles for diabetic patients. Finger millet variety VL-149 was taken. The finger millet flour and refined wheat flour (RWF) were evaluated for nutrient composition. The finger millet flour (FMF) was blended in various proportions (30 to 50%) in refined wheat flour and used for the preparation of noodles. Control consisted of RWF noodles. Sensory quality and nutrient composition of finger millet noodles was evaluated. The ...

  6. Whole-Genome Sequencing of Lactobacillus salivarius Strains BCRC 14759 and BCRC 12574.

    Science.gov (United States)

    Chiu, Shih-Hau; Chen, Chien-Chi; Wang, Li-Ting; Huang, Lina

    2017-11-22

    Lactobacillus salivarius BCRC 14759 has been identified as a high-exopolysaccharide-producing strain with potential as a probiotic or fermented dairy product. Here, we report the genome sequences of L. salivarius BCRC 14759 and the comparable strain BCRC 12574, isolated from human saliva. The PacBio RSII sequencing platform was used to obtain high-quality assemblies for characterization of this probiotic candidate. Copyright © 2017 Chiu et al.

  7. Genome Sequences of 19 Novel Erwinia amylovora Bacteriophages.

    Science.gov (United States)

    Esplin, Ian N D; Berg, Jordan A; Sharma, Ruchira; Allen, Robert C; Arens, Daniel K; Ashcroft, Cody R; Bairett, Shannon R; Beatty, Nolan J; Bickmore, Madeline; Bloomfield, Travis J; Brady, T Scott; Bybee, Rachel N; Carter, John L; Choi, Minsey C; Duncan, Steven; Fajardo, Christopher P; Foy, Brayden B; Fuhriman, David A; Gibby, Paul D; Grossarth, Savannah E; Harbaugh, Kala; Harris, Natalie; Hilton, Jared A; Hurst, Emily; Hyde, Jonathan R; Ingersoll, Kayleigh; Jacobson, Caitlin M; James, Brady D; Jarvis, Todd M; Jaen-Anieves, Daniella; Jensen, Garrett L; Knabe, Bradley K; Kruger, Jared L; Merrill, Bryan D; Pape, Jenny A; Payne Anderson, Ashley M; Payne, David E; Peck, Malia D; Pollock, Samuel V; Putnam, Micah J; Ransom, Ethan K; Ririe, Devin B; Robinson, David M; Rogers, Spencer L; Russell, Kerri A; Schoenhals, Jonathan E; Shurtleff, Christopher A; Simister, Austin R; Smith, Hunter G; Stephenson, Michael B; Staley, Lyndsay A; Stettler, Jason M; Stratton, Mallorie L; Tateoka, Olivia B; Tatlow, P J; Taylor, Alexander S; Thompson, Suzanne E; Townsend, Michelle H; Thurgood, Trever L; Usher, Brittian K; Whitley, Kiara V; Ward, Andrew T; Ward, Megan E H; Webb, Charles J; Wienclaw, Trevor M; Williamson, Taryn L; Wells, Michael J; Wright, Cole K; Breakwell, Donald P; Hope, Sandra; Grose, Julianne H

    2017-11-16

    Erwinia amylovora is the causal agent of fire blight, a devastating disease affecting some plants of the Rosaceae family. We isolated bacteriophages from samples collected from infected apple and pear trees along the Wasatch Front in Utah. We announce 19 high-quality complete genome sequences of E. amylovora bacteriophages. Copyright © 2017 Esplin et al.

  8. Lipid Oxidation, Color Changes, and Microbiological Quality of Frozen Beef Burgers Incorporated with Shirazi Thyme, Cinnamon, and Rosemary Extracts

    Directory of Open Access Journals (Sweden)

    Hadi Hashemi Gahruie

    2017-01-01

    Full Text Available In this study, the oxidative stability of beef burgers incorporated with Shirazi thyme, cinnamon, and rosemary extracts was compared with that of BHT-incorporated and antioxidant-free samples. The chemical composition, TBARS, metmyoglobin, pH, color, and microbial and sensory characteristics were evaluated during storage at −18°C for 2 months. The results indicated that Shirazi thyme and cinnamon extracts did not change the colorimetric properties significantly (P BHT > Shirazi thyme > rosemary > control. Finally, the results showed that these plant extracts can be utilized as an alternative to synthetic antioxidants in formulation of burgers.

  9. Universal sequence map (USM of arbitrary discrete sequences

    Directory of Open Access Journals (Sweden)

    Almeida Jonas S

    2002-02-01

    Full Text Available Abstract Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM, is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR. The latter enables the representation of 4 unit type sequences (like DNA as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules.

  10. Retrospective comparison of three-dimensional imaging sequences in the visualization of posterior fossa cranial nerves.

    Science.gov (United States)

    Ors, Suna; Inci, Ercan; Turkay, Rustu; Kokurcan, Atilla; Hocaoglu, Elif

    2017-12-01

    To compare efficancy of three-dimentional SPACE (sampling perfection with application-optimized contrasts using different flip-angle evolutions) and CISS (constructive interference in steady state) sequences in the imaging of the cisternal segments of cranial nerves V-XII. Temporal MRI scans from 50 patients (F:M ratio, 27:23; mean age, 44.5±15.9 years) admitted to our hospital with vertigo, tinnitus, and hearing loss were retrospectively analyzed. All patients had both CISS and SPACE sequences. Quantitative analysis of SPACE and CISS sequences was performed by measuring the ventricle-to-parenchyma contrast-to-noise ratio (CNR). Qualitative analysis of differences in visualization capability, image quality, and severity of artifacts was also conducted. A score ranging 'no artefact' to 'severe artefacts and unreadable' was used for the assessment of artifacts and from 'not visualized' to 'completely visualized' for the assesment of image quality, respectively. The distribution of variables was controlled by the Kolmogorov-Smirnov test. Samples t-test and McNemar's test were used to determine statistical significance. Rates of visualization of posterior fossa cranial nerves in cases of complete visualization were as follows: nerve V (100% for both sequences), nerve VI (94% in SPACE, 86% in CISS sequences), nerves VII-VIII (100% for both sequences), IX-XI nerve complex (96%, 88%); nerve XII (58%, 46%) (p<0.05). SPACE sequences showed fewer artifacts than CISS sequences (p<0.002). Copyright © 2017 Elsevier B.V. All rights reserved.

  11. Incorporation of local dependent reliability information into the Prior Image Constrained Compressed Sensing (PICCS) reconstruction algorithm

    International Nuclear Information System (INIS)

    Vaegler, Sven; Sauer, Otto; Stsepankou, Dzmitry; Hesser, Juergen

    2015-01-01

    The reduction of dose in cone beam computer tomography (CBCT) arises from the decrease of the tube current for each projection as well as from the reduction of the number of projections. In order to maintain good image quality, sophisticated image reconstruction techniques are required. The Prior Image Constrained Compressed Sensing (PICCS) incorporates prior images into the reconstruction algorithm and outperforms the widespread used Feldkamp-Davis-Kress-algorithm (FDK) when the number of projections is reduced. However, prior images that contain major variations are not appropriately considered so far in PICCS. We therefore propose the partial-PICCS (pPICCS) algorithm. This framework is a problem-specific extension of PICCS and enables the incorporation of the reliability of the prior images additionally. We assumed that the prior images are composed of areas with large and small deviations. Accordingly, a weighting matrix considered the assigned areas in the objective function. We applied our algorithm to the problem of image reconstruction from few views by simulations with a computer phantom as well as on clinical CBCT projections from a head-and-neck case. All prior images contained large local variations. The reconstructed images were compared to the reconstruction results by the FDK-algorithm, by Compressed Sensing (CS) and by PICCS. To show the gain of image quality we compared image details with the reference image and used quantitative metrics (root-mean-square error (RMSE), contrast-to-noise-ratio (CNR)). The pPICCS reconstruction framework yield images with substantially improved quality even when the number of projections was very small. The images contained less streaking, blurring and inaccurately reconstructed structures compared to the images reconstructed by FDK, CS and conventional PICCS. The increased image quality is also reflected in large RMSE differences. We proposed a modification of the original PICCS algorithm. The pPICCS algorithm

  12. Quality assurance in ionizing radiation application

    International Nuclear Information System (INIS)

    Rastkhah; Nasser.

    1995-01-01

    Quality assurance is a mean for controlling all the activities within an organization which affect the quality of the product or service. A series of international standards have been prepared which incorporate the accumulated knowledge and provide guidance on what activities within an organization should be controlled. A proposal on a quality assurance system to be implemented in ionizing radiation application centers is the primary concern of Atomic Energy Organization of Iran is represented. The Objectives were identification of quality related problems ;Comply with national and international requirements ;Controlling all activities within an organization which affects the quality and assurance of maintaining the quality within organization. In performing protection measures, risk, cost, benefit consideration, cause of problems and the classic solution are summarized in four chapters

  13. Numeral Incorporation in Japanese Sign Language

    Science.gov (United States)

    Ktejik, Mish

    2013-01-01

    This article explores the morphological process of numeral incorporation in Japanese Sign Language. Numeral incorporation is defined and the available research on numeral incorporation in signed language is discussed. The numeral signs in Japanese Sign Language are then introduced and followed by an explanation of the numeral morphemes which are…

  14. Automated monitoring of recovered water quality

    Science.gov (United States)

    Misselhorn, J. E.; Hartung, W. H.; Witz, S. W.

    1974-01-01

    Laboratory prototype water quality monitoring system provides automatic system for online monitoring of chemical, physical, and bacteriological properties of recovered water and for signaling malfunction in water recovery system. Monitor incorporates whenever possible commercially available sensors suitably modified.

  15. PETRA, MSVAT-SPACE and SEMAC sequences for metal artefact reduction in dental MR imaging

    International Nuclear Information System (INIS)

    Hilgenfeld, Tim; Heil, Alexander; Bendszus, Martin; Prager, Marcel; Heiland, Sabine; Schwindling, Franz Sebastian; Rammelsberg, Peter; Nittka, Mathias; Grodzki, David

    2017-01-01

    Dental MRI is often impaired by artefacts due to metallic dental materials. Several sequences were developed to reduce susceptibility artefacts. Here, we evaluated a set of sequences for artefact reduction for dental MRI for the first time. Artefact volume, signal-to-noise ratio (SNR) and image quality were assessed on a 3-T MRI for pointwise encoding time reduction with radial acquisition (PETRA), multiple-slab acquisition with view angle tilting gradient, based on a sampling perfection with application-optimised contrasts using different flip angle evolution (SPACE) sequence (MSVAT-SPACE), slice-encoding for metal-artefact correction (SEMAC) and compared to a standard SPACE and a standard turbo-spin-echo (TSE) sequence. Field-of-view and acquisition times were chosen to enable in vivo application. Two implant-supported prostheses were tested (porcelain fused to metal non-precious alloy and monolithic zirconia). Smallest artefact was measured for TSE sequences with no difference between the standard TSE and the SEMAC. MSVAT-SPACE reduced artefacts about 56% compared to the standard SPACE. Effect of the PETRA was dependent on sample used. Image quality and SNR were comparable for all sequences except PETRA, which yielded poor results. There is no benefit in terms of artefact reduction for SEMAC compared to standard TSE. Usage of MSVAT-SPACE is advantageous since artefacts are reduced and higher resolution is achieved. (orig.)

  16. PETRA, MSVAT-SPACE and SEMAC sequences for metal artefact reduction in dental MR imaging

    Energy Technology Data Exchange (ETDEWEB)

    Hilgenfeld, Tim; Heil, Alexander; Bendszus, Martin [Heidelberg University Hospital, Department of Neuroradiology, Heidelberg (Germany); Prager, Marcel; Heiland, Sabine [Heidelberg University Hospital, Department of Neuroradiology, Heidelberg (Germany); Heidelberg University Hospital, Section of Experimental Radiology, Heidelberg (Germany); Schwindling, Franz Sebastian; Rammelsberg, Peter [Heidelberg University Hospital, Department of Prosthodontics, Heidelberg (Germany); Nittka, Mathias; Grodzki, David [Siemens Healthcare GmbH, Erlangen (Germany)

    2017-12-15

    Dental MRI is often impaired by artefacts due to metallic dental materials. Several sequences were developed to reduce susceptibility artefacts. Here, we evaluated a set of sequences for artefact reduction for dental MRI for the first time. Artefact volume, signal-to-noise ratio (SNR) and image quality were assessed on a 3-T MRI for pointwise encoding time reduction with radial acquisition (PETRA), multiple-slab acquisition with view angle tilting gradient, based on a sampling perfection with application-optimised contrasts using different flip angle evolution (SPACE) sequence (MSVAT-SPACE), slice-encoding for metal-artefact correction (SEMAC) and compared to a standard SPACE and a standard turbo-spin-echo (TSE) sequence. Field-of-view and acquisition times were chosen to enable in vivo application. Two implant-supported prostheses were tested (porcelain fused to metal non-precious alloy and monolithic zirconia). Smallest artefact was measured for TSE sequences with no difference between the standard TSE and the SEMAC. MSVAT-SPACE reduced artefacts about 56% compared to the standard SPACE. Effect of the PETRA was dependent on sample used. Image quality and SNR were comparable for all sequences except PETRA, which yielded poor results. There is no benefit in terms of artefact reduction for SEMAC compared to standard TSE. Usage of MSVAT-SPACE is advantageous since artefacts are reduced and higher resolution is achieved. (orig.)

  17. Effect of inulin on dough and biscuit quality produced from different flours

    Directory of Open Access Journals (Sweden)

    Maria S. Blanco Canalis

    2017-04-01

    Full Text Available One of the methods of improving the nutritional profile of baked-goods is the incorporation of dietary fibre (DF to the formulation. However, DF retains more water than wheat flour, which affects dough rheological properties and thus the final product quality. Flour is the main ingredient in biscuits and contributes to the baked texture and shape of biscuits. There are large differences in quality among the flours from wheat cultivars and among non-wheat flours used for biscuit production. In previous works, different dietary fibres were tested and inulin was found to have a positive quality  effect on biscuits made from an all-purpose wheat flour without the need of introducing significant changes in dough properties. To know whether these results are extended to different types of flours that can be used for biscuit production, this paper aimed to analyse the effects of different levels of inulin incorporation on dough and biscuits quality made from three different wheat flours and one triticale flour. Chemical and physic-chemical characterization of the flour samples was performed and compared (moisture, ashes, gluten, solvent retention capacity (SRC and protein content. Inulin was incorporated to biscuits formulation in two levels: 6 and 12% (wheat replacement. Dough and biscuits quality was measured by spread rate during baking and biscuit factor, texture and surface colour. Inulin incorporation increased dough spreading and biscuit factor and decreased biscuit hardness. The enhancing effect of inulin was observed in all flour. The results confirmed the effect of inulin on biscuits obtained with all flour assayed, despite the differences in flour quality that was tested.

  18. Is sequence awareness mandatory for perceptual sequence learning: An assessment using a pure perceptual sequence learning design.

    Science.gov (United States)

    Deroost, Natacha; Coomans, Daphné

    2018-02-01

    We examined the role of sequence awareness in a pure perceptual sequence learning design. Participants had to react to the target's colour that changed according to a perceptual sequence. By varying the mapping of the target's colour onto the response keys, motor responses changed randomly. The effect of sequence awareness on perceptual sequence learning was determined by manipulating the learning instructions (explicit versus implicit) and assessing the amount of sequence awareness after the experiment. In the explicit instruction condition (n = 15), participants were instructed to intentionally search for the colour sequence, whereas in the implicit instruction condition (n = 15), they were left uninformed about the sequenced nature of the task. Sequence awareness after the sequence learning task was tested by means of a questionnaire and the process-dissociation-procedure. The results showed that the instruction manipulation had no effect on the amount of perceptual sequence learning. Based on their report to have actively applied their sequence knowledge during the experiment, participants were subsequently regrouped in a sequence strategy group (n = 14, of which 4 participants from the implicit instruction condition and 10 participants from the explicit instruction condition) and a no-sequence strategy group (n = 16, of which 11 participants from the implicit instruction condition and 5 participants from the explicit instruction condition). Only participants of the sequence strategy group showed reliable perceptual sequence learning and sequence awareness. These results indicate that perceptual sequence learning depends upon the continuous employment of strategic cognitive control processes on sequence knowledge. Sequence awareness is suggested to be a necessary but not sufficient condition for perceptual learning to take place. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. A survey on the quantitative incorporation organizational factors into PSA

    International Nuclear Information System (INIS)

    Park, S. Z.; Jea, M. S.; Ahn, N. S.

    2002-01-01

    The effects of organizational factors on the human performance and safety in nuclear power plants have been known through the results of research for several years. The organizational factor, which belongs to 11 elements of PSR (Periodic Safety Review), has been an important research area. In this study the state-of-the-art of qualitative and quantitative evaluation methodologies on organizational factors has been surveyed. The results of this study may contribute to developing a quantitative evaluation methodology on organizational factors as well as the basic research for conducting the PSR research, and for incorporating the quality of organization factors into PSA

  20. Contrast-enhanced Magnetic Resonance Imaging of Pelvic Bone Metastases at 3.0 T: Comparison Between 3-dimensional T1-weighted CAIPIRINHA-VIBE Sequence and 2-dimensional T1-weighted Turbo Spin-Echo Sequence.

    Science.gov (United States)

    Yoon, Min A; Hong, Suk-Joo; Lee, Kyu-Chong; Lee, Chang Hee

    2018-06-12

    This study aimed to compare 3-dimensional T1-weighted gradient-echo sequence (CAIPIRINHA-volumetric interpolated breath-hold examination [VIBE]) with 2-dimensional T1-weighted turbo spin-echo sequence for contrast-enhanced magnetic resonance imaging (MRI) of pelvic bone metastases at 3.0 T. Thirty-one contrast-enhanced MRIs of pelvic bone metastases were included. Two contrast-enhanced sequences were evaluated for the following parameters: overall image quality, sharpness of pelvic bone, iliac vessel clarity, artifact severity, and conspicuity and edge sharpness of the smallest metastases. Quantitative analysis was performed by calculating signal-to-noise ratio and contrast-to-noise ratio of the smallest metastases. Significant differences between the 2 sequences were assessed. CAIPIRINHA-VIBE had higher scores for overall image quality, pelvic bone sharpness, iliac vessel clarity, and edge sharpness of the metastatic lesions, and had less artifacts (all P 0.05). Our results suggest that CAIPIRINHA-VIBE may be superior to turbo spin-echo for contrast-enhanced MRI of pelvic bone metastases at 3.0 T.

  1. Centroid based clustering of high throughput sequencing reads based on n-mer counts.

    Science.gov (United States)

    Solovyov, Alexander; Lipkin, W Ian

    2013-09-08

    Many problems in computational biology require alignment-free sequence comparisons. One of the common tasks involving sequence comparison is sequence clustering. Here we apply methods of alignment-free comparison (in particular, comparison using sequence composition) to the challenge of sequence clustering. We study several centroid based algorithms for clustering sequences based on word counts. Study of their performance shows that using k-means algorithm with or without the data whitening is efficient from the computational point of view. A higher clustering accuracy can be achieved using the soft expectation maximization method, whereby each sequence is attributed to each cluster with a specific probability. We implement an open source tool for alignment-free clustering. It is publicly available from github: https://github.com/luscinius/afcluster. We show the utility of alignment-free sequence clustering for high throughput sequencing analysis despite its limitations. In particular, it allows one to perform assembly with reduced resources and a minimal loss of quality. The major factor affecting performance of alignment-free read clustering is the length of the read.

  2. Study on the physico-chemical properties, sensory attributes and shelf life of pineapple powder incorporated biscuits

    Directory of Open Access Journals (Sweden)

    M. Thivani

    2016-12-01

    Full Text Available The bakery industry is growing very fast and the products are becoming increasingly popular among consumers world-wide. Among ready-to-eat snacks, biscuits possess several attractive features including wider consumption base, relatively long shelf-life and preferred eating quality. Pineapples constitute important natural and valuable material in producing functional foods due to the presence of several anti-oxidants and bioactive compounds. Therefore, the aim of the present study was to investigate the nutritional composition and sensory quality of wheat flour biscuits incorporated with pineapple powder at the rates of 3, 5, 10, and 15% (w/w basis. The protein, fat, total sugar, moisture and mineral contents were significantly (p<0.05 changed during storage. Nutritional analysis indicated that biscuits incorporated with 5% pineapple powder had a slow rate of increase in moisture content (4.19 to 5.0% and decreasing trend in protein (22.15 to 22.06%, fat (18.83 to 18.79%, total sugar (58.52 to 58.34% and mineral contents (2.212 to 1.591% compared to other treatments during storage. Among the treatments, the biscuits prepared with 5% pineapple powder had the highest nutritional and sensory quality, having the overall acceptability score of 7.7 in a 9-point hedonic scale. The shelf life evaluation showed that these biscuits could be stored for 6 weeks at the ambient conditions of average temperature at 301ºC and RH at 75-80% with acceptable quality.

  3. Fixing Formalin: A Method to Recover Genomic-Scale DNA Sequence Data from Formalin-Fixed Museum Specimens Using High-Throughput Sequencing.

    Directory of Open Access Journals (Sweden)

    Sarah M Hykin

    Full Text Available For 150 years or more, specimens were routinely collected and deposited in natural history collections without preserving fresh tissue samples for genetic analysis. In the case of most herpetological specimens (i.e. amphibians and reptiles, attempts to extract and sequence DNA from formalin-fixed, ethanol-preserved specimens-particularly for use in phylogenetic analyses-has been laborious and largely ineffective due to the highly fragmented nature of the DNA. As a result, tens of thousands of specimens in herpetological collections have not been available for sequence-based phylogenetic studies. Massively parallel High-Throughput Sequencing methods and the associated bioinformatics, however, are particularly suited to recovering meaningful genetic markers from severely degraded/fragmented DNA sequences such as DNA damaged by formalin-fixation. In this study, we compared previously published DNA extraction methods on three tissue types subsampled from formalin-fixed specimens of Anolis carolinensis, followed by sequencing. Sufficient quality DNA was recovered from liver tissue, making this technique minimally destructive to museum specimens. Sequencing was only successful for the more recently collected specimen (collected ~30 ybp. We suspect this could be due either to the conditions of preservation and/or the amount of tissue used for extraction purposes. For the successfully sequenced sample, we found a high rate of base misincorporation. After rigorous trimming, we successfully mapped 27.93% of the cleaned reads to the reference genome, were able to reconstruct the complete mitochondrial genome, and recovered an accurate phylogenetic placement for our specimen. We conclude that the amount of DNA available, which can vary depending on specimen age and preservation conditions, will determine if sequencing will be successful. The technique described here will greatly improve the value of museum collections by making many formalin-fixed specimens

  4. Fixing Formalin: A Method to Recover Genomic-Scale DNA Sequence Data from Formalin-Fixed Museum Specimens Using High-Throughput Sequencing.

    Science.gov (United States)

    Hykin, Sarah M; Bi, Ke; McGuire, Jimmy A

    2015-01-01

    For 150 years or more, specimens were routinely collected and deposited in natural history collections without preserving fresh tissue samples for genetic analysis. In the case of most herpetological specimens (i.e. amphibians and reptiles), attempts to extract and sequence DNA from formalin-fixed, ethanol-preserved specimens-particularly for use in phylogenetic analyses-has been laborious and largely ineffective due to the highly fragmented nature of the DNA. As a result, tens of thousands of specimens in herpetological collections have not been available for sequence-based phylogenetic studies. Massively parallel High-Throughput Sequencing methods and the associated bioinformatics, however, are particularly suited to recovering meaningful genetic markers from severely degraded/fragmented DNA sequences such as DNA damaged by formalin-fixation. In this study, we compared previously published DNA extraction methods on three tissue types subsampled from formalin-fixed specimens of Anolis carolinensis, followed by sequencing. Sufficient quality DNA was recovered from liver tissue, making this technique minimally destructive to museum specimens. Sequencing was only successful for the more recently collected specimen (collected ~30 ybp). We suspect this could be due either to the conditions of preservation and/or the amount of tissue used for extraction purposes. For the successfully sequenced sample, we found a high rate of base misincorporation. After rigorous trimming, we successfully mapped 27.93% of the cleaned reads to the reference genome, were able to reconstruct the complete mitochondrial genome, and recovered an accurate phylogenetic placement for our specimen. We conclude that the amount of DNA available, which can vary depending on specimen age and preservation conditions, will determine if sequencing will be successful. The technique described here will greatly improve the value of museum collections by making many formalin-fixed specimens available for

  5. Leaching of Heavy Metals Using SPLP Method from Fired Clay Brick Incorporating with Sewage Sludge

    Science.gov (United States)

    Kadir, Aeslina Abdul; Salim, Nurul Salhana Abdul; Amira Sarani, Noor; Aqma Izurin Rahmat, Nur

    2017-05-01

    Sewage sludge is a by-product generate from wastewater treatment process. The sewage sludge contains significant trace metal such as Cr, Mn, Ni, Cu, Zn, As, Cd and Pb which are toxic to the environment. Sewage sludge is disposed of by landfilling method. However, this option not suitable because of land restriction and environmental control regulations imposed. Therefore, sewage sludge from wastewater treatment plant was incorporated into fired clay brick to produce good quality of brick as well as reducing heavy metals from sludge itself. Sewage sludge with 0%, 1%, 5%, 10% and 20% of were incorporated into fired clay bricks and fired at 1050°C temperature with heating rates of 1°C/min. The brick sample then crushed and sieved through 9.5 mm sieve for Synthetic Precipitation Leaching Procedure (SPLP). From the results, incorporation up to 20% of sewage sludge has leached less heavy metals and compliance with USEPA standard.

  6. Ultra-fast sequence clustering from similarity networks with SiLiX

    Directory of Open Access Journals (Sweden)

    Duret Laurent

    2011-04-01

    Full Text Available Abstract Background The number of gene sequences that are available for comparative genomics approaches is increasing extremely quickly. A current challenge is to be able to handle this huge amount of sequences in order to build families of homologous sequences in a reasonable time. Results We present the software package SiLiX that implements a novel method which reconsiders single linkage clustering with a graph theoretical approach. A parallel version of the algorithms is also presented. As a demonstration of the ability of our software, we clustered more than 3 millions sequences from about 2 billion BLAST hits in 7 minutes, with a high clustering quality, both in terms of sensitivity and specificity. Conclusions Comparing state-of-the-art software, SiLiX presents the best up-to-date capabilities to face the problem of clustering large collections of sequences. SiLiX is freely available at http://lbbe.univ-lyon1.fr/SiLiX.

  7. MR imaging of the orbit and eye using inversion recovery sequences

    International Nuclear Information System (INIS)

    Smith, F.W.; Parekh, S.; Forrester, J.; Redpath, T.W.

    1986-01-01

    Most centers performing MR imaging use spin-echo sequences to produce images; however, there are many advantages to using short TI inversion-recovery sequences for examination of the orbits. By selecting a TI similar to the relaxation time of any structure, the signal from this can be suppressed, thereby enhancing the signal from other structures. Using a sequence of TR = 1,000 msec and TI of less than 200 msec, the signal from fat is suppressed, improving image quality adjacent to the surface coil and providing better contrast between orbital structures and fat. The use of this short TI sequence for the examination of the eye in patients with opaque lenses is an accurate method of diagnosis since the sequence enhances the signal from both long T1 and T2 lesions. Eighty-five patients with orbital or ocular pathology have been studied, and the results demonstrate the usefulness of this technique for diagnosis

  8. Inhibition of Listeria monocytogenes ATCC 19115 on ham steak by tea bioactive compounds incorporated into chitosan-coated plastic films

    Directory of Open Access Journals (Sweden)

    Vodnar Dan C

    2012-07-01

    Full Text Available Abstract Background The consumer demands for better quality and safety of food products have given rise to the development and implementation of edible films. The use of antimicrobial films can be a promising tool for controlling L. monocytogenes on ready to eat products. The aim of this study was to develop effective antimicrobial films incorporating bioactive compounds from green and black teas into chitosan, for controlling L. monocytogenes ATCC 19115 on vacuum-packaged ham steak. The effectiveness of these antimicrobial films was evaluated at room temperature (20°C for 10 days and at refrigerated temperature (4°C for 8 weeks. Results The HPLC results clearly show that relative concentrations of catechins and caffeine in green tea ranked EGCG>EGC>CAF>ECG>EC>C while in black tea extracts ranked CAF>EGCG>ECG>EGC>EC>C. The chitosan-coated plastic films incorporating green tea and black tea extracts shows specific markers identified by FTIR. Incorporating natural extracts into chitosan showed that the growth of L monocytogenes ATCC 19115 was inhibited. The efficacy of antimicrobial effect of tea extracts incorporated into chitosan-coated plastic film was dose dependent. However, chitosan-coated films without addition of tea extracts did not inhibit the growth of L. monocytogenes ATCC 19115. Chitosan-coated plastic films incorporating 4% Green tea extract was the most effective antimicrobial, reducing the initial counts from 3.2 to 2.65 log CFU/cm2 during room temperature storage and from 3.2 to 1–1.5 log CFU/cm2 during refrigerated storage. Conclusions Incorporation of tea extracts into the chitosan-coated films considerably enhanced their effectiveness against L. monocytogenes ATCC 19115. 4% Green tea incorporated into chitosan-coated plastic film had a better antilisterial effect than 2% green tea or 2% and 4% black tea. Data from this study would provide new formulation options for developing antimicrobial packaging films using tea

  9. Mission Management Computer and Sequencing Hardware for RLV-TD HEX-01 Mission

    Science.gov (United States)

    Gupta, Sukrat; Raj, Remya; Mathew, Asha Mary; Koshy, Anna Priya; Paramasivam, R.; Mookiah, T.

    2017-12-01

    Reusable Launch Vehicle-Technology Demonstrator Hypersonic Experiment (RLV-TD HEX-01) mission posed some unique challenges in the design and development of avionics hardware. This work presents the details of mission critical avionics hardware mainly Mission Management Computer (MMC) and sequencing hardware. The Navigation, Guidance and Control (NGC) chain for RLV-TD is dual redundant with cross-strapped Remote Terminals (RTs) interfaced through MIL-STD-1553B bus. MMC is Bus Controller on the 1553 bus, which does the function of GPS aided navigation, guidance, digital autopilot and sequencing for the RLV-TD launch vehicle in different periodicities (10, 20, 500 ms). Digital autopilot execution in MMC with a periodicity of 10 ms (in ascent phase) is introduced for the first time and successfully demonstrated in the flight. MMC is built around Intel i960 processor and has inbuilt fault tolerance features like ECC for memories. Fault Detection and Isolation schemes are implemented to isolate the failed MMC. The sequencing hardware comprises Stage Processing System (SPS) and Command Execution Module (CEM). SPS is `RT' on the 1553 bus which receives the sequencing and control related commands from MMCs and posts to downstream modules after proper error handling for final execution. SPS is designed as a high reliability system by incorporating various fault tolerance and fault detection features. CEM is a relay based module for sequence command execution.

  10. Correction of echo shift in reconstruction processing for ultra-short TE pulse sequence

    International Nuclear Information System (INIS)

    Takizawa, Masahiro; Ootsuka, Takehiro; Abe, Takayuki; Takahashi, Tetsuhiko

    2010-01-01

    An ultra-short echo time (TE) pulse sequence is composed of a radial sampling that acquires echo signals radially in the K-space and a half-echo acquisition that acquires only half of the echo signal. The shift in the position of the echo signal (echo shift) caused by the timing errors in the gradient magnetic field pulses affects the image quality in the radial sampling with the half-echo acquisition. To improve image quality, we have developed a signal correction algorithm that detects and eliminates this echo shift during reconstruction by performing a pre-scan within 10 seconds. The results showed that image quality is improved under oblique and/or off-centering conditions that frequently cause image distortion due to hardware error. In conclusion, we have developed a robust ultra-short TE pulse sequence that allows wide latitude in the scan parameters, including oblique and off-centering conditions. (author)

  11. MBGD update 2015: microbial genome database for flexible ortholog analysis utilizing a diverse set of genomic data.

    Science.gov (United States)

    Uchiyama, Ikuo; Mihara, Motohiro; Nishide, Hiroyo; Chiba, Hirokazu

    2015-01-01

    The microbial genome database for comparative analysis (MBGD) (available at http://mbgd.genome.ad.jp/) is a comprehensive ortholog database for flexible comparative analysis of microbial genomes, where the users are allowed to create an ortholog table among any specified set of organisms. Because of the rapid increase in microbial genome data owing to the next-generation sequencing technology, it becomes increasingly challenging to maintain high-quality orthology relationships while allowing the users to incorporate the latest genomic data available into an analysis. Because many of the recently accumulating genomic data are draft genome sequences for which some complete genome sequences of the same or closely related species are available, MBGD now stores draft genome data and allows the users to incorporate them into a user-specific ortholog database using the MyMBGD functionality. In this function, draft genome data are incorporated into an existing ortholog table created only from the complete genome data in an incremental manner to prevent low-quality draft data from affecting clustering results. In addition, to provide high-quality orthology relationships, the standard ortholog table containing all the representative genomes, which is first created by the rapid classification program DomClust, is now refined using DomRefine, a recently developed program for improving domain-level clustering using multiple sequence alignment information. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing.

    Science.gov (United States)

    Feehan, Joanna M; Scheibel, Katherine E; Bourras, Salim; Underwood, William; Keller, Beat; Somerville, Shauna C

    2017-03-31

    The powdery mildew fungi are a group of economically important fungal plant pathogens. Relatively little is known about the molecular biology and genetics of these pathogens, in part due to a lack of well-developed genetic and genomic resources. These organisms have large, repetitive genomes, which have made genome sequencing and assembly prohibitively difficult. Here, we describe methods for the collection, extraction, purification and quality control assessment of high molecular weight genomic DNA from one powdery mildew species, Golovinomyces cichoracearum. The protocol described includes mechanical disruption of spores followed by an optimized phenol/chloroform genomic DNA extraction. A typical yield was 7 µg DNA per 150 mg conidia. The genomic DNA that is isolated using this procedure is suitable for long-read sequencing (i.e., > 48.5 kbp). Quality control measures to ensure the size, yield, and purity of the genomic DNA are also described in this method. Sequencing of the genomic DNA of the quality described here will allow for the assembly and comparison of multiple powdery mildew genomes, which in turn will lead to a better understanding and improved control of this agricultural pathogen.

  13. 200 area liquid effluent facility quality assurance program plan. Revision 1

    International Nuclear Information System (INIS)

    Sullivan, N.J.

    1995-01-01

    Direct revision of Supporting Document WHC-SD-LEF-QAPP-001, Rev. 0. 200 Area Liquid Effluent Facilities Quality Assurance Program Plan. Incorporates changes to references in tables. Revises test to incorporate WHC-SD-LEF-CSCM-001, Computer Software Configuration Management Plan for 200 East/West Liquid Effluent Facilities

  14. A platform-independent method for detecting errors in metagenomic sequencing data: DRISEE.

    Directory of Open Access Journals (Sweden)

    Kevin P Keegan

    Full Text Available We provide a novel method, DRISEE (duplicate read inferred sequencing error estimation, to assess sequencing quality (alternatively referred to as "noise" or "error" within and/or between sequencing samples. DRISEE provides positional error estimates that can be used to inform read trimming within a sample. It also provides global (whole sample error estimates that can be used to identify samples with high or varying levels of sequencing error that may confound downstream analyses, particularly in the case of studies that utilize data from multiple sequencing samples. For shotgun metagenomic data, we believe that DRISEE provides estimates of sequencing error that are more accurate and less constrained by technical limitations than existing methods that rely on reference genomes or the use of scores (e.g. Phred. Here, DRISEE is applied to (non amplicon data sets from both the 454 and Illumina platforms. The DRISEE error estimate is obtained by analyzing sets of artifactual duplicate reads (ADRs, a known by-product of both sequencing platforms. We present DRISEE as an open-source, platform-independent method to assess sequencing error in shotgun metagenomic data, and utilize it to discover previously uncharacterized error in de novo sequence data from the 454 and Illumina sequencing platforms.

  15. Total quality management in the hospital setting.

    Science.gov (United States)

    Ernst, D F

    1994-01-01

    With the increasing demands on hospitals for improved quality and lower costs, hospitals have been forced to reevaluate their manner of operation and quality assurance (QA) programs. Hospitals have been faced with customer dissatisfaction with services, escalating costs, intense competition, and reduced reimbursement for services. As a result, many hospitals have incorporated total quality management (TQM), also known as continuous quality improvement (CQI) and quality improvement (QI), to improve quality care and decrease costs. This article examines the concept of TQM, its rationale, and how it can be implemented in a hospital. A comparison of TQM and QA is made. Examples of hospital implementation of TQM and problems and issues associated with TQM in the hospital setting are explored.

  16. Facial Video based Detection of Physical Fatigue for Maximal Muscle Activity

    DEFF Research Database (Denmark)

    Haque, Mohammad Ahsanul; Irani, Ramin; Nasrollahi, Kamal

    2016-01-01

    the challenges originates from realistic sce-nario. A face quality assessment system was also incorporated in the proposed system to reduce erroneous results by discarding low quality faces that occurred in a video sequence due to problems in realistic lighting, head motion and pose variation. Experimental...

  17. Quality assurance

    International Nuclear Information System (INIS)

    1996-01-01

    The main efforts of Nuclear Regulatory Authority of the Slovak Republic (NRA SR) was focused on support of quality assurance programmes development at responsible organizations Bohunice V-1 and V-v and Mochovce NPPs and their inspection. Development of the level two documentation of a partial quality assurance programme for NPP operation continued at Mochovce NPP. Most of documentation has been submitted to NRA SR for comments and approval. NRA SR invited a mission of French experts to Mochovce NPP to review preparation and performance of internal audits that would be beneficial for improvement in this kind activities at the NPP. Bohunice NPP continued in development of a partial quality assurance programme for operation. The Quality Assurance Programme submitted to NRA SR for approval. Based on a request of Bohunice NPPs, NRA SR consulted the draft quality assurance programme developed by Siemens for stage of the 'Basic Design' of V-1 NPP upgrading. The programme had not been submitted for approval to NRA SR prior to completion of works by Siemens. Based on an internal audit that had been performed, corrective measures were proposed to meet requirements on review and approval of suppliers quality assurance programmes. Requirements related to the quality assurance at nuclear installations were prepared to be incorporated into principles of a act on peaceful use of nuclear power in Slovak Republic

  18. Technical Report: Algorithm and Implementation for Quasispecies Abundance Inference with Confidence Intervals from Metagenomic Sequence Data

    Energy Technology Data Exchange (ETDEWEB)

    McLoughlin, Kevin [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2016-01-11

    This report describes the design and implementation of an algorithm for estimating relative microbial abundances, together with confidence limits, using data from metagenomic DNA sequencing. For the background behind this project and a detailed discussion of our modeling approach for metagenomic data, we refer the reader to our earlier technical report, dated March 4, 2014. Briefly, we described a fully Bayesian generative model for paired-end sequence read data, incorporating the effects of the relative abundances, the distribution of sequence fragment lengths, fragment position bias, sequencing errors and variations between the sampled genomes and the nearest reference genomes. A distinctive feature of our modeling approach is the use of a Chinese restaurant process (CRP) to describe the selection of genomes to be sampled, and thus the relative abundances. The CRP component is desirable for fitting abundances to reads that may map ambiguously to multiple targets, because it naturally leads to sparse solutions that select the best representative from each set of nearly equivalent genomes.

  19. Between Economic and Legal Analysis of Incorporated Things: a Critical "NO" to Aedilitian Remedies

    Directory of Open Access Journals (Sweden)

    CG Kilian

    2006-01-01

    Full Text Available This article analyses the dictum of the Phame v Paizes 1973 3 397 (A within economic and legal principles to determine whether incorporeal things could possess characteristics of value or quality characteristics as in the case of corporeal things. The author uses practical economic examples to argue for the development of common law. The author identifies relevant Roman law principles which justify the legal nature of incorporeal things. It is demonstrated that the value of incorporeal things depends greatly on future circumstances. It is argued in this article that the courts’ willingness to extend the Aedilitian remedies and the wide interpretation of a dictum et promissum create an open door for any unsatisfied buyer with no entrepreneurial skills to claim a reduced price if the business is unable to achieve similar financial results to those prior to the conclusion of the contract. Currently the seller of a business has no clear or enforceable defense under these circumstances. The author subsequently suggests that relevant Roman law principles should be revisited in the aim to develop an appropriate defense for the seller.

  20. Impact of whey protein coating incorporated with Bifidobacterium and Lactobacillus on sliced ham properties.

    Science.gov (United States)

    Odila Pereira, Joana; Soares, José; J P Monteiro, Maria; Gomes, Ana; Pintado, Manuela

    2018-05-01

    Edible coatings/films with functional ingredients may be a solution to consumers' demands for high-quality food products and an extended shelf-life. The aim of this work was to evaluate the antimicrobial efficiency of edible coatings incorporated with probiotics on sliced ham preservation. Coatings was developed based on whey protein isolates with incorporation of Bifidobacterium animalis Bb-12® or Lactobacillus casei-01. The physicochemical analyses showed that coating decreased water and weight loss on the ham. Furthermore, color analysis showed that coated sliced ham, exhibited no color change, comparatively to uncoated slices. The edible coatings incorporating the probiotic strains inhibited detectable growth of Staphylococcus spp., Pseudomonas spp., Enterobacteriaceae and yeasts/molds, at least, for 45days of storage at 4°C. The sensory evaluation demonstrated that there was a preference for the sliced coated ham. Probiotic bacteria viable cell numbers were maintained at ca. 10 8 CFU/g throughout storage time, enabling the slice of ham to act as a suitable carrier for the beneficial bacteria. Copyright © 2018 Elsevier Ltd. All rights reserved.

  1. Incorporating metacognition into morbidity and mortality rounds: The next frontier in quality improvement.

    Science.gov (United States)

    Katz, David; Detsky, Allan S

    2016-02-01

    This Perspective proposes the introduction of metacognition (thinking about thinking) into the existing format of hospital-based morbidity and mortality rounds. It is placed in the context of historical movements to advance quality improvement by expanding the spectrum of the causes of medical error from systems-based issues to flawed human decision-making capabilities. We suggest that the current approach that focuses on systems-based issues can be improved by exploiting the opportunities to educate physicians about predictable errors committed by reliance on cognitive heuristics. In addition, because the field of educating clinicians about cognitive heuristics has shown mixed results, this proposal represents fertile ground for further research. Educating clinicians about cognitive heuristics may improve metacognition and perhaps be the next frontier in quality improvement. © 2015 Society of Hospital Medicine.

  2. Using quality scores and longer reads improves accuracy of Solexa read mapping

    Directory of Open Access Journals (Sweden)

    Xuan Zhenyu

    2008-02-01

    Full Text Available Abstract Background Second-generation sequencing has the potential to revolutionize genomics and impact all areas of biomedical science. New technologies will make re-sequencing widely available for such applications as identifying genome variations or interrogating the oligonucleotide content of a large sample (e.g. ChIP-sequencing. The increase in speed, sensitivity and availability of sequencing technology brings demand for advances in computational technology to perform associated analysis tasks. The Solexa/Illumina 1G sequencer can produce tens of millions of reads, ranging in length from ~25–50 nt, in a single experiment. Accurately mapping the reads back to a reference genome is a critical task in almost all applications. Two sources of information that are often ignored when mapping reads from the Solexa technology are the 3' ends of longer reads, which contain a much higher frequency of sequencing errors, and the base-call quality scores. Results To investigate whether these sources of information can be used to improve accuracy when mapping reads, we developed the RMAP tool, which can map reads having a wide range of lengths and allows base-call quality scores to determine which positions in each read are more important when mapping. We applied RMAP to analyze data re-sequenced from two human BAC regions for varying read lengths, and varying criteria for use of quality scores. RMAP is freely available for downloading at http://rulai.cshl.edu/rmap/. Conclusion Our results indicate that significant gains in Solexa read mapping performance can be achieved by considering the information in 3' ends of longer reads, and appropriately using the base-call quality scores. The RMAP tool we have developed will enable researchers to effectively exploit this information in targeted re-sequencing projects.

  3. Synthesis of hexahydropyrrolo[2,1-a]isoquinoline compound libraries through a Pictet–Spengler cyclization/metal-catalyzed cross coupling/amidation sequence

    DEFF Research Database (Denmark)

    Petersen, Rico; Cohrt, A. Emil; Petersen, Michael Åxman

    2015-01-01

    incorporating two handles for diversification, were synthesized through an oxidative cleavage/Pictet–Spengler reaction sequence in high overall yields. A subsequent metal-catalyzed cross coupling/amidation protocol was developed and its utility in library synthesis was validated by construction of a 20-membered...

  4. Modeling fraud detection and the incorporation of forensic specialists in the audit process

    DEFF Research Database (Denmark)

    Sakalauskaite, Dominyka

    Financial statement audits are still comparatively poor in fraud detection. Forensic specialists can play a significant role in increasing audit quality. In this paper, based on prior academic research, I develop a model of fraud detection and the incorporation of forensic specialists in the audit...... process. The intention of the model is to identify the reasons why the audit is weak in fraud detection and to provide the analytical framework to assess whether the incorporation of forensic specialists can help to improve it. The results show that such specialists can potentially improve the fraud...... detection in the audit, but might also cause some negative implications. Overall, even though fraud detection is one of the main topics in research there are very few studies done on the subject of how auditors co-operate with forensic specialists. Thus, the paper concludes with suggestions for further...

  5. Draft genome sequence of Phomopsis longicolla isolate MSPL 10-6

    Directory of Open Access Journals (Sweden)

    Shuxian Li

    2015-03-01

    Full Text Available Phomopsis longicolla is the primary cause of Phomopsis seed decay in soybean. This disease severely affects soybean seed quality by reducing seed viability and oil content, altering seed composition, and increasing frequencies of moldy and/or split beans. It is one of the most economically important soybean diseases. Here, we report the de novo assembled draft genome sequence of the P. longicolla isolate MSPL10-6, which was isolated from field-grown soybean seed in Mississippi, USA. This study represents the first reported genome sequence of a seedborne fungal pathogen in the Diaporthe–Phomopsis complex. The P. longicolla genome sequence will enable research into the genetic basis of fungal infection of soybean seed and provide information for the study of soybean–fungal interactions. The genome sequence will also be valuable for molecular genetic marker development, manipulation of pathogenicity-related genes and development of new control strategies for this pathogen.

  6. Integrated Management System Incorporating Quality Management and Management of Environment, Health and Occupational Safety

    International Nuclear Information System (INIS)

    Manchev, B.; Nenkova, B.; Tomov, E.

    2012-01-01

    Risk Engineering Ltd is a Bulgarian private company founded in 1990 to provide engineering and consulting services applicable to each and every field of the energy sector. Since its establishment Risk Engineering Ltd develops, implement and apply a System for quality assurance, certified for the first time by BVQI (now Bureau Veritas Certification) in 1999 for conformity with the standard ISO 9001:1994. Later on, in connection with the revision of the standards of ISO 9000 series and introduction of the standard ISO 9001:2000 a Quality Management System in conformity with the standard ISO 9001:2000 was developed, introduced and certified. At present, Risk Engineering Ltd has got developed, documented, introduced and certified by Lloyd's Register Quality Assurance (LRQA) Quality Management System in compliance with ISO 9001:2008 on the process approach basis. On this basis and including the requirements of the ISO 14001:2004 (regarding the environment) and OHSAS 18001:2007 (regarding the health and occupational safety), Risk Engineering Ltd has developed and introduced Integrated Management System aim at achieving and demonstrating good results regarding protection of the environment, health and occupational safety. The processes under control by the Integrated Management System and applicable at the company are divided in two general types: A) Management processes: Strategic management and Management of the human resources. B) Processes describing the main activities: design/development process; project management; management of industrial projects and technical infrastructure project; construction, installation, repair and operation of power industry facilities; commercial activities and marketing; investigation of energy efficiency of industrial systems and certification of buildings regarding energy efficiency; consulting activity in the field of industry and energy as well as consultant in accordance with the Law of the Spatial Planning; management of the

  7. Extracting flat-field images from scene-based image sequences using phase correlation

    Energy Technology Data Exchange (ETDEWEB)

    Caron, James N., E-mail: Caron@RSImd.com [Research Support Instruments, 4325-B Forbes Boulevard, Lanham, Maryland 20706 (United States); Montes, Marcos J. [Naval Research Laboratory, Code 7231, 4555 Overlook Avenue, SW, Washington, DC 20375 (United States); Obermark, Jerome L. [Naval Research Laboratory, Code 8231, 4555 Overlook Avenue, SW, Washington, DC 20375 (United States)

    2016-06-15

    Flat-field image processing is an essential step in producing high-quality and radiometrically calibrated images. Flat-fielding corrects for variations in the gain of focal plane array electronics and unequal illumination from the system optics. Typically, a flat-field image is captured by imaging a radiometrically uniform surface. The flat-field image is normalized and removed from the images. There are circumstances, such as with remote sensing, where a flat-field image cannot be acquired in this manner. For these cases, we developed a phase-correlation method that allows the extraction of an effective flat-field image from a sequence of scene-based displaced images. The method uses sub-pixel phase correlation image registration to align the sequence to estimate the static scene. The scene is removed from sequence producing a sequence of misaligned flat-field images. An average flat-field image is derived from the realigned flat-field sequence.

  8. Sequence Ready Characterization of the Pericentromeric Region of 19p12

    Energy Technology Data Exchange (ETDEWEB)

    Evan E. Eichler

    2006-08-31

    Current mapping and sequencing strategies have been inadequate within the proximal portion of 19p12 due, in part, to the presence of a recently expanded ZNF (zinc-finger) gene family and the presence of large (25-50 kb) inverted beta-satellite repeat structures which bracket this tandemly duplicated gene family. The virtual of absence of classically defined “unique” sequence within the region has hampered efforts to identify and characterize a suitable minimal tiling path of clones which can be used as templates required for finished sequencing of the region. The goal of this proposal is to develop and implement a novel sequence-anchor strategy to generate a contiguous BAC map of the most proximal portion of chromosome 19p12 for the purpose of complete sequence characterization. The target region will be an estimated 4.5 Mb of DNA extending from STS marker D19S450 (the beginning of the ZNF gene cluster) to the centromeric (alpha-satellite) junction of 19p11. The approach will entail 1) pre-selection of 19p12 BAC and cosmid clones (NIH approved library) utilizing both 19p12 -unique and 19p12-SPECIFIC repeat probes (Eichler et al., 1998); 2) the generation of a BAC/cosmid end-sequence map across the region with a density of one marker every 8kb; 3) the development of a second-generation of STS (sequence tagged sites) which will be used to identify and verify clonal overlap at the level of the sequence; 4) incorporation of these sequence-anchored overlapping clones into existing cosmid/BAC restriction maps developed at Livermore National Laboratory; and 5) validation of the organization of this region utilizing high-resolution FISH techniques (extended chromatin analysis) on monochromosomal 19 somatic cell hybrids and parental cell lines of source material. The data generated will be used in the selection of the most parsimonious tiling path of BAC clones to be sequenced as part of the JGI effort on chromosome 19 and should serve as a model for the sequence

  9. Spike protein assembly into the coronavirion: exploring the limits of its sequence requirements

    International Nuclear Information System (INIS)

    Bosch, Berend Jan; Haan, Cornelis A.M. de; Smits, Saskia L.; Rottier, Peter J.M.

    2005-01-01

    The coronavirus spike (S) protein, required for receptor binding and membrane fusion, is incorporated into the assembling virion by interactions with the viral membrane (M) protein. Earlier we showed that the ectodomain of the S protein is not involved in this process. Here we further defined the requirements of the S protein for virion incorporation. We show that the cytoplasmic domain, not the transmembrane domain, determines the association with the M protein and suffices to effect the incorporation into viral particles of chimeric spikes as well as of foreign viral glycoproteins. The essential sequence was mapped to the membrane-proximal region of the cytoplasmic domain, which is also known to be of critical importance for the fusion function of the S protein. Consistently, only short C-terminal truncations of the S protein were tolerated when introduced into the virus by targeted recombination. The important role of the about 38-residues cytoplasmic domain in the assembly of and membrane fusion by this approximately 1300 amino acids long protein is discussed

  10. Can a single-shot black-blood T2-weighted spin-echo echo-planar imaging sequence with sensitivity encoding replace the respiratory-triggered turbo spin-echo sequence for the liver? An optimization and feasibility study.

    Science.gov (United States)

    Hussain, Shahid M; De Becker, Jan; Hop, Wim C J; Dwarkasing, Soendersing; Wielopolski, Piotr A

    2005-03-01

    To optimize and assess the feasibility of a single-shot black-blood T2-weighted spin-echo echo-planar imaging (SSBB-EPI) sequence for MRI of the liver using sensitivity encoding (SENSE), and compare the results with those obtained with a T2-weighted turbo spin-echo (TSE) sequence. Six volunteers and 16 patients were scanned at 1.5T (Philips Intera). In the volunteer study, we optimized the SSBB-EPI sequence by interactively changing the parameters (i.e., the resolution, echo time (TE), diffusion weighting with low b-values, and polarity of the phase-encoding gradient) with regard to distortion, suppression of the blood signal, and sensitivity to motion. The influence of each change was assessed. The optimized SSBB-EPI sequence was applied in patients (N = 16). A number of items, including the overall image quality (on a scale of 1-5), were used for graded evaluation. In addition, the signal-to-noise ratio (SNR) of the liver was calculated. Statistical analysis was carried out with the use of Wilcoxon's signed rank test for comparison of the SSBB-EPI and TSE sequences, with P = 0.05 considered the limit for significance. The SSBB-EPI sequence was improved by the following steps: 1) less frequency points than phase-encoding steps, 2) a b-factor of 20, and 3) a reversed polarity of the phase-encoding gradient. In patients, the mean overall image quality score for the optimized SSBB-EPI (3.5 (range: 1-4)) and TSE (3.6 (range: 3-4)), and the SNR of the liver on SSBB-EPI (mean +/- SD = 7.6 +/- 4.0) and TSE (8.9 +/- 4.6) were not significantly different (P > .05). Optimized SSBB-EPI with SENSE proved to be feasible in patients, and the overall image quality and SNR of the liver were comparable to those achieved with the standard respiratory-triggered T2-weighted TSE sequence. (c) 2005 Wiley-Liss, Inc.

  11. Efficient alignment of pyrosequencing reads for re-sequencing applications

    Directory of Open Access Journals (Sweden)

    Russo Luis MS

    2011-05-01

    Full Text Available Abstract Background Over the past few years, new massively parallel DNA sequencing technologies have emerged. These platforms generate massive amounts of data per run, greatly reducing the cost of DNA sequencing. However, these techniques also raise important computational difficulties mostly due to the huge volume of data produced, but also because of some of their specific characteristics such as read length and sequencing errors. Among the most critical problems is that of efficiently and accurately mapping reads to a reference genome in the context of re-sequencing projects. Results We present an efficient method for the local alignment of pyrosequencing reads produced by the GS FLX (454 system against a reference sequence. Our approach explores the characteristics of the data in these re-sequencing applications and uses state of the art indexing techniques combined with a flexible seed-based approach, leading to a fast and accurate algorithm which needs very little user parameterization. An evaluation performed using real and simulated data shows that our proposed method outperforms a number of mainstream tools on the quantity and quality of successful alignments, as well as on the execution time. Conclusions The proposed methodology was implemented in a software tool called TAPyR--Tool for the Alignment of Pyrosequencing Reads--which is publicly available from http://www.tapyr.net.

  12. Strain-specific and pooled genome sequences for populations of Drosophila melanogaster from three continents.

    Science.gov (United States)

    Bergman, Casey M; Haddrill, Penelope R

    2015-01-01

    To contribute to our general understanding of the evolutionary forces that shape variation in genome sequences in nature, we have sequenced genomes from 50 isofemale lines and six pooled samples from populations of Drosophila melanogaster on three continents. Analysis of raw and reference-mapped reads indicates the quality of these genomic sequence data is very high. Comparison of the predicted and experimentally-determined Wolbachia infection status of these samples suggests that strain or sample swaps are unlikely to have occurred in the generation of these data. Genome sequences are freely available in the European Nucleotide Archive under accession ERP009059. Isofemale lines can be obtained from the Drosophila Species Stock Center.

  13. Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing.

    Science.gov (United States)

    Zhang, Jin; Ruhlman, Tracey A; Mower, Jeffrey P; Jansen, Robert K

    2013-12-29

    Organelle genomes of Geraniaceae exhibit several unusual evolutionary phenomena compared to other angiosperm families including accelerated nucleotide substitution rates, widespread gene loss, reduced RNA editing, and extensive genomic rearrangements. Since most organelle-encoded proteins function in multi-subunit complexes that also contain nuclear-encoded proteins, it is likely that the atypical organellar phenomena affect the evolution of nuclear genes encoding organellar proteins. To begin to unravel the complex co-evolutionary interplay between organellar and nuclear genomes in this family, we sequenced nuclear transcriptomes of two species, Geranium maderense and Pelargonium x hortorum. Normalized cDNA libraries of G. maderense and P. x hortorum were used for transcriptome sequencing. Five assemblers (MIRA, Newbler, SOAPdenovo, SOAPdenovo-trans [SOAPtrans], Trinity) and two next-generation technologies (454 and Illumina) were compared to determine the optimal transcriptome sequencing approach. Trinity provided the highest quality assembly of Illumina data with the deepest transcriptome coverage. An analysis to determine the amount of sequencing needed for de novo assembly revealed diminishing returns of coverage and quality with data sets larger than sixty million Illumina paired end reads for both species. The G. maderense and P. x hortorum transcriptomes contained fewer transcripts encoding the PLS subclass of PPR proteins relative to other angiosperms, consistent with reduced mitochondrial RNA editing activity in Geraniaceae. In addition, transcripts for all six plastid targeted sigma factors were identified in both transcriptomes, suggesting that one of the highly divergent rpoA-like ORFs in the P. x hortorum plastid genome is functional. The findings support the use of the Illumina platform and assemblers optimized for transcriptome assembly, such as Trinity or SOAPtrans, to generate high-quality de novo transcriptomes with broad coverage. In addition

  14. Frame Rate versus Spatial Quality: Which Video Characteristics Do Matter?

    DEFF Research Database (Denmark)

    Korhonen, Jari; Reiter, Ulrich; Ukhanova, Ann

    2013-01-01

    and temporal quality levels. We also propose simple yet powerful metrics for characterizing spatial and temporal properties of a video sequence, and demonstrate how these metrics can be applied for evaluating the relative impact of spatial and temporal quality on the perceived overall quality....

  15. Sequencing of a QTL-rich region of the Theobroma cacao genome using pooled BACs and the identification of trait specific candidate genes

    Directory of Open Access Journals (Sweden)

    Blackmon Barbara P

    2011-07-01

    Full Text Available Abstract Background BAC-based physical maps provide for sequencing across an entire genome or a selected sub-genomic region of biological interest. Such a region can be approached with next-generation whole-genome sequencing and assembly as if it were an independent small genome. Using the minimum tiling path as a guide, specific BAC clones representing the prioritized genomic interval are selected, pooled, and used to prepare a sequencing library. Results This pooled BAC approach was taken to sequence and assemble a QTL-rich region, of ~3 Mbp and represented by twenty-seven BACs, on linkage group 5 of the Theobroma cacao cv. Matina 1-6 genome. Using various mixtures of read coverages from paired-end and linear 454 libraries, multiple assemblies of varied quality were generated. Quality was assessed by comparing the assembly of 454 reads with a subset of ten BACs individually sequenced and assembled using Sanger reads. A mixture of reads optimal for assembly was identified. We found, furthermore, that a quality assembly suitable for serving as a reference genome template could be obtained even with a reduced depth of sequencing coverage. Annotation of the resulting assembly revealed several genes potentially responsible for three T. cacao traits: black pod disease resistance, bean shape index, and pod weight. Conclusions Our results, as with other pooled BAC sequencing reports, suggest that pooling portions of a minimum tiling path derived from a BAC-based physical map is an effective method to target sub-genomic regions for sequencing. While we focused on a single QTL region, other QTL regions of importance could be similarly sequenced allowing for biological discovery to take place before a high quality whole-genome assembly is completed.

  16. Collembase: a repository for springtail genomics and soil quality assessment

    Directory of Open Access Journals (Sweden)

    Klein-Lankhorst Rene M

    2007-09-01

    Full Text Available Abstract Background Environmental quality assessment is traditionally based on responses of reproduction and survival of indicator organisms. For soil assessment the springtail Folsomia candida (Collembola is an accepted standard test organism. We argue that environmental quality assessment using gene expression profiles of indicator organisms exposed to test substrates is more sensitive, more toxicant specific and significantly faster than current risk assessment methods. To apply this species as a genomic model for soil quality testing we conducted an EST sequencing project and developed an online database. Description Collembase is a web-accessible database comprising springtail (F. candida genomic data. Presently, the database contains information on 8686 ESTs that are assembled into 5952 unique gene objects. Of those gene objects ~40% showed homology to other protein sequences available in GenBank (blastx analysis; non-redundant (nr database; expect-value -5. Software was applied to infer protein sequences. The putative peptides, which had an average length of 115 amino-acids (ranging between 23 and 440 were annotated with Gene Ontology (GO terms. In total 1025 peptides (~17% of the gene objects were assigned at least one GO term (expect-value -25. Within Collembase searches can be conducted based on BLAST and GO annotation, cluster name or using a BLAST server. The system furthermore enables easy sequence retrieval for functional genomic and Quantitative-PCR experiments. Sequences are submitted to GenBank (Accession numbers: EV473060 – EV481745. Conclusion Collembase http://www.collembase.org is a resource of sequence data on the springtail F. candida. The information within the database will be linked to a custom made microarray, based on the Agilent platform, which can be applied for soil quality testing. In addition, Collembase supplies information that is valuable for related scientific disciplines such as molecular ecology

  17. Structural Approaches to Sequence Evolution Molecules, Networks, Populations

    CERN Document Server

    Bastolla, Ugo; Roman, H. Eduardo; Vendruscolo, Michele

    2007-01-01

    Structural requirements constrain the evolution of biological entities at all levels, from macromolecules to their networks, right up to populations of biological organisms. Classical models of molecular evolution, however, are focused at the level of the symbols - the biological sequence - rather than that of their resulting structure. Now recent advances in understanding the thermodynamics of macromolecules, the topological properties of gene networks, the organization and mutation capabilities of genomes, and the structure of populations make it possible to incorporate these key elements into a broader and deeply interdisciplinary view of molecular evolution. This book gives an account of such a new approach, through clear tutorial contributions by leading scientists specializing in the different fields involved.

  18. Short-term incorporation of organic manures and biofertilizers influences biochemical and microbial characteristics of soils under an annual crop [Turmeric (Curcuma longa L.)].

    Science.gov (United States)

    Dinesh, R; Srinivasan, V; Hamza, S; Manjusha, A

    2010-06-01

    The study was conducted to determine whether short-term incorporation of organic manures and biofertilizers influence biochemical and microbial variables reflecting soil quality. For the study, soils were collected from a field experiment conducted on turmeric (Curcuma longa L.) involving organic nutrient management (ONM), chemical nutrient management (CNM) and integrated nutrient management (INM). The findings revealed that application of organic manures and biofertilizers (ONM and INM) positively influenced microbial biomass C, N mineralization, soil respiration and enzymes activities. Contrarily, greater metabolic quotient levels in CNM indicated a stressed soil microbial community. Principal component analysis indicated the strong relationship between microbial activity and the availability of labile and easily mineralizable organic matter. The findings imply that even short-term incorporation of organic manures and biofertilizers promoted soil microbial and enzyme activities and these parameters are sensitive enough to detect changes in soil quality due to short-term incorporation of biological fertilizers. (c) 2010 Elsevier Ltd. All rights reserved.

  19. iFORM: Incorporating Find Occurrence of Regulatory Motifs.

    Science.gov (United States)

    Ren, Chao; Chen, Hebing; Yang, Bite; Liu, Feng; Ouyang, Zhangyi; Bo, Xiaochen; Shu, Wenjie

    2016-01-01

    Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.

  20. Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing

    Directory of Open Access Journals (Sweden)

    Wang Nian

    2012-08-01

    Full Text Available Abstract Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison.

  1. MACSIMS : multiple alignment of complete sequences information management system

    Directory of Open Access Journals (Sweden)

    Plewniak Frédéric

    2006-06-01

    Full Text Available Abstract Background In the post-genomic era, systems-level studies are being performed that seek to explain complex biological systems by integrating diverse resources from fields such as genomics, proteomics or transcriptomics. New information management systems are now needed for the collection, validation and analysis of the vast amount of heterogeneous data available. Multiple alignments of complete sequences provide an ideal environment for the integration of this information in the context of the protein family. Results MACSIMS is a multiple alignment-based information management program that combines the advantages of both knowledge-based and ab initio sequence analysis methods. Structural and functional information is retrieved automatically from the public databases. In the multiple alignment, homologous regions are identified and the retrieved data is evaluated and propagated from known to unknown sequences with these reliable regions. In a large-scale evaluation, the specificity of the propagated sequence features is estimated to be >99%, i.e. very few false positive predictions are made. MACSIMS is then used to characterise mutations in a test set of 100 proteins that are known to be involved in human genetic diseases. The number of sequence features associated with these proteins was increased by 60%, compared to the features available in the public databases. An XML format output file allows automatic parsing of the MACSIM results, while a graphical display using the JalView program allows manual analysis. Conclusion MACSIMS is a new information management system that incorporates detailed analyses of protein families at the structural, functional and evolutionary levels. MACSIMS thus provides a unique environment that facilitates knowledge extraction and the presentation of the most pertinent information to the biologist. A web server and the source code are available at http://bips.u-strasbg.fr/MACSIMS/.

  2. Reconceptualising water quality governance to incorporate knowledge and values: Case studies from Australian and Brazilian Indigenous communities

    Directory of Open Access Journals (Sweden)

    Kate A. Berry

    2018-02-01

    Full Text Available This paper examines the significance of knowledge and values for water quality and its governance. Modernist approaches to the governance of water quality in rivers and lakes need to be reconceptualised and overhauled. The problems include: perceiving water only as a physical and chemical liquid, defining quality in narrow terms, rendering water knowledge as invisible, boiling down water values to uses of presumed economic importance and limiting how and by whom objectives are set or actions taken. In addressing the need to reframe water quality governance, and as a counter to the objectification of water quality, we propose a framework that explicitly recognises the significance of knowledge and values relating to water. While our framework could apply to other contexts under the influence of modernist water-management regimes, here we pay particular attention to the relevance of the water knowledge, values and governance of water quality by Indigenous people. In the second half of the paper we address issues related to Indigenous water-quality governance in two countries, Brazil and Australia, showing some of the ways in which, despite enormous obstacles, Indigenous communities re-work governance structures through their engagements with water quality and pay attention to water knowledge and values.

  3. Incorporation Between AHP and N-TSP for Plant Surveillance Routing with Multiple Constraints

    International Nuclear Information System (INIS)

    Djoko Hari Nugroho

    2002-01-01

    This paper observed plant on-line surveillance routing for maintenance management with multiple constraints using TSP (Traveling Salesman Problem). In the research N-TSP (nomadic TSP) type was used. In this case, on-line surveillance could be implemented on moving robot. Route for preventive maintenance management was observed sequentially per stage using multiple constraints (a) distance between components, and (b) failure probability of components using AHP (Analytical Hierarchy Process). Simulation was observed utilizing DURESS as a complex system. The simulation result showed that the route with single constraint distance between components represents the sequence of 1 - 6 - 2 - 4 - 3 - 5. Routing for DURESS with multiple constraints using incorporation of AHP and TSP showed that the first priority in the route is flow sensor FB2 with the value of comparation of 0.1042. The next priority is sequentially FB1, FA2, FA1, FA, FB, VB, VA, VA1, VA2, VB1, VB2, pump B, pump A, FR1, FR2, reservoir 2, and reservoir 1. Numerical experiment obtained that the incorporation between AHP and N-TSP has successfully constructed the surveillance routing with multiple constraints. (author)

  4. Evaluation of turbo spin echo sequences for MRI of focal liver lesions at 0.5 T

    International Nuclear Information System (INIS)

    Kreft, B.; Layer, G.; Steudel, A.; Spiller, L.; Heuck, A.; Mueller, A.; Gieseke, J.; Reiser, M.F.

    1994-01-01

    To determine whether turbo spin echo (TSE) sequences can replace conventional T2-weighted spin echo (SE) sequences in MRI of the liver, 40 patients with focal liver lesions were imaged at 0.5 T. A T2-weighted SE sequence (TR/TE 1800/90 ms, number of signals averaged [NEX] = 2, scan time 7:16 min), a TSE sequence (TR/TE 1800/90 ms, NEX = 4, number of echos per excitation = 13, echo spacing = 12.9 ms, scan time = 4:16 min) and a T1-weighted SE sequence (TR/TE 350/15 ms, NEX = 2, scan time = 4:21 min) were obtained and image quality, lesion detectability and lesion differentiation were evaluated qualitatively by subjective assessment using scores and quantitatively by lesion-liver contrast-to-noise (CNR) and tumour/liver signal intensity (SI) ratios. The image quality of the TSE sequence was substantially better compared with the T2-weighted SE sequence due to a reduction in motion artefacts and better delineation of anatomical details. Of a total of 158 visible lesions the T1-weighted SE, TSE and T2-weighted SE sequences showed 91 %, 81 % and 65 % of the lesions, respectively. Thus the TSE sequence depicted 24 % (P < 0.001) more lesions than the T2-weighted SE sequence. In all types of pathology the lesion-liver CNR of the TSE sequence was significantly (P < 0.001) higher compared to the CNR of the T2-weighted SE sequence ( +55-65 %), indicating superior lesion conspicuity. Lesion characterization was equally good on the two T2-weighted sequences with no difference in the tumour/liver SI ratio. Using a criterion of tumour/liver SI ratio equal to or higher than 2, haemangiomas larger than 1 cm in diameter could be differentiated from other lesions with a sensitivity and specificity of 95 % and 96 %, respectively. Our results indicate that the TSE sequence is suitable for replacing the conventional T2-weighted SE sequence in MRI of focal liver lesions. (orig.)

  5. Viral metagenomics: Analysis of begomoviruses by illumina high-throughput sequencing

    KAUST Repository

    Idris, Ali

    2014-03-12

    Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant begomoviral genomes and their associated beta- and alpha-satellite molecules (virus-satellite complexes) (genus, Begomovirus; family, Geminiviridae) were enriched from total nucleic acids isolated from symptomatic, field-infected plants, using rolling circle amplification (RCA). Enriched virus-satellite complexes were subjected to Illumina-Next Generation Sequencing (NGS). CASAVA and SeqMan NGen programs were implemented, respectively, for quality control and for de novo and reference-guided contig assembly of viral-satellite sequences. The authenticity of the begomoviral sequences, and the reproducibility of the Illumina-NGS approach for begomoviral deep sequencing projects, were validated by comparing NGS results with those obtained using traditional molecular cloning and Sanger sequencing of viral components and satellite DNAs, also enriched by RCA or amplified by polymerase chain reaction. As the use of NGS approaches, together with advances in software development, make possible deep sequence coverage at a lower cost; the approach described herein will streamline the exploration of begomovirus diversity and population structure from naturally infected plants, irrespective of viral abundance. This is the first report of the implementation of Illumina-NGS to explore the diversity and identify begomoviral-satellite SNPs directly from plants naturally-infected with begomoviruses under field conditions. 2014 by the authors; licensee MDPI, Basel, Switzerland.

  6. Viral Metagenomics: Analysis of Begomoviruses by Illumina High-Throughput Sequencing

    Directory of Open Access Journals (Sweden)

    Ali Idris

    2014-03-01

    Full Text Available Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant begomoviral genomes and their associated beta- and alpha-satellite molecules (virus-satellite complexes (genus, Begomovirus; family, Geminiviridae were enriched from total nucleic acids isolated from symptomatic, field-infected plants, using rolling circle amplification (RCA. Enriched virus-satellite complexes were subjected to Illumina-Next Generation Sequencing (NGS. CASAVA and SeqMan NGen programs were implemented, respectively, for quality control and for de novo and reference-guided contig assembly of viral-satellite sequences. The authenticity of the begomoviral sequences, and the reproducibility of the Illumina-NGS approach for begomoviral deep sequencing projects, were validated by comparing NGS results with those obtained using traditional molecular cloning and Sanger sequencing of viral components and satellite DNAs, also enriched by RCA or amplified by polymerase chain reaction. As the use of NGS approaches, together with advances in software development, make possible deep sequence coverage at a lower cost; the approach described herein will streamline the exploration of begomovirus diversity and population structure from naturally infected plants, irrespective of viral abundance. This is the first report of the implementation of Illumina-NGS to explore the diversity and identify begomoviral-satellite SNPs directly from plants naturally-infected with begomoviruses under field conditions.

  7. Impact of culture on the application of quality management system

    OpenAIRE

    2012-01-01

    M.Ing. The management of quality in companies has become more and more strategically important over recent years. The emphasis on quality is vital to managing projects and achieving excellence in today’s global economy. Modern petrochemical construction project management has incorporated quality management principles and initiatives in their activities. The Quality Management System is used to ensure that the project will satisfy the requirements for which it was undertaken. Improving pro...

  8. WildSpan: mining structured motifs from protein sequences

    Directory of Open Access Journals (Sweden)

    Chen Chien-Yu

    2011-03-01

    Full Text Available Abstract Background Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions that incorporates several pruning strategies to largely reduce the mining cost. Results WildSpan is shown to efficiently find W-patterns containing conserved residues that are far separated in sequences. We conducted experiments with two mining strategies, protein-based and family-based mining, to evaluate the usefulness of W-patterns and performance of WildSpan. The protein-based mining mode

  9. Quality of synthetic hexaploid wheat containing null alleles at Glu-A1

    Indian Academy of Sciences (India)

    GSs. However, incorporation of HMW-GS from Ae. tauschii in six synthetic hexaploid wheat lines significantly increased most quality related parameters. The potential values of these wheat lines in improving the quality of wheat are discussed.

  10. Combined evidence annotation of transposable elements in genome sequences.

    Directory of Open Access Journals (Sweden)

    Hadi Quesneville

    2005-07-01

    Full Text Available Transposable elements (TEs are mobile, repetitive sequences that make up significant fractions of metazoan genomes. Despite their near ubiquity and importance in genome and chromosome biology, most efforts to annotate TEs in genome sequences rely on the results of a single computational program, RepeatMasker. In contrast, recent advances in gene annotation indicate that high-quality gene models can be produced from combining multiple independent sources of computational evidence. To elevate the quality of TE annotations to a level comparable to that of gene models, we have developed a combined evidence-model TE annotation pipeline, analogous to systems used for gene annotation, by integrating results from multiple homology-based and de novo TE identification methods. As proof of principle, we have annotated "TE models" in Drosophila melanogaster Release 4 genomic sequences using the combined computational evidence derived from RepeatMasker, BLASTER, TBLASTX, all-by-all BLASTN, RECON, TE-HMM and the previous Release 3.1 annotation. Our system is designed for use with the Apollo genome annotation tool, allowing automatic results to be curated manually to produce reliable annotations. The euchromatic TE fraction of D. melanogaster is now estimated at 5.3% (cf. 3.86% in Release 3.1, and we found a substantially higher number of TEs (n = 6,013 than previously identified (n = 1,572. Most of the new TEs derive from small fragments of a few hundred nucleotides long and highly abundant families not previously annotated (e.g., INE-1. We also estimated that 518 TE copies (8.6% are inserted into at least one other TE, forming a nest of elements. The pipeline allows rapid and thorough annotation of even the most complex TE models, including highly deleted and/or nested elements such as those often found in heterochromatic sequences. Our pipeline can be easily adapted to other genome sequences, such as those of the D. melanogaster heterochromatin or other

  11. Subjective Video Quality Assessment in H.264/AVC Video Coding Standard

    Directory of Open Access Journals (Sweden)

    Z. Miličević

    2012-11-01

    Full Text Available This paper seeks to provide an approach for subjective video quality assessment in the H.264/AVC standard. For this purpose a special software program for the subjective assessment of quality of all the tested video sequences is developed. It was developed in accordance with recommendation ITU-T P.910, since it is suitable for the testing of multimedia applications. The obtained results show that in the proposed selective intra prediction and optimized inter prediction algorithm there is a small difference in picture quality (signal-to-noise ratio between decoded original and modified video sequences.

  12. Quantum-Sequencing: Fast electronic single DNA molecule sequencing

    Science.gov (United States)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.

  13. Karect: accurate correction of substitution, insertion and deletion errors for next-generation sequencing data

    KAUST Repository

    Allam, Amin

    2015-07-14

    Motivation: Next-generation sequencing generates large amounts of data affected by errors in the form of substitutions, insertions or deletions of bases. Error correction based on the high-coverage information, typically improves de novo assembly. Most existing tools can correct substitution errors only; some support insertions and deletions, but accuracy in many cases is low. Results: We present Karect, a novel error correction technique based on multiple alignment. Our approach supports substitution, insertion and deletion errors. It can handle non-uniform coverage as well as moderately covered areas of the sequenced genome. Experiments with data from Illumina, 454 FLX and Ion Torrent sequencing machines demonstrate that Karect is more accurate than previous methods, both in terms of correcting individual-bases errors (up to 10% increase in accuracy gain) and post de novo assembly quality (up to 10% increase in NGA50). We also introduce an improved framework for evaluating the quality of error correction.