WorldWideScience

Sample records for segmental genome heterogeneity

  1. Organizational heterogeneity of vertebrate genomes.

    Science.gov (United States)

    Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

    2012-01-01

    Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  2. Organizational heterogeneity of vertebrate genomes.

    Directory of Open Access Journals (Sweden)

    Svetlana Frenkel

    Full Text Available Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  3. Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits

    DEFF Research Database (Denmark)

    Gebreyesus, Grum; Lund, Mogens Sandø; Buitenhuis, Albert Johannes

    2017-01-01

    Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci...... of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we...... developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls...

  4. Exploratory analysis of genomic segmentations with Segtools

    Directory of Open Access Journals (Sweden)

    Buske Orion J

    2011-10-01

    Full Text Available Abstract Background As genome-wide experiments and annotations become more prevalent, researchers increasingly require tools to help interpret data at this scale. Many functional genomics experiments involve partitioning the genome into labeled segments, such that segments sharing the same label exhibit one or more biochemical or functional traits. For example, a collection of ChlP-seq experiments yields a compendium of peaks, each labeled with one or more associated DNA-binding proteins. Similarly, manually or automatically generated annotations of functional genomic elements, including cis-regulatory modules and protein-coding or RNA genes, can also be summarized as genomic segmentations. Results We present a software toolkit called Segtools that simplifies and automates the exploration of genomic segmentations. The software operates as a series of interacting tools, each of which provides one mode of summarization. These various tools can be pipelined and summarized in a single HTML page. We describe the Segtools toolkit and demonstrate its use in interpreting a collection of human histone modification data sets and Plasmodium falciparum local chromatin structure data sets. Conclusions Segtools provides a convenient, powerful means of interpreting a genomic segmentation.

  5. How clonal is clonal? Genome plasticity across multicellular segments of a "Candidatus Marithrix sp." filament from sulfidic, briny seafloor sediments in the Gulf of Mexico

    Directory of Open Access Journals (Sweden)

    Verena Salman-Carvalho

    2016-08-01

    Full Text Available Candidatus Marithrix is a recently described lineage within the group of large sulfur bacteria (Beggiatoaceae, Gammaproteobacteria. This group of bacteria comprises vacuolated, attached-living filaments that inhabit the sediment surface around vent and seep sites in the marine environment. A single filament is ca. 100 µm in diameter, several millimeters long, and consists of hundreds of clonal cells, which are considered highly polyploid. Based on these characteristics, Candidatus Marithrix was used as a model organism for the assessment of genomic plasticity along segments of a single filament using next generation sequencing to possibly identify hotspots of microevolution. Using six consecutive segments of a single filament sampled from a mud volcano in the Gulf of Mexico, we recovered ca. 90% of the Candidatus Marithrix genome in each segment. There was a high level of genome conservation along the filament with average nucleotide identities between 99.98-100%. Different approaches to assemble all reads into a complete consensus genome could not fill the gaps. Each of the six segment datasets encoded merely a few hundred unique nucleotides and 5 or less unique genes - the residual content was redundant in all datasets. Besides the overall high genomic identity, we identified a similar number of single nucleotide polymorphisms (SNPs between the clonal segments, which are comparable to numbers reported for other clonal organisms. An increase of SNPs with greater distance of filament segments was not observed. The polyploidy of the cells was apparent when analyzing the heterogeneity of reads within a segment. Here, a strong increase in single nucleotide variants, or 'intrasegmental sequence heterogeneity' (ISH events, was observed. These sites may represent hotspots for genome plasticity, and possibly microevolution, since two thirds of these variants were not co-localized across the genome copies of the multicellular filament.

  6. Efficient Algorithms for Analyzing Segmental Duplications, Deletions, and Inversions in Genomes

    Science.gov (United States)

    Kahn, Crystal L.; Mozes, Shay; Raphael, Benjamin J.

    Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics consisting of pieces of multiple other segmental duplications. This complex genomic organization complicates analysis of the evolutionary history of these sequences. Earlier, we introduced a genomic distance, called duplication distance, that computes the most parsimonious way to build a target string by repeatedly copying substrings of a source string. We also showed how to use this distance to describe the formation of segmental duplications according to a two-step model that has been proposed to explain human segmental duplications. Here we describe polynomial-time exact algorithms for several extensions of duplication distance including models that allow certain types of substring deletions and inversions. These extensions will permit more biologically realistic analyses of segmental duplications in genomes.

  7. Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments.

    Directory of Open Access Journals (Sweden)

    Paul J Wichgers Schreur

    2016-08-01

    Full Text Available The bunyavirus genome comprises a small (S, medium (M, and large (L RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging. Accumulating evidence suggests that genomes of viruses with eight or more genome segments are incorporated into virions by highly selective processes. Remarkably, little is known about the genome packaging process of the tri-segmented bunyaviruses. Here, we evaluated, by single-molecule RNA fluorescence in situ hybridization (FISH, the intracellular spatio-temporal distribution and replication kinetics of the Rift Valley fever virus (RVFV genome and determined the segment composition of mature virions. The results reveal that the RVFV genome segments start to replicate near the site of infection before spreading and replicating throughout the cytoplasm followed by translocation to the virion assembly site at the Golgi network. Despite the average intracellular S, M and L genome segments approached a 1:1:1 ratio, major differences in genome segment ratios were observed among cells. We also observed a significant amount of cells lacking evidence of M-segment replication. Analysis of two-segmented replicons and four-segmented viruses subsequently confirmed the previous notion that Golgi recruitment is mediated by the Gn glycoprotein. The absence of colocalization of the different segments in the cytoplasm and the successful rescue of a tri-segmented variant with a codon shuffled M-segment suggested that inter-segment interactions are unlikely to drive the copackaging of the different segments into a single virion. The latter was confirmed by direct visualization of RNPs inside mature virions which showed that the majority of virions lack one or more genome segments. Altogether, this study suggests that RVFV genome packaging is a non-selective process.

  8. Assessing the scale of tumor heterogeneity by complete hierarchical segmentation of MRI.

    Science.gov (United States)

    Gensheimer, Michael F; Hawkins, Douglas S; Ermoian, Ralph P; Trister, Andrew D

    2015-02-07

    In many cancers, intratumoral heterogeneity has been found in histology, genetic variation and vascular structure. We developed an algorithm to interrogate different scales of heterogeneity using clinical imaging. We hypothesize that heterogeneity of perfusion at coarse scale may correlate with treatment resistance and propensity for disease recurrence. The algorithm recursively segments the tumor image into increasingly smaller regions. Each dividing line is chosen so as to maximize signal intensity difference between the two regions. This process continues until the tumor has been divided into single voxels, resulting in segments at multiple scales. For each scale, heterogeneity is measured by comparing each segmented region to the adjacent region and calculating the difference in signal intensity histograms. Using digital phantom images, we showed that the algorithm is robust to image artifacts and various tumor shapes. We then measured the primary tumor scales of contrast enhancement heterogeneity in MRI of 18 rhabdomyosarcoma patients. Using Cox proportional hazards regression, we explored the influence of heterogeneity parameters on relapse-free survival. Coarser scale of maximum signal intensity heterogeneity was prognostic of shorter survival (p = 0.05). By contrast, two fractal parameters and three Haralick texture features were not prognostic. In summary, our algorithm produces a biologically motivated segmentation of tumor regions and reports the amount of heterogeneity at various distance scales. If validated on a larger dataset, this prognostic imaging biomarker could be useful to identify patients at higher risk for recurrence and candidates for alternative treatment.

  9. biomvRhsmm: Genomic Segmentation with Hidden Semi-Markov Model

    Directory of Open Access Journals (Sweden)

    Yang Du

    2014-01-01

    Full Text Available High-throughput technologies like tiling array and next-generation sequencing (NGS generate continuous homogeneous segments or signal peaks in the genome that represent transcripts and transcript variants (transcript mapping and quantification, regions of deletion and amplification (copy number variation, or regions characterized by particular common features like chromatin state or DNA methylation ratio (epigenetic modifications. However, the volume and output of data produced by these technologies present challenges in analysis. Here, a hidden semi-Markov model (HSMM is implemented and tailored to handle multiple genomic profile, to better facilitate genome annotation by assisting in the detection of transcripts, regulatory regions, and copy number variation by holistic microarray or NGS. With support for various data distributions, instead of limiting itself to one specific application, the proposed hidden semi-Markov model is designed to allow modeling options to accommodate different types of genomic data and to serve as a general segmentation engine. By incorporating genomic positions into the sojourn distribution of HSMM, with optional prior learning using annotation or previous studies, the modeling output is more biologically sensible. The proposed model has been compared with several other state-of-the-art segmentation models through simulation benchmarking, which shows that our efficient implementation achieves comparable or better sensitivity and specificity in genomic segmentation.

  10. Heterogeneous skills and homogeneous land: segmentation and agglomeration

    OpenAIRE

    Matthias Wrede

    2013-01-01

    This paper analyzes the impact of skill heterogeneity on regional patterns of production and housing in the presence of pecuniary externalities within a general-equilibrium framework assuming monopolistic competition at intermediate good markets. It shows that the interplay of heterogeneous skills and relatively homogeneous land demand triggers skill segmentation and agglomeration. The core region, being more attractive to high skilled workers, has a disproportionately large share of producti...

  11. Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments

    NARCIS (Netherlands)

    Wichgers Schreur, Paul J.; Kortekaas, Jeroen

    2016-01-01

    The bunyavirus genome comprises a small (S), medium (M), and large (L) RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging.

  12. Assessing the scale of tumor heterogeneity by complete hierarchical segmentation of MRI

    International Nuclear Information System (INIS)

    Gensheimer, Michael F; Ermoian, Ralph P; Hawkins, Douglas S; Trister, Andrew D

    2015-01-01

    In many cancers, intratumoral heterogeneity has been found in histology, genetic variation and vascular structure. We developed an algorithm to interrogate different scales of heterogeneity using clinical imaging. We hypothesize that heterogeneity of perfusion at coarse scale may correlate with treatment resistance and propensity for disease recurrence. The algorithm recursively segments the tumor image into increasingly smaller regions. Each dividing line is chosen so as to maximize signal intensity difference between the two regions. This process continues until the tumor has been divided into single voxels, resulting in segments at multiple scales. For each scale, heterogeneity is measured by comparing each segmented region to the adjacent region and calculating the difference in signal intensity histograms. Using digital phantom images, we showed that the algorithm is robust to image artifacts and various tumor shapes. We then measured the primary tumor scales of contrast enhancement heterogeneity in MRI of 18 rhabdomyosarcoma patients. Using Cox proportional hazards regression, we explored the influence of heterogeneity parameters on relapse-free survival. Coarser scale of maximum signal intensity heterogeneity was prognostic of shorter survival (p = 0.05). By contrast, two fractal parameters and three Haralick texture features were not prognostic. In summary, our algorithm produces a biologically motivated segmentation of tumor regions and reports the amount of heterogeneity at various distance scales. If validated on a larger dataset, this prognostic imaging biomarker could be useful to identify patients at higher risk for recurrence and candidates for alternative treatment. (paper)

  13. Analysis Of Segmental Duplications In The Pig Genome Based On Next-Generation Sequencing

    DEFF Research Database (Denmark)

    Fadista, João; Bendixen, Christian

    Segmental duplications are >1kb segments of duplicated DNA present in a genome with high sequence identity (>90%). They are associated with genomic rearrangements and provide a significant source of gene and genome evolution within mammalian genomes. Although segmental duplications have been...... extensively studied in other organisms, its analysis in pig has been hampered by the lack of a complete pig genome assembly. By measuring the depth of coverage of Illumina whole-genome shotgun sequencing reads of the Tabasco animal aligned to the latest pig genome assembly (Sus scrofa 10 – based also...... and their associated copy number alterations, focusing on the global organization of these segments and their possible functional significance in porcine phenotypes. This work provides insights into mammalian genome evolution and generates a valuable resource for porcine genomics research...

  14. Structural constraints in the packaging of bluetongue virus genomic segments

    OpenAIRE

    Burkhardt, Christiane; Sung, Po-Yu; Celma, Cristina C.; Roy, Polly

    2014-01-01

    : The mechanism used by bluetongue virus (BTV) to ensure the sorting and packaging of its 10 genomic segments is still poorly understood. In this study, we investigated the packaging constraints for two BTV genomic segments from two different serotypes. Segment 4 (S4) of BTV serotype 9 was mutated sequentially and packaging of mutant ssRNAs was investigated by two newly developed RNA packaging assay systems, one in vivo and the other in vitro. Modelling of the mutated ssRNA followed by bioche...

  15. Segment-specific terminal sequences of Bunyamwera bunyavirus regulate genome replication

    International Nuclear Information System (INIS)

    Barr, John N.; Elliott, Richard M.; Dunn, Ewan F.; Wertz, Gail W.

    2003-01-01

    Bunyamwera virus (BUNV) is the prototype of both the Orthobunyavirus genus and the Bunyaviridae family of segmented negative sense RNA viruses. The tripartite BUNV genome consists of small (S), medium (M), and large (L) segments that are transcribed to give a single mRNA and replicated to generate an antigenome that is the template for synthesis of further genomic RNA strands. We modified an existing cDNA-derived RNA synthesis system to allow identification of BUNV RNA replication and transcription products by direct metabolic labeling. Direct RNA analysis allowed us to distinguish between template activities that affected either RNA replication or mRNA transcription, an ability that was not possible using previous reporter gene expression assays. We generated genome analogs containing the entire nontranslated terminal sequences of the S, M, and L BUNV segments surrounding a common sequence. Analysis of RNAs synthesized from these templates revealed that the relative abilities of BUNV segments to perform RNA replication was M > L > S. Exchange of segment-specific terminal nucleotides identified a 12-nt region located within both the 3' and 5' termini of the M segment that correlated with its high replication ability

  16. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups B and E to genome segments

    International Nuclear Information System (INIS)

    Gombold, J.L.; Estes, M.K.; Ramig, R.F.

    1985-01-01

    Recombinant (reassortant) viruses were selected from crosses between temperature-sensitive (ts) mutants of simian rotavirus SA11 and wild-type human rotavirus Wa. The double-stranded genome RNAs of the reassortants were examined by electrophoresis in Tris-glycine-buffered polyacrylamide gels and by dot hybridization with a cloned DNA probe for genome segment 2. Analysis of replacements of genome segments in the reassortants allowed construction of a map correlating genome segments providing functions interchangeable between SA11 and Wa. The reassortants revealed a functional correspondence in order of increasing electrophoretic mobility of genome segments. Analysis of the parental origin of genome segments in ts+ SA11/Wa reassortants derived from the crosses SA11 tsB(339) X Wa and SA11 tsE(1400) X Wa revealed that the group B lesion of tsB(339) was located on genome segment 3 and the group E lesion of tsE(1400) was on segment 8

  17. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups B and E to genome segments

    Energy Technology Data Exchange (ETDEWEB)

    Gombold, J.L.; Estes, M.K.; Ramig, R.F.

    1985-05-01

    Recombinant (reassortant) viruses were selected from crosses between temperature-sensitive (ts) mutants of simian rotavirus SA11 and wild-type human rotavirus Wa. The double-stranded genome RNAs of the reassortants were examined by electrophoresis in Tris-glycine-buffered polyacrylamide gels and by dot hybridization with a cloned DNA probe for genome segment 2. Analysis of replacements of genome segments in the reassortants allowed construction of a map correlating genome segments providing functions interchangeable between SA11 and Wa. The reassortants revealed a functional correspondence in order of increasing electrophoretic mobility of genome segments. Analysis of the parental origin of genome segments in ts+ SA11/Wa reassortants derived from the crosses SA11 tsB(339) X Wa and SA11 tsE(1400) X Wa revealed that the group B lesion of tsB(339) was located on genome segment 3 and the group E lesion of tsE(1400) was on segment 8.

  18. WE-E-17A-06: Assessing the Scale of Tumor Heterogeneity by Complete Hierarchical Segmentation On MRI

    International Nuclear Information System (INIS)

    Gensheimer, M; Trister, A; Ermoian, R; Hawkins, D

    2014-01-01

    Purpose: In many cancers, intratumoral heterogeneity exists in vascular and genetic structure. We developed an algorithm which uses clinical imaging to interrogate different scales of heterogeneity. We hypothesize that heterogeneity of perfusion at large distance scales may correlate with propensity for disease recurrence. We applied the algorithm to initial diagnosis MRI of rhabdomyosarcoma patients to predict recurrence. Methods: The Spatial Heterogeneity Analysis by Recursive Partitioning (SHARP) algorithm recursively segments the tumor image. The tumor is repeatedly subdivided, with each dividing line chosen to maximize signal intensity difference between the two subregions. This process continues to the voxel level, producing segments at multiple scales. Heterogeneity is measured by comparing signal intensity histograms between each segmented region and the adjacent region. We measured the scales of contrast enhancement heterogeneity of the primary tumor in 18 rhabdomyosarcoma patients. Using Cox proportional hazards regression, we explored the influence of heterogeneity parameters on relapse-free survival (RFS). To compare with existing methods, fractal and Haralick texture features were also calculated. Results: The complete segmentation produced by SHARP allows extraction of diverse features, including the amount of heterogeneity at various distance scales, the area of the tumor with the most heterogeneity at each scale, and for a given point in the tumor, the heterogeneity at different scales. 10/18 rhabdomyosarcoma patients suffered disease recurrence. On contrast-enhanced MRI, larger scale of maximum signal intensity heterogeneity, relative to tumor diameter, predicted for shorter RFS (p=0.05). Fractal dimension, fractal fit, and three Haralick features did not predict RFS (p=0.09-0.90). Conclusion: SHARP produces an automatic segmentation of tumor regions and reports the amount of heterogeneity at various distance scales. In rhabdomyosarcoma, RFS was

  19. Structural constraints in the packaging of bluetongue virus genomic segments.

    Science.gov (United States)

    Burkhardt, Christiane; Sung, Po-Yu; Celma, Cristina C; Roy, Polly

    2014-10-01

    The mechanism used by bluetongue virus (BTV) to ensure the sorting and packaging of its 10 genomic segments is still poorly understood. In this study, we investigated the packaging constraints for two BTV genomic segments from two different serotypes. Segment 4 (S4) of BTV serotype 9 was mutated sequentially and packaging of mutant ssRNAs was investigated by two newly developed RNA packaging assay systems, one in vivo and the other in vitro. Modelling of the mutated ssRNA followed by biochemical data analysis suggested that a conformational motif formed by interaction of the 5' and 3' ends of the molecule was necessary and sufficient for packaging. A similar structural signal was also identified in S8 of BTV serotype 1. Furthermore, the same conformational analysis of secondary structures for positive-sense ssRNAs was used to generate a chimeric segment that maintained the putative packaging motif but contained unrelated internal sequences. This chimeric segment was packaged successfully, confirming that the motif identified directs the correct packaging of the segment. © 2014 The Authors.

  20. Construction of carrier state viruses with partial genomes of the segmented dsRNA bacteriophages

    International Nuclear Information System (INIS)

    Sun Yang; Qiao Xueying; Mindich, Leonard

    2004-01-01

    The cystoviridae are bacteriophages with genomes of three segments of dsRNA enclosed within a polyhedral capsid. Two members of this family, PHI6 and PHI8, have been shown to form carrier states in which the virus replicates as a stable episome in the host bacterium while expressing reporter genes such as kanamycin resistance or lacα. The carrier state does not require the activity of all the genes necessary for phage production. It is possible to generate carrier states by infecting cells with virus or by electroporating nonreplicating plasmids containing cDNA copies of the viral genomes into the host cells. We have found that carrier states in both PHI6 and PHI8 can be formed at high frequency with all three genomic segments or with only the large and small segments. The large genomic segment codes for the proteins that constitute the inner core of the virus, which is the structure responsible for the packaging and replication of the genome. In PHI6, a carrier state can be formed with the large and middle segment if mutations occur in the gene for the major structural protein of the inner core. In PHI8, carrier state formation requires the activity of genes 8 and 12 of segment S

  1. Examining the Heterogeneous Genome Content of Multipartite Viruses BMV and CCMV by Native Mass Spectrometry

    Science.gov (United States)

    van de Waterbeemd, Michiel; Snijder, Joost; Tsvetkova, Irina B.; Dragnea, Bogdan G.; Cornelissen, Jeroen J.; Heck, Albert J. R.

    2016-06-01

    Since the concept was first introduced by Brian Chait and co-workers in 1991, mass spectrometry of proteins and protein complexes under non-denaturing conditions (native MS) has strongly developed, through parallel advances in instrumentation, sample preparation, and data analysis tools. However, the success rate of native MS analysis, particularly in heterogeneous mega-Dalton (MDa) protein complexes, still strongly depends on careful instrument modification. Here, we further explore these boundaries in native mass spectrometry, analyzing two related endogenous multipartite viruses: the Brome Mosaic Virus (BMV) and the Cowpea Chlorotic Mottle Virus (CCMV). Both CCMV and BMV are approximately 4.6 megadalton (MDa) in mass, of which approximately 1 MDA originates from the genomic content of the virion. Both viruses are produced as mixtures of three particles carrying different segments of the genome, varying by approximately 0.1 MDA in mass (~2%). This mixture of particles poses a challenging analytical problem for high-resolution native MS analysis, given the large mass scales involved. We attempt to unravel the particle heterogeneity using both Q-TOF and Orbitrap mass spectrometers extensively modified for analysis of very large assemblies. We show that manipulation of the charging behavior can provide assistance in assigning the correct charge states. Despite their challenging size and heterogeneity, we obtained native mass spectra with resolved series of charge states for both BMV and CCMV, demonstrating that native MS of endogenous multipartite virions is feasible.

  2. Microenvironmental Heterogeneity Parallels Breast Cancer Progression: A Histology-Genomic Integration Analysis.

    Directory of Open Access Journals (Sweden)

    Rachael Natrajan

    2016-02-01

    Full Text Available The intra-tumor diversity of cancer cells is under intense investigation; however, little is known about the heterogeneity of the tumor microenvironment that is key to cancer progression and evolution. We aimed to assess the degree of microenvironmental heterogeneity in breast cancer and correlate this with genomic and clinical parameters.We developed a quantitative measure of microenvironmental heterogeneity along three spatial dimensions (3-D in solid tumors, termed the tumor ecosystem diversity index (EDI, using fully automated histology image analysis coupled with statistical measures commonly used in ecology. This measure was compared with disease-specific survival, key mutations, genome-wide copy number, and expression profiling data in a retrospective study of 510 breast cancer patients as a test set and 516 breast cancer patients as an independent validation set. In high-grade (grade 3 breast cancers, we uncovered a striking link between high microenvironmental heterogeneity measured by EDI and a poor prognosis that cannot be explained by tumor size, genomics, or any other data types. However, this association was not observed in low-grade (grade 1 and 2 breast cancers. The prognostic value of EDI was superior to known prognostic factors and was enhanced with the addition of TP53 mutation status (multivariate analysis test set, p = 9 × 10-4, hazard ratio = 1.47, 95% CI 1.17-1.84; validation set, p = 0.0011, hazard ratio = 1.78, 95% CI 1.26-2.52. Integration with genome-wide profiling data identified losses of specific genes on 4p14 and 5q13 that were enriched in grade 3 tumors with high microenvironmental diversity that also substratified patients into poor prognostic groups. Limitations of this study include the number of cell types included in the model, that EDI has prognostic value only in grade 3 tumors, and that our spatial heterogeneity measure was dependent on spatial scale and tumor size.To our knowledge, this is the first

  3. Comparing genomes with rearrangements and segmental duplications.

    Science.gov (United States)

    Shao, Mingfu; Moret, Bernard M E

    2015-06-15

    Large-scale evolutionary events such as genomic rearrange.ments and segmental duplications form an important part of the evolution of genomes and are widely studied from both biological and computational perspectives. A basic computational problem is to infer these events in the evolutionary history for given modern genomes, a task for which many algorithms have been proposed under various constraints. Algorithms that can handle both rearrangements and content-modifying events such as duplications and losses remain few and limited in their applicability. We study the comparison of two genomes under a model including general rearrangements (through double-cut-and-join) and segmental duplications. We formulate the comparison as an optimization problem and describe an exact algorithm to solve it by using an integer linear program. We also devise a sufficient condition and an efficient algorithm to identify optimal substructures, which can simplify the problem while preserving optimality. Using the optimal substructures with the integer linear program (ILP) formulation yields a practical and exact algorithm to solve the problem. We then apply our algorithm to assign in-paralogs and orthologs (a necessary step in handling duplications) and compare its performance with that of the state-of-the-art method MSOAR, using both simulations and real data. On simulated datasets, our method outperforms MSOAR by a significant margin, and on five well-annotated species, MSOAR achieves high accuracy, yet our method performs slightly better on each of the 10 pairwise comparisons. http://lcbb.epfl.ch/softwares/coser. © The Author 2015. Published by Oxford University Press.

  4. Superpixel Segmentation for Polsar Images with Local Iterative Clustering and Heterogeneous Statistical Model

    Science.gov (United States)

    Xiang, D.; Ni, W.; Zhang, H.; Wu, J.; Yan, W.; Su, Y.

    2017-09-01

    Superpixel segmentation has an advantage that can well preserve the target shape and details. In this research, an adaptive polarimetric SLIC (Pol-ASLIC) superpixel segmentation method is proposed. First, the spherically invariant random vector (SIRV) product model is adopted to estimate the normalized covariance matrix and texture for each pixel. A new edge detector is then utilized to extract PolSAR image edges for the initialization of central seeds. In the local iterative clustering, multiple cues including polarimetric, texture, and spatial information are considered to define the similarity measure. Moreover, a polarimetric homogeneity measurement is used to automatically determine the tradeoff factor, which can vary from homogeneous areas to heterogeneous areas. Finally, the SLIC superpixel segmentation scheme is applied to the airborne Experimental SAR and PiSAR L-band PolSAR data to demonstrate the effectiveness of this proposed segmentation approach. This proposed algorithm produces compact superpixels which can well adhere to image boundaries in both natural and urban areas. The detail information in heterogeneous areas can be well preserved.

  5. SUPERPIXEL SEGMENTATION FOR POLSAR IMAGES WITH LOCAL ITERATIVE CLUSTERING AND HETEROGENEOUS STATISTICAL MODEL

    Directory of Open Access Journals (Sweden)

    D. Xiang

    2017-09-01

    Full Text Available Superpixel segmentation has an advantage that can well preserve the target shape and details. In this research, an adaptive polarimetric SLIC (Pol-ASLIC superpixel segmentation method is proposed. First, the spherically invariant random vector (SIRV product model is adopted to estimate the normalized covariance matrix and texture for each pixel. A new edge detector is then utilized to extract PolSAR image edges for the initialization of central seeds. In the local iterative clustering, multiple cues including polarimetric, texture, and spatial information are considered to define the similarity measure. Moreover, a polarimetric homogeneity measurement is used to automatically determine the tradeoff factor, which can vary from homogeneous areas to heterogeneous areas. Finally, the SLIC superpixel segmentation scheme is applied to the airborne Experimental SAR and PiSAR L-band PolSAR data to demonstrate the effectiveness of this proposed segmentation approach. This proposed algorithm produces compact superpixels which can well adhere to image boundaries in both natural and urban areas. The detail information in heterogeneous areas can be well preserved.

  6. Analysis of high-identity segmental duplications in the grapevine genome

    Directory of Open Access Journals (Sweden)

    Carelli Francesco N

    2011-08-01

    Full Text Available Abstract Background Segmental duplications (SDs are blocks of genomic sequence of 1-200 kb that map to different loci in a genome and share a sequence identity > 90%. SDs show at the sequence level the same characteristics as other regions of the human genome: they contain both high-copy repeats and gene sequences. SDs play an important role in genome plasticity by creating new genes and modeling genome structure. Although data is plentiful for mammals, not much was known about the representation of SDs in plant genomes. In this regard, we performed a genome-wide analysis of high-identity SDs on the sequenced grapevine (Vitis vinifera genome (PN40024. Results We demonstrate that recent SDs (> 94% identity and >= 10 kb in size are a relevant component of the grapevine genome (85 Mb, 17% of the genome sequence. We detected mitochondrial and plastid DNA and genes (10% of gene annotation in segmentally duplicated regions of the nuclear genome. In particular, the nine highest copy number genes have a copy in either or both organelle genomes. Further we showed that several duplicated genes take part in the biosynthesis of compounds involved in plant response to environmental stress. Conclusions These data show the great influence of SDs and organelle DNA transfers in modeling the Vitis vinifera nuclear DNA structure as well as the impact of SDs in contributing to the adaptive capacity of grapevine and the nutritional content of grape products through genome variation. This study represents a step forward in the full characterization of duplicated genes important for grapevine cultural needs and human health.

  7. Segmenting the human genome based on states of neutral genetic divergence.

    Science.gov (United States)

    Kuruppumullage Don, Prabhani; Ananda, Guruprasad; Chiaromonte, Francesca; Makova, Kateryna D

    2013-09-03

    Many studies have demonstrated that divergence levels generated by different mutation types vary and covary across the human genome. To improve our still-incomplete understanding of the mechanistic basis of this phenomenon, we analyze several mutation types simultaneously, anchoring their variation to specific regions of the genome. Using hidden Markov models on insertion, deletion, nucleotide substitution, and microsatellite divergence estimates inferred from human-orangutan alignments of neutrally evolving genomic sequences, we segment the human genome into regions corresponding to different divergence states--each uniquely characterized by specific combinations of divergence levels. We then parsed the mutagenic contributions of various biochemical processes associating divergence states with a broad range of genomic landscape features. We find that high divergence states inhabit guanine- and cytosine (GC)-rich, highly recombining subtelomeric regions; low divergence states cover inner parts of autosomes; chromosome X forms its own state with lowest divergence; and a state of elevated microsatellite mutability is interspersed across the genome. These general trends are mirrored in human diversity data from the 1000 Genomes Project, and departures from them highlight the evolutionary history of primate chromosomes. We also find that genes and noncoding functional marks [annotations from the Encyclopedia of DNA Elements (ENCODE)] are concentrated in high divergence states. Our results provide a powerful tool for biomedical data analysis: segmentations can be used to screen personal genome variants--including those associated with cancer and other diseases--and to improve computational predictions of noncoding functional elements.

  8. Whole genome sequencing reveals genomic heterogeneity and antibiotic purification in Mycobacterium tuberculosis isolates

    KAUST Repository

    Black, PA

    2015-10-24

    Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug resistance. In an age where whole genome sequencing is increasingly relied upon for defining the structure of bacterial genomes, it is important to investigate the reliability of next generation sequencing to identify clonal variants present in a minor percentage of the population. This study aimed to define a reliable cut-off for identification of low frequency sequence variants and to subsequently investigate genetic heterogeneity and the evolution of drug resistance in M. tuberculosis. Methods Genomic DNA was isolated from single colonies from 14 rifampicin mono-resistant M. tuberculosis isolates, as well as the primary cultures and follow up MDR cultures from two of these patients. The whole genomes of the M. tuberculosis isolates were sequenced using either the Illumina MiSeq or Illumina HiSeq platforms. Sequences were analysed with an in-house pipeline. Results Using next-generation sequencing in combination with Sanger sequencing and statistical analysis we defined a read frequency cut-off of 30 % to identify low frequency M. tuberculosis variants with high confidence. Using this cut-off we demonstrated a high rate of genetic diversity between single colonies isolated from one population, showing that by using the current sequencing technology, single colonies are not a true reflection of the genetic diversity within a whole population and vice versa. We further showed that numerous heterogeneous variants emerge and then disappear during the evolution of isoniazid resistance within individual patients. Our findings allowed us to formulate a model for the selective bottleneck which occurs during the course of infection, acting as a genomic purification event. Conclusions Our study demonstrated true levels of genetic diversity

  9. Spectral entropy criteria for structural segmentation in genomic DNA sequences

    International Nuclear Information System (INIS)

    Chechetkin, V.R.; Lobzin, V.V.

    2004-01-01

    The spectral entropy is calculated with Fourier structure factors and characterizes the level of structural ordering in a sequence of symbols. It may efficiently be applied to the assessment and reconstruction of the modular structure in genomic DNA sequences. We present the relevant spectral entropy criteria for the local and non-local structural segmentation in DNA sequences. The results are illustrated with the model examples and analysis of intervening exon-intron segments in the protein-coding regions

  10. Detection and correction of false segmental duplications caused by genome mis-assembly

    Science.gov (United States)

    2010-01-01

    Diploid genomes with divergent chromosomes present special problems for assembly software as two copies of especially polymorphic regions may be mistakenly constructed, creating the appearance of a recent segmental duplication. We developed a method for identifying such false duplications and applied it to four vertebrate genomes. For each genome, we corrected mis-assemblies, improved estimates of the amount of duplicated sequence, and recovered polymorphisms between the sequenced chromosomes. PMID:20219098

  11. Molecular Heterogeneity in Primary Breast Carcinomas and Axillary Lymph Node Metastases Assessed by Genomic Fingerprinting Analysis

    Science.gov (United States)

    Ellsworth, Rachel E; Toro, Allyson L; Blackburn, Heather L; Decewicz, Alisha; Deyarmin, Brenda; Mamula, Kimberly A; Costantino, Nicholas S; Hooke, Jeffrey A; Shriver, Craig D; Ellsworth, Darrell L

    2015-01-01

    Molecular heterogeneity within primary breast carcinomas and among axillary lymph node (LN) metastases may impact diagnosis and confound treatment. In this study, we used short tandem repeated sequences to assess genomic heterogeneity and to determine hereditary relationships among primary tumor areas and regional metastases from 30 breast cancer patients. We found that primary carcinomas were genetically heterogeneous and sampling multiple areas was necessary to adequately assess genomic variability. LN metastases appeared to originate at different time periods during disease progression from different sites of the primary tumor and the extent of genomic divergence among regional metastases was associated with a less favorable patient outcome (P = 0.009). In conclusion, metastasis is a complex process influenced by primary tumor heterogeneity and variability in the timing of dissemination. Genomic variation in primary breast tumors and regional metastases may negatively impact clinical diagnostics and contribute to therapeutic resistance. PMID:26279627

  12. Molecular Heterogeneity in Primary Breast Carcinomas and Axillary Lymph Node Metastases Assessed by Genomic Fingerprinting Analysis

    Directory of Open Access Journals (Sweden)

    Rachel E. Ellsworth

    2015-01-01

    Full Text Available Molecular heterogeneity within primary breast carcinomas and among axillary lymph node (LN metastases may impact diagnosis and confound treatment. In this study, we used short tandem repeated sequences to assess genomic heterogeneity and to determine hereditary relationships among primary tumor areas and regional metastases from 30 breast cancer patients. We found that primary carcinomas were genetically heterogeneous and sampling multiple areas was necessary to adequately assess genomic variability. LN metastases appeared to originate at different time periods during disease progression from different sites of the primary tumor and the extent of genomic divergence among regional metastases was associated with a less favorable patient outcome ( P = 0.009. In conclusion, metastasis is a complex process influenced by primary tumor heterogeneity and variability in the timing of dissemination. Genomic variation in primary breast tumors and regional metastases may negatively impact clinical diagnostics and contribute to therapeutic resistance.

  13. Intra-genomic GC heterogeneity in sauropsids: evolutionary insights from cDNA mapping and GC3 profiling in snake

    Science.gov (United States)

    2012-01-01

    Background Extant sauropsids (reptiles and birds) are divided into two major lineages, the lineage of Testudines (turtles) and Archosauria (crocodilians and birds) and the lineage of Lepidosauria (tuatara, lizards, worm lizards and snakes). Karyotypes of these sauropsidan groups generally consist of macrochromosomes and microchromosomes. In chicken, microchromosomes exhibit a higher GC-content than macrochromosomes. To examine the pattern of intra-genomic GC heterogeneity in lepidosaurian genomes, we constructed a cytogenetic map of the Japanese four-striped rat snake (Elaphe quadrivirgata) with 183 cDNA clones by fluorescence in situ hybridization, and examined the correlation between the GC-content of exonic third codon positions (GC3) of the genes and the size of chromosomes on which the genes were localized. Results Although GC3 distribution of snake genes was relatively homogeneous compared with those of the other amniotes, microchromosomal genes showed significantly higher GC3 than macrochromosomal genes as in chicken. Our snake cytogenetic map also identified several conserved segments between the snake macrochromosomes and the chicken microchromosomes. Cross-species comparisons revealed that GC3 of most snake orthologs in such macrochromosomal segments were GC-poor (GC3 < 50%) whereas those of chicken orthologs in microchromosomes were relatively GC-rich (GC3 ≥ 50%). Conclusion Our results suggest that the chromosome size-dependent GC heterogeneity had already occurred before the lepidosaur-archosaur split, 275 million years ago. This character was probably present in the common ancestor of lepidosaurs and but lost in the lineage leading to Anolis during the diversification of lepidosaurs. We also identified several genes whose GC-content might have been influenced by the size of the chromosomes on which they were harbored over the course of sauropsid evolution. PMID:23140509

  14. RNA structural constraints in the evolution of the influenza A virus genome NP segment

    NARCIS (Netherlands)

    A.P. Gultyaev (Alexander); A. Tsyganov-Bodounov (Anton); M.I. Spronken (Monique); S. Van Der Kooij (Sander); R.A.M. Fouchier (Ron); R.C.L. Olsthoorn (René)

    2014-01-01

    textabstractConserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length,

  15. Deciphering the assembly of multi-segment genome complexes in influenza A virus

    OpenAIRE

    Prisner, Simon

    2017-01-01

    Influenza A besitzt ein segmentiertes, achtsträngiges Genom in negativer Orientierung. Die einzelnen Segmente sind in virale Ribonukleoproteinkomplexe (vRNPs) verpackt. Genomische Segmentierung erlaubt es Influenza, zwischen verschiedenen Stämmen Reassortierung zu betreiben, was zur Entstehung von hochgradig virulenten und potentiell pandemischen neuen Stämmen führen kann. Die Existenz eines Packungsmechanismus wird vermutet, der sicherstellt dass exakt ein Segment jeden Typs in neu knospe...

  16. Evaluation of Segmentation Bases for the Heterogeneous Elderly Consumer Population: the Functional Food Market

    NARCIS (Netherlands)

    Zanden, van der L.D.T.; Kleef, van E.; Wijk, de R.A.; Trijp, van J.C.M.

    2014-01-01

    It is beneficial for both the public health community and the food industry to meet nutritional needs of elderly consumers through product formats that they want. The heterogeneity of the elderly market poses a challenge, however, and calls for market segmentation. Although many researchers have

  17. Consumer Product Perceptions and Salmon Consumption Frequency: The Role of Heterogeneity Based on Food Lifestyle Segments

    OpenAIRE

    Yuko Onozaka; Håvard Hansen; Arne Sørvig

    2014-01-01

    Seafood consumers are vastly heterogeneous in terms of their knowledge, confidence, and perceptions about seafood. This article examines the relationship between consumer perceptions (healthiness, value for money, and convenience) and salmon consumption frequencies while modeling unobserved consumer heterogeneity by segmenting consumers based on their food-related lifestyle. We employ latent class analysis (LCA) that embeds the structural equation modeling (SEM) to ensure the latent nature of...

  18. Multi-region and single-cell sequencing reveal variable genomic heterogeneity in rectal cancer.

    Science.gov (United States)

    Liu, Mingshan; Liu, Yang; Di, Jiabo; Su, Zhe; Yang, Hong; Jiang, Beihai; Wang, Zaozao; Zhuang, Meng; Bai, Fan; Su, Xiangqian

    2017-11-23

    Colorectal cancer is a heterogeneous group of malignancies with complex molecular subtypes. While colon cancer has been widely investigated, studies on rectal cancer are very limited. Here, we performed multi-region whole-exome sequencing and single-cell whole-genome sequencing to examine the genomic intratumor heterogeneity (ITH) of rectal tumors. We sequenced nine tumor regions and 88 single cells from two rectal cancer patients with tumors of the same molecular classification and characterized their mutation profiles and somatic copy number alterations (SCNAs) at the multi-region and the single-cell levels. A variable extent of genomic heterogeneity was observed between the two patients, and the degree of ITH increased when analyzed on the single-cell level. We found that major SCNAs were early events in cancer development and inherited steadily. Single-cell sequencing revealed mutations and SCNAs which were hidden in bulk sequencing. In summary, we studied the ITH of rectal cancer at regional and single-cell resolution and demonstrated that variable heterogeneity existed in two patients. The mutational scenarios and SCNA profiles of two patients with treatment naïve from the same molecular subtype are quite different. Our results suggest each tumor possesses its own architecture, which may result in different diagnosis, prognosis, and drug responses. Remarkable ITH exists in the two patients we have studied, providing a preliminary impression of ITH in rectal cancer.

  19. Genomic Heterogeneity as a Barrier to Precision Medicine in Gastroesophageal Adenocarcinoma.

    Science.gov (United States)

    Pectasides, Eirini; Stachler, Matthew D; Derks, Sarah; Liu, Yang; Maron, Steven; Islam, Mirazul; Alpert, Lindsay; Kwak, Heewon; Kindler, Hedy; Polite, Blase; Sharma, Manish R; Allen, Kenisha; O'Day, Emily; Lomnicki, Samantha; Maranto, Melissa; Kanteti, Rajani; Fitzpatrick, Carrie; Weber, Christopher; Setia, Namrata; Xiao, Shu-Yuan; Hart, John; Nagy, Rebecca J; Kim, Kyoung-Mee; Choi, Min-Gew; Min, Byung-Hoon; Nason, Katie S; O'Keefe, Lea; Watanabe, Masayuki; Baba, Hideo; Lanman, Rick; Agoston, Agoston T; Oh, David J; Dunford, Andrew; Thorner, Aaron R; Ducar, Matthew D; Wollison, Bruce M; Coleman, Haley A; Ji, Yuan; Posner, Mitchell C; Roggin, Kevin; Turaga, Kiran; Chang, Paul; Hogarth, Kyle; Siddiqui, Uzma; Gelrud, Andres; Ha, Gavin; Freeman, Samuel S; Rhoades, Justin; Reed, Sarah; Gydush, Greg; Rotem, Denisse; Davison, Jon; Imamura, Yu; Adalsteinsson, Viktor; Lee, Jeeyun; Bass, Adam J; Catenacci, Daniel V

    2018-01-01

    Gastroesophageal adenocarcinoma (GEA) is a lethal disease where targeted therapies, even when guided by genomic biomarkers, have had limited efficacy. A potential reason for the failure of such therapies is that genomic profiling results could commonly differ between the primary and metastatic tumors. To evaluate genomic heterogeneity, we sequenced paired primary GEA and synchronous metastatic lesions across multiple cohorts, finding extensive differences in genomic alterations, including discrepancies in potentially clinically relevant alterations. Multiregion sequencing showed significant discrepancy within the primary tumor (PT) and between the PT and disseminated disease, with oncogene amplification profiles commonly discordant. In addition, a pilot analysis of cell-free DNA (cfDNA) sequencing demonstrated the feasibility of detecting genomic amplifications not detected in PT sampling. Lastly, we profiled paired primary tumors, metastatic tumors, and cfDNA from patients enrolled in the personalized antibodies for GEA (PANGEA) trial of targeted therapies in GEA and found that genomic biomarkers were recurrently discrepant between the PT and untreated metastases. Divergent primary and metastatic tissue profiling led to treatment reassignment in 32% (9/28) of patients. In discordant primary and metastatic lesions, we found 87.5% concordance for targetable alterations in metastatic tissue and cfDNA, suggesting the potential for cfDNA profiling to enhance selection of therapy. Significance: We demonstrate frequent baseline heterogeneity in targetable genomic alterations in GEA, indicating that current tissue sampling practices for biomarker testing do not effectively guide precision medicine in this disease and that routine profiling of metastatic lesions and/or cfDNA should be systematically evaluated. Cancer Discov; 8(1); 37-48. ©2017 AACR. See related commentary by Sundar and Tan, p. 14 See related article by Janjigian et al., p. 49 This article is highlighted

  20. Genomic and epigenomic heterogeneity in chronic lymphocytic leukemia

    OpenAIRE

    Guièze, Romain; Wu, Catherine J.

    2015-01-01

    Defining features of chronic lymphocytic leukemia (CLL) are not only its immunophenotype of CD19+CD5+CD23+sIgdim expressing clonal mature B cells but also its highly variable clinical course. In recent years, advances in massively parallel sequencing technologies have led to rapid progress in our understanding of the CLL genome and epigenome. Overall, these studies have clearly demarcated not only the vast degree of genetic and epigenetic heterogeneity among individuals with CLL but also even...

  1. Supplementary Material for: Whole genome sequencing reveals genomic heterogeneity and antibiotic purification in Mycobacterium tuberculosis isolates

    KAUST Repository

    Black, PA

    2015-01-01

    Abstract Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug resistance. In an age where whole genome sequencing is increasingly relied upon for defining the structure of bacterial genomes, it is important to investigate the reliability of next generation sequencing to identify clonal variants present in a minor percentage of the population. This study aimed to define a reliable cut-off for identification of low frequency sequence variants and to subsequently investigate genetic heterogeneity and the evolution of drug resistance in M. tuberculosis. Methods Genomic DNA was isolated from single colonies from 14 rifampicin mono-resistant M. tuberculosis isolates, as well as the primary cultures and follow up MDR cultures from two of these patients. The whole genomes of the M. tuberculosis isolates were sequenced using either the Illumina MiSeq or Illumina HiSeq platforms. Sequences were analysed with an in-house pipeline. Results Using next-generation sequencing in combination with Sanger sequencing and statistical analysis we defined a read frequency cut-off of 30 % to identify low frequency M. tuberculosis variants with high confidence. Using this cut-off we demonstrated a high rate of genetic diversity between single colonies isolated from one population, showing that by using the current sequencing technology, single colonies are not a true reflection of the genetic diversity within a whole population and vice versa. We further showed that numerous heterogeneous variants emerge and then disappear during the evolution of isoniazid resistance within individual patients. Our findings allowed us to formulate a model for the selective bottleneck which occurs during the course of infection, acting as a genomic purification event. Conclusions Our study demonstrated true levels of genetic

  2. The Consequences of Reconfiguring the Ambisense S Genome Segment of Rift Valley Fever Virus on Viral Replication in Mammalian and Mosquito Cells and for Genome Packaging

    Science.gov (United States)

    Elliott, Richard M.

    2014-01-01

    Rift Valley fever virus (RVFV, family Bunyaviridae) is a mosquito-borne pathogen of both livestock and humans, found primarily in Sub-Saharan Africa and the Arabian Peninsula. The viral genome comprises two negative-sense (L and M segments) and one ambisense (S segment) RNAs that encode seven proteins. The S segment encodes the nucleocapsid (N) protein in the negative-sense and a nonstructural (NSs) protein in the positive-sense, though NSs cannot be translated directly from the S segment but rather from a specific subgenomic mRNA. Using reverse genetics we generated a virus, designated rMP12:S-Swap, in which the N protein is expressed from the NSs locus and NSs from the N locus within the genomic S RNA. In cells infected with rMP12:S-Swap NSs is expressed at higher levels with respect to N than in cells infected with the parental rMP12 virus. Despite NSs being the main interferon antagonist and determinant of virulence, growth of rMP12:S-Swap was attenuated in mammalian cells and gave a small plaque phenotype. The increased abundance of the NSs protein did not lead to faster inhibition of host cell protein synthesis or host cell transcription in infected mammalian cells. In cultured mosquito cells, however, infection with rMP12:S-Swap resulted in cell death rather than establishment of persistence as seen with rMP12. Finally, altering the composition of the S segment led to a differential packaging ratio of genomic to antigenomic RNA into rMP12:S-Swap virions. Our results highlight the plasticity of the RVFV genome and provide a useful experimental tool to investigate further the packaging mechanism of the segmented genome. PMID:24550727

  3. Geometry segmentation of voxelized representations of heterogeneous microstructures using betweenness centrality

    Energy Technology Data Exchange (ETDEWEB)

    Yuan, Rui; Singh, Sudhanshu S.; Chawla, Nikhilesh; Oswald, Jay, E-mail: joswald1@asu.edu

    2016-08-15

    We present a robust method for automating removal of “segregation artifacts” in segmented tomographic images of three-dimensional heterogeneous microstructures. The objective of this method is to accurately identify and separate discrete features in composite materials where limitations in imaging resolution lead to spurious connections near close contacts. The method utilizes betweenness centrality, a measure of the importance of a node in the connectivity of a graph network, to identify voxels that create artificial bridges between otherwise distinct geometric features. To facilitate automation of the algorithm, we develop a relative centrality metric to allow for the selection of a threshold criterion that is not sensitive to inclusion size or shape. As a demonstration of the effectiveness of the algorithm, we report on the segmentation of a 3D reconstruction of a SiC particle reinforced aluminum alloy, imaged by X-ray synchrotron tomography.

  4. The GC-heterogeneity of teleost fishes

    Directory of Open Access Journals (Sweden)

    Gautier Christian

    2008-12-01

    Full Text Available Abstract Background One of the most striking features of mammalian and birds chromosomes is the variation in the guanine-cytosine (GC content that occurs over scales of hundreds of kilobases to megabases; this is known as the "isochore" structure. Among other vertebrates the presence of isochores depends upon the taxon; isochore are clearly present in Crocodiles and turtles but fish genome seems very homogeneous on GC content. This has suggested a unique isochore origin after the divergence between Sarcopterygii and Actinopterygii, but before that between Sauropsida and mammals. However during more than 30 years of analysis, isochore characteristics have been studied and many important biological properties have been associated with the isochore structure of human genomes. For instance, the genes are more compact and their density is highest in GC rich isochores. Results This paper shows in teleost fish genomes the existence of "GC segmentation" sharing some of the characteristics of isochores although teleost fish genomes presenting a particular homogeneity in CG content. The entire genomes of T nigroviridis and D rerio are now available, and this has made it possible to check whether a mosaic structure associated with isochore properties can be found in these fishes. In this study, hidden Markov models were trained on fish genes (T nigroviridis and D rerio which were classified by using the isochore class of their human orthologous. A clear segmentation of these genomes was detected. Conclusion The GC content is an excellent indicator of isochores in heterogeneous genomes as mammals. The segmentation we obtained were well correlated with GC content and other properties associated to GC content such as gene density, the number of exons per gene and the length of introns. Therefore, the GC content is the main property that allows the detection of isochore but more biological properties have to be taken into account. This method allows detecting

  5. Evaluation of a Phylogenetic Marker Based on Genomic Segment B of Infectious Bursal Disease Virus: Facilitating a Feasible Incorporation of this Segment to the Molecular Epidemiology Studies for this Viral Agent.

    Science.gov (United States)

    Alfonso-Morales, Abdulahi; Rios, Liliam; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Ganges, Llilianne; Díaz de Arce, Heidy; Majó, Natàlia; Núñez, José I; Pérez, Lester J

    2015-01-01

    Infectious bursal disease (IBD) is a highly contagious and acute viral disease, which has caused high mortality rates in birds and considerable economic losses in different parts of the world for more than two decades and it still represents a considerable threat to poultry. The current study was designed to rigorously measure the reliability of a phylogenetic marker included into segment B. This marker can facilitate molecular epidemiology studies, incorporating this segment of the viral genome, to better explain the links between emergence, spreading and maintenance of the very virulent IBD virus (vvIBDV) strains worldwide. Sequences of the segment B gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank Database; Cuban sequences were obtained in the current work. A phylogenetic marker named B-marker was assessed by different phylogenetic principles such as saturation of substitution, phylogenetic noise and high consistency. This last parameter is based on the ability of B-marker to reconstruct the same topology as the complete segment B of the viral genome. From the results obtained from B-marker, demographic history for both main lineages of IBDV regarding segment B was performed by Bayesian skyline plot analysis. Phylogenetic analysis for both segments of IBDV genome was also performed, revealing the presence of a natural reassortant strain with segment A from vvIBDV strains and segment B from non-vvIBDV strains within Cuban IBDV population. This study contributes to a better understanding of the emergence of vvIBDV strains, describing molecular epidemiology of IBDV using the state-of-the-art methodology concerning phylogenetic reconstruction. This study also revealed the presence of a novel natural reassorted strain as possible manifest of change in the genetic structure and stability of the vvIBDV strains. Therefore, it highlights the need to obtain information about both genome segments of IBDV for molecular

  6. Detection and analysis of ancient segmental duplications in mammalian genomes.

    Science.gov (United States)

    Pu, Lianrong; Lin, Yu; Pevzner, Pavel A

    2018-05-07

    Although segmental duplications (SDs) represent hotbeds for genomic rearrangements and emergence of new genes, there are still no easy-to-use tools for identifying SDs. Moreover, while most previous studies focused on recently emerged SDs, detection of ancient SDs remains an open problem. We developed an SDquest algorithm for SD finding and applied it to analyzing SDs in human, gorilla, and mouse genomes. Our results demonstrate that previous studies missed many SDs in these genomes and show that SDs account for at least 6.05% of the human genome (version hg19), a 17% increase as compared to the previous estimate. Moreover, SDquest classified 6.42% of the latest GRCh38 version of the human genome as SDs, a large increase as compared to previous studies. We thus propose to re-evaluate evolution of SDs based on their accurate representation across multiple genomes. Toward this goal, we analyzed the complex mosaic structure of SDs and decomposed mosaic SDs into elementary SDs, a prerequisite for follow-up evolutionary analysis. We also introduced the concept of the breakpoint graph of mosaic SDs that revealed SD hotspots and suggested that some SDs may have originated from circular extrachromosomal DNA (ecDNA), not unlike ecDNA that contributes to accelerated evolution in cancer. © 2018 Pu et al.; Published by Cold Spring Harbor Laboratory Press.

  7. Understanding heterogeneity among elderly consumers: an evaluation of segmentation approaches in the functional food market.

    Science.gov (United States)

    van der Zanden, Lotte D T; van Kleef, Ellen; de Wijk, René A; van Trijp, Hans C M

    2014-06-01

    It is beneficial for both the public health community and the food industry to meet nutritional needs of elderly consumers through product formats that they want. The heterogeneity of the elderly market poses a challenge, however, and calls for market segmentation. Although many researchers have proposed ways to segment the elderly consumer population, the elderly food market has received surprisingly little attention in this respect. Therefore, the present paper reviewed eight potential segmentation bases on their appropriateness in the context of functional foods aimed at the elderly: cognitive age, life course, time perspective, demographics, general food beliefs, food choice motives, product attributes and benefits sought, and past purchase. Each of the segmentation bases had strengths as well as weaknesses regarding seven evaluation criteria. Given that both product design and communication are useful tools to increase the appeal of functional foods, we argue that elderly consumers in this market may best be segmented using a preference-based segmentation base that is predictive of behaviour (for example, attributes and benefits sought), combined with a characteristics-based segmentation base that describes consumer characteristics (for example, demographics). In the end, the effectiveness of (combinations of) segmentation bases for elderly consumers in the functional food market remains an empirical matter. We hope that the present review stimulates further empirical research that substantiates the ideas presented in this paper.

  8. Segmental heterogeneity of enzymatic response during compensatory renal growth

    International Nuclear Information System (INIS)

    Hoang, T.; Bergeron, M.

    1985-01-01

    The activities of DNA polymerase α and key enzymes of gluconeogenesis and glycolysis were measured in different segments of the rat nephron at various times (up to 96 hrs) following a unilateral nephrectomy (UNx). Tubule fragments were obtained after collagenase treatment followed by centrifugation on a Percoll gradient. The DNA polymerase α activity in control rats showed moderate and similar values in different segmental extracts as well as in the whole kidney extract (1700-1800 μμmole[ 3 H] dAMP/mg DNA). In Unx rats, activity in proximal tubules (PT) measured at 24, 48, 72 and 96 hrs after nephrectomy represented an increase of 60%, 200%, 420% and 370% respectively over control values. Distal tubule fragments (DT) showed only minor increases. The results demonstrate that the proximal tubule accounts for most of the compensatory renal growth (CRG) in the remaining kidney. The gluconeogenic and glycolytic enzymes were confined to the PT and those of glycolysis to the DT fragments. Following UNx, the specific activities of these enzymes were not modified in the remaining kidney; however, the overall activity of gluconeogenesis was increased as a result of the cell hyperplasia occurring in the PT. The work also illustrates that biochemical studies of CRG on the whole organ may provide misleading information due to the presence of heterogeneous cell populations in the mammalian kidney and to their uneven response in CRG

  9. GenHtr: a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing

    Directory of Open Access Journals (Sweden)

    Yu GongXin

    2010-10-01

    Full Text Available Abstract Background Microevolution is the study of short-term changes of alleles within a population and their effects on the phenotype of organisms. The result of the below-species-level evolution is heterogeneity, where populations consist of subpopulations with a large number of structural variations. Heterogeneity analysis is thus essential to our understanding of how selective and neutral forces shape bacterial populations over a short period of time. The Solexa Genome Analyzer, a next-generation sequencing platform, allows millions of short sequencing reads to be obtained with great accuracy, allowing for the ability to study the dynamics of the bacterial population at the whole genome level. The tool referred to as GenHtr was developed for genome-wide heterogeneity analysis. Results For particular bacterial strains, GenHtr relies on a set of Solexa short reads on given bacteria pathogens and their isogenic reference genome to identify heterogeneity sites, the chromosomal positions with multiple variants of genes in the bacterial population, and variations that occur in large gene families. GenHtr accomplishes this by building and comparatively analyzing genome-wide heterogeneity genotypes for both the newly sequenced genomes (using massive short-read sequencing and their isogenic reference (using simulated data. As proof of the concept, this approach was applied to SRX007711, the Solexa sequencing data for a newly sequenced Staphylococcus aureus subsp. USA300 cell line, and demonstrated that it could predict such multiple variants. They include multiple variants of genes critical in pathogenesis, e.g. genes encoding a LysR family transcriptional regulator, 23 S ribosomal RNA, and DNA mismatch repair protein MutS. The heterogeneity results in non-synonymous and nonsense mutations, leading to truncated proteins for both LysR and MutS. Conclusion GenHtr was developed for genome-wide heterogeneity analysis. Although it is much more time

  10. Evaluation of a Phylogenetic Marker Based on Genomic Segment B of Infectious Bursal Disease Virus: Facilitating a Feasible Incorporation of this Segment to the Molecular Epidemiology Studies for this Viral Agent.

    Directory of Open Access Journals (Sweden)

    Abdulahi Alfonso-Morales

    Full Text Available Infectious bursal disease (IBD is a highly contagious and acute viral disease, which has caused high mortality rates in birds and considerable economic losses in different parts of the world for more than two decades and it still represents a considerable threat to poultry. The current study was designed to rigorously measure the reliability of a phylogenetic marker included into segment B. This marker can facilitate molecular epidemiology studies, incorporating this segment of the viral genome, to better explain the links between emergence, spreading and maintenance of the very virulent IBD virus (vvIBDV strains worldwide.Sequences of the segment B gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank Database; Cuban sequences were obtained in the current work. A phylogenetic marker named B-marker was assessed by different phylogenetic principles such as saturation of substitution, phylogenetic noise and high consistency. This last parameter is based on the ability of B-marker to reconstruct the same topology as the complete segment B of the viral genome. From the results obtained from B-marker, demographic history for both main lineages of IBDV regarding segment B was performed by Bayesian skyline plot analysis. Phylogenetic analysis for both segments of IBDV genome was also performed, revealing the presence of a natural reassortant strain with segment A from vvIBDV strains and segment B from non-vvIBDV strains within Cuban IBDV population.This study contributes to a better understanding of the emergence of vvIBDV strains, describing molecular epidemiology of IBDV using the state-of-the-art methodology concerning phylogenetic reconstruction. This study also revealed the presence of a novel natural reassorted strain as possible manifest of change in the genetic structure and stability of the vvIBDV strains. Therefore, it highlights the need to obtain information about both genome segments of IBDV for

  11. Modeling and interoperability of heterogeneous genomic big data for integrative processing and querying.

    Science.gov (United States)

    Masseroli, Marco; Kaitoua, Abdulrahman; Pinoli, Pietro; Ceri, Stefano

    2016-12-01

    While a huge amount of (epi)genomic data of multiple types is becoming available by using Next Generation Sequencing (NGS) technologies, the most important emerging problem is the so-called tertiary analysis, concerned with sense making, e.g., discovering how different (epi)genomic regions and their products interact and cooperate with each other. We propose a paradigm shift in tertiary analysis, based on the use of the Genomic Data Model (GDM), a simple data model which links genomic feature data to their associated experimental, biological and clinical metadata. GDM encompasses all the data formats which have been produced for feature extraction from (epi)genomic datasets. We specifically describe the mapping to GDM of SAM (Sequence Alignment/Map), VCF (Variant Call Format), NARROWPEAK (for called peaks produced by NGS ChIP-seq or DNase-seq methods), and BED (Browser Extensible Data) formats, but GDM supports as well all the formats describing experimental datasets (e.g., including copy number variations, DNA somatic mutations, or gene expressions) and annotations (e.g., regarding transcription start sites, genes, enhancers or CpG islands). We downloaded and integrated samples of all the above-mentioned data types and formats from multiple sources. The GDM is able to homogeneously describe semantically heterogeneous data and makes the ground for providing data interoperability, e.g., achieved through the GenoMetric Query Language (GMQL), a high-level, declarative query language for genomic big data. The combined use of the data model and the query language allows comprehensive processing of multiple heterogeneous data, and supports the development of domain-specific data-driven computations and bio-molecular knowledge discovery. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Avian reovirus L2 genome segment sequences and predicted structure/function of the encoded RNA-dependent RNA polymerase protein

    Directory of Open Access Journals (Sweden)

    Xu Wanhong

    2008-12-01

    Full Text Available Abstract Background The orthoreoviruses are infectious agents that possess a genome comprised of 10 double-stranded RNA segments encased in two concentric protein capsids. Like virtually all RNA viruses, an RNA-dependent RNA polymerase (RdRp enzyme is required for viral propagation. RdRp sequences have been determined for the prototype mammalian orthoreoviruses and for several other closely-related reoviruses, including aquareoviruses, but have not yet been reported for any avian orthoreoviruses. Results We determined the L2 genome segment nucleotide sequences, which encode the RdRp proteins, of two different avian reoviruses, strains ARV138 and ARV176 in order to define conserved and variable regions within reovirus RdRp proteins and to better delineate structure/function of this important enzyme. The ARV138 L2 genome segment was 3829 base pairs long, whereas the ARV176 L2 segment was 3830 nucleotides long. Both segments were predicted to encode λB RdRp proteins 1259 amino acids in length. Alignments of these newly-determined ARV genome segments, and their corresponding proteins, were performed with all currently available homologous mammalian reovirus (MRV and aquareovirus (AqRV genome segment and protein sequences. There was ~55% amino acid identity between ARV λB and MRV λ3 proteins, making the RdRp protein the most highly conserved of currently known orthoreovirus proteins, and there was ~28% identity between ARV λB and homologous MRV and AqRV RdRp proteins. Predictive structure/function mapping of identical and conserved residues within the known MRV λ3 atomic structure indicated most identical amino acids and conservative substitutions were located near and within predicted catalytic domains and lining RdRp channels, whereas non-identical amino acids were generally located on the molecule's surfaces. Conclusion The ARV λB and MRV λ3 proteins showed the highest ARV:MRV identity values (~55% amongst all currently known ARV and MRV

  13. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences

    Directory of Open Access Journals (Sweden)

    Alessandra Traini

    2013-01-01

    Full Text Available Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  14. Genome Microscale Heterogeneity among Wild Potatoes Revealed by Diversity Arrays Technology Marker Sequences.

    Science.gov (United States)

    Traini, Alessandra; Iorizzo, Massimo; Mann, Harpartap; Bradeen, James M; Carputo, Domenico; Frusciante, Luigi; Chiusano, Maria Luisa

    2013-01-01

    Tuber-bearing potato species possess several genes that can be exploited to improve the genetic background of the cultivated potato Solanum tuberosum. Among them, S. bulbocastanum and S. commersonii are well known for their strong resistance to environmental stresses. However, scant information is available for these species in terms of genome organization, gene function, and regulatory networks. Consequently, genomic tools to assist breeding are meager, and efficient exploitation of these species has been limited so far. In this paper, we employed the reference genome sequences from cultivated potato and tomato and a collection of sequences of 1,423 potato Diversity Arrays Technology (DArT) markers that show polymorphic representation across the genomes of S. bulbocastanum and/or S. commersonii genotypes. Our results highlighted microscale genome sequence heterogeneity that may play a significant role in functional and structural divergence between related species. Our analytical approach provides knowledge of genome structural and sequence variability that could not be detected by transcriptome and proteome approaches.

  15. Quantitative segmentation of fluorescence microscopy images of heterogeneous tissue: Approach for tuning algorithm parameters

    Science.gov (United States)

    Mueller, Jenna L.; Harmany, Zachary T.; Mito, Jeffrey K.; Kennedy, Stephanie A.; Kim, Yongbaek; Dodd, Leslie; Geradts, Joseph; Kirsch, David G.; Willett, Rebecca M.; Brown, J. Quincy; Ramanujam, Nimmi

    2013-02-01

    The combination of fluorescent contrast agents with microscopy is a powerful technique to obtain real time images of tissue histology without the need for fixing, sectioning, and staining. The potential of this technology lies in the identification of robust methods for image segmentation and quantitation, particularly in heterogeneous tissues. Our solution is to apply sparse decomposition (SD) to monochrome images of fluorescently-stained microanatomy to segment and quantify distinct tissue types. The clinical utility of our approach is demonstrated by imaging excised margins in a cohort of mice after surgical resection of a sarcoma. Representative images of excised margins were used to optimize the formulation of SD and tune parameters associated with the algorithm. Our results demonstrate that SD is a robust solution that can advance vital fluorescence microscopy as a clinically significant technology.

  16. Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology.

    Science.gov (United States)

    Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y

    2015-10-22

    Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.

  17. An R package "VariABEL" for genome-wide searching of potentially interacting loci by testing genotypic variance heterogeneity

    Directory of Open Access Journals (Sweden)

    Struchalin Maksim V

    2012-01-01

    Full Text Available Abstract Background Hundreds of new loci have been discovered by genome-wide association studies of human traits. These studies mostly focused on associations between single locus and a trait. Interactions between genes and between genes and environmental factors are of interest as they can improve our understanding of the genetic background underlying complex traits. Genome-wide testing of complex genetic models is a computationally demanding task. Moreover, testing of such models leads to multiple comparison problems that reduce the probability of new findings. Assuming that the genetic model underlying a complex trait can include hundreds of genes and environmental factors, testing of these models in genome-wide association studies represent substantial difficulties. We and Pare with colleagues (2010 developed a method allowing to overcome such difficulties. The method is based on the fact that loci which are involved in interactions can show genotypic variance heterogeneity of a trait. Genome-wide testing of such heterogeneity can be a fast scanning approach which can point to the interacting genetic variants. Results In this work we present a new method, SVLM, allowing for variance heterogeneity analysis of imputed genetic variation. Type I error and power of this test are investigated and contracted with these of the Levene's test. We also present an R package, VariABEL, implementing existing and newly developed tests. Conclusions Variance heterogeneity analysis is a promising method for detection of potentially interacting loci. New method and software package developed in this work will facilitate such analysis in genome-wide context.

  18. Understanding intratumor heterogeneity by combining genome analysis and mathematical modeling.

    Science.gov (United States)

    Niida, Atsushi; Nagayama, Satoshi; Miyano, Satoru; Mimori, Koshi

    2018-04-01

    Cancer is composed of multiple cell populations with different genomes. This phenomenon called intratumor heterogeneity (ITH) is supposed to be a fundamental cause of therapeutic failure. Therefore, its principle-level understanding is a clinically important issue. To achieve this goal, an interdisciplinary approach combining genome analysis and mathematical modeling is essential. For example, we have recently performed multiregion sequencing to unveil extensive ITH in colorectal cancer. Moreover, by employing mathematical modeling of cancer evolution, we demonstrated that it is possible that this ITH is generated by neutral evolution. In this review, we introduce recent advances in a research field related to ITH and also discuss strategies for exploiting novel findings on ITH in a clinical setting. © 2018 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.

  19. Compositional and mutational rate heterogeneity in mitochondrial genomes and its effect on the phylogenetic inferences of Cimicomorpha (Hemiptera: Heteroptera).

    Science.gov (United States)

    Yang, Huanhuan; Li, Teng; Dang, Kai; Bu, Wenjun

    2018-04-18

    Mitochondrial genome (mt-genome) data can potentially return artefactual relationships in the higher-level phylogenetic inference of insects due to the biases of accelerated substitution rates and compositional heterogeneity. Previous studies based on mt-genome data alone showed a paraphyly of Cimicomorpha (Insecta, Hemiptera) due to the positions of the families Tingidae and Reduviidae rather than the monophyly that was supported based on morphological characters, morphological and molecular combined data and large scale molecular datasets. Various strategies have been proposed to ameliorate the effects of potential mt-genome biases, including dense taxon sampling, removal of third codon positions or purine-pyrimidine coding and the use of site-heterogeneous models. In this study, we sequenced the mt-genomes of five additional Tingidae species and discussed the compositional and mutational rate heterogeneity in mt-genomes and its effect on the phylogenetic inferences of Cimicomorpha by implementing the bias-reduction strategies mentioned above. Heterogeneity in nucleotide composition and mutational biases were found in mt protein-coding genes, and the third codon exhibited high levels of saturation. Dense taxon sampling of Tingidae and Reduviidae and the other common strategies mentioned above were insufficient to recover the monophyly of the well-established group Cimicomorpha. When the sites with weak phylogenetic signals in the dataset were removed, the remaining dataset of mt-genomes can support the monophyly of Cimicomorpha; this support demonstrates that mt-genomes possess strong phylogenetic signals for the inference of higher-level phylogeny of this group. Comparison of the ratio of the removal of amino acids for each PCG showed that ATP8 has the highest ratio while CO1 has the lowest. This pattern is largely congruent with the evolutionary rate of 13 PCGs that ATP8 represents the highest evolutionary rate, whereas CO1 appears to be the lowest. Notably

  20. Heterogeneous Ribosomes Preferentially Translate Distinct Subpools of mRNAs Genome-wide.

    Science.gov (United States)

    Shi, Zhen; Fujii, Kotaro; Kovary, Kyle M; Genuth, Naomi R; Röst, Hannes L; Teruel, Mary N; Barna, Maria

    2017-07-06

    Emerging studies have linked the ribosome to more selective control of gene regulation. However, an outstanding question is whether ribosome heterogeneity at the level of core ribosomal proteins (RPs) exists and enables ribosomes to preferentially translate specific mRNAs genome-wide. Here, we measured the absolute abundance of RPs in translating ribosomes and profiled transcripts that are enriched or depleted from select subsets of ribosomes within embryonic stem cells. We find that heterogeneity in RP composition endows ribosomes with differential selectivity for translating subpools of transcripts, including those controlling metabolism, cell cycle, and development. As an example, mRNAs enriched in binding to RPL10A/uL1-containing ribosomes are shown to require RPL10A/uL1 for their efficient translation. Within several of these transcripts, this level of regulation is mediated, at least in part, by internal ribosome entry sites. Together, these results reveal a critical functional link between ribosome heterogeneity and the post-transcriptional circuitry of gene expression. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Segmentation of time series with long-range fractal correlations

    Science.gov (United States)

    Bernaola-Galván, P.; Oliver, J.L.; Hackenberg, M.; Coronado, A.V.; Ivanov, P.Ch.; Carpena, P.

    2012-01-01

    Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome. PMID:23645997

  2. Segmentation of time series with long-range fractal correlations.

    Science.gov (United States)

    Bernaola-Galván, P; Oliver, J L; Hackenberg, M; Coronado, A V; Ivanov, P Ch; Carpena, P

    2012-06-01

    Segmentation is a standard method of data analysis to identify change-points dividing a nonstationary time series into homogeneous segments. However, for long-range fractal correlated series, most of the segmentation techniques detect spurious change-points which are simply due to the heterogeneities induced by the correlations and not to real nonstationarities. To avoid this oversegmentation, we present a segmentation algorithm which takes as a reference for homogeneity, instead of a random i.i.d. series, a correlated series modeled by a fractional noise with the same degree of correlations as the series to be segmented. We apply our algorithm to artificial series with long-range correlations and show that it systematically detects only the change-points produced by real nonstationarities and not those created by the correlations of the signal. Further, we apply the method to the sequence of the long arm of human chromosome 21, which is known to have long-range fractal correlations. We obtain only three segments that clearly correspond to the three regions of different G + C composition revealed by means of a multi-scale wavelet plot. Similar results have been obtained when segmenting all human chromosome sequences, showing the existence of previously unknown huge compositional superstructures in the human genome.

  3. Assessment of Genetic Heterogeneity in Structured Plant Populations Using Multivariate Whole-Genome Regression Models.

    Science.gov (United States)

    Lehermeier, Christina; Schön, Chris-Carolin; de Los Campos, Gustavo

    2015-09-01

    Plant breeding populations exhibit varying levels of structure and admixture; these features are likely to induce heterogeneity of marker effects across subpopulations. Traditionally, structure has been dealt with as a potential confounder, and various methods exist to "correct" for population stratification. However, these methods induce a mean correction that does not account for heterogeneity of marker effects. The animal breeding literature offers a few recent studies that consider modeling genetic heterogeneity in multibreed data, using multivariate models. However, these methods have received little attention in plant breeding where population structure can have different forms. In this article we address the problem of analyzing data from heterogeneous plant breeding populations, using three approaches: (a) a model that ignores population structure [A-genome-based best linear unbiased prediction (A-GBLUP)], (b) a stratified (i.e., within-group) analysis (W-GBLUP), and (c) a multivariate approach that uses multigroup data and accounts for heterogeneity (MG-GBLUP). The performance of the three models was assessed on three different data sets: a diversity panel of rice (Oryza sativa), a maize (Zea mays L.) half-sib panel, and a wheat (Triticum aestivum L.) data set that originated from plant breeding programs. The estimated genomic correlations between subpopulations varied from null to moderate, depending on the genetic distance between subpopulations and traits. Our assessment of prediction accuracy features cases where ignoring population structure leads to a parsimonious more powerful model as well as others where the multivariate and stratified approaches have higher predictive power. In general, the multivariate approach appeared slightly more robust than either the A- or the W-GBLUP. Copyright © 2015 by the Genetics Society of America.

  4. Non-Random Inversion Landscapes in Prokaryotic Genomes Are Shaped by Heterogeneous Selection Pressures.

    Science.gov (United States)

    Repar, Jelena; Warnecke, Tobias

    2017-08-01

    Inversions are a major contributor to structural genome evolution in prokaryotes. Here, using a novel alignment-based method, we systematically compare 1,651 bacterial and 98 archaeal genomes to show that inversion landscapes are frequently biased toward (symmetric) inversions around the origin-terminus axis. However, symmetric inversion bias is not a universal feature of prokaryotic genome evolution but varies considerably across clades. At the extremes, inversion landscapes in Bacillus-Clostridium and Actinobacteria are dominated by symmetric inversions, while there is little or no systematic bias favoring symmetric rearrangements in archaea with a single origin of replication. Within clades, we find strong but clade-specific relationships between symmetric inversion bias and different features of adaptive genome architecture, including the distance of essential genes to the origin of replication and the preferential localization of genes on the leading strand. We suggest that heterogeneous selection pressures have converged to produce similar patterns of structural genome evolution across prokaryotes. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. Non-canonical ribosomal DNA segments in the human genome, and nucleoli functioning.

    Science.gov (United States)

    Kupriyanova, Natalia S; Netchvolodov, Kirill K; Sadova, Anastasia A; Cherepanova, Marina D; Ryskov, Alexei P

    2015-11-10

    Ribosomal DNA (rDNA) in the human genome is represented by tandem repeats of 43 kb nucleotide sequences that form nucleoli organizers (NORs) on each of five pairs of acrocentric chromosomes. RDNA-similar segments of different lengths are also present on (NOR)(-) chromosomes. Many of these segments contain nucleotide substitutions, supplementary microsatellite clusters, and extended deletions. Recently, it was shown that, in addition to ribosome biogenesis, nucleoli exhibit additional functions, such as cell-cycle regulation and response to stresses. In particular, several stress-inducible loci located in the ribosomal intergenic spacer (rIGS) produce stimuli-specific noncoding nucleolus RNAs. By mapping the 5'/3' ends of the rIGS segments scattered throughout (NOR)(-) chromosomes, we discovered that the bonds in the rIGS that were most often susceptible to disruption in the rIGS were adjacent to, or overlapped with stimuli-specific inducible loci. This suggests the interconnection of the two phenomena - nucleoli functioning and the scattering of rDNA-like sequences on (NOR)(-) chromosomes. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Characterization of gene expression on genomic segment 7 of infectious salmon anaemia virus

    Directory of Open Access Journals (Sweden)

    Qian Biao

    2007-03-01

    Full Text Available Abstract Background Infectious salmon anaemia (ISA virus (ISAV, an important pathogen of fish that causes disease accompanied by high mortality in marine-farmed Atlantic salmon, is the only species in the genus Isavirus, one of the five genera of the Orthomyxoviridae family. The Isavirus genome consists of eight single-stranded RNA species, and the virions have two surface glycoproteins; haemagglutinin-esterase (HE protein encoded on segment 6 and fusion (F protein encoded on segment 5. Based on the initial demonstration of two 5'-coterminal mRNA transcripts by RT-PCR, ISAV genomic segment 7 was suggested to share a similar coding strategy with segment 7 of influenza A virus, encoding two proteins. However, there appears to be confusion as to the protein sizes predicted from the two open reading frames (ORFs of ISAV segment 7 which has in turn led to confusion of the predicted protein functions. The primary goal of the present work was to clone and express these two ORFs in order to assess whether the predicted protein sizes match those of the expressed proteins so as to clarify the coding assignments, and thereby identify any additional structural proteins of ISAV. Results In the present study we show that ISAV segment 7 encodes 3 proteins with estimated molecular masses of 32, 18, and 9.5 kDa. The 18-kDa and 9.5-kDa products are based on removal of an intron each from the primary transcript (7-ORF1 so that the translation continues in the +2 and +3 reading frames, respectively. The segment 7-ORF1/3 product is variably truncated in the sequence of ISAV isolates of the European genotype. All three proteins are recognized by rabbit antiserum against the 32-kDa product of the primary transcript, as they all share the N-terminal 22 amino acids. This antiserum detected a single 35-kDa protein in Western blots of purified virus, and immunoprecipitated a 32-kDa protein in ISAV-infected TO cells. Immunofluorescence staining of infected cells with the

  7. O4.04‘FROM THE CORE TO BEYOND THE MARGIN’: A GENOMIC PICTURE OF GLIOBLASTOMA INTRATUMOR HETEROGENEITY

    Science.gov (United States)

    Aubry, M.; de Tayrac, M.; Etcheverry, A.; Clavreul, A.; Saikali, S.; Menei, P.; Mosser, J.

    2014-01-01

    INTRODUCTION: Glioblastoma (GB) is a very agressive brain tumor that almost systematically recurs. This recurrence is linked to the highly invasive behavior of this tumor. Major challenges in therapy of GB are therefore associated with controlling this infiltration sustained by a strong heterogeneous biology of GB tumors. In the past decade, molecular studies of GB have been focused on inter-individual heterogeneity. These studies have identified gene mutations and molecular signatures with putative prognostic or predictive significance. However, accumulating evidence suggests that intratumor heterogeneity is the key to understanding treatment failure. MATERIAL AND METHOD: To explore GB intratumor heterogeneity we developed a surgical multisampling scheme to collect tumor fragments from 10 GB patients in four spatially distinct areas defined on 3D MRI ‘from the core to beyond the margin’: necrotic zone, tumor zone, interface, and peripheral brain zone. These samples were studied genome-wide at three molecular levels: genome, transcriptome and methylome. We used Principal Component Analysis (PCA) to classify the samples and to select genes. We constructed a co-expression network on the basis of the expression data of the most informative genes related to the data structure revealed by PCA. RESULTS AND DISCUSSION: At the genome level, we identified common GB alterations (loss/partial loss of chromosome 10, polysomy of chromosome 7, focal deletion of the CDKN2A/B and focal high-level amplifications of EGFR) and a strong inter-individual molecular heterogeneity. Transcriptome analysis highlighted a pronounced intratumor architecture reflecting the surgical sampling plan of the study and identified gene modules associated with hallmarks of cancer. We provide a master gene signature highly correlated with the intratumor gradient and associated with the tumor infiltrative behavior. In this signature, the percentage of genes presenting an anti-correlation between

  8. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations

    NARCIS (Netherlands)

    Uwimana, B.; Smulders, M.J.M.; Hooftman, D.A.P.; Hartman, Y.; van Tienderen, P.H.; Jansen, J.; McHale, L.K.; Michelmore, R.W.; Visser, R.G.F.; van de Wiel, C.C.M.

    2012-01-01

    Background: After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (trans)genes could become established in natural

  9. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations

    NARCIS (Netherlands)

    Uwimana, B.; Smulders, M.J.M.; Hooftman, D.A.P.; Hartman, Y.; Tienderen, van P.H.; Jansen, J.; McHale, L.K.; Michelmore, R.; Visser, R.G.F.; Wiel, van de C.C.M.

    2012-01-01

    After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (trans)genes could become established in natural populations. The

  10. Sequence analysis of the PIP5K locus in Eimeria maxima provides further evidence for eimerian genome plasticity and segmental organization.

    Science.gov (United States)

    Song, B K; Pan, M Z; Lau, Y L; Wan, K L

    2014-07-29

    Commercial flocks infected by Eimeria species parasites, including Eimeria maxima, have an increased risk of developing clinical or subclinical coccidiosis; an intestinal enteritis associated with increased mortality rates in poultry. Currently, infection control is largely based on chemotherapy or live vaccines; however, drug resistance is common and vaccines are relatively expensive. The development of new cost-effective intervention measures will benefit from unraveling the complex genetic mechanisms that underlie host-parasite interactions, including the identification and characterization of genes encoding proteins such as phosphatidylinositol 4-phosphate 5-kinase (PIP5K). We previously identified a PIP5K coding sequence within the E. maxima genome. In this study, we analyzed two bacterial artificial chromosome clones presenting a ~145-kb E. maxima (Weybridge strain) genomic region spanning the PIP5K gene locus. Sequence analysis revealed that ~95% of the simple sequence repeats detected were located within regions comparable to the previously described feature-rich segments of the Eimeria tenella genome. Comparative sequence analysis with the orthologous E. maxima (Houghton strain) region revealed a moderate level of conserved synteny. Unique segmental organizations and telomere-like repeats were also observed in both genomes. A number of incomplete transposable elements were detected and further scrutiny of these elements in both orthologous segments revealed interesting nesting events, which may play a role in facilitating genome plasticity in E. maxima. The current analysis provides more detailed information about the genome organization of E. maxima and may help to reveal genotypic differences that are important for expression of traits related to pathogenicity and virulence.

  11. Deciphering heterogeneity in pig genome assembly Sscrofa9 by isochore and isochore-like region analyses.

    Directory of Open Access Journals (Sweden)

    Wenqian Zhang

    Full Text Available BACKGROUND: The isochore, a large DNA sequence with relatively small GC variance, is one of the most important structures in eukaryotic genomes. Although the isochore has been widely studied in humans and other species, little is known about its distribution in pigs. PRINCIPAL FINDINGS: In this paper, we construct a map of long homogeneous genome regions (LHGRs, i.e., isochores and isochore-like regions, in pigs to provide an intuitive version of GC heterogeneity in each chromosome. The LHGR pattern study not only quantifies heterogeneities, but also reveals some primary characteristics of the chromatin organization, including the followings: (1 the majority of LHGRs belong to GC-poor families and are in long length; (2 a high gene density tends to occur with the appearance of GC-rich LHGRs; and (3 the density of LINE repeats decreases with an increase in the GC content of LHGRs. Furthermore, a portion of LHGRs with particular GC ranges (50%-51% and 54%-55% tend to have abnormally high gene densities, suggesting that biased gene conversion (BGC, as well as time- and energy-saving principles, could be of importance to the formation of genome organization. CONCLUSION: This study significantly improves our knowledge of chromatin organization in the pig genome. Correlations between the different biological features (e.g., gene density and repeat density and GC content of LHGRs provide a unique glimpse of in silico gene and repeats prediction.

  12. Genome-wide signatures of 'rearrangement hotspots' within segmental duplications in humans.

    Directory of Open Access Journals (Sweden)

    Mohammed Uddin

    Full Text Available The primary objective of this study was to create a genome-wide high resolution map (i.e., >100 bp of 'rearrangement hotspots' which can facilitate the identification of regions capable of mediating de novo deletions or duplications in humans. A hierarchical method was employed to fragment segmental duplications (SDs into multiple smaller SD units. Combining an end space free pairwise alignment algorithm with a 'seed and extend' approach, we have exhaustively searched 409 million alignments to detect complex structural rearrangements within the reference-guided assembly of the NA18507 human genome (18× coverage, including the previously identified novel 4.8 Mb sequence from de novo assembly within this genome. We have identified 1,963 rearrangement hotspots within SDs which encompass 166 genes and display an enrichment of duplicated gene nucleotide variants (DNVs. These regions are correlated with increased non-allelic homologous recombination (NAHR event frequency which presumably represents the origin of copy number variations (CNVs and pathogenic duplications/deletions. Analysis revealed that 20% of the detected hotspots are clustered within the proximal and distal SD breakpoints flanked by the pathogenic deletions/duplications that have been mapped for 24 NAHR-mediated genomic disorders. FISH Validation of selected complex regions revealed 94% concordance with in silico localization of the highly homologous derivatives. Other results from this study indicate that intra-chromosomal recombination is enhanced in genic compared with agenic duplicated regions, and that gene desert regions comprising SDs may represent reservoirs for creation of novel genes. The generation of genome-wide signatures of 'rearrangement hotspots', which likely serve as templates for NAHR, may provide a powerful approach towards understanding the underlying mutational mechanism(s for development of constitutional and acquired diseases.

  13. An efficient and high fidelity method for amplification, cloning and sequencing of complete tospovirus genomic RNA segments

    Science.gov (United States)

    Amplification and sequencing of the complete M- and S-RNA segments of Tomato spotted wilt virus and Impatiens necrotic spot virus as a single fragment is useful for whole genome sequencing of tospoviruses co-infecting a single host plant. It avoids issues associated with overlapping amplicon-based ...

  14. Ribosomal DNA sequence heterogeneity reflects intraspecies phylogenies and predicts genome structure in two contrasting yeast species.

    Science.gov (United States)

    West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N

    2014-07-01

    The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of

  15. The use of molecular imaging combined with genomic techniques to understand the heterogeneity in cancer metastasis

    Science.gov (United States)

    Chowdhury, R; Ganeshan, B; Irshad, S; Lawler, K; Eisenblätter, M; Milewicz, H; Rodriguez-Justo, M; Miles, K; Ellis, P; Groves, A; Punwani, S

    2014-01-01

    Tumour heterogeneity has, in recent times, come to play a vital role in how we understand and treat cancers; however, the clinical translation of this has lagged behind advances in research. Although significant advancements in oncological management have been made, personalized care remains an elusive goal. Inter- and intratumour heterogeneity, particularly in the clinical setting, has been difficult to quantify and therefore to treat. The histological quantification of heterogeneity of tumours can be a logistical and clinical challenge. The ability to examine not just the whole tumour but also all the molecular variations of metastatic disease in a patient is obviously difficult with current histological techniques. Advances in imaging techniques and novel applications, alongside our understanding of tumour heterogeneity, have opened up a plethora of non-invasive biomarker potential to examine tumours, their heterogeneity and the clinical translation. This review will focus on how various imaging methods that allow for quantification of metastatic tumour heterogeneity, along with the potential of developing imaging, integrated with other in vitro diagnostic approaches such as genomics and exosome analyses, have the potential role as a non-invasive biomarker for guiding the treatment algorithm. PMID:24597512

  16. The Statistical Segment Length of DNA: Opportunities for Biomechanical Modeling in Polymer Physics and Next-Generation Genomics.

    Science.gov (United States)

    Dorfman, Kevin D

    2018-02-01

    The development of bright bisintercalating dyes for deoxyribonucleic acid (DNA) in the 1990s, most notably YOYO-1, revolutionized the field of polymer physics in the ensuing years. These dyes, in conjunction with modern molecular biology techniques, permit the facile observation of polymer dynamics via fluorescence microscopy and thus direct tests of different theories of polymer dynamics. At the same time, they have played a key role in advancing an emerging next-generation method known as genome mapping in nanochannels. The effect of intercalation on the bending energy of DNA as embodied by a change in its statistical segment length (or, alternatively, its persistence length) has been the subject of significant controversy. The precise value of the statistical segment length is critical for the proper interpretation of polymer physics experiments and controls the phenomena underlying the aforementioned genomics technology. In this perspective, we briefly review the model of DNA as a wormlike chain and a trio of methods (light scattering, optical or magnetic tweezers, and atomic force microscopy (AFM)) that have been used to determine the statistical segment length of DNA. We then outline the disagreement in the literature over the role of bisintercalation on the bending energy of DNA, and how a multiscale biomechanical approach could provide an important model for this scientifically and technologically relevant problem.

  17. The most conserved genome segments for life detection on Earth and other planets.

    Science.gov (United States)

    Isenbarger, Thomas A; Carr, Christopher E; Johnson, Sarah Stewart; Finney, Michael; Church, George M; Gilbert, Walter; Zuber, Maria T; Ruvkun, Gary

    2008-12-01

    On Earth, very simple but powerful methods to detect and classify broad taxa of life by the polymerase chain reaction (PCR) are now standard practice. Using DNA primers corresponding to the 16S ribosomal RNA gene, one can survey a sample from any environment for its microbial inhabitants. Due to massive meteoritic exchange between Earth and Mars (as well as other planets), a reasonable case can be made for life on Mars or other planets to be related to life on Earth. In this case, the supremely sensitive technologies used to study life on Earth, including in extreme environments, can be applied to the search for life on other planets. Though the 16S gene has become the standard for life detection on Earth, no genome comparisons have established that the ribosomal genes are, in fact, the most conserved DNA segments across the kingdoms of life. We present here a computational comparison of full genomes from 13 diverse organisms from the Archaea, Bacteria, and Eucarya to identify genetic sequences conserved across the widest divisions of life. Our results identify the 16S and 23S ribosomal RNA genes as well as other universally conserved nucleotide sequences in genes encoding particular classes of transfer RNAs and within the nucleotide binding domains of ABC transporters as the most conserved DNA sequence segments across phylogeny. This set of sequences defines a core set of DNA regions that have changed the least over billions of years of evolution and provides a means to identify and classify divergent life, including ancestrally related life on other planets.

  18. Molecular characterization of genome segments 1 and 3 encoding two capsid proteins of Antheraea mylitta cytoplasmic polyhedrosis virus

    Directory of Open Access Journals (Sweden)

    Chakrabarti Mrinmay

    2010-08-01

    Full Text Available Abstract Background Antheraea mylitta cytoplasmic polyhedrosis virus (AmCPV, a cypovirus of Reoviridae family, infects Indian non-mulberry silkworm, Antheraea mylitta, and contains 11 segmented double stranded RNA (S1-S11 in its genome. Some of its genome segments (S2 and S6-S11 have been previously characterized but genome segments encoding viral capsid have not been characterized. Results In this study genome segments 1 (S1 and 3 (S3 of AmCPV were converted to cDNA, cloned and sequenced. S1 consisted of 3852 nucleotides, with one long ORF of 3735 nucleotides and could encode a protein of 1245 amino acids with molecular mass of ~141 kDa. Similarly, S3 consisted of 3784 nucleotides having a long ORF of 3630 nucleotides and could encode a protein of 1210 amino acids with molecular mass of ~137 kDa. BLAST analysis showed 20-22% homology of S1 and S3 sequence with spike and capsid proteins, respectively, of other closely related cypoviruses like Bombyx mori CPV (BmCPV, Lymantria dispar CPV (LdCPV, and Dendrolimus punctatus CPV (DpCPV. The ORFs of S1 and S3 were expressed as 141 kDa and 137 kDa insoluble His-tagged fusion proteins, respectively, in Escherichia coli M15 cells via pQE-30 vector, purified through Ni-NTA chromatography and polyclonal antibodies were raised. Immunoblot analysis of purified polyhedra, virion particles and virus infected mid-gut cells with the raised anti-p137 and anti-p141 antibodies showed specific immunoreactive bands and suggest that S1 and S3 may code for viral structural proteins. Expression of S1 and S3 ORFs in insect cells via baculovirus recombinants showed to produce viral like particles (VLPs by transmission electron microscopy. Immunogold staining showed that S3 encoded proteins self assembled to form viral outer capsid and VLPs maintained their stability at different pH in presence of S1 encoded protein. Conclusion Our results of cloning, sequencing and functional analysis of AmCPV S1 and S3 indicate that S3

  19. The impact of selection, gene flow and demographic history on heterogeneous genomic divergence: three-spine sticklebacks in divergent environments.

    Science.gov (United States)

    Ferchaud, Anne-Laure; Hansen, Michael M

    2016-01-01

    Heterogeneous genomic divergence between populations may reflect selection, but should also be seen in conjunction with gene flow and drift, particularly population bottlenecks. Marine and freshwater three-spine stickleback (Gasterosteus aculeatus) populations often exhibit different lateral armour plate morphs. Moreover, strikingly parallel genomic footprints across different marine-freshwater population pairs are interpreted as parallel evolution and gene reuse. Nevertheless, in some geographic regions like the North Sea and Baltic Sea, different patterns are observed. Freshwater populations in coastal regions are often dominated by marine morphs, suggesting that gene flow overwhelms selection, and genomic parallelism may also be less pronounced. We used RAD sequencing for analysing 28 888 SNPs in two marine and seven freshwater populations in Denmark, Europe. Freshwater populations represented a variety of environments: river populations accessible to gene flow from marine sticklebacks and large and small isolated lakes with and without fish predators. Sticklebacks in an accessible river environment showed minimal morphological and genomewide divergence from marine populations, supporting the hypothesis of gene flow overriding selection. Allele frequency spectra suggested bottlenecks in all freshwater populations, and particularly two small lake populations. However, genomic footprints ascribed to selection could nevertheless be identified. No genomic regions were consistent freshwater-marine outliers, and parallelism was much lower than in other comparable studies. Two genomic regions previously described to be under divergent selection in freshwater and marine populations were outliers between different freshwater populations. We ascribe these patterns to stronger environmental heterogeneity among freshwater populations in our study as compared to most other studies, although the demographic history involving bottlenecks should also be considered in the

  20. Life style segmentation in a cross-cultural perspective

    DEFF Research Database (Denmark)

    Poulsen, Carsten Stig; Juhl, Hans Jørn

    1997-01-01

    This paper describes some problemes in doing cross-country, comparative research, involving observed as well as unobserved heterogeneity. The setting is life style segmentation, but the arguments cover a mush broader area.......This paper describes some problemes in doing cross-country, comparative research, involving observed as well as unobserved heterogeneity. The setting is life style segmentation, but the arguments cover a mush broader area....

  1. Joint shape segmentation with linear programming

    KAUST Repository

    Huang, Qixing

    2011-01-01

    We present an approach to segmenting shapes in a heterogenous shape database. Our approach segments the shapes jointly, utilizing features from multiple shapes to improve the segmentation of each. The approach is entirely unsupervised and is based on an integer quadratic programming formulation of the joint segmentation problem. The program optimizes over possible segmentations of individual shapes as well as over possible correspondences between segments from multiple shapes. The integer quadratic program is solved via a linear programming relaxation, using a block coordinate descent procedure that makes the optimization feasible for large databases. We evaluate the presented approach on the Princeton segmentation benchmark and show that joint shape segmentation significantly outperforms single-shape segmentation techniques. © 2011 ACM.

  2. Whole-genome sequencing identifies genomic heterogeneity at a nucleotide and chromosomal level in bladder cancer

    Science.gov (United States)

    Morrison, Carl D.; Liu, Pengyuan; Woloszynska-Read, Anna; Zhang, Jianmin; Luo, Wei; Qin, Maochun; Bshara, Wiam; Conroy, Jeffrey M.; Sabatini, Linda; Vedell, Peter; Xiong, Donghai; Liu, Song; Wang, Jianmin; Shen, He; Li, Yinwei; Omilian, Angela R.; Hill, Annette; Head, Karen; Guru, Khurshid; Kunnev, Dimiter; Leach, Robert; Eng, Kevin H.; Darlak, Christopher; Hoeflich, Christopher; Veeranki, Srividya; Glenn, Sean; You, Ming; Pruitt, Steven C.; Johnson, Candace S.; Trump, Donald L.

    2014-01-01

    Using complete genome analysis, we sequenced five bladder tumors accrued from patients with muscle-invasive transitional cell carcinoma of the urinary bladder (TCC-UB) and identified a spectrum of genomic aberrations. In three tumors, complex genotype changes were noted. All three had tumor protein p53 mutations and a relatively large number of single-nucleotide variants (SNVs; average of 11.2 per megabase), structural variants (SVs; average of 46), or both. This group was best characterized by chromothripsis and the presence of subclonal populations of neoplastic cells or intratumoral mutational heterogeneity. Here, we provide evidence that the process of chromothripsis in TCC-UB is mediated by nonhomologous end-joining using kilobase, rather than megabase, fragments of DNA, which we refer to as “stitchers,” to repair this process. We postulate that a potential unifying theme among tumors with the more complex genotype group is a defective replication–licensing complex. A second group (two bladder tumors) had no chromothripsis, and a simpler genotype, WT tumor protein p53, had relatively few SNVs (average of 5.9 per megabase) and only a single SV. There was no evidence of a subclonal population of neoplastic cells. In this group, we used a preclinical model of bladder carcinoma cell lines to study a unique SV (translocation and amplification) of the gene glutamate receptor ionotropic N-methyl D-aspertate as a potential new therapeutic target in bladder cancer. PMID:24469795

  3. Quantitative image analysis of cellular heterogeneity in breast tumors complements genomic profiling.

    Science.gov (United States)

    Yuan, Yinyin; Failmezger, Henrik; Rueda, Oscar M; Ali, H Raza; Gräf, Stefan; Chin, Suet-Feung; Schwarz, Roland F; Curtis, Christina; Dunning, Mark J; Bardwell, Helen; Johnson, Nicola; Doyle, Sarah; Turashvili, Gulisa; Provenzano, Elena; Aparicio, Sam; Caldas, Carlos; Markowetz, Florian

    2012-10-24

    Solid tumors are heterogeneous tissues composed of a mixture of cancer and normal cells, which complicates the interpretation of their molecular profiles. Furthermore, tissue architecture is generally not reflected in molecular assays, rendering this rich information underused. To address these challenges, we developed a computational approach based on standard hematoxylin and eosin-stained tissue sections and demonstrated its power in a discovery and validation cohort of 323 and 241 breast tumors, respectively. To deconvolute cellular heterogeneity and detect subtle genomic aberrations, we introduced an algorithm based on tumor cellularity to increase the comparability of copy number profiles between samples. We next devised a predictor for survival in estrogen receptor-negative breast cancer that integrated both image-based and gene expression analyses and significantly outperformed classifiers that use single data types, such as microarray expression signatures. Image processing also allowed us to describe and validate an independent prognostic factor based on quantitative analysis of spatial patterns between stromal cells, which are not detectable by molecular assays. Our quantitative, image-based method could benefit any large-scale cancer study by refining and complementing molecular assays of tumor samples.

  4. Genomic profiling of plasmablastic lymphoma using array comparative genomic hybridization (aCGH: revealing significant overlapping genomic lesions with diffuse large B-cell lymphoma

    Directory of Open Access Journals (Sweden)

    Lu Xin-Yan

    2009-11-01

    Full Text Available Abstract Background Plasmablastic lymphoma (PL is a subtype of diffuse large B-cell lymphoma (DLBCL. Studies have suggested that tumors with PL morphology represent a group of neoplasms with clinopathologic characteristics corresponding to different entities including extramedullary plasmablastic tumors associated with plasma cell myeloma (PCM. The goal of the current study was to evaluate the genetic similarities and differences among PL, DLBCL (AIDS-related and non AIDS-related and PCM using array-based comparative genomic hybridization. Results Examination of genomic data in PL revealed that the most frequent segmental gain (> 40% include: 1p36.11-1p36.33, 1p34.1-1p36.13, 1q21.1-1q23.1, 7q11.2-7q11.23, 11q12-11q13.2 and 22q12.2-22q13.3. This correlated with segmental gains occurring in high frequency in DLBCL (AIDS-related and non AIDS-related cases. There were some segmental gains and some segmental loss that occurred in PL but not in the other types of lymphoma suggesting that these foci may contain genes responsible for the differentiation of this lymphoma. Additionally, some segmental gains and some segmental loss occurred only in PL and AIDS associated DLBCL suggesting that these foci may be associated with HIV infection. Furthermore, some segmental gains and some segmental loss occurred only in PL and PCM suggesting that these lesions may be related to plasmacytic differentiation. Conclusion To the best of our knowledge, the current study represents the first genomic exploration of PL. The genomic aberration pattern of PL appears to be more similar to that of DLBCL (AIDS-related or non AIDS-related than to PCM. Our findings suggest that PL may remain best classified as a subtype of DLBCL at least at the genome level.

  5. Quantitative Segmentation of Fluorescence Microscopy Images of Heterogeneous Tissue: Application to the Detection of Residual Disease in Tumor Margins.

    Science.gov (United States)

    Mueller, Jenna L; Harmany, Zachary T; Mito, Jeffrey K; Kennedy, Stephanie A; Kim, Yongbaek; Dodd, Leslie; Geradts, Joseph; Kirsch, David G; Willett, Rebecca M; Brown, J Quincy; Ramanujam, Nimmi

    2013-01-01

    To develop a robust tool for quantitative in situ pathology that allows visualization of heterogeneous tissue morphology and segmentation and quantification of image features. TISSUE EXCISED FROM A GENETICALLY ENGINEERED MOUSE MODEL OF SARCOMA WAS IMAGED USING A SUBCELLULAR RESOLUTION MICROENDOSCOPE AFTER TOPICAL APPLICATION OF A FLUORESCENT ANATOMICAL CONTRAST AGENT: acriflavine. An algorithm based on sparse component analysis (SCA) and the circle transform (CT) was developed for image segmentation and quantification of distinct tissue types. The accuracy of our approach was quantified through simulations of tumor and muscle images. Specifically, tumor, muscle, and tumor+muscle tissue images were simulated because these tissue types were most commonly observed in sarcoma margins. Simulations were based on tissue characteristics observed in pathology slides. The potential clinical utility of our approach was evaluated by imaging excised margins and the tumor bed in a cohort of mice after surgical resection of sarcoma. Simulation experiments revealed that SCA+CT achieved the lowest errors for larger nuclear sizes and for higher contrast ratios (nuclei intensity/background intensity). For imaging of tumor margins, SCA+CT effectively isolated nuclei from tumor, muscle, adipose, and tumor+muscle tissue types. Differences in density were correctly identified with SCA+CT in a cohort of ex vivo and in vivo images, thus illustrating the diagnostic potential of our approach. The combination of a subcellular-resolution microendoscope, acriflavine staining, and SCA+CT can be used to accurately isolate nuclei and quantify their density in anatomical images of heterogeneous tissue.

  6. Genomic Heterogeneity of Methicillin Resistant Staphylococcus aureus Associated with Variation in Severity of Illness among Children with Acute Hematogenous Osteomyelitis.

    Directory of Open Access Journals (Sweden)

    Claudia Gaviria-Agudelo

    Full Text Available The association between severity of illness of children with osteomyelitis caused by Methicillin-resistant Staphylococcus aureus (MRSA and genomic variation of the causative organism has not been previously investigated. The purpose of this study is to assess genomic heterogeneity among MRSA isolates from children with osteomyelitis who have diverse severity of illness.Children with osteomyelitis were prospectively studied between 2010 and 2011. Severity of illness of the affected children was determined from clinical and laboratory parameters. MRSA isolates were analyzed with next generation sequencing (NGS and optical mapping. Sequence data was used for multi-locus sequence typing (MLST, phylogenetic analysis by maximum likelihood (PAML, and identification of virulence genes and single nucleotide polymorphisms (SNP relative to reference strains.The twelve children studied demonstrated severity of illness scores ranging from 0 (mild to 9 (severe. All isolates were USA300, ST 8, SCC mec IVa MRSA by MLST. The isolates differed from reference strains by 2 insertions (40 Kb each and 2 deletions (10 and 25 Kb but had no rearrangements or copy number variations. There was a higher occurrence of virulence genes among study isolates when compared to the reference strains (p = 0.0124. There were an average of 11 nonsynonymous SNPs per strain. PAML demonstrated heterogeneity of study isolates from each other and from the reference strains.Genomic heterogeneity exists among MRSA isolates causing osteomyelitis among children in a single community. These variations may play a role in the pathogenesis of variation in clinical severity among these children.

  7. A sequence-based survey of the complex structural organization of tumor genomes

    Energy Technology Data Exchange (ETDEWEB)

    Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav; Yu, Peng; Wu, Chunxiao; Huang, Guiqing; Linardopoulou, Elena V.; Trask, Barbara J.; Waldman, Frederic; Costello, Joseph; Pienta, Kenneth J.; Mills, Gordon B.; Bajsarowicz, Krystyna; Kobayashi, Yasuko; Sridharan, Shivaranjani; Paris, Pamela; Tao, Quanzhou; Aerni, Sarah J.; Brown, Raymond P.; Bashir, Ali; Gray, Joe W.; Cheng, Jan-Fang; de Jong, Pieter; Nefedov, Mikhail; Ried, Thomas; Padilla-Nash, Hesed M.; Collins, Colin C.

    2008-04-03

    The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison of the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.

  8. Quantitative Segmentation of Fluorescence Microscopy Images of Heterogeneous Tissue: Application to the Detection of Residual Disease in Tumor Margins.

    Directory of Open Access Journals (Sweden)

    Jenna L Mueller

    Full Text Available To develop a robust tool for quantitative in situ pathology that allows visualization of heterogeneous tissue morphology and segmentation and quantification of image features.TISSUE EXCISED FROM A GENETICALLY ENGINEERED MOUSE MODEL OF SARCOMA WAS IMAGED USING A SUBCELLULAR RESOLUTION MICROENDOSCOPE AFTER TOPICAL APPLICATION OF A FLUORESCENT ANATOMICAL CONTRAST AGENT: acriflavine. An algorithm based on sparse component analysis (SCA and the circle transform (CT was developed for image segmentation and quantification of distinct tissue types. The accuracy of our approach was quantified through simulations of tumor and muscle images. Specifically, tumor, muscle, and tumor+muscle tissue images were simulated because these tissue types were most commonly observed in sarcoma margins. Simulations were based on tissue characteristics observed in pathology slides. The potential clinical utility of our approach was evaluated by imaging excised margins and the tumor bed in a cohort of mice after surgical resection of sarcoma.Simulation experiments revealed that SCA+CT achieved the lowest errors for larger nuclear sizes and for higher contrast ratios (nuclei intensity/background intensity. For imaging of tumor margins, SCA+CT effectively isolated nuclei from tumor, muscle, adipose, and tumor+muscle tissue types. Differences in density were correctly identified with SCA+CT in a cohort of ex vivo and in vivo images, thus illustrating the diagnostic potential of our approach.The combination of a subcellular-resolution microendoscope, acriflavine staining, and SCA+CT can be used to accurately isolate nuclei and quantify their density in anatomical images of heterogeneous tissue.

  9. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations.

    Science.gov (United States)

    Uwimana, Brigitte; Smulders, Marinus J M; Hooftman, Danny A P; Hartman, Yorike; van Tienderen, Peter H; Jansen, Johannes; McHale, Leah K; Michelmore, Richard W; Visser, Richard G F; van de Wiel, Clemens C M

    2012-03-26

    After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (trans)genes could become established in natural populations. The likelihood of introgression of transgenes will not only be determined by fitness effects from the transgene itself but also by the crop genes linked to it. Although lettuce is generally regarded as self-pollinating, outbreeding does occur at a low frequency. Backcrossing to wild lettuce is a likely pathway to introgression along with selfing, due to the high frequency of wild individuals relative to the rarely occurring crop-wild hybrids. To test the effect of backcrossing on the vigour of inter-specific hybrids, Lactuca serriola, the closest wild relative of cultivated lettuce, was crossed with L. sativa and the F(1) hybrid was backcrossed to L. serriola to generate BC(1) and BC(2) populations. Experiments were conducted on progeny from selfed plants of the backcrossing families (BC(1)S(1) and BC(2)S(1)). Plant vigour of these two backcrossing populations was determined in the greenhouse under non-stress and abiotic stress conditions (salinity, drought, and nutrient deficiency). Despite the decreasing contribution of crop genomic blocks in the backcross populations, the BC(1)S(1) and BC(2)S(1) hybrids were characterized by a substantial genetic variation under both non-stress and stress conditions. Hybrids were identified that performed equally or better than the wild genotypes, indicating that two backcrossing events did not eliminate the effect of the crop genomic segments that contributed to the vigour of the BC(1) and BC(2) hybrids. QTLs for plant vigour under non-stress and the various stress conditions were detected in the two populations with positive as well as negative effects from the crop. As it was shown that the crop contributed QTLs with either a positive

  10. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations

    Directory of Open Access Journals (Sweden)

    Uwimana Brigitte

    2012-03-01

    Full Text Available Abstract Background After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (transgenes could become established in natural populations. The likelihood of introgression of transgenes will not only be determined by fitness effects from the transgene itself but also by the crop genes linked to it. Although lettuce is generally regarded as self-pollinating, outbreeding does occur at a low frequency. Backcrossing to wild lettuce is a likely pathway to introgression along with selfing, due to the high frequency of wild individuals relative to the rarely occurring crop-wild hybrids. To test the effect of backcrossing on the vigour of inter-specific hybrids, Lactuca serriola, the closest wild relative of cultivated lettuce, was crossed with L. sativa and the F1 hybrid was backcrossed to L. serriola to generate BC1 and BC2 populations. Experiments were conducted on progeny from selfed plants of the backcrossing families (BC1S1 and BC2S1. Plant vigour of these two backcrossing populations was determined in the greenhouse under non-stress and abiotic stress conditions (salinity, drought, and nutrient deficiency. Results Despite the decreasing contribution of crop genomic blocks in the backcross populations, the BC1S1 and BC2S1 hybrids were characterized by a substantial genetic variation under both non-stress and stress conditions. Hybrids were identified that performed equally or better than the wild genotypes, indicating that two backcrossing events did not eliminate the effect of the crop genomic segments that contributed to the vigour of the BC1 and BC2 hybrids. QTLs for plant vigour under non-stress and the various stress conditions were detected in the two populations with positive as well as negative effects from the crop. Conclusion As it was shown that the crop

  11. Crop to wild introgression in lettuce: following the fate of crop genome segments in backcross populations

    Science.gov (United States)

    2012-01-01

    Background After crop-wild hybridization, some of the crop genomic segments may become established in wild populations through selfing of the hybrids or through backcrosses to the wild parent. This constitutes a possible route through which crop (trans)genes could become established in natural populations. The likelihood of introgression of transgenes will not only be determined by fitness effects from the transgene itself but also by the crop genes linked to it. Although lettuce is generally regarded as self-pollinating, outbreeding does occur at a low frequency. Backcrossing to wild lettuce is a likely pathway to introgression along with selfing, due to the high frequency of wild individuals relative to the rarely occurring crop-wild hybrids. To test the effect of backcrossing on the vigour of inter-specific hybrids, Lactuca serriola, the closest wild relative of cultivated lettuce, was crossed with L. sativa and the F1 hybrid was backcrossed to L. serriola to generate BC1 and BC2 populations. Experiments were conducted on progeny from selfed plants of the backcrossing families (BC1S1 and BC2S1). Plant vigour of these two backcrossing populations was determined in the greenhouse under non-stress and abiotic stress conditions (salinity, drought, and nutrient deficiency). Results Despite the decreasing contribution of crop genomic blocks in the backcross populations, the BC1S1 and BC2S1 hybrids were characterized by a substantial genetic variation under both non-stress and stress conditions. Hybrids were identified that performed equally or better than the wild genotypes, indicating that two backcrossing events did not eliminate the effect of the crop genomic segments that contributed to the vigour of the BC1 and BC2 hybrids. QTLs for plant vigour under non-stress and the various stress conditions were detected in the two populations with positive as well as negative effects from the crop. Conclusion As it was shown that the crop contributed QTLs with either a

  12. Analysis of IAV Replication and Co-infection Dynamics by a Versatile RNA Viral Genome Labeling Method

    Directory of Open Access Journals (Sweden)

    Dan Dou

    2017-07-01

    Full Text Available Genome delivery to the proper cellular compartment for transcription and replication is a primary goal of viruses. However, methods for analyzing viral genome localization and differentiating genomes with high identity are lacking, making it difficult to investigate entry-related processes and co-examine heterogeneous RNA viral populations. Here, we present an RNA labeling approach for single-cell analysis of RNA viral replication and co-infection dynamics in situ, which uses the versatility of padlock probes. We applied this method to identify influenza A virus (IAV infections in cells and lung tissue with single-nucleotide specificity and to classify entry and replication stages by gene segment localization. Extending the classification strategy to co-infections of IAVs with single-nucleotide variations, we found that the dependence on intracellular trafficking places a time restriction on secondary co-infections necessary for genome reassortment. Altogether, these data demonstrate how RNA viral genome labeling can help dissect entry and co-infections.

  13. Grass genomes

    OpenAIRE

    Bennetzen, Jeffrey L.; SanMiguel, Phillip; Chen, Mingsheng; Tikhonov, Alexander; Francki, Michael; Avramova, Zoya

    1998-01-01

    For the most part, studies of grass genome structure have been limited to the generation of whole-genome genetic maps or the fine structure and sequence analysis of single genes or gene clusters. We have investigated large contiguous segments of the genomes of maize, sorghum, and rice, primarily focusing on intergenic spaces. Our data indicate that much (>50%) of the maize genome is composed of interspersed repetitive DNAs, primarily nested retrotransposons that in...

  14. Variable Copy Number, Intra-Genomic Heterogeneities and Lateral Transfers of the 16S rRNA Gene in Pseudomonas

    Science.gov (United States)

    Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent

    2012-01-01

    Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. PMID:22545126

  15. Molecular cytogenetic and genomic analyses reveal new insights into the origin of the wheat B genome.

    Science.gov (United States)

    Zhang, Wei; Zhang, Mingyi; Zhu, Xianwen; Cao, Yaping; Sun, Qing; Ma, Guojia; Chao, Shiaoman; Yan, Changhui; Xu, Steven S; Cai, Xiwen

    2018-02-01

    This work pinpointed the goatgrass chromosomal segment in the wheat B genome using modern cytogenetic and genomic technologies, and provided novel insights into the origin of the wheat B genome. Wheat is a typical allopolyploid with three homoeologous subgenomes (A, B, and D). The donors of the subgenomes A and D had been identified, but not for the subgenome B. The goatgrass Aegilops speltoides (genome SS) has been controversially considered a possible candidate for the donor of the wheat B genome. However, the relationship of the Ae. speltoides S genome with the wheat B genome remains largely obscure. The present study assessed the homology of the B and S genomes using an integrative cytogenetic and genomic approach, and revealed the contribution of Ae. speltoides to the origin of the wheat B genome. We discovered noticeable homology between wheat chromosome 1B and Ae. speltoides chromosome 1S, but not between other chromosomes in the B and S genomes. An Ae. speltoides-originated segment spanning a genomic region of approximately 10.46 Mb was detected on the long arm of wheat chromosome 1B (1BL). The Ae. speltoides-originated segment on 1BL was found to co-evolve with the rest of the B genome. Evidently, Ae. speltoides had been involved in the origin of the wheat B genome, but should not be considered an exclusive donor of this genome. The wheat B genome might have a polyphyletic origin with multiple ancestors involved, including Ae. speltoides. These novel findings will facilitate genome studies in wheat and other polyploids.

  16. Phenotypic and genetic heterogeneity in a genome-wide linkage study of asthma families

    Directory of Open Access Journals (Sweden)

    Schuster Antje

    2005-01-01

    Full Text Available Abstract Background Asthma is a complex genetic disease with more than 20 genome-wide scans conducted so far. Regions on almost every chromosome have been linked to asthma and several genes have been associated. However, most of these associations are weak and are still awaiting replication. Methods In this study, we conducted a second-stage genome-wide scan with 408 microsatellite markers on 201 asthma-affected sib pair families and defined clinical subgroups to identify phenotype-genotype relations. Results The lowest P value for asthma in the total sample was 0.003 on chromosome 11, while several of the clinical subsets reached lower significance levels than in the overall sample. Suggestive evidence for linkage (p = 0.0007 was found for total IgE on chromosomes 1, 7 and again on chromosome 11, as well as for HDM asthma on chromosome 12. Weaker linkage signals could be found on chromosomes 4 and 5 for early onset and HDM, and, newly described, on chromosome 2 for severe asthma and on chromosome 9 for hay fever. Conclusions This phenotypic dissection underlines the importance of detailed clinical characterisations and the extreme genetic heterogeneity of asthma.

  17. Moving Segmentation Up the Supply-Chain: Supply Chain Segmentation and Artificial Neural Networks

    OpenAIRE

    Erevelles, Sunil; Fukawa, Nobuyuki

    2008-01-01

    This paper explained the concept of supply-side segmentation and transvectional alignment, and applies these concepts in the artificial neural network (ANN). To the best of our knowledge, no research has applied ANN in explaining the heterogeneity of both the supply-side and demand-side of a market in forming relational entity that consists of firms at all levels of the supply chain and the demand chain. The ANN offers a way of operationalizing the concept of supply-side segmentation. In toda...

  18. Gene conversion in the rice genome

    DEFF Research Database (Denmark)

    Xu, Shuqing; Clark, Terry; Zheng, Hongkun

    2008-01-01

    -chromosomal conversions distributed between chromosome 1 and 5, 2 and 6, and 3 and 5 are more frequent than genome average (Z-test, P ... is not tightly linked to natural selection in the rice genome. To assess the contribution of segmental duplication on gene conversion statistics, we determined locations of conversion partners with respect to inter-chromosomal segment duplication. The number of conversions associated with segmentation is less...... involved in conversion events. CONCLUSION: The evolution of gene families in the rice genome may have been accelerated by conversion with pseudogenes. Our analysis suggests a possible role for gene conversion in the evolution of pathogen-response genes....

  19. Genomic selection in mink yield higher accuracies with a Bayesian approach allowing for heterogeneous variance than a GBLUP model

    DEFF Research Database (Denmark)

    Villumsen, Trine Michelle; Su, Guosheng; Cai, Zexi

    2018-01-01

    by sequencing. Four live grading traits and four traits on dried pelts for size and quality were analysed. GWAS analysis detected significant SNPs for all the traits. The single-trait Bayesian model resulted in higher accuracies for the genomic predictions than the single-trait GBLUP model, especially......The accuracy of genomic prediction for mink was compared for single-trait and multiple-trait GBLUP models and Bayesian models that allowed for heterogeneous (co)variance structure over the genome. The mink population consisted of 2,103 brown minks genotyped with the method of genotyping...... for the traits measured on dried pelts. We expected the multiple-trait models to be superior to the single trait models since the multiple-trait model can make use of information when traits are correlated. However, we did not find a general improvement in accuracies with the multiple-trait models compared...

  20. Incompatibility and competitive exclusion of genomic segments between sibling Drosophila species.

    Science.gov (United States)

    Fang, Shu; Yukilevich, Roman; Chen, Ying; Turissini, David A; Zeng, Kai; Boussy, Ian A; Wu, Chung-I

    2012-06-01

    The extent and nature of genetic incompatibilities between incipient races and sibling species is of fundamental importance to our view of speciation. However, with the exception of hybrid inviability and sterility factors, little is known about the extent of other, more subtle genetic incompatibilities between incipient species. Here we experimentally demonstrate the prevalence of such genetic incompatibilities between two young allopatric sibling species, Drosophila simulans and D. sechellia. Our experiments took advantage of 12 introgression lines that carried random introgressed D. sechellia segments in different parts of the D. simulans genome. First, we found that these introgression lines did not show any measurable sterility or inviability effects. To study if these sechellia introgressions in a simulans background contained other fitness consequences, we competed and genetically tracked the marked alleles within each introgression against the wild-type alleles for 20 generations. Strikingly, all marked D. sechellia introgression alleles rapidly decreased in frequency in only 6 to 7 generations. We then developed computer simulations to model our competition results. These simulations indicated that selection against D. sechellia introgression alleles was high (average s = 0.43) and that the marker alleles and the incompatible alleles did not separate in 78% of the introgressions. The latter result likely implies that most introgressions contain multiple genetic incompatibilities. Thus, this study reveals that, even at early stages of speciation, many parts of the genome diverge to a point where introducing foreign elements has detrimental fitness consequences, but which cannot be seen using standard sterility and inviability assays.

  1. Potential for La Crosse virus segment reassortment in nature

    Directory of Open Access Journals (Sweden)

    Geske Dave

    2008-12-01

    Full Text Available Abstract The evolutionary success of La Crosse virus (LACV, family Bunyaviridae is due to its ability to adapt to changing conditions through intramolecular genetic changes and segment reassortment. Vertical transmission of LACV in mosquitoes increases the potential for segment reassortment. Studies were conducted to determine if segment reassortment was occurring in naturally infected Aedes triseriatus from Wisconsin and Minnesota in 2000, 2004, 2006 and 2007. Mosquito eggs were collected from various sites in Wisconsin and Minnesota. They were reared in the laboratory and adults were tested for LACV antigen by immunofluorescence assay. RNA was isolated from the abdomen of infected mosquitoes and portions of the small (S, medium (M and large (L viral genome segments were amplified by RT-PCR and sequenced. Overall, the viral sequences from 40 infected mosquitoes and 5 virus isolates were analyzed. Phylogenetic and linkage disequilibrium analyses revealed that approximately 25% of infected mosquitoes and viruses contained reassorted genome segments, suggesting that LACV segment reassortment is frequent in nature.

  2. International Good Market Segmentation and Financial Market Structure

    OpenAIRE

    Basak, Suleyman; Croitoru, Benjamin

    2003-01-01

    While financial markets have recently become more complete and international capital flows well liberalized, markets for goods remain segmented. To investigate how more complete security markets may relieve the effects of this segmentation, we examine a series of two-country economies with internationally segmented good markets, distinguished by the available financial securities. We show that, under heterogeneity within countries, the financial structure matters: even with internationally co...

  3. A re-sequencing based assessment of genomic heterogeneity and fast neutron-induced deletions in a common bean cultivar

    Directory of Open Access Journals (Sweden)

    Jamie A. O'Rourke

    2013-06-01

    Full Text Available A small fast neutron mutant population has been established from Phaseolus vulgaris cv. Red Hawk. We leveraged the available P. vulgaris genome sequence and high throughput next generation DNA sequencing to examine the genomic structure of five Phaseolus vulgaris cv. Red Hawk fast neutron mutants with striking visual phenotypes. Analysis of these genomes identified three classes of structural variation; between cultivar variation, natural variation within the fast neutron mutant population, and fast neutron induced mutagenesis. Our analyses focused on the latter two classes. We identified 23 large deletions (>40 bp common to multiple individuals, illustrating residual heterogeneity and regions of structural variation within the common bean cv. Red Hawk. An additional 18 large deletions were identified in individual mutant plants. These deletions, ranging in size from 40 bp to 43,000 bp, are potentially the result of fast neutron mutagenesis. Six of the 18 deletions lie near or within gene coding regions, identifying potential candidate genes causing the mutant phenotype.

  4. A genome-wide association study of autism using the Simons Simplex Collection: Does reducing phenotypic heterogeneity in autism increase genetic homogeneity?

    Science.gov (United States)

    Chaste, Pauline; Klei, Lambertus; Sanders, Stephan J; Hus, Vanessa; Murtha, Michael T; Lowe, Jennifer K; Willsey, A Jeremy; Moreno-De-Luca, Daniel; Yu, Timothy W; Fombonne, Eric; Geschwind, Daniel; Grice, Dorothy E; Ledbetter, David H; Mane, Shrikant M; Martin, Donna M; Morrow, Eric M; Walsh, Christopher A; Sutcliffe, James S; Lese Martin, Christa; Beaudet, Arthur L; Lord, Catherine; State, Matthew W; Cook, Edwin H; Devlin, Bernie

    2015-05-01

    Phenotypic heterogeneity in autism has long been conjectured to be a major hindrance to the discovery of genetic risk factors, leading to numerous attempts to stratify children based on phenotype to increase power of discovery studies. This approach, however, is based on the hypothesis that phenotypic heterogeneity closely maps to genetic variation, which has not been tested. Our study examines the impact of subphenotyping of a well-characterized autism spectrum disorder (ASD) sample on genetic homogeneity and the ability to discover common genetic variants conferring liability to ASD. Genome-wide genotypic data of 2576 families from the Simons Simplex Collection were analyzed in the overall sample and phenotypic subgroups defined on the basis of diagnosis, IQ, and symptom profiles. We conducted a family-based association study, as well as estimating heritability and evaluating allele scores for each phenotypic subgroup. Association analyses revealed no genome-wide significant association signal. Subphenotyping did not increase power substantially. Moreover, allele scores built from the most associated single nucleotide polymorphisms, based on the odds ratio in the full sample, predicted case status in subsets of the sample equally well and heritability estimates were very similar for all subgroups. In genome-wide association analysis of the Simons Simplex Collection sample, reducing phenotypic heterogeneity had at most a modest impact on genetic homogeneity. Our results are based on a relatively small sample, one with greater homogeneity than the entire population; if they apply more broadly, they imply that analysis of subphenotypes is not a productive path forward for discovering genetic risk variants in ASD. Copyright © 2015 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  5. Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome

    NARCIS (Netherlands)

    Sharp, Andrew J.; Hansen, Sierra; Selzer, Rebecca R.; Cheng, Ze; Regan, Regina; Hurst, Jane A.; Stewart, Helen; Price, Sue M.; Blair, Edward; Hennekam, Raoul C.; Fitzpatrick, Carrie A.; Segraves, Rick; Richmond, Todd A.; Guiver, Cheryl; Albertson, Donna G.; Pinkel, Daniel; Eis, Peggy S.; Schwartz, Stuart; Knight, Samantha J. L.; Eichler, Evan E.

    2006-01-01

    Genomic disorders are characterized by the presence of flanking segmental duplications that predispose these regions to recurrent rearrangement. Based on the duplication architecture of the genome, we investigated 130 regions that we hypothesized as candidates for previously undescribed genomic

  6. The variability of the large genomic segment of Tahyna orthobunyavirus and an all-atom exploration of its anti-viral drug resistance

    Czech Academy of Sciences Publication Activity Database

    Kilian, Patrik; Valdés, James J.; Lecina-Casas, D.; Chrudimský, T.; Růžek, Daniel

    2013-01-01

    Roč. 20, 2013-Dec (2013), s. 304-311 ISSN 1567-1348 R&D Projects: GA ČR GAP502/11/2116; GA MŠk(CZ) EE2.3.30.0032 Institutional support: RVO:60077344 Keywords : Tahyna virus * Orthobunyavirus * California complex * Genetic variability * Large genomic segment Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.264, year: 2013

  7. FRAMEWORK FOR COMPARING SEGMENTATION ALGORITHMS

    Directory of Open Access Journals (Sweden)

    G. Sithole

    2015-05-01

    Full Text Available The notion of a ‘Best’ segmentation does not exist. A segmentation algorithm is chosen based on the features it yields, the properties of the segments (point sets it generates, and the complexity of its algorithm. The segmentation is then assessed based on a variety of metrics such as homogeneity, heterogeneity, fragmentation, etc. Even after an algorithm is chosen its performance is still uncertain because the landscape/scenarios represented in a point cloud have a strong influence on the eventual segmentation. Thus selecting an appropriate segmentation algorithm is a process of trial and error. Automating the selection of segmentation algorithms and their parameters first requires methods to evaluate segmentations. Three common approaches for evaluating segmentation algorithms are ‘goodness methods’, ‘discrepancy methods’ and ‘benchmarks’. Benchmarks are considered the most comprehensive method of evaluation. This paper shortcomings in current benchmark methods are identified and a framework is proposed that permits both a visual and numerical evaluation of segmentations for different algorithms, algorithm parameters and evaluation metrics. The concept of the framework is demonstrated on a real point cloud. Current results are promising and suggest that it can be used to predict the performance of segmentation algorithms.

  8. The infinite sites model of genome evolution.

    Science.gov (United States)

    Ma, Jian; Ratan, Aakrosh; Raney, Brian J; Suh, Bernard B; Miller, Webb; Haussler, David

    2008-09-23

    We formalize the problem of recovering the evolutionary history of a set of genomes that are related to an unseen common ancestor genome by operations of speciation, deletion, insertion, duplication, and rearrangement of segments of bases. The problem is examined in the limit as the number of bases in each genome goes to infinity. In this limit, the chromosomes are represented by continuous circles or line segments. For such an infinite-sites model, we present a polynomial-time algorithm to find the most parsimonious evolutionary history of any set of related present-day genomes.

  9. Genomic segments RNA1 and RNA2 of Prunus necrotic ringspot virus codetermine viral pathogenicity to adapt to alternating natural Prunus hosts.

    Science.gov (United States)

    Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

    2013-05-01

    Prunus necrotic ringspot virus (PNRSV) affects Prunus fruit production worldwide. To date, numerous PNRSV isolates with diverse pathological properties have been documented. To study the pathogenicity of PNRSV, which directly or indirectly determines the economic losses of infected fruit trees, we have recently sequenced the complete genome of peach isolate Pch12 and cherry isolate Chr3, belonging to the pathogenically aggressive PV32 group and mild PV96 group, respectively. Here, we constructed the Chr3- and Pch12-derived full-length cDNA clones that were infectious in the experimental host cucumber and their respective natural Prunus hosts. Pch12-derived clones induced much more severe symptoms than Chr3 in cucumber, and the pathogenicity discrepancy between Chr3 and Pch12 was associated with virus accumulation. By reassortment of genomic segments, swapping of partial genomic segments, and site-directed mutagenesis, we identified the 3' terminal nucleotide sequence (1C region) in RNA1 and amino acid K at residue 279 in RNA2-encoded P2 as the severe virulence determinants in Pch12. Gain-of-function experiments demonstrated that both the 1C region and K279 of Pch12 were required for severe virulence and high levels of viral accumulation. Our results suggest that PNRSV RNA1 and RNA2 codetermine viral pathogenicity to adapt to alternating natural Prunus hosts, likely through mediating viral accumulation.

  10. Biotic Drivers of Spatial Heterogeneity and Implications for River Ecosystems

    Science.gov (United States)

    Wohl, Ellen

    2017-04-01

    Rivers throughout the northern hemisphere have been simplified and homogenized by the removal of beavers and instream wood, along with numerous forms of channel engineering and flow regulation. Loss of spatial heterogeneity in river corridors - channels and floodplains - affects downstream fluxes of water, sediment, organic matter, and nutrients, as well as stream metabolism, biomass, and biodiversity. Recent work in streams of the Colorado Rocky Mountains illustrates how the presence of beavers and instream wood can facilitate spatial heterogeneity by creating stable, persistent, multithread channel planform and high channel-floodplain and channel-hyporheic zone connectivity. This spatial heterogeneity facilitates retention of water in pools, floodplain wetlands, and hyporheic storage. Suspended sediment, particulate organic matter (POM), and solutes are also more likely to be retained in these stream segments than in more uniform stream segments with greater downstream conveyance. Retention of POM and solutes equates to greater volumes of organic carbon storage per unit valley length and greater rates of nitrogen uptake. Spatially heterogeneous stream segments also exhibit greater biomass and biodiversity of aquatic macroinvertebrates, salmonid fish, and riparian spiders than do more uniform stream segments. These significant differences in stream form and function are unlikely to be unique to this field area and can provide a conceptual model for understanding and restoring ecosystem functions in other rivers.

  11. Systematic discovery of unannotated genes in 11 yeast species using a database of orthologous genomic segments

    LENUS (Irish Health Repository)

    OhEigeartaigh, Sean S

    2011-07-26

    Abstract Background In standard BLAST searches, no information other than the sequences of the query and the database entries is considered. However, in situations where two genes from different species have only borderline similarity in a BLAST search, the discovery that the genes are located within a region of conserved gene order (synteny) can provide additional evidence that they are orthologs. Thus, for interpreting borderline search results, it would be useful to know whether the syntenic context of a database hit is similar to that of the query. This principle has often been used in investigations of particular genes or genomic regions, but to our knowledge it has never been implemented systematically. Results We made use of the synteny information contained in the Yeast Gene Order Browser database for 11 yeast species to carry out a systematic search for protein-coding genes that were overlooked in the original annotations of one or more yeast genomes but which are syntenic with their orthologs. Such genes tend to have been overlooked because they are short, highly divergent, or contain introns. The key features of our software - called SearchDOGS - are that the database entries are classified into sets of genomic segments that are already known to be orthologous, and that very weak BLAST hits are retained for further analysis if their genomic location is similar to that of the query. Using SearchDOGS we identified 595 additional protein-coding genes among the 11 yeast species, including two new genes in Saccharomyces cerevisiae. We found additional genes for the mating pheromone a-factor in six species including Kluyveromyces lactis. Conclusions SearchDOGS has proven highly successful for identifying overlooked genes in the yeast genomes. We anticipate that our approach can be adapted for study of further groups of species, such as bacterial genomes. More generally, the concept of doing sequence similarity searches against databases to which external

  12. Novel approach for identification of influenza virus host range and zoonotic transmissible sequences by determination of host-related associative positions in viral genome segments.

    Science.gov (United States)

    Kargarfard, Fatemeh; Sami, Ashkan; Mohammadi-Dehcheshmeh, Manijeh; Ebrahimie, Esmaeil

    2016-11-16

    Recent (2013 and 2009) zoonotic transmission of avian or porcine influenza to humans highlights an increase in host range by evading species barriers. Gene reassortment or antigenic shift between viruses from two or more hosts can generate a new life-threatening virus when the new shuffled virus is no longer recognized by antibodies existing within human populations. There is no large scale study to help understand the underlying mechanisms of host transmission. Furthermore, there is no clear understanding of how different segments of the influenza genome contribute in the final determination of host range. To obtain insight into the rules underpinning host range determination, various supervised machine learning algorithms were employed to mine reassortment changes in different viral segments in a range of hosts. Our multi-host dataset contained whole segments of 674 influenza strains organized into three host categories: avian, human, and swine. Some of the sequences were assigned to multiple hosts. In point of fact, the datasets are a form of multi-labeled dataset and we utilized a multi-label learning method to identify discriminative sequence sites. Then algorithms such as CBA, Ripper, and decision tree were applied to extract informative and descriptive association rules for each viral protein segment. We found informative rules in all segments that are common within the same host class but varied between different hosts. For example, for infection of an avian host, HA14V and NS1230S were the most important discriminative and combinatorial positions. Host range identification is facilitated by high support combined rules in this study. Our major goal was to detect discriminative genomic positions that were able to identify multi host viruses, because such viruses are likely to cause pandemic or disastrous epidemics.

  13. Interference, heterogeneity and disease gene mapping

    Energy Technology Data Exchange (ETDEWEB)

    Keats, B. [Louisiana State Univ. Medical Center, New Orleans, LA (United States)

    1996-12-31

    The Human Genome Project has had a major impact on genetic research over the past five years. The number of mapped genes is now over 3,000 compared with approximately 1,600 in 1989 and only about 260 ten years before that. The realization that extensive variation could be detected in anonymous DNA segments greatly enhanced the potential for mapping by linkage analysis. Previously, linkage studies had depended on polymorphisms that could be detected in red blood cell antigens, proteins (revealed by electrophoresis and isoelectric focusing), and cytogenetic heteromorphisms. The identification of thousands of polymorphic DNA markers throughout the human genome has led to the construction of high density genetic linkage maps. These maps provide the data necessary to test hypotheses concerning differences in recombination rates and levels of interference. They are also important for disease gene mapping because the existence of these genes must be inferred from the phenotype. Showing linkage of a disease gene to a DNA marker is the first step towards isolating the disease gene, determining its protein product, and developing effective therapies. However, interpretation of results is not always straightforward. Factors such as etiological heterogeneity and undetected irregular segregation can lead to confusing linkage results and incorrect conclusions about the locations of disease genes. This paper will discuss these phenomena and present examples that illustrate the problems, as well as approaches to dealing with them. 23 refs., 3 figs., 3 tabs.

  14. Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome.

    Science.gov (United States)

    Sharp, Andrew J; Hansen, Sierra; Selzer, Rebecca R; Cheng, Ze; Regan, Regina; Hurst, Jane A; Stewart, Helen; Price, Sue M; Blair, Edward; Hennekam, Raoul C; Fitzpatrick, Carrie A; Segraves, Rick; Richmond, Todd A; Guiver, Cheryl; Albertson, Donna G; Pinkel, Daniel; Eis, Peggy S; Schwartz, Stuart; Knight, Samantha J L; Eichler, Evan E

    2006-09-01

    Genomic disorders are characterized by the presence of flanking segmental duplications that predispose these regions to recurrent rearrangement. Based on the duplication architecture of the genome, we investigated 130 regions that we hypothesized as candidates for previously undescribed genomic disorders. We tested 290 individuals with mental retardation by BAC array comparative genomic hybridization and identified 16 pathogenic rearrangements, including de novo microdeletions of 17q21.31 found in four individuals. Using oligonucleotide arrays, we refined the breakpoints of this microdeletion, defining a 478-kb critical region containing six genes that were deleted in all four individuals. We mapped the breakpoints of this deletion and of four other pathogenic rearrangements in 1q21.1, 15q13, 15q24 and 17q12 to flanking segmental duplications, suggesting that these are also sites of recurrent rearrangement. In common with the 17q21.31 deletion, these breakpoint regions are sites of copy number polymorphism in controls, indicating that these may be inherently unstable genomic regions.

  15. Systematic determination of the mosaic structure of bacterial genomes: species backbone versus strain-specific loops

    Directory of Open Access Journals (Sweden)

    Gendrault-Jacquemard A

    2005-07-01

    Full Text Available Abstract Background Public databases now contain multitude of complete bacterial genomes, including several genomes of the same species. The available data offers new opportunities to address questions about bacterial genome evolution, a task that requires reliable fine comparison data of closely related genomes. Recent analyses have shown, using pairwise whole genome alignments, that it is possible to segment bacterial genomes into a common conserved backbone and strain-specific sequences called loops. Results Here, we generalize this approach and propose a strategy that allows systematic and non-biased genome segmentation based on multiple genome alignments. Segmentation analyses, as applied to 13 different bacterial species, confirmed the feasibility of our approach to discern the 'mosaic' organization of bacterial genomes. Segmentation results are available through a Web interface permitting functional analysis, extraction and visualization of the backbone/loops structure of documented genomes. To illustrate the potential of this approach, we performed a precise analysis of the mosaic organization of three E. coli strains and functional characterization of the loops. Conclusion The segmentation results including the backbone/loops structure of 13 bacterial species genomes are new and available for use by the scientific community at the URL: http://genome.jouy.inra.fr/mosaic.

  16. Equatorial segment of the mid-atlantic ridge

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-12-31

    The Equatorial Segment of the Mid-Atlantic Ridge is a part of this mid-oceanic ridge limited by a cluster of fracture zones - Cape Verde, Marathon, Mercury, Vema, Doldrums, Vernadsky and Sierra Leone - in the North, and a similar cluster of fracture zones - St Paul, Romanche and Chain - in the South. During recent decades, following the publication of the 5. edition of the General Bathymetric Chart of the Oceans (GEBCO), there has been a great deal of geological-geophysical research and mapping of the World Ocean. The results have led to the development of a number of theories concerning the essential heterogeneity of the structure of the ocean floor and, in particular, the heterogeneity of the structure and segmentation of mid-oceanic ridges. Research on the nature of such segmentation is of great importance for an understanding of the processes of development of such ridges and oceanic basins as a whole. Chapter 20 is dedicated to the study of the atlantic ocean mantle by using (Th.U)Th, (Th/U)pb and K/Ti systematics 380 refs.

  17. Equatorial segment of the mid-atlantic ridge

    International Nuclear Information System (INIS)

    1996-01-01

    The Equatorial Segment of the Mid-Atlantic Ridge is a part of this mid-oceanic ridge limited by a cluster of fracture zones - Cape Verde, Marathon, Mercury, Vema, Doldrums, Vernadsky and Sierra Leone - in the North, and a similar cluster of fracture zones - St Paul, Romanche and Chain - in the South. During recent decades, following the publication of the 5. edition of the General Bathymetric Chart of the Oceans (GEBCO), there has been a great deal of geological-geophysical research and mapping of the World Ocean. The results have led to the development of a number of theories concerning the essential heterogeneity of the structure of the ocean floor and, in particular, the heterogeneity of the structure and segmentation of mid-oceanic ridges. Research on the nature of such segmentation is of great importance for an understanding of the processes of development of such ridges and oceanic basins as a whole. Chapter 20 is dedicated to the study of the atlantic ocean mantle by using (Th.U)Th, (Th/U)pb and K/Ti systematics

  18. Equatorial segment of the mid-atlantic ridge

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1996-12-31

    The Equatorial Segment of the Mid-Atlantic Ridge is a part of this mid-oceanic ridge limited by a cluster of fracture zones - Cape Verde, Marathon, Mercury, Vema, Doldrums, Vernadsky and Sierra Leone - in the North, and a similar cluster of fracture zones - St Paul, Romanche and Chain - in the South. During recent decades, following the publication of the 5. edition of the General Bathymetric Chart of the Oceans (GEBCO), there has been a great deal of geological-geophysical research and mapping of the World Ocean. The results have led to the development of a number of theories concerning the essential heterogeneity of the structure of the ocean floor and, in particular, the heterogeneity of the structure and segmentation of mid-oceanic ridges. Research on the nature of such segmentation is of great importance for an understanding of the processes of development of such ridges and oceanic basins as a whole. Chapter 20 is dedicated to the study of the atlantic ocean mantle by using (Th.U)Th, (Th/U)pb and K/Ti systematics 380 refs.

  19. Deciphering cancer heterogeneity: the biological space

    Directory of Open Access Journals (Sweden)

    Stephanie eRoessler

    2014-04-01

    Full Text Available Most lethal solid tumors including hepatocellular carcinoma (HCC are considered incurable due to extensive heterogeneity in clinical presentation and tumor biology. Tumor heterogeneity may result from different cells of origin, patient ethnicity, etiology, underlying disease and diversity of genomic and epigenomic changes which drive tumor development. Cancer genomic heterogeneity thereby impedes treatment options and poses a significant challenge to cancer management. Studies of the HCC genome have revealed that although various genomic signatures identified in different HCC subgroups share a common prognosis, each carries unique molecular changes which are linked to different sets of cancer hallmarks whose misregulation has been proposed by Hanahan and Weinberg to be essential for tumorigenesis. We hypothesize that these specific sets of cancer hallmarks collectively occupy different tumor biological space representing the misregulation of different biological processes. In principle, a combination of different cancer hallmarks can result in new convergent molecular networks that are unique to each tumor subgroup and represent ideal druggable targets. Due to the ability of the tumor to adapt to external factors such as treatment or changes in the tumor microenvironment, the tumor biological space is elastic. Our ability to identify distinct groups of cancer patients with similar tumor biology who are most likely to respond to a specific therapy would have a significant impact on improving patient outcome. It is currently a challenge to identify a particular hallmark or a newly emerged convergent molecular network for a particular tumor. Thus, it is anticipated that the integration of multiple levels of data such as genomic mutations, somatic copy number aberration, gene expression, proteomics, and metabolomics, may help us grasp the tumor biological space occupied by each individual, leading to improved therapeutic intervention and outcome.

  20. Cancer heterogeneity and imaging.

    Science.gov (United States)

    O'Connor, James P B

    2017-04-01

    There is interest in identifying and quantifying tumor heterogeneity at the genomic, tissue pathology and clinical imaging scales, as this may help better understand tumor biology and may yield useful biomarkers for guiding therapy-based decision making. This review focuses on the role and value of using x-ray, CT, MRI and PET based imaging methods that identify, measure and map tumor heterogeneity. In particular we highlight the potential value of these techniques and the key challenges required to validate and qualify these biomarkers for clinical use. Copyright © 2016. Published by Elsevier Ltd.

  1. Molecular Characterization of Bombyx mori Cytoplasmic Polyhedrosis Virus Genome Segment 4

    Science.gov (United States)

    Ikeda, Keiko; Nagaoka, Sumiharu; Winkler, Stefan; Kotani, Kumiko; Yagi, Hiroaki; Nakanishi, Kae; Miyajima, Shigetoshi; Kobayashi, Jun; Mori, Hajime

    2001-01-01

    The complete nucleotide sequence of the genome segment 4 (S4) of Bombyx mori cytoplasmic polyhedrosis virus (BmCPV) was determined. The 3,259-nucleotide sequence contains a single long open reading frame which spans nucleotides 14 to 3187 and which is predicted to encode a protein with a molecular mass of about 130 kDa. Western blot analysis showed that S4 encodes BmCPV protein VP3, which is one of the outer components of the BmCPV virion. Sequence analysis of the deduced amino acid sequence of BmCPV VP3 revealed possible sequence homology with proteins from rice ragged stunt virus (RRSV) S2, Nilaparvata lugens reovirus S4, and Fiji disease fijivirus S4. This may suggest that plant reoviruses originated from insect viruses and that RRSV emerged more recently than other plant reoviruses. A chimeric protein consisting of BmCPV VP3 and green fluorescent protein (GFP) was constructed and expressed with BmCPV polyhedrin using a baculovirus expression vector. The VP3-GFP chimera was incorporated into BmCPV polyhedra and released under alkaline conditions. The results indicate that specific interactions occur between BmCPV polyhedrin and VP3 which might facilitate BmCPV virion occlusion into the polyhedra. PMID:11134312

  2. Comparative transcriptional and genomic analysis of Plasmodium falciparum field isolates.

    Directory of Open Access Journals (Sweden)

    Margaret J Mackinnon

    2009-10-01

    Full Text Available Mechanisms for differential regulation of gene expression may underlie much of the phenotypic variation and adaptability of malaria parasites. Here we describe transcriptional variation among culture-adapted field isolates of Plasmodium falciparum, the species responsible for most malarial disease. It was found that genes coding for parasite protein export into the red cell cytosol and onto its surface, and genes coding for sexual stage proteins involved in parasite transmission are up-regulated in field isolates compared with long-term laboratory isolates. Much of this variability was associated with the loss of small or large chromosomal segments, or other forms of gene copy number variation that are prevalent in the P. falciparum genome (copy number variants, CNVs. Expression levels of genes inside these segments were correlated to that of genes outside and adjacent to the segment boundaries, and this association declined with distance from the CNV boundary. This observation could not be explained by copy number variation in these adjacent genes. This suggests a local-acting regulatory role for CNVs in transcription of neighboring genes and helps explain the chromosomal clustering that we observed here. Transcriptional co-regulation of physical clusters of adaptive genes may provide a way for the parasite to readily adapt to its highly heterogeneous and strongly selective environment.

  3. Supplementary Material for: Whole genome sequencing reveals genomic heterogeneity and antibiotic purification in Mycobacterium tuberculosis isolates

    KAUST Repository

    Black, PA; Vos, M. de; Louw, GE; Merwe, RG van der; Dippenaar, A.; Streicher, EM; Abdallah, AM; Sampson, SL; Victor, TC; Dolby, T.; Simpson, JA; Helden, PD van; Warren, RM; Pain, Arnab

    2015-01-01

    Abstract Background Whole genome sequencing has revolutionised the interrogation of mycobacterial genomes. Recent studies have reported conflicting findings on the genomic stability of Mycobacterium tuberculosis during the evolution of drug

  4. Short segment search method for phylogenetic analysis using nested sliding windows

    Science.gov (United States)

    Iskandar, A. A.; Bustamam, A.; Trimarsanto, H.

    2017-10-01

    To analyze phylogenetics in Bioinformatics, coding DNA sequences (CDS) segment is needed for maximal accuracy. However, analysis by CDS cost a lot of time and money, so a short representative segment by CDS, which is envelope protein segment or non-structural 3 (NS3) segment is necessary. After sliding window is implemented, a better short segment than envelope protein segment and NS3 is found. This paper will discuss a mathematical method to analyze sequences using nested sliding window to find a short segment which is representative for the whole genome. The result shows that our method can find a short segment which more representative about 6.57% in topological view to CDS segment than an Envelope segment or NS3 segment.

  5. An improved segmented gamma scanning for radioactive waste drums

    International Nuclear Information System (INIS)

    Liu Cheng; Wang Dezhong; Bai Yunfei; Qian Nan

    2010-01-01

    In this paper, the equivalent radius of radioactive sources in each segment is determined by analyzing the different responses of the two identical detectors, and an improved segmented gamma scanning is used to assay waste drums containing mainly organic materials, and proved by an established simulation model. The simulated radioactivity distributions in homogenous waste drum and an experimental heterogeneous waste drum were compared with those of traditional segmented gamma scanning. The results show that our method is good in performance and can be used for analyzing the waste drums. (authors)

  6. The W22 genome: a foundation for maize functional genomics and transposon biology

    Science.gov (United States)

    The maize W22 inbred has served as a platform for maize genetics since the mid twentieth century. To streamline maize genome analyses, we have sequenced and de novo assembled a W22 reference genome using small-read sequencing technologies. We show that significant structural heterogeneity exists in ...

  7. An Improved Algorithm Based on Minimum Spanning Tree for Multi-scale Segmentation of Remote Sensing Imagery

    Directory of Open Access Journals (Sweden)

    LI Hui

    2015-07-01

    Full Text Available As the basis of object-oriented information extraction from remote sensing imagery,image segmentation using multiple image features,exploiting spatial context information, and by a multi-scale approach are currently the research focuses. Using an optimization approach of the graph theory, an improved multi-scale image segmentation method is proposed. In this method, the image is applied with a coherent enhancement anisotropic diffusion filter followed by a minimum spanning tree segmentation approach, and the resulting segments are merged with reference to a minimum heterogeneity criterion.The heterogeneity criterion is defined as a function of the spectral characteristics and shape parameters of segments. The purpose of the merging step is to realize the multi-scale image segmentation. Tested on two images, the proposed method was visually and quantitatively compared with the segmentation method employed in the eCognition software. The results show that the proposed method is effective and outperforms the latter on areas with subtle spectral differences.

  8. Pricing Liquidity Risk with Heterogeneous Investment Horizons

    NARCIS (Netherlands)

    Beber, Alessandro; Driessen, Joost; Neuberger, A.; Tuijp, P

    We develop an asset pricing model with stochastic transaction costs and investors with heterogeneous horizons. Depending on their horizon, investors hold different sets of assets in equilibrium. This generates segmentation and spillover effects for expected returns, where the liquidity (risk)

  9. A decision-theoretic approach for segmental classification

    OpenAIRE

    Yau, Christopher; Holmes, Christopher C.

    2013-01-01

    This paper is concerned with statistical methods for the segmental classification of linear sequence data where the task is to segment and classify the data according to an underlying hidden discrete state sequence. Such analysis is commonplace in the empirical sciences including genomics, finance and speech processing. In particular, we are interested in answering the following question: given data $y$ and a statistical model $\\pi(x,y)$ of the hidden states $x$, what should we report as the ...

  10. Pricing Liquidity Risk with Heterogeneous Investment Horizons

    NARCIS (Netherlands)

    Beber, A.; Driessen, J.; Tuijp, P.F.A.

    2012-01-01

    We develop a new asset pricing model with stochastic transaction costs and investors with heterogenous horizons. Short-term investors hold only liquid assets in equilibrium. This generates segmentation effects in the pricing of liquid versus illiquid assets. Specifically, the liquidity (risk) premia

  11. Intra-tumor genetic heterogeneity and mortality in head and neck cancer: analysis of data from the Cancer Genome Atlas.

    Science.gov (United States)

    Mroz, Edmund A; Tward, Aaron D; Tward, Aaron M; Hammon, Rebecca J; Ren, Yin; Rocco, James W

    2015-02-01

    Although the involvement of intra-tumor genetic heterogeneity in tumor progression, treatment resistance, and metastasis is established, genetic heterogeneity is seldom examined in clinical trials or practice. Many studies of heterogeneity have had prespecified markers for tumor subpopulations, limiting their generalizability, or have involved massive efforts such as separate analysis of hundreds of individual cells, limiting their clinical use. We recently developed a general measure of intra-tumor genetic heterogeneity based on whole-exome sequencing (WES) of bulk tumor DNA, called mutant-allele tumor heterogeneity (MATH). Here, we examine data collected as part of a large, multi-institutional study to validate this measure and determine whether intra-tumor heterogeneity is itself related to mortality. Clinical and WES data were obtained from The Cancer Genome Atlas in October 2013 for 305 patients with head and neck squamous cell carcinoma (HNSCC), from 14 institutions. Initial pathologic diagnoses were between 1992 and 2011 (median, 2008). Median time to death for 131 deceased patients was 14 mo; median follow-up of living patients was 22 mo. Tumor MATH values were calculated from WES results. Despite the multiple head and neck tumor subsites and the variety of treatments, we found in this retrospective analysis a substantial relation of high MATH values to decreased overall survival (Cox proportional hazards analysis: hazard ratio for high/low heterogeneity, 2.2; 95% CI 1.4 to 3.3). This relation of intra-tumor heterogeneity to survival was not due to intra-tumor heterogeneity's associations with other clinical or molecular characteristics, including age, human papillomavirus status, tumor grade and TP53 mutation, and N classification. MATH improved prognostication over that provided by traditional clinical and molecular characteristics, maintained a significant relation to survival in multivariate analyses, and distinguished outcomes among patients having

  12. Comparative genome analysis of VSP-II and SNPs reveals heterogenic variation in contemporary strains of Vibrio cholerae O1 isolated from cholera patients in Kolkata, India.

    Science.gov (United States)

    Imamura, Daisuke; Morita, Masatomo; Sekizuka, Tsuyoshi; Mizuno, Tamaki; Takemura, Taichiro; Yamashiro, Tetsu; Chowdhury, Goutam; Pazhani, Gururaja P; Mukhopadhyay, Asish K; Ramamurthy, Thandavarayan; Miyoshi, Shin-Ichi; Kuroda, Makoto; Shinoda, Sumio; Ohnishi, Makoto

    2017-02-01

    Cholera is an acute diarrheal disease and a major public health problem in many developing countries in Asia, Africa, and Latin America. Since the Bay of Bengal is considered the epicenter for the seventh cholera pandemic, it is important to understand the genetic dynamism of Vibrio cholerae from Kolkata, as a representative of the Bengal region. We analyzed whole genome sequence data of V. cholerae O1 isolated from cholera patients in Kolkata, India, from 2007 to 2014 and identified the heterogeneous genomic region in these strains. In addition, we carried out a phylogenetic analysis based on the whole genome single nucleotide polymorphisms to determine the genetic lineage of strains in Kolkata. This analysis revealed the heterogeneity of the Vibrio seventh pandemic island (VSP)-II in Kolkata strains. The ctxB genotype was also heterogeneous and was highly related to VSP-II types. In addition, phylogenetic analysis revealed the shifts in predominant strains in Kolkata. Two distinct lineages, 1 and 2, were found between 2007 and 2010. However, the proportion changed markedly in 2010 and lineage 2 strains were predominant thereafter. Lineage 2 can be divided into four sublineages, I, II, III and IV. The results of this study indicate that lineages 1 and 2-I were concurrently prevalent between 2007 and 2009, and lineage 2-III observed in 2010, followed by the predominance of lineage 2-IV in 2011 and continued until 2014. Our findings demonstrate that the epidemic of cholera in Kolkata was caused by several distinct strains that have been constantly changing within the genetic lineages of V. cholerae O1 in recent years.

  13. GENOMIC FEATURES OF COTESIA PLUTELLAE POLYDNAVIRUS

    Institute of Scientific and Technical Information of China (English)

    LIUCai-ling; ZHUXiang-xiong; FuWen-jun; ZHAOMu-jun

    2003-01-01

    Polydnavirus was purified from the calyx fluid of Cotesia plutellae ovary. The genomic features of C. plutellae polydnavirus (CpPDV) were investigated. The viral genome consists of at least 12 different segments and the aggregate genome size is a lower estimate of 80kbp. By partial digestion of CpPDV DNA with BamHI and subsequent ligation with BamHI-cut plasmid Bluescript, a representative library of CpPDV genome was obtained.

  14. Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

    Science.gov (United States)

    Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

    2012-12-01

    In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  15. A conserved segmental duplication within ELA.

    Science.gov (United States)

    Brinkmeyer-Langford, C L; Murphy, W J; Childers, C P; Skow, L C

    2010-12-01

    The assembled genomic sequence of the horse major histocompatibility complex (MHC) (equine lymphocyte antigen, ELA) is very similar to the homologous human HLA, with the notable exception of a large segmental duplication at the boundary of ELA class I and class III that is absent in HLA. The segmental duplication consists of a ∼ 710 kb region of at least 11 repeated blocks: 10 blocks each contain an MHC class I-like sequence and the helicase domain portion of a BAT1-like sequence, and the remaining unit contains the full-length BAT1 gene. Similar genomic features were found in other Perissodactyls, indicating an ancient origin, which is consistent with phylogenetic analyses. Reverse-transcriptase PCR (RT-PCR) of mRNA from peripheral white blood cells of healthy and chronically or acutely infected horses detected transcription from predicted open reading frames in several of the duplicated blocks. This duplication is not present in the sequenced MHCs of most other mammals, although a similar feature at the same relative position is present in the feline MHC (FLA). Striking sequence conservation throughout Perissodactyl evolution is consistent with a functional role for at least some of the genes included within this segmental duplication. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.

  16. Assessment of the relationship between morphological emphysema phenotype and corresponding pulmonary perfusion pattern on a segmental level

    International Nuclear Information System (INIS)

    Bryant, Mark; Kauczor, Hans-Ulrich; Ley, Sebastian; Eberhardt, Ralf; Herth, Felix; Menezes, Ravi; Sedlaczek, Oliver; Ley-Zaporozhan, Julia

    2015-01-01

    Distinct morphological emphysema phenotypes were assessed by CT to show characteristic perfusion defect patterns. Forty-one patients with severe emphysema (GOLD III/IV) underwent three-dimensional high resolution computed tomography (3D-HRCT) and contrast-enhanced magnetic resonance (MR) perfusion. 3D-HRCT data was visually analyzed for emphysema phenotyping and quantification by consensus of three experts in chest-radiology. The predominant phenotype per segment was categorized as normal, centrilobular, panlobular or paraseptal. Segmental lung perfusion was visually analyzed using six patterns of pulmonary perfusion (1-normal; 2-mild homogeneous reduction in perfusion; 3-heterogeneous perfusion without focal defects; 4-heterogeneous perfusion with focal defects; 5-heterogeneous absence of perfusion; 6-homogeneous absence of perfusion), with the extent of the defect given as a percentage. 730 segments were evaluated. CT categorized 566 (78 %) as centrilobular, 159 (22 %) as panlobular and 5 (<1 %) as paraseptal with no normals. Scores with regards to MR perfusion patterns were: 1-0; 2-0; 3-28 (4 %); 4-425 (58 %); 5-169 (23 %); 6-108 (15 %). The predominant perfusion pattern matched as follows: 70 % centrilobular emphysema - heterogeneous perfusion with focal defects (score 4); 42 % panlobular - homogeneous absence of perfusion (score 5); and 43 % panlobular - heterogeneous absence of perfusion (score 6). MR pulmonary perfusion patterns correlate with the CT phenotype at a segmental level in patients with severe emphysema. (orig.)

  17. Assessment of the relationship between morphological emphysema phenotype and corresponding pulmonary perfusion pattern on a segmental level

    Energy Technology Data Exchange (ETDEWEB)

    Bryant, Mark; Kauczor, Hans-Ulrich [University of Heidelberg, Department of Diagnostic and Interventional Radiology, Heidelberg (Germany); Member of German Lung Research Center DZL, Translational Lung Research Center TLRC-H, Heidelberg (Germany); Ley, Sebastian [Chirurgische Klinik Dr. Rinecker, Department of Diagnostic and Interventional Radiology, Munich (Germany); Ludwig Maximilians University, Department of Clinical Radiology, Munich (Germany); Eberhardt, Ralf; Herth, Felix [Thoraxklinik University of Heidelberg, Department of Pneumology and Critical Care Medicine, Heidelberg (Germany); Member of German Lung Research Center DZL, Translational Lung Research Center TLRC-H, Heidelberg (Germany); Menezes, Ravi [University of Toronto, Medical Imaging, Toronto (Canada); Sedlaczek, Oliver [University of Heidelberg, Department of Diagnostic and Interventional Radiology, Heidelberg (Germany); German Cancer Research Center, Department of Radiology, Heidelberg (Germany); Member of German Lung Research Center DZL, Translational Lung Research Center TLRC-H, Heidelberg (Germany); Ley-Zaporozhan, Julia [University of Heidelberg, Department of Diagnostic and Interventional Radiology, Heidelberg (Germany); Ludwig Maximilians University, Department of Clinical Radiology, Munich (Germany)

    2015-01-15

    Distinct morphological emphysema phenotypes were assessed by CT to show characteristic perfusion defect patterns. Forty-one patients with severe emphysema (GOLD III/IV) underwent three-dimensional high resolution computed tomography (3D-HRCT) and contrast-enhanced magnetic resonance (MR) perfusion. 3D-HRCT data was visually analyzed for emphysema phenotyping and quantification by consensus of three experts in chest-radiology. The predominant phenotype per segment was categorized as normal, centrilobular, panlobular or paraseptal. Segmental lung perfusion was visually analyzed using six patterns of pulmonary perfusion (1-normal; 2-mild homogeneous reduction in perfusion; 3-heterogeneous perfusion without focal defects; 4-heterogeneous perfusion with focal defects; 5-heterogeneous absence of perfusion; 6-homogeneous absence of perfusion), with the extent of the defect given as a percentage. 730 segments were evaluated. CT categorized 566 (78 %) as centrilobular, 159 (22 %) as panlobular and 5 (<1 %) as paraseptal with no normals. Scores with regards to MR perfusion patterns were: 1-0; 2-0; 3-28 (4 %); 4-425 (58 %); 5-169 (23 %); 6-108 (15 %). The predominant perfusion pattern matched as follows: 70 % centrilobular emphysema - heterogeneous perfusion with focal defects (score 4); 42 % panlobular - homogeneous absence of perfusion (score 5); and 43 % panlobular - heterogeneous absence of perfusion (score 6). MR pulmonary perfusion patterns correlate with the CT phenotype at a segmental level in patients with severe emphysema. (orig.)

  18. An Inhibitory Motif on the 5’UTR of Several Rotavirus Genome Segments Affects Protein Expression and Reverse Genetics Strategies

    Science.gov (United States)

    Papa, Guido; Eichwald, Catherine; Burrone, Oscar R.

    2016-01-01

    Rotavirus genome consists of eleven segments of dsRNA, each encoding one single protein. Viral mRNAs contain an open reading frame (ORF) flanked by relatively short untranslated regions (UTRs), whose role in the viral cycle remains elusive. Here we investigated the role of 5’UTRs in T7 polymerase-driven cDNAs expression in uninfected cells. The 5’UTRs of eight genome segments (gs3, gs5-6, gs7-11) of the simian SA11 strain showed a strong inhibitory effect on the expression of viral proteins. Decreased protein expression was due to both compromised transcription and translation and was independent of the ORF and the 3’UTR sequences. Analysis of several mutants of the 21-nucleotide long 5’UTR of gs 11 defined an inhibitory motif (IM) represented by its primary sequence rather than its secondary structure. IM was mapped to the 5’ terminal 6-nucleotide long pyrimidine-rich tract 5’-GGY(U/A)UY-3’. The 5’ terminal position within the mRNA was shown to be essentially required, as inhibitory activity was lost when IM was moved to an internal position. We identified two mutations (insertion of a G upstream the 5’UTR and the U to A mutation of the fifth nucleotide of IM) that render IM non-functional and increase the transcription and translation rate to levels that could considerably improve the efficiency of virus helper-free reverse genetics strategies. PMID:27846320

  19. Flexibility and symmetry of prokaryotic genome rearrangement reveal lineage-associated core-gene-defined genome organizational frameworks.

    Science.gov (United States)

    Kang, Yu; Gu, Chaohao; Yuan, Lina; Wang, Yue; Zhu, Yanmin; Li, Xinna; Luo, Qibin; Xiao, Jingfa; Jiang, Daquan; Qian, Minping; Ahmed Khan, Aftab; Chen, Fei; Zhang, Zhang; Yu, Jun

    2014-11-25

    The prokaryotic pangenome partitions genes into core and dispensable genes. The order of core genes, albeit assumed to be stable under selection in general, is frequently interrupted by horizontal gene transfer and rearrangement, but how a core-gene-defined genome maintains its stability or flexibility remains to be investigated. Based on data from 30 species, including 425 genomes from six phyla, we grouped core genes into syntenic blocks in the context of a pangenome according to their stability across multiple isolates. A subset of the core genes, often species specific and lineage associated, formed a core-gene-defined genome organizational framework (cGOF). Such cGOFs are either single segmental (one-third of the species analyzed) or multisegmental (the rest). Multisegment cGOFs were further classified into symmetric or asymmetric according to segment orientations toward the origin-terminus axis. The cGOFs in Gram-positive species are exclusively symmetric and often reversible in orientation, as opposed to those of the Gram-negative bacteria, which are all asymmetric and irreversible. Meanwhile, all species showing strong strand-biased gene distribution contain symmetric cGOFs and often specific DnaE (α subunit of DNA polymerase III) isoforms. Furthermore, functional evaluations revealed that cGOF genes are hub associated with regard to cellular activities, and the stability of cGOF provides efficient indexes for scaffold orientation as demonstrated by assembling virtual and empirical genome drafts. cGOFs show species specificity, and the symmetry of multisegmental cGOFs is conserved among taxa and constrained by DNA polymerase-centric strand-biased gene distribution. The definition of species-specific cGOFs provides powerful guidance for genome assembly and other structure-based analysis. Prokaryotic genomes are frequently interrupted by horizontal gene transfer (HGT) and rearrangement. To know whether there is a set of genes not only conserved in position

  20. Advances in the translational genomics of neuroblastoma

    Science.gov (United States)

    Bosse, Kristopher R.; Maris, John M.

    2015-01-01

    Neuroblastoma is an embryonal malignancy that commonly affects young children and is remarkably heterogenous in its malignant potential. Recently, the genetic basis of neuroblastoma has come into focus, which has catalyzed not only a more comprehensive understanding of neuroblastoma tumorigenesis, but has also revealed novel oncogenic vulnerabilities that are being leveraged therapeutically. Neuroblastoma is a model pediatric solid tumor in its use of recurrent genomic alterations, such as high-level MYCN amplification, for risk stratification. Given the relative paucity of recurrent activating somatic point mutations or gene fusions in primary neuroblastoma tumors studied at initial diagnosis, innovative treatment approaches beyond small molecules targeting mutated or dysregulated kinases will be required moving forward to achieve noticeable improvements in overall patient survival. However, the clonally acquired, oncogenic aberrations in relapsed neuroblastomas are currently being defined and may offer an opportunity to improve patient outcomes with molecularly targeted therapy directed towards aberrantly regulated pathways in relapsed disease. This review will summarize the current state of knowledge of neuroblastoma genetics and genomics, highlighting the improved prognostication and potential therapeutic opportunities that have arisen from recent advances in understanding germline predisposition, recurrent segmental chromosomal alterations, somatic point mutations and translocations, and clonal evolution in relapsed neuroblastoma. PMID:26539795

  1. Identifying uniformly mutated segments within repeats.

    Science.gov (United States)

    Sahinalp, S Cenk; Eichler, Evan; Goldberg, Paul; Berenbrink, Petra; Friedetzky, Tom; Ergun, Funda

    2004-12-01

    Given a long string of characters from a constant size alphabet we present an algorithm to determine whether its characters have been generated by a single i.i.d. random source. More specifically, consider all possible n-coin models for generating a binary string S, where each bit of S is generated via an independent toss of one of the n coins in the model. The choice of which coin to toss is decided by a random walk on the set of coins where the probability of a coin change is much lower than the probability of using the same coin repeatedly. We present a procedure to evaluate the likelihood of a n-coin model for given S, subject a uniform prior distribution over the parameters of the model (that represent mutation rates and probabilities of copying events). In the absence of detailed prior knowledge of these parameters, the algorithm can be used to determine whether the a posteriori probability for n=1 is higher than for any other n>1. Our algorithm runs in time O(l4logl), where l is the length of S, through a dynamic programming approach which exploits the assumed convexity of the a posteriori probability for n. Our test can be used in the analysis of long alignments between pairs of genomic sequences in a number of ways. For example, functional regions in genome sequences exhibit much lower mutation rates than non-functional regions. Because our test provides means for determining variations in the mutation rate, it may be used to distinguish functional regions from non-functional ones. Another application is in determining whether two highly similar, thus evolutionarily related, genome segments are the result of a single copy event or of a complex series of copy events. This is particularly an issue in evolutionary studies of genome regions rich with repeat segments (especially tandemly repeated segments).

  2. Comparative genomics reveals insights into avian genome evolution and adaptation

    Science.gov (United States)

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  3. PREPAID TELECOM CUSTOMERS SEGMENTATION USING THE K-MEAN ALGORITHM

    Directory of Open Access Journals (Sweden)

    Marar Liviu Ioan

    2012-07-01

    Full Text Available The scope of relationship marketing is to retain customers and win their loyalty. This can be achieved if the companies’ products and services are developed and sold considering customers’ demands. Fulfilling customers’ demands, taken as the starting point of relationship marketing, can be obtained by acknowledging that the customers’ needs and wishes are heterogeneous. The segmentation of the customers’ base allows operators to overcome this because it illustrates the whole heterogeneous market as the sum of smaller homogeneous markets. The concept of segmentation relies on the high probability of persons grouped into segments based on common demands and behaviours to have a similar response to marketing strategies. This article focuses on the segmentation of a telecom customer base according to specific and noticeable criteria of a certain service. Although the segmentation concept is widely approached in professional literature, articles on the segmentation of a telecom customer base are very scarce, due to the strategic nature of this information. Market segmentation is carried out based on how customers spent their money on credit recharging, on making calls, on sending SMS and on Internet navigation. The method used for customer segmentation is the K-mean cluster analysis. To assess the internal cohesion of the clusters we employed the average sum of squares error indicator, and to determine the differences among the clusters we used the ANOVA and the post-hoc Tukey tests. The analyses revealed seven customer segments with different features and behaviours. The results enable the telecom company to conceive marketing strategies and planning which lead to better understanding of its customers’ needs and ultimately to a more efficient relationship with the subscribers and enhanced customer satisfaction. At the same time, the results enable the description and characterization of expenditure patterns

  4. Genome instability in Lactobacillus rhamnosus GG

    NARCIS (Netherlands)

    Sybesma, W.; Molenaar, D.; IJcken, W. van; Venema, K.; Korta, R.

    2013-01-01

    We describe here a comparative genome analysis of three dairy product isolates of Lactobacillus rhamnosus GG (LGG) and the ATCC 53103 reference strain to the published genome sequence of L. rhamnosus GG. The analysis showed that in two of three isolates, major DNA segments were missing from the

  5. Genome-Wide Association Analysis Reveals Genetic Heterogeneity of Sjögren's Syndrome According to Ancestry

    DEFF Research Database (Denmark)

    Taylor, Kimberly E; Wong, Quenna; Levine, David M

    2017-01-01

    common protocol-directed methods. The aim of this study was to examine the genetic etiology of Sjögren's syndrome (SS) across ancestry and disease subsets. METHODS: We performed genome-wide association study analyses using SICCA subjects and external controls obtained from dbGaP data sets, one using all......OBJECTIVE: The Sjögren's International Collaborative Clinical Alliance (SICCA) is an international data registry and biorepository derived from a multisite observational study of participants in whom genotyping was performed on the Omni2.5M platform and who had undergone deep phenotyping using...... subphenotype distributions differ by ethnicity, and whether this contributes to the heterogeneity of genetic associations. RESULTS: We observed significant associations in established regions of the major histocompatibility complex (MHC), IRF5, and STAT4 (P = 3 × 10(-42) , P = 3 × 10(-14) , and P = 9 × 10...

  6. The structures of bovine herpesvirus 1 virion and concatemeric DNA: implications for cleavage and packaging of herpesvirus genomes

    International Nuclear Information System (INIS)

    Schynts, Frederic; McVoy, Michael A.; Meurens, Francois; Detry, Bruno; Epstein, Alberto L.; Thiry, Etienne

    2003-01-01

    Herpesvirus genomes are often characterized by the presence of direct and inverted repeats that delineate their grouping into six structural classes. Class D genomes consist of a long (L) segment and a short (S) segment. The latter is flanked by large inverted repeats. DNA replication produces concatemers of head-to-tail linked genomes that are cleaved into unit genomes during the process of packaging DNA into capsids. Packaged class D genomes are an equimolar mixture of two isomers in which S is in either of two orientations, presumably a consequence of homologous recombination between the inverted repeats. The L segment remains predominantly fixed in a prototype (P) orientation; however, low levels of genomes having inverted L (I L ) segments have been reported for some class D herpesviruses. Inefficient formation of class D I L genomes has been attributed to infrequent L segment inversion, but recent detection of frequent inverted L segments in equine herpesvirus 1 concatemers [Virology 229 (1997) 415-420] suggests that the defect may be at the level of cleavage and packaging rather than inversion. In this study, the structures of virion and concatemeric DNA of another class D herpesvirus, bovine herpesvirus 1, were determined. Virion DNA contained low levels of I L genomes, whereas concatemeric DNA contained significant amounts of L segments in both P and I L orientations. However, concatemeric termini exhibited a preponderance of L termini derived from P isomers which was comparable to the preponderance of P genomes found in virion DNA. Thus, the defect in formation of I L genomes appears to lie at the level of concatemer cleavage. These results have important implications for the mechanisms by which herpesvirus DNA cleavage and packaging occur

  7. Signals of historical interlocus gene conversion in human segmental duplications.

    Directory of Open Access Journals (Sweden)

    Beth L Dumont

    Full Text Available Standard methods of DNA sequence analysis assume that sequences evolve independently, yet this assumption may not be appropriate for segmental duplications that exchange variants via interlocus gene conversion (IGC. Here, we use high quality multiple sequence alignments from well-annotated segmental duplications to systematically identify IGC signals in the human reference genome. Our analysis combines two complementary methods: (i a paralog quartet method that uses DNA sequence simulations to identify a statistical excess of sites consistent with inter-paralog exchange, and (ii the alignment-based method implemented in the GENECONV program. One-quarter (25.4% of the paralog families in our analysis harbor clear IGC signals by the quartet approach. Using GENECONV, we identify 1477 gene conversion tracks that cumulatively span 1.54 Mb of the genome. Our analyses confirm the previously reported high rates of IGC in subtelomeric regions and Y-chromosome palindromes, and identify multiple novel IGC hotspots, including the pregnancy specific glycoproteins and the neuroblastoma breakpoint gene families. Although the duplication history of a paralog family is described by a single tree, we show that IGC has introduced incredible site-to-site variation in the evolutionary relationships among paralogs in the human genome. Our findings indicate that IGC has left significant footprints in patterns of sequence diversity across segmental duplications in the human genome, out-pacing the contributions of single base mutation by orders of magnitude. Collectively, the IGC signals we report comprise a catalog that will provide a critical reference for interpreting observed patterns of DNA sequence variation across duplicated genomic regions, including targets of recent adaptive evolution in humans.

  8. Normalization in Unsupervised Segmentation Parameter Optimization: A Solution Based on Local Regression Trend Analysis

    Directory of Open Access Journals (Sweden)

    Stefanos Georganos

    2018-02-01

    Full Text Available In object-based image analysis (OBIA, the appropriate parametrization of segmentation algorithms is crucial for obtaining satisfactory image classification results. One of the ways this can be done is by unsupervised segmentation parameter optimization (USPO. A popular USPO method does this through the optimization of a “global score” (GS, which minimizes intrasegment heterogeneity and maximizes intersegment heterogeneity. However, the calculated GS values are sensitive to the minimum and maximum ranges of the candidate segmentations. Previous research proposed the use of fixed minimum/maximum threshold values for the intrasegment/intersegment heterogeneity measures to deal with the sensitivity of user-defined ranges, but the performance of this approach has not been investigated in detail. In the context of a remote sensing very-high-resolution urban application, we show the limitations of the fixed threshold approach, both in a theoretical and applied manner, and instead propose a novel solution to identify the range of candidate segmentations using local regression trend analysis. We found that the proposed approach showed significant improvements over the use of fixed minimum/maximum values, is less subjective than user-defined threshold values and, thus, can be of merit for a fully automated procedure and big data applications.

  9. Tumor Heterogeneity and Drug Resistance

    International Nuclear Information System (INIS)

    Kucerova, L.; Skolekova, S.; Kozovska, Z.

    2015-01-01

    New generation of sequencing methodologies revealed unexpected complexity and genomic alterations linked with the tumor subtypes. This diversity exists across the tumor types, histologic tumor subtypes and subsets of the tumor cells within the same tumor. This phenomenon is termed tumor heterogeneity. Regardless of its origin and mechanisms of development it has a major impact in the clinical setting. Genetic, phenotypic and expression pattern diversity of tumors plays critical role in the selection of suitable treatment and also in the prognosis prediction. Intratumoral heterogeneity plays a key role in the intrinsic and acquired chemoresistance to cytotoxic and targeted therapies. In this review we focus on the mechanisms of intratumoral and inter tumoral heterogeneity and their relationship to the drug resistance. Understanding of the mechanisms and spatiotemporal dynamics of tumor heterogeneity development before and during the therapy is important for the ability to design individual treatment protocols suitable in the given molecular context. (author)

  10. Intratumor and Intertumor Heterogeneity in Melanoma

    Directory of Open Access Journals (Sweden)

    Tomasz M. Grzywa

    2017-12-01

    Full Text Available Melanoma is a cancer that exhibits one of the most aggressive and heterogeneous features. The incidence rate escalates. A high number of clones harboring various mutations contribute to an exceptional level of intratumor heterogeneity of melanoma. It also refers to metastases which may originate from different subclones of primary lesion. Such component of the neoplasm biology is termed intertumor and intratumor heterogeneity. These levels of tumor heterogeneity hinder accurate diagnosis and effective treatment. The increasing number of research on the topic reflects the need for understanding limitation or failure of contemporary therapies. Majority of analyses concentrate on mutations in cancer-related genes. Novel high-throughput techniques reveal even higher degree of variations within a lesion. Consolidation of theories and researches indicates new routes for treatment options such as targets for immunotherapy. The demand for personalized approach in melanoma treatment requires extensive knowledge on intratumor and intertumor heterogeneity on the level of genome, transcriptome/proteome, and epigenome. Thus, achievements in exploration of melanoma variety are described in details. Particularly, the issue of tumor heterogeneity or homogeneity given BRAF mutations is discussed.

  11. Validating PET segmentation of thoracic lesions-is 4D PET necessary?

    DEFF Research Database (Denmark)

    Nielsen, M. S.; Carl, J.

    2017-01-01

    Respiratory-induced motions are prone to degrade the positron emission tomography (PET) signal with the consequent loss of image information and unreliable segmentations. This phantom study aims to assess the discrepancies relative to stationary PET segmentations, of widely used semiautomatic PET...... segmentation methods on heterogeneous target lesions influenced by motion during image acquisition. Three target lesions included dual F-18 Fluoro-deoxy-glucose (FDG) tracer concentrations as high-and low tracer activities relative to the background. Four different tracer concentration arrangements were...... segmented using three SUV threshold methods (Max40%, SUV40% and 2.5SUV) and a gradient based method (GradientSeg). Segmentations in static 3D-PET scans (PETsta) specified the reference conditions for the individual segmentation methods, target lesions and tracer concentrations. The motion included PET...

  12. Informational segmentation in international capital markets

    OpenAIRE

    Wahl, Jack E.

    1988-01-01

    The economic influence of barriers to international information acquisition and, hence, of informational segmentation in international capital markets depends heavily upon the prevailing level of risk aversion. We find that these barriers are likely to have second order economic impact only. Furthermore, improving international informational integration is likely to Increase all asset prices when causing less heterogeneity of international subjective probability beliefs.

  13. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes.

    Science.gov (United States)

    Wenke, Torsten; Döbel, Thomas; Sörensen, Thomas Rosleff; Junghans, Holger; Weisshaar, Bernd; Schmidt, Thomas

    2011-09-01

    Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5' truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3' end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families.

  14. Delineating slowly and rapidly evolving fractions of the Drosophila genome.

    Science.gov (United States)

    Keith, Jonathan M; Adams, Peter; Stephen, Stuart; Mattick, John S

    2008-05-01

    Evolutionary conservation is an important indicator of function and a major component of bioinformatic methods to identify non-protein-coding genes. We present a new Bayesian method for segmenting pairwise alignments of eukaryotic genomes while simultaneously classifying segments into slowly and rapidly evolving fractions. We also describe an information criterion similar to the Akaike Information Criterion (AIC) for determining the number of classes. Working with pairwise alignments enables detection of differences in conservation patterns among closely related species. We analyzed three whole-genome and three partial-genome pairwise alignments among eight Drosophila species. Three distinct classes of conservation level were detected. Sequences comprising the most slowly evolving component were consistent across a range of species pairs, and constituted approximately 62-66% of the D. melanogaster genome. Almost all (>90%) of the aligned protein-coding sequence is in this fraction, suggesting much of it (comprising the majority of the Drosophila genome, including approximately 56% of non-protein-coding sequences) is functional. The size and content of the most rapidly evolving component was species dependent, and varied from 1.6% to 4.8%. This fraction is also enriched for protein-coding sequence (while containing significant amounts of non-protein-coding sequence), suggesting it is under positive selection. We also classified segments according to conservation and GC content simultaneously. This analysis identified numerous sub-classes of those identified on the basis of conservation alone, but was nevertheless consistent with that classification. Software, data, and results available at www.maths.qut.edu.au/-keithj/. Genomic segments comprising the conservation classes available in BED format.

  15. Segmental allotetraploidy and allelic interactions in buffelgrass (Pennisetum ciliare (L.) Link syn. Cenchrus ciliaris L.) as revealed by genome mapping.

    Science.gov (United States)

    Jessup, R W; Burson, B L; Burow, O; Wang, Y W; Chang, C; Li, Z; Paterson, A H; Hussey, M A

    2003-04-01

    Linkage analyses increasingly complement cytological and traditional plant breeding techniques by providing valuable information regarding genome organization and transmission genetics of complex polyploid species. This study reports a genome map of buffelgrass (Pennisetum ciliare (L.) Link syn. Cenchrus ciliaris L.). Maternal and paternal maps were constructed with restriction fragment length polymorphisms (RFLPs) segregating in 87 F1 progeny from an intraspecific cross between two heterozygous genotypes. A survey of 862 heterologous cDNAs and gDNAs from across the Poaceae, as well as 443 buffelgrass cDNAs, yielded 100 and 360 polymorphic probes, respectively. The maternal map included 322 RFLPs, 47 linkage groups, and 3464 cM, whereas the paternal map contained 245 RFLPs, 42 linkage groups, and 2757 cM. Approximately 70 to 80% of the buffelgrass genome was covered, and the average marker spacing was 10.8 and 11.3 cM on the respective maps. Preferential pairing was indicated between many linkage groups, which supports cytological reports that buffelgrass is a segmental allotetraploid. More preferential pairing (disomy) was found in the maternal than paternal parent across linkage groups (55 vs. 38%) and loci (48 vs. 15%). Comparison of interval lengths in 15 allelic bridges indicated significantly less meiotic recombination in paternal gametes. Allelic interactions were detected in four regions of the maternal map and were absent in the paternal map.

  16. Family genome browser: visualizing genomes with pedigree information.

    Science.gov (United States)

    Juan, Liran; Liu, Yongzhuang; Wang, Yongtian; Teng, Mingxiang; Zang, Tianyi; Wang, Yadong

    2015-07-15

    Families with inherited diseases are widely used in Mendelian/complex disease studies. Owing to the advances in high-throughput sequencing technologies, family genome sequencing becomes more and more prevalent. Visualizing family genomes can greatly facilitate human genetics studies and personalized medicine. However, due to the complex genetic relationships and high similarities among genomes of consanguineous family members, family genomes are difficult to be visualized in traditional genome visualization framework. How to visualize the family genome variants and their functions with integrated pedigree information remains a critical challenge. We developed the Family Genome Browser (FGB) to provide comprehensive analysis and visualization for family genomes. The FGB can visualize family genomes in both individual level and variant level effectively, through integrating genome data with pedigree information. Family genome analysis, including determination of parental origin of the variants, detection of de novo mutations, identification of potential recombination events and identical-by-decent segments, etc., can be performed flexibly. Diverse annotations for the family genome variants, such as dbSNP memberships, linkage disequilibriums, genes, variant effects, potential phenotypes, etc., are illustrated as well. Moreover, the FGB can automatically search de novo mutations and compound heterozygous variants for a selected individual, and guide investigators to find high-risk genes with flexible navigation options. These features enable users to investigate and understand family genomes intuitively and systematically. The FGB is available at http://mlg.hit.edu.cn/FGB/. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. Assessment of heterogeneity between European Populations: a Baltic and Danish replication case-control study of SNPs from a recent European ulcerative colitis genome wide association study

    DEFF Research Database (Denmark)

    Andersen, Vibeke; Ernst, Anja; Sventoraityte, Jurgita

    2011-01-01

    the combined Baltic, Danish, and Norwegian panel versus the combined German, British, Belgian, and Greek panel (rs7520292 (P = 0.001), rs12518307 (P = 0.007), and rs2395609 (TCP11) (P = 0.01), respectively). No SNP reached genome-wide significance in the combined analyses of all the panels. Conclusions......Background: Differences in the genetic architecture of inflammatory bowel disease between different European countries and ethnicities have previously been reported. In the present study, we wanted to assess the role of 11 newly identified UC risk variants, derived from a recent European UC genome...... wide association study (GWAS) (Franke et al., 2010), for 1) association with UC in the Nordic countries, 2) for population heterogeneity between the Nordic countries and the rest of Europe, and, 3) eventually, to drive some of the previous findings towards overall genome-wide significance. Methods...

  18. DEMARCATE: Density-based magnetic resonance image clustering for assessing tumor heterogeneity in cancer.

    Science.gov (United States)

    Saha, Abhijoy; Banerjee, Sayantan; Kurtek, Sebastian; Narang, Shivali; Lee, Joonsang; Rao, Ganesh; Martinez, Juan; Bharath, Karthik; Rao, Arvind U K; Baladandayuthapani, Veerabhadran

    2016-01-01

    Tumor heterogeneity is a crucial area of cancer research wherein inter- and intra-tumor differences are investigated to assess and monitor disease development and progression, especially in cancer. The proliferation of imaging and linked genomic data has enabled us to evaluate tumor heterogeneity on multiple levels. In this work, we examine magnetic resonance imaging (MRI) in patients with brain cancer to assess image-based tumor heterogeneity. Standard approaches to this problem use scalar summary measures (e.g., intensity-based histogram statistics) that do not adequately capture the complete and finer scale information in the voxel-level data. In this paper, we introduce a novel technique, DEMARCATE (DEnsity-based MAgnetic Resonance image Clustering for Assessing Tumor hEterogeneity) to explore the entire tumor heterogeneity density profiles (THDPs) obtained from the full tumor voxel space. THDPs are smoothed representations of the probability density function of the tumor images. We develop tools for analyzing such objects under the Fisher-Rao Riemannian framework that allows us to construct metrics for THDP comparisons across patients, which can be used in conjunction with standard clustering approaches. Our analyses of The Cancer Genome Atlas (TCGA) based Glioblastoma dataset reveal two significant clusters of patients with marked differences in tumor morphology, genomic characteristics and prognostic clinical outcomes. In addition, we see enrichment of image-based clusters with known molecular subtypes of glioblastoma multiforme, which further validates our representation of tumor heterogeneity and subsequent clustering techniques.

  19. A weighted U statistic for association analyses considering genetic heterogeneity.

    Science.gov (United States)

    Wei, Changshuai; Elston, Robert C; Lu, Qing

    2016-07-20

    Converging evidence suggests that common complex diseases with the same or similar clinical manifestations could have different underlying genetic etiologies. While current research interests have shifted toward uncovering rare variants and structural variations predisposing to human diseases, the impact of heterogeneity in genetic studies of complex diseases has been largely overlooked. Most of the existing statistical methods assume the disease under investigation has a homogeneous genetic effect and could, therefore, have low power if the disease undergoes heterogeneous pathophysiological and etiological processes. In this paper, we propose a heterogeneity-weighted U (HWU) method for association analyses considering genetic heterogeneity. HWU can be applied to various types of phenotypes (e.g., binary and continuous) and is computationally efficient for high-dimensional genetic data. Through simulations, we showed the advantage of HWU when the underlying genetic etiology of a disease was heterogeneous, as well as the robustness of HWU against different model assumptions (e.g., phenotype distributions). Using HWU, we conducted a genome-wide analysis of nicotine dependence from the Study of Addiction: Genetics and Environments dataset. The genome-wide analysis of nearly one million genetic markers took 7h, identifying heterogeneous effects of two new genes (i.e., CYP3A5 and IKBKB) on nicotine dependence. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  20. Heterogeneous chromatin target model

    International Nuclear Information System (INIS)

    Watanabe, Makoto

    1996-01-01

    The higher order structure of the entangled chromatin fibers in a chromosome plays a key role in molecular control mechanism involved in chromosome mutation due to ionizing radiations or chemical mutagens. The condensed superstructure of chromatin is not so rigid and regular as has been postulated in general. We have proposed a rheological explanation for the flexible network system ('chromatin network') that consists of the fluctuating assembly of nucleosome clusters linked with supertwisting DNA in a chromatin fiber ('Supertwisting Particulate Model'). We have proposed a 'Heterosensitive Target Model' for cellular radiosensitivity that is a modification of 'Heterogeneous Target Model'. The heterogeneity of chromatin target is derived from the highly condensed organization of chromatin segments consist of unstable and fragile sites in the fluctuating assembly of nucleosome clusters, namely 'supranucleosomal particles' or 'superbeads'. The models have been principally supported by our electron microscopic experiments employing 'surface - spreading whole - mount technique' since 1967. However, some deformation and artifacts in the chromatin structure are inevitable with these electron microscopic procedures. On the contrary, the 'atomic force microscope (AFM)' can be operated in liquid as well as in the air. A living specimen can be examined without any preparative procedures. Micromanipulation of the isolated chromosome is also possible by the precise positional control of a cantilever on the nanometer scale. The living human chromosomes were submerged in a solution of culture medium and observed by AFM using a liquid immersion cell. The surface - spreading whole - mount technique was applicable for this observation. The particulate chromatin segments of nucleosome clusters were clearly observed within mitotic human chromosomes in a living hydrated condition. These findings support the heterogeneity of chromatin target in a living cell. (J.P.N.)

  1. Adaptive stress response in segmental progeria resembles long-lived dwarfism and calorie restriction in mice

    NARCIS (Netherlands)

    H.W.M. van de Ven (Marieke); J.-O. Andressoo (Jaan-Olle); V.B. Holcomb (Valerie); M.M. von Lindern (Marieke); W.M.C. Jong (Willeke); C.I. de Zeeuw (Chris); Y. Suh (Yousin); P. Hasty (Paul); J.H.J. Hoeijmakers (Jan); G.T.J. van der Horst (Gijsbertus); J.R. Mitchell (James)

    2006-01-01

    textabstractHow congenital defects causing genome instability can result in the pleiotropic symptoms reminiscent of aging but in a segmental and accelerated fashion remains largely unknown. Most segmental progerias are associated with accelerated fibroblast senescence, suggesting that cellular

  2. SAAS-CNV: A Joint Segmentation Approach on Aggregated and Allele Specific Signals for the Identification of Somatic Copy Number Alterations with Next-Generation Sequencing Data.

    Science.gov (United States)

    Zhang, Zhongyang; Hao, Ke

    2015-11-01

    Cancer genomes exhibit profound somatic copy number alterations (SCNAs). Studying tumor SCNAs using massively parallel sequencing provides unprecedented resolution and meanwhile gives rise to new challenges in data analysis, complicated by tumor aneuploidy and heterogeneity as well as normal cell contamination. While the majority of read depth based methods utilize total sequencing depth alone for SCNA inference, the allele specific signals are undervalued. We proposed a joint segmentation and inference approach using both signals to meet some of the challenges. Our method consists of four major steps: 1) extracting read depth supporting reference and alternative alleles at each SNP/Indel locus and comparing the total read depth and alternative allele proportion between tumor and matched normal sample; 2) performing joint segmentation on the two signal dimensions; 3) correcting the copy number baseline from which the SCNA state is determined; 4) calling SCNA state for each segment based on both signal dimensions. The method is applicable to whole exome/genome sequencing (WES/WGS) as well as SNP array data in a tumor-control study. We applied the method to a dataset containing no SCNAs to test the specificity, created by pairing sequencing replicates of a single HapMap sample as normal/tumor pairs, as well as a large-scale WGS dataset consisting of 88 liver tumors along with adjacent normal tissues. Compared with representative methods, our method demonstrated improved accuracy, scalability to large cancer studies, capability in handling both sequencing and SNP array data, and the potential to improve the estimation of tumor ploidy and purity.

  3. Intratumor and Intertumor Heterogeneity in Melanoma.

    Science.gov (United States)

    Grzywa, Tomasz M; Paskal, Wiktor; Włodarski, Paweł K

    2017-12-01

    Melanoma is a cancer that exhibits one of the most aggressive and heterogeneous features. The incidence rate escalates. A high number of clones harboring various mutations contribute to an exceptional level of intratumor heterogeneity of melanoma. It also refers to metastases which may originate from different subclones of primary lesion. Such component of the neoplasm biology is termed intertumor and intratumor heterogeneity. These levels of tumor heterogeneity hinder accurate diagnosis and effective treatment. The increasing number of research on the topic reflects the need for understanding limitation or failure of contemporary therapies. Majority of analyses concentrate on mutations in cancer-related genes. Novel high-throughput techniques reveal even higher degree of variations within a lesion. Consolidation of theories and researches indicates new routes for treatment options such as targets for immunotherapy. The demand for personalized approach in melanoma treatment requires extensive knowledge on intratumor and intertumor heterogeneity on the level of genome, transcriptome/proteome, and epigenome. Thus, achievements in exploration of melanoma variety are described in details. Particularly, the issue of tumor heterogeneity or homogeneity given BRAF mutations is discussed. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  4. Enhancing yeast transcription analysis through integration of heterogeneous data

    DEFF Research Database (Denmark)

    Grotkjær, Thomas; Nielsen, Jens

    2004-01-01

    of Saccharomyces cerevisiae whole genome transcription data. A special focus is on the quantitative aspects of normalisation and mathematical modelling approaches, since they are expected to play an increasing role in future DNA microarray analysis studies. Data analysis is exemplified with cluster analysis......DNA microarray technology enables the simultaneous measurement of the transcript level of thousands of genes. Primary analysis can be done with basic statistical tools and cluster analysis, but effective and in depth analysis of the vast amount of transcription data requires integration with data...... from several heterogeneous data Sources, such as upstream promoter sequences, genome-scale metabolic models, annotation databases and other experimental data. In this review, we discuss how experimental design, normalisation, heterogeneous data and mathematical modelling can enhance analysis...

  5. Imaging intratumor heterogeneity: role in therapy response, resistance, and clinical outcome.

    Science.gov (United States)

    O'Connor, James P B; Rose, Chris J; Waterton, John C; Carano, Richard A D; Parker, Geoff J M; Jackson, Alan

    2015-01-15

    Tumors exhibit genomic and phenotypic heterogeneity, which has prognostic significance and may influence response to therapy. Imaging can quantify the spatial variation in architecture and function of individual tumors through quantifying basic biophysical parameters such as CT density or MRI signal relaxation rate; through measurements of blood flow, hypoxia, metabolism, cell death, and other phenotypic features; and through mapping the spatial distribution of biochemical pathways and cell signaling networks using PET, MRI, and other emerging molecular imaging techniques. These methods can establish whether one tumor is more or less heterogeneous than another and can identify subregions with differing biology. In this article, we review the image analysis methods currently used to quantify spatial heterogeneity within tumors. We discuss how analysis of intratumor heterogeneity can provide benefit over more simple biomarkers such as tumor size and average function. We consider how imaging methods can be integrated with genomic and pathology data, instead of being developed in isolation. Finally, we identify the challenges that must be overcome before measurements of intratumoral heterogeneity can be used routinely to guide patient care. ©2014 American Association for Cancer Research.

  6. Extreme-Scale De Novo Genome Assembly

    Energy Technology Data Exchange (ETDEWEB)

    Georganas, Evangelos [Intel Corporation, Santa Clara, CA (United States); Hofmeyr, Steven [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Egan, Rob [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Buluc, Aydin [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Oliker, Leonid [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Rokhsar, Daniel [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Yelick, Katherine [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.

    2017-09-26

    De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. Genome assembly software has many components, each of which stresses different components of a computer system. This chapter explains the computational challenges involved in each step of the HipMer pipeline, the key distributed data structures, and communication costs in detail. We present performance results of assembling the human genome and the large hexaploid wheat genome on large supercomputers up to tens of thousands of cores.

  7. Lung tumor segmentation in PET images using graph cuts.

    Science.gov (United States)

    Ballangan, Cherry; Wang, Xiuying; Fulham, Michael; Eberl, Stefan; Feng, David Dagan

    2013-03-01

    The aim of segmentation of tumor regions in positron emission tomography (PET) is to provide more accurate measurements of tumor size and extension into adjacent structures, than is possible with visual assessment alone and hence improve patient management decisions. We propose a segmentation energy function for the graph cuts technique to improve lung tumor segmentation with PET. Our segmentation energy is based on an analysis of the tumor voxels in PET images combined with a standardized uptake value (SUV) cost function and a monotonic downhill SUV feature. The monotonic downhill feature avoids segmentation leakage into surrounding tissues with similar or higher PET tracer uptake than the tumor and the SUV cost function improves the boundary definition and also addresses situations where the lung tumor is heterogeneous. We evaluated the method in 42 clinical PET volumes from patients with non-small cell lung cancer (NSCLC). Our method improves segmentation and performs better than region growing approaches, the watershed technique, fuzzy-c-means, region-based active contour and tumor customized downhill. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  8. Adaptive stress response in segmental progeria resembles long-lived dwarfism and calorie restriction in mice

    NARCIS (Netherlands)

    van de Ven, Marieke; Andressoo, Jaan-Olle; Holcomb, Valerie B.; von Lindern, Marieke; Jong, Willeke M. C.; de Zeeuw, Chris I.; Suh, Yousin; Hasty, Paul; Hoeijmakers, Jan H. J.; van der Horst, Gijsbertus T. J.; Mitchell, James R.

    2006-01-01

    How congenital defects causing genome instability can result in the pleiotropic symptoms reminiscent of aging but in a segmental and accelerated fashion remains largely unknown. Most segmental progerias are associated with accelerated fibroblast senescence, suggesting that cellular senescence is a

  9. Flow heterogeneity following global no-flow ischemia in isolated rabbit heart

    International Nuclear Information System (INIS)

    Marshall, Robert C.; Powers-Risius, Patricia; Reutter, Bryan W.; Schustz, Amy M.; Kuo, Chaincy; Huesman, Michelle K.; Huesman, Ronald H.

    2002-01-01

    The purpose of this study was to evaluate flow heterogeneity and impaired reflow during reperfusion following 60 min global no-flow ischemia in the isolated rabbit heart. Radiolabeled microspheres were used to measure relative flow in small left ventricular (LV) segments in five ischemia + reperfused hearts and in five non-ischemic controls. Although variable in the post-ischemic hearts, flow heterogeneity was increased relative to pre-ischemia for the whole LV (0.92 plus or minus 0.41 vs. 0.37 plus or minus 0.07, P < 0.05) as well as the subendocardium (Endo) and subepicardium (Epi) considered separately (endo: 1.28 plus or minus 0.74 vs. 0.30 plus or minus 0.09; epi: 0.69 plus or minus 0.22 vs. 0.38 plus or minus 0.08; P < 0.05 for both comparisons) during early reperfusion. There were also segments with abnormally reduced reflow. The number of segments with abnormally reduced reflow increased as flow heterogeneity increased. Abnormally reduced reflow indicates that regional ischemia can persist despite restoration of normal global flow. In addition, the relationship between regional and global flow is altered and venous outflow is derived from regions with continued perfusion and not the whole LV. These observations emphasize the need to quantify regional reflow during reperfusion following sustained no-flow ischemia in the isolated rabbit heart

  10. Flow heterogeneity following global no-flow ischemia in isolated rabbit heart

    Energy Technology Data Exchange (ETDEWEB)

    Marshall, Robert C.; Powers-Risius, Patricia; Reutter, Bryan W.; Schustz, Amy M.; Kuo, Chaincy; Huesman, Michelle K.; Huesman, Ronald H.

    2003-02-01

    The purpose of this study was to evaluate flow heterogeneity and impaired reflow during reperfusion following 60 min global no-flow ischemia in the isolated rabbit heart. Radiolabeled microspheres were used to measure relative flow in small left ventricular (LV) segments in five ischemia + reperfused hearts and in five non-ischemic controls. Although variable in the post-ischemic hearts, flow heterogeneity was increased relative to pre-ischemia for the whole LV (0.92 plus or minus 0.41 vs. 0.37 plus or minus 0.07, P < 0.05) as well as the subendocardium (Endo) and subepicardium (Epi) considered separately (endo: 1.28 plus or minus 0.74 vs. 0.30 plus or minus 0.09; epi: 0.69 plus or minus 0.22 vs. 0.38 plus or minus 0.08; P < 0.05 for both comparisons) during early reperfusion. There were also segments with abnormally reduced reflow. The number of segments with abnormally reduced reflow increased as flow heterogeneity increased. Abnormally reduced reflow indicates that regional ischemia can persist despite restoration of normal global flow. In addition, the relationship between regional and global flow is altered and venous outflow is derived from regions with continued perfusion and not the whole LV. These observations emphasize the need to quantify regional reflow during reperfusion following sustained no-flow ischemia in the isolated rabbit heart.

  11. Document flow segmentation for business applications

    Science.gov (United States)

    Daher, Hani; Belaïd, Abdel

    2013-12-01

    The aim of this paper is to propose a document flow supervised segmentation approach applied to real world heterogeneous documents. Our algorithm treats the flow of documents as couples of consecutive pages and studies the relationship that exists between them. At first, sets of features are extracted from the pages where we propose an approach to model the couple of pages into a single feature vector representation. This representation will be provided to a binary classifier which classifies the relationship as either segmentation or continuity. In case of segmentation, we consider that we have a complete document and the analysis of the flow continues by starting a new document. In case of continuity, the couple of pages are assimilated to the same document and the analysis continues on the flow. If there is an uncertainty on whether the relationship between the couple of pages should be classified as a continuity or segmentation, a rejection is decided and the pages analyzed until this point are considered as a "fragment". The first classification already provides good results approaching 90% on certain documents, which is high at this level of the system.

  12. Targeted Identification of Short Interspersed Nuclear Element Families Shows Their Widespread Existence and Extreme Heterogeneity in Plant Genomes[W

    Science.gov (United States)

    Wenke, Torsten; Döbel, Thomas; Sörensen, Thomas Rosleff; Junghans, Holger; Weisshaar, Bernd; Schmidt, Thomas

    2011-01-01

    Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5′ truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3′ end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families. PMID:21908723

  13. Heterogeneous slip and rupture models of the San Andreas fault zone based upon three-dimensional earthquake tomography

    Energy Technology Data Exchange (ETDEWEB)

    Foxall, William [Univ. of California, Berkeley, CA (United States)

    1992-11-01

    Crystal fault zones exhibit spatially heterogeneous slip behavior at all scales, slip being partitioned between stable frictional sliding, or fault creep, and unstable earthquake rupture. An understanding the mechanisms underlying slip segmentation is fundamental to research into fault dynamics and the physics of earthquake generation. This thesis investigates the influence that large-scale along-strike heterogeneity in fault zone lithology has on slip segmentation. Large-scale transitions from the stable block sliding of the Central 4D Creeping Section of the San Andreas, fault to the locked 1906 and 1857 earthquake segments takes place along the Loma Prieta and Parkfield sections of the fault, respectively, the transitions being accomplished in part by the generation of earthquakes in the magnitude range 6 (Parkfield) to 7 (Loma Prieta). Information on sub-surface lithology interpreted from the Loma Prieta and Parkfield three-dimensional crustal velocity models computed by Michelini (1991) is integrated with information on slip behavior provided by the distributions of earthquakes located using, the three-dimensional models and by surface creep data to study the relationships between large-scale lithological heterogeneity and slip segmentation along these two sections of the fault zone.

  14. Heterogeneous periodicity of drosophila mtDNA: new refutations of neutral and nearly neutral evolution

    Directory of Open Access Journals (Sweden)

    Carlos Y Valenzuela

    2011-01-01

    Full Text Available We found a consistent 3-site periodicity of the X²9 values for the heterogeneity of the distribution of the second base in relation to the first base of dinucleotides separated by 0 (contiguous, 1, 2, 3 ... 17 (K nucleotide sites in Drosophila mtDNA. Triplets of X²9 values were found where the first was over 300 and the second and third ranged between 37 and 114 (previous studies. In this study, the periodicity was significant until separation of 2011K, and a structure of deviations from randomness among dinucleotides was found. The most deviant dinucleotides were G-G, G-C and C-G for the first, second and third element of the triplet, respectively. In these three cases there were more dinucleotides observed than expected. This inter-bases correlation and periodicity may be related to the tertiary structure of circular DNA, like that of prokaryotes and mitochondria, to protect and preserve it. The mtDNA with 19.517 bp was divided into four equal segments of 4.879 bp. The fourth sub-segment presented a very low proportion of G and C, the internucleotide interaction was weaker in this sub-segment and no periodicity was found. The maintenance of this mtDNA structure and organization for millions of generations, in spite of a high recurrent mutation rate, does not support the notion of neutralism or near neutralism. The high level of internucleotide interaction and periodicity indicate that every nucleotide is co-adapted with the residual genome.

  15. Implementing Genome-Driven Oncology

    Science.gov (United States)

    Hyman, David M.; Taylor, Barry S.; Baselga, José

    2017-01-01

    Early successes in identifying and targeting individual oncogenic drivers, together with the increasing feasibility of sequencing tumor genomes, have brought forth the promise of genome-driven oncology care. As we expand the breadth and depth of genomic analyses, the biological and clinical complexity of its implementation will be unparalleled. Challenges include target credentialing and validation, implementing drug combinations, clinical trial designs, targeting tumor heterogeneity, and deploying technologies beyond DNA sequencing, among others. We review how contemporary approaches are tackling these challenges and will ultimately serve as an engine for biological discovery and increase our insight into cancer and its treatment. PMID:28187282

  16. Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

    Science.gov (United States)

    Wen, Chiu-Ming

    2017-08-01

    An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.

  17. Speculative segmented sum for sparse matrix-vector multiplication on heterogeneous processors

    DEFF Research Database (Denmark)

    Liu, Weifeng; Vinter, Brian

    2015-01-01

    of the same chip is triggered to re-arrange the predicted partial sums for a correct resulting vector. On three heterogeneous processors from Intel, AMD and nVidia, using 20 sparse matrices as a benchmark suite, the experimental results show that our method obtains significant performance improvement over...

  18. Assessment of heterogeneity between European Populations: a Baltic and Danish replication case-control study of SNPs from a recent European ulcerative colitis genome wide association study.

    Science.gov (United States)

    Andersen, Vibeke; Ernst, Anja; Sventoraityte, Jurgita; Kupcinskas, Limas; Jacobsen, Bent A; Krarup, Henrik B; Vogel, Ulla; Jonaitis, Laimas; Denapiene, Goda; Kiudelis, Gediminas; Balschun, Tobias; Franke, Andre

    2011-10-13

    Differences in the genetic architecture of inflammatory bowel disease between different European countries and ethnicities have previously been reported. In the present study, we wanted to assess the role of 11 newly identified UC risk variants, derived from a recent European UC genome wide association study (GWAS) (Franke et al., 2010), for 1) association with UC in the Nordic countries, 2) for population heterogeneity between the Nordic countries and the rest of Europe, and, 3) eventually, to drive some of the previous findings towards overall genome-wide significance. Eleven SNPs were replicated in a Danish sample consisting of 560 UC patients and 796 controls and nine missing SNPs of the German GWAS study were successfully genotyped in the Baltic sample comprising 441 UC cases and 1156 controls. The independent replication data was then jointly analysed with the original data and systematic comparisons of the findings between ethnicities were made. Pearson's χ2, Breslow-Day (BD) and Cochran-Mantel-Haenszel (CMH) tests were used for association analyses and heterogeneity testing. The rs5771069 (IL17REL) SNP was not associated with UC in the Danish panel. The rs5771069 (IL17REL) SNP was significantly associated with UC in the combined Baltic, Danish and Norwegian UC study sample driven by the Norwegian panel (OR = 0.89, 95% CI: 0.79-0.98, P = 0.02). No association was found between rs7809799 (SMURF1/KPNA7) and UC (OR = 1.20, 95% CI: 0.95-1.52, P = 0.10) or between UC and all other remaining SNPs. We had 94% chance of detecting an association for rs7809799 (SMURF1/KPNA7) in the combined replication sample, whereas the power were 55% or lower for the remaining SNPs.Statistically significant PBD was found for OR heterogeneity between the combined Baltic, Danish, and Norwegian panel versus the combined German, British, Belgian, and Greek panel (rs7520292 (P = 0.001), rs12518307 (P = 0.007), and rs2395609 (TCP11) (P = 0.01), respectively).No SNP reached genome

  19. Integrating mean and variance heterogeneities to identify differentially expressed genes.

    Science.gov (United States)

    Ouyang, Weiwei; An, Qiang; Zhao, Jinying; Qin, Huaizhen

    2016-12-06

    In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment

  20. Unsupervised motion-based object segmentation refined by color

    Science.gov (United States)

    Piek, Matthijs C.; Braspenning, Ralph; Varekamp, Chris

    2003-06-01

    For various applications, such as data compression, structure from motion, medical imaging and video enhancement, there is a need for an algorithm that divides video sequences into independently moving objects. Because our focus is on video enhancement and structure from motion for consumer electronics, we strive for a low complexity solution. For still images, several approaches exist based on colour, but these lack in both speed and segmentation quality. For instance, colour-based watershed algorithms produce a so-called oversegmentation with many segments covering each single physical object. Other colour segmentation approaches exist which somehow limit the number of segments to reduce this oversegmentation problem. However, this often results in inaccurate edges or even missed objects. Most likely, colour is an inherently insufficient cue for real world object segmentation, because real world objects can display complex combinations of colours. For video sequences, however, an additional cue is available, namely the motion of objects. When different objects in a scene have different motion, the motion cue alone is often enough to reliably distinguish objects from one another and the background. However, because of the lack of sufficient resolution of efficient motion estimators, like the 3DRS block matcher, the resulting segmentation is not at pixel resolution, but at block resolution. Existing pixel resolution motion estimators are more sensitive to noise, suffer more from aperture problems or have less correspondence to the true motion of objects when compared to block-based approaches or are too computationally expensive. From its tendency to oversegmentation it is apparent that colour segmentation is particularly effective near edges of homogeneously coloured areas. On the other hand, block-based true motion estimation is particularly effective in heterogeneous areas, because heterogeneous areas improve the chance a block is unique and thus decrease the

  1. Diversity of Pseudomonas Genomes, Including Populus-Associated Isolates, as Revealed by Comparative Genome Analysis.

    Science.gov (United States)

    Jun, Se-Ran; Wassenaar, Trudy M; Nookaew, Intawat; Hauser, Loren; Wanchai, Visanu; Land, Miriam; Timm, Collin M; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A; Ussery, David W

    2016-01-01

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches, including the rhizosphere and endosphere of many plants. Their diversity influences the phylogenetic diversity and heterogeneity of these communities. On the basis of average amino acid identity, comparative genome analysis of >1,000 Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides (eastern cottonwood) trees resulted in consistent and robust genomic clusters with phylogenetic homogeneity. All Pseudomonas aeruginosa genomes clustered together, and these were clearly distinct from other Pseudomonas species groups on the basis of pangenome and core genome analyses. In contrast, the genomes of Pseudomonas fluorescens were organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. Most of our 21 Populus-associated isolates formed three distinct subgroups within the major P. fluorescens group, supported by pathway profile analysis, while two isolates were more closely related to Pseudomonas chlororaphis and Pseudomonas putida. Genes specific to Populus-associated subgroups were identified. Genes specific to subgroup 1 include several sensory systems that act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor. Genes specific to subgroup 2 contain hypothetical genes, and genes specific to subgroup 3 were annotated with hydrolase activity. This study justifies the need to sequence multiple isolates, especially from P. fluorescens, which displays the most genetic variation, in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants. Copyright © 2015 Jun et al.

  2. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Baumgarten Andrew

    2004-06-01

    Full Text Available Abstract Background Most genes in Arabidopsis thaliana are members of gene families. How do the members of gene families arise, and how are gene family copy numbers maintained? Some gene families may evolve primarily through tandem duplication and high rates of birth and death in clusters, and others through infrequent polyploidy or large-scale segmental duplications and subsequent losses. Results Our approach to understanding the mechanisms of gene family evolution was to construct phylogenies for 50 large gene families in Arabidopsis thaliana, identify large internal segmental duplications in Arabidopsis, map gene duplications onto the segmental duplications, and use this information to identify which nodes in each phylogeny arose due to segmental or tandem duplication. Examples of six gene families exemplifying characteristic modes are described. Distributions of gene family sizes and patterns of duplication by genomic distance are also described in order to characterize patterns of local duplication and copy number for large gene families. Both gene family size and duplication by distance closely follow power-law distributions. Conclusions Combining information about genomic segmental duplications, gene family phylogenies, and gene positions provides a method to evaluate contributions of tandem duplication and segmental genome duplication in the generation and maintenance of gene families. These differences appear to correspond meaningfully to differences in functional roles of the members of the gene families.

  3. Molecular evolution of avian reovirus: evidence for genetic diversity and reassortment of the S-class genome segments and multiple cocirculating lineages

    International Nuclear Information System (INIS)

    Liu, Hung J.; Lee, Long H.; Hsu, Hsiao W.; Kuo, Liam C.; Liao, Ming H.

    2003-01-01

    Nucleotide sequences of the S-class genome segments of 17 field-isolates and vaccine strains of avian reovirus (ARV) isolated over a 23-year period from different hosts, pathotypes, and geographic locations were examined and analyzed to define phylogenetic profiles and evolutionary mechanism. The S1 genome segment showed noticeably higher divergence than the other S-class genes. The σC-encoding gene has evolved into six distinct lineages. In contrast, the other S-class genes showed less divergence than that of the σC-encoding gene and have evolved into two to three major distinct lineages, respectively. Comparative sequence analysis provided evidence indicating extensive sequence divergence between ARV and other orthoreoviruses. The evolutionary trees of each gene were distinct, suggesting that these genes evolve in an independent manner. Furthermore, variable topologies were the result of frequent genetic reassortment among multiple cocirculating lineages. Results showed genetic diversity correlated more closely with date of isolation and geographic sites than with host species and pathotypes. This is the first evidence demonstrating genetic variability among circulating ARVs through a combination of evolutionary mechanisms involving multiple cocirculating lineages and genetic reassortment. The evolutionary rates and patterns of base substitutions were examined. The evolutionary rate for the σC-encoding gene and σC protein was higher than for the other S-class genes and other family of viruses. With the exception of the σC-encoding gene, which nonsynonymous substitutions predominate over synonymous, the evolutionary process of the other S-class genes can be explained by the neutral theory of molecular evolution. Results revealed that synonymous substitutions predominate over nonsynonymous in the S-class genes, even though genetic diversity and substitution rates vary among the viruses

  4. Genomic Perturbations Reveal Distinct Regulatory Networks in Intrahepatic Cholangiocarcinoma

    DEFF Research Database (Denmark)

    Nepal, Chirag; O'Rourke, Colm J; Oliveira, Douglas Vnp

    2018-01-01

    Intrahepatic cholangiocarcinoma (iCCA) remains a highly heterogeneous malignancy that has eluded effective patient stratification to date. The extent to which such heterogeneity can be influenced by individual driver mutations remains to be evaluated. Here, we analyzed genomic (whole-exome sequen...

  5. Translocations of chromosome end-segments and facultative heterochromatin promote meiotic ring formation in evening primroses.

    Science.gov (United States)

    Golczyk, Hieronim; Massouh, Amid; Greiner, Stephan

    2014-03-01

    Due to reciprocal chromosomal translocations, many species of Oenothera (evening primrose) form permanent multichromosomal meiotic rings. However, regular bivalent pairing is also observed. Chiasmata are restricted to chromosomal ends, which makes homologous recombination virtually undetectable. Genetic diversity is achieved by changing linkage relations of chromosomes in rings and bivalents via hybridization and reciprocal translocations. Although the structural prerequisite for this system is enigmatic, whole-arm translocations are widely assumed to be the mechanistic driving force. We demonstrate that this prerequisite is genome compartmentation into two epigenetically defined chromatin fractions. The first one facultatively condenses in cycling cells into chromocenters negative both for histone H3 dimethylated at lysine 4 and for C-banding, and forms huge condensed middle chromosome regions on prophase chromosomes. Remarkably, it decondenses in differentiating cells. The second fraction is euchromatin confined to distal chromosome segments, positive for histone H3 lysine 4 dimethylation and for histone H3 lysine 27 trimethylation. The end-segments are deprived of canonical telomeres but capped with constitutive heterochromatin. This genomic organization promotes translocation breakpoints between the two chromatin fractions, thus facilitating exchanges of end-segments. We challenge the whole-arm translocation hypothesis by demonstrating why reciprocal translocations of chromosomal end-segments should strongly promote meiotic rings and evolution toward permanent translocation heterozygosity. Reshuffled end-segments, each possessing a major crossover hot spot, can furthermore explain meiotic compatibility between genomes with different translocation histories.

  6. Genetic determinants of P wave duration and PR segment

    NARCIS (Netherlands)

    Verweij, Niek; Mateo Leach, Irene; van den Boogaard, Malou; van Veldhuisen, Dirk J.; Christoffels, Vincent M.; Hillege, Hans L.; van Gilst, Wiek H.; Barnett, Phil; de Boer, Rudolf A.; van der Harst, Pim

    2014-01-01

    The PR interval on the ECG reflects atrial depolarization and atrioventricular nodal delay which can be partially differentiated by P wave duration and PR segment, respectively. Genome-wide association studies have identified several genetic loci for PR interval, but it remains to be determined

  7. Genetic Determinants of P Wave Duration and PR Segment

    NARCIS (Netherlands)

    Verweij, Niek; Mateo Leach, Irene; van den Boogaard, Malou; van Veldhuisen, Dirk J.; Christoffels, Vincent M.; Hillege, Hans L.; van Gilst, Wiek H.; Barnett, Phil; de Boer, Rudolf A.; van der Harst, Pim

    Background-The PR interval on the ECG reflects atrial depolarization and atrioventricular nodal delay which can be partially differentiated by P wave duration and PR segment, respectively. Genome-wide association studies have identified several genetic loci for PR interval, but it remains to be

  8. Three-Dimensional Segmentation of the Tumor in Computed Tomographic Images of Neuroblastoma

    OpenAIRE

    Deglint, Hanford J.; Rangayyan, Rangaraj M.; Ayres, Fábio J.; Boag, Graham S.; Zuffo, Marcelo K.

    2006-01-01

    Segmentation of the tumor in neuroblastoma is complicated by the fact that the mass is almost always heterogeneous in nature; furthermore, viable tumor, necrosis, and normal tissue are often intermixed. Tumor definition and diagnosis require the analysis of the spatial distribution and Hounsfield unit (HU) values of voxels in computed tomography (CT) images, coupled with a knowledge of normal anatomy. Segmentation and analysis of the tissue composition of the tumor can assist in quantitative ...

  9. Annotating non-coding regions of the genome.

    Science.gov (United States)

    Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

    2010-08-01

    Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

  10. Composition and genomic organization of arthropod Hox clusters.

    Science.gov (United States)

    Pace, Ryan M; Grbić, Miodrag; Nagy, Lisa M

    2016-01-01

    The ancestral arthropod is believed to have had a clustered arrangement of ten Hox genes. Within arthropods, Hox gene mutations result in transformation of segment identities. Despite the fact that variation in segment number/character was common in the diversification of arthropods, few examples of Hox gene gains/losses have been correlated with morphological evolution. Furthermore, a full appreciation of the variation in the genomic arrangement of Hox genes in extant arthropods has not been recognized, as genome sequences from each major arthropod clade have not been reported until recently. Initial genomic analysis of the chelicerate Tetranychus urticae suggested that loss of Hox genes and Hox gene clustering might be more common than previously assumed. To further characterize the genomic evolution of arthropod Hox genes, we compared the genomic arrangement and general characteristics of Hox genes from representative taxa from each arthropod subphylum. In agreement with others, we find arthropods generally contain ten Hox genes arranged in a common orientation in the genome, with an increasing number of sampled species missing either Hox3 or abdominal-A orthologs. The genomic clustering of Hox genes in species we surveyed varies significantly, ranging from 0.3 to 13.6 Mb. In all species sampled, arthropod Hox genes are dispersed in the genome relative to the vertebrate Mus musculus. Differences in Hox cluster size arise from variation in the number of intervening genes, intergenic spacing, and the size of introns and UTRs. In the arthropods surveyed, Hox gene duplications are rare and four microRNAs are, in general, conserved in similar genomic positions relative to the Hox genes. The tightly clustered Hox complexes found in the vertebrates are not evident within arthropods, and differential patterns of Hox gene dispersion are found throughout the arthropods. The comparative genomic data continue to support an ancestral arthropod Hox cluster of ten genes with

  11. Bacillus subtilis genome diversity.

    Science.gov (United States)

    Earl, Ashlee M; Losick, Richard; Kolter, Roberto

    2007-02-01

    Microarray-based comparative genomic hybridization (M-CGH) is a powerful method for rapidly identifying regions of genome diversity among closely related organisms. We used M-CGH to examine the genome diversity of 17 strains belonging to the nonpathogenic species Bacillus subtilis. Our M-CGH results indicate that there is considerable genetic heterogeneity among members of this species; nearly one-third of Bsu168-specific genes exhibited variability, as measured by the microarray hybridization intensities. The variable loci include those encoding proteins involved in antibiotic production, cell wall synthesis, sporulation, and germination. The diversity in these genes may reflect this organism's ability to survive in diverse natural settings.

  12. Risk & Hedging Behavior: The Role and Determinants of Latent Heterogeneity

    NARCIS (Netherlands)

    Pennings, J.M.E.; Garcia, P.

    2010-01-01

    The notion of heterogeneous behavior is well grounded in economic theory. Recently it has been shown in a hedging context that the influence of risk attitudes and risk perceptions varies for different segments using a generalized mixture regression model. Here, using recently developed individual

  13. Genetic heterogeneity of L-Zagreb mumps virus vaccine strain.

    Science.gov (United States)

    Kosutic-Gulija, Tanja; Forcic, Dubravko; Santak, Maja; Ramljak, Ana; Mateljak-Lukacevic, Sanja; Mazuran, Renata

    2008-07-10

    The most often used mumps vaccine strains Jeryl Lynn (JL), RIT4385, Urabe-AM9, L-Zagreb and L-3 differ in immunogenicity and reactogenicity. Previous analyses showed that JL, Urabe-AM9 and L-3 are genetically heterogeneous. We identified the heterogeneity of L-Zagreb throughout the entire genome. Two major variants were defined: variant A being identical to the consensus sequence of viral seeds and vaccine(s) and variant B which differs from variant A in three nucleotide positions. The difference between viral variants in L-Zagreb strain is insufficient for distinct viral strains to be defined. We demonstrated that proportion of variants in L-Zagreb viral population depends on cell substrate used for viral replication in vitro and in vivo. L-Zagreb strain should be considered as a single strain composed of at least two variant viral genomes.

  14. GDTN: Genome-Based Delay Tolerant Network Formation in Heterogeneous 5G Using Inter-UA Collaboration.

    Science.gov (United States)

    You, Ilsun; Sharma, Vishal; Atiquzzaman, Mohammed; Choo, Kim-Kwang Raymond

    2016-01-01

    With a more Internet-savvy and sophisticated user base, there are more demands for interactive applications and services. However, it is a challenge for existing radio access networks (e.g. 3G and 4G) to cope with the increasingly demanding requirements such as higher data rates and wider coverage area. One potential solution is the inter-collaborative deployment of multiple radio devices in a 5G setting designed to meet exacting user demands, and facilitate the high data rate requirements in the underlying networks. These heterogeneous 5G networks can readily resolve the data rate and coverage challenges. Networks established using the hybridization of existing networks have diverse military and civilian applications. However, there are inherent limitations in such networks such as irregular breakdown, node failures, and halts during speed transmissions. In recent years, there have been attempts to integrate heterogeneous 5G networks with existing ad hoc networks to provide a robust solution for delay-tolerant transmissions in the form of packet switched networks. However, continuous connectivity is still required in these networks, in order to efficiently regulate the flow to allow the formation of a robust network. Therefore, in this paper, we present a novel network formation consisting of nodes from different network maneuvered by Unmanned Aircraft (UA). The proposed model utilizes the features of a biological aspect of genomes and forms a delay tolerant network with existing network models. This allows us to provide continuous and robust connectivity. We then demonstrate that the proposed network model has an efficient data delivery, lower overheads and lesser delays with high convergence rate in comparison to existing approaches, based on evaluations in both real-time testbed and simulation environment.

  15. Image Segmentation Parameter Optimization Considering Within- and Between-Segment Heterogeneity at Multiple Scale Levels: Test Case for Mapping Residential Areas Using Landsat Imagery

    Directory of Open Access Journals (Sweden)

    Brian A. Johnson

    2015-10-01

    Full Text Available Multi-scale/multi-level geographic object-based image analysis (MS-GEOBIA methods are becoming widely-used in remote sensing because single-scale/single-level (SS-GEOBIA methods are often unable to obtain an accurate segmentation and classification of all land use/land cover (LULC types in an image. However, there have been few comparisons between SS-GEOBIA and MS-GEOBIA approaches for the purpose of mapping a specific LULC type, so it is not well understood which is more appropriate for this task. In addition, there are few methods for automating the selection of segmentation parameters for MS-GEOBIA, while manual selection (i.e., trial-and-error approach of parameters can be quite challenging and time-consuming. In this study, we examined SS-GEOBIA and MS-GEOBIA approaches for extracting residential areas in Landsat 8 imagery, and compared naïve and parameter-optimized segmentation approaches to assess whether unsupervised segmentation parameter optimization (USPO could improve the extraction of residential areas. Our main findings were: (i the MS-GEOBIA approaches achieved higher classification accuracies than the SS-GEOBIA approach, and (ii USPO resulted in more accurate MS-GEOBIA classification results while reducing the number of segmentation levels and classification variables considerably.

  16. Heterochromatin heterogeneity in Hypostomus prope unae (Steindachner, 1878 (Siluriformes, Loricariidae from Northeastern Brazil

    Directory of Open Access Journals (Sweden)

    Jamille Bitencourt

    2011-11-01

    Full Text Available Cytogenetic analyses using C-banding and chromosomal digestion by several restriction enzymes were carried out in four populations (named A, B, C and D of Hypostomus prope unae (Loricariidae, Hypostominae from Contas river basin, northeastern Brazil. These populations share 2n=76 and single NORs on the second metacentric pair but exclusive karyotype forms for each locality. Populations A and B presented conspicuous terminal and interstitial heterochromatic blocks on most of acrocentric chromosomes and equivalent to NORs with differences in both position and bearing pair. Population D showed evident marks at interstitial regions and interspersed with nucleolar region while population C presented interstitial and terminal heterochromatin segments, non-coincident with NORs. The banding pattern after digestion with the endonucleases Alu I, Bam HI, Hae III and Dde I revealed a remarkable heterogeneity within heterochromatin, allowing the identification of distinctive clusters of repeated DNA in the studied populations, besides specific patterns along euchromatic regions. The analysis using restriction enzymes has proved to be highly informative, characterizing population differences and peculiarities in the genome organization of H. prope unae.

  17. Assessment of heterogeneity between European Populations: a Baltic and Danish replication case-control study of SNPs from a recent European ulcerative colitis genome wide association study

    Directory of Open Access Journals (Sweden)

    Jonaitis Laimas

    2011-10-01

    Full Text Available Abstract Background Differences in the genetic architecture of inflammatory bowel disease between different European countries and ethnicities have previously been reported. In the present study, we wanted to assess the role of 11 newly identified UC risk variants, derived from a recent European UC genome wide association study (GWAS (Franke et al., 2010, for 1 association with UC in the Nordic countries, 2 for population heterogeneity between the Nordic countries and the rest of Europe, and, 3 eventually, to drive some of the previous findings towards overall genome-wide significance. Methods Eleven SNPs were replicated in a Danish sample consisting of 560 UC patients and 796 controls and nine missing SNPs of the German GWAS study were successfully genotyped in the Baltic sample comprising 441 UC cases and 1156 controls. The independent replication data was then jointly analysed with the original data and systematic comparisons of the findings between ethnicities were made. Pearson's χ2, Breslow-Day (BD and Cochran-Mantel-Haenszel (CMH tests were used for association analyses and heterogeneity testing. Results The rs5771069 (IL17REL SNP was not associated with UC in the Danish panel. The rs5771069 (IL17REL SNP was significantly associated with UC in the combined Baltic, Danish and Norwegian UC study sample driven by the Norwegian panel (OR = 0.89, 95% CI: 0.79-0.98, P = 0.02. No association was found between rs7809799 (SMURF1/KPNA7 and UC (OR = 1.20, 95% CI: 0.95-1.52, P = 0.10 or between UC and all other remaining SNPs. We had 94% chance of detecting an association for rs7809799 (SMURF1/KPNA7 in the combined replication sample, whereas the power were 55% or lower for the remaining SNPs. Statistically significant PBD was found for OR heterogeneity between the combined Baltic, Danish, and Norwegian panel versus the combined German, British, Belgian, and Greek panel (rs7520292 (P = 0.001, rs12518307 (P = 0.007, and rs2395609 (TCP11 (P = 0

  18. Genetic variants influencing phenotypic variance heterogeneity.

    Science.gov (United States)

    Ek, Weronica E; Rask-Andersen, Mathias; Karlsson, Torgny; Enroth, Stefan; Gyllensten, Ulf; Johansson, Åsa

    2018-03-01

    Most genetic studies identify genetic variants associated with disease risk or with the mean value of a quantitative trait. More rarely, genetic variants associated with variance heterogeneity are considered. In this study, we have identified such variance single-nucleotide polymorphisms (vSNPs) and examined if these represent biological gene × gene or gene × environment interactions or statistical artifacts caused by multiple linked genetic variants influencing the same phenotype. We have performed a genome-wide study, to identify vSNPs associated with variance heterogeneity in DNA methylation levels. Genotype data from over 10 million single-nucleotide polymorphisms (SNPs), and DNA methylation levels at over 430 000 CpG sites, were analyzed in 729 individuals. We identified vSNPs for 7195 CpG sites (P mean DNA methylation levels. We further showed that variance heterogeneity between genotypes mainly represents additional, often rare, SNPs in linkage disequilibrium (LD) with the respective vSNP and for some vSNPs, multiple low frequency variants co-segregating with one of the vSNP alleles. Therefore, our results suggest that variance heterogeneity of DNA methylation mainly represents phenotypic effects by multiple SNPs, rather than biological interactions. Such effects may also be important for interpreting variance heterogeneity of more complex clinical phenotypes.

  19. Two-stage atlas subset selection in multi-atlas based image segmentation

    Energy Technology Data Exchange (ETDEWEB)

    Zhao, Tingting, E-mail: tingtingzhao@mednet.ucla.edu; Ruan, Dan, E-mail: druan@mednet.ucla.edu [The Department of Radiation Oncology, University of California, Los Angeles, California 90095 (United States)

    2015-06-15

    Purpose: Fast growing access to large databases and cloud stored data presents a unique opportunity for multi-atlas based image segmentation and also presents challenges in heterogeneous atlas quality and computation burden. This work aims to develop a novel two-stage method tailored to the special needs in the face of large atlas collection with varied quality, so that high-accuracy segmentation can be achieved with low computational cost. Methods: An atlas subset selection scheme is proposed to substitute a significant portion of the computationally expensive full-fledged registration in the conventional scheme with a low-cost alternative. More specifically, the authors introduce a two-stage atlas subset selection method. In the first stage, an augmented subset is obtained based on a low-cost registration configuration and a preliminary relevance metric; in the second stage, the subset is further narrowed down to a fusion set of desired size, based on full-fledged registration and a refined relevance metric. An inference model is developed to characterize the relationship between the preliminary and refined relevance metrics, and a proper augmented subset size is derived to ensure that the desired atlases survive the preliminary selection with high probability. Results: The performance of the proposed scheme has been assessed with cross validation based on two clinical datasets consisting of manually segmented prostate and brain magnetic resonance images, respectively. The proposed scheme demonstrates comparable end-to-end segmentation performance as the conventional single-stage selection method, but with significant computation reduction. Compared with the alternative computation reduction method, their scheme improves the mean and medium Dice similarity coefficient value from (0.74, 0.78) to (0.83, 0.85) and from (0.82, 0.84) to (0.95, 0.95) for prostate and corpus callosum segmentation, respectively, with statistical significance. Conclusions: The authors

  20. Two-stage atlas subset selection in multi-atlas based image segmentation.

    Science.gov (United States)

    Zhao, Tingting; Ruan, Dan

    2015-06-01

    Fast growing access to large databases and cloud stored data presents a unique opportunity for multi-atlas based image segmentation and also presents challenges in heterogeneous atlas quality and computation burden. This work aims to develop a novel two-stage method tailored to the special needs in the face of large atlas collection with varied quality, so that high-accuracy segmentation can be achieved with low computational cost. An atlas subset selection scheme is proposed to substitute a significant portion of the computationally expensive full-fledged registration in the conventional scheme with a low-cost alternative. More specifically, the authors introduce a two-stage atlas subset selection method. In the first stage, an augmented subset is obtained based on a low-cost registration configuration and a preliminary relevance metric; in the second stage, the subset is further narrowed down to a fusion set of desired size, based on full-fledged registration and a refined relevance metric. An inference model is developed to characterize the relationship between the preliminary and refined relevance metrics, and a proper augmented subset size is derived to ensure that the desired atlases survive the preliminary selection with high probability. The performance of the proposed scheme has been assessed with cross validation based on two clinical datasets consisting of manually segmented prostate and brain magnetic resonance images, respectively. The proposed scheme demonstrates comparable end-to-end segmentation performance as the conventional single-stage selection method, but with significant computation reduction. Compared with the alternative computation reduction method, their scheme improves the mean and medium Dice similarity coefficient value from (0.74, 0.78) to (0.83, 0.85) and from (0.82, 0.84) to (0.95, 0.95) for prostate and corpus callosum segmentation, respectively, with statistical significance. The authors have developed a novel two-stage atlas

  1. Two-stage atlas subset selection in multi-atlas based image segmentation

    International Nuclear Information System (INIS)

    Zhao, Tingting; Ruan, Dan

    2015-01-01

    Purpose: Fast growing access to large databases and cloud stored data presents a unique opportunity for multi-atlas based image segmentation and also presents challenges in heterogeneous atlas quality and computation burden. This work aims to develop a novel two-stage method tailored to the special needs in the face of large atlas collection with varied quality, so that high-accuracy segmentation can be achieved with low computational cost. Methods: An atlas subset selection scheme is proposed to substitute a significant portion of the computationally expensive full-fledged registration in the conventional scheme with a low-cost alternative. More specifically, the authors introduce a two-stage atlas subset selection method. In the first stage, an augmented subset is obtained based on a low-cost registration configuration and a preliminary relevance metric; in the second stage, the subset is further narrowed down to a fusion set of desired size, based on full-fledged registration and a refined relevance metric. An inference model is developed to characterize the relationship between the preliminary and refined relevance metrics, and a proper augmented subset size is derived to ensure that the desired atlases survive the preliminary selection with high probability. Results: The performance of the proposed scheme has been assessed with cross validation based on two clinical datasets consisting of manually segmented prostate and brain magnetic resonance images, respectively. The proposed scheme demonstrates comparable end-to-end segmentation performance as the conventional single-stage selection method, but with significant computation reduction. Compared with the alternative computation reduction method, their scheme improves the mean and medium Dice similarity coefficient value from (0.74, 0.78) to (0.83, 0.85) and from (0.82, 0.84) to (0.95, 0.95) for prostate and corpus callosum segmentation, respectively, with statistical significance. Conclusions: The authors

  2. Genetic heterogeneity of L-Zagreb mumps virus vaccine strain

    Directory of Open Access Journals (Sweden)

    Mateljak-Lukacevic Sanja

    2008-07-01

    Full Text Available Abstract Background The most often used mumps vaccine strains Jeryl Lynn (JL, RIT4385, Urabe-AM9, L-Zagreb and L-3 differ in immunogenicity and reactogenicity. Previous analyses showed that JL, Urabe-AM9 and L-3 are genetically heterogeneous. Results We identified the heterogeneity of L-Zagreb throughout the entire genome. Two major variants were defined: variant A being identical to the consensus sequence of viral seeds and vaccine(s and variant B which differs from variant A in three nucleotide positions. The difference between viral variants in L-Zagreb strain is insufficient for distinct viral strains to be defined. We demonstrated that proportion of variants in L-Zagreb viral population depends on cell substrate used for viral replication in vitro and in vivo. Conclusion L-Zagreb strain should be considered as a single strain composed of at least two variant viral genomes.

  3. Genetic heterogeneity of L-Zagreb mumps virus vaccine strain

    Science.gov (United States)

    Kosutic-Gulija, Tanja; Forcic, Dubravko; Šantak, Maja; Ramljak, Ana; Mateljak-Lukacevic, Sanja; Mazuran, Renata

    2008-01-01

    Background The most often used mumps vaccine strains Jeryl Lynn (JL), RIT4385, Urabe-AM9, L-Zagreb and L-3 differ in immunogenicity and reactogenicity. Previous analyses showed that JL, Urabe-AM9 and L-3 are genetically heterogeneous. Results We identified the heterogeneity of L-Zagreb throughout the entire genome. Two major variants were defined: variant A being identical to the consensus sequence of viral seeds and vaccine(s) and variant B which differs from variant A in three nucleotide positions. The difference between viral variants in L-Zagreb strain is insufficient for distinct viral strains to be defined. We demonstrated that proportion of variants in L-Zagreb viral population depends on cell substrate used for viral replication in vitro and in vivo. Conclusion L-Zagreb strain should be considered as a single strain composed of at least two variant viral genomes. PMID:18616793

  4. Universal global imprints of genome growth and evolution--equivalent length and cumulative mutation density.

    Directory of Open Access Journals (Sweden)

    Hong-Da Chen

    Full Text Available BACKGROUND: Segmental duplication is widely held to be an important mode of genome growth and evolution. Yet how this would affect the global structure of genomes has been little discussed. METHODS/PRINCIPAL FINDINGS: Here, we show that equivalent length, or L(e, a quantity determined by the variance of fluctuating part of the distribution of the k-mer frequencies in a genome, characterizes the latter's global structure. We computed the L(es of 865 complete chromosomes and found that they have nearly universal but (k-dependent values. The differences among the L(e of a chromosome and those of its coding and non-coding parts were found to be slight. CONCLUSIONS: We verified that these non-trivial results are natural consequences of a genome growth model characterized by random segmental duplication and random point mutation, but not of any model whose dominant growth mechanism is not segmental duplication. Our study also indicates that genomes have a nearly universal cumulative "point" mutation density of about 0.73 mutations per site that is compatible with the relatively low mutation rates of (1-5 x 10(-3/site/Mya previously determined by sequence comparison for the human and E. coli genomes.

  5. Heterogeneity of mammary lesions represent molecular differences

    International Nuclear Information System (INIS)

    Namba, Ruria; Gregg, Jeffrey P; Maglione, Jeannie E; Davis, Ryan R; Baron, Colin A; Liu, Stephenie; Carmack, Condie E; Young, Lawrence JT; Borowsky, Alexander D; Cardiff, Robert D

    2006-01-01

    Human breast cancer is a heterogeneous disease, histopathologically, molecularly and phenotypically. The molecular basis of this heterogeneity is not well understood. We have used a mouse model of DCIS that consists of unique lines of mammary intraepithelial neoplasia (MIN) outgrowths, the premalignant lesion in the mouse that progress to invasive carcinoma, to understand the molecular changes that are characteristic to certain phenotypes. Each MIN-O line has distinguishable morphologies, metastatic potentials and estrogen dependencies. We utilized oligonucleotide expression arrays and high resolution array comparative genomic hybridization (aCGH) to investigate whole genome expression patterns and whole genome aberrations in both the MIN-O and tumor from four different MIN-O lines that each have different phenotypes. From the whole genome analysis at 35 kb resolution, we found that chromosome 1, 2, 10, and 11 were frequently associated with whole chromosome gains in the MIN-Os. In particular, two MIN-O lines had the majority of the chromosome gains. Although we did not find any whole chromosome loss, we identified 3 recurring chromosome losses (2F1-2, 3E4, 17E2) and two chromosome copy number gains on chromosome 11. These interstitial deletions and duplications were verified with a custom made array designed to interrogate the specific regions at approximately 550 bp resolution. We demonstrated that expression and genomic changes are present in the early premalignant lesions and that these molecular profiles can be correlated to phenotype (metastasis and estrogen responsiveness). We also identified expression changes associated with genomic instability. Progression to invasive carcinoma was associated with few additional changes in gene expression and genomic organization. Therefore, in the MIN-O mice, early premalignant lesions have the major molecular and genetic changes required and these changes have important phenotypic significance. In contrast, the changes

  6. RadGenomics project

    Energy Technology Data Exchange (ETDEWEB)

    Iwakawa, Mayumi; Imai, Takashi; Harada, Yoshinobu [National Inst. of Radiological Sciences, Chiba (Japan). Frontier Research Center] [and others

    2002-06-01

    Human health is determined by a complex interplay of factors, predominantly between genetic susceptibility, environmental conditions and aging. The ultimate aim of the RadGenomics (Radiation Genomics) project is to understand the implications of heterogeneity in responses to ionizing radiation arising from genetic variation between individuals in the human population. The rapid progression of the human genome sequencing and the recent development of new technologies in molecular genetics are providing us with new opportunities to understand the genetic basis of individual differences in susceptibility to natural and/or artificial environmental factors, including radiation exposure. The RadGenomics project will inevitably lead to improved protocols for personalized radiotherapy and reductions in the potential side effects of such treatment. The project will contribute to future research into the molecular mechanisms of radiation sensitivity in humans and will stimulate the development of new high-throughput technologies for a broader application of biological and medical sciences. The staff members are specialists in a variety of fields, including genome science, radiation biology, medical science, molecular biology, and informatics, and have joined the RadGenomics project from various universities, companies, and research institutes. The project started in April 2001. (author)

  7. RadGenomics project

    International Nuclear Information System (INIS)

    Iwakawa, Mayumi; Imai, Takashi; Harada, Yoshinobu

    2002-01-01

    Human health is determined by a complex interplay of factors, predominantly between genetic susceptibility, environmental conditions and aging. The ultimate aim of the RadGenomics (Radiation Genomics) project is to understand the implications of heterogeneity in responses to ionizing radiation arising from genetic variation between individuals in the human population. The rapid progression of the human genome sequencing and the recent development of new technologies in molecular genetics are providing us with new opportunities to understand the genetic basis of individual differences in susceptibility to natural and/or artificial environmental factors, including radiation exposure. The RadGenomics project will inevitably lead to improved protocols for personalized radiotherapy and reductions in the potential side effects of such treatment. The project will contribute to future research into the molecular mechanisms of radiation sensitivity in humans and will stimulate the development of new high-throughput technologies for a broader application of biological and medical sciences. The staff members are specialists in a variety of fields, including genome science, radiation biology, medical science, molecular biology, and informatics, and have joined the RadGenomics project from various universities, companies, and research institutes. The project started in April 2001. (author)

  8. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration.

    Science.gov (United States)

    Thorvaldsdóttir, Helga; Robinson, James T; Mesirov, Jill P

    2013-03-01

    Data visualization is an essential component of genomic data analysis. However, the size and diversity of the data sets produced by today's sequencing and array-based profiling methods present major challenges to visualization tools. The Integrative Genomics Viewer (IGV) is a high-performance viewer that efficiently handles large heterogeneous data sets, while providing a smooth and intuitive user experience at all levels of genome resolution. A key characteristic of IGV is its focus on the integrative nature of genomic studies, with support for both array-based and next-generation sequencing data, and the integration of clinical and phenotypic data. Although IGV is often used to view genomic data from public sources, its primary emphasis is to support researchers who wish to visualize and explore their own data sets or those from colleagues. To that end, IGV supports flexible loading of local and remote data sets, and is optimized to provide high-performance data visualization and exploration on standard desktop systems. IGV is freely available for download from http://www.broadinstitute.org/igv, under a GNU LGPL open-source license.

  9. Whole genome sequencing reveals genomic heterogeneity and antibiotic purification in Mycobacterium tuberculosis isolates

    KAUST Repository

    Black, PA; de Vos, M.; Louw, GE; van der Merwe, RG; Dippenaar, A.; Streicher, EM; Abdallah, A. M.; Sampson, SL; Victor, TC; Dolby, T.; Simpson, JA; van Helden, PD; Warren, RM; Pain, Arnab

    2015-01-01

    Our study demonstrated true levels of genetic diversity within an M. tuberculosis population and showed that genetic diversity may be re-defined when a selective pressure, such as drug exposure, is imposed on M. tuberculosis populations during the course of infection. This suggests that the genome of M. tuberculosis is more dynamic than previously thought, suggesting preparedness to respond to a changing environment.

  10. Draft genome sequence of the Coccolithovirus Emiliania huxleyi virus 203.

    Science.gov (United States)

    Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

    2011-12-01

    The Coccolithoviridae are a recently discovered group of viruses that infect the marine coccolithophorid Emiliania huxleyi. Emiliania huxleyi virus 203 (EhV-203) has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 400 kbp, consisting of 464 coding sequences (CDSs). Here we describe the genomic features of EhV-203 together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.

  11. Quantification of the heterogeneity of prognostic cellular biomarkers in ewing sarcoma using automated image and random survival forest analysis.

    Directory of Open Access Journals (Sweden)

    Claudia Bühnemann

    Full Text Available Driven by genomic somatic variation, tumour tissues are typically heterogeneous, yet unbiased quantitative methods are rarely used to analyse heterogeneity at the protein level. Motivated by this problem, we developed automated image segmentation of images of multiple biomarkers in Ewing sarcoma to generate distributions of biomarkers between and within tumour cells. We further integrate high dimensional data with patient clinical outcomes utilising random survival forest (RSF machine learning. Using material from cohorts of genetically diagnosed Ewing sarcoma with EWSR1 chromosomal translocations, confocal images of tissue microarrays were segmented with level sets and watershed algorithms. Each cell nucleus and cytoplasm were identified in relation to DAPI and CD99, respectively, and protein biomarkers (e.g. Ki67, pS6, Foxo3a, EGR1, MAPK localised relative to nuclear and cytoplasmic regions of each cell in order to generate image feature distributions. The image distribution features were analysed with RSF in relation to known overall patient survival from three separate cohorts (185 informative cases. Variation in pre-analytical processing resulted in elimination of a high number of non-informative images that had poor DAPI localisation or biomarker preservation (67 cases, 36%. The distribution of image features for biomarkers in the remaining high quality material (118 cases, 104 features per case were analysed by RSF with feature selection, and performance assessed using internal cross-validation, rather than a separate validation cohort. A prognostic classifier for Ewing sarcoma with low cross-validation error rates (0.36 was comprised of multiple features, including the Ki67 proliferative marker and a sub-population of cells with low cytoplasmic/nuclear ratio of CD99. Through elimination of bias, the evaluation of high-dimensionality biomarker distribution within cell populations of a tumour using random forest analysis in quality

  12. Quantification of the heterogeneity of prognostic cellular biomarkers in ewing sarcoma using automated image and random survival forest analysis.

    Science.gov (United States)

    Bühnemann, Claudia; Li, Simon; Yu, Haiyue; Branford White, Harriet; Schäfer, Karl L; Llombart-Bosch, Antonio; Machado, Isidro; Picci, Piero; Hogendoorn, Pancras C W; Athanasou, Nicholas A; Noble, J Alison; Hassan, A Bassim

    2014-01-01

    Driven by genomic somatic variation, tumour tissues are typically heterogeneous, yet unbiased quantitative methods are rarely used to analyse heterogeneity at the protein level. Motivated by this problem, we developed automated image segmentation of images of multiple biomarkers in Ewing sarcoma to generate distributions of biomarkers between and within tumour cells. We further integrate high dimensional data with patient clinical outcomes utilising random survival forest (RSF) machine learning. Using material from cohorts of genetically diagnosed Ewing sarcoma with EWSR1 chromosomal translocations, confocal images of tissue microarrays were segmented with level sets and watershed algorithms. Each cell nucleus and cytoplasm were identified in relation to DAPI and CD99, respectively, and protein biomarkers (e.g. Ki67, pS6, Foxo3a, EGR1, MAPK) localised relative to nuclear and cytoplasmic regions of each cell in order to generate image feature distributions. The image distribution features were analysed with RSF in relation to known overall patient survival from three separate cohorts (185 informative cases). Variation in pre-analytical processing resulted in elimination of a high number of non-informative images that had poor DAPI localisation or biomarker preservation (67 cases, 36%). The distribution of image features for biomarkers in the remaining high quality material (118 cases, 104 features per case) were analysed by RSF with feature selection, and performance assessed using internal cross-validation, rather than a separate validation cohort. A prognostic classifier for Ewing sarcoma with low cross-validation error rates (0.36) was comprised of multiple features, including the Ki67 proliferative marker and a sub-population of cells with low cytoplasmic/nuclear ratio of CD99. Through elimination of bias, the evaluation of high-dimensionality biomarker distribution within cell populations of a tumour using random forest analysis in quality controlled tumour

  13. Homogeneous versus heterogeneous probes for microbial ecological microarrays.

    Science.gov (United States)

    Bae, Jin-Woo; Park, Yong-Ha

    2006-07-01

    Microbial ecological microarrays have been developed for investigating the composition and functions of microorganism communities in environmental niches. These arrays include microbial identification microarrays, which use oligonucleotides, gene fragments or microbial genomes as probes. In this article, the advantages and disadvantages of each type of probe are reviewed. Oligonucleotide probes are currently useful for probing uncultivated bacteria that are not amenable to gene fragment probing, whereas the functional gene fragments amplified randomly from microbial genomes require phylogenetic and hierarchical categorization before use as microbial identification probes, despite their high resolution for both specificity and sensitivity. Until more bacteria are sequenced and gene fragment probes are thoroughly validated, heterogeneous bacterial genome probes will provide a simple, sensitive and quantitative tool for exploring the ecosystem structure.

  14. Translocations of Chromosome End-Segments and Facultative Heterochromatin Promote Meiotic Ring Formation in Evening Primroses[W][OPEN

    Science.gov (United States)

    Golczyk, Hieronim; Massouh, Amid; Greiner, Stephan

    2014-01-01

    Due to reciprocal chromosomal translocations, many species of Oenothera (evening primrose) form permanent multichromosomal meiotic rings. However, regular bivalent pairing is also observed. Chiasmata are restricted to chromosomal ends, which makes homologous recombination virtually undetectable. Genetic diversity is achieved by changing linkage relations of chromosomes in rings and bivalents via hybridization and reciprocal translocations. Although the structural prerequisite for this system is enigmatic, whole-arm translocations are widely assumed to be the mechanistic driving force. We demonstrate that this prerequisite is genome compartmentation into two epigenetically defined chromatin fractions. The first one facultatively condenses in cycling cells into chromocenters negative both for histone H3 dimethylated at lysine 4 and for C-banding, and forms huge condensed middle chromosome regions on prophase chromosomes. Remarkably, it decondenses in differentiating cells. The second fraction is euchromatin confined to distal chromosome segments, positive for histone H3 lysine 4 dimethylation and for histone H3 lysine 27 trimethylation. The end-segments are deprived of canonical telomeres but capped with constitutive heterochromatin. This genomic organization promotes translocation breakpoints between the two chromatin fractions, thus facilitating exchanges of end-segments. We challenge the whole-arm translocation hypothesis by demonstrating why reciprocal translocations of chromosomal end-segments should strongly promote meiotic rings and evolution toward permanent translocation heterozygosity. Reshuffled end-segments, each possessing a major crossover hot spot, can furthermore explain meiotic compatibility between genomes with different translocation histories. PMID:24681616

  15. The diploid genome sequence of an individual human.

    Directory of Open Access Journals (Sweden)

    Samuel Levy

    2007-09-01

    Full Text Available Presented here is a genome sequence of an individual human. It was produced from approximately 32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel included 3,213,401 single nucleotide polymorphisms (SNPs, 53,823 block substitutions (2-206 bp, 292,102 heterozygous insertion/deletion events (indels(1-571 bp, 559,473 homozygous indels (1-82,711 bp, 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information.

  16. Genomics of Escherichia and Shigella

    Science.gov (United States)

    Perna, Nicole T.

    The laboratory workhorse Escherichia coli K-12 is among the most intensively studied living organisms on earth, and this single strain serves as the model system behind much of our understanding of prokaryotic molecular biology. Dense genome sequencing and recent insightful comparative analyses are making the species E. coli, as a whole, an emerging system for studying prokaryotic population genetics and the relationship between system-scale, or genome-scale, molecular evolution and complex traits like host range and pathogenic potential. Genomic perspective has revealed a coherent but dynamic species united by intraspecific gene flow via homologous lateral or horizontal transfer and differentiated by content flux mediated by acquisition of DNA segments from interspecies transfers.

  17. Contextually guided very-high-resolution imagery classification with semantic segments

    Science.gov (United States)

    Zhao, Wenzhi; Du, Shihong; Wang, Qiao; Emery, William J.

    2017-10-01

    Contextual information, revealing relationships and dependencies between image objects, is one of the most important information for the successful interpretation of very-high-resolution (VHR) remote sensing imagery. Over the last decade, geographic object-based image analysis (GEOBIA) technique has been widely used to first divide images into homogeneous parts, and then to assign semantic labels according to the properties of image segments. However, due to the complexity and heterogeneity of VHR images, segments without semantic labels (i.e., semantic-free segments) generated with low-level features often fail to represent geographic entities (such as building roofs usually be partitioned into chimney/antenna/shadow parts). As a result, it is hard to capture contextual information across geographic entities when using semantic-free segments. In contrast to low-level features, "deep" features can be used to build robust segments with accurate labels (i.e., semantic segments) in order to represent geographic entities at higher levels. Based on these semantic segments, semantic graphs can be constructed to capture contextual information in VHR images. In this paper, semantic segments were first explored with convolutional neural networks (CNN) and a conditional random field (CRF) model was then applied to model the contextual information between semantic segments. Experimental results on two challenging VHR datasets (i.e., the Vaihingen and Beijing scenes) indicate that the proposed method is an improvement over existing image classification techniques in classification performance (overall accuracy ranges from 82% to 96%).

  18. A methodology for texture feature-based quality assessment in nucleus segmentation of histopathology image

    Directory of Open Access Journals (Sweden)

    Si Wen

    2017-01-01

    Full Text Available Context: Image segmentation pipelines often are sensitive to algorithm input parameters. Algorithm parameters optimized for a set of images do not necessarily produce good-quality-segmentation results for other images. Even within an image, some regions may not be well segmented due to a number of factors, including multiple pieces of tissue with distinct characteristics, differences in staining of the tissue, normal versus tumor regions, and tumor heterogeneity. Evaluation of quality of segmentation results is an important step in image analysis. It is very labor intensive to do quality assessment manually with large image datasets because a whole-slide tissue image may have hundreds of thousands of nuclei. Semi-automatic mechanisms are needed to assist researchers and application developers to detect image regions with bad segmentations efficiently. Aims: Our goal is to develop and evaluate a machine-learning-based semi-automated workflow to assess quality of nucleus segmentation results in a large set of whole-slide tissue images. Methods: We propose a quality control methodology, in which machine-learning algorithms are trained with image intensity and texture features to produce a classification model. This model is applied to image patches in a whole-slide tissue image to predict the quality of nucleus segmentation in each patch. The training step of our methodology involves the selection and labeling of regions by a pathologist in a set of images to create the training dataset. The image regions are partitioned into patches. A set of intensity and texture features is computed for each patch. A classifier is trained with the features and the labels assigned by the pathologist. At the end of this process, a classification model is generated. The classification step applies the classification model to unlabeled test images. Each test image is partitioned into patches. The classification model is applied to each patch to predict the patch

  19. Phenotypic and genotypic heterogeneity of Lynch syndrome: a complex diagnostic challenge.

    Science.gov (United States)

    Lynch, Henry T; Lanspa, Stephen; Shaw, Trudy; Casey, Murray Joseph; Rendell, Marc; Stacey, Mark; Townley, Theresa; Snyder, Carrie; Hitchins, Megan; Bailey-Wilson, Joan

    2018-07-01

    Lynch syndrome is the hereditary disorder that most frequently predisposes to colorectal cancer as well as predisposing to a number of extracolonic cancers, most prominently endometrial cancer. It is caused by germline mutations in the mismatch repair genes. Both its phenotype and genotype show marked heterogeneity. This review gives a historical overview of the syndrome, its heterogeneity, its genomic landscape, and its implications for complex diagnosis, genetic counseling and putative implications for immunotherapy.

  20. Draft genome sequence of the coccolithovirus Emiliania huxleyi virus 202.

    Science.gov (United States)

    Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

    2012-02-01

    Emiliania huxleyi virus 202 (EhV-202) is a member of the Coccolithoviridae, a group of viruses that infect the marine coccolithophorid Emiliania huxleyi. EhV-202 has a 160- to 180-nm-diameter icosahedral structure and a genome of approximately 407 kbp, consisting of 485 coding sequences (CDSs). Here we describe the genomic features of EhV-202, together with a draft genome sequence and its annotation, highlighting the homology and heterogeneity of this genome in comparison with the EhV-86 reference genome.

  1. SNUGB: a versatile genome browser supporting comparative and functional fungal genomics

    Directory of Open Access Journals (Sweden)

    Kim Seungill

    2008-12-01

    Full Text Available Abstract Background Since the full genome sequences of Saccharomyces cerevisiae were released in 1996, genome sequences of over 90 fungal species have become publicly available. The heterogeneous formats of genome sequences archived in different sequencing centers hampered the integration of the data for efficient and comprehensive comparative analyses. The Comparative Fungal Genomics Platform (CFGP was developed to archive these data via a single standardized format that can support multifaceted and integrated analyses of the data. To facilitate efficient data visualization and utilization within and across species based on the architecture of CFGP and associated databases, a new genome browser was needed. Results The Seoul National University Genome Browser (SNUGB integrates various types of genomic information derived from 98 fungal/oomycete (137 datasets and 34 plant and animal (38 datasets species, graphically presents germane features and properties of each genome, and supports comparison between genomes. The SNUGB provides three different forms of the data presentation interface, including diagram, table, and text, and six different display options to support visualization and utilization of the stored information. Information for individual species can be quickly accessed via a new tool named the taxonomy browser. In addition, SNUGB offers four useful data annotation/analysis functions, including 'BLAST annotation.' The modular design of SNUGB makes its adoption to support other comparative genomic platforms easy and facilitates continuous expansion. Conclusion The SNUGB serves as a powerful platform supporting comparative and functional genomics within the fungal kingdom and also across other kingdoms. All data and functions are available at the web site http://genomebrowser.snu.ac.kr/.

  2. Chromosomal Speciation in the Genomics Era: Disentangling Phylogenetic Evolution of Rock-wallabies.

    Science.gov (United States)

    Potter, Sally; Bragg, Jason G; Blom, Mozes P K; Deakin, Janine E; Kirkpatrick, Mark; Eldridge, Mark D B; Moritz, Craig

    2017-01-01

    The association of chromosome rearrangements (CRs) with speciation is well established, and there is a long history of theory and evidence relating to "chromosomal speciation." Genomic sequencing has the potential to provide new insights into how reorganization of genome structure promotes divergence, and in model systems has demonstrated reduced gene flow in rearranged segments. However, there are limits to what we can understand from a small number of model systems, which each only tell us about one episode of chromosomal speciation. Progressing from patterns of association between chromosome (and genic) change, to understanding processes of speciation requires both comparative studies across diverse systems and integration of genome-scale sequence comparisons with other lines of evidence. Here, we showcase a promising example of chromosomal speciation in a non-model organism, the endemic Australian marsupial genus Petrogale . We present initial phylogenetic results from exon-capture that resolve a history of divergence associated with extensive and repeated CRs. Yet it remains challenging to disentangle gene tree heterogeneity caused by recent divergence and gene flow in this and other such recent radiations. We outline a way forward for better integration of comparative genomic sequence data with evidence from molecular cytogenetics, and analyses of shifts in the recombination landscape and potential disruption of meiotic segregation and epigenetic programming. In all likelihood, CRs impact multiple cellular processes and these effects need to be considered together, along with effects of genic divergence. Understanding the effects of CRs together with genic divergence will require development of more integrative theory and inference methods. Together, new data and analysis tools will combine to shed light on long standing questions of how chromosome and genic divergence promote speciation.

  3. Adaptive stress response in segmental progeria resembles long-lived dwarfism and calorie restriction in mice

    OpenAIRE

    Ven, Marieke; Andressoo, Jaan-Olle; Holcomb, Valerie; Lindern, Marieke; Jong, Willeke; Zeeuw, Chris; Suh, Yousin; Hasty, Paul; Hoeijmakers, Jan; Horst, Gijsbertus; Mitchell, James

    2006-01-01

    textabstractHow congenital defects causing genome instability can result in the pleiotropic symptoms reminiscent of aging but in a segmental and accelerated fashion remains largely unknown. Most segmental progerias are associated with accelerated fibroblast senescence, suggesting that cellular senescence is a likely contributing mechanism. Contrary to expectations, neither accelerated senescence nor acute oxidative stress hypersensitivity was detected in primary fibroblast or erythroblast cul...

  4. Applying mealtime functionality to tailor protein-enriched meals to older consumer segments

    NARCIS (Netherlands)

    Uijl, den Louise C.; Jager, Gerry; Zandstra, Elizabeth H.; Graaf, de Kees; Kremer, Stefanie

    2017-01-01

    The older adults group is highly heterogeneous, and its members do not always meet their recommended protein intake. We explored mealtime functionality as a basis for tailoring protein-enriched (PE) meal concepts to two senior consumer segments: 1) cosy socialisers, who eat mainly for cosiness

  5. Macroscopic heterogeneity of liver fat: an MR-based study in type-2 diabetic patients

    International Nuclear Information System (INIS)

    Capitan, Violaine; Lefevre, Pierre-Henri; Favelier, Sylvain; Loffroy, Romaric; Krause, Denis; Petit, Jean-Michel; Aho, Serge; Hillon, Patrick; Cercueil, Jean-Pierre; Guiu, Boris

    2012-01-01

    To assess the heterogeneity of liver fat deposition with MR of the liver in type-2 diabetic (T2D) patients. We enrolled 121 consecutive T2D patients. The reference standard was 3.0-T 1 H-MR spectroscopy. Hepatic steatosis was defined as liver fat content (LFC) ≥5.56 %. A triple-echo gradient-echo sequence corrected for T1 recovery and T2* decay was used to calculate LFC in left and right livers and hepatic segments. Analyses were performed using a linear mixed model. Fifty-nine (48.8 %) patients had liver steatosis, whereas 62 (51.2 %) did not. Steatosis was greater in the right than in the left liver (P < 0.0001) [mean difference: 1.32 % (range: 0.01-8.75 %)]. In seven patients (5.8 %), LFC was <5.56 % in one side of the liver, whereas it was ≥5.56 % in the other. Steatosis of the left and right liver was heterogeneous at the segmental level in both non-steatotic (P < 0.001 and P < 0.0001 respectively) and steatotic (P < 0.0001 and P = 0.0002 respectively) patients [mean maximum difference: 3.98 % (range: 0.74-19.32 %)]. In 23 patients (19 %), LFC was <5.56 % in one segment, whereas it was ≥5.56 % in at least one other. Overall, the mean segmental/lobar variability of steatosis is low. However, segmental variability can sometimes lead to a misdiagnosis. (orig.)

  6. Macroscopic heterogeneity of liver fat: an MR-based study in type-2 diabetic patients

    Energy Technology Data Exchange (ETDEWEB)

    Capitan, Violaine; Lefevre, Pierre-Henri; Favelier, Sylvain; Loffroy, Romaric; Krause, Denis [CHU (University Hospital), Department of Radiology, 14 rue Paul Gaffarel, BP 77908, Dijon (France); CHU (University Hospital), BP 77908, Dijon (France); Petit, Jean-Michel [CHU (University Hospital), Department of Endocrinology, Diabetology, and Metabolic Diseases, BP 77908, Dijon (France); CHU (University Hospital), BP 77908, Dijon (France); Aho, Serge [CHU (University Hospital), Department of Biostatistics and Medical Informatics, Dijon (France); CHU (University Hospital), BP 77908, Dijon (France); Hillon, Patrick [University of Burgundy, INSERM U866, BP 87900, Dijon (France); CHU (University Hospital), Department of Hepatology, BP 77908, Dijon (France); CHU (University Hospital), BP 77908, Dijon (France); Cercueil, Jean-Pierre; Guiu, Boris [CHU (University Hospital), Department of Radiology, 14 rue Paul Gaffarel, BP 77908, Dijon (France); University of Burgundy, INSERM U866, BP 87900, Dijon (France); CHU (University Hospital), BP 77908, Dijon (France)

    2012-10-15

    To assess the heterogeneity of liver fat deposition with MR of the liver in type-2 diabetic (T2D) patients. We enrolled 121 consecutive T2D patients. The reference standard was 3.0-T {sup 1}H-MR spectroscopy. Hepatic steatosis was defined as liver fat content (LFC) {>=}5.56 %. A triple-echo gradient-echo sequence corrected for T1 recovery and T2* decay was used to calculate LFC in left and right livers and hepatic segments. Analyses were performed using a linear mixed model. Fifty-nine (48.8 %) patients had liver steatosis, whereas 62 (51.2 %) did not. Steatosis was greater in the right than in the left liver (P < 0.0001) [mean difference: 1.32 % (range: 0.01-8.75 %)]. In seven patients (5.8 %), LFC was <5.56 % in one side of the liver, whereas it was {>=}5.56 % in the other. Steatosis of the left and right liver was heterogeneous at the segmental level in both non-steatotic (P < 0.001 and P < 0.0001 respectively) and steatotic (P < 0.0001 and P = 0.0002 respectively) patients [mean maximum difference: 3.98 % (range: 0.74-19.32 %)]. In 23 patients (19 %), LFC was <5.56 % in one segment, whereas it was {>=}5.56 % in at least one other. Overall, the mean segmental/lobar variability of steatosis is low. However, segmental variability can sometimes lead to a misdiagnosis. (orig.)

  7. Genomic heterogeneity among human and nonhuman strains of hepatitis A virus

    International Nuclear Information System (INIS)

    Lemon, S.M.; Chao, S.F.; Jansen, R.W.; Binn, L.N.; LeDuc, J.W.

    1987-01-01

    Cloned cDNA probes derived from the P1 and P2 regions of the genome of HM175 virus, a reference strain of human hepatitis A virus (HAV), failed to hybridize under standard stringency criteria with RNA from PA21 and PA33 viruses, two epizootiologically related HAV strains recovered from naturally infected New World owl monkeys. Hybridization of these probes to PA21 RNA was only evident under reduced stringency conditions. However, cDNA representing the 5' nontranslated region of the MH175 genome hybridized equally to HM175 and PA21 RNA under standard stringency conditions, while a probe derived from the 3', 1400 bases of the genome yielded a reduced hybridization signal with PA21 RNA. In contrast, no differences could be discerned between HM175 virus and three other HAV strains of human origin (GR8, LV374, and MS1) in any region of the genome, unless increased stringency conditions were used. These results suggest that PA21 and PA33 are unique among HAV isolates and may represent a virus native to the owl monkey. Despite extremely poor homology within the P1 region, which encodes capsid polypeptides, monoclonal antibody analysis confirmed that the immunodominant neutralization epitopes of HAV were highly conserved between HM175 and PA21 viruses. These data provide molecular evidence for the existence of HAV strains unique to nonhuman species and indicate that strict conservation of antigenic function may accompany substantial genetic divergence in HAV

  8. Phenotypic Heterogeneity of Genomically-Diverse Isolates of Streptococcus mutans

    Science.gov (United States)

    Palmer, Sara R.; Miller, James H.; Abranches, Jacqueline; Zeng, Lin; Lefebure, Tristan; Richards, Vincent P.; Lemos, José A.; Stanhope, Michael J.; Burne, Robert A.

    2013-01-01

    High coverage, whole genome shotgun (WGS) sequencing of 57 geographically- and genetically-diverse isolates of Streptococcus mutans from individuals of known dental caries status was recently completed. Of the 57 sequenced strains, fifteen isolates, were selected based primarily on differences in gene content and phenotypic characteristics known to affect virulence and compared with the reference strain UA159. A high degree of variability in these properties was observed between strains, with a broad spectrum of sensitivities to low pH, oxidative stress (air and paraquat) and exposure to competence stimulating peptide (CSP). Significant differences in autolytic behavior and in biofilm development in glucose or sucrose were also observed. Natural genetic competence varied among isolates, and this was correlated to the presence or absence of competence genes, comCDE and comX, and to bacteriocins. In general strains that lacked the ability to become competent possessed fewer genes for bacteriocins and immunity proteins or contained polymorphic variants of these genes. WGS sequence analysis of the pan-genome revealed, for the first time, components of a Type VII secretion system in several S. mutans strains, as well as two putative ORFs that encode possible collagen binding proteins located upstream of the cnm gene, which is associated with host cell invasiveness. The virulence of these particular strains was assessed in a wax-worm model. This is the first study to combine a comprehensive analysis of key virulence-related phenotypes with extensive genomic analysis of a pathogen that evolved closely with humans. Our analysis highlights the phenotypic diversity of S. mutans isolates and indicates that the species has evolved a variety of adaptive strategies to persist in the human oral cavity and, when conditions are favorable, to initiate disease. PMID:23613838

  9. Phenotypic heterogeneity of genomically-diverse isolates of Streptococcus mutans.

    Directory of Open Access Journals (Sweden)

    Sara R Palmer

    Full Text Available High coverage, whole genome shotgun (WGS sequencing of 57 geographically- and genetically-diverse isolates of Streptococcus mutans from individuals of known dental caries status was recently completed. Of the 57 sequenced strains, fifteen isolates, were selected based primarily on differences in gene content and phenotypic characteristics known to affect virulence and compared with the reference strain UA159. A high degree of variability in these properties was observed between strains, with a broad spectrum of sensitivities to low pH, oxidative stress (air and paraquat and exposure to competence stimulating peptide (CSP. Significant differences in autolytic behavior and in biofilm development in glucose or sucrose were also observed. Natural genetic competence varied among isolates, and this was correlated to the presence or absence of competence genes, comCDE and comX, and to bacteriocins. In general strains that lacked the ability to become competent possessed fewer genes for bacteriocins and immunity proteins or contained polymorphic variants of these genes. WGS sequence analysis of the pan-genome revealed, for the first time, components of a Type VII secretion system in several S. mutans strains, as well as two putative ORFs that encode possible collagen binding proteins located upstream of the cnm gene, which is associated with host cell invasiveness. The virulence of these particular strains was assessed in a wax-worm model. This is the first study to combine a comprehensive analysis of key virulence-related phenotypes with extensive genomic analysis of a pathogen that evolved closely with humans. Our analysis highlights the phenotypic diversity of S. mutans isolates and indicates that the species has evolved a variety of adaptive strategies to persist in the human oral cavity and, when conditions are favorable, to initiate disease.

  10. Genome position specific priors for genomic prediction

    DEFF Research Database (Denmark)

    Brøndum, Rasmus Froberg; Su, Guosheng; Lund, Mogens Sandø

    2012-01-01

    casual mutation is different between the populations but affects the same gene. Proportions of a four-distribution mixture for SNP effects in segments of fixed size along the genome are derived from one population and set as location specific prior proportions of distributions of SNP effects...... for the target population. The model was tested using dairy cattle populations of different breeds: 540 Australian Jersey bulls, 2297 Australian Holstein bulls and 5214 Nordic Holstein bulls. The traits studied were protein-, fat- and milk yield. Genotypic data was Illumina 777K SNPs, real or imputed Results...

  11. Quantifying heterogeneity in human tumours using MRI and PET.

    Science.gov (United States)

    Asselin, Marie-Claude; O'Connor, James P B; Boellaard, Ronald; Thacker, Neil A; Jackson, Alan

    2012-03-01

    Most tumours, even those of the same histological type and grade, demonstrate considerable biological heterogeneity. Variations in genomic subtype, growth factor expression and local microenvironmental factors can result in regional variations within individual tumours. For example, localised variations in tumour cell proliferation, cell death, metabolic activity and vascular structure will be accompanied by variations in oxygenation status, pH and drug delivery that may directly affect therapeutic response. Documenting and quantifying regional heterogeneity within the tumour requires histological or imaging techniques. There is increasing evidence that quantitative imaging biomarkers can be used in vivo to provide important, reproducible and repeatable estimates of tumoural heterogeneity. In this article we review the imaging methods available to provide appropriate biomarkers of tumour structure and function. We also discuss the significant technical issues involved in the quantitative estimation of heterogeneity and the range of descriptive metrics that can be derived. Finally, we have reviewed the existing clinical evidence that heterogeneity metrics provide additional useful information in drug discovery and development and in clinical practice. Copyright © 2012 Elsevier Ltd. All rights reserved.

  12. An embedded system for image segmentation and multimodal registration in noninvasive skin cancer screening.

    Science.gov (United States)

    Diaz, Silvana; Soto, Javier E; Inostroza, Fabian; Godoy, Sebastian E; Figueroa, Miguel

    2017-07-01

    We present a heterogeneous architecture for image registration and multimodal segmentation on an embedded system for noninvasive skin cancer screening. The architecture combines Otsu thresholding and the random walker algorithm to perform image segmentation, and features a hardware implementation of the Harris corner detection algorithm to perform region-of-interest detection and image registration. Running on a Xilinx XC7Z020 reconfigurable system-on-a-chip, our prototype computes the initial segmentation of a 400×400-pixel region of interest in the visible spectrum in 12.1 seconds, and registers infrared images against this region at 540 frames per second, while consuming 1.9W.

  13. CpGislandEVO: A Database and Genome Browser for Comparative Evolutionary Genomics of CpG Islands

    Directory of Open Access Journals (Sweden)

    Guillermo Barturen

    2013-01-01

    Full Text Available Hypomethylated, CpG-rich DNA segments (CpG islands, CGIs are epigenome markers involved in key biological processes. Aberrant methylation is implicated in the appearance of several disorders as cancer, immunodeficiency, or centromere instability. Furthermore, methylation differences at promoter regions between human and chimpanzee strongly associate with genes involved in neurological/psychological disorders and cancers. Therefore, the evolutionary comparative analyses of CGIs can provide insights on the functional role of these epigenome markers in both health and disease. Given the lack of specific tools, we developed CpGislandEVO. Briefly, we first compile a database of statistically significant CGIs for the best assembled mammalian genome sequences available to date. Second, by means of a coupled browser front-end, we focus on the CGIs overlapping orthologous genes extracted from OrthoDB, thus ensuring the comparison between CGIs located on truly homologous genome segments. This allows comparing the main compositional features between homologous CGIs. Finally, to facilitate nucleotide comparisons, we lifted genome coordinates between assemblies from different species, which enables the analysis of sequence divergence by direct count of nucleotide substitutions and indels occurring between homologous CGIs. The resulting CpGislandEVO database, linking together CGIs and single-cytosine DNA methylation data from several mammalian species, is freely available at our website.

  14. Reconstruction of putative DNA virus from endogenous rice tungro bacilliform virus-like sequences in the rice genome: implications for integration and evolution

    Directory of Open Access Journals (Sweden)

    Kishima Yuji

    2004-10-01

    Full Text Available Abstract Background Plant genomes contain various kinds of repetitive sequences such as transposable elements, microsatellites, tandem repeats and virus-like sequences. Most of them, with the exception of virus-like sequences, do not allow us to trace their origins nor to follow the process of their integration into the host genome. Recent discoveries of virus-like sequences in plant genomes led us to set the objective of elucidating the origin of the repetitive sequences. Endogenous rice tungro bacilliform virus (RTBV-like sequences (ERTBVs have been found throughout the rice genome. Here, we reconstructed putative virus structures from RTBV-like sequences in the rice genome and characterized to understand evolutionary implication, integration manner and involvements of endogenous virus segments in the corresponding disease response. Results We have collected ERTBVs from the rice genomes. They contain rearranged structures and no intact ORFs. The identified ERTBV segments were shown to be phylogenetically divided into three clusters. For each phylogenetic cluster, we were able to make a consensus alignment for a circular virus-like structure carrying two complete ORFs. Comparisons of DNA and amino acid sequences suggested the closely relationship between ERTBV and RTBV. The Oryza AA-genome species vary in the ERTBV copy number. The species carrying low-copy-number of ERTBV segments have been reported to be extremely susceptible to RTBV. The DNA methylation state of the ERTBV sequences was correlated with their copy number in the genome. Conclusions These ERTBV segments are unlikely to have functional potential as a virus. However, these sequences facilitate to establish putative virus that provided information underlying virus integration and evolutionary relationship with existing virus. Comparison of ERTBV among the Oryza AA-genome species allowed us to speculate a possible role of endogenous virus segments against its related disease.

  15. Elucidating the triplicated ancestral genome structure of radish based on chromosome-level comparison with the Brassica genomes.

    Science.gov (United States)

    Jeong, Young-Min; Kim, Namshin; Ahn, Byung Ohg; Oh, Mijin; Chung, Won-Hyong; Chung, Hee; Jeong, Seongmun; Lim, Ki-Byung; Hwang, Yoon-Jung; Kim, Goon-Bo; Baek, Seunghoon; Choi, Sang-Bong; Hyung, Dae-Jin; Lee, Seung-Won; Sohn, Seong-Han; Kwon, Soo-Jin; Jin, Mina; Seol, Young-Joo; Chae, Won Byoung; Choi, Keun Jin; Park, Beom-Seok; Yu, Hee-Ju; Mun, Jeong-Hwan

    2016-07-01

    This study presents a chromosome-scale draft genome sequence of radish that is assembled into nine chromosomal pseudomolecules. A comprehensive comparative genome analysis with the Brassica genomes provides genomic evidences on the evolution of the mesohexaploid radish genome. Radish (Raphanus sativus L.) is an agronomically important root vegetable crop and its origin and phylogenetic position in the tribe Brassiceae is controversial. Here we present a comprehensive analysis of the radish genome based on the chromosome sequences of R. sativus cv. WK10039. The radish genome was sequenced and assembled into 426.2 Mb spanning >98 % of the gene space, of which 344.0 Mb were integrated into nine chromosome pseudomolecules. Approximately 36 % of the genome was repetitive sequences and 46,514 protein-coding genes were predicted and annotated. Comparative mapping of the tPCK-like ancestral genome revealed that the radish genome has intermediate characteristics between the Brassica A/C and B genomes in the triplicated segments, suggesting an internal origin from the genus Brassica. The evolutionary characteristics shared between radish and other Brassica species provided genomic evidences that the current form of nine chromosomes in radish was rearranged from the chromosomes of hexaploid progenitor. Overall, this study provides a chromosome-scale draft genome sequence of radish as well as novel insight into evolution of the mesohexaploid genomes in the tribe Brassiceae.

  16. Using web services for linking genomic data to medical information systems.

    Science.gov (United States)

    Maojo, V; Crespo, J; de la Calle, G; Barreiro, J; Garcia-Remesal, M

    2007-01-01

    To develop a new perspective for biomedical information systems, regarding the introduction of ideas, methods and tools related to the new scenario of genomic medicine. Technological aspects related to the analysis and integration of heterogeneous clinical and genomic data include mapping clinical and genetic concepts, potential future standards or the development of integrated biomedical ontologies. In this clinicomics scenario, we describe the use of Web services technologies to improve access to and integrate different information sources. We give a concrete example of the use of Web services technologies: the OntoFusion project. Web services provide new biomedical informatics (BMI) approaches related to genomic medicine. Customized workflows will aid research tasks by linking heterogeneous Web services. Two significant examples of these European Commission-funded efforts are the INFOBIOMED Network of Excellence and the Advancing Clinico-Genomic Trials on Cancer (ACGT) integrated project. Supplying medical researchers and practitioners with omics data and biologists with clinical datasets can help to develop genomic medicine. BMI is contributing by providing the informatics methods and technological infrastructure needed for these collaborative efforts.

  17. Central focused convolutional neural networks: Developing a data-driven model for lung nodule segmentation.

    Science.gov (United States)

    Wang, Shuo; Zhou, Mu; Liu, Zaiyi; Liu, Zhenyu; Gu, Dongsheng; Zang, Yali; Dong, Di; Gevaert, Olivier; Tian, Jie

    2017-08-01

    Accurate lung nodule segmentation from computed tomography (CT) images is of great importance for image-driven lung cancer analysis. However, the heterogeneity of lung nodules and the presence of similar visual characteristics between nodules and their surroundings make it difficult for robust nodule segmentation. In this study, we propose a data-driven model, termed the Central Focused Convolutional Neural Networks (CF-CNN), to segment lung nodules from heterogeneous CT images. Our approach combines two key insights: 1) the proposed model captures a diverse set of nodule-sensitive features from both 3-D and 2-D CT images simultaneously; 2) when classifying an image voxel, the effects of its neighbor voxels can vary according to their spatial locations. We describe this phenomenon by proposing a novel central pooling layer retaining much information on voxel patch center, followed by a multi-scale patch learning strategy. Moreover, we design a weighted sampling to facilitate the model training, where training samples are selected according to their degree of segmentation difficulty. The proposed method has been extensively evaluated on the public LIDC dataset including 893 nodules and an independent dataset with 74 nodules from Guangdong General Hospital (GDGH). We showed that CF-CNN achieved superior segmentation performance with average dice scores of 82.15% and 80.02% for the two datasets respectively. Moreover, we compared our results with the inter-radiologists consistency on LIDC dataset, showing a difference in average dice score of only 1.98%. Copyright © 2017. Published by Elsevier B.V.

  18. Whole-Genome Sequence of the Purple Photosynthetic Bacterium Rhodovulum sulfidophilum Strain W4

    OpenAIRE

    Masuda, Shinji; Hori, Koichi; Maruyama, Fumito; Ren, Shukun; Sugimoto, Saori; Yamamoto, Nozomi; Mori, Hiroshi; Yamada, Takuji; Sato, Shusei; Tabata, Satoshi; Ohta, Hiroyuki; Kurokawa, Ken

    2013-01-01

    We report the draft genome sequence of the purple photosynthetic bacterium Rhodovulum sulfidophilum. The photosynthesis gene cluster comprises two segments?a unique feature among photosynthesis gene clusters of purple bacteria. The genome information will be useful for further analysis of bacterial photosynthesis.

  19. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    Directory of Open Access Journals (Sweden)

    Daniel Garrido-Sanz

    Full Text Available The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as

  20. Hypereosinophilic syndrome: CT findings in patients with hepatic lobar or segmental involvement

    Energy Technology Data Exchange (ETDEWEB)

    Lim, Jae Hoon; Lee, Won Jae [Sungkyunkwan University School of Medicine, Seoul (Korea, Republic of); Lee, Dong Ho [Kyunghee University Hospital, Seoul (Korea, Republic of); Nam, Kyung Jin [Donga University College of Medicine, Pusan (Korea, Republic of)

    2000-06-01

    The purpose of this study was to describe the CT findings of hepatic hypereosinophilic syndrome in which hepatic lobes or segments were involved. Seven patients with hypereosinophilic syndrome with hepatic lobar or segmental involvement were included in our study. In all seven, diagnosis was based on liver biopsy and the results of corticosteroid treatment. CT findings were retrospectively reviewed by three radiologists, who reached a consensus. Biopsy specimens were examined, with special reference to portal and periportal inflammation. CT demonstrated well-defined, homogeneous or heterogeneous low attenuation with a straight margin limited to a hepatic lobe (n = 2), segments (n = 3), or subsegments (n = 2), particularly during the portal phase. Where there was subsegmental involvement, lesions were multiple, ovoid or wedge-shaped, and showed low attenuation. In two patients with lobar or segmental involvement, segmental portal vein narrowing was observed. Histopathologic examination disclosed eosinophilic infiltration in the periportal area, sinusoids and central veins, as well as portal phlebitis. Hypereosinophilic syndrome may involve the presence of hepatic lobar, segmental, or subsegmental low-attenuated lesions, as seen on CT images. Their presence may be related to damage of the liver parenchyma and to portal phlebitis.

  1. Hypereosinophilic syndrome: CT findings in patients with hepatic lobar or segmental involvement

    International Nuclear Information System (INIS)

    Lim, Jae Hoon; Lee, Won Jae; Lee, Dong Ho; Nam, Kyung Jin

    2000-01-01

    The purpose of this study was to describe the CT findings of hepatic hypereosinophilic syndrome in which hepatic lobes or segments were involved. Seven patients with hypereosinophilic syndrome with hepatic lobar or segmental involvement were included in our study. In all seven, diagnosis was based on liver biopsy and the results of corticosteroid treatment. CT findings were retrospectively reviewed by three radiologists, who reached a consensus. Biopsy specimens were examined, with special reference to portal and periportal inflammation. CT demonstrated well-defined, homogeneous or heterogeneous low attenuation with a straight margin limited to a hepatic lobe (n = 2), segments (n = 3), or subsegments (n = 2), particularly during the portal phase. Where there was subsegmental involvement, lesions were multiple, ovoid or wedge-shaped, and showed low attenuation. In two patients with lobar or segmental involvement, segmental portal vein narrowing was observed. Histopathologic examination disclosed eosinophilic infiltration in the periportal area, sinusoids and central veins, as well as portal phlebitis. Hypereosinophilic syndrome may involve the presence of hepatic lobar, segmental, or subsegmental low-attenuated lesions, as seen on CT images. Their presence may be related to damage of the liver parenchyma and to portal phlebitis

  2. Comparative genomic assessment of Multi-Locus Sequence Typing: rapid accumulation of genomic heterogeneity among clonal isolates of Campylobacter jejuni

    Directory of Open Access Journals (Sweden)

    Nash John HE

    2008-08-01

    Full Text Available Abstract Background Multi-Locus Sequence Typing (MLST has emerged as a leading molecular typing method owing to its high ability to discriminate among bacterial isolates, the relative ease with which data acquisition and analysis can be standardized, and the high portability of the resulting sequence data. While MLST has been successfully applied to the study of the population structure for a number of different bacterial species, it has also provided compelling evidence for high rates of recombination in some species. We have analyzed a set of Campylobacter jejuni strains using MLST and Comparative Genomic Hybridization (CGH on a full-genome microarray in order to determine whether recombination and high levels of genomic mosaicism adversely affect the inference of strain relationships based on the analysis of a restricted number of genetic loci. Results Our results indicate that, in general, there is significant concordance between strain relationships established by MLST and those based on shared gene content as established by CGH. While MLST has significant predictive power with respect to overall genome similarity of isolates, we also found evidence for significant differences in genomic content among strains that would otherwise appear to be highly related based on their MLST profiles. Conclusion The extensive genomic mosaicism between closely related strains has important implications in the context of establishing strain to strain relationships because it suggests that the exact gene content of strains, and by extension their phenotype, is less likely to be "predicted" based on a small number of typing loci. This in turn suggests that a greater emphasis should be placed on analyzing genes of clinical interest as we forge ahead with the next generation of molecular typing methods.

  3. A Simple Predictive Enhancer Syntax for Hindbrain Patterning Is Conserved in Vertebrate Genomes.

    Directory of Open Access Journals (Sweden)

    Joseph Grice

    Full Text Available Determining the function of regulatory elements is fundamental for our understanding of development, disease and evolution. However, the sequence features that mediate these functions are often unclear and the prediction of tissue-specific expression patterns from sequence alone is non-trivial. Previous functional studies have demonstrated a link between PBX-HOX and MEIS/PREP binding interactions and hindbrain enhancer activity, but the defining grammar of these sites, if any exists, has remained elusive.Here, we identify a shared sequence signature (syntax within a heterogeneous set of conserved vertebrate hindbrain enhancers composed of spatially co-occurring PBX-HOX and MEIS/PREP transcription factor binding motifs. We use this syntax to accurately predict hindbrain enhancers in 89% of cases (67/75 predicted elements from a set of conserved non-coding elements (CNEs. Furthermore, mutagenesis of the sites abolishes activity or generates ectopic expression, demonstrating their requirement for segmentally restricted enhancer activity in the hindbrain. We refine and use our syntax to predict over 3,000 hindbrain enhancers across the human genome. These sequences tend to be located near developmental transcription factors and are enriched in known hindbrain activating elements, demonstrating the predictive power of this simple model.Our findings support the theory that hundreds of CNEs, and perhaps thousands of regions across the human genome, function to coordinate gene expression in the developing hindbrain. We speculate that deeply conserved sequences of this kind contributed to the co-option of new genes into the hindbrain gene regulatory network during early vertebrate evolution by linking patterns of hox expression to downstream genes involved in segmentation and patterning, and evolutionarily newer instances may have continued to contribute to lineage-specific elaboration of the hindbrain.

  4. An Efficient Parallel Multi-Scale Segmentation Method for Remote Sensing Imagery

    Directory of Open Access Journals (Sweden)

    Haiyan Gu

    2018-04-01

    Full Text Available Remote sensing (RS image segmentation is an essential step in geographic object-based image analysis (GEOBIA to ultimately derive “meaningful objects”. While many segmentation methods exist, most of them are not efficient for large data sets. Thus, the goal of this research is to develop an efficient parallel multi-scale segmentation method for RS imagery by combining graph theory and the fractal net evolution approach (FNEA. Specifically, a minimum spanning tree (MST algorithm in graph theory is proposed to be combined with a minimum heterogeneity rule (MHR algorithm that is used in FNEA. The MST algorithm is used for the initial segmentation while the MHR algorithm is used for object merging. An efficient implementation of the segmentation strategy is presented using data partition and the “reverse searching-forward processing” chain based on message passing interface (MPI parallel technology. Segmentation results of the proposed method using images from multiple sensors (airborne, SPECIM AISA EAGLE II, WorldView-2, RADARSAT-2 and different selected landscapes (residential/industrial, residential/agriculture covering four test sites indicated its efficiency in accuracy and speed. We conclude that the proposed method is applicable and efficient for the segmentation of a variety of RS imagery (airborne optical, satellite optical, SAR, high-spectral, while the accuracy is comparable with that of the FNEA method.

  5. Pathways to Genome-targeted Therapies in Serous Ovarian Cancer.

    Science.gov (United States)

    Axelrod, Joshua; Delaney, Joe

    2017-07-01

    Genome sequencing technologies and corresponding oncology publications have generated enormous publicly available datasets for many cancer types. While this has enabled new treatments, and in some limited cases lifetime management of the disease, the treatment options for serous ovarian cancer remain dismal. This review summarizes recent advances in our understanding of ovarian cancer, with a focus on heterogeneity, functional genomics, and actionable data.

  6. Transcriptome sequencing in prostate cancer identifies inter-tumor heterogeneity

    Directory of Open Access Journals (Sweden)

    Janet Mendonca

    2015-06-01

    Full Text Available Given the dearth of gene mutations in prostate cancer, [1] ,[2] it is likely that genomic rearrangements play a significant role in the evolution of prostate cancer. However, in the search for recurrent genomic alterations, "private alterations" have received less attention. Such alterations may provide insights into the evolution, behavior, and clinical outcome of an individual tumor. In a recent report in "Genome Biology" Wyatt et al. [3] defines unique alterations in a cohort of high-risk prostate cancer patient with a lethal phenotype. Utilizing a transcriptome sequencing approach they observe high inter-tumor heterogeneity; however, the genes altered distill into three distinct cancer-relevant pathways. Their analysis reveals the presence of several non-ETS fusions, which may contribute to the phenotype of individual tumors, and have significance for disease progression.

  7. Endogenous viral elements in animal genomes.

    Directory of Open Access Journals (Sweden)

    Aris Katzourakis

    2010-11-01

    Full Text Available Integration into the nuclear genome of germ line cells can lead to vertical inheritance of retroviral genes as host alleles. For other viruses, germ line integration has only rarely been documented. Nonetheless, we identified endogenous viral elements (EVEs derived from ten non-retroviral families by systematic in silico screening of animal genomes, including the first endogenous representatives of double-stranded RNA, reverse-transcribing DNA, and segmented RNA viruses, and the first endogenous DNA viruses in mammalian genomes. Phylogenetic and genomic analysis of EVEs across multiple host species revealed novel information about the origin and evolution of diverse virus groups. Furthermore, several of the elements identified here encode intact open reading frames or are expressed as mRNA. For one element in the primate lineage, we provide statistically robust evidence for exaptation. Our findings establish that genetic material derived from all known viral genome types and replication strategies can enter the animal germ line, greatly broadening the scope of paleovirological studies and indicating a more significant evolutionary role for gene flow from virus to animal genomes than has previously been recognized.

  8. Forces shaping the fastest evolving regions in the human genome

    DEFF Research Database (Denmark)

    Pollard, Katherine S; Salama, Sofie R; King, Bryan

    2006-01-01

    Comparative genomics allow us to search the human genome for segments that were extensively changed in the last approximately 5 million years since divergence from our common ancestor with chimpanzee, but are highly conserved in other species and thus are likely to be functional. We found 202...... genomic elements that are highly conserved in vertebrates but show evidence of significantly accelerated substitution rates in human. These are mostly in non-coding DNA, often near genes associated with transcription and DNA binding. Resequencing confirmed that the five most accelerated elements...... contributed to accelerated evolution of the fastest evolving elements in the human genome....

  9. Systemic risk and heterogeneous leverage in banking networks

    Science.gov (United States)

    Kuzubaş, Tolga Umut; Saltoğlu, Burak; Sever, Can

    2016-11-01

    This study probes systemic risk implications of leverage heterogeneity in banking networks. We show that the presence of heterogeneous leverages drastically changes the systemic effects of defaults and the nature of the contagion in interbank markets. Using financial leverage data from the US banking system, through simulations, we analyze the systemic significance of different types of borrowers, the evolution of the network, the consequences of interbank market size and the impact of market segmentation. Our study is related to the recent Basel III regulations on systemic risk and the treatment of the Global Systemically Important Banks (GSIBs). We also assess the extent to which the recent capital surcharges on GSIBs may curb financial fragility. We show the effectiveness of surcharge policy for the most-levered banks vis-a-vis uniform capital injection.

  10. Isolation of Retroelement from Plant Genomic DNA

    OpenAIRE

    sprotocols

    2014-01-01

    Author: Pat Heslop-Harrison ### Abstract: Retroelements and their derivatives are an ubiquitous and abundant component of plant genomes. From the 1990s, PCR based techniques have been developed to isolate the elements from genomic DNA of different plants, and the methods and primers used are presented here. Major classes of retroelements include the Ty1-copia, the Ty3-gypsy and the LINE (non-LTR) groups. Mixed PCR products representing the full heterogeneous pool of retrotransposo...

  11. Analysis of Vehicle-Following Heterogeneity Using Self-Organizing Feature Maps

    Directory of Open Access Journals (Sweden)

    Jie Yang

    2014-01-01

    Full Text Available A self-organizing feature map (SOM was used to represent vehicle-following and to analyze the heterogeneities in vehicle-following behavior. The SOM was constructed in such a way that the prototype vectors represented vehicle-following stimuli (the follower’s velocity, relative velocity, and gap while the output signals represented the response (the follower’s acceleration. Vehicle trajectories collected at a northbound segment of Interstate 80 Freeway at Emeryville, CA, were used to train the SOM. The trajectory information of two selected pairs of passenger cars was then fed into the trained SOM to identify similar stimuli experienced by the followers. The observed responses, when the stimuli were classified by the SOM into the same category, were compared to discover the interdriver heterogeneity. The acceleration profile of another passenger car was analyzed in the same fashion to observe the interdriver heterogeneity. The distribution of responses derived from data sets of car-following-car and car-following-truck, respectively, was compared to ascertain inter-vehicle-type heterogeneity.

  12. Sequence heterogeneity accelerates protein search for targets on DNA

    International Nuclear Information System (INIS)

    Shvets, Alexey A.; Kolomeisky, Anatoly B.

    2015-01-01

    The process of protein search for specific binding sites on DNA is fundamentally important since it marks the beginning of all major biological processes. We present a theoretical investigation that probes the role of DNA sequence symmetry, heterogeneity, and chemical composition in the protein search dynamics. Using a discrete-state stochastic approach with a first-passage events analysis, which takes into account the most relevant physical-chemical processes, a full analytical description of the search dynamics is obtained. It is found that, contrary to existing views, the protein search is generally faster on DNA with more heterogeneous sequences. In addition, the search dynamics might be affected by the chemical composition near the target site. The physical origins of these phenomena are discussed. Our results suggest that biological processes might be effectively regulated by modifying chemical composition, symmetry, and heterogeneity of a genome

  13. Sequence heterogeneity accelerates protein search for targets on DNA

    Energy Technology Data Exchange (ETDEWEB)

    Shvets, Alexey A.; Kolomeisky, Anatoly B., E-mail: tolya@rice.edu [Department of Chemistry and Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005 (United States)

    2015-12-28

    The process of protein search for specific binding sites on DNA is fundamentally important since it marks the beginning of all major biological processes. We present a theoretical investigation that probes the role of DNA sequence symmetry, heterogeneity, and chemical composition in the protein search dynamics. Using a discrete-state stochastic approach with a first-passage events analysis, which takes into account the most relevant physical-chemical processes, a full analytical description of the search dynamics is obtained. It is found that, contrary to existing views, the protein search is generally faster on DNA with more heterogeneous sequences. In addition, the search dynamics might be affected by the chemical composition near the target site. The physical origins of these phenomena are discussed. Our results suggest that biological processes might be effectively regulated by modifying chemical composition, symmetry, and heterogeneity of a genome.

  14. ChloroMitoCU: Codon patterns across organelle genomes for functional genomics and evolutionary applications.

    Science.gov (United States)

    Sablok, Gaurav; Chen, Ting-Wen; Lee, Chi-Ching; Yang, Chi; Gan, Ruei-Chi; Wegrzyn, Jill L; Porta, Nicola L; Nayak, Kinshuk C; Huang, Po-Jung; Varotto, Claudio; Tang, Petrus

    2017-06-01

    Organelle genomes are widely thought to have arisen from reduction events involving cyanobacterial and archaeal genomes, in the case of chloroplasts, or α-proteobacterial genomes, in the case of mitochondria. Heterogeneity in base composition and codon preference has long been the subject of investigation of topics ranging from phylogenetic distortion to the design of overexpression cassettes for transgenic expression. From the overexpression point of view, it is critical to systematically analyze the codon usage patterns of the organelle genomes. In light of the importance of codon usage patterns in the development of hyper-expression organelle transgenics, we present ChloroMitoCU, the first-ever curated, web-based reference catalog of the codon usage patterns in organelle genomes. ChloroMitoCU contains the pre-compiled codon usage patterns of 328 chloroplast genomes (29,960 CDS) and 3,502 mitochondrial genomes (49,066 CDS), enabling genome-wide exploration and comparative analysis of codon usage patterns across species. ChloroMitoCU allows the phylogenetic comparison of codon usage patterns across organelle genomes, the prediction of codon usage patterns based on user-submitted transcripts or assembled organelle genes, and comparative analysis with the pre-compiled patterns across species of interest. ChloroMitoCU can increase our understanding of the biased patterns of codon usage in organelle genomes across multiple clades. ChloroMitoCU can be accessed at: http://chloromitocu.cgu.edu.tw/. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  15. Image-based computational quantification and visualization of genetic alterations and tumour heterogeneity.

    Science.gov (United States)

    Zhong, Qing; Rüschoff, Jan H; Guo, Tiannan; Gabrani, Maria; Schüffler, Peter J; Rechsteiner, Markus; Liu, Yansheng; Fuchs, Thomas J; Rupp, Niels J; Fankhauser, Christian; Buhmann, Joachim M; Perner, Sven; Poyet, Cédric; Blattner, Miriam; Soldini, Davide; Moch, Holger; Rubin, Mark A; Noske, Aurelia; Rüschoff, Josef; Haffner, Michael C; Jochum, Wolfram; Wild, Peter J

    2016-04-07

    Recent large-scale genome analyses of human tissue samples have uncovered a high degree of genetic alterations and tumour heterogeneity in most tumour entities, independent of morphological phenotypes and histopathological characteristics. Assessment of genetic copy-number variation (CNV) and tumour heterogeneity by fluorescence in situ hybridization (ISH) provides additional tissue morphology at single-cell resolution, but it is labour intensive with limited throughput and high inter-observer variability. We present an integrative method combining bright-field dual-colour chromogenic and silver ISH assays with an image-based computational workflow (ISHProfiler), for accurate detection of molecular signals, high-throughput evaluation of CNV, expressive visualization of multi-level heterogeneity (cellular, inter- and intra-tumour heterogeneity), and objective quantification of heterogeneous genetic deletions (PTEN) and amplifications (19q12, HER2) in diverse human tumours (prostate, endometrial, ovarian and gastric), using various tissue sizes and different scanners, with unprecedented throughput and reproducibility.

  16. Segmenting patients and physicians using preferences from discrete choice experiments.

    Science.gov (United States)

    Deal, Ken

    2014-01-01

    People often form groups or segments that have similar interests and needs and seek similar benefits from health providers. Health organizations need to understand whether the same health treatments, prevention programs, services, and products should be applied to everyone in the relevant population or whether different treatments need to be provided to each of several segments that are relatively homogeneous internally but heterogeneous among segments. Our objective was to explain the purposes, benefits, and methods of segmentation for health organizations, and to illustrate the process of segmenting health populations based on preference coefficients from a discrete choice conjoint experiment (DCE) using an example study of prevention of cyberbullying among university students. We followed a two-level procedure for investigating segmentation incorporating several methods for forming segments in Level 1 using DCE preference coefficients and testing their quality, reproducibility, and usability by health decision makers. Covariates (demographic, behavioral, lifestyle, and health state variables) were included in Level 2 to further evaluate quality and to support the scoring of large databases and developing typing tools for assigning those in the relevant population, but not in the sample, to the segments. Several segmentation solution candidates were found during the Level 1 analysis, and the relationship of the preference coefficients to the segments was investigated using predictive methods. Those segmentations were tested for their quality and reproducibility and three were found to be very close in quality. While one seemed better than others in the Level 1 analysis, another was very similar in quality and proved ultimately better in predicting segment membership using covariates in Level 2. The two segments in the final solution were profiled for attributes that would support the development and acceptance of cyberbullying prevention programs among university

  17. A Retrospective Study on Genetic Heterogeneity within Treponema Strains: Subpopulations Are Genetically Distinct in a Limited Number of Positions.

    Directory of Open Access Journals (Sweden)

    Darina Čejková

    Full Text Available Pathogenic uncultivable treponemes comprise human and animal pathogens including agents of syphilis, yaws, bejel, pinta, and venereal spirochetosis in rabbits and hares. A set of 10 treponemal genome sequences including those of 4 Treponema pallidum ssp. pallidum (TPA strains (Nichols, DAL-1, Mexico A, SS14, 4 T. p. ssp. pertenue (TPE strains (CDC-2, Gauthier, Samoa D, Fribourg-Blanc, 1 T. p. ssp. endemicum (TEN strain (Bosnia A and one strain (Cuniculi A of Treponema paraluisleporidarum ecovar Cuniculus (TPLC were examined with respect to the presence of nucleotide intrastrain heterogeneous sites.The number of identified intrastrain heterogeneous sites in individual genomes ranged between 0 and 7. Altogether, 23 intrastrain heterogeneous sites (in 17 genes were found in 5 out of 10 investigated treponemal genomes including TPA strains Nichols (n = 5, DAL-1 (n = 4, and SS14 (n = 7, TPE strain Samoa D (n = 1, and TEN strain Bosnia A (n = 5. Although only one heterogeneous site was identified among 4 tested TPE strains, 16 such sites were identified among 4 TPA strains. Heterogeneous sites were mostly strain-specific and were identified in four tpr genes (tprC, GI, I, K, in genes involved in bacterial motility and chemotaxis (fliI, cheC-fliY, in genes involved in cell structure (murC, translation (prfA, general and DNA metabolism (putative SAM dependent methyltransferase, topA, and in seven hypothetical genes.Heterogeneous sites likely represent both the selection of adaptive changes during infection of the host as well as an ongoing diversifying evolutionary process.

  18. A Retrospective Study on Genetic Heterogeneity within Treponema Strains: Subpopulations Are Genetically Distinct in a Limited Number of Positions.

    Science.gov (United States)

    Čejková, Darina; Strouhal, Michal; Norris, Steven J; Weinstock, George M; Šmajs, David

    2015-01-01

    Pathogenic uncultivable treponemes comprise human and animal pathogens including agents of syphilis, yaws, bejel, pinta, and venereal spirochetosis in rabbits and hares. A set of 10 treponemal genome sequences including those of 4 Treponema pallidum ssp. pallidum (TPA) strains (Nichols, DAL-1, Mexico A, SS14), 4 T. p. ssp. pertenue (TPE) strains (CDC-2, Gauthier, Samoa D, Fribourg-Blanc), 1 T. p. ssp. endemicum (TEN) strain (Bosnia A) and one strain (Cuniculi A) of Treponema paraluisleporidarum ecovar Cuniculus (TPLC) were examined with respect to the presence of nucleotide intrastrain heterogeneous sites. The number of identified intrastrain heterogeneous sites in individual genomes ranged between 0 and 7. Altogether, 23 intrastrain heterogeneous sites (in 17 genes) were found in 5 out of 10 investigated treponemal genomes including TPA strains Nichols (n = 5), DAL-1 (n = 4), and SS14 (n = 7), TPE strain Samoa D (n = 1), and TEN strain Bosnia A (n = 5). Although only one heterogeneous site was identified among 4 tested TPE strains, 16 such sites were identified among 4 TPA strains. Heterogeneous sites were mostly strain-specific and were identified in four tpr genes (tprC, GI, I, K), in genes involved in bacterial motility and chemotaxis (fliI, cheC-fliY), in genes involved in cell structure (murC), translation (prfA), general and DNA metabolism (putative SAM dependent methyltransferase, topA), and in seven hypothetical genes. Heterogeneous sites likely represent both the selection of adaptive changes during infection of the host as well as an ongoing diversifying evolutionary process.

  19. FROM2D to 3d Supervised Segmentation and Classification for Cultural Heritage Applications

    Science.gov (United States)

    Grilli, E.; Dininno, D.; Petrucci, G.; Remondino, F.

    2018-05-01

    The digital management of architectural heritage information is still a complex problem, as a heritage object requires an integrated representation of various types of information in order to develop appropriate restoration or conservation strategies. Currently, there is extensive research focused on automatic procedures of segmentation and classification of 3D point clouds or meshes, which can accelerate the study of a monument and integrate it with heterogeneous information and attributes, useful to characterize and describe the surveyed object. The aim of this study is to propose an optimal, repeatable and reliable procedure to manage various types of 3D surveying data and associate them with heterogeneous information and attributes to characterize and describe the surveyed object. In particular, this paper presents an approach for classifying 3D heritage models, starting from the segmentation of their textures based on supervised machine learning methods. Experimental results run on three different case studies demonstrate that the proposed approach is effective and with many further potentials.

  20. Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals

    DEFF Research Database (Denmark)

    Hellmann, Ines; Mang, Yuan; Gu, Zhiping

    2008-01-01

    We introduce a simple, broadly applicable method for obtaining estimates of nucleotide diversity from genomic shotgun sequencing data. The method takes into account the special nature of these data: random sampling of genomic segments from one or more individuals and a relatively high error rate...... for individual reads. Applying this method to data from the Celera human genome sequencing and SNP discovery project, we obtain estimates of nucleotide diversity in windows spanning the human genome and show that the diversity to divergence ratio is reduced in regions of low recombination. Furthermore, we show...

  1. Weighting training images by maximizing distribution similarity for supervised segmentation across scanners

    DEFF Research Database (Denmark)

    van Opbroek, Annegreet; Vernooij, Meike W; Ikram, M.Arfan

    2015-01-01

    Many automatic segmentation methods are based on supervised machine learning. Such methods have proven to perform well, on the condition that they are trained on a sufficiently large manually labeled training set that is representative of the images to segment. However, due to differences between...... scanners, scanning parameters, and patients such a training set may be difficult to obtain. We present a transfer-learning approach to segmentation by multi-feature voxelwise classification. The presented method can be trained using a heterogeneous set of training images that may be obtained with different...... scanners than the target image. In our approach each training image is given a weight based on the distribution of its voxels in the feature space. These image weights are chosen as to minimize the difference between the weighted probability density function (PDF) of the voxels of the training images...

  2. Genome chaos: survival strategy during crisis.

    Science.gov (United States)

    Liu, Guo; Stevens, Joshua B; Horne, Steven D; Abdallah, Batoul Y; Ye, Karen J; Bremer, Steven W; Ye, Christine J; Chen, David J; Heng, Henry H

    2014-01-01

    Genome chaos, a process of complex, rapid genome re-organization, results in the formation of chaotic genomes, which is followed by the potential to establish stable genomes. It was initially detected through cytogenetic analyses, and recently confirmed by whole-genome sequencing efforts which identified multiple subtypes including "chromothripsis", "chromoplexy", "chromoanasynthesis", and "chromoanagenesis". Although genome chaos occurs commonly in tumors, both the mechanism and detailed aspects of the process are unknown due to the inability of observing its evolution over time in clinical samples. Here, an experimental system to monitor the evolutionary process of genome chaos was developed to elucidate its mechanisms. Genome chaos occurs following exposure to chemotherapeutics with different mechanisms, which act collectively as stressors. Characterization of the karyotype and its dynamic changes prior to, during, and after induction of genome chaos demonstrates that chromosome fragmentation (C-Frag) occurs just prior to chaotic genome formation. Chaotic genomes seem to form by random rejoining of chromosomal fragments, in part through non-homologous end joining (NHEJ). Stress induced genome chaos results in increased karyotypic heterogeneity. Such increased evolutionary potential is demonstrated by the identification of increased transcriptome dynamics associated with high levels of karyotypic variance. In contrast to impacting on a limited number of cancer genes, re-organized genomes lead to new system dynamics essential for cancer evolution. Genome chaos acts as a mechanism of rapid, adaptive, genome-based evolution that plays an essential role in promoting rapid macroevolution of new genome-defined systems during crisis, which may explain some unwanted consequences of cancer treatment.

  3. Intra-tumor heterogeneity in breast cancer has limited impact on transcriptomic-based molecular profiling.

    Science.gov (United States)

    Karthik, Govindasamy-Muralidharan; Rantalainen, Mattias; Stålhammar, Gustav; Lövrot, John; Ullah, Ikram; Alkodsi, Amjad; Ma, Ran; Wedlund, Lena; Lindberg, Johan; Frisell, Jan; Bergh, Jonas; Hartman, Johan

    2017-11-29

    Transcriptomic profiling of breast tumors provides opportunity for subtyping and molecular-based patient stratification. In diagnostic applications the specimen profiled should be representative of the expression profile of the whole tumor and ideally capture properties of the most aggressive part of the tumor. However, breast cancers commonly exhibit intra-tumor heterogeneity at molecular, genomic and in phenotypic level, which can arise during tumor evolution. Currently it is not established to what extent a random sampling approach may influence molecular breast cancer diagnostics. In this study we applied RNA-sequencing to quantify gene expression in 43 pieces (2-5 pieces per tumor) from 12 breast tumors (Cohort 1). We determined molecular subtype and transcriptomic grade for all tumor pieces and analysed to what extent pieces originating from the same tumors are concordant or discordant with each other. Additionally, we validated our finding in an independent cohort consisting of 19 pieces (2-6 pieces per tumor) from 6 breast tumors (Cohort 2) profiled using microarray technique. Exome sequencing was also performed on this cohort, to investigate the extent of intra-tumor genomic heterogeneity versus the intra-tumor molecular subtype classifications. Molecular subtyping was consistent in 11 out of 12 tumors and transcriptomic grade assignments were consistent in 11 out of 12 tumors as well. Molecular subtype predictions revealed consistent subtypes in four out of six patients in this cohort 2. Interestingly, we observed extensive intra-tumor genomic heterogeneity in these tumor pieces but not in their molecular subtype classifications. Our results suggest that macroscopic intra-tumoral transcriptomic heterogeneity is limited and unlikely to have an impact on molecular diagnostics for most patients.

  4. Segmentation: Identification of consumer segments

    DEFF Research Database (Denmark)

    Høg, Esben

    2005-01-01

    It is very common to categorise people, especially in the advertising business. Also traditional marketing theory has taken in consumer segments as a favorite topic. Segmentation is closely related to the broader concept of classification. From a historical point of view, classification has its...... origin in other sciences as for example biology, anthropology etc. From an economic point of view, it is called segmentation when specific scientific techniques are used to classify consumers to different characteristic groupings. What is the purpose of segmentation? For example, to be able to obtain...... a basic understanding of grouping people. Advertising agencies may use segmentation totarget advertisements, while food companies may usesegmentation to develop products to various groups of consumers. MAPP has for example investigated the positioning of fish in relation to other food products...

  5. Phylogenetic analysis of Puumala virus strains from Central Europe highlights the need for a full-genome perspective on hantavirus evolution.

    Science.gov (United States)

    Szabó, Róbert; Radosa, Lukáš; Ličková, Martina; Sláviková, Monika; Heroldová, Marta; Stanko, Michal; Pejčoch, Milan; Osterberg, Anja; Laenen, Lies; Schex, Susanne; Ulrich, Rainer G; Essbauer, Sandra; Maes, Piet; Klempa, Boris

    2017-12-01

    Puumala virus (PUUV), carried by bank voles (Myodes glareolus), is the medically most important hantavirus in Central and Western Europe. In this study, a total of 523 bank voles (408 from Germany, 72 from Slovakia, and 43 from Czech Republic) collected between the years 2007-2012 were analyzed for the presence of hantavirus RNA. Partial PUUV genome segment sequences were obtained from 51 voles. Phylogenetic analyses of all three genome segments showed that the newfound strains cluster with other Central and Western European PUUV strains. The new sequences from Šumava (Bohemian Forest), Czech Republic, are most closely related to the strains from the neighboring Bavarian Forest, a known hantavirus disease outbreak region. Interestingly, the Slovak strains clustered with the sequences from Bohemian and Bavarian Forests only in the M but not S segment analyses. This well-supported topological incongruence suggests a segment reassortment event or, as we analyzed only partial sequences, homologous recombination. Our data highlight the necessity of sequencing all three hantavirus genome segments and of a broader bank vole screening not only in recognized endemic foci but also in regions with no reported human hantavirus disease cases.

  6. Gamma ray scanner systems for nondestructive assay of heterogeneous waste barrels

    International Nuclear Information System (INIS)

    Martz, H.E.; Decman, B.J.; Roberson, G.P.; Levai, F.

    1997-01-01

    Traditional gamma safeguards measurements have usually been performed using a segmented gamma scanning (SGS) system. The accuracy of this technique relies on the assumption that the sample matrix and the activity are both uniform for a segment. Waste barrels are often highly heterogeneous, span a wide range of composition and matrix type. The primary sources of error are all directly or indirectly related to a non-uniform measurement response associated with unknown radioactive source spatial distribution and heterogeneity of the matrix. These errors can be significantly reduced by some imaging techniques that measure exact spatial locations of sources and attenuation maps. In this paper we describe a joint R ampersand D effort between the Lawrence Livermore National Laboratory (LLNL) and the Institute of Nuclear Techniques (INT) of the Technical University, Budapest, to compare results obtained by two different gamma-ray nondestructive assay (NDA) systems used for imaging waste barrels. The basic principles are the same, but the approaches are different. Key factors to judge the adequacy of a method are the detection limit and the accuracy. Test drums representing waste to be measured are used to determine basic parameters of these techniques

  7. Whole-genome sequencing and comprehensive molecular profiling identify new driver mutations in gastric cancer

    NARCIS (Netherlands)

    Wang, Kai; Yuen, Siu Tsan; Xu, Jiangchun; Lee, Siu Po; Yan, Helen H N; Shi, Stephanie T; Siu, Hoi Cheong; Deng, Shibing; Chu, Kent Man; Law, Simon; Chan, Kok Hoe; Chan, Annie S Y; Tsui, Wai Yin; Ho, Siu Lun; Chan, Anthony K W; Man, Jonathan L K; Foglizzo, Valentina; Ng, Man Kin; Chan, April S; Ching, Yick Pang; Cheng, Grace H W; Xie, Tao; Fernandez, Julio; Li, Vivian S W; Clevers, Hans; Rejto, Paul A; Mao, Mao; Leung, Suet Yi

    Gastric cancer is a heterogeneous disease with diverse molecular and histological subtypes. We performed whole-genome sequencing in 100 tumor-normal pairs, along with DNA copy number, gene expression and methylation profiling, for integrative genomic analysis. We found subtype-specific genetic and

  8. The Genome Sequence of Taurine Cattle: A Window to Ruminant Biology and Evolution

    OpenAIRE

    Elsik, Christine G.; Tellam, Ross L.; Worley, Kim C.; Gibbs, Richard A.; Abatepaulo, Antonio R. R.; Abbey, Colette A.; Adelson, David L.; Aerts, Jan; Ahola, Virpi; Alexander, Lee; Alioto, Tyler; Almeida, Iassudara G.; Amadio, Ariel F.; Anatriello, Elen; Antonarakis, Stylianos E.

    2009-01-01

    To understand the biology and evolution of ruminants, the cattle genome was sequenced to about sevenfold coverage. The cattle genome contains a minimum of 22,000 genes, with a core set of 14,345 orthologs shared among seven mammalian species of which 1217 are absent or undetected in noneutherian (marsupial or monotreme) genomes. Cattle-specific evolutionary breakpoint regions in chromosomes have a higher density of segmental duplications, enrichment of repetitive elements, and species-specifi...

  9. Advertising exposures for a seasonal good in a segmented market

    OpenAIRE

    Daniela Favaretto; Luca Grosset; Bruno Viscolani

    2011-01-01

    The optimal control problem of determining advertising efforts for a seasonal good in a heterogeneous market is considered. We characterize optimal advertising exposures under different conditions: the general situation in which several wide-spectrum media are available, under the assumption of additive advertising effects on goodwill evolution, the ideal situation in which the advertising process can reach selectively each segment and the more realistic one in which a single medium reaches s...

  10. Complete genome sequences of two strains of Treponema pallidum subsp. pertenue from Ghana, Africa: Identical genome sequences in samples isolated more than 7 years apart.

    Directory of Open Access Journals (Sweden)

    Michal Strouhal

    2017-09-01

    Full Text Available Treponema pallidum subsp. pertenue (TPE is the causative agent of yaws, a multi-stage disease, endemic in tropical regions of Africa, Asia, Oceania, and South America. To date, four TPE strains have been completely sequenced including three TPE strains of human origin (Samoa D, CDC-2, and Gauthier and one TPE strain (Fribourg-Blanc isolated from a baboon. All TPE strains are highly similar to T. pallidum subsp. pallidum (TPA strains. The mutation rate in syphilis and related treponemes has not been experimentally determined yet.Complete genomes of two TPE strains, CDC 2575 and Ghana-051, that infected patients in Ghana and were isolated in 1980 and 1988, respectively, were sequenced and analyzed. Both strains had identical consensus genome nucleotide sequences raising the question whether TPE CDC 2575 and Ghana-051 represent two different strains. Several lines of evidence support the fact that both strains represent independent samples including regions showing intrastrain heterogeneity (13 and 5 intrastrain heterogeneous sites in TPE Ghana-051 and TPE CDC 2575, respectively. Four of these heterogeneous sites were found in both genomes but the frequency of alternative alleles differed. The identical consensus genome sequences were used to estimate the upper limit of the yaws treponeme evolution rate, which was 4.1 x 10-10 nucleotide changes per site per generation.The estimated upper limit for the mutation rate of TPE was slightly lower than the mutation rate of E. coli, which was determined during a long-term experiment. Given the known diversity between TPA and TPE genomes and the assumption that both TPA and TPE have a similar mutation rate, the most recent common ancestor of syphilis and yaws treponemes appears to be more than ten thousand years old and likely even older.

  11. Mitochondrial genome sequencing helps show the evolutionary mechanism of mitochondrial genome formation in Brassica

    Science.gov (United States)

    2011-01-01

    Background Angiosperm mitochondrial genomes are more complex than those of other organisms. Analyses of the mitochondrial genome sequences of at least 11 angiosperm species have showed several common properties; these cannot easily explain, however, how the diverse mitotypes evolved within each genus or species. We analyzed the evolutionary relationships of Brassica mitotypes by sequencing. Results We sequenced the mitotypes of cam (Brassica rapa), ole (B. oleracea), jun (B. juncea), and car (B. carinata) and analyzed them together with two previously sequenced mitotypes of B. napus (pol and nap). The sizes of whole single circular genomes of cam, jun, ole, and car are 219,747 bp, 219,766 bp, 360,271 bp, and 232,241 bp, respectively. The mitochondrial genome of ole is largest as a resulting of the duplication of a 141.8 kb segment. The jun mitotype is the result of an inherited cam mitotype, and pol is also derived from the cam mitotype with evolutionary modifications. Genes with known functions are conserved in all mitotypes, but clear variation in open reading frames (ORFs) with unknown functions among the six mitotypes was observed. Sequence relationship analysis showed that there has been genome compaction and inheritance in the course of Brassica mitotype evolution. Conclusions We have sequenced four Brassica mitotypes, compared six Brassica mitotypes and suggested a mechanism for mitochondrial genome formation in Brassica, including evolutionary events such as inheritance, duplication, rearrangement, genome compaction, and mutation. PMID:21988783

  12. Usher syndrome type 1-associated cadherins shape the photoreceptor outer segment.

    Science.gov (United States)

    Schietroma, Cataldo; Parain, Karine; Estivalet, Amrit; Aghaie, Asadollah; Boutet de Monvel, Jacques; Picaud, Serge; Sahel, José-Alain; Perron, Muriel; El-Amraoui, Aziz; Petit, Christine

    2017-06-05

    Usher syndrome type 1 (USH1) causes combined hearing and sight defects, but how mutations in USH1 genes lead to retinal dystrophy in patients remains elusive. The USH1 protein complex is associated with calyceal processes, which are microvilli of unknown function surrounding the base of the photoreceptor outer segment. We show that in Xenopus tropicalis , these processes are connected to the outer-segment membrane by links composed of protocadherin-15 (USH1F protein). Protocadherin-15 deficiency, obtained by a knockdown approach, leads to impaired photoreceptor function and abnormally shaped photoreceptor outer segments. Rod basal outer disks displayed excessive outgrowth, and cone outer segments were curved, with lamellae of heterogeneous sizes, defects also observed upon knockdown of Cdh23 , encoding cadherin-23 (USH1D protein). The calyceal processes were virtually absent in cones and displayed markedly reduced F-actin content in rods, suggesting that protocadherin-15-containing links are essential for their development and/or maintenance. We propose that calyceal processes, together with their associated links, control the sizing of rod disks and cone lamellae throughout their daily renewal. © 2017 Schietroma et al.

  13. Capturing Structural Heterogeneity in Chromatin Fibers.

    Science.gov (United States)

    Ekundayo, Babatunde; Richmond, Timothy J; Schalch, Thomas

    2017-10-13

    Chromatin fiber organization is implicated in processes such as transcription, DNA repair and chromosome segregation, but how nucleosomes interact to form higher-order structure remains poorly understood. We solved two crystal structures of tetranucleosomes with approximately 11-bp DNA linker length at 5.8 and 6.7 Å resolution. Minimal intramolecular nucleosome-nucleosome interactions result in a fiber model resembling a flat ribbon that is compatible with a two-start helical architecture, and that exposes histone and DNA surfaces to the environment. The differences in the two structures combined with electron microscopy reveal heterogeneous structural states, and we used site-specific chemical crosslinking to assess the diversity of nucleosome-nucleosome interactions through identification of structure-sensitive crosslink sites that provide a means to characterize fibers in solution. The chromatin fiber architectures observed here provide a basis for understanding heterogeneous chromatin higher-order structures as they occur in a genomic context. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Genomes of ubiquitous marine and hypersaline Hydrogenovibrio, Thiomicrorhabdus and Thiomicrospira spp. encode a diversity of mechanisms to sustain chemolithoautotrophy in heterogeneous environments.

    Science.gov (United States)

    Scott, Kathleen M; Williams, John; Porter, Cody M B; Russel, Sydney; Harmer, Tara L; Paul, John H; Antonen, Kirsten M; Bridges, Megan K; Camper, Gary J; Campla, Christie K; Casella, Leila G; Chase, Eva; Conrad, James W; Cruz, Mercedez C; Dunlap, Darren S; Duran, Laura; Fahsbender, Elizabeth M; Goldsmith, Dawn B; Keeley, Ryan F; Kondoff, Matthew R; Kussy, Breanna I; Lane, Marannda K; Lawler, Stephanie; Leigh, Brittany A; Lewis, Courtney; Lostal, Lygia M; Marking, Devon; Mancera, Paola A; McClenthan, Evan C; McIntyre, Emily A; Mine, Jessica A; Modi, Swapnil; Moore, Brittney D; Morgan, William A; Nelson, Kaleigh M; Nguyen, Kimmy N; Ogburn, Nicholas; Parrino, David G; Pedapudi, Anangamanjari D; Pelham, Rebecca P; Preece, Amanda M; Rampersad, Elizabeth A; Richardson, Jason C; Rodgers, Christina M; Schaffer, Brent L; Sheridan, Nancy E; Solone, Michael R; Staley, Zachery R; Tabuchi, Maki; Waide, Ramond J; Wanjugi, Pauline W; Young, Suzanne; Clum, Alicia; Daum, Chris; Huntemann, Marcel; Ivanova, Natalia; Kyrpides, Nikos; Mikhailova, Natalia; Palaniappan, Krishnaveni; Pillay, Manoj; Reddy, T B K; Shapiro, Nicole; Stamatis, Dimitrios; Varghese, Neha; Woyke, Tanja; Boden, Rich; Freyermuth, Sharyn K; Kerfeld, Cheryl A

    2018-03-09

    Chemolithoautotrophic bacteria from the genera Hydrogenovibrio, Thiomicrorhabdus and Thiomicrospira are common, sometimes dominant, isolates from sulfidic habitats including hydrothermal vents, soda and salt lakes and marine sediments. Their genome sequences confirm their membership in a deeply branching clade of the Gammaproteobacteria. Several adaptations to heterogeneous habitats are apparent. Their genomes include large numbers of genes for sensing and responding to their environment (EAL- and GGDEF-domain proteins and methyl-accepting chemotaxis proteins) despite their small sizes (2.1-3.1 Mbp). An array of sulfur-oxidizing complexes are encoded, likely to facilitate these organisms' use of multiple forms of reduced sulfur as electron donors. Hydrogenase genes are present in some taxa, including group 1d and 2b hydrogenases in Hydrogenovibrio marinus and H. thermophilus MA2-6, acquired via horizontal gene transfer. In addition to high-affinity cbb 3 cytochrome c oxidase, some also encode cytochrome bd-type quinol oxidase or ba 3 -type cytochrome c oxidase, which could facilitate growth under different oxygen tensions, or maintain redox balance. Carboxysome operons are present in most, with genes downstream encoding transporters from four evolutionarily distinct families, which may act with the carboxysomes to form CO 2 concentrating mechanisms. These adaptations to habitat variability likely contribute to the cosmopolitan distribution of these organisms. © 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.

  15. Genomic diversity within the Enterobacter cloacae complex.

    Directory of Open Access Journals (Sweden)

    Armand Paauw

    Full Text Available BACKGROUND: Isolates of the Enterobacter cloacae complex have been increasingly isolated as nosocomial pathogens, but phenotypic identification of the E. cloacae complex is unreliable and irreproducible. Identification of species based on currently available genotyping tools is already superior to phenotypic identification, but the taxonomy of isolates belonging to this complex is cumbersome. METHODOLOGY/PRINCIPAL FINDINGS: This study shows that multilocus sequence analysis and comparative genomic hybridization based on a mixed genome array is a powerful method for studying species assignment within the E. cloacae complex. The E. cloacae complex is shown to be evolutionarily divided into two clades that are genetically distinct from each other. The younger first clade is genetically more homogenous, contains the Enterobacter hormaechei species and is the most frequently cultured Enterobacter species in hospitals. The second and older clade consists of several (subspecies that are genetically more heterogeneous. Genetic markers were identified that could discriminate between the two clades and cluster 1. CONCLUSIONS/SIGNIFICANCE: Based on genomic differences it is concluded that some previously defined (clonal and heterogenic (subspecies of the E. cloacae complex have to be redefined because of disagreements with known or proposed nomenclature. However, further improved identification of the redefined species will be possible based on novel markers presented here.

  16. Efficient assembly of de novo human artificial chromosomes from large genomic loci

    Directory of Open Access Journals (Sweden)

    Stromberg Gregory

    2005-07-01

    Full Text Available Abstract Background Human Artificial Chromosomes (HACs are potentially useful vectors for gene transfer studies and for functional annotation of the genome because of their suitability for cloning, manipulating and transferring large segments of the genome. However, development of HACs for the transfer of large genomic loci into mammalian cells has been limited by difficulties in manipulating high-molecular weight DNA, as well as by the low overall frequencies of de novo HAC formation. Indeed, to date, only a small number of large (>100 kb genomic loci have been reported to be successfully packaged into de novo HACs. Results We have developed novel methodologies to enable efficient assembly of HAC vectors containing any genomic locus of interest. We report here the creation of a novel, bimolecular system based on bacterial artificial chromosomes (BACs for the construction of HACs incorporating any defined genomic region. We have utilized this vector system to rapidly design, construct and validate multiple de novo HACs containing large (100–200 kb genomic loci including therapeutically significant genes for human growth hormone (HGH, polycystic kidney disease (PKD1 and ß-globin. We report significant differences in the ability of different genomic loci to support de novo HAC formation, suggesting possible effects of cis-acting genomic elements. Finally, as a proof of principle, we have observed sustained ß-globin gene expression from HACs incorporating the entire 200 kb ß-globin genomic locus for over 90 days in the absence of selection. Conclusion Taken together, these results are significant for the development of HAC vector technology, as they enable high-throughput assembly and functional validation of HACs containing any large genomic locus. We have evaluated the impact of different genomic loci on the frequency of HAC formation and identified segments of genomic DNA that appear to facilitate de novo HAC formation. These genomic loci

  17. Construction of chromosome segment substitution lines enables QTL mapping for flowering and morphological traits in Brassica rapa

    Directory of Open Access Journals (Sweden)

    Xiaonan eLi

    2015-06-01

    Full Text Available Chromosome segment substitution lines (CSSLs represent a powerful method for precise quantitative trait loci (QTL detection of complex agronomical traits in plants. In this study, we used a marker-assisted backcrossing strategy to develop a population consisting of 63 CSSLs, derived from backcrossing of the F1 generated from a cross between two Brassica rapa subspecies: ‘Chiifu’ (ssp. pekinensis, the Brassica A genome-represented line used as the donor, and ‘49caixin’ (ssp. parachinensis, a non-heading cultivar used as the recipient. The 63 CSSLs covered 87.95% of the B. rapa genome. Among them, 39 lines carried a single segment; 15 lines, two segments; and nine lines, three or more segments of the donor parent chromosomes. To verify the potential advantage of these CSSL lines, we used them to locate QTL for six morphology-related traits. A total of 58 QTL were located on eight chromosomes for all six traits: 17 for flowering time, 14 each for bolting time and plant height, 6 for plant diameter, 2 for leaf width, and 5 for flowering stalk diameter. Co-localized QTL were mainly distributed on eight genomic regions in A01, A02, A05, A06, A08, A09, and A10, present in the corresponding CSSLs. Moreover, new chromosomal fragments that harbored QTL were identified using the findings of previous studies. The CSSL population constructed in our study paves the way for fine mapping and cloning of candidate genes involved in late bolting, flowering, and plant architecture-related traits in B. rapa. Furthermore, it has great potential for future marker-aided gene/QTL pyramiding of other interesting traits in B. rapa breeding.

  18. Whole-Genome Analysis of a Novel Fish Reovirus (MsReV Discloses Aquareovirus Genomic Structure Relationship with Host in Saline Environments

    Directory of Open Access Journals (Sweden)

    Zhong-Yuan Chen

    2015-08-01

    Full Text Available Aquareoviruses are serious pathogens of aquatic animals. Here, genome characterization and functional gene analysis of a novel aquareovirus, largemouth bass Micropterus salmoides reovirus (MsReV, was described. It comprises 11 dsRNA segments (S1–S11 covering 24,024 bp, and encodes 12 putative proteins including the inclusion forming-related protein NS87 and the fusion-associated small transmembrane (FAST protein NS22. The function of NS22 was confirmed by expression in fish cells. Subsequently, MsReV was compared with two representative aquareoviruses, saltwater fish turbot Scophthalmus maximus reovirus (SMReV and freshwater fish grass carp reovirus strain 109 (GCReV-109. MsReV NS87 and NS22 genes have the same structure and function with those of SMReV, whereas GCReV-109 is either missing the coiled-coil region in NS79 or the gene-encoding NS22. Significant similarities are also revealed among equivalent genome segments between MsReV and SMReV, but a difference is found between MsReV and GCReV-109. Furthermore, phylogenetic analysis showed that 13 aquareoviruses could be divided into freshwater and saline environments subgroups, and MsReV was closely related to SMReV in saline environments. Consequently, these viruses from hosts in saline environments have more genomic structural similarities than the viruses from hosts in freshwater. This is the first study of the relationships between aquareovirus genomic structure and their host environments.

  19. Identification and characterization of viral defective RNA genomes in influenza B virus.

    Science.gov (United States)

    Sheng, Zizhang; Liu, Runxia; Yu, Jieshi; Ran, Zhiguang; Newkirk, Simon J; An, Wenfeng; Li, Feng; Wang, Dan

    2018-04-01

    Influenza B virus (FLUBV) is an important pathogen that infects humans and causes seasonal influenza epidemics. To date, little is known about defective genomes of FLUBV and their roles in viral replication. In this study, by using a next-generation sequencing approach, we analyzed total mRNAs extracted from A549 cells infected with B/Brisbane/60/2008 virus (Victoria lineage), and identified four defective FLUBV genomes with two (PB1∆A and PB1∆B) from the polymerase basic subunit 1 (PB1) segment and the other two (M∆A and M∆B) from the matrix (M) protein-encoding segment. These defective genomes contained significant deletions in the central regions with each having the potential for encoding a novel polypeptide. Significantly, each of the discovered defective RNAs can potently inhibit the replication of B/Yamanashi/166/98 (Yamagata lineage). Furthermore, PB1∆A was able to interfere modestly with influenza A virus (FLUAV) replication. In summary, our study provides important initial insights into FLUBV defective-interfering genomes, which can be further explored to achieve better understanding of the replication, pathogenesis and evolution of FLUBV.

  20. An overview on genome organization of marine organisms.

    Science.gov (United States)

    Costantini, Maria

    2015-12-01

    In this review we will concentrate on some general genome features of marine organisms and their evolution, ranging from vertebrate to invertebrates until unicellular organisms. Before genome sequencing, the ultracentrifugation in CsCl led to high resolution of mammalian DNA (without seeing at the sequence). The analytical profile of human DNA showed that the vertebrate genome is a mosaic of isochores, typically megabase-size DNA segments that belong in a small number of families characterized by different GC levels. The recent availability of a number of fully sequenced genomes allowed mapping very precisely the isochores, based on DNA sequences. Since isochores are tightly linked to biological properties such as gene density, replication timing and recombination, the new level of detail provided by the isochore map helped the understanding of genome structure, function and evolution. This led the current level of knowledge and to further insights. Copyright © 2015. Published by Elsevier B.V.

  1. International team with Virginia Tech participation maps genome of dengue and yellow fever mosquito

    OpenAIRE

    Trulove, Susan

    2007-01-01

    Developing new strategies to prevent and control yellow fever and dengue fever has become more possible with the completion of the first draft of the genome sequence of Aedes aegypti mosquito by scientists led by Vishvanath Nene at The Institute for Genomic Research (TIGR) and David Severson at the University of Notre Dame. The genome is the complete set of genetic material including genes and other segments of DNA in an organism.

  2. Status of the segment interconnect, cable segment ancillary logic, and the cable segment hybrid driver projects

    International Nuclear Information System (INIS)

    Swoboda, C.; Barsotti, E.; Chappa, S.; Downing, R.; Goeransson, G.; Lensy, D.; Moore, G.; Rotolo, C.; Urish, J.

    1985-01-01

    The FASTBUS Segment Interconnect (SI) provides a communication path between two otherwise independent, asynchronous bus segments. In particular, the Segment Interconnect links a backplane crate segment to a cable segment. All standard FASTBUS address and data transactions can be passed through the SI or any number of SIs and segments in a path. Thus systems of arbitrary connection complexity can be formed, allowing simultaneous independent processing, yet still permitting devices associated with one segment to be accessed from others. The model S1 Segment Interconnect and the Cable Segment Ancillary Logic covered in this report comply with all the mandatory features stated in the FASTBUS specification document DOE/ER-0189. A block diagram of the SI is shown

  3. A Heterogeneous Nuclear Ribonucleoprotein A/B-Related Protein Binds to Single-Stranded DNA near the 5′ End or within the Genome of Feline Parvovirus and Can Modify Virus Replication

    Science.gov (United States)

    Wang, Dai; Parrish, Colin R.

    1999-01-01

    Phage display of cDNA clones prepared from feline cells was used to identify host cell proteins that bound to DNA-containing feline panleukopenia virus (FPV) capsids but not to empty capsids. One gene found in several clones encoded a heterogeneous nuclear ribonucleoprotein (hnRNP)-related protein (DBP40) that was very similar in sequence to the A/B-type hnRNP proteins. DBP40 bound specifically to oligonucleotides representing a sequence near the 5′ end of the genome which is exposed on the outside of the full capsid but did not bind most other terminal sequences. Adding purified DBP40 to an in vitro fill-in reaction using viral DNA as a template inhibited the production of the second strand after nucleotide (nt) 289 but prior to nt 469. DBP40 bound to various regions of the viral genome, including a region between nt 295 and 330 of the viral genome which has been associated with transcriptional attenuation of the parvovirus minute virus of mice, which is mediated by a stem-loop structure of the DNA and cellular proteins. Overexpression of the protein in feline cells from a plasmid vector made them largely resistant to FPV infection. Mutagenesis of the protein binding site within the 5′ end viral genome did not affect replication of the virus. PMID:10438866

  4. Complete Genome Sequence of a Novel Aquareovirus That Infects the Endangered Fountain Darter, Etheostoma fonticola

    OpenAIRE

    Iwanowicz, Luke R.; Iwanowicz, Deborah D.; Adams, Cynthia R.; Lewis, Teresa D.; Brandt, Thomas M.; Cornman, Robert S.; Sanders, Lakyn

    2016-01-01

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1).

  5. Complete genome sequence of a novel aquareovirus that infects the endangered fountain darter, Etheostoma fonticola

    Science.gov (United States)

    Iwanowicz, Luke R.; Iwanowicz, Deborah; Adams, Cynthia; Lewis, Teresa D.; Brandt, Thomas M.; Cornman, Robert S.; Sanders, Lakyn R.

    2016-01-01

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1).

  6. "Tandem duplication-random loss" is not a real feature of oyster mitochondrial genomes

    Directory of Open Access Journals (Sweden)

    Zhang Guofan

    2009-02-01

    Full Text Available Abstract Duplications and rearrangements of coding genes are major themes in the evolution of mitochondrial genomes, bearing important consequences in the function of mitochondria and the fitness of organisms. Yu et al. (BMC Genomics 2008, 9:477 reported the complete mt genome sequence of the oyster Crassostrea hongkongensis (16,475 bp and found that a DNA segment containing four tRNA genes (trnK1, trnC, trnQ1 and trnN, a duplicated (rrnS and a split rRNA gene (rrnL5' was absent compared with that of two other Crassostrea species. It was suggested that the absence was a novel case of "tandem duplication-random loss" with evolutionary significance. We independently sequenced the complete mt genome of three C. hongkongensis individuals, all of which were 18,622 bp and contained the segment that was missing in Yu et al.'s sequence. Further, we designed primers, verified sequences and demonstrated that the sequence loss in Yu et al.'s study was an artifact caused by placing primers in a duplicated region. The duplication and split of ribosomal RNA genes are unique for Crassostrea oysters and not lost in C. hongkongensis. Our study highlights the need for caution when amplifying and sequencing through duplicated regions of the genome.

  7. Mapping soil heterogeneity using RapidEye satellite images

    Science.gov (United States)

    Piccard, Isabelle; Eerens, Herman; Dong, Qinghan; Gobin, Anne; Goffart, Jean-Pierre; Curnel, Yannick; Planchon, Viviane

    2016-04-01

    In the frame of BELCAM, a project funded by the Belgian Science Policy Office (BELSPO), researchers from UCL, ULg, CRA-W and VITO aim to set up a collaborative system to develop and deliver relevant information for agricultural monitoring in Belgium. The main objective is to develop remote sensing methods and processing chains able to ingest crowd sourcing data, provided by farmers or associated partners, and to deliver in return relevant and up-to-date information for crop monitoring at the field and district level based on Sentinel-1 and -2 satellite imagery. One of the developments within BELCAM concerns an automatic procedure to detect soil heterogeneity within a parcel using optical high resolution images. Such heterogeneity maps can be used to adjust farming practices according to the detected heterogeneity. This heterogeneity may for instance be caused by differences in mineral composition of the soil, organic matter content, soil moisture or soil texture. Local differences in plant growth may be indicative for differences in soil characteristics. As such remote sensing derived vegetation indices may be used to reveal soil heterogeneity. VITO started to delineate homogeneous zones within parcels by analyzing a series of RapidEye images acquired in 2015 (as a precursor for Sentinel-2). Both unsupervised classification (ISODATA, K-means) and segmentation techniques were tested. Heterogeneity maps were generated from images acquired at different moments during the season (13 May, 30 June, 17 July, 31 August, 11 September and 1 November 2015). Tests were performed using blue, green, red, red edge and NIR reflectances separately and using derived indices such as NDVI, fAPAR, CIrededge, NDRE2. The results for selected winter wheat, maize and potato fields were evaluated together with experts from the collaborating agricultural research centers. For a few fields UAV images and/or yield measurements were available for comparison.

  8. The role of duplications in the evolution of genomes highlights the need for evolutionary-based approaches in comparative genomics

    Directory of Open Access Journals (Sweden)

    Levasseur Anthony

    2011-02-01

    Full Text Available Abstract Understanding the evolutionary plasticity of the genome requires a global, comparative approach in which genetic events are considered both in a phylogenetic framework and with regard to population genetics and environmental variables. In the mechanisms that generate adaptive and non-adaptive changes in genomes, segmental duplications (duplication of individual genes or genomic regions and polyploidization (whole genome duplications are well-known driving forces. The probability of fixation and maintenance of duplicates depends on many variables, including population sizes and selection regimes experienced by the corresponding genes: a combination of stochastic and adaptive mechanisms has shaped all genomes. A survey of experimental work shows that the distinction made between fixation and maintenance of duplicates still needs to be conceptualized and mathematically modeled. Here we review the mechanisms that increase or decrease the probability of fixation or maintenance of duplicated genes, and examine the outcome of these events on the adaptation of the organisms. Reviewers This article was reviewed by Dr. Etienne Joly, Dr. Lutz Walter and Dr. W. Ford Doolittle.

  9. Getting complete genomes from complex samples using nanopore sequencing

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Albertsen, Mads

    Short read sequencing and metagenomic binning workflows have made it possible to extract bacterial genome bins from environmental microbial samples containing hundreds to thousands of different species. However, these genome bins often do not represent complete genomes, as they are mostly...... fragmented, incomplete and often contaminated with foreign DNA and with no robust strategies to validate the quality. The value of these `draft genomes` have limited, lasting value to the scientific community, as gene synteny is broken and the uncertainty of what is missing. The genetic material most often...... missed is important multi-copy and/or conserved marker genes such as the 16S rRNA gene, as sequence micro-heterogeneity prevents assembly of these genes in the de novo assembly. We demonstrate that using nanopore long reads it is now possible to overcome these issues and make complete genomes from...

  10. Getting complete genomes from complex samples using nanopore sequencing

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Albertsen, Mads

    Background Short read DNA sequencing and metagenomic binning workflows have made it possible to extract bacterial genome bins from environmental microbial samples containing hundreds to thousands of different species. However, these genome bins often do not represent complete genomes......, as they are mostly fragmented, incomplete and often contaminated with foreign DNA. The value of these `draft genomes` have limited, lasting value to the scientific community, as gene synteny is broken and there is some uncertainty of what is missing1. The genetic material most often missed is important multi......-copy and/or conserved marker genes such as the 16S rRNA gene, as sequence micro-heterogeneity prevents assembly of these genes in the de novo assembly. However, long read sequencing technologies are emerging promising an end to fragmented genome assemblies2. Experimental design We extracted DNA from a full...

  11. Copy number variation in the bovine genome

    DEFF Research Database (Denmark)

    Fadista, João; Thomsen, Bo; Holm, Lars-Erik

    2010-01-01

    to genetic variation in cattle. Results We designed and used a set of NimbleGen CGH arrays that tile across the assayable portion of the cattle genome with approximately 6.3 million probes, at a median probe spacing of 301 bp. This study reports the highest resolution map of copy number variation...... in the cattle genome, with 304 CNV regions (CNVRs) being identified among the genomes of 20 bovine samples from 4 dairy and beef breeds. The CNVRs identified covered 0.68% (22 Mb) of the genome, and ranged in size from 1.7 to 2,031 kb (median size 16.7 kb). About 20% of the CNVs co-localized with segmental...... duplications, while 30% encompass genes, of which the majority is involved in environmental response. About 10% of the human orthologous of these genes are associated with human disease susceptibility and, hence, may have important phenotypic consequences. Conclusions Together, this analysis provides a useful...

  12. Information Extraction of High Resolution Remote Sensing Images Based on the Calculation of Optimal Segmentation Parameters

    Science.gov (United States)

    Zhu, Hongchun; Cai, Lijie; Liu, Haiying; Huang, Wei

    2016-01-01

    Multi-scale image segmentation and the selection of optimal segmentation parameters are the key processes in the object-oriented information extraction of high-resolution remote sensing images. The accuracy of remote sensing special subject information depends on this extraction. On the basis of WorldView-2 high-resolution data, the optimal segmentation parameters methodof object-oriented image segmentation and high-resolution image information extraction, the following processes were conducted in this study. Firstly, the best combination of the bands and weights was determined for the information extraction of high-resolution remote sensing image. An improved weighted mean-variance method was proposed andused to calculatethe optimal segmentation scale. Thereafter, the best shape factor parameter and compact factor parameters were computed with the use of the control variables and the combination of the heterogeneity and homogeneity indexes. Different types of image segmentation parameters were obtained according to the surface features. The high-resolution remote sensing images were multi-scale segmented with the optimal segmentation parameters. Ahierarchical network structure was established by setting the information extraction rules to achieve object-oriented information extraction. This study presents an effective and practical method that can explain expert input judgment by reproducible quantitative measurements. Furthermore the results of this procedure may be incorporated into a classification scheme. PMID:27362762

  13. Defining functional DNA elements in the human genome

    Science.gov (United States)

    Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

    2014-01-01

    With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594

  14. Segmentation Scheme for Safety Enhancement of Engineered Safety Features Component Control System

    International Nuclear Information System (INIS)

    Lee, Sangseok; Sohn, Kwangyoung; Lee, Junku; Park, Geunok

    2013-01-01

    Common Caused Failure (CCF) or undetectable failure would adversely impact safety functions of ESF-CCS in the existing nuclear power plants. We propose the segmentation scheme to solve these problems. Main function assignment to segments in the proposed segmentation scheme is based on functional dependency and critical function success path by using the dependency depth matrix. The segment has functional independence and physical isolation. The segmentation structure is that prohibit failure propagation to others from undetectable failures. Therefore, the segmentation system structure has robustness to undetectable failures. The segmentation system structure has functional diversity. The specific function in the segment defected by CCF, the specific function could be maintained by diverse control function that assigned to other segments. Device level control signals and system level control signals are separated and also control signal and status signals are separated due to signal transmission paths are allocated independently based on signal type. In this kind of design, single device failure or failures on signal path in the channel couldn't result in the loss of all segmented functions simultaneously. Thus the proposed segmentation function is the design scheme that improves availability of safety functions. In conventional ESF-CCS, the single controller generates the signal to control the multiple safety functions, and the reliability is achieved by multiplication within the channel. This design has a drawback causing the loss of multiple functions due to the CCF (Common Cause Failure) and single failure Heterogeneous controller guarantees the diversity ensuring the execution of safety functions against the CCF and single failure, but requiring a lot of resources like manpower and cost. The segmentation technology based on the compartmentalization and functional diversification decreases the CCF and single failure nonetheless the identical types of controllers

  15. Segmentation Scheme for Safety Enhancement of Engineered Safety Features Component Control System

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Sangseok; Sohn, Kwangyoung [Korea Reliability Technology and System, Daejeon (Korea, Republic of); Lee, Junku; Park, Geunok [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

    2013-05-15

    Common Caused Failure (CCF) or undetectable failure would adversely impact safety functions of ESF-CCS in the existing nuclear power plants. We propose the segmentation scheme to solve these problems. Main function assignment to segments in the proposed segmentation scheme is based on functional dependency and critical function success path by using the dependency depth matrix. The segment has functional independence and physical isolation. The segmentation structure is that prohibit failure propagation to others from undetectable failures. Therefore, the segmentation system structure has robustness to undetectable failures. The segmentation system structure has functional diversity. The specific function in the segment defected by CCF, the specific function could be maintained by diverse control function that assigned to other segments. Device level control signals and system level control signals are separated and also control signal and status signals are separated due to signal transmission paths are allocated independently based on signal type. In this kind of design, single device failure or failures on signal path in the channel couldn't result in the loss of all segmented functions simultaneously. Thus the proposed segmentation function is the design scheme that improves availability of safety functions. In conventional ESF-CCS, the single controller generates the signal to control the multiple safety functions, and the reliability is achieved by multiplication within the channel. This design has a drawback causing the loss of multiple functions due to the CCF (Common Cause Failure) and single failure Heterogeneous controller guarantees the diversity ensuring the execution of safety functions against the CCF and single failure, but requiring a lot of resources like manpower and cost. The segmentation technology based on the compartmentalization and functional diversification decreases the CCF and single failure nonetheless the identical types of

  16. Pricing of drugs with heterogeneous health insurance coverage.

    Science.gov (United States)

    Ferrara, Ida; Missios, Paul

    2012-03-01

    In this paper, we examine the role of insurance coverage in explaining the generic competition paradox in a two-stage game involving a single producer of brand-name drugs and n quantity-competing producers of generic drugs. Independently of brand loyalty, which some studies rely upon to explain the paradox, we show that heterogeneity in insurance coverage may result in higher prices of brand-name drugs following generic entry. With market segmentation based on insurance coverage present in both the pre- and post-entry stages, the paradox can arise when the two types of drugs are highly substitutable and the market is quite profitable but does not have to arise when the two types of drugs are highly differentiated. However, with market segmentation occurring only after generic entry, the paradox can arise when the two types of drugs are weakly substitutable, provided, however, that the industry is not very profitable. In both cases, that is, when market segmentation is present in the pre-entry stage and when it is not, the paradox becomes more likely to arise as the market expands and/or insurance companies decrease deductibles applied on the purchase of generic drugs. Copyright © 2012 Elsevier B.V. All rights reserved.

  17. Ultrasound image-based thyroid nodule automatic segmentation using convolutional neural networks.

    Science.gov (United States)

    Ma, Jinlian; Wu, Fa; Jiang, Tian'an; Zhao, Qiyu; Kong, Dexing

    2017-11-01

    Delineation of thyroid nodule boundaries from ultrasound images plays an important role in calculation of clinical indices and diagnosis of thyroid diseases. However, it is challenging for accurate and automatic segmentation of thyroid nodules because of their heterogeneous appearance and components similar to the background. In this study, we employ a deep convolutional neural network (CNN) to automatically segment thyroid nodules from ultrasound images. Our CNN-based method formulates a thyroid nodule segmentation problem as a patch classification task, where the relationship among patches is ignored. Specifically, the CNN used image patches from images of normal thyroids and thyroid nodules as inputs and then generated the segmentation probability maps as outputs. A multi-view strategy is used to improve the performance of the CNN-based model. Additionally, we compared the performance of our approach with that of the commonly used segmentation methods on the same dataset. The experimental results suggest that our proposed method outperforms prior methods on thyroid nodule segmentation. Moreover, the results show that the CNN-based model is able to delineate multiple nodules in thyroid ultrasound images accurately and effectively. In detail, our CNN-based model can achieve an average of the overlap metric, dice ratio, true positive rate, false positive rate, and modified Hausdorff distance as [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text] on overall folds, respectively. Our proposed method is fully automatic without any user interaction. Quantitative results also indicate that our method is so efficient and accurate that it can be good enough to replace the time-consuming and tedious manual segmentation approach, demonstrating the potential clinical applications.

  18. Ranked retrieval of segmented nuclei for objective assessment of cancer gene repositioning

    Directory of Open Access Journals (Sweden)

    Cukierski William J

    2012-09-01

    correlation coefficients between the gene position measurements were above 0.9 (p Conclusions Accurate segmentation is necessary to automate quantitative image analysis for applications such as gene repositioning. However, due to heterogeneity within images and across different applications, no segmentation algorithm provides a satisfactory solution. Automated assessment of segmentations by ranked retrieval is capable of reducing or even eliminating the need to select segmented objects by hand and represents a significant improvement over binary classification. The method can be extended to other high-throughput applications requiring accurate detection of cells or nuclei across a range of biomedical applications.

  19. Usher syndrome type 1–associated cadherins shape the photoreceptor outer segment

    Science.gov (United States)

    Parain, Karine; Aghaie, Asadollah; Picaud, Serge

    2017-01-01

    Usher syndrome type 1 (USH1) causes combined hearing and sight defects, but how mutations in USH1 genes lead to retinal dystrophy in patients remains elusive. The USH1 protein complex is associated with calyceal processes, which are microvilli of unknown function surrounding the base of the photoreceptor outer segment. We show that in Xenopus tropicalis, these processes are connected to the outer-segment membrane by links composed of protocadherin-15 (USH1F protein). Protocadherin-15 deficiency, obtained by a knockdown approach, leads to impaired photoreceptor function and abnormally shaped photoreceptor outer segments. Rod basal outer disks displayed excessive outgrowth, and cone outer segments were curved, with lamellae of heterogeneous sizes, defects also observed upon knockdown of Cdh23, encoding cadherin-23 (USH1D protein). The calyceal processes were virtually absent in cones and displayed markedly reduced F-actin content in rods, suggesting that protocadherin-15–containing links are essential for their development and/or maintenance. We propose that calyceal processes, together with their associated links, control the sizing of rod disks and cone lamellae throughout their daily renewal. PMID:28495838

  20. Segmented block copolymers with monodisperse aramide end-segments

    NARCIS (Netherlands)

    Araichimani, A.; Gaymans, R.J.

    2008-01-01

    Segmented block copolymers were synthesized using monodisperse diaramide (TT) as hard segments and PTMO with a molecular weight of 2 900 g · mol-1 as soft segments. The aramide: PTMO segment ratio was increased from 1:1 to 2:1 thereby changing the structure from a high molecular weight multi-block

  1. Genome-wide investigation reveals high evolutionary rates in annual model plants.

    Science.gov (United States)

    Yue, Jia-Xing; Li, Jinpeng; Wang, Dan; Araki, Hitoshi; Tian, Dacheng; Yang, Sihai

    2010-11-09

    Rates of molecular evolution vary widely among species. While significant deviations from molecular clock have been found in many taxa, effects of life histories on molecular evolution are not fully understood. In plants, annual/perennial life history traits have long been suspected to influence the evolutionary rates at the molecular level. To date, however, the number of genes investigated on this subject is limited and the conclusions are mixed. To evaluate the possible heterogeneity in evolutionary rates between annual and perennial plants at the genomic level, we investigated 85 nuclear housekeeping genes, 10 non-housekeeping families, and 34 chloroplast genes using the genomic data from model plants including Arabidopsis thaliana and Medicago truncatula for annuals and grape (Vitis vinifera) and popular (Populus trichocarpa) for perennials. According to the cross-comparisons among the four species, 74-82% of the nuclear genes and 71-97% of the chloroplast genes suggested higher rates of molecular evolution in the two annuals than those in the two perennials. The significant heterogeneity in evolutionary rate between annuals and perennials was consistently found both in nonsynonymous sites and synonymous sites. While a linear correlation of evolutionary rates in orthologous genes between species was observed in nonsynonymous sites, the correlation was weak or invisible in synonymous sites. This tendency was clearer in nuclear genes than in chloroplast genes, in which the overall evolutionary rate was small. The slope of the regression line was consistently lower than unity, further confirming the higher evolutionary rate in annuals at the genomic level. The higher evolutionary rate in annuals than in perennials appears to be a universal phenomenon both in nuclear and chloroplast genomes in the four dicot model plants we investigated. Therefore, such heterogeneity in evolutionary rate should result from factors that have genome-wide influence, most likely those

  2. Segmentation of consumer's markets and evaluation of market's segments

    OpenAIRE

    ŠVECOVÁ, Iveta

    2013-01-01

    The goal of this bachelor thesis was to explain a possibly segmentation of consumer´s markets for a chosen company, and to present a suitable goods offer, so it would be suitable to the needs of selected segments. The work is divided into theoretical and practical part. First part describes marketing, segmentation, segmentation of consumer's markets, consumer's market, market's segments a other terms. Second part describes an evaluation of questionnaire survey, discovering of market's segment...

  3. NeuroBlocks – Visual Tracking of Segmentation and Proofreading for Large Connectomics Projects

    KAUST Repository

    Al-Awami, Ali

    2015-08-12

    In the field of connectomics, neuroscientists acquire electron microscopy volumes at nanometer resolution in order to reconstruct a detailed wiring diagram of the neurons in the brain. The resulting image volumes, which often are hundreds of terabytes in size, need to be segmented to identify cell boundaries, synapses, and important cell organelles. However, the segmentation process of a single volume is very complex, time-intensive, and usually performed using a diverse set of tools and many users. To tackle the associated challenges, this paper presents NeuroBlocks, which is a novel visualization system for tracking the state, progress, and evolution of very large volumetric segmentation data in neuroscience. NeuroBlocks is a multi-user web-based application that seamlessly integrates the diverse set of tools that neuroscientists currently use for manual and semi-automatic segmentation, proofreading, visualization, and analysis. NeuroBlocks is the first system that integrates this heterogeneous tool set, providing crucial support for the management, provenance, accountability, and auditing of large-scale segmentations. We describe the design of NeuroBlocks, starting with an analysis of the domain-specific tasks, their inherent challenges, and our subsequent task abstraction and visual representation. We demonstrate the utility of our design based on two case studies that focus on different user roles and their respective requirements for performing and tracking the progress of segmentation and proofreading in a large real-world connectomics project.

  4. Microbial comparative pan-genomics using binomial mixture models

    DEFF Research Database (Denmark)

    Ussery, David; Snipen, L; Almøy, T

    2009-01-01

    The size of the core- and pan-genome of bacterial species is a topic of increasing interest due to the growing number of sequenced prokaryote genomes, many from the same species. Attempts to estimate these quantities have been made, using regression methods or mixture models. We extend the latter...... approach by using statistical ideas developed for capture-recapture problems in ecology and epidemiology. RESULTS: We estimate core- and pan-genome sizes for 16 different bacterial species. The results reveal a complex dependency structure for most species, manifested as heterogeneous detection...... probabilities. Estimated pan-genome sizes range from small (around 2600 gene families) in Buchnera aphidicola to large (around 43000 gene families) in Escherichia coli. Results for Echerichia coli show that as more data become available, a larger diversity is estimated, indicating an extensive pool of rarely...

  5. Exploring unobserved heterogeneity in bicyclists' red-light running behaviors at different crossing facilities.

    Science.gov (United States)

    Guo, Yanyong; Li, Zhibin; Wu, Yao; Xu, Chengcheng

    2018-06-01

    Bicyclists running the red light at crossing facilities increase the potential of colliding with motor vehicles. Exploring the contributing factors could improve the prediction of running red-light probability and develop countermeasures to reduce such behaviors. However, individuals could have unobserved heterogeneities in running a red light, which make the accurate prediction more challenging. Traditional models assume that factor parameters are fixed and cannot capture the varying impacts on red-light running behaviors. In this study, we employed the full Bayesian random parameters logistic regression approach to account for the unobserved heterogeneous effects. Two types of crossing facilities were considered which were the signalized intersection crosswalks and the road segment crosswalks. Electric and conventional bikes were distinguished in the modeling. Data were collected from 16 crosswalks in urban area of Nanjing, China. Factors such as individual characteristics, road geometric design, environmental features, and traffic variables were examined. Model comparison indicates that the full Bayesian random parameters logistic regression approach is statistically superior to the standard logistic regression model. More red-light runners are predicted at signalized intersection crosswalks than at road segment crosswalks. Factors affecting red-light running behaviors are gender, age, bike type, road width, presence of raised median, separation width, signal type, green ratio, bike and vehicle volume, and average vehicle speed. Factors associated with the unobserved heterogeneity are gender, bike type, signal type, separation width, and bike volume. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. A Novel Unsupervised Segmentation Quality Evaluation Method for Remote Sensing Images.

    Science.gov (United States)

    Gao, Han; Tang, Yunwei; Jing, Linhai; Li, Hui; Ding, Haifeng

    2017-10-24

    The segmentation of a high spatial resolution remote sensing image is a critical step in geographic object-based image analysis (GEOBIA). Evaluating the performance of segmentation without ground truth data, i.e., unsupervised evaluation, is important for the comparison of segmentation algorithms and the automatic selection of optimal parameters. This unsupervised strategy currently faces several challenges in practice, such as difficulties in designing effective indicators and limitations of the spectral values in the feature representation. This study proposes a novel unsupervised evaluation method to quantitatively measure the quality of segmentation results to overcome these problems. In this method, multiple spectral and spatial features of images are first extracted simultaneously and then integrated into a feature set to improve the quality of the feature representation of ground objects. The indicators designed for spatial stratified heterogeneity and spatial autocorrelation are included to estimate the properties of the segments in this integrated feature set. These two indicators are then combined into a global assessment metric as the final quality score. The trade-offs of the combined indicators are accounted for using a strategy based on the Mahalanobis distance, which can be exhibited geometrically. The method is tested on two segmentation algorithms and three testing images. The proposed method is compared with two existing unsupervised methods and a supervised method to confirm its capabilities. Through comparison and visual analysis, the results verified the effectiveness of the proposed method and demonstrated the reliability and improvements of this method with respect to other methods.

  7. A Novel Unsupervised Segmentation Quality Evaluation Method for Remote Sensing Images

    Directory of Open Access Journals (Sweden)

    Han Gao

    2017-10-01

    Full Text Available The segmentation of a high spatial resolution remote sensing image is a critical step in geographic object-based image analysis (GEOBIA. Evaluating the performance of segmentation without ground truth data, i.e., unsupervised evaluation, is important for the comparison of segmentation algorithms and the automatic selection of optimal parameters. This unsupervised strategy currently faces several challenges in practice, such as difficulties in designing effective indicators and limitations of the spectral values in the feature representation. This study proposes a novel unsupervised evaluation method to quantitatively measure the quality of segmentation results to overcome these problems. In this method, multiple spectral and spatial features of images are first extracted simultaneously and then integrated into a feature set to improve the quality of the feature representation of ground objects. The indicators designed for spatial stratified heterogeneity and spatial autocorrelation are included to estimate the properties of the segments in this integrated feature set. These two indicators are then combined into a global assessment metric as the final quality score. The trade-offs of the combined indicators are accounted for using a strategy based on the Mahalanobis distance, which can be exhibited geometrically. The method is tested on two segmentation algorithms and three testing images. The proposed method is compared with two existing unsupervised methods and a supervised method to confirm its capabilities. Through comparison and visual analysis, the results verified the effectiveness of the proposed method and demonstrated the reliability and improvements of this method with respect to other methods.

  8. Highly syntenic regions in the genomes of soybean, Medicago truncatula, and Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Roe Bruce A

    2005-08-01

    Full Text Available Abstract Background Recent genome sequencing enables mega-base scale comparisons between related genomes. Comparisons between animals, plants, fungi, and bacteria demonstrate extensive synteny tempered by rearrangements. Within the legume plant family, glimpses of synteny have also been observed. Characterizing syntenic relationships in legumes is important in transferring knowledge from model legumes to crops that are important sources of protein, fixed nitrogen, and health-promoting compounds. Results We have uncovered two large soybean regions exhibiting synteny with M. truncatula and with a network of segmentally duplicated regions in Arabidopsis. In all, syntenic regions comprise over 500 predicted genes spanning 3 Mb. Up to 75% of soybean genes are colinear with M. truncatula, including one region in which 33 of 35 soybean predicted genes with database support are colinear to M. truncatula. In some regions, 60% of soybean genes share colinearity with a network of A. thaliana duplications. One region is especially interesting because this 500 kbp segment of soybean is syntenic to two paralogous regions in M. truncatula on different chromosomes. Phylogenetic analysis of individual genes within these regions demonstrates that one is orthologous to the soybean region, with which it also shows substantially denser synteny and significantly lower levels of synonymous nucleotide substitutions. The other M. truncatula region is inferred to be paralogous, presumably resulting from a duplication event preceding speciation. Conclusion The presence of well-defined M. truncatula segments showing orthologous and paralogous relationships with soybean allows us to explore the evolution of contiguous genomic regions in the context of ancient genome duplication and speciation events.

  9. CD4 is expressed on a heterogeneous subset of hematopoietic progenitors, which persistently harbor CXCR4 and CCR5-tropic HIV proviral genomes in vivo.

    Directory of Open Access Journals (Sweden)

    Nadia T Sebastian

    2017-07-01

    Full Text Available Latent HIV infection of long-lived cells is a barrier to viral clearance. Hematopoietic stem and progenitor cells are a heterogeneous population of cells, some of which are long-lived. CXCR4-tropic HIVs infect a broad range of HSPC subtypes, including hematopoietic stem cells, which are multi-potent and long-lived. However, CCR5-tropic HIV infection is limited to more differentiated progenitor cells with life spans that are less well understood. Consistent with emerging data that restricted progenitor cells can be long-lived, we detected persistent HIV in restricted HSPC populations from optimally treated people. Further, genotypic and phenotypic analysis of amplified env alleles from donor samples indicated that both CXCR4- and CCR5-tropic viruses persisted in HSPCs. RNA profiling confirmed expression of HIV receptor RNA in a pattern that was consistent with in vitro and in vivo results. In addition, we characterized a CD4high HSPC sub-population that was preferentially targeted by a variety of CXCR4- and CCR5-tropic HIVs in vitro. Finally, we present strong evidence that HIV proviral genomes of both tropisms can be transmitted to CD4-negative daughter cells of multiple lineages in vivo. In some cases, the transmitted proviral genomes contained signature deletions that inactivated the virus, eliminating the possibility that coincidental infection explains the results. These data support a model in which both stem and non-stem cell progenitors serve as persistent reservoirs for CXCR4- and CCR5-tropic HIV proviral genomes that can be passed to daughter cells.

  10. CD4 is expressed on a heterogeneous subset of hematopoietic progenitors, which persistently harbor CXCR4 and CCR5-tropic HIV proviral genomes in vivo.

    Science.gov (United States)

    Sebastian, Nadia T; Zaikos, Thomas D; Terry, Valeri; Taschuk, Frances; McNamara, Lucy A; Onafuwa-Nuga, Adewunmi; Yucha, Ryan; Signer, Robert A J; Riddell, James; Bixby, Dale; Markowitz, Norman; Morrison, Sean J; Collins, Kathleen L

    2017-07-01

    Latent HIV infection of long-lived cells is a barrier to viral clearance. Hematopoietic stem and progenitor cells are a heterogeneous population of cells, some of which are long-lived. CXCR4-tropic HIVs infect a broad range of HSPC subtypes, including hematopoietic stem cells, which are multi-potent and long-lived. However, CCR5-tropic HIV infection is limited to more differentiated progenitor cells with life spans that are less well understood. Consistent with emerging data that restricted progenitor cells can be long-lived, we detected persistent HIV in restricted HSPC populations from optimally treated people. Further, genotypic and phenotypic analysis of amplified env alleles from donor samples indicated that both CXCR4- and CCR5-tropic viruses persisted in HSPCs. RNA profiling confirmed expression of HIV receptor RNA in a pattern that was consistent with in vitro and in vivo results. In addition, we characterized a CD4high HSPC sub-population that was preferentially targeted by a variety of CXCR4- and CCR5-tropic HIVs in vitro. Finally, we present strong evidence that HIV proviral genomes of both tropisms can be transmitted to CD4-negative daughter cells of multiple lineages in vivo. In some cases, the transmitted proviral genomes contained signature deletions that inactivated the virus, eliminating the possibility that coincidental infection explains the results. These data support a model in which both stem and non-stem cell progenitors serve as persistent reservoirs for CXCR4- and CCR5-tropic HIV proviral genomes that can be passed to daughter cells.

  11. Isolation and characterization of reverse transcriptase fragments of LTR retrotransposons from the genome of Chenopodium quinoa (Amaranthaceae).

    Science.gov (United States)

    Kolano, Bozena; Bednara, Edyta; Weiss-Schneeweiss, Hanna

    2013-10-01

    High heterogeneity was observed among conserved domains of reverse transcriptase ( rt ) isolated from quinoa. Only one Ty1- copia rt was highly amplified. Reverse transcriptase sequences were located predominantly in pericentromeric region of quinoa chromosomes. The heterogeneity, genomic abundance, and chromosomal distribution of reverse transcriptase (rt)-coding fragments of Ty1-copia and Ty3-gypsy long terminal repeat retrotransposons were analyzed in the Chenopodium quinoa genome. Conserved domains of the rt gene were amplified and characterized using degenerate oligonucleotide primer pairs. Sequence analyses indicated that half of Ty1-copia rt (51 %) and 39 % of Ty3-gypsy rt fragments contained intact reading frames. High heterogeneity among rt sequences was observed for both Ty1-copia and Ty3-gypsy rt amplicons, with Ty1-copia more heterogeneous than Ty3-gypsy. Most of the isolated rt fragments were present in quinoa genome in low copy numbers, with only one highly amplified Ty1-copia rt sequence family. The gypsy-like RNase H fragments co-amplified with Ty1-copia-degenerate primers were shown to be highly amplified in the quinoa genome indicating either higher abundance of some gypsy families of which rt domains could not be amplified, or independent evolution of this gypsy-region in quinoa. Both Ty1-copia and Ty3-gypsy retrotransposons were preferentially located in pericentromeric heterochromatin of quinoa chromosomes. Phylogenetic analyses of newly amplified rt fragments together with well-characterized retrotransposon families from other organisms allowed identification of major lineages of retroelements in the genome of quinoa and provided preliminary insight into their evolutionary dynamics.

  12. Disruption of Specific RNA-RNA Interactions in a Double-Stranded RNA Virus Inhibits Genome Packaging and Virus Infectivity.

    Science.gov (United States)

    Fajardo, Teodoro; Sung, Po-Yu; Roy, Polly

    2015-12-01

    Bluetongue virus (BTV) causes hemorrhagic disease in economically important livestock. The BTV genome is organized into ten discrete double-stranded RNA molecules (S1-S10) which have been suggested to follow a sequential packaging pathway from smallest to largest segment during virus capsid assembly. To substantiate and extend these studies, we have investigated the RNA sorting and packaging mechanisms with a new experimental approach using inhibitory oligonucleotides. Putative packaging signals present in the 3'untranslated regions of BTV segments were targeted by a number of nuclease resistant oligoribonucleotides (ORNs) and their effects on virus replication in cell culture were assessed. ORNs complementary to the 3' UTR of BTV RNAs significantly inhibited virus replication without affecting protein synthesis. Same ORNs were found to inhibit complex formation when added to a novel RNA-RNA interaction assay which measured the formation of supramolecular complexes between and among different RNA segments. ORNs targeting the 3'UTR of BTV segment 10, the smallest RNA segment, were shown to be the most potent and deletions or substitution mutations of the targeted sequences diminished the RNA complexes and abolished the recovery of viable viruses using reverse genetics. Cell-free capsid assembly/RNA packaging assay also confirmed that the inhibitory ORNs could interfere with RNA packaging and further substitution mutations within the putative RNA packaging sequence have identified the recognition sequence concerned. Exchange of 3'UTR between segments have further demonstrated that RNA recognition was segment specific, most likely acting as part of the secondary structure of the entire genomic segment. Our data confirm that genome packaging in this segmented dsRNA virus occurs via the formation of supramolecular complexes formed by the interaction of specific sequences located in the 3' UTRs. Additionally, the inhibition of packaging in-trans with inhibitory ORNs

  13. Cascade of chromosomal rearrangements caused by a heterogeneous T-DNA integration supports the double-stranded break repair model for T-DNA integration.

    Science.gov (United States)

    Hu, Yufei; Chen, Zhiyu; Zhuang, Chuxiong; Huang, Jilei

    2017-06-01

    Transferred DNA (T-DNA) from Agrobacterium tumefaciens can be integrated into the plant genome. The double-stranded break repair (DSBR) pathway is a major model for T-DNA integration. From this model, we expect that two ends of a T-DNA molecule would invade into a single DNA double-stranded break (DSB) or independent DSBs in the plant genome. We call the later phenomenon a heterogeneous T-DNA integration, which has never been observed. In this work, we demonstrated it in an Arabidopsis T-DNA insertion mutant seb19. To resolve the chromosomal structural changes caused by T-DNA integration at both the nucleotide and chromosome levels, we performed inverse PCR, genome resequencing, fluorescence in situ hybridization and linkage analysis. We found, in seb19, a single T-DNA connected two different chromosomal loci and caused complex chromosomal rearrangements. The specific break-junction pattern in seb19 is consistent with the result of heterogeneous T-DNA integration but not of recombination between two T-DNA insertions. We demonstrated that, in seb19, heterogeneous T-DNA integration evoked a cascade of incorrect repair of seven DSBs on chromosomes 4 and 5, and then produced translocation, inversion, duplication and deletion. Heterogeneous T-DNA integration supports the DSBR model and suggests that two ends of a T-DNA molecule could be integrated into the plant genome independently. Our results also show a new origin of chromosomal abnormalities. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  14. Microbial comparative pan-genomics using binomial mixture models

    Directory of Open Access Journals (Sweden)

    Ussery David W

    2009-08-01

    Full Text Available Abstract Background The size of the core- and pan-genome of bacterial species is a topic of increasing interest due to the growing number of sequenced prokaryote genomes, many from the same species. Attempts to estimate these quantities have been made, using regression methods or mixture models. We extend the latter approach by using statistical ideas developed for capture-recapture problems in ecology and epidemiology. Results We estimate core- and pan-genome sizes for 16 different bacterial species. The results reveal a complex dependency structure for most species, manifested as heterogeneous detection probabilities. Estimated pan-genome sizes range from small (around 2600 gene families in Buchnera aphidicola to large (around 43000 gene families in Escherichia coli. Results for Echerichia coli show that as more data become available, a larger diversity is estimated, indicating an extensive pool of rarely occurring genes in the population. Conclusion Analyzing pan-genomics data with binomial mixture models is a way to handle dependencies between genomes, which we find is always present. A bottleneck in the estimation procedure is the annotation of rarely occurring genes.

  15. Identifying anti-growth factors for human cancer cell lines through genome-scale metabolic modeling

    DEFF Research Database (Denmark)

    Ghaffari, Pouyan; Mardinoglu, Adil; Asplund, Anna

    2015-01-01

    Human cancer cell lines are used as important model systems to study molecular mechanisms associated with tumor growth, hereunder how genomic and biological heterogeneity found in primary tumors affect cellular phenotypes. We reconstructed Genome scale metabolic models (GEMs) for eleven cell lines...... based on RNA-Seq data and validated the functionality of these models with data from metabolite profiling. We used cell line-specific GEMs to analyze the differences in the metabolism of cancer cell lines, and to explore the heterogeneous expression of the metabolic subsystems. Furthermore, we predicted...... for inhibition of cell growth may provide leads for the development of efficient cancer treatment strategies....

  16. Population genetic inference from personal genome data: impact of ancestry and admixture on human genomic variation.

    Science.gov (United States)

    Kidd, Jeffrey M; Gravel, Simon; Byrnes, Jake; Moreno-Estrada, Andres; Musharoff, Shaila; Bryc, Katarzyna; Degenhardt, Jeremiah D; Brisbin, Abra; Sheth, Vrunda; Chen, Rong; McLaughlin, Stephen F; Peckham, Heather E; Omberg, Larsson; Bormann Chung, Christina A; Stanley, Sarah; Pearlstein, Kevin; Levandowsky, Elizabeth; Acevedo-Acevedo, Suehelay; Auton, Adam; Keinan, Alon; Acuña-Alonzo, Victor; Barquera-Lozano, Rodrigo; Canizales-Quinteros, Samuel; Eng, Celeste; Burchard, Esteban G; Russell, Archie; Reynolds, Andy; Clark, Andrew G; Reese, Martin G; Lincoln, Stephen E; Butte, Atul J; De La Vega, Francisco M; Bustamante, Carlos D

    2012-10-05

    Full sequencing of individual human genomes has greatly expanded our understanding of human genetic variation and population history. Here, we present a systematic analysis of 50 human genomes from 11 diverse global populations sequenced at high coverage. Our sample includes 12 individuals who have admixed ancestry and who have varying degrees of recent (within the last 500 years) African, Native American, and European ancestry. We found over 21 million single-nucleotide variants that contribute to a 1.75-fold range in nucleotide heterozygosity across diverse human genomes. This heterozygosity ranged from a high of one heterozygous site per kilobase in west African genomes to a low of 0.57 heterozygous sites per kilobase in segments inferred to have diploid Native American ancestry from the genomes of Mexican and Puerto Rican individuals. We show evidence of all three continental ancestries in the genomes of Mexican, Puerto Rican, and African American populations, and the genome-wide statistics are highly consistent across individuals from a population once ancestry proportions have been accounted for. Using a generalized linear model, we identified subtle variations across populations in the proportion of neutral versus deleterious variation and found that genome-wide statistics vary in admixed populations even once ancestry proportions have been factored in. We further infer that multiple periods of gene flow shaped the diversity of admixed populations in the Americas-70% of the European ancestry in today's African Americans dates back to European gene flow happening only 7-8 generations ago. Copyright © 2012 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  17. Avoiding Pitfalls in the Statistical Analysis of Heterogeneous Tumors

    Directory of Open Access Journals (Sweden)

    Judith-Anne W. Chapman

    2009-01-01

    Full Text Available Information about tumors is usually obtained from a single assessment of a tumor sample, performed at some point in the course of the development and progression of the tumor, with patient characteristics being surrogates for natural history context. Differences between cells within individual tumors (intratumor heterogeneity and between tumors of different patients (intertumor heterogeneity may mean that a small sample is not representative of the tumor as a whole, particularly for solid tumors which are the focus of this paper. This issue is of increasing importance as high-throughput technologies generate large multi-feature data sets in the areas of genomics, proteomics, and image analysis. Three potential pitfalls in statistical analysis are discussed (sampling, cut-points, and validation and suggestions are made about how to avoid these pitfalls.

  18. Closing the gap between knowledge and clinical application: challenges for genomic translation.

    Science.gov (United States)

    Burke, Wylie; Korngiebel, Diane M

    2015-01-01

    Despite early predictions and rapid progress in research, the introduction of personal genomics into clinical practice has been slow. Several factors contribute to this translational gap between knowledge and clinical application. The evidence available to support genetic test use is often limited, and implementation of new testing programs can be challenging. In addition, the heterogeneity of genomic risk information points to the need for strategies to select and deliver the information most appropriate for particular clinical needs. Accomplishing these tasks also requires recognition that some expectations for personal genomics are unrealistic, notably expectations concerning the clinical utility of genomic risk assessment for common complex diseases. Efforts are needed to improve the body of evidence addressing clinical outcomes for genomics, apply implementation science to personal genomics, and develop realistic goals for genomic risk assessment. In addition, translational research should emphasize the broader benefits of genomic knowledge, including applications of genomic research that provide clinical benefit outside the context of personal genomic risk.

  19. Genome mapping and characterization of the Anopheles gambiae heterochromatin

    Directory of Open Access Journals (Sweden)

    Sharakhova Maria V

    2010-08-01

    Full Text Available Abstract Background Heterochromatin plays an important role in chromosome function and gene regulation. Despite the availability of polytene chromosomes and genome sequence, the heterochromatin of the major malaria vector Anopheles gambiae has not been mapped and characterized. Results To determine the extent of heterochromatin within the An. gambiae genome, genes were physically mapped to the euchromatin-heterochromatin transition zone of polytene chromosomes. The study found that a minimum of 232 genes reside in 16.6 Mb of mapped heterochromatin. Gene ontology analysis revealed that heterochromatin is enriched in genes with DNA-binding and regulatory activities. Immunostaining of the An. gambiae chromosomes with antibodies against Drosophila melanogaster heterochromatin protein 1 (HP1 and the nuclear envelope protein lamin Dm0 identified the major invariable sites of the proteins' localization in all regions of pericentric heterochromatin, diffuse intercalary heterochromatin, and euchromatic region 9C of the 2R arm, but not in the compact intercalary heterochromatin. To better understand the molecular differences among chromatin types, novel Bayesian statistical models were developed to analyze genome features. The study found that heterochromatin and euchromatin differ in gene density and the coverage of retroelements and segmental duplications. The pericentric heterochromatin had the highest coverage of retroelements and tandem repeats, while intercalary heterochromatin was enriched with segmental duplications. We also provide evidence that the diffuse intercalary heterochromatin has a higher coverage of DNA transposable elements, minisatellites, and satellites than does the compact intercalary heterochromatin. The investigation of 42-Mb assembly of unmapped genomic scaffolds showed that it has molecular characteristics similar to cytologically mapped heterochromatin. Conclusions Our results demonstrate that Anopheles polytene chromosomes

  20. SINEs as driving forces in genome evolution.

    Science.gov (United States)

    Schmitz, J

    2012-01-01

    SINEs are short interspersed elements derived from cellular RNAs that repetitively retropose via RNA intermediates and integrate more or less randomly back into the genome. SINEs propagate almost entirely vertically within their host cells and, once established in the germline, are passed on from generation to generation. As non-autonomous elements, their reverse transcription (from RNA to cDNA) and genomic integration depends on the activity of the enzymatic machinery of autonomous retrotransposons, such as long interspersed elements (LINEs). SINEs are widely distributed in eukaryotes, but are especially effectively propagated in mammalian species. For example, more than a million Alu-SINE copies populate the human genome (approximately 13% of genomic space), and few master copies of them are still active. In the organisms where they occur, SINEs are a challenge to genomic integrity, but in the long term also can serve as beneficial building blocks for evolution, contributing to phenotypic heterogeneity and modifying gene regulatory networks. They substantially expand the genomic space and introduce structural variation to the genome. SINEs have the potential to mutate genes, to alter gene expression, and to generate new parts of genes. A balanced distribution and controlled activity of such properties is crucial to maintaining the organism's dynamic and thriving evolution. Copyright © 2012 S. Karger AG, Basel.

  1. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

    Science.gov (United States)

    Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat

    2017-01-01

    Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.

  2. Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

    Directory of Open Access Journals (Sweden)

    Matthias Christen

    Full Text Available Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.

  3. Heterogeneity and compartmentalization of Pneumocystis carinii f. sp. hominis genotypes in autopsy lungs

    DEFF Research Database (Denmark)

    Helweg-Larsen, J; Lundgren, Bettina; Lundgren, Jens Dilling

    2001-01-01

    The extent and importance of genotype heterogeneity of Pneumocystis carinii f. sp. hominis within lungs have not previously been investigated. Two hundred forty PCR clones obtained from respiratory specimens and lung segments from three patients with fatal P. carinii pneumonia were investigated....... Not all genotypes present in the lungs at autopsy were detected in the diagnostic respiratory samples. Compartmentalization of specific ITS and mtLSU rRNA sequence types was observed in different lung segments. In conclusion, the interpretation of genotype data and in particular ITS sequence types...... in the assessment of epidemiological questions should be cautious since genotyping done on respiratory samples cannot a priori be assumed to represent all genotypes present within the lung....

  4. Complete Genome Sequence of a Novel Aquareovirus That Infects the Endangered Fountain Darter, Etheostoma fonticola.

    Science.gov (United States)

    Iwanowicz, Luke R; Iwanowicz, Deborah D; Adams, Cynthia R; Lewis, Teresa D; Brandt, Thomas M; Cornman, Robert S; Sanders, Lakyn

    2016-12-22

    Here, we report the complete genome of a novel aquareovirus isolated from clinically normal fountain darters, Etheostoma fonticola, inhabiting the San Marcos River, Texas, USA. The complete genome consists of 23,958 bp consisting of 11 segments that range from 783 bp (S11) to 3,866 bp (S1). Copyright © 2016 Iwanowicz et al.

  5. Quantitative measure of randomness and order for complete genomes

    Science.gov (United States)

    Kong, Sing-Guan; Fan, Wen-Lang; Chen, Hong-Da; Wigger, Jan; Torda, Andrew E.; Lee, H. C.

    2009-06-01

    We propose an order index, ϕ , which gives a quantitative measure of randomness and order of complete genomic sequences. It maps genomes to a number from 0 (random and of infinite length) to 1 (fully ordered) and applies regardless of sequence length. The 786 complete genomic sequences in GenBank were found to have ϕ values in a very narrow range, ϕg=0.031-0.015+0.028 . We show this implies that genomes are halfway toward being completely random, or, at the “edge of chaos.” We further show that artificial “genomes” converted from literary classics have ϕ ’s that almost exactly coincide with ϕg , but sequences of low information content do not. We infer that ϕg represents a high information-capacity “fixed point” in sequence space, and that genomes are driven to it by the dynamics of a robust growth and evolution process. We show that a growth process characterized by random segmental duplication can robustly drive genomes to the fixed point.

  6. Immunoglobulin Genomics in the Guinea Pig (Cavia porcellus)

    Science.gov (United States)

    Guo, Yongchen; Bao, Yonghua; Meng, Qingwen; Hu, Xiaoxiang; Meng, Qingyong; Ren, Liming; Li, Ning; Zhao, Yaofeng

    2012-01-01

    In science, the guinea pig is known as one of the gold standards for modeling human disease. It is especially important as a molecular and cellular biology model for studying the human immune system, as its immunological genes are more similar to human genes than are those of mice. The utility of the guinea pig as a model organism can be further enhanced by further characterization of the genes encoding components of the immune system. Here, we report the genomic organization of the guinea pig immunoglobulin (Ig) heavy and light chain genes. The guinea pig IgH locus is located in genomic scaffolds 54 and 75, and spans approximately 6,480 kb. 507 VH segments (94 potentially functional genes and 413 pseudogenes), 41 DH segments, six JH segments, four constant region genes (μ, γ, ε, and α), and one reverse δ remnant fragment were identified within the two scaffolds. Many VH pseudogenes were found within the guinea pig, and likely constituted a potential donor pool for gene conversion during evolution. The Igκ locus mapped to a 4,029 kb region of scaffold 37 and 24 is composed of 349 Vκ (111 potentially functional genes and 238 pseudogenes), three Jκ and one Cκ genes. The Igλ locus spans 1,642 kb in scaffold 4 and consists of 142 Vλ (58 potentially functional genes and 84 pseudogenes) and 11 Jλ -Cλ clusters. Phylogenetic analysis suggested the guinea pig’s large germline VH gene segments appear to form limited gene families. Therefore, this species may generate antibody diversity via a gene conversion-like mechanism associated with its pseudogene reserves. PMID:22761756

  7. Comparative methods for PET image segmentation in pharyngolaryngeal squamous cell carcinoma

    Energy Technology Data Exchange (ETDEWEB)

    Zaidi, Habib [Geneva University Hospital, Division of Nuclear Medicine and Molecular Imaging, Geneva (Switzerland); Geneva University, Geneva Neuroscience Center, Geneva (Switzerland); University of Groningen, Department of Nuclear Medicine and Molecular Imaging, University Medical Center Groningen, Groningen (Netherlands); Abdoli, Mehrsima [University of Groningen, Department of Nuclear Medicine and Molecular Imaging, University Medical Center Groningen, Groningen (Netherlands); Fuentes, Carolina Llina [Geneva University Hospital, Division of Nuclear Medicine and Molecular Imaging, Geneva (Switzerland); Naqa, Issam M.El [McGill University, Department of Medical Physics, Montreal (Canada)

    2012-05-15

    Several methods have been proposed for the segmentation of {sup 18}F-FDG uptake in PET. In this study, we assessed the performance of four categories of {sup 18}F-FDG PET image segmentation techniques in pharyngolaryngeal squamous cell carcinoma using clinical studies where the surgical specimen served as the benchmark. Nine PET image segmentation techniques were compared including: five thresholding methods; the level set technique (active contour); the stochastic expectation-maximization approach; fuzzy clustering-based segmentation (FCM); and a variant of FCM, the spatial wavelet-based algorithm (FCM-SW) which incorporates spatial information during the segmentation process, thus allowing the handling of uptake in heterogeneous lesions. These algorithms were evaluated using clinical studies in which the segmentation results were compared to the 3-D biological tumour volume (BTV) defined by histology in PET images of seven patients with T3-T4 laryngeal squamous cell carcinoma who underwent a total laryngectomy. The macroscopic tumour specimens were collected ''en bloc'', frozen and cut into 1.7- to 2-mm thick slices, then digitized for use as reference. The clinical results suggested that four of the thresholding methods and expectation-maximization overestimated the average tumour volume, while a contrast-oriented thresholding method, the level set technique and the FCM-SW algorithm underestimated it, with the FCM-SW algorithm providing relatively the highest accuracy in terms of volume determination (-5.9 {+-} 11.9%) and overlap index. The mean overlap index varied between 0.27 and 0.54 for the different image segmentation techniques. The FCM-SW segmentation technique showed the best compromise in terms of 3-D overlap index and statistical analysis results with values of 0.54 (0.26-0.72) for the overlap index. The BTVs delineated using the FCM-SW segmentation technique were seemingly the most accurate and approximated closely the 3-D BTVs

  8. Multi-scale Gaussian representation and outline-learning based cell image segmentation

    Science.gov (United States)

    2013-01-01

    Background High-throughput genome-wide screening to study gene-specific functions, e.g. for drug discovery, demands fast automated image analysis methods to assist in unraveling the full potential of such studies. Image segmentation is typically at the forefront of such analysis as the performance of the subsequent steps, for example, cell classification, cell tracking etc., often relies on the results of segmentation. Methods We present a cell cytoplasm segmentation framework which first separates cell cytoplasm from image background using novel approach of image enhancement and coefficient of variation of multi-scale Gaussian scale-space representation. A novel outline-learning based classification method is developed using regularized logistic regression with embedded feature selection which classifies image pixels as outline/non-outline to give cytoplasm outlines. Refinement of the detected outlines to separate cells from each other is performed in a post-processing step where the nuclei segmentation is used as contextual information. Results and conclusions We evaluate the proposed segmentation methodology using two challenging test cases, presenting images with completely different characteristics, with cells of varying size, shape, texture and degrees of overlap. The feature selection and classification framework for outline detection produces very simple sparse models which use only a small subset of the large, generic feature set, that is, only 7 and 5 features for the two cases. Quantitative comparison of the results for the two test cases against state-of-the-art methods show that our methodology outperforms them with an increase of 4-9% in segmentation accuracy with maximum accuracy of 93%. Finally, the results obtained for diverse datasets demonstrate that our framework not only produces accurate segmentation but also generalizes well to different segmentation tasks. PMID:24267488

  9. Whole-genome sequencing of a laboratory-evolved yeast strain

    Directory of Open Access Journals (Sweden)

    Dunham Maitreya J

    2010-02-01

    Full Text Available Abstract Background Experimental evolution of microbial populations provides a unique opportunity to study evolutionary adaptation in response to controlled selective pressures. However, until recently it has been difficult to identify the precise genetic changes underlying adaptation at a genome-wide scale. New DNA sequencing technologies now allow the genome of parental and evolved strains of microorganisms to be rapidly determined. Results We sequenced >93.5% of the genome of a laboratory-evolved strain of the yeast Saccharomyces cerevisiae and its ancestor at >28× depth. Both single nucleotide polymorphisms and copy number amplifications were found, with specific gains over array-based methodologies previously used to analyze these genomes. Applying a segmentation algorithm to quantify structural changes, we determined the approximate genomic boundaries of a 5× gene amplification. These boundaries guided the recovery of breakpoint sequences, which provide insights into the nature of a complex genomic rearrangement. Conclusions This study suggests that whole-genome sequencing can provide a rapid approach to uncover the genetic basis of evolutionary adaptations, with further applications in the study of laboratory selections and mutagenesis screens. In addition, we show how single-end, short read sequencing data can provide detailed information about structural rearrangements, and generate predictions about the genomic features and processes that underlie genome plasticity.

  10. ReMixT: clone-specific genomic structure estimation in cancer.

    Science.gov (United States)

    McPherson, Andrew W; Roth, Andrew; Ha, Gavin; Chauve, Cedric; Steif, Adi; de Souza, Camila P E; Eirew, Peter; Bouchard-Côté, Alexandre; Aparicio, Sam; Sahinalp, S Cenk; Shah, Sohrab P

    2017-07-27

    Somatic evolution of malignant cells produces tumors composed of multiple clonal populations, distinguished in part by rearrangements and copy number changes affecting chromosomal segments. Whole genome sequencing mixes the signals of sampled populations, diluting the signals of clone-specific aberrations, and complicating estimation of clone-specific genotypes. We introduce ReMixT, a method to unmix tumor and contaminating normal signals and jointly predict mixture proportions, clone-specific segment copy number, and clone specificity of breakpoints. ReMixT is free, open-source software and is available at http://bitbucket.org/dranew/remixt .

  11. Impact of image segmentation on high-content screening data quality for SK-BR-3 cells

    Directory of Open Access Journals (Sweden)

    Li Yizheng

    2007-09-01

    Full Text Available Abstract Background High content screening (HCS is a powerful method for the exploration of cellular signalling and morphology that is rapidly being adopted in cancer research. HCS uses automated microscopy to collect images of cultured cells. The images are subjected to segmentation algorithms to identify cellular structures and quantitate their morphology, for hundreds to millions of individual cells. However, image analysis may be imperfect, especially for "HCS-unfriendly" cell lines whose morphology is not well handled by current image segmentation algorithms. We asked if segmentation errors were common for a clinically relevant cell line, if such errors had measurable effects on the data, and if HCS data could be improved by automated identification of well-segmented cells. Results Cases of poor cell body segmentation occurred frequently for the SK-BR-3 cell line. We trained classifiers to identify SK-BR-3 cells that were well segmented. On an independent test set created by human review of cell images, our optimal support-vector machine classifier identified well-segmented cells with 81% accuracy. The dose responses of morphological features were measurably different in well- and poorly-segmented populations. Elimination of the poorly-segmented cell population increased the purity of DNA content distributions, while appropriately retaining biological heterogeneity, and simultaneously increasing our ability to resolve specific morphological changes in perturbed cells. Conclusion Image segmentation has a measurable impact on HCS data. The application of a multivariate shape-based filter to identify well-segmented cells improved HCS data quality for an HCS-unfriendly cell line, and could be a valuable post-processing step for some HCS datasets.

  12. Patterns and effects of GC3 heterogeneity and parsimony informative sites on the phylogenetic tree of genes.

    Science.gov (United States)

    Ma, Shuai; Wu, Qi; Hu, Yibo; Wei, Fuwen

    2018-05-20

    The explosive growth in genomic data has provided novel insights into the conflicting signals hidden in phylogenetic trees. Although some studies have explored the effects of the GC content and parsimony informative sites (PIS) on the phylogenetic tree, the effect of the heterogeneity of the GC content at the first/second/third codon position on parsimony informative sites (GC1/2/3 PIS ) among different species and the effect of PIS on phylogenetic tree construction remain largely unexplored. Here, we used two different mammal genomic datasets to explore the patterns of GC1/2/3 PIS heterogeneity and the effect of PIS on the phylogenetic tree of genes: (i) all GC1/2/3 PIS have obvious heterogeneity between different mammals, and the levels of heterogeneity are GC3 PIS  > GC2 PIS  > GC1 PIS ; (ii) the number of PIS is positively correlated with the metrics of "good" gene tree topologies, and excluding the third codon position (C3) decreases the quality of gene trees by removing too many PIS. These results provide novel insights into the heterogeneity pattern of GC1/2/3 PIS in mammals and the relationship between GC3/PIS and gene trees. Additionally, it is necessary to carefully consider whether to exclude C3 to improve the quality of gene trees, especially in the super-tree method. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Overview on Clinical Relevance of Intra-Tumor Heterogeneity.

    Science.gov (United States)

    Stanta, Giorgio; Bonin, Serena

    2018-01-01

    Today, clinical evaluation of tumor heterogeneity is an emergent issue to improve clinical oncology. In particular, intra-tumor heterogeneity (ITH) is closely related to cancer progression, resistance to therapy, and recurrences. It is interconnected with complex molecular mechanisms including spatial and temporal phenomena, which are often peculiar for every single patient. This review tries to describe all the types of ITH including morphohistological ITH, and at the molecular level clonal ITH derived from genomic instability and nonclonal ITH derived from microenvironment interaction. It is important to consider the different types of ITH as a whole for any patient to investigate on cancer progression, prognosis, and treatment opportunities. From a practical point of view, analytical methods that are widely accessible today, or will be in the near future, are evaluated to investigate the complex pattern of ITH in a reproducible way for a clinical application.

  14. Overview on Clinical Relevance of Intra-Tumor Heterogeneity

    Directory of Open Access Journals (Sweden)

    Giorgio Stanta

    2018-04-01

    Full Text Available Today, clinical evaluation of tumor heterogeneity is an emergent issue to improve clinical oncology. In particular, intra-tumor heterogeneity (ITH is closely related to cancer progression, resistance to therapy, and recurrences. It is interconnected with complex molecular mechanisms including spatial and temporal phenomena, which are often peculiar for every single patient. This review tries to describe all the types of ITH including morphohistological ITH, and at the molecular level clonal ITH derived from genomic instability and nonclonal ITH derived from microenvironment interaction. It is important to consider the different types of ITH as a whole for any patient to investigate on cancer progression, prognosis, and treatment opportunities. From a practical point of view, analytical methods that are widely accessible today, or will be in the near future, are evaluated to investigate the complex pattern of ITH in a reproducible way for a clinical application.

  15. Genome-wide selection signatures in Pinzgau cattle

    Directory of Open Access Journals (Sweden)

    Radovan Kasarda

    2015-08-01

    Full Text Available The aim of this study was to identify the evidence of recent selection based on estimation of the integrated Haplotype Score (iHS, population differentiation index (FST and characterize affected regions near QTL associated with traits under strong selection in Pinzgau cattle. In total 21 Austrian and 19 Slovak purebreed bulls genotyped with Illumina bovineHD and  bovineSNP50 BeadChip were used to identify genomic regions under selection. Only autosomal loci with call rate higher than 90%, minor allele frequency higher than 0.01 and Hardy-Weinberg equlibrium limit of 0.001 were included in the subsequent analyses of selection sweeps presence. The final dataset was consisted from 30538 SNPs with 81.86 kb average adjacent SNPs spacing. The iHS score were averaged into non-overlapping 500 kb segments across the genome. The FST values were also plotted against genome position based on sliding windows approach and averaged over 8 consecutive SNPs. Based on integrated Haplotype Score evaluation only 7 regions with iHS score higher than 1.7 was found. The average iHS score observed for each adjacent syntenic regions indicated slight effect of recent selection in analysed group of Pinzgau bulls. The level of genetic differentiation between Austrian and Slovak bulls estimated based on FST index was low. Only 24% of FST values calculated for each SNP was greather than 0.01. By using sliding windows approach was found that 5% of analysed windows had higher value than 0.01. Our results indicated use of similar selection scheme in breeding programs of Slovak and Austrian Pinzgau bulls. The evidence for genome-wide association between signatures of selection and regions affecting complex traits such as milk production was insignificant, because the loci in segments identified as affected by selection were very distant from each other. Identification of genomic regions that may be under pressure of selection for phenotypic traits to better understanding of the

  16. Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

    Science.gov (United States)

    Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

    2015-10-01

    Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  17. Perspectives of Integrative Cancer Genomics in Next Generation Sequencing Era

    Directory of Open Access Journals (Sweden)

    So Mee Kwon

    2012-06-01

    Full Text Available The explosive development of genomics technologies including microarrays and next generation sequencing (NGS has provided comprehensive maps of cancer genomes, including the expression of mRNAs and microRNAs, DNA copy numbers, sequence variations, and epigenetic changes. These genome-wide profiles of the genetic aberrations could reveal the candidates for diagnostic and/or prognostic biomarkers as well as mechanistic insights into tumor development and progression. Recent efforts to establish the huge cancer genome compendium and integrative omics analyses, so-called "integromics", have extended our understanding on the cancer genome, showing its daunting complexity and heterogeneity. However, the challenges of the structured integration, sharing, and interpretation of the big omics data still remain to be resolved. Here, we review several issues raised in cancer omics data analysis, including NGS, focusing particularly on the study design and analysis strategies. This might be helpful to understand the current trends and strategies of the rapidly evolving cancer genomics research.

  18. Remote sensing image segmentation using local sparse structure constrained latent low rank representation

    Science.gov (United States)

    Tian, Shu; Zhang, Ye; Yan, Yimin; Su, Nan; Zhang, Junping

    2016-09-01

    Latent low-rank representation (LatLRR) has been attached considerable attention in the field of remote sensing image segmentation, due to its effectiveness in exploring the multiple subspace structures of data. However, the increasingly heterogeneous texture information in the high spatial resolution remote sensing images, leads to more severe interference of pixels in local neighborhood, and the LatLRR fails to capture the local complex structure information. Therefore, we present a local sparse structure constrainted latent low-rank representation (LSSLatLRR) segmentation method, which explicitly imposes the local sparse structure constraint on LatLRR to capture the intrinsic local structure in manifold structure feature subspaces. The whole segmentation framework can be viewed as two stages in cascade. In the first stage, we use the local histogram transform to extract the texture local histogram features (LHOG) at each pixel, which can efficiently capture the complex and micro-texture pattern. In the second stage, a local sparse structure (LSS) formulation is established on LHOG, which aims to preserve the local intrinsic structure and enhance the relationship between pixels having similar local characteristics. Meanwhile, by integrating the LSS and the LatLRR, we can efficiently capture the local sparse and low-rank structure in the mixture of feature subspace, and we adopt the subspace segmentation method to improve the segmentation accuracy. Experimental results on the remote sensing images with different spatial resolution show that, compared with three state-of-the-art image segmentation methods, the proposed method achieves more accurate segmentation results.

  19. Genome-Wide Association Analysis Reveals Genetic Heterogeneity of Sjögren's Syndrome According to Ancestry.

    Science.gov (United States)

    Taylor, Kimberly E; Wong, Quenna; Levine, David M; McHugh, Caitlin; Laurie, Cathy; Doheny, Kimberly; Lam, Mi Y; Baer, Alan N; Challacombe, Stephen; Lanfranchi, Hector; Schiødt, Morten; Srinivasan, M; Umehara, Hisanori; Vivino, Frederick B; Zhao, Yan; Shiboski, Stephen C; Daniels, Troy E; Greenspan, John S; Shiboski, Caroline H; Criswell, Lindsey A

    2017-06-01

    The Sjögren's International Collaborative Clinical Alliance (SICCA) is an international data registry and biorepository derived from a multisite observational study of participants in whom genotyping was performed on the Omni2.5M platform and who had undergone deep phenotyping using common protocol-directed methods. The aim of this study was to examine the genetic etiology of Sjögren's syndrome (SS) across ancestry and disease subsets. We performed genome-wide association study analyses using SICCA subjects and external controls obtained from dbGaP data sets, one using all participants (1,405 cases, 1,622 SICCA controls, and 3,125 external controls), one using European participants (585, 966, and 580, respectively), and one using Asian participants (460, 224, and 901, respectively) with ancestry adjustments via principal components analyses. We also investigated whether subphenotype distributions differ by ethnicity, and whether this contributes to the heterogeneity of genetic associations. We observed significant associations in established regions of the major histocompatibility complex (MHC), IRF5, and STAT4 (P = 3 × 10 -42 , P = 3 × 10 -14 , and P = 9 × 10 -10 , respectively), and several novel suggestive regions (those with 2 or more associations at P ancestry (P = 4 × 10 -15 and P = 4 × 10 -5 , respectively), but that subphenotype differences did not explain most of the ancestry differences in genetic associations. Genetic associations with SS differ markedly according to ancestry; however, this is not explained by differences in subphenotypes. © 2017, The Authors. Arthritis & Rheumatology published by Wiley Periodicals, Inc. on behalf of American College of Rheumatology.

  20. High-resolution comparative mapping among man, cattle and mouse suggests a role for repeat sequences in mammalian genome evolution

    Directory of Open Access Journals (Sweden)

    Rodolphe François

    2006-08-01

    Full Text Available Abstract Background Comparative mapping provides new insights into the evolutionary history of genomes. In particular, recent studies in mammals have suggested a role for segmental duplication in genome evolution. In some species such as Drosophila or maize, transposable elements (TEs have been shown to be involved in chromosomal rearrangements. In this work, we have explored the presence of interspersed repeats in regions of chromosomal rearrangements, using an updated high-resolution integrated comparative map among cattle, man and mouse. Results The bovine, human and mouse comparative autosomal map has been constructed using data from bovine genetic and physical maps and from FISH-mapping studies. We confirm most previous results but also reveal some discrepancies. A total of 211 conserved segments have been identified between cattle and man, of which 33 are new segments and 72 correspond to extended, previously known segments. The resulting map covers 91% and 90% of the human and bovine genomes, respectively. Analysis of breakpoint regions revealed a high density of species-specific interspersed repeats in the human and mouse genomes. Conclusion Analysis of the breakpoint regions has revealed specific repeat density patterns, suggesting that TEs may have played a significant role in chromosome evolution and genome plasticity. However, we cannot rule out that repeats and breakpoints accumulate independently in the few same regions where modifications are better tolerated. Likewise, we cannot ascertain whether increased TE density is the cause or the consequence of chromosome rearrangements. Nevertheless, the identification of high density repeat clusters combined with a well-documented repeat phylogeny should highlight probable breakpoints, and permit their precise dating. Combining new statistical models taking the present information into account should help reconstruct ancestral karyotypes.

  1. The 8p23 inversion polymorphism determines local recombination heterogeneity across human populations.

    Science.gov (United States)

    Alves, Joao M; Chikhi, Lounès; Amorim, António; Lopes, Alexandra M

    2014-04-01

    For decades, chromosomal inversions have been regarded as fascinating evolutionary elements as they are expected to suppress recombination between chromosomes with opposite orientations, leading to the accumulation of genetic differences between the two configurations over time. Here, making use of publicly available population genotype data for the largest polymorphic inversion in the human genome (8p23-inv), we assessed whether this inhibitory effect of inversion rearrangements led to significant differences in the recombination landscape of two homologous DNA segments, with opposite orientation. Our analysis revealed that the accumulation of genetic differentiation is positively correlated with the variation in recombination profiles. The observed recombination dissimilarity between inversion types is consistent across all populations analyzed and surpasses the effects of geographic structure, suggesting that both structures (orientations) have been evolving independently over an extended period of time, despite being subjected to the very same demographic history. Aside this mainly independent evolution, we also identified a short segment (350 kb, inversion) in the central region of the inversion where the genetic divergence between the two structural haplotypes is diminished. Although it is difficult to demonstrate it, this could be due to gene flow (possibly via double-crossing over events), which is consistent with the higher recombination rates surrounding this segment. This study demonstrates for the first time that chromosomal inversions influence the recombination landscape at a fine-scale and highlights the role of these rearrangements as drivers of genome evolution.

  2. Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    Science.gov (United States)

    Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl

    2014-07-04

    Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was

  3. ℓ1/2-norm regularized nonnegative low-rank and sparse affinity graph for remote sensing image segmentation

    Science.gov (United States)

    Tian, Shu; Zhang, Ye; Yan, Yiming; Su, Nan

    2016-10-01

    Segmentation of real-world remote sensing images is a challenge due to the complex texture information with high heterogeneity. Thus, graph-based image segmentation methods have been attracting great attention in the field of remote sensing. However, most of the traditional graph-based approaches fail to capture the intrinsic structure of the feature space and are sensitive to noises. A ℓ-norm regularization-based graph segmentation method is proposed to segment remote sensing images. First, we use the occlusion of the random texture model (ORTM) to extract the local histogram features. Then, a ℓ-norm regularized low-rank and sparse representation (LNNLRS) is implemented to construct a ℓ-regularized nonnegative low-rank and sparse graph (LNNLRS-graph), by the union of feature subspaces. Moreover, the LNNLRS-graph has a high ability to discriminate the manifold intrinsic structure of highly homogeneous texture information. Meanwhile, the LNNLRS representation takes advantage of the low-rank and sparse characteristics to remove the noises and corrupted data. Last, we introduce the LNNLRS-graph into the graph regularization nonnegative matrix factorization to enhance the segmentation accuracy. The experimental results using remote sensing images show that when compared to five state-of-the-art image segmentation methods, the proposed method achieves more accurate segmentation results.

  4. An alternative covariance estimator to investigate genetic heterogeneity in populations.

    Science.gov (United States)

    Heslot, Nicolas; Jannink, Jean-Luc

    2015-11-26

    For genomic prediction and genome-wide association studies (GWAS) using mixed models, covariance between individuals is estimated using molecular markers. Based on the properties of mixed models, using available molecular data for prediction is optimal if this covariance is known. Under this assumption, adding individuals to the analysis should never be detrimental. However, some empirical studies showed that increasing training population size decreased prediction accuracy. Recently, results from theoretical models indicated that even if marker density is high and the genetic architecture of traits is controlled by many loci with small additive effects, the covariance between individuals, which depends on relationships at causal loci, is not always well estimated by the whole-genome kinship. We propose an alternative covariance estimator named K-kernel, to account for potential genetic heterogeneity between populations that is characterized by a lack of genetic correlation, and to limit the information flow between a priori unknown populations in a trait-specific manner. This is similar to a multi-trait model and parameters are estimated by REML and, in extreme cases, it can allow for an independent genetic architecture between populations. As such, K-kernel is useful to study the problem of the design of training populations. K-kernel was compared to other covariance estimators or kernels to examine its fit to the data, cross-validated accuracy and suitability for GWAS on several datasets. It provides a significantly better fit to the data than the genomic best linear unbiased prediction model and, in some cases it performs better than other kernels such as the Gaussian kernel, as shown by an empirical null distribution. In GWAS simulations, alternative kernels control type I errors as well as or better than the classical whole-genome kinship and increase statistical power. No or small gains were observed in cross-validated prediction accuracy. This alternative

  5. The complex jujube genome provides insights into fruit tree biology.

    Science.gov (United States)

    Liu, Meng-Jun; Zhao, Jin; Cai, Qing-Le; Liu, Guo-Cheng; Wang, Jiu-Rui; Zhao, Zhi-Hui; Liu, Ping; Dai, Li; Yan, Guijun; Wang, Wen-Jiang; Li, Xian-Song; Chen, Yan; Sun, Yu-Dong; Liu, Zhi-Guo; Lin, Min-Juan; Xiao, Jing; Chen, Ying-Ying; Li, Xiao-Feng; Wu, Bin; Ma, Yong; Jian, Jian-Bo; Yang, Wei; Yuan, Zan; Sun, Xue-Chao; Wei, Yan-Li; Yu, Li-Li; Zhang, Chi; Liao, Sheng-Guang; He, Rong-Jun; Guang, Xuan-Min; Wang, Zhuo; Zhang, Yue-Yang; Luo, Long-Hai

    2014-10-28

    The jujube (Ziziphus jujuba Mill.), a member of family Rhamnaceae, is a major dry fruit and a traditional herbal medicine for more than one billion people. Here we present a high-quality sequence for the complex jujube genome, the first genome sequence of Rhamnaceae, using an integrated strategy. The final assembly spans 437.65 Mb (98.6% of the estimated) with 321.45 Mb anchored to the 12 pseudo-chromosomes and contains 32,808 genes. The jujube genome has undergone frequent inter-chromosome fusions and segmental duplications, but no recent whole-genome duplication. Further analyses of the jujube-specific genes and transcriptome data from 15 tissues reveal the molecular mechanisms underlying some specific properties of the jujube. Its high vitamin C content can be attributed to a unique high level expression of genes involved in both biosynthesis and regeneration. Our study provides insights into jujube-specific biology and valuable genomic resources for the improvement of Rhamnaceae plants and other fruit trees.

  6. A survey of innovation through duplication in the reduced genomes of twelve parasites.

    Directory of Open Access Journals (Sweden)

    Jeremy D DeBarry

    Full Text Available We characterize the prevalence, distribution, divergence, and putative functions of detectable two-copy paralogs and segmental duplications in the Apicomplexa, a phylum of parasitic protists. Apicomplexans are mostly obligate intracellular parasites responsible for human and animal diseases (e.g. malaria and toxoplasmosis. Gene loss is a major force in the phylum. Genomes are small and protein-encoding gene repertoires are reduced. Despite this genomic streamlining, duplications and gene family amplifications are present. The potential for innovation introduced by duplications is of particular interest. We compared genomes of twelve apicomplexans across four lineages and used orthology and genome cartography to map distributions of duplications against genome architectures. Segmental duplications appear limited to five species. Where present, they correspond to regions enriched for multi-copy and species-specific genes, pointing toward roles in adaptation and innovation. We found a phylum-wide association of duplications with dynamic chromosome regions and syntenic breakpoints. Trends in the distribution of duplicated genes indicate that recent, species-specific duplicates are often tandem while most others have been dispersed by genome rearrangements. These trends show a relationship between genome architecture and gene duplication. Functional analysis reveals: proteases, which are vital to a parasitic lifecycle, to be prominent in putative recent duplications; a pair of paralogous genes in Toxoplasma gondii previously shown to produce the rate-limiting step in dopamine synthesis in mammalian cells, a possible link to the modification of host behavior; and phylum-wide differences in expression and subcellular localization, indicative of modes of divergence. We have uncovered trends in multiple modes of duplicate divergence including sequence, intron content, expression, subcellular localization, and functions of putative recent duplicates that

  7. Understanding Cancer Genome and Its Evolution by Next Generation Sequencing

    DEFF Research Database (Denmark)

    Hou, Yong

    Cancer will cause 13 million deaths by the year of 2030, ranking the second leading cause of death worldwide. Previous studies indicate that most of the cancers originate from cells that acquired somatic mutations and evolved as Darwin Theory. Ten biological insights of cancer have been summarized...... recently. Cutting-age technologies like next generation sequencing (NGS) enable exploring cancer genome and evolution much more efficiently. However, integrated cancer genome sequencing studies showed great inter-/intra-tumoral heterogeneity (ITH) and complex evolution patterns beyond the cancer biological...... knowledge we previously know. There is very limited knowledge of East Asia lung cancer genome except enrichment of EGFR mutations and lack of KRAS mutations. We carried out integrated genomic, transcriptomic and methylomic analysis of 335 primary Chinese lung adenocarcinomas (LUAD) and 35 corresponding...

  8. Analysis of nuclear and organellar genomes of Plasmodium knowlesi in humans reveals ancient population structure and recent recombination among host-specific subpopulations

    KAUST Repository

    Diez Benavente, Ernest

    2017-09-18

    The macaque parasite Plasmodium knowlesi is a significant concern in Malaysia where cases of human infection are increasing. Parasites infecting humans originate from genetically distinct subpopulations associated with the long-tailed (Macaca fascicularis (Mf)) or pig-tailed macaques (Macaca nemestrina (Mn)). We used a new high-quality reference genome to re-evaluate previously described subpopulations among human and macaque isolates from Malaysian-Borneo and Peninsular-Malaysia. Nuclear genomes were dimorphic, as expected, but new evidence of chromosomal-segment exchanges between subpopulations was found. A large segment on chromosome 8 originating from the Mn subpopulation and containing genes encoding proteins expressed in mosquito-borne parasite stages, was found in Mf genotypes. By contrast, non-recombining organelle genomes partitioned into 3 deeply branched lineages, unlinked with nuclear genomic dimorphism. Subpopulations which diverged in isolation have re-connected, possibly due to deforestation and disruption of wild macaque habitats. The resulting genomic mosaics reveal traits selected by host-vector-parasite interactions in a setting of ecological transition.

  9. Analysis of nuclear and organellar genomes of Plasmodium knowlesi in humans reveals ancient population structure and recent recombination among host-specific subpopulations

    KAUST Repository

    Diez Benavente, Ernest; Florez de Sessions, Paola; Moon, Robert W.; Holder, Anthony A.; Blackman, Michael J.; Roper, Cally; Drakeley, Christopher J.; Pain, Arnab; Sutherland, Colin J.; Hibberd, Martin L.; Campino, Susana; Clark, Taane G.

    2017-01-01

    The macaque parasite Plasmodium knowlesi is a significant concern in Malaysia where cases of human infection are increasing. Parasites infecting humans originate from genetically distinct subpopulations associated with the long-tailed (Macaca fascicularis (Mf)) or pig-tailed macaques (Macaca nemestrina (Mn)). We used a new high-quality reference genome to re-evaluate previously described subpopulations among human and macaque isolates from Malaysian-Borneo and Peninsular-Malaysia. Nuclear genomes were dimorphic, as expected, but new evidence of chromosomal-segment exchanges between subpopulations was found. A large segment on chromosome 8 originating from the Mn subpopulation and containing genes encoding proteins expressed in mosquito-borne parasite stages, was found in Mf genotypes. By contrast, non-recombining organelle genomes partitioned into 3 deeply branched lineages, unlinked with nuclear genomic dimorphism. Subpopulations which diverged in isolation have re-connected, possibly due to deforestation and disruption of wild macaque habitats. The resulting genomic mosaics reveal traits selected by host-vector-parasite interactions in a setting of ecological transition.

  10. Whole-genome analyses of DS-1-like human G2P[4] and G8P[4] rotavirus strains from Eastern, Western and Southern Africa.

    Science.gov (United States)

    Nyaga, Martin M; Stucker, Karla M; Esona, Mathew D; Jere, Khuzwayo C; Mwinyi, Bakari; Shonhai, Annie; Tsolenyanu, Enyonam; Mulindwa, Augustine; Chibumbya, Julia N; Adolfine, Hokororo; Halpin, Rebecca A; Roy, Sunando; Stockwell, Timothy B; Berejena, Chipo; Seheri, Mapaseka L; Mwenda, Jason M; Steele, A Duncan; Wentworth, David E; Mphahlele, M Jeffrey

    2014-10-01

    Group A rotaviruses (RVAs) with distinct G and P genotype combinations have been reported globally. We report the genome composition and possible origin of seven G8P[4] and five G2P[4] human RVA strains based on the genetic evolution of all 11 genome segments at the nucleotide level. Twelve RVA ELISA positive stool samples collected in the representative countries of Eastern, Southern and West Africa during the 2007-2012 surveillance seasons were subjected to sequencing using the Ion Torrent PGM and Illumina MiSeq platforms. A reference-based assembly was performed using CLC Bio's clc_ref_assemble_long program, and full-genome consensus sequences were obtained. With the exception of the neutralising antigen, VP7, all study strains exhibited the DS-1-like genome constellation (P[4]-I2-R2-C2-M2-A2-N2-T2-E2-H2) and clustered phylogenetically with reference strains having a DS-1-like genetic backbone. Comparison of the nucleotide and amino acid sequences with selected global cognate genome segments revealed nucleotide and amino acid sequence identities of 81.7-100 % and 90.6-100 %, respectively, with NSP4 gene segment showing the most diversity among the strains. Bayesian analyses of all gene sequences to estimate the time of divergence of the lineage indicated that divergence times ranged from 16 to 44 years, except for the NSP4 gene where the lineage seemed to arise in the more distant past at an estimated 203 years ago. However, the long-term effects of changes found within the NSP4 genome segment should be further explored, and thus we recommend continued whole-genome analyses from larger sample sets to determine the evolutionary mechanisms of the DS-1-like strains collected in Africa.

  11. glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data

    Directory of Open Access Journals (Sweden)

    Andrew Paul Hutchins

    2014-01-01

    Full Text Available Genomic datasets and the tools to analyze them have proliferated at an astonishing rate. However, such tools are often poorly integrated with each other: each program typically produces its own custom output in a variety of non-standard file formats. Here we present glbase, a framework that uses a flexible set of descriptors that can quickly parse non-binary data files. glbase includes many functions to intersect two lists of data, including operations on genomic interval data and support for the efficient random access to huge genomic data files. Many glbase functions can produce graphical outputs, including scatter plots, heatmaps, boxplots and other common analytical displays of high-throughput data such as RNA-seq, ChIP-seq and microarray expression data. glbase is designed to rapidly bring biological data into a Python-based analytical environment to facilitate analysis and data processing. In summary, glbase is a flexible and multifunctional toolkit that allows the combination and analysis of high-throughput data (especially next-generation sequencing and genome-wide data, and which has been instrumental in the analysis of complex data sets. glbase is freely available at http://bitbucket.org/oaxiom/glbase/.

  12. glbase: a framework for combining, analyzing and displaying heterogeneous genomic and high-throughput sequencing data.

    Science.gov (United States)

    Hutchins, Andrew Paul; Jauch, Ralf; Dyla, Mateusz; Miranda-Saavedra, Diego

    2014-01-01

    Genomic datasets and the tools to analyze them have proliferated at an astonishing rate. However, such tools are often poorly integrated with each other: each program typically produces its own custom output in a variety of non-standard file formats. Here we present glbase, a framework that uses a flexible set of descriptors that can quickly parse non-binary data files. glbase includes many functions to intersect two lists of data, including operations on genomic interval data and support for the efficient random access to huge genomic data files. Many glbase functions can produce graphical outputs, including scatter plots, heatmaps, boxplots and other common analytical displays of high-throughput data such as RNA-seq, ChIP-seq and microarray expression data. glbase is designed to rapidly bring biological data into a Python-based analytical environment to facilitate analysis and data processing. In summary, glbase is a flexible and multifunctional toolkit that allows the combination and analysis of high-throughput data (especially next-generation sequencing and genome-wide data), and which has been instrumental in the analysis of complex data sets. glbase is freely available at http://bitbucket.org/oaxiom/glbase/.

  13. Analysis of the coding potential of the partially overlapping 3' ORF in segment 5 of the plant fijiviruses

    Directory of Open Access Journals (Sweden)

    Atkins John F

    2009-03-01

    Full Text Available Abstract The plant-infecting members of the genus Fijivirus (family Reoviridae have linear dsRNA genomes divided into 10 segments, two of which contain two substantial and non-overlapping ORFs, while the remaining eight are apparently monocistronic. However, one of these – namely segment 5 – contains a second long ORF (~200+ codons that overlaps the 3' end of the major ORF (~920–940 codons in the +1 reading frame. In this report, we use bioinformatic techniques to analyze the pattern of base variations across an alignment of fijivirus segment 5 sequences, and show that this 3' ORF has a strong coding signature. Possible translation mechanisms for this unusually positioned ORF are discussed.

  14. Estimating phylogenetic trees from genome-scale data.

    Science.gov (United States)

    Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V

    2015-12-01

    The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.

  15. Human genome-microbiome interaction: metagenomics frontiers for the aetiopathology of autoimmune diseases.

    Science.gov (United States)

    Gundogdu, Aycan; Nalbantoglu, Ufuk

    2017-04-01

    A short while ago, the human genome and microbiome were analysed simultaneously for the first time as a multi-omic approach. The analyses of heterogeneous population cohorts showed that microbiome components were associated with human genome variations. In-depth analysis of these results reveals that the majority of those relationships are between immune pathways and autoimmune disease-associated microbiome components. Thus, it can be hypothesized that autoimmunity may be associated with homeostatic disequilibrium of the human-microbiome interactome. Further analysis of human genome-human microbiome relationships in disease contexts with tailored systems biology approaches may yield insights into disease pathogenesis and prognosis.

  16. On the multiple depots vehicle routing problem with heterogeneous fleet capacity and velocity

    Science.gov (United States)

    Hanum, F.; Hartono, A. P.; Bakhtiar, T.

    2018-03-01

    This current manuscript concerns with the optimization problem arising in a route determination of products distribution. The problem is formulated in the form of multiple depots and time windowed vehicle routing problem with heterogeneous capacity and velocity of fleet. Model includes a number of constraints such as route continuity, multiple depots availability and serving time in addition to generic constraints. In dealing with the unique feature of heterogeneous velocity, we generate a number of velocity profiles along the road segments, which then converted into traveling-time tables. An illustrative example of rice distribution among villages by bureau of logistics is provided. Exact approach is utilized to determine the optimal solution in term of vehicle routes and starting time of service.

  17. The highly heterogeneous methylated genomes and diverse restriction-modification systems of bloom-forming Microcystis.

    Science.gov (United States)

    Zhao, Liang; Song, Yulong; Li, Lin; Gan, Nanqin; Brand, Jerry J; Song, Lirong

    2018-05-01

    The occurrence of harmful Microcystis blooms is increasing in frequency in a myriad of freshwater ecosystems. Despite considerable research pertaining to the cause and nature of these blooms, the molecular mechanisms behind the cosmopolitan distribution and phenotypic diversity in Microcystis are still unclear. We compared the patterns and extent of DNA methylation in three strains of Microcystis, PCC 7806SL, NIES-2549 and FACHB-1757, using Single Molecule Real-Time (SMRT) sequencing technology. Intact restriction-modification (R-M) systems were identified from the genomes of these strains, and from two previously sequenced strains of Microcystis, NIES-843 and TAIHU98. A large number of methylation motifs and R-M genes were identified in these strains, which differ substantially among different strains. Of the 35 motifs identified, eighteen had not previously been reported. Strain NIES-843 contains a larger number of total putative methyltransferase genes than have been reported previously from any bacterial genome. Genomic comparisons reveal that methyltransferases (some partial) may have been acquired from the environment through horizontal gene transfer. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. transPLANT Resources for Triticeae Genomic Data

    Directory of Open Access Journals (Sweden)

    Manuel Spannagl

    2016-03-01

    Full Text Available The genome sequences of many important Triticeae species, including bread wheat ( L. and barley ( L., remained uncharacterized for a long time because their high repeat content, large sizes, and polyploidy. As a result of improvements in sequencing technologies and novel analyses strategies, several of these have recently been deciphered. These efforts have generated new insights into Triticeae biology and genome organization and have important implications for downstream usage by breeders, experimental biologists, and comparative genomicists. transPLANT ( is an EU-funded project aimed at constructing hardware, software, and data infrastructure for genome-scale research in the life sciences. Since the Triticeae data are intrinsically complex, heterogenous, and distributed, the transPLANT consortium has undertaken efforts to develop common data formats and tools that enable the exchange and integration of data from distributed resources. Here we present an overview of the individual Triticeae genome resources hosted by transPLANT partners, introduce the objectives of transPLANT, and outline common developments and interfaces supporting integrated data access.

  19. Sequencing and comparative genome analysis of two pathogenic Streptococcus gallolyticus subspecies: genome plasticity, adaptation and virulence.

    Directory of Open Access Journals (Sweden)

    I-Hsuan Lin

    Full Text Available Streptococcus gallolyticus infections in humans are often associated with bacteremia, infective endocarditis and colon cancers. The disease manifestations are different depending on the subspecies of S. gallolyticus causing the infection. Here, we present the complete genomes of S. gallolyticus ATCC 43143 (biotype I and S. pasteurianus ATCC 43144 (biotype II.2. The genomic differences between the two biotypes were characterized with comparative genomic analyses. The chromosome of ATCC 43143 and ATCC 43144 are 2,36 and 2,10 Mb in length and encode 2246 and 1869 CDS respectively. The organization and genomic contents of both genomes were most similar to the recently published S. gallolyticus UCN34, where 2073 (92% and 1607 (86% of the ATCC 43143 and ATCC 43144 CDS were conserved in UCN34 respectively. There are around 600 CDS conserved in all Streptococcus genomes, indicating the Streptococcus genus has a small core-genome (constitute around 30% of total CDS and substantial evolutionary plasticity. We identified eight and five regions of genome plasticity in ATCC 43143 and ATCC 43144 respectively. Within these regions, several proteins were recognized to contribute to the fitness and virulence of each of the two subspecies. We have also predicted putative cell-surface associated proteins that could play a role in adherence to host tissues, leading to persistent infections causing sub-acute and chronic diseases in humans. This study showed evidence that the S. gallolyticus still possesses genes making it suitable in a rumen environment, whereas the ability for S. pasteurianus to live in rumen is reduced. The genome heterogeneity and genetic diversity among the two biotypes, especially membrane and lipoproteins, most likely contribute to the differences in the pathogenesis of the two S. gallolyticus biotypes and the type of disease an infected patient eventually develops.

  20. Urban tourist complexes as Multi-product companies: Market segmentation and product differentiation in Amsterdam

    OpenAIRE

    Romao, J.; Neuts, B.; Nijkamp, P.; Leeuwen, E.S. van

    2012-01-01

    The tourism sector is evolving into an advanced industrial sector. Modern tourism presupposes an attractive portfolio of tourist services for a varied set of visitors. Meanwhile, tourism destinations have turned into multifaceted tourist complexes comprising a broad package of amenities that satisfy the needs of a heterogeneous group of clients. Such tourist complexes may be regarded as export-oriented multi-product companies, characterized by spatial and functional market segmentation and by...

  1. Automated Segmentation of Nuclei in Breast Cancer Histopathology Images.

    Science.gov (United States)

    Paramanandam, Maqlin; O'Byrne, Michael; Ghosh, Bidisha; Mammen, Joy John; Manipadam, Marie Therese; Thamburaj, Robinson; Pakrashi, Vikram

    2016-01-01

    The process of Nuclei detection in high-grade breast cancer images is quite challenging in the case of image processing techniques due to certain heterogeneous characteristics of cancer nuclei such as enlarged and irregularly shaped nuclei, highly coarse chromatin marginalized to the nuclei periphery and visible nucleoli. Recent reviews state that existing techniques show appreciable segmentation accuracy on breast histopathology images whose nuclei are dispersed and regular in texture and shape; however, typical cancer nuclei are often clustered and have irregular texture and shape properties. This paper proposes a novel segmentation algorithm for detecting individual nuclei from Hematoxylin and Eosin (H&E) stained breast histopathology images. This detection framework estimates a nuclei saliency map using tensor voting followed by boundary extraction of the nuclei on the saliency map using a Loopy Back Propagation (LBP) algorithm on a Markov Random Field (MRF). The method was tested on both whole-slide images and frames of breast cancer histopathology images. Experimental results demonstrate high segmentation performance with efficient precision, recall and dice-coefficient rates, upon testing high-grade breast cancer images containing several thousand nuclei. In addition to the optimal performance on the highly complex images presented in this paper, this method also gave appreciable results in comparison with two recently published methods-Wienert et al. (2012) and Veta et al. (2013), which were tested using their own datasets.

  2. Automated Segmentation of Nuclei in Breast Cancer Histopathology Images.

    Directory of Open Access Journals (Sweden)

    Maqlin Paramanandam

    Full Text Available The process of Nuclei detection in high-grade breast cancer images is quite challenging in the case of image processing techniques due to certain heterogeneous characteristics of cancer nuclei such as enlarged and irregularly shaped nuclei, highly coarse chromatin marginalized to the nuclei periphery and visible nucleoli. Recent reviews state that existing techniques show appreciable segmentation accuracy on breast histopathology images whose nuclei are dispersed and regular in texture and shape; however, typical cancer nuclei are often clustered and have irregular texture and shape properties. This paper proposes a novel segmentation algorithm for detecting individual nuclei from Hematoxylin and Eosin (H&E stained breast histopathology images. This detection framework estimates a nuclei saliency map using tensor voting followed by boundary extraction of the nuclei on the saliency map using a Loopy Back Propagation (LBP algorithm on a Markov Random Field (MRF. The method was tested on both whole-slide images and frames of breast cancer histopathology images. Experimental results demonstrate high segmentation performance with efficient precision, recall and dice-coefficient rates, upon testing high-grade breast cancer images containing several thousand nuclei. In addition to the optimal performance on the highly complex images presented in this paper, this method also gave appreciable results in comparison with two recently published methods-Wienert et al. (2012 and Veta et al. (2013, which were tested using their own datasets.

  3. Progressive epicardial coronary blood flow reduction fails to produce ST-segment depression at normal heart rates.

    Science.gov (United States)

    de Chantal, Marilyn; Diodati, Jean G; Nasmith, James B; Amyot, Robert; LeBlanc, A Robert; Schampaert, Erick; Pharand, Chantal

    2006-12-01

    ST-segment depression is commonly seen in patients with acute coronary syndromes. Most authors have attributed it to transient reductions in coronary blood flow due to nonocclusive thrombus formation on a disrupted atherosclerotic plaque and dynamic focal vasospasm at the site of coronary artery stenosis. However, ST-segment depression was never reproduced in classic animal models of coronary stenosis without the presence of tachycardia. We hypothesized that ST-segment depression occurring during acute coronary syndromes is not entirely explained by changes in epicardial coronary artery resistance and thus evaluated the effect of a slow, progressive epicardial coronary artery occlusion on the ECG and regional myocardial blood flow in anesthetized pigs. Slow, progressive occlusion over 72 min (SD 27) of the left anterior descending coronary artery in 20 anesthetized pigs led to a 90% decrease in coronary blood flow and the development of ST-segment elevation associated with homogeneous and transmural myocardial blood flow reductions, confirmed by microspheres and myocardial contrast echocardiography. ST-segment depression was not observed in any ECG lead before the development of ST-segment elevation. At normal heart rates, progressive epicardial stenosis of a coronary artery results in myocardial ischemia associated with homogeneous, transmural reduction in regional myocardial blood flow and ST-segment elevation, without preceding ST-segment depression. Thus, in coronary syndromes with ST-segment depression and predominant subendocardial ischemia, factors other than mere increases in epicardial coronary resistance must be invoked to explain the heterogeneous parietal distribution of flow and associated ECG changes.

  4. Platform for Quantitative Evaluation of Spatial Intratumoral Heterogeneity in Multiplexed Fluorescence Images.

    Science.gov (United States)

    Spagnolo, Daniel M; Al-Kofahi, Yousef; Zhu, Peihong; Lezon, Timothy R; Gough, Albert; Stern, Andrew M; Lee, Adrian V; Ginty, Fiona; Sarachan, Brion; Taylor, D Lansing; Chennubhotla, S Chakra

    2017-11-01

    We introduce THRIVE (Tumor Heterogeneity Research Interactive Visualization Environment), an open-source tool developed to assist cancer researchers in interactive hypothesis testing. The focus of this tool is to quantify spatial intratumoral heterogeneity (ITH), and the interactions between different cell phenotypes and noncellular constituents. Specifically, we foresee applications in phenotyping cells within tumor microenvironments, recognizing tumor boundaries, identifying degrees of immune infiltration and epithelial/stromal separation, and identification of heterotypic signaling networks underlying microdomains. The THRIVE platform provides an integrated workflow for analyzing whole-slide immunofluorescence images and tissue microarrays, including algorithms for segmentation, quantification, and heterogeneity analysis. THRIVE promotes flexible deployment, a maintainable code base using open-source libraries, and an extensible framework for customizing algorithms with ease. THRIVE was designed with highly multiplexed immunofluorescence images in mind, and, by providing a platform to efficiently analyze high-dimensional immunofluorescence signals, we hope to advance these data toward mainstream adoption in cancer research. Cancer Res; 77(21); e71-74. ©2017 AACR . ©2017 American Association for Cancer Research.

  5. The genomic distribution of intraspecific and interspecific sequence divergence of human segmental duplications relative to human/chimpanzee chromosomal rearrangements

    Directory of Open Access Journals (Sweden)

    Eichler Evan E

    2008-08-01

    Full Text Available Abstract Background It has been suggested that chromosomal rearrangements harbor the molecular footprint of the biological phenomena which they induce, in the form, for instance, of changes in the sequence divergence rates of linked genes. So far, all the studies of these potential associations have focused on the relationship between structural changes and the rates of evolution of single-copy DNA and have tried to exclude segmental duplications (SDs. This is paradoxical, since SDs are one of the primary forces driving the evolution of structure and function in our genomes and have been linked not only with novel genes acquiring new functions, but also with overall higher DNA sequence divergence and major chromosomal rearrangements. Results Here we take the opposite view and focus on SDs. We analyze several of the features of SDs, including the rates of intraspecific divergence between paralogous copies of human SDs and of interspecific divergence between human SDs and chimpanzee DNA. We study how divergence measures relate to chromosomal rearrangements, while considering other factors that affect evolutionary rates in single copy DNA. Conclusion We find that interspecific SD divergence behaves similarly to divergence of single-copy DNA. In contrast, old and recent paralogous copies of SDs do present different patterns of intraspecific divergence. Also, we show that some relatively recent SDs accumulate in regions that carry inversions in sister lineages.

  6. Performance of Genomic Selection in Mice

    OpenAIRE

    Legarra, Andrés; Robert-Granié, Christèle; Manfredi, Eduardo; Elsen, Jean-Michel

    2008-01-01

    Selection plans in plant and animal breeding are driven by genetic evaluation. Recent developments suggest using massive genetic marker information, known as “genomic selection.” There is little evidence of its performance, though. We empirically compared three strategies for selection: (1) use of pedigree and phenotypic information, (2) use of genomewide markers and phenotypic information, and (3) the combination of both. We analyzed four traits from a heterogeneous mouse population (http://...

  7. Insights into the genome structure and copy-number variation of Eimeria tenella

    Directory of Open Access Journals (Sweden)

    Lim Lik-Sin

    2012-08-01

    Full Text Available Abstract Background Eimeria is a genus of parasites in the same phylum (Apicomplexa as human parasites such as Toxoplasma, Cryptosporidium and the malaria parasite Plasmodium. As an apicomplexan whose life-cycle involves a single host, Eimeria is a convenient model for understanding this group of organisms. Although the genomes of the Apicomplexa are diverse, that of Eimeria is unique in being composed of large alternating blocks of sequence with very different characteristics - an arrangement seen in no other organism. This arrangement has impeded efforts to fully sequence the genome of Eimeria, which remains the last of the major apicomplexans to be fully analyzed. In order to increase the value of the genome sequence data and aid in the effort to gain a better understanding of the Eimeria tenella genome, we constructed a whole genome map for the parasite. Results A total of 1245 contigs representing 70.0% of the whole genome assembly sequences (Wellcome Trust Sanger Institute were selected and subjected to marker selection. Subsequently, 2482 HAPPY markers were developed and typed. Of these, 795 were considered as usable markers, and utilized in the construction of a HAPPY map. Markers developed from chromosomally-assigned genes were then integrated into the HAPPY map and this aided the assignment of a number of linkage groups to their respective chromosomes. BAC-end sequences and contigs from whole genome sequencing were also integrated to improve and validate the HAPPY map. This resulted in an integrated HAPPY map consisting of 60 linkage groups that covers approximately half of the estimated 60 Mb genome. Further analysis suggests that the segmental organization first seen in Chromosome 1 is present throughout the genome, with repeat-poor (P regions alternating with repeat-rich (R regions. Evidence of copy-number variation between strains was also uncovered. Conclusions This paper describes the application of a whole genome mapping

  8. Genome-wide Association for Major Depression Through Age at Onset Stratification: Major Depressive Disorder Working Group of the Psychiatric Genomics Consortium.

    Science.gov (United States)

    Power, Robert A; Tansey, Katherine E; Buttenschøn, Henriette Nørmølle; Cohen-Woods, Sarah; Bigdeli, Tim; Hall, Lynsey S; Kutalik, Zoltán; Lee, S Hong; Ripke, Stephan; Steinberg, Stacy; Teumer, Alexander; Viktorin, Alexander; Wray, Naomi R; Arolt, Volker; Baune, Bernard T; Boomsma, Dorret I; Børglum, Anders D; Byrne, Enda M; Castelao, Enrique; Craddock, Nick; Craig, Ian W; Dannlowski, Udo; Deary, Ian J; Degenhardt, Franziska; Forstner, Andreas J; Gordon, Scott D; Grabe, Hans J; Grove, Jakob; Hamilton, Steven P; Hayward, Caroline; Heath, Andrew C; Hocking, Lynne J; Homuth, Georg; Hottenga, Jouke J; Kloiber, Stefan; Krogh, Jesper; Landén, Mikael; Lang, Maren; Levinson, Douglas F; Lichtenstein, Paul; Lucae, Susanne; MacIntyre, Donald J; Madden, Pamela; Magnusson, Patrik K E; Martin, Nicholas G; McIntosh, Andrew M; Middeldorp, Christel M; Milaneschi, Yuri; Montgomery, Grant W; Mors, Ole; Müller-Myhsok, Bertram; Nyholt, Dale R; Oskarsson, Hogni; Owen, Michael J; Padmanabhan, Sandosh; Penninx, Brenda W J H; Pergadia, Michele L; Porteous, David J; Potash, James B; Preisig, Martin; Rivera, Margarita; Shi, Jianxin; Shyn, Stanley I; Sigurdsson, Engilbert; Smit, Johannes H; Smith, Blair H; Stefansson, Hreinn; Stefansson, Kari; Strohmaier, Jana; Sullivan, Patrick F; Thomson, Pippa; Thorgeirsson, Thorgeir E; Van der Auwera, Sandra; Weissman, Myrna M; Breen, Gerome; Lewis, Cathryn M

    2017-02-15

    Major depressive disorder (MDD) is a disabling mood disorder, and despite a known heritable component, a large meta-analysis of genome-wide association studies revealed no replicable genetic risk variants. Given prior evidence of heterogeneity by age at onset in MDD, we tested whether genome-wide significant risk variants for MDD could be identified in cases subdivided by age at onset. Discovery case-control genome-wide association studies were performed where cases were stratified using increasing/decreasing age-at-onset cutoffs; significant single nucleotide polymorphisms were tested in nine independent replication samples, giving a total sample of 22,158 cases and 133,749 control subjects for subsetting. Polygenic score analysis was used to examine whether differences in shared genetic risk exists between earlier and adult-onset MDD with commonly comorbid disorders of schizophrenia, bipolar disorder, Alzheimer's disease, and coronary artery disease. We identified one replicated genome-wide significant locus associated with adult-onset (>27 years) MDD (rs7647854, odds ratio: 1.16, 95% confidence interval: 1.11-1.21, p = 5.2 × 10 -11 ). Using polygenic score analyses, we show that earlier-onset MDD is genetically more similar to schizophrenia and bipolar disorder than adult-onset MDD. We demonstrate that using additional phenotype data previously collected by genetic studies to tackle phenotypic heterogeneity in MDD can successfully lead to the discovery of genetic risk factor despite reduced sample size. Furthermore, our results suggest that the genetic susceptibility to MDD differs between adult- and earlier-onset MDD, with earlier-onset cases having a greater genetic overlap with schizophrenia and bipolar disorder. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  9. Brookhaven segment interconnect

    International Nuclear Information System (INIS)

    Morse, W.M.; Benenson, G.; Leipuner, L.B.

    1983-01-01

    We have performed a high energy physics experiment using a multisegment Brookhaven FASTBUS system. The system was composed of three crate segments and two cable segments. We discuss the segment interconnect module which permits communication between the various segments

  10. Employing Measures of Heterogeneity and an Object-Based Approach to Extrapolate Tree Species Distribution Data

    Directory of Open Access Journals (Sweden)

    Trevor G. Jones

    2014-07-01

    Full Text Available Information derived from high spatial resolution remotely sensed data is critical for the effective management of forested ecosystems. However, high spatial resolution data-sets are typically costly to acquire and process and usually provide limited geographic coverage. In contrast, moderate spatial resolution remotely sensed data, while not able to provide the spectral or spatial detail required for certain types of products and applications, offer inexpensive, comprehensive landscape-level coverage. This study assessed using an object-based approach to extrapolate detailed tree species heterogeneity beyond the extent of hyperspectral/LiDAR flightlines to the broader area covered by a Landsat scene. Using image segments, regression trees established ecologically decipherable relationships between tree species heterogeneity and the spectral properties of Landsat segments. The spectral properties of Landsat bands 4 (i.e., NIR: 0.76–0.90 µm, 5 (i.e., SWIR: 1.55–1.75 µm and 7 (SWIR: 2.08–2.35 µm were consistently selected as predictor variables, explaining approximately 50% of variance in richness and diversity. Results have important ramifications for ongoing management initiatives in the study area and are applicable to wide range of applications.

  11. Involvement of Disperse Repetitive Sequences in Wheat/Rye Genome Adjustment

    Directory of Open Access Journals (Sweden)

    Manuela Silva

    2012-07-01

    Full Text Available The union of different genomes in the same nucleus frequently results in hybrid genotypes with improved genome plasticity related to both genome remodeling events and changes in gene expression. Most modern cereal crops are polyploid species. Triticale, synthesized by the cross between wheat and rye, constitutes an excellent model to study polyploidization functional implications. We intend to attain a deeper knowledge of dispersed repetitive sequence involvement in parental genome reshuffle in triticale and in wheat-rye addition lines that have the entire wheat genome plus each rye chromosome pair. Through Random Amplified Polymorphic DNA (RAPD analysis with OPH20 10-mer primer we unraveled clear alterations corresponding to the loss of specific bands from both parental genomes. Moreover, the sequential nature of those events was revealed by the increased absence of rye-origin bands in wheat-rye addition lines in comparison with triticale. Remodeled band sequencing revealed that both repetitive and coding genome domains are affected in wheat-rye hybrid genotypes. Additionally, the amplification and sequencing of pSc20H internal segments showed that the disappearance of parental bands may result from restricted sequence alterations and unraveled the involvement of wheat/rye related repetitive sequences in genome adjustment needed for hybrid plant stabilization.

  12. Genomic prediction based on data from three layer lines using non-linear regression models.

    Science.gov (United States)

    Huang, Heyun; Windig, Jack J; Vereijken, Addie; Calus, Mario P L

    2014-11-06

    Most studies on genomic prediction with reference populations that include multiple lines or breeds have used linear models. Data heterogeneity due to using multiple populations may conflict with model assumptions used in linear regression methods. In an attempt to alleviate potential discrepancies between assumptions of linear models and multi-population data, two types of alternative models were used: (1) a multi-trait genomic best linear unbiased prediction (GBLUP) model that modelled trait by line combinations as separate but correlated traits and (2) non-linear models based on kernel learning. These models were compared to conventional linear models for genomic prediction for two lines of brown layer hens (B1 and B2) and one line of white hens (W1). The three lines each had 1004 to 1023 training and 238 to 240 validation animals. Prediction accuracy was evaluated by estimating the correlation between observed phenotypes and predicted breeding values. When the training dataset included only data from the evaluated line, non-linear models yielded at best a similar accuracy as linear models. In some cases, when adding a distantly related line, the linear models showed a slight decrease in performance, while non-linear models generally showed no change in accuracy. When only information from a closely related line was used for training, linear models and non-linear radial basis function (RBF) kernel models performed similarly. The multi-trait GBLUP model took advantage of the estimated genetic correlations between the lines. Combining linear and non-linear models improved the accuracy of multi-line genomic prediction. Linear models and non-linear RBF models performed very similarly for genomic prediction, despite the expectation that non-linear models could deal better with the heterogeneous multi-population data. This heterogeneity of the data can be overcome by modelling trait by line combinations as separate but correlated traits, which avoids the occasional

  13. Mobility Behavior of the Elderly: an attitude-based segmentation approach for a heterogeneous target group

    DEFF Research Database (Denmark)

    Haustein, Sonja

    2012-01-01

    The western population is ageing. Based on the assumption that the elderly are a quite heterogeneous population group with an increasing impact on the transport system, mobility types of the elderly were identified. By means of 1,500 standardized telephone interviews, mobility behavior and possib...... of the diverse lifestyles, attitudes, travel behavior and needs of the elderly. Furthermore, it identifies starting points for the reduction of car use....

  14. Isolation of a Genomic Region Affecting Most Components of Metabolic Syndrome in a Chromosome-16 Congenic Rat Model.

    Directory of Open Access Journals (Sweden)

    Lucie Šedová

    Full Text Available Metabolic syndrome is a highly prevalent human disease with substantial genomic and environmental components. Previous studies indicate the presence of significant genetic determinants of several features of metabolic syndrome on rat chromosome 16 (RNO16 and the syntenic regions of human genome. We derived the SHR.BN16 congenic strain by introgression of a limited RNO16 region from the Brown Norway congenic strain (BN-Lx into the genomic background of the spontaneously hypertensive rat (SHR strain. We compared the morphometric, metabolic, and hemodynamic profiles of adult male SHR and SHR.BN16 rats. We also compared in silico the DNA sequences for the differential segment in the BN-Lx and SHR parental strains. SHR.BN16 congenic rats had significantly lower weight, decreased concentrations of total triglycerides and cholesterol, and improved glucose tolerance compared with SHR rats. The concentrations of insulin, free fatty acids, and adiponectin were comparable between the two strains. SHR.BN16 rats had significantly lower systolic (18-28 mmHg difference and diastolic (10-15 mmHg difference blood pressure throughout the experiment (repeated-measures ANOVA, P < 0.001. The differential segment spans approximately 22 Mb of the telomeric part of the short arm of RNO16. The in silico analyses revealed over 1200 DNA variants between the BN-Lx and SHR genomes in the SHR.BN16 differential segment, 44 of which lead to missense mutations, and only eight of which (in Asb14, Il17rd, Itih1, Syt15, Ercc6, RGD1564958, Tmem161a, and Gatad2a genes are predicted to be damaging to the protein product. Furthermore, a number of genes within the RNO16 differential segment associated with metabolic syndrome components in human studies showed polymorphisms between SHR and BN-Lx (including Lpl, Nrg3, Pbx4, Cilp2, and Stab1. Our novel congenic rat model demonstrates that a limited genomic region on RNO16 in the SHR significantly affects many of the features of metabolic

  15. Advances in Genomics of Entomopathogenic Fungi.

    Science.gov (United States)

    Wang, J B; St Leger, R J; Wang, C

    2016-01-01

    Fungi are the commonest pathogens of insects and crucial regulators of insect populations. The rapid advance of genome technologies has revolutionized our understanding of entomopathogenic fungi with multiple Metarhizium spp. sequenced, as well as Beauveria bassiana, Cordyceps militaris, and Ophiocordyceps sinensis among others. Phylogenomic analysis suggests that the ancestors of many of these fungi were plant endophytes or pathogens, with entomopathogenicity being an acquired characteristic. These fungi now occupy a wide range of habitats and hosts, and their genomes have provided a wealth of information on the evolution of virulence-related characteristics, as well as the protein families and genomic structure associated with ecological and econutritional heterogeneity, genome evolution, and host range diversification. In particular, their evolutionary transition from plant pathogens or endophytes to insect pathogens provides a novel perspective on how new functional mechanisms important for host switching and virulence are acquired. Importantly, genomic resources have helped make entomopathogenic fungi ideal model systems for answering basic questions in parasitology, entomology, and speciation. At the same time, identifying the selective forces that act upon entomopathogen fitness traits could underpin both the development of new mycoinsecticides and further our understanding of the natural roles of these fungi in nature. These roles frequently include mutualistic relationships with plants. Genomics has also facilitated the rapid identification of genes encoding biologically useful molecules, with implications for the development of pharmaceuticals and the use of these fungi as bioreactors. Copyright © 2016 Elsevier Inc. All rights reserved.

  16. A novel genome-information content-based statistic for genome-wide association analysis designed for next-generation sequencing data.

    Science.gov (United States)

    Luo, Li; Zhu, Yun; Xiong, Momiao

    2012-06-01

    The genome-wide association studies (GWAS) designed for next-generation sequencing data involve testing association of genomic variants, including common, low frequency, and rare variants. The current strategies for association studies are well developed for identifying association of common variants with the common diseases, but may be ill-suited when large amounts of allelic heterogeneity are present in sequence data. Recently, group tests that analyze their collective frequency differences between cases and controls shift the current variant-by-variant analysis paradigm for GWAS of common variants to the collective test of multiple variants in the association analysis of rare variants. However, group tests ignore differences in genetic effects among SNPs at different genomic locations. As an alternative to group tests, we developed a novel genome-information content-based statistics for testing association of the entire allele frequency spectrum of genomic variation with the diseases. To evaluate the performance of the proposed statistics, we use large-scale simulations based on whole genome low coverage pilot data in the 1000 Genomes Project to calculate the type 1 error rates and power of seven alternative statistics: a genome-information content-based statistic, the generalized T(2), collapsing method, multivariate and collapsing (CMC) method, individual χ(2) test, weighted-sum statistic, and variable threshold statistic. Finally, we apply the seven statistics to published resequencing dataset from ANGPTL3, ANGPTL4, ANGPTL5, and ANGPTL6 genes in the Dallas Heart Study. We report that the genome-information content-based statistic has significantly improved type 1 error rates and higher power than the other six statistics in both simulated and empirical datasets.

  17. CoCoNUT: an efficient system for the comparison and analysis of genomes

    Directory of Open Access Journals (Sweden)

    Kurtz Stefan

    2008-11-01

    Full Text Available Abstract Background Comparative genomics is the analysis and comparison of genomes from different species. This area of research is driven by the large number of sequenced genomes and heavily relies on efficient algorithms and software to perform pairwise and multiple genome comparisons. Results Most of the software tools available are tailored for one specific task. In contrast, we have developed a novel system CoCoNUT (Computational Comparative geNomics Utility Toolkit that allows solving several different tasks in a unified framework: (1 finding regions of high similarity among multiple genomic sequences and aligning them, (2 comparing two draft or multi-chromosomal genomes, (3 locating large segmental duplications in large genomic sequences, and (4 mapping cDNA/EST to genomic sequences. Conclusion CoCoNUT is competitive with other software tools w.r.t. the quality of the results. The use of state of the art algorithms and data structures allows CoCoNUT to solve comparative genomics tasks more efficiently than previous tools. With the improved user interface (including an interactive visualization component, CoCoNUT provides a unified, versatile, and easy-to-use software tool for large scale studies in comparative genomics.

  18. Active Segmentation.

    Science.gov (United States)

    Mishra, Ajay; Aloimonos, Yiannis

    2009-01-01

    The human visual system observes and understands a scene/image by making a series of fixations. Every fixation point lies inside a particular region of arbitrary shape and size in the scene which can either be an object or just a part of it. We define as a basic segmentation problem the task of segmenting that region containing the fixation point. Segmenting the region containing the fixation is equivalent to finding the enclosing contour- a connected set of boundary edge fragments in the edge map of the scene - around the fixation. This enclosing contour should be a depth boundary.We present here a novel algorithm that finds this bounding contour and achieves the segmentation of one object, given the fixation. The proposed segmentation framework combines monocular cues (color/intensity/texture) with stereo and/or motion, in a cue independent manner. The semantic robots of the immediate future will be able to use this algorithm to automatically find objects in any environment. The capability of automatically segmenting objects in their visual field can bring the visual processing to the next level. Our approach is different from current approaches. While existing work attempts to segment the whole scene at once into many areas, we segment only one image region, specifically the one containing the fixation point. Experiments with real imagery collected by our active robot and from the known databases 1 demonstrate the promise of the approach.

  19. Quantification of within-sample genetic heterogeneity from SNP-array data

    DEFF Research Database (Denmark)

    Martinez, Pierre; Kimberley, Christopher; Birkbak, Nicolai Juul

    2017-01-01

    Intra-tumour genetic heterogeneity (ITH) fosters drug resistance and is a critical hurdle to clinical treatment. ITH can be well-measured using multi-region sampling but this is costly and challenging to implement. There is therefore a need for tools to estimate ITH in individual samples, using...... standard genomic data such as SNP-arrays, that could be implemented routinely. We designed two novel scores S and R, respectively based on the Shannon diversity index and Ripley's L statistic of spatial homogeneity, to quantify ITH in single SNP-array samples. We created in-silico and in-vitro mixtures...... sequencing data but heterogeneity in the fraction of tumour cells present across samples hampered accurate quantification. The prognostic potential of both scores was moderate but significantly predictive of survival in several tumour types (corrected p = 0.03). Our work thus shows how individual SNP...

  20. Intertumoral Heterogeneity within Medulloblastoma Subgroups.

    Science.gov (United States)

    Cavalli, Florence M G; Remke, Marc; Rampasek, Ladislav; Peacock, John; Shih, David J H; Luu, Betty; Garzia, Livia; Torchia, Jonathon; Nor, Carolina; Morrissy, A Sorana; Agnihotri, Sameer; Thompson, Yuan Yao; Kuzan-Fischer, Claudia M; Farooq, Hamza; Isaev, Keren; Daniels, Craig; Cho, Byung-Kyu; Kim, Seung-Ki; Wang, Kyu-Chang; Lee, Ji Yeoun; Grajkowska, Wieslawa A; Perek-Polnik, Marta; Vasiljevic, Alexandre; Faure-Conter, Cecile; Jouvet, Anne; Giannini, Caterina; Nageswara Rao, Amulya A; Li, Kay Ka Wai; Ng, Ho-Keung; Eberhart, Charles G; Pollack, Ian F; Hamilton, Ronald L; Gillespie, G Yancey; Olson, James M; Leary, Sarah; Weiss, William A; Lach, Boleslaw; Chambless, Lola B; Thompson, Reid C; Cooper, Michael K; Vibhakar, Rajeev; Hauser, Peter; van Veelen, Marie-Lise C; Kros, Johan M; French, Pim J; Ra, Young Shin; Kumabe, Toshihiro; López-Aguilar, Enrique; Zitterbart, Karel; Sterba, Jaroslav; Finocchiaro, Gaetano; Massimino, Maura; Van Meir, Erwin G; Osuka, Satoru; Shofuda, Tomoko; Klekner, Almos; Zollo, Massimo; Leonard, Jeffrey R; Rubin, Joshua B; Jabado, Nada; Albrecht, Steffen; Mora, Jaume; Van Meter, Timothy E; Jung, Shin; Moore, Andrew S; Hallahan, Andrew R; Chan, Jennifer A; Tirapelli, Daniela P C; Carlotti, Carlos G; Fouladi, Maryam; Pimentel, José; Faria, Claudia C; Saad, Ali G; Massimi, Luca; Liau, Linda M; Wheeler, Helen; Nakamura, Hideo; Elbabaa, Samer K; Perezpeña-Diazconti, Mario; Chico Ponce de León, Fernando; Robinson, Shenandoah; Zapotocky, Michal; Lassaletta, Alvaro; Huang, Annie; Hawkins, Cynthia E; Tabori, Uri; Bouffet, Eric; Bartels, Ute; Dirks, Peter B; Rutka, James T; Bader, Gary D; Reimand, Jüri; Goldenberg, Anna; Ramaswamy, Vijay; Taylor, Michael D

    2017-06-12

    While molecular subgrouping has revolutionized medulloblastoma classification, the extent of heterogeneity within subgroups is unknown. Similarity network fusion (SNF) applied to genome-wide DNA methylation and gene expression data across 763 primary samples identifies very homogeneous clusters of patients, supporting the presence of medulloblastoma subtypes. After integration of somatic copy-number alterations, and clinical features specific to each cluster, we identify 12 different subtypes of medulloblastoma. Integrative analysis using SNF further delineates group 3 from group 4 medulloblastoma, which is not as readily apparent through analyses of individual data types. Two clear subtypes of infants with Sonic Hedgehog medulloblastoma with disparate outcomes and biology are identified. Medulloblastoma subtypes identified through integrative clustering have important implications for stratification of future clinical trials. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous data

    Directory of Open Access Journals (Sweden)

    Ruzzo Walter L

    2006-03-01

    Full Text Available Abstract Background As a variety of functional genomic and proteomic techniques become available, there is an increasing need for functional analysis methodologies that integrate heterogeneous data sources. Methods In this paper, we address this issue by proposing a general framework for gene function prediction based on the k-nearest-neighbor (KNN algorithm. The choice of KNN is motivated by its simplicity, flexibility to incorporate different data types and adaptability to irregular feature spaces. A weakness of traditional KNN methods, especially when handling heterogeneous data, is that performance is subject to the often ad hoc choice of similarity metric. To address this weakness, we apply regression methods to infer a similarity metric as a weighted combination of a set of base similarity measures, which helps to locate the neighbors that are most likely to be in the same class as the target gene. We also suggest a novel voting scheme to generate confidence scores that estimate the accuracy of predictions. The method gracefully extends to multi-way classification problems. Results We apply this technique to gene function prediction according to three well-known Escherichia coli classification schemes suggested by biologists, using information derived from microarray and genome sequencing data. We demonstrate that our algorithm dramatically outperforms the naive KNN methods and is competitive with support vector machine (SVM algorithms for integrating heterogenous data. We also show that by combining different data sources, prediction accuracy can improve significantly. Conclusion Our extension of KNN with automatic feature weighting, multi-class prediction, and probabilistic inference, enhance prediction accuracy significantly while remaining efficient, intuitive and flexible. This general framework can also be applied to similar classification problems involving heterogeneous datasets.

  2. Clinical and pathological heterogeneity of cervical intraepithelial neoplasia grade 3.

    Directory of Open Access Journals (Sweden)

    Hannah P Yang

    Full Text Available Cervical intraepithelial neoplasia grade 3 (CIN3, the immediate cervical cancer precursor, is a target of cervical cancer prevention. However, less than half of CIN3s will progress to cancer. Routine treatment of all CIN3s and the majority of CIN2s may lead to overtreatment of many lesions that would not progress. To improve our understanding of CIN3 natural history, we performed a detailed characterization of CIN3 heterogeneity in a large referral population in the US.We examined 309 CIN3 cases in the SUCCEED, a large population-based study of women with abnormal cervical cancer screening results. Histology information for 12 individual loop electrosurgical excision procedure (LEEP segments was evaluated for each woman. We performed case-case comparisons of CIN3s to analyze determinants of heterogeneity and screening test performance.CIN3 cases varied substantially by size (1-10 LEEP segments and by presentation with concomitant CIN2 and CIN1. All grades of CINs were equally distributed over the cervical surface. In half of the women, CIN3 lesions were found as multiple distinct lesions on the cervix. Women with large and solitary CIN3 lesions were more likely to be older, have longer sexual activity span, and have fewer multiple high risk HPV infections. Screening frequency, but not HPV16 positivity, was an important predictor of CIN3 size. Large CIN3 lesions were also characterized by high-grade clinical test results.We demonstrate substantial heterogeneity in clinical and pathological presentation of CIN3 in a US population. Time since sexual debut and participation in screening were predictors of CIN3 size. We did not observe a preferential site of CIN3 on the cervical surface that could serve as a target for cervical biopsy. Cervical cancer screening procedures were more likely to detect larger CIN3s, suggesting that CIN3s detected by multiple independent diagnostic tests may represent cases with increased risk of invasion.

  3. Partitioning of genetic variation between regulatory and coding gene segments: the predominance of software variation in genes encoding introvert proteins.

    Science.gov (United States)

    Mitchison, A

    1997-01-01

    In considering genetic variation in eukaryotes, a fundamental distinction can be made between variation in regulatory (software) and coding (hardware) gene segments. For quantitative traits the bulk of variation, particularly that near the population mean, appears to reside in regulatory segments. The main exceptions to this rule concern proteins which handle extrinsic substances, here termed extrovert proteins. The immune system includes an unusually large proportion of this exceptional category, but even so its chief source of variation may well be polymorphism in regulatory gene segments. The main evidence for this view emerges from genome scanning for quantitative trait loci (QTL), which in the case of the immune system points to a major contribution of pro-inflammatory cytokine genes. Further support comes from sequencing of major histocompatibility complex (Mhc) class II promoters, where a high level of polymorphism has been detected. These Mhc promoters appear to act, in part at least, by gating the back-signal from T cells into antigen-presenting cells. Both these forms of polymorphism are likely to be sustained by the need for flexibility in the immune response. Future work on promoter polymorphism is likely to benefit from the input from genome informatics.

  4. GeoSegmenter: A statistically learned Chinese word segmenter for the geoscience domain

    Science.gov (United States)

    Huang, Lan; Du, Youfu; Chen, Gongyang

    2015-03-01

    Unlike English, the Chinese language has no space between words. Segmenting texts into words, known as the Chinese word segmentation (CWS) problem, thus becomes a fundamental issue for processing Chinese documents and the first step in many text mining applications, including information retrieval, machine translation and knowledge acquisition. However, for the geoscience subject domain, the CWS problem remains unsolved. Although a generic segmenter can be applied to process geoscience documents, they lack the domain specific knowledge and consequently their segmentation accuracy drops dramatically. This motivated us to develop a segmenter specifically for the geoscience subject domain: the GeoSegmenter. We first proposed a generic two-step framework for domain specific CWS. Following this framework, we built GeoSegmenter using conditional random fields, a principled statistical framework for sequence learning. Specifically, GeoSegmenter first identifies general terms by using a generic baseline segmenter. Then it recognises geoscience terms by learning and applying a model that can transform the initial segmentation into the goal segmentation. Empirical experimental results on geoscience documents and benchmark datasets showed that GeoSegmenter could effectively recognise both geoscience terms and general terms.

  5. Biomimetic heterogenous elastic tissue development.

    Science.gov (United States)

    Tsai, Kai Jen; Dixon, Simon; Hale, Luke Richard; Darbyshire, Arnold; Martin, Daniel; de Mel, Achala

    2017-01-01

    There is an unmet need for artificial tissue to address current limitations with donor organs and problems with donor site morbidity. Despite the success with sophisticated tissue engineering endeavours, which employ cells as building blocks, they are limited to dedicated labs suitable for cell culture, with associated high costs and long tissue maturation times before available for clinical use. Direct 3D printing presents rapid, bespoke, acellular solutions for skull and bone repair or replacement, and can potentially address the need for elastic tissue, which is a major constituent of smooth muscle, cartilage, ligaments and connective tissue that support organs. Thermoplastic polyurethanes are one of the most versatile elastomeric polymers. Their segmented block copolymeric nature, comprising of hard and soft segments allows for an almost limitless potential to control physical properties and mechanical behaviour. Here we show direct 3D printing of biocompatible thermoplastic polyurethanes with Fused Deposition Modelling, with a view to presenting cell independent in-situ tissue substitutes. This method can expeditiously and economically produce heterogenous, biomimetic elastic tissue substitutes with controlled porosity to potentially facilitate vascularisation. The flexibility of this application is shown here with tubular constructs as exemplars. We demonstrate how these 3D printed constructs can be post-processed to incorporate bioactive molecules. This efficacious strategy, when combined with the privileges of digital healthcare, can be used to produce bespoke elastic tissue substitutes in-situ, independent of extensive cell culture and may be developed as a point-of-care therapy approach.

  6. Single-segment and double-segment INTACS for post-LASIK ectasia.

    Directory of Open Access Journals (Sweden)

    Hassan Hashemi

    2014-09-01

    Full Text Available The objective of the present study was to compare single segment and double segment INTACS rings in the treatment of post-LASIK ectasia. In this interventional study, 26 eyes with post-LASIK ectasia were assessed. Ectasia was defined as progressive myopia regardless of astigmatism, along with topographic evidence of inferior steepening of the cornea after LASIK. We excluded those with a history of intraocular surgery, certain eye conditions, and immune disorders, as well as monocular, pregnant and lactating patients. A total of 11 eyes had double ring and 15 eyes had single ring implantation. Visual and refractive outcomes were compared with preoperative values based on the number of implanted INTACS rings. Pre and postoperative spherical equivalent were -3.92 and -2.29 diopter (P=0.007. The spherical equivalent decreased by 1 ± 3.2 diopter in the single-segment group and 2.56 ± 1.58 diopter in the double-segment group (P=0.165. Mean preoperative astigmatism was 2.38 ± 1.93 diopter which decreased to 2.14 ± 1.1 diopter after surgery (P=0.508; 0.87 ± 1.98 diopter decrease in the single-segment group and 0.67 ± 1.2 diopter increase in the double-segment group (P=0.025. Nineteen patients (75% gained one or two lines, and only three, who were all in the double-segment group, lost one or two lines of best corrected visual acuity. The spherical equivalent and vision significantly decreased in all patients. In these post-LASIK ectasia patients, the spherical equivalent was corrected better with two segments compared to single segment implantation; nonetheless, the level of astigmatism in the single-segment group was significantly better than that in the double-segment group.

  7. Sequence analysis of the whole genomes of five African human G9 rotavirus strains.

    Science.gov (United States)

    Nyaga, Martin M; Jere, Khuzwayo C; Peenze, Ina; Mlera, Luwanika; van Dijk, Alberdina A; Seheri, Mapaseka L; Mphahlele, M Jeffrey

    2013-06-01

    The G9 rotaviruses are amongst the most common global rotavirus strains causing severe childhood diarrhoea. However, the whole genomes of only a few G9 rotaviruses have been fully sequenced and characterised of which only one G9P[6] and one G9P[8] are from Africa. We determined the consensus sequence of the whole genomes of five African human group A G9 rotavirus strains, four G9P[8] strains and one G9P[6] strain collected in Cameroon (central Africa), Kenya (eastern Africa), South Africa and Zimbabwe (southern Africa) in 1999, 2009 and 2010. Strain RVA/Human-wt/ZWE/MRC-DPRU1723/2009/G9P[8] from Zimbabwe, RVA/Human-wt/ZAF/MRC-DPRU4677/2010/G9P[8] from South Africa, RVA/Human-wt/CMR/1424/2009/G9P[8] from Cameroon and RVA/Human-wt/KEN/MRC-DPRU2427/2010/G9P[8] from Kenya were on a Wa-like genetic backbone and were genotyped as G9-P[8]-I1-R1-C1-M1-A1-N1-T1-E1-H1. Strain RVA/Human-wt/ZAF/MRC-DPRU9317/1999/G9P[6] from South Africa was genotyped as G9-P[6]-I2-R2-C2-M2-A2-N1-T2-E2-H2. Rotavirus A strain MRC-DPRU9317 is the second G9 strain to be reported on a DS-1-like genetic backbone, the other being RVA/Human-wt/ZAF/GR10924/1999/G9P[6]. MRC-DPRU9317 was found to be a reassortant between DS-1-like (I2, R2, C2, M2, A2, T2, E2 and H2) and Wa-like (N1) genome segments. All the genome segments of the five strains grouped strictly according to their genotype Wa- or DS-1-like clusters. Within their respective genotypes, the genome segments of the three G9 study strains from southern Africa clustered most closely with rotaviruses from the same geographical origin and with those with the same G and P types. The highest nucleotide identity of genome segments of the study strains from eastern and central Africa regions on a Wa-like backbone was not limited to rotaviruses with G9P[8] genotypes only, they were also closely related to G12P[6], G8P[8], G1P[8] and G11P[25] rotaviruses, indicating a close inter-genotype relationship between the G9 and other rotavirus genotypes

  8. Synonymous Codon Usage Analysis of Thirty Two Mycobacteriophage Genomes

    Directory of Open Access Journals (Sweden)

    Sameer Hassan

    2009-01-01

    Full Text Available Synonymous codon usage of protein coding genes of thirty two completely sequenced mycobacteriophage genomes was studied using multivariate statistical analysis. One of the major factors influencing codon usage is identified to be compositional bias. Codons ending with either C or G are preferred in highly expressed genes among which C ending codons are highly preferred over G ending codons. A strong negative correlation between effective number of codons (Nc and GC3s content was also observed, showing that the codon usage was effected by gene nucleotide composition. Translational selection is also identified to play a role in shaping the codon usage operative at the level of translational accuracy. High level of heterogeneity is seen among and between the genomes. Length of genes is also identified to influence the codon usage in 11 out of 32 phage genomes. Mycobacteriophage Cooper is identified to be the highly biased genome with better translation efficiency comparing well with the host specific tRNA genes.

  9. Background based Gaussian mixture model lesion segmentation in PET

    Energy Technology Data Exchange (ETDEWEB)

    Soffientini, Chiara Dolores, E-mail: chiaradolores.soffientini@polimi.it; Baselli, Giuseppe [DEIB, Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Piazza Leonardo da Vinci 32, Milan 20133 (Italy); De Bernardi, Elisabetta [Department of Medicine and Surgery, Tecnomed Foundation, University of Milano—Bicocca, Monza 20900 (Italy); Zito, Felicia; Castellani, Massimo [Nuclear Medicine Department, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, via Francesco Sforza 35, Milan 20122 (Italy)

    2016-05-15

    demonstrated. The inclusion of the spatial prior improved segmentation accuracy only for lesions surrounded by heterogeneous background: in the relevant simulation subset, the median VE significantly decreased from 13% to 7%. Results on clinical data were found in accordance with simulations, with absolute VE <7%, Dice >0.85, CE <0.30, and HD <0.81. Conclusions: The sole introduction of constraints based on background modeling outperformed standard GMM and the other tested algorithms. Insertion of a spatial prior improved the accuracy for realistic cases of objects in heterogeneous backgrounds. Moreover, robustness against initialization supports the applicability in a clinical setting. In conclusion, application-driven constraints can generally improve the capabilities of GMM and statistical clustering algorithms.

  10. Comparison of fish assemblages in two disjoined segments of an oxbow lake in relation to connectivity

    Science.gov (United States)

    Dembkowski, Daniel J.; Miranda, Leandro E.

    2011-01-01

    Disconnection between adjacent habitat patches is one of the most notable factors contributing to the decreased biotic integrity of global ecosystems. Connectivity is especially threatened in river–floodplain ecosystems in which channel modifications have disrupted the lateral links between the main river channel and floodplain lakes. In this study, we examined the interaction between the interconnectedness of floodplain lakes and main river channels and fish assemblage descriptors. Fish assemblages in two segments of an oxbow lake, one connected to and the other isolated from the Yazoo River, Mississippi, were estimated with daytime boat electrofishing during 2007–2010. The frequency of connection for the connected segment ranged from zero to seven individual events per year (mean, ∼2). The timing of most connection events reflected regional precipitation patterns. Greater species richness, diversity, and evenness were observed in the connected segment. Additionally, the connected segment had a greater abundance of piscivores and periodic life history strategists. All fishes collected solely in the connected segment were typically riverine in nature, whereas fishes collected only in the disconnected segment were more lacustrine adapted. These results suggest that periodic connection and the associated habitat heterogeneity that it provides are important for maintaining fish species richness and diversity in large-river floodplain lakes. We suggest that maintenance or restoration of connection be an integral part of fluvial ecosystem management plans.

  11. Initial assessment of a model relating intratumoral genetic heterogeneity to radiological morphology

    Science.gov (United States)

    Noterdaeme, O; Kelly, M; Friend, P; Soonowalla, Z; Steers, G; Brady, M

    2010-01-01

    Tumour heterogeneity has major implications for tumour development and response to therapy. Tumour heterogeneity results from mutations in the genes responsible for mismatch repair or maintenance of chromosomal stability. Cells with different genetic properties may grow at different rates and exhibit different resistance to therapeutic interventions. To date, there exists no approach to non-invasively assess tumour heterogeneity. Here we present a biologically inspired model of tumour growth, which relates intratumoral genetic heterogeneity to gross morphology visible on radiological images. The model represents the development of a tumour as a set of expanding spheres, each sphere representing a distinct clonal centre, with the sprouting of new spheres corresponding to new clonal centres. Each clonal centre may possess different characteristics relating to genetic composition, growth rate and response to treatment. We present a clinical example for which the model accurately tracks tumour growth and shows the correspondence to genetic variation (as determined by array comparative genomic hybridisation). One clinical implication of our work is that the assessment of heterogeneous tumours using Response Evaluation Criteria In Solid Tumours (RECIST) or volume measurements may not accurately reflect tumour growth, stability or the response to treatment. We believe that this is the first model linking the macro-scale appearance of tumours to their genetic composition. We anticipate that our model will provide a more informative way to assess the response of heterogeneous tumours to treatment, which is of increasing importance with the development of novel targeted anti-cancer treatments. PMID:19690073

  12. Distribution of segmental duplications in the context of higher order chromatin organisation of human chromosome 7

    DEFF Research Database (Denmark)

    Ebert, Grit; Steininger, Anne; Weißmann, Robert

    2014-01-01

    of the Williams-Beuren syndrome locus we demonstrate by cross-species comparison that these SDs have inserted at the borders of a topological domain and that they flank regions with distinct DNA conformation. CONCLUSIONS: Our study suggests a link of nuclear architecture and the propagation of SDs across......BACKGROUND: Segmental duplications (SDs) are not evenly distributed along chromosomes. The reasons for this biased susceptibility to SD insertion are poorly understood. Accumulation of SDs is associated with increased genomic instability, which can lead to structural variants and genomic disorders...... chromosome 7, either by promoting regional SD insertion or by contributing to the establishment of higher order chromatin organisation themselves. The latter could compensate for the high risk of structural rearrangements and thus may have contributed to their evolutionary fixation in the human genome....

  13. SU-E-J-128: Two-Stage Atlas Selection in Multi-Atlas-Based Image Segmentation

    International Nuclear Information System (INIS)

    Zhao, T; Ruan, D

    2015-01-01

    Purpose: In the new era of big data, multi-atlas-based image segmentation is challenged by heterogeneous atlas quality and high computation burden from extensive atlas collection, demanding efficient identification of the most relevant atlases. This study aims to develop a two-stage atlas selection scheme to achieve computational economy with performance guarantee. Methods: We develop a low-cost fusion set selection scheme by introducing a preliminary selection to trim full atlas collection into an augmented subset, alleviating the need for extensive full-fledged registrations. More specifically, fusion set selection is performed in two successive steps: preliminary selection and refinement. An augmented subset is first roughly selected from the whole atlas collection with a simple registration scheme and the corresponding preliminary relevance metric; the augmented subset is further refined into the desired fusion set size, using full-fledged registration and the associated relevance metric. The main novelty of this work is the introduction of an inference model to relate the preliminary and refined relevance metrics, based on which the augmented subset size is rigorously derived to ensure the desired atlases survive the preliminary selection with high probability. Results: The performance and complexity of the proposed two-stage atlas selection method were assessed using a collection of 30 prostate MR images. It achieved comparable segmentation accuracy as the conventional one-stage method with full-fledged registration, but significantly reduced computation time to 1/3 (from 30.82 to 11.04 min per segmentation). Compared with alternative one-stage cost-saving approach, the proposed scheme yielded superior performance with mean and medium DSC of (0.83, 0.85) compared to (0.74, 0.78). Conclusion: This work has developed a model-guided two-stage atlas selection scheme to achieve significant cost reduction while guaranteeing high segmentation accuracy. The benefit

  14. Unexpected heterogeneity derived from Cas9 ribonucleoprotein-introduced clonal cells at the HPRT1 locus.

    Science.gov (United States)

    Sakuma, Tetsushi; Mochida, Keiji; Nakade, Shota; Ezure, Toru; Minagawa, Sachi; Yamamoto, Takashi

    2018-04-01

    Single-cell cloning is an essential technique for establishing genome-edited cell clones mediated by programmable nucleases such as CRISPR-Cas9. However, residual genome-editing activity after single-cell cloning may cause heterogeneity in the clonal cells. Previous studies showed efficient mutagenesis and rapid degradation of CRISPR-Cas9 components in cultured cells by introducing Cas9 ribonucleoproteins (RNPs). In this study, we investigated how the timing for single-cell cloning of Cas9 RNP-transfected cells affected the heterogeneity of the resultant clones. We carried out transfection of Cas9 RNPs targeting several loci in the HPRT1 gene in HCT116 cells, followed by single-cell cloning at 24, 48, 72 hr and 1 week post-transfection. After approximately 3 weeks of incubation, the clonal cells were collected and genotyped by high-resolution microchip electrophoresis and Sanger sequencing. Unexpectedly, long-term incubation before single-cell cloning resulted in highly heterogeneous clones. We used a lipofection method for transfection, and the media containing transfectable RNPs were not removed before single-cell cloning. Therefore, the active Cas9 RNPs were considered to be continuously incorporated into cells during the precloning incubation. Our findings provide a warning that lipofection of Cas9 RNPs may cause continuous introduction of gene mutations depending on the experimental procedures. © 2018 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.

  15. Meta-analysis for genome-wide association studies using case-control design: application and practice.

    Science.gov (United States)

    Shim, Sungryul; Kim, Jiyoung; Jung, Wonguen; Shin, In-Soo; Bae, Jong-Myon

    2016-01-01

    This review aimed to arrange the process of a systematic review of genome-wide association studies in order to practice and apply a genome-wide meta-analysis (GWMA). The process has a series of five steps: searching and selection, extraction of related information, evaluation of validity, meta-analysis by type of genetic model, and evaluation of heterogeneity. In contrast to intervention meta-analyses, GWMA has to evaluate the Hardy-Weinberg equilibrium (HWE) in the third step and conduct meta-analyses by five potential genetic models, including dominant, recessive, homozygote contrast, heterozygote contrast, and allelic contrast in the fourth step. The 'genhwcci' and 'metan' commands of STATA software evaluate the HWE and calculate a summary effect size, respectively. A meta-regression using the 'metareg' command of STATA should be conducted to evaluate related factors of heterogeneities.

  16. Prokaryote genome fluidity: toward a system approach of the mobilome.

    Science.gov (United States)

    Toussaint, Ariane; Chandler, Mick

    2012-01-01

    The importance of horizontal/lateral gene transfer (LGT) in shaping the genomes of prokaryotic organisms has been recognized in recent years as a result of analysis of the increasing number of available genome sequences. LGT is largely due to the transfer and recombination activities of mobile genetic elements (MGEs). Bacterial and archaeal genomes are mosaics of vertically and horizontally transmitted DNA segments. This generates reticulate relationships between members of the prokaryotic world that are better represented by networks than by "classical" phylogenetic trees. In this review we summarize the nature and activities of MGEs, and the problems that presently limit their analysis on a large scale. We propose routes to improve their annotation in the flow of genomic and metagenomic sequences that currently exist and those that become available. We describe network analysis of evolutionary relationships among some MGE categories and sketch out possible developments of this type of approach to get more insight into the role of the mobilome in bacterial adaptation and evolution.

  17. Genetic dissection of the Drosophila melanogaster female head transcriptome reveals widespread allelic heterogeneity.

    Science.gov (United States)

    King, Elizabeth G; Sanderson, Brian J; McNeil, Casey L; Long, Anthony D; Macdonald, Stuart J

    2014-05-01

    Modern genetic mapping is plagued by the "missing heritability" problem, which refers to the discordance between the estimated heritabilities of quantitative traits and the variance accounted for by mapped causative variants. One major potential explanation for the missing heritability is allelic heterogeneity, in which there are multiple causative variants at each causative gene with only a fraction having been identified. The majority of genome-wide association studies (GWAS) implicitly assume that a single SNP can explain all the variance for a causative locus. However, if allelic heterogeneity is prevalent, a substantial amount of genetic variance will remain unexplained. In this paper, we take a haplotype-based mapping approach and quantify the number of alleles segregating at each locus using a large set of 7922 eQTL contributing to regulatory variation in the Drosophila melanogaster female head. Not only does this study provide a comprehensive eQTL map for a major community genetic resource, the Drosophila Synthetic Population Resource, but it also provides a direct test of the allelic heterogeneity hypothesis. We find that 95% of cis-eQTLs and 78% of trans-eQTLs are due to multiple alleles, demonstrating that allelic heterogeneity is widespread in Drosophila eQTL. Allelic heterogeneity likely contributes significantly to the missing heritability problem common in GWAS studies.

  18. Assessment of heterogeneity in types of vegetables served by main household food preparers and food decision influencers.

    Science.gov (United States)

    Yi, Sunghwan; Kanetkar, Vinay; Brauer, Paula

    2015-10-01

    While vegetables are often studied as one food group, global measures may mask variation in the types and forms of vegetables preferred by different individuals. To explore preferences for and perceptions of vegetables, we assessed main food preparers based on their preparation of eight specific vegetables and mushrooms. An online self-report survey. Ontario, Canada. Measures included perceived benefits and obstacles of vegetables, convenience orientation and variety seeking in meal preparation. Of the 4517 randomly selected consumers who received the invitation, 1013 responded to the survey (22·4 % response). Data from the main food preparers were analysed (n 756). Latent profile analysis indicated three segments of food preparers. More open to new recipes, the 'crucifer lover' segment (13 %) prepared and consumed substantially more Brussels sprouts, broccoli and asparagus than the other segments. Although similar to the 'average consumer' segment (54 %) in many ways, the 'frozen vegetable user' segment (33 %) used significantly more frozen vegetables than the other segments due to higher prioritization of time and convenience in meal preparation and stronger 'healthy=not tasty' perception. Perception of specific vegetables on taste, healthiness, ease of preparation and cost varied significantly across the three consumer segments. Crucifer lovers also differed with respect to shopping and cooking habits compared with the frozen vegetable users. The substantial heterogeneity in the types of vegetables consumed and perceptions across the three consumer segments has implications for the development of new approaches to promoting these foods.

  19. Ontology based heterogeneous materials database integration and semantic query

    Science.gov (United States)

    Zhao, Shuai; Qian, Quan

    2017-10-01

    Materials digital data, high throughput experiments and high throughput computations are regarded as three key pillars of materials genome initiatives. With the fast growth of materials data, the integration and sharing of data is very urgent, that has gradually become a hot topic of materials informatics. Due to the lack of semantic description, it is difficult to integrate data deeply in semantic level when adopting the conventional heterogeneous database integration approaches such as federal database or data warehouse. In this paper, a semantic integration method is proposed to create the semantic ontology by extracting the database schema semi-automatically. Other heterogeneous databases are integrated to the ontology by means of relational algebra and the rooted graph. Based on integrated ontology, semantic query can be done using SPARQL. During the experiments, two world famous First Principle Computational databases, OQMD and Materials Project are used as the integration targets, which show the availability and effectiveness of our method.

  20. Heterogeneity of uridine incorporation along the rabbit nephron. I. Autoradiographic study

    International Nuclear Information System (INIS)

    Vandewalle, A.; Farman, N.; Cluzeaud, F.; Bonvalet, J.P.

    1984-01-01

    An autoradiographic study of uridine labeling in tubular segments microdissected from the rabbit kidney is presented. Kidney pyramids were incubated for 60 min with low (66 nM) and high (66μM) [ 3 H]-uridine concentration. At the two concentrations studied the labeling was almost exclusively nuclear in all segments studied. At the low concentration, labeling predominated in the macula densa (MD = 63.88 +/- 6.15 silver grains/100 μm 2 , n = 11), cortical ascending limb (CAL = 19.65 +/- 1.65, n = 15), and initial distal tubule (DCT/sub a/ = 24.31 +/- 2.70, n = 6). It was minimal in the proximal tubule (PCT 2 = 9.14 +/- 1.61, n = 16) and in the cortical (CCT = 5.23 +/- 0.75, n = 18) and medullary (MCT = 5.52 +/- 1.10, n = 12) collecting ducts. At a high concentration, the profile of labeling was roughly similar except for a relative increase in labeling much more pronounced in collecting ducts (CCT = +373, MCT = +323%) than in the other structures (MD = -14, CAL = +66, DCT/sub a/ = +49, PCT = +9%). Pulse-chase experiments do not show evidence for differences in turnover or degradation rates of RNA between segments, at least in the PCT and the connecting part of the CCT. Analysis of the results at low and high concentration suggests that the observed heterogeneity in uridine labeling depends on both variable endogenous nucleoside pools and different rates of uridine incorporation into RNA from one segment to another

  1. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk.

    Science.gov (United States)

    Waller, Rosalie G; Darlington, Todd M; Wei, Xiaomu; Madsen, Michael J; Thomas, Alun; Curtin, Karen; Coon, Hilary; Rajamanickam, Venkatesh; Musinsky, Justin; Jayabalan, David; Atanackovic, Djordje; Rajkumar, S Vincent; Kumar, Shaji; Slager, Susan; Middha, Mridu; Galia, Perrine; Demangel, Delphine; Salama, Mohamed; Joseph, Vijai; McKay, James; Offit, Kenneth; Klein, Robert J; Lipkin, Steven M; Dumontet, Charles; Vachon, Celine M; Camp, Nicola J

    2018-02-01

    The high-risk pedigree (HRP) design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS) method in 11 extended, Utah, multiple myeloma (MM) HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance-a precursor to MM) cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu), a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val), a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits.

  2. Novel pedigree analysis implicates DNA repair and chromatin remodeling in multiple myeloma risk.

    Directory of Open Access Journals (Sweden)

    Rosalie G Waller

    2018-02-01

    Full Text Available The high-risk pedigree (HRP design is an established strategy to discover rare, highly-penetrant, Mendelian-like causal variants. Its success, however, in complex traits has been modest, largely due to challenges of genetic heterogeneity and complex inheritance models. We describe a HRP strategy that addresses intra-familial heterogeneity, and identifies inherited segments important for mapping regulatory risk. We apply this new Shared Genomic Segment (SGS method in 11 extended, Utah, multiple myeloma (MM HRPs, and subsequent exome sequencing in SGS regions of interest in 1063 MM / MGUS (monoclonal gammopathy of undetermined significance-a precursor to MM cases and 964 controls from a jointly-called collaborative resource, including cases from the initial 11 HRPs. One genome-wide significant 1.8 Mb shared segment was found at 6q16. Exome sequencing in this region revealed predicted deleterious variants in USP45 (p.Gln691* and p.Gln621Glu, a gene known to influence DNA repair through endonuclease regulation. Additionally, a 1.2 Mb segment at 1p36.11 is inherited in two Utah HRPs, with coding variants identified in ARID1A (p.Ser90Gly and p.Met890Val, a key gene in the SWI/SNF chromatin remodeling complex. Our results provide compelling statistical and genetic evidence for segregating risk variants for MM. In addition, we demonstrate a novel strategy to use large HRPs for risk-variant discovery more generally in complex traits.

  3. A Horizontally Transferred Autonomous Helitron Became a Full Polydnavirus Segment in Cotesia vestalis

    Directory of Open Access Journals (Sweden)

    Pedro Heringer

    2017-12-01

    Full Text Available Bracoviruses associate symbiotically with thousands of parasitoid wasp species in the family Braconidae, working as virulence gene vectors, and allowing the development of wasp larvae within hosts. These viruses are composed of multiple DNA circles that are packaged into infective particles, and injected together with wasp’s eggs during parasitization. One of the viral segments of Cotesia vestalis bracovirus contains a gene that has been previously described as a helicase of unknown origin. Here, we demonstrate that this gene is a Rep/Helicase from an intact Helitron transposable element that covers the viral segment almost entirely. We also provide evidence that this element underwent at least two horizontal transfers, which appear to have occurred consecutively: first from a Drosophila host ancestor to the genome of the parasitoid wasp C. vestalis and its bracovirus, and then from C. vestalis to a lepidopteran host (Bombyx mori. Our results reinforce the idea of parasitoid wasps as frequent agents of horizontal transfers in eukaryotes. Additionally, this Helitron-bracovirus segment is the first example of a transposable element that effectively became a whole viral circle.

  4. Automatic Segmentation and Quantification of Filamentous Structures in Electron Tomography.

    Science.gov (United States)

    Loss, Leandro A; Bebis, George; Chang, Hang; Auer, Manfred; Sarkar, Purbasha; Parvin, Bahram

    2012-10-01

    Electron tomography is a promising technology for imaging ultrastructures at nanoscale resolutions. However, image and quantitative analyses are often hindered by high levels of noise, staining heterogeneity, and material damage either as a result of the electron beam or sample preparation. We have developed and built a framework that allows for automatic segmentation and quantification of filamentous objects in 3D electron tomography. Our approach consists of three steps: (i) local enhancement of filaments by Hessian filtering; (ii) detection and completion (e.g., gap filling) of filamentous structures through tensor voting; and (iii) delineation of the filamentous networks. Our approach allows for quantification of filamentous networks in terms of their compositional and morphological features. We first validate our approach using a set of specifically designed synthetic data. We then apply our segmentation framework to tomograms of plant cell walls that have undergone different chemical treatments for polysaccharide extraction. The subsequent compositional and morphological analyses of the plant cell walls reveal their organizational characteristics and the effects of the different chemical protocols on specific polysaccharides.

  5. Genome-wide expressions in autologous eutopic and ectopic endometrium of fertile women with endometriosis

    OpenAIRE

    Khan, Meraj A; Sengupta, Jayasree; Mittal, Suneeta; Ghosh, Debabrata

    2012-01-01

    Abstract Background In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eu...

  6. Accounting for segment correlations in segmented gamma-ray scans

    International Nuclear Information System (INIS)

    Sheppard, G.A.; Prettyman, T.H.; Piquette, E.C.

    1994-01-01

    In a typical segmented gamma-ray scanner (SGS), the detector's field of view is collimated so that a complete horizontal slice or segment of the desired thickness is visible. Ordinarily, the collimator is not deep enough to exclude gamma rays emitted from sample volumes above and below the segment aligned with the collimator. This can lead to assay biases, particularly for certain radioactive-material distributions. Another consequence of the collimator's low aspect ratio is that segment assays at the top and bottom of the sample are biased low because the detector's field of view is not filled. This effect is ordinarily countered by placing the sample on a low-Z pedestal and scanning one or more segment thicknesses below and above the sample. This takes extra time, however, We have investigated a number of techniques that both account for correlated segments and correct for end effects in SGS assays. Also, we have developed an algorithm that facilitates estimates of assay precision. Six calculation methods have been compared by evaluating the results of thousands of simulated, assays for three types of gamma-ray source distribution and ten masses. We will report on these computational studies and their experimental verification

  7. Nanostructural Features of Radiation Cured Networks for High Performance Composites: From Incurred Heterogeneities to Tailored Nanocomposites

    International Nuclear Information System (INIS)

    Krzeminski, Mickael; Ranoux, Guillaume; Coqueret, Xavier; Molinari, Michael; Chabbert, Brigitte; Aguié, Véronique; Defoort, Brigitte

    2011-01-01

    The radiation-induced polymerization of multiacrylates is suspected to generate heterogeneous networks at various dimension scales. In order to gain an insight into the polymer microstructure, a combination of analytic methods was used to quantify polymer segment mobility in the different domains [4,5]. Model epoxy or ethoxylated bis-phenol A diacrylates, EPAC and ETAC respectively, were used as precursors of representative networks for our investigations

  8. Nanostructural Features of Radiation Cured Networks for High Performance Composites: From Incurred Heterogeneities to Tailored Nanocomposites

    Energy Technology Data Exchange (ETDEWEB)

    Krzeminski, Mickael [Institut de Chimie Moléculaire de Reims (France); EADS Astrium, BP 20011, 33165 Saint Médard en Jalles Cedex (France); Ranoux, Guillaume; Coqueret, Xavier [Institut de Chimie Moléculaire de Reims (France); Molinari, Michael [Laboratoire des Microscopies et d’Etude des Nanostructures (France); Chabbert, Brigitte; Aguié, Véronique [UMR INRA Fractionnement des Agro-ressources et Environnement, Université de Reims Champagne Ardenne - 51687 Reims (France); Defoort, Brigitte [EADS Astrium, BP 20011, 33165 Saint Médard en Jalles Cedex, (France)

    2011-07-01

    The radiation-induced polymerization of multiacrylates is suspected to generate heterogeneous networks at various dimension scales. In order to gain an insight into the polymer microstructure, a combination of analytic methods was used to quantify polymer segment mobility in the different domains [4,5]. Model epoxy or ethoxylated bis-phenol A diacrylates, EPAC and ETAC respectively, were used as precursors of representative networks for our investigations.

  9. The Sorghum bicolor genome and the diversification of grasses

    Energy Technology Data Exchange (ETDEWEB)

    Paterson, Andrew H.; Bowers, John E.; Bruggmann, Remy; dubchak, Inna; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hellsten, Uffe; Mitros, Therese; Poliakov, Alexander; Schmutz, Jeremy; Spannagl, Manuel; Tang, Haibo; Wang, Xiyin; Wicker, Thomas; Bharti, Arvind K.; Chapman, Jarrod; Feltus, F. Alex; Gowik, Udo; Grigoriev, Igor V.; Lyons, Eric; Maher, Christopher A.; Martis, Mihaela; Marechania, Apurva; Otillar, Robert P.; Penning, Bryan W.; Salamov, Asaf. A.; Wang, Yu; Zhang, Lifang; Carpita, Nicholas C.; Freeling, Michael; Gingle, Alan R.; hash, C. Thomas; Keller, Beat; Klein, Patricia; Kresovich, Stephen; McCann, Maureen C.; Ming, Ray; Peterson, Daniel G.; ur-Rahman, Mehboob-; Ware, Doreen; Westhoff, Peter; Mayer, Klaus F. X.; Messing, Joachim; Rokhsar, Daniel S.

    2008-08-20

    Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approx730-megabase Sorghum bicolor (L.) Moench genome, placing approx98percent of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approx75percent larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approx70 million years ago, most duplicated gene sets lost one member before the sorghum rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24percent of genes are grass-specific and 7percent are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.

  10. Multi-scale structural community organisation of the human genome.

    Science.gov (United States)

    Boulos, Rasha E; Tremblay, Nicolas; Arneodo, Alain; Borgnat, Pierre; Audit, Benjamin

    2017-04-11

    Structural interaction frequency matrices between all genome loci are now experimentally achievable thanks to high-throughput chromosome conformation capture technologies. This ensues a new methodological challenge for computational biology which consists in objectively extracting from these data the structural motifs characteristic of genome organisation. We deployed the fast multi-scale community mining algorithm based on spectral graph wavelets to characterise the networks of intra-chromosomal interactions in human cell lines. We observed that there exist structural domains of all sizes up to chromosome length and demonstrated that the set of structural communities forms a hierarchy of chromosome segments. Hence, at all scales, chromosome folding predominantly involves interactions between neighbouring sites rather than the formation of links between distant loci. Multi-scale structural decomposition of human chromosomes provides an original framework to question structural organisation and its relationship to functional regulation across the scales. By construction the proposed methodology is independent of the precise assembly of the reference genome and is thus directly applicable to genomes whose assembly is not fully determined.

  11. Dynamics of genome rearrangement in bacterial populations.

    Directory of Open Access Journals (Sweden)

    Aaron E Darling

    2008-07-01

    Full Text Available Genome structure variation has profound impacts on phenotype in organisms ranging from microbes to humans, yet little is known about how natural selection acts on genome arrangement. Pathogenic bacteria such as Yersinia pestis, which causes bubonic and pneumonic plague, often exhibit a high degree of genomic rearrangement. The recent availability of several Yersinia genomes offers an unprecedented opportunity to study the evolution of genome structure and arrangement. We introduce a set of statistical methods to study patterns of rearrangement in circular chromosomes and apply them to the Yersinia. We constructed a multiple alignment of eight Yersinia genomes using Mauve software to identify 78 conserved segments that are internally free from genome rearrangement. Based on the alignment, we applied Bayesian statistical methods to infer the phylogenetic inversion history of Yersinia. The sampling of genome arrangement reconstructions contains seven parsimonious tree topologies, each having different histories of 79 inversions. Topologies with a greater number of inversions also exist, but were sampled less frequently. The inversion phylogenies agree with results suggested by SNP patterns. We then analyzed reconstructed inversion histories to identify patterns of rearrangement. We confirm an over-representation of "symmetric inversions"-inversions with endpoints that are equally distant from the origin of chromosomal replication. Ancestral genome arrangements demonstrate moderate preference for replichore balance in Yersinia. We found that all inversions are shorter than expected under a neutral model, whereas inversions acting within a single replichore are much shorter than expected. We also found evidence for a canonical configuration of the origin and terminus of replication. Finally, breakpoint reuse analysis reveals that inversions with endpoints proximal to the origin of DNA replication are nearly three times more frequent. Our findings

  12. High-Resolution Replication Profiles Define the Stochastic Nature of Genome Replication Initiation and Termination

    Directory of Open Access Journals (Sweden)

    Michelle Hawkins

    2013-11-01

    Full Text Available Eukaryotic genome replication is stochastic, and each cell uses a different cohort of replication origins. We demonstrate that interpreting high-resolution Saccharomyces cerevisiae genome replication data with a mathematical model allows quantification of the stochastic nature of genome replication, including the efficiency of each origin and the distribution of termination events. Single-cell measurements support the inferred values for stochastic origin activation time. A strain, in which three origins were inactivated, confirmed that the distribution of termination events is primarily dictated by the stochastic activation time of origins. Cell-to-cell variability in origin activity ensures that termination events are widely distributed across virtually the whole genome. We propose that the heterogeneity in origin usage contributes to genome stability by limiting potentially deleterious events from accumulating at particular loci.

  13. Genome reorganization in Nicotiana asymmetric somatic hybrids analysed by in situ hybridization

    International Nuclear Information System (INIS)

    Parokonny, A.S.; Kenton, A.Y.; Gleba, Y.Y.; Bennett, M.D.

    1992-01-01

    In situ hybridization was used to examine genome reorganization in asymmetric somatic hybrids between Nicotiana plumbaginifolia and Nicotiana sylvestris obtained by fusion of gamma-irradiated protoplasts from one of the parents (donor) with non-irradiated protoplasts from the other (recipient). Probing with biotinylated total genomic DNA from either the donor or the recipient species unequivocally identified genetic material from both parents in 31 regenerant plants, each originating from a different nuclear hybrid colony. This method, termed genomic in situ hybridization (GISH), allowed intergenomic translocations containing chromosome segments from both species to be recognized in four regenerants. A probe homologous to the consensus sequence of the Arabidopsis thaliana telomeric repeat (5'-TTTAGGG-3')n, identified telomeres on all chromosomes, including 'mini-chromosomes' originating from the irradiated donor genome. Genomic in situ hybridization to plant chromosomes provides a rapid and reliable means of screening for recombinant genotypes in asymmetric somatic hybrids. Used in combination with other DNA probes, it also contributes to a greater understanding of the events responsible for genomic recovery and restabilization following genetic manipulation in vitro

  14. CrusView: a Java-based visualization platform for comparative genomics analyses in Brassicaceae species.

    Science.gov (United States)

    Chen, Hao; Wang, Xiangfeng

    2013-09-01

    In plants and animals, chromosomal breakage and fusion events based on conserved syntenic genomic blocks lead to conserved patterns of karyotype evolution among species of the same family. However, karyotype information has not been well utilized in genomic comparison studies. We present CrusView, a Java-based bioinformatic application utilizing Standard Widget Toolkit/Swing graphics libraries and a SQLite database for performing visualized analyses of comparative genomics data in Brassicaceae (crucifer) plants. Compared with similar software and databases, one of the unique features of CrusView is its integration of karyotype information when comparing two genomes. This feature allows users to perform karyotype-based genome assembly and karyotype-assisted genome synteny analyses with preset karyotype patterns of the Brassicaceae genomes. Additionally, CrusView is a local program, which gives its users high flexibility when analyzing unpublished genomes and allows users to upload self-defined genomic information so that they can visually study the associations between genome structural variations and genetic elements, including chromosomal rearrangements, genomic macrosynteny, gene families, high-frequency recombination sites, and tandem and segmental duplications between related species. This tool will greatly facilitate karyotype, chromosome, and genome evolution studies using visualized comparative genomics approaches in Brassicaceae species. CrusView is freely available at http://www.cmbb.arizona.edu/CrusView/.

  15. Guided genome halving: hardness, heuristics and the history of the Hemiascomycetes.

    Science.gov (United States)

    Zheng, Chunfang; Zhu, Qian; Adam, Zaky; Sankoff, David

    2008-07-01

    Some present day species have incurred a whole genome doubling event in their evolutionary history, and this is reflected today in patterns of duplicated segments scattered throughout their chromosomes. These duplications may be used as data to 'halve' the genome, i.e. to reconstruct the ancestral genome at the moment of doubling, but the solution is often highly nonunique. To resolve this problem, we take account of outgroups, external reference genomes, to guide and narrow down the search. We improve on a previous, computationally costly, 'brute force' method by adapting the genome halving algorithm of El-Mabrouk and Sankoff so that it rapidly and accurately constructs an ancestor close the outgroups, prior to a local optimization heuristic. We apply this to reconstruct the predoubling ancestor of Saccharomyces cerevisiae and Candida glabrata, guided by the genomes of three other yeasts that diverged before the genome doubling event. We analyze the results in terms (1) of the minimum evolution criterion, (2) how close the genome halving result is to the final (local) minimum and (3) how close the final result is to an ancestor manually constructed by an expert with access to additional information. We also visualize the set of reconstructed ancestors using classic multidimensional scaling to see what aspects of the two doubled and three unduplicated genomes influence the differences among the reconstructions. The experimental software is available on request.

  16. Phenotype heterogeneity in cancer cell populations

    International Nuclear Information System (INIS)

    Almeida, Luis; Chisholm, Rebecca; Clairambault, Jean; Escargueil, Alexandre; Lorenzi, Tommaso; Lorz, Alexander; Trélat, Emmanuel

    2016-01-01

    Phenotype heterogeneity in cancer cell populations, be it of genetic, epigenetic or stochastic origin, has been identified as a main source of resistance to drug treatments and a major source of therapeutic failures in cancers. The molecular mechanisms of drug resistance are partly understood at the single cell level (e.g., overexpression of ABC transporters or of detoxication enzymes), but poorly predictable in tumours, where they are hypothesised to rely on heterogeneity at the cell population scale, which is thus the right level to describe cancer growth and optimise its control by therapeutic strategies in the clinic. We review a few results from the biological literature on the subject, and from mathematical models that have been published to predict and control evolution towards drug resistance in cancer cell populations. We propose, based on the latter, optimisation strategies of combined treatments to limit emergence of drug resistance to cytotoxic drugs in cancer cell populations, in the monoclonal situation, which limited as it is still retains consistent features of cell population heterogeneity. The polyclonal situation, that may be understood as “bet hedging” of the tumour, thus protecting itself from different sources of drug insults, may lie beyond such strategies and will need further developments. In the monoclonal situation, we have designed an optimised therapeutic strategy relying on a scheduled combination of cytotoxic and cytostatic treatments that can be adapted to different situations of cancer treatments. Finally, we review arguments for biological theoretical frameworks proposed at different time and development scales, the so-called atavistic model (diachronic view relying on Darwinian genotype selection in the coursof billions of years) and the Waddington-like epigenetic landscape endowed with evolutionary quasi-potential (synchronic view relying on Lamarckian phenotype instruction of a given genome by reversible mechanisms), to

  17. Phenotype heterogeneity in cancer cell populations

    Energy Technology Data Exchange (ETDEWEB)

    Almeida, Luis [CNRS UMR 7598, LJLL, & INRIA MAMBA team, Sorbonne Universités, UPMC Univ Paris 06, Boîte courrier 187, 4 Pl. Jussieu, 75252 Paris cedex 05, France, luis@ann.jussieu.fr (France); Chisholm, Rebecca [School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia, rebecca.chisholm@gmail.com (Australia); Clairambault, Jean [INRIA MAMBA team & LJLL, UMR 7598, Sorbonne Universités, UPMC Univ Paris 06, Boîte courrier 187, 4 Pl. Jussieu, 75252 Paris cedex 05, France, jean.clairambault@inria.fr, Corresponding author (France); Escargueil, Alexandre [INSERM “Cancer Biology and Therapeutics”, Sorbonne Universités, UPMC Univ Paris 06, UMR-S 938, CDR St Antoine, Hôpital St Antoine, 184 Fbg. St Antoine, 75571 Paris cedex 12, France, alexandre.escargueil@upmc.fr (France); Lorenzi, Tommaso [CMLA, ENS Cachan, 61, Av. du Président Wilson, 94230 Cachan cedex & INRIA MAMBA team, & LJLL, UMR 7598, UPMC Univ Paris 06, Boîte courrier 187, 4 Pl. Jussieu, 75252 Paris cedex 05, France, tommaso.lorenzi@gmail.com (France); Lorz, Alexander [Sorbonne Universités, UPMC Univ Paris 06, LJLL, UMR 7598 & INRIA Boîte courrier 187, 4 Pl. Jussieu, 75252 Paris cedex 05, France, alex.lorz@ann.jussieu.fr (France); Trélat, Emmanuel [Institut Universitaire de France, Sorbonne Universités, UPMC Univ Paris 06, LJLL, UMR 7598, Boîte courrier 187, UPMC Univ Paris 06, 4 Pl. Jussieu, 75252 Paris cedex 05, France, emmanuel.trelat@upmc.fr (France)

    2016-06-08

    Phenotype heterogeneity in cancer cell populations, be it of genetic, epigenetic or stochastic origin, has been identified as a main source of resistance to drug treatments and a major source of therapeutic failures in cancers. The molecular mechanisms of drug resistance are partly understood at the single cell level (e.g., overexpression of ABC transporters or of detoxication enzymes), but poorly predictable in tumours, where they are hypothesised to rely on heterogeneity at the cell population scale, which is thus the right level to describe cancer growth and optimise its control by therapeutic strategies in the clinic. We review a few results from the biological literature on the subject, and from mathematical models that have been published to predict and control evolution towards drug resistance in cancer cell populations. We propose, based on the latter, optimisation strategies of combined treatments to limit emergence of drug resistance to cytotoxic drugs in cancer cell populations, in the monoclonal situation, which limited as it is still retains consistent features of cell population heterogeneity. The polyclonal situation, that may be understood as “bet hedging” of the tumour, thus protecting itself from different sources of drug insults, may lie beyond such strategies and will need further developments. In the monoclonal situation, we have designed an optimised therapeutic strategy relying on a scheduled combination of cytotoxic and cytostatic treatments that can be adapted to different situations of cancer treatments. Finally, we review arguments for biological theoretical frameworks proposed at different time and development scales, the so-called atavistic model (diachronic view relying on Darwinian genotype selection in the coursof billions of years) and the Waddington-like epigenetic landscape endowed with evolutionary quasi-potential (synchronic view relying on Lamarckian phenotype instruction of a given genome by reversible mechanisms), to

  18. Phenotype heterogeneity in cancer cell populations

    Science.gov (United States)

    Almeida, Luis; Chisholm, Rebecca; Clairambault, Jean; Escargueil, Alexandre; Lorenzi, Tommaso; Lorz, Alexander; Trélat, Emmanuel

    2016-06-01

    Phenotype heterogeneity in cancer cell populations, be it of genetic, epigenetic or stochastic origin, has been identified as a main source of resistance to drug treatments and a major source of therapeutic failures in cancers. The molecular mechanisms of drug resistance are partly understood at the single cell level (e.g., overexpression of ABC transporters or of detoxication enzymes), but poorly predictable in tumours, where they are hypothesised to rely on heterogeneity at the cell population scale, which is thus the right level to describe cancer growth and optimise its control by therapeutic strategies in the clinic. We review a few results from the biological literature on the subject, and from mathematical models that have been published to predict and control evolution towards drug resistance in cancer cell populations. We propose, based on the latter, optimisation strategies of combined treatments to limit emergence of drug resistance to cytotoxic drugs in cancer cell populations, in the monoclonal situation, which limited as it is still retains consistent features of cell population heterogeneity. The polyclonal situation, that may be understood as "bet hedging" of the tumour, thus protecting itself from different sources of drug insults, may lie beyond such strategies and will need further developments. In the monoclonal situation, we have designed an optimised therapeutic strategy relying on a scheduled combination of cytotoxic and cytostatic treatments that can be adapted to different situations of cancer treatments. Finally, we review arguments for biological theoretical frameworks proposed at different time and development scales, the so-called atavistic model (diachronic view relying on Darwinian genotype selection in the coursof billions of years) and the Waddington-like epigenetic landscape endowed with evolutionary quasi-potential (synchronic view relying on Lamarckian phenotype instruction of a given genome by reversible mechanisms), to

  19. Three-dimensional segmentation of the tumor mass in computed tomographic images of neuroblastoma

    Science.gov (United States)

    Deglint, Hanford J.; Rangayyan, Rangaraj M.; Boag, Graham S.

    2004-05-01

    Tumor definition and diagnosis require the analysis of the spatial distribution and Hounsfield unit (HU) values of voxels in computed tomography (CT) images, coupled with a knowledge of normal anatomy. Segmentation of the tumor in neuroblastoma is complicated by the fact that the mass is almost always heterogeneous in nature; furthermore, viable tumor, necrosis, fibrosis, and normal tissue are often intermixed. Rather than attempt to separate these tissue types into distinct regions, we propose to explore methods to delineate the normal structures expected in abdominal CT images, remove them from further consideration, and examine the remaining parts of the images for the tumor mass. We explore the use of fuzzy connectivity for this purpose. Expert knowledge provided by the radiologist in the form of the expected structures and their shapes, HU values, and radiological characteristics are also incorporated in the segmentation algorithm. Segmentation and analysis of the tissue composition of the tumor can assist in quantitative assessment of the response to chemotherapy and in the planning of delayed surgery for resection of the tumor. The performance of the algorithm is evaluated using cases acquired from the Alberta Children's Hospital.

  20. Pathway-based outlier method reveals heterogeneous genomic structure of autism in blood transcriptome.

    Science.gov (United States)

    Campbell, Malcolm G; Kohane, Isaac S; Kong, Sek Won

    2013-09-24

    Decades of research strongly suggest that the genetic etiology of autism spectrum disorders (ASDs) is heterogeneous. However, most published studies focus on group differences between cases and controls. In contrast, we hypothesized that the heterogeneity of the disorder could be characterized by identifying pathways for which individuals are outliers rather than pathways representative of shared group differences of the ASD diagnosis. Two previously published blood gene expression data sets--the Translational Genetics Research Institute (TGen) dataset (70 cases and 60 unrelated controls) and the Simons Simplex Consortium (Simons) dataset (221 probands and 191 unaffected family members)--were analyzed. All individuals of each dataset were projected to biological pathways, and each sample's Mahalanobis distance from a pooled centroid was calculated to compare the number of case and control outliers for each pathway. Analysis of a set of blood gene expression profiles from 70 ASD and 60 unrelated controls revealed three pathways whose outliers were significantly overrepresented in the ASD cases: neuron development including axonogenesis and neurite development (29% of ASD, 3% of control), nitric oxide signaling (29%, 3%), and skeletal development (27%, 3%). Overall, 50% of cases and 8% of controls were outliers in one of these three pathways, which could not be identified using group comparison or gene-level outlier methods. In an independently collected data set consisting of 221 ASD and 191 unaffected family members, outliers in the neurogenesis pathway were heavily biased towards cases (20.8% of ASD, 12.0% of control). Interestingly, neurogenesis outliers were more common among unaffected family members (Simons) than unrelated controls (TGen), but the statistical significance of this effect was marginal (Chi squared P < 0.09). Unlike group difference approaches, our analysis identified the samples within the case and control groups that manifested each expression

  1. Hydrophilic segmented block copolymers based on poly(ethylene oxide) and monodisperse amide segments

    NARCIS (Netherlands)

    Husken, D.; Feijen, Jan; Gaymans, R.J.

    2007-01-01

    Segmented block copolymers based on poly(ethylene oxide) (PEO) flexible segments and monodisperse crystallizable bisester tetra-amide segments were made via a polycondensation reaction. The molecular weight of the PEO segments varied from 600 to 4600 g/mol and a bisester tetra-amide segment (T6T6T)

  2. The genomic landscape at a late stage of stickleback speciation: High genomic divergence interspersed by small localized regions of introgression.

    Directory of Open Access Journals (Sweden)

    Mark Ravinet

    2018-05-01

    Full Text Available Speciation is a continuous process and analysis of species pairs at different stages of divergence provides insight into how it unfolds. Previous genomic studies on young species pairs have revealed peaks of divergence and heterogeneous genomic differentiation. Yet less known is how localised peaks of differentiation progress to genome-wide divergence during the later stages of speciation in the presence of persistent gene flow. Spanning the speciation continuum, stickleback species pairs are ideal for investigating how genomic divergence builds up during speciation. However, attention has largely focused on young postglacial species pairs, with little knowledge of the genomic signatures of divergence and introgression in older stickleback systems. The Japanese stickleback species pair, composed of the Pacific Ocean three-spined stickleback (Gasterosteus aculeatus and the Japan Sea stickleback (G. nipponicus, which co-occur in the Japanese islands, is at a late stage of speciation. Divergence likely started well before the end of the last glacial period and crosses between Japan Sea females and Pacific Ocean males result in hybrid male sterility. Here we use coalescent analyses and Approximate Bayesian Computation to show that the two species split approximately 0.68-1 million years ago but that they have continued to exchange genes at a low rate throughout divergence. Population genomic data revealed that, despite gene flow, a high level of genomic differentiation is maintained across the majority of the genome. However, we identified multiple, small regions of introgression, occurring mainly in areas of low recombination rate. Our results demonstrate that a high level of genome-wide divergence can establish in the face of persistent introgression and that gene flow can be localized to small genomic regions at the later stages of speciation with gene flow.

  3. The genomic landscape at a late stage of stickleback speciation: High genomic divergence interspersed by small localized regions of introgression.

    Science.gov (United States)

    Ravinet, Mark; Yoshida, Kohta; Shigenobu, Shuji; Toyoda, Atsushi; Fujiyama, Asao; Kitano, Jun

    2018-05-01

    Speciation is a continuous process and analysis of species pairs at different stages of divergence provides insight into how it unfolds. Previous genomic studies on young species pairs have revealed peaks of divergence and heterogeneous genomic differentiation. Yet less known is how localised peaks of differentiation progress to genome-wide divergence during the later stages of speciation in the presence of persistent gene flow. Spanning the speciation continuum, stickleback species pairs are ideal for investigating how genomic divergence builds up during speciation. However, attention has largely focused on young postglacial species pairs, with little knowledge of the genomic signatures of divergence and introgression in older stickleback systems. The Japanese stickleback species pair, composed of the Pacific Ocean three-spined stickleback (Gasterosteus aculeatus) and the Japan Sea stickleback (G. nipponicus), which co-occur in the Japanese islands, is at a late stage of speciation. Divergence likely started well before the end of the last glacial period and crosses between Japan Sea females and Pacific Ocean males result in hybrid male sterility. Here we use coalescent analyses and Approximate Bayesian Computation to show that the two species split approximately 0.68-1 million years ago but that they have continued to exchange genes at a low rate throughout divergence. Population genomic data revealed that, despite gene flow, a high level of genomic differentiation is maintained across the majority of the genome. However, we identified multiple, small regions of introgression, occurring mainly in areas of low recombination rate. Our results demonstrate that a high level of genome-wide divergence can establish in the face of persistent introgression and that gene flow can be localized to small genomic regions at the later stages of speciation with gene flow.

  4. Meta-analysis for genome-wide association studies using case-control design: application and practice

    Directory of Open Access Journals (Sweden)

    Sungryul Shim

    2016-12-01

    Full Text Available This review aimed to arrange the process of a systematic review of genome-wide association studies in order to practice and apply a genome-wide meta-analysis (GWMA. The process has a series of five steps: searching and selection, extraction of related information, evaluation of validity, meta-analysis by type of genetic model, and evaluation of heterogeneity. In contrast to intervention meta-analyses, GWMA has to evaluate the Hardy–Weinberg equilibrium (HWE in the third step and conduct meta-analyses by five potential genetic models, including dominant, recessive, homozygote contrast, heterozygote contrast, and allelic contrast in the fourth step. The ‘genhwcci’ and ‘metan’ commands of STATA software evaluate the HWE and calculate a summary effect size, respectively. A meta-regression using the ‘metareg’ command of STATA should be conducted to evaluate related factors of heterogeneities.

  5. Spinal segmental dysgenesis

    Directory of Open Access Journals (Sweden)

    N Mahomed

    2009-06-01

    Full Text Available Spinal segmental dysgenesis is a rare congenital spinal abnormality , seen in neonates and infants in which a segment of the spine and spinal cord fails to develop normally . The condition is segmental with normal vertebrae above and below the malformation. This condition is commonly associated with various abnormalities that affect the heart, genitourinary, gastrointestinal tract and skeletal system. We report two cases of spinal segmental dysgenesis and the associated abnormalities.

  6. Segmentation of solid subregion of high grade gliomas in MRI images based on active contour model (ACM)

    Science.gov (United States)

    Seow, P.; Win, M. T.; Wong, J. H. D.; Abdullah, N. A.; Ramli, N.

    2016-03-01

    Gliomas are tumours arising from the interstitial tissue of the brain which are heterogeneous, infiltrative and possess ill-defined borders. Tumour subregions (e.g. solid enhancing part, edema and necrosis) are often used for tumour characterisation. Tumour demarcation into substructures facilitates glioma staging and provides essential information. Manual segmentation had several drawbacks that include laborious, time consuming, subjected to intra and inter-rater variability and hindered by diversity in the appearance of tumour tissues. In this work, active contour model (ACM) was used to segment the solid enhancing subregion of the tumour. 2D brain image acquisition data using 3T MRI fast spoiled gradient echo sequence in post gadolinium of four histologically proven high-grade glioma patients were obtained. Preprocessing of the images which includes subtraction and skull stripping were performed and then followed by ACM segmentation. The results of the automatic segmentation method were compared against the manual delineation of the tumour by a trainee radiologist. Both results were further validated by an experienced neuroradiologist and a brief quantitative evaluations (pixel area and difference ratio) were performed. Preliminary results of the clinical data showed the potential of ACM model in the application of fast and large scale tumour segmentation in medical imaging.

  7. Segmentation of solid subregion of high grade gliomas in MRI images based on active contour model (ACM)

    International Nuclear Information System (INIS)

    Seow, P; Win, M T; Wong, J H D; Ramli, N; Abdullah, N A

    2016-01-01

    Gliomas are tumours arising from the interstitial tissue of the brain which are heterogeneous, infiltrative and possess ill-defined borders. Tumour subregions (e.g. solid enhancing part, edema and necrosis) are often used for tumour characterisation. Tumour demarcation into substructures facilitates glioma staging and provides essential information. Manual segmentation had several drawbacks that include laborious, time consuming, subjected to intra and inter-rater variability and hindered by diversity in the appearance of tumour tissues. In this work, active contour model (ACM) was used to segment the solid enhancing subregion of the tumour. 2D brain image acquisition data using 3T MRI fast spoiled gradient echo sequence in post gadolinium of four histologically proven high-grade glioma patients were obtained. Preprocessing of the images which includes subtraction and skull stripping were performed and then followed by ACM segmentation. The results of the automatic segmentation method were compared against the manual delineation of the tumour by a trainee radiologist. Both results were further validated by an experienced neuroradiologist and a brief quantitative evaluations (pixel area and difference ratio) were performed. Preliminary results of the clinical data showed the potential of ACM model in the application of fast and large scale tumour segmentation in medical imaging. (paper)

  8. From tomographic images to fault heterogeneities

    Directory of Open Access Journals (Sweden)

    A. Amato

    1994-06-01

    Full Text Available Local Earthquake Tomography (LET is a useful tool for imaging lateral heterogeneities in the upper crust. The pattern of P- and S-wave velocity anomalies, in relation to the seismicity distribution along active fault zones. can shed light on the existence of discrete seismogenic patches. Recent tomographic studies in well monitored seismic areas have shown that the regions with large seismic moment release generally correspond to high velocity zones (HVZ's. In this paper, we discuss the relationship between the seismogenic behavior of faults and the velocity structure of fault zones as inferred from seismic tomography. First, we review some recent tomographic studies in active strike-slip faults. We show examples from different segments of the San Andreas fault system (Parkfield, Loma Prieta, where detailed studies have been carried out in recent years. We also show two applications of LET to thrust faults (Coalinga, Friuli. Then, we focus on the Irpinia normal fault zone (South-Central Italy, where a Ms = 6.9 earthquake occurred in 1980 and many thousands of attershock travel time data are available. We find that earthquake hypocenters concentrate in HVZ's, whereas low velocity zones (LVZ’ s appear to be relatively aseismic. The main HVZ's along which the mainshock rupture bas propagated may correspond to velocity weakening fault regions, whereas the LVZ's are probably related to weak materials undergoing stable slip (velocity strengthening. A correlation exists between this HVZ and the area with larger coseismic slip along the fault, according to both surface evidence (a fault scarp as high as 1 m and strong ground motion waveform modeling. Smaller wave-length, low-velocity anomalies detected along the fault may be the expression of velocity strengthening sections, where aseismic slip occurs. According to our results, the rupture at the nucleation depth (~ 10-12 km is continuous for the whole fault lenoth (~ 30 km, whereas at shallow depth

  9. Relationship between Deleterious Variation, Genomic Autozygosity, and Disease Risk: Insights from The 1000 Genomes Project.

    Science.gov (United States)

    Pemberton, Trevor J; Szpiech, Zachary A

    2018-04-05

    Genomic regions of autozygosity (ROAs) represent segments of individual genomes that are homozygous for haplotypes inherited identical-by-descent (IBD) from a common ancestor. ROAs are nonuniformly distributed across the genome, and increased ROA levels are a reported risk factor for numerous complex diseases. Previously, we hypothesized that long ROAs are enriched for deleterious homozygotes as a result of young haplotypes with recent deleterious mutations-relatively untouched by purifying selection-being paired IBD as a consequence of recent parental relatedness, a pattern supported by ROA and whole-exome sequence data on 27 individuals. Here, we significantly bolster support for our hypothesis and expand upon our original analyses using ROA and whole-genome sequence data on 2,436 individuals from The 1000 Genomes Project. Considering CADD deleteriousness scores, we reaffirm our previous observation that long ROAs are enriched for damaging homozygotes worldwide. We show that strongly damaging homozygotes experience greater enrichment than weaker damaging homozygotes, while overall enrichment varies appreciably among populations. Mendelian disease genes and those encoding FDA-approved drug targets have significantly increased rates of gain in damaging homozygotes with increasing ROA coverage relative to all other genes. In genes implicated in eight complex phenotypes for which ROA levels have been identified as a risk factor, rates of gain in damaging homozygotes vary across phenotypes and populations but frequently differ significantly from non-disease genes. These findings highlight the potential confounding effects of population background in the assessment of associations between ROA levels and complex disease risk, which might underlie reported inconsistencies in ROA-phenotype associations. Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  10. Large epidemic thresholds emerge in heterogeneous networks of heterogeneous nodes

    Science.gov (United States)

    Yang, Hui; Tang, Ming; Gross, Thilo

    2015-08-01

    One of the famous results of network science states that networks with heterogeneous connectivity are more susceptible to epidemic spreading than their more homogeneous counterparts. In particular, in networks of identical nodes it has been shown that network heterogeneity, i.e. a broad degree distribution, can lower the epidemic threshold at which epidemics can invade the system. Network heterogeneity can thus allow diseases with lower transmission probabilities to persist and spread. However, it has been pointed out that networks in which the properties of nodes are intrinsically heterogeneous can be very resilient to disease spreading. Heterogeneity in structure can enhance or diminish the resilience of networks with heterogeneous nodes, depending on the correlations between the topological and intrinsic properties. Here, we consider a plausible scenario where people have intrinsic differences in susceptibility and adapt their social network structure to the presence of the disease. We show that the resilience of networks with heterogeneous connectivity can surpass those of networks with homogeneous connectivity. For epidemiology, this implies that network heterogeneity should not be studied in isolation, it is instead the heterogeneity of infection risk that determines the likelihood of outbreaks.

  11. Large epidemic thresholds emerge in heterogeneous networks of heterogeneous nodes.

    Science.gov (United States)

    Yang, Hui; Tang, Ming; Gross, Thilo

    2015-08-21

    One of the famous results of network science states that networks with heterogeneous connectivity are more susceptible to epidemic spreading than their more homogeneous counterparts. In particular, in networks of identical nodes it has been shown that network heterogeneity, i.e. a broad degree distribution, can lower the epidemic threshold at which epidemics can invade the system. Network heterogeneity can thus allow diseases with lower transmission probabilities to persist and spread. However, it has been pointed out that networks in which the properties of nodes are intrinsically heterogeneous can be very resilient to disease spreading. Heterogeneity in structure can enhance or diminish the resilience of networks with heterogeneous nodes, depending on the correlations between the topological and intrinsic properties. Here, we consider a plausible scenario where people have intrinsic differences in susceptibility and adapt their social network structure to the presence of the disease. We show that the resilience of networks with heterogeneous connectivity can surpass those of networks with homogeneous connectivity. For epidemiology, this implies that network heterogeneity should not be studied in isolation, it is instead the heterogeneity of infection risk that determines the likelihood of outbreaks.

  12. Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.

    Science.gov (United States)

    Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S K; Mammel, Mark K; Tarr, Phillip I; Eppinger, Mark

    2016-01-01

    Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and

  13. Genome network medicine: innovation to overcome huge challenges in cancer therapy.

    Science.gov (United States)

    Roukos, Dimitrios H

    2014-01-01

    The post-ENCODE era shapes now a new biomedical research direction for understanding transcriptional and signaling networks driving gene expression and core cellular processes such as cell fate, survival, and apoptosis. Over the past half century, the Francis Crick 'central dogma' of single n gene/protein-phenotype (trait/disease) has defined biology, human physiology, disease, diagnostics, and drugs discovery. However, the ENCODE project and several other genomic studies using high-throughput sequencing technologies, computational strategies, and imaging techniques to visualize regulatory networks, provide evidence that transcriptional process and gene expression are regulated by highly complex dynamic molecular and signaling networks. This Focus article describes the linear experimentation-based limitations of diagnostics and therapeutics to cure advanced cancer and the need to move on from reductionist to network-based approaches. With evident a wide genomic heterogeneity, the power and challenges of next-generation sequencing (NGS) technologies to identify a patient's personal mutational landscape for tailoring the best target drugs in the individual patient are discussed. However, the available drugs are not capable of targeting aberrant signaling networks and research on functional transcriptional heterogeneity and functional genome organization is poorly understood. Therefore, the future clinical genome network medicine aiming at overcoming multiple problems in the new fields of regulatory DNA mapping, noncoding RNA, enhancer RNAs, and dynamic complexity of transcriptional circuitry are also discussed expecting in new innovation technology and strong appreciation of clinical data and evidence-based medicine. The problematic and potential solutions in the discovery of next-generation, molecular, and signaling circuitry-based biomarkers and drugs are explored. © 2013 Wiley Periodicals, Inc.

  14. Automatic Melody Segmentation

    NARCIS (Netherlands)

    Rodríguez López, Marcelo

    2016-01-01

    The work presented in this dissertation investigates music segmentation. In the field of Musicology, segmentation refers to a score analysis technique, whereby notated pieces or passages of these pieces are divided into “units” referred to as sections, periods, phrases, and so on. Segmentation

  15. Recurrent reciprocal genomic rearrangements of 17q12 are associated with renal disease, diabetes, and epilepsy

    DEFF Research Database (Denmark)

    Mefford, Heather C; Clauin, Severine; Sharp, Andrew J

    2007-01-01

    predisposed to recurrent rearrangement, by array-based comparative genomic hybridization. We found that 6% of fetal material showed evidence of microdeletion or microduplication, including three independent events that likely resulted from unequal crossing-over between segmental duplications. One...

  16. Updates on genome-wide association findings in eating disorders and future application to precision medicine.

    Science.gov (United States)

    Breithaupt, Lauren; Hubel, Christopher; Bulik, Cynthia M

    2018-02-22

    Heterogeneity, frequent diagnostic fluctuation across presentations, and global concerns with the absence of effective treatments all encourage science that moves the field toward individualized or precision medicine in eating disorders. We review recent advances in psychiatric genetics focusing on genome-wide association studies (GWAS) in eating disorders and enumerate the prospects and challenges of a genomics-driven approach towards personalized intervention. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  17. Hippocampal unified multi-atlas network (HUMAN): protocol and scale validation of a novel segmentation tool.

    Science.gov (United States)

    Amoroso, N; Errico, R; Bruno, S; Chincarini, A; Garuccio, E; Sensi, F; Tangaro, S; Tateo, A; Bellotti, R

    2015-11-21

    In this study we present a novel fully automated Hippocampal Unified Multi-Atlas-Networks (HUMAN) algorithm for the segmentation of the hippocampus in structural magnetic resonance imaging. In multi-atlas approaches atlas selection is of crucial importance for the accuracy of the segmentation. Here we present an optimized method based on the definition of a small peri-hippocampal region to target the atlas learning with linear and non-linear embedded manifolds. All atlases were co-registered to a data driven template resulting in a computationally efficient method that requires only one test registration. The optimal atlases identified were used to train dedicated artificial neural networks whose labels were then propagated and fused to obtain the final segmentation. To quantify data heterogeneity and protocol inherent effects, HUMAN was tested on two independent data sets provided by the Alzheimer's Disease Neuroimaging Initiative and the Open Access Series of Imaging Studies. HUMAN is accurate and achieves state-of-the-art performance (Dice[Formula: see text] and Dice[Formula: see text]). It is also a robust method that remains stable when applied to the whole hippocampus or to sub-regions (patches). HUMAN also compares favorably with a basic multi-atlas approach and a benchmark segmentation tool such as FreeSurfer.

  18. Genetic dissection of the Drosophila melanogaster female head transcriptome reveals widespread allelic heterogeneity.

    Directory of Open Access Journals (Sweden)

    Elizabeth G King

    2014-05-01

    Full Text Available Modern genetic mapping is plagued by the "missing heritability" problem, which refers to the discordance between the estimated heritabilities of quantitative traits and the variance accounted for by mapped causative variants. One major potential explanation for the missing heritability is allelic heterogeneity, in which there are multiple causative variants at each causative gene with only a fraction having been identified. The majority of genome-wide association studies (GWAS implicitly assume that a single SNP can explain all the variance for a causative locus. However, if allelic heterogeneity is prevalent, a substantial amount of genetic variance will remain unexplained. In this paper, we take a haplotype-based mapping approach and quantify the number of alleles segregating at each locus using a large set of 7922 eQTL contributing to regulatory variation in the Drosophila melanogaster female head. Not only does this study provide a comprehensive eQTL map for a major community genetic resource, the Drosophila Synthetic Population Resource, but it also provides a direct test of the allelic heterogeneity hypothesis. We find that 95% of cis-eQTLs and 78% of trans-eQTLs are due to multiple alleles, demonstrating that allelic heterogeneity is widespread in Drosophila eQTL. Allelic heterogeneity likely contributes significantly to the missing heritability problem common in GWAS studies.

  19. Genomic diversity of drug-resistant Mycobacterium tuberculosis isolates in Lisbon Portugal: Towards tuberculosis genomic epidemiology

    Directory of Open Access Journals (Sweden)

    João Perdigão

    2015-01-01

    : Lisboa3 and Q1. The data presented by this study contributes to the expanding knowledge of MTB genomic diversity yielding insights on microevolution and identification of novel compensatory mutations. Additionally, the analysis of nonsynonymous/ synonymous ratios revealed heterogeneities across the chromosome, genotype and Clusters of Orthologous Groups, highlighting possible and different evolution strategies. Overall, the results that are presented support the notion of an increasing genomic diversity that may support both setting and host adaptation.

  20. Genomic diversity of drug-resistant Mycobacterium tuberculosis isolates in Lisbon Portugal: Towards tuberculosis genomic epidemiology

    KAUST Repository

    Perdigã o, Joã o; Silva, Hugo; Machado, Diana; Macedo, Rita; Maltez, Fernando; Silva, Carla; Jordao, Luisa; Couto, Isabel; Mallard, Kim; Coll, Francesc; Hill-Cawthorne, Grant A.; McNerney, Ruth; Pain, Arnab; Clark, Taane G.; Viveiros, Miguel; Portugal, Isabel

    2015-01-01

    1. The data presented by this study contributes to the expanding knowledge of MTB genomic diversity yielding insights on microevolution and identification of novel compensatory mutations. Additionally, the analysis of non-synonymous/synonymous ratios revealed heterogeneities across the chromosome, genotype and Clusters of Orthologous Groups, highlighting possible and different evolution strategies. Overall, the results that are presented support the notion of an increasing genomic diversity that may support both setting and host adaptation.

  1. Genomic diversity of drug-resistant Mycobacterium tuberculosis isolates in Lisbon Portugal: Towards tuberculosis genomic epidemiology

    KAUST Repository

    Perdigão, João

    2015-03-01

    1. The data presented by this study contributes to the expanding knowledge of MTB genomic diversity yielding insights on microevolution and identification of novel compensatory mutations. Additionally, the analysis of non-synonymous/synonymous ratios revealed heterogeneities across the chromosome, genotype and Clusters of Orthologous Groups, highlighting possible and different evolution strategies. Overall, the results that are presented support the notion of an increasing genomic diversity that may support both setting and host adaptation.

  2. Genome-wide maps of alkylation damage, repair, and mutagenesis in yeast reveal mechanisms of mutational heterogeneity.

    Science.gov (United States)

    Mao, Peng; Brown, Alexander J; Malc, Ewa P; Mieczkowski, Piotr A; Smerdon, Michael J; Roberts, Steven A; Wyrick, John J

    2017-10-01

    DNA base damage is an important contributor to genome instability, but how the formation and repair of these lesions is affected by the genomic landscape and contributes to mutagenesis is unknown. Here, we describe genome-wide maps of DNA base damage, repair, and mutagenesis at single nucleotide resolution in yeast treated with the alkylating agent methyl methanesulfonate (MMS). Analysis of these maps revealed that base excision repair (BER) of alkylation damage is significantly modulated by chromatin, with faster repair in nucleosome-depleted regions, and slower repair and higher mutation density within strongly positioned nucleosomes. Both the translational and rotational settings of lesions within nucleosomes significantly influence BER efficiency; moreover, this effect is asymmetric relative to the nucleosome dyad axis and is regulated by histone modifications. Our data also indicate that MMS-induced mutations at adenine nucleotides are significantly enriched on the nontranscribed strand (NTS) of yeast genes, particularly in BER-deficient strains, due to higher damage formation on the NTS and transcription-coupled repair of the transcribed strand (TS). These findings reveal the influence of chromatin on repair and mutagenesis of base lesions on a genome-wide scale and suggest a novel mechanism for transcription-associated mutation asymmetry, which is frequently observed in human cancers. © 2017 Mao et al.; Published by Cold Spring Harbor Laboratory Press.

  3. A Practical Approach to Tumor Heterogeneity in Clinical Research and Diagnostics.

    Science.gov (United States)

    Stanta, Giorgio; Bonin, Serena

    2018-01-01

    This Pathobiology issue tries to better define the complex phenomenon of intratumor heterogeneity (ITH), mostly from a practical point of view. This topic has been chosen because ITH is a central issue in tumor development and has to be investigated directly in patient tissue and immediately applied in the treatment of the presenting patient. Different types of ITH should be considered: clonal genetic and epigenetic evolution, morphological heterogeneity, and tumor sampling, heterogeneity resulting from microenvironmental autocrine and paracrine interaction, and stochastic plasticity related to different functional cell efficiencies. For a higher level of reproducibility in clinical research and diagnostics, it is necessary to establish standardized analytical methods, including microdissection. In situ techniques can be pivotal to explore tumor microenvironment and can be improved with associated digital analysis. Liquid biopsies for plasma DNA analysis are at present the best method to study recurrent tumors with treatment adaptation, and widespread clinical use could be beneficial. The different types of tumor genomic instabilities could have pragmatic applications to rank ITH for clinical applications: treatment approaches differ in patients with a high nucleotide mutation rate and patients with high copy number alterations. © 2017 S. Karger AG, Basel.

  4. Variance heterogeneity in Saccharomyces cerevisiae expression data: trans-regulation and epistasis.

    Science.gov (United States)

    Nelson, Ronald M; Pettersson, Mats E; Li, Xidan; Carlborg, Örjan

    2013-01-01

    Here, we describe the results from the first variance heterogeneity Genome Wide Association Study (VGWAS) on yeast expression data. Using this forward genetics approach, we show that the genetic regulation of gene-expression in the budding yeast, Saccharomyces cerevisiae, includes mechanisms that can lead to variance heterogeneity in the expression between genotypes. Additionally, we performed a mean effect association study (GWAS). Comparing the mean and variance heterogeneity analyses, we find that the mean expression level is under genetic regulation from a larger absolute number of loci but that a higher proportion of the variance controlling loci were trans-regulated. Both mean and variance regulating loci cluster in regulatory hotspots that affect a large number of phenotypes; a single variance-controlling locus, mapping close to DIA2, was found to be involved in more than 10% of the significant associations. It has been suggested in the literature that variance-heterogeneity between the genotypes might be due to genetic interactions. We therefore screened the multi-locus genotype-phenotype maps for several traits where multiple associations were found, for indications of epistasis. Several examples of two and three locus genetic interactions were found to involve variance-controlling loci, with reports from the literature corroborating the functional connections between the loci. By using a new analytical approach to re-analyze a powerful existing dataset, we are thus able to both provide novel insights to the genetic mechanisms involved in the regulation of gene-expression in budding yeast and experimentally validate epistasis as an important mechanism underlying genetic variance-heterogeneity between genotypes.

  5. Heterogeneity of uridine incorporation along the rabbit nephron. I. Autoradiographic study

    Energy Technology Data Exchange (ETDEWEB)

    Vandewalle, A.; Farman, N.; Cluzeaud, F.; Bonvalet, J.P.

    1984-04-01

    An autoradiographic study of uridine labeling in tubular segments microdissected from the rabbit kidney is presented. Kidney pyramids were incubated for 60 min with low (66 nM) and high (66..mu..M) (/sup 3/H)-uridine concentration. At the two concentrations studied the labeling was almost exclusively nuclear in all segments studied. At the low concentration, labeling predominated in the macula densa (MD = 63.88 +/- 6.15 silver grains/100 ..mu..m/sup 2/, n = 11), cortical ascending limb (CAL = 19.65 +/- 1.65, n = 15), and initial distal tubule (DCT/sub a/ = 24.31 +/- 2.70, n = 6). It was minimal in the proximal tubule (PCT/sub 2/ = 9.14 +/- 1.61, n = 16) and in the cortical (CCT = 5.23 +/- 0.75, n = 18) and medullary (MCT = 5.52 +/- 1.10, n = 12) collecting ducts. At a high concentration, the profile of labeling was roughly similar except for a relative increase in labeling much more pronounced in collecting ducts (CCT = +373, MCT = +323%) than in the other structures (MD = -14, CAL = +66, DCT/sub a/ = +49, PCT = +9%). Pulse-chase experiments do not show evidence for differences in turnover or degradation rates of RNA between segments, at least in the PCT and the connecting part of the CCT. Analysis of the results at low and high concentration suggests that the observed heterogeneity in uridine labeling depends on both variable endogenous nucleoside pools and different rates of uridine incorporation into RNA from one segment to another.

  6. CrusView: A Java-Based Visualization Platform for Comparative Genomics Analyses in Brassicaceae Species[OPEN

    Science.gov (United States)

    Chen, Hao; Wang, Xiangfeng

    2013-01-01

    In plants and animals, chromosomal breakage and fusion events based on conserved syntenic genomic blocks lead to conserved patterns of karyotype evolution among species of the same family. However, karyotype information has not been well utilized in genomic comparison studies. We present CrusView, a Java-based bioinformatic application utilizing Standard Widget Toolkit/Swing graphics libraries and a SQLite database for performing visualized analyses of comparative genomics data in Brassicaceae (crucifer) plants. Compared with similar software and databases, one of the unique features of CrusView is its integration of karyotype information when comparing two genomes. This feature allows users to perform karyotype-based genome assembly and karyotype-assisted genome synteny analyses with preset karyotype patterns of the Brassicaceae genomes. Additionally, CrusView is a local program, which gives its users high flexibility when analyzing unpublished genomes and allows users to upload self-defined genomic information so that they can visually study the associations between genome structural variations and genetic elements, including chromosomal rearrangements, genomic macrosynteny, gene families, high-frequency recombination sites, and tandem and segmental duplications between related species. This tool will greatly facilitate karyotype, chromosome, and genome evolution studies using visualized comparative genomics approaches in Brassicaceae species. CrusView is freely available at http://www.cmbb.arizona.edu/CrusView/. PMID:23898041

  7. The Genomic and Transcriptomic Landscape of a HeLa Cell Line

    Science.gov (United States)

    Landry, Jonathan J. M.; Pyl, Paul Theodor; Rausch, Tobias; Zichner, Thomas; Tekkedil, Manu M.; Stütz, Adrian M.; Jauch, Anna; Aiyar, Raeka S.; Pau, Gregoire; Delhomme, Nicolas; Gagneur, Julien; Korbel, Jan O.; Huber, Wolfgang; Steinmetz, Lars M.

    2013-01-01

    HeLa is the most widely used model cell line for studying human cellular and molecular biology. To date, no genomic reference for this cell line has been released, and experiments have relied on the human reference genome. Effective design and interpretation of molecular genetic studies performed using HeLa cells require accurate genomic information. Here we present a detailed genomic and transcriptomic characterization of a HeLa cell line. We performed DNA and RNA sequencing of a HeLa Kyoto cell line and analyzed its mutational portfolio and gene expression profile. Segmentation of the genome according to copy number revealed a remarkably high level of aneuploidy and numerous large structural variants at unprecedented resolution. Some of the extensive genomic rearrangements are indicative of catastrophic chromosome shattering, known as chromothripsis. Our analysis of the HeLa gene expression profile revealed that several pathways, including cell cycle and DNA repair, exhibit significantly different expression patterns from those in normal human tissues. Our results provide the first detailed account of genomic variants in the HeLa genome, yielding insight into their impact on gene expression and cellular function as well as their origins. This study underscores the importance of accounting for the strikingly aberrant characteristics of HeLa cells when designing and interpreting experiments, and has implications for the use of HeLa as a model of human biology. PMID:23550136

  8. Segmented trapped vortex cavity

    Science.gov (United States)

    Grammel, Jr., Leonard Paul (Inventor); Pennekamp, David Lance (Inventor); Winslow, Jr., Ralph Henry (Inventor)

    2010-01-01

    An annular trapped vortex cavity assembly segment comprising includes a cavity forward wall, a cavity aft wall, and a cavity radially outer wall there between defining a cavity segment therein. A cavity opening extends between the forward and aft walls at a radially inner end of the assembly segment. Radially spaced apart pluralities of air injection first and second holes extend through the forward and aft walls respectively. The segment may include first and second expansion joint features at distal first and second ends respectively of the segment. The segment may include a forward subcomponent including the cavity forward wall attached to an aft subcomponent including the cavity aft wall. The forward and aft subcomponents include forward and aft portions of the cavity radially outer wall respectively. A ring of the segments may be circumferentially disposed about an axis to form an annular segmented vortex cavity assembly.

  9. Transformation of natural genetic variation into Haemophilus influenzae genomes.

    Directory of Open Access Journals (Sweden)

    Joshua Chang Mell

    2011-07-01

    Full Text Available Many bacteria are able to efficiently bind and take up double-stranded DNA fragments, and the resulting natural transformation shapes bacterial genomes, transmits antibiotic resistance, and allows escape from immune surveillance. The genomes of many competent pathogens show evidence of extensive historical recombination between lineages, but the actual recombination events have not been well characterized. We used DNA from a clinical isolate of Haemophilus influenzae to transform competent cells of a laboratory strain. To identify which of the ~40,000 polymorphic differences had recombined into the genomes of four transformed clones, their genomes and their donor and recipient parents were deep sequenced to high coverage. Each clone was found to contain ~1000 donor polymorphisms in 3-6 contiguous runs (8.1±4.5 kb in length that collectively comprised ~1-3% of each transformed chromosome. Seven donor-specific insertions and deletions were also acquired as parts of larger donor segments, but the presence of other structural variation flanking 12 of 32 recombination breakpoints suggested that these often disrupt the progress of recombination events. This is the first genome-wide analysis of chromosomes directly transformed with DNA from a divergent genotype, connecting experimental studies of transformation with the high levels of natural genetic variation found in isolates of the same species.

  10. Performance Analysis of Segmentation of Hyperspectral Images Based on Color Image Segmentation

    Directory of Open Access Journals (Sweden)

    Praveen Agarwal

    2017-06-01

    Full Text Available Image segmentation is a fundamental approach in the field of image processing and based on user’s application .This paper propose an original and simple segmentation strategy based on the EM approach that resolves many informatics problems about hyperspectral images which are observed by airborne sensors. In a first step, to simplify the input color textured image into a color image without texture. The final segmentation is simply achieved by a spatially color segmentation using feature vector with the set of color values contained around the pixel to be classified with some mathematical equations. The spatial constraint allows taking into account the inherent spatial relationships of any image and its color. This approach provides effective PSNR for the segmented image. These results have the better performance as the segmented images are compared with Watershed & Region Growing Algorithm and provide effective segmentation for the Spectral Images & Medical Images.

  11. On detection and assessment of statistical significance of Genomic Islands

    Directory of Open Access Journals (Sweden)

    Chaudhuri Probal

    2008-04-01

    Full Text Available Abstract Background Many of the available methods for detecting Genomic Islands (GIs in prokaryotic genomes use markers such as transposons, proximal tRNAs, flanking repeats etc., or they use other supervised techniques requiring training datasets. Most of these methods are primarily based on the biases in GC content or codon and amino acid usage of the islands. However, these methods either do not use any formal statistical test of significance or use statistical tests for which the critical values and the P-values are not adequately justified. We propose a method, which is unsupervised in nature and uses Monte-Carlo statistical tests based on randomly selected segments of a chromosome. Such tests are supported by precise statistical distribution theory, and consequently, the resulting P-values are quite reliable for making the decision. Results Our algorithm (named Design-Island, an acronym for Detection of Statistically Significant Genomic Island runs in two phases. Some 'putative GIs' are identified in the first phase, and those are refined into smaller segments containing horizontally acquired genes in the refinement phase. This method is applied to Salmonella typhi CT18 genome leading to the discovery of several new pathogenicity, antibiotic resistance and metabolic islands that were missed by earlier methods. Many of these islands contain mobile genetic elements like phage-mediated genes, transposons, integrase and IS elements confirming their horizontal acquirement. Conclusion The proposed method is based on statistical tests supported by precise distribution theory and reliable P-values along with a technique for visualizing statistically significant islands. The performance of our method is better than many other well known methods in terms of their sensitivity and accuracy, and in terms of specificity, it is comparable to other methods.

  12. Genomic landscape of human diversity across Madagascar

    Science.gov (United States)

    Pierron, Denis; Heiske, Margit; Razafindrazaka, Harilanto; Rakoto, Ignace; Rabetokotany, Nelly; Ravololomanga, Bodo; Rakotozafy, Lucien M.-A.; Rakotomalala, Mireille Mialy; Razafiarivony, Michel; Rasoarifetra, Bako; Raharijesy, Miakabola Andriamampianina; Razafindralambo, Lolona; Ramilisonina; Fanony, Fulgence; Lejamble, Sendra; Thomas, Olivier; Mohamed Abdallah, Ahmed; Rocher, Christophe; Arachiche, Amal; Tonaso, Laure; Pereda-loth, Veronica; Schiavinato, Stéphanie; Brucato, Nicolas; Ricaut, Francois-Xavier; Kusuma, Pradiptajati; Sudoyo, Herawati; Ni, Shengyu; Boland, Anne; Deleuze, Jean-Francois; Beaujard, Philippe; Grange, Philippe; Adelaar, Sander; Stoneking, Mark; Rakotoarisoa, Jean-Aimé; Radimilahy, Chantal; Letellier, Thierry

    2017-01-01

    Although situated ∼400 km from the east coast of Africa, Madagascar exhibits cultural, linguistic, and genetic traits from both Southeast Asia and Eastern Africa. The settlement history remains contentious; we therefore used a grid-based approach to sample at high resolution the genomic diversity (including maternal lineages, paternal lineages, and genome-wide data) across 257 villages and 2,704 Malagasy individuals. We find a common Bantu and Austronesian descent for all Malagasy individuals with a limited paternal contribution from Europe and the Middle East. Admixture and demographic growth happened recently, suggesting a rapid settlement of Madagascar during the last millennium. However, the distribution of African and Asian ancestry across the island reveals that the admixture was sex biased and happened heterogeneously across Madagascar, suggesting independent colonization of Madagascar from Africa and Asia rather than settlement by an already admixed population. In addition, there are geographic influences on the present genomic diversity, independent of the admixture, showing that a few centuries is sufficient to produce detectable genetic structure in human populations. PMID:28716916

  13. Genome BLAST distance phylogenies inferred from whole plastid and whole mitochondrion genome sequences

    Directory of Open Access Journals (Sweden)

    Holland Barbara R

    2006-07-01

    Full Text Available Abstract Background Phylogenetic methods which do not rely on multiple sequence alignments are important tools in inferring trees directly from completely sequenced genomes. Here, we extend the recently described Genome BLAST Distance Phylogeny (GBDP strategy to compute phylogenetic trees from all completely sequenced plastid genomes currently available and from a selection of mitochondrial genomes representing the major eukaryotic lineages. BLASTN, TBLASTX, or combinations of both are used to locate high-scoring segment pairs (HSPs between two sequences from which pairwise similarities and distances are computed in different ways resulting in a total of 96 GBDP variants. The suitability of these distance formulae for phylogeny reconstruction is directly estimated by computing a recently described measure of "treelikeness", the so-called δ value, from the respective distance matrices. Additionally, we compare the trees inferred from these matrices using UPGMA, NJ, BIONJ, FastME, or STC, respectively, with the NCBI taxonomy tree of the taxa under study. Results Our results indicate that, at this taxonomic level, plastid genomes are much more valuable for inferring phylogenies than are mitochondrial genomes, and that distances based on breakpoints are of little use. Distances based on the proportion of "matched" HSP length to average genome length were best for tree estimation. Additionally we found that using TBLASTX instead of BLASTN and, particularly, combining TBLASTX and BLASTN leads to a small but significant increase in accuracy. Other factors do not significantly affect the phylogenetic outcome. The BIONJ algorithm results in phylogenies most in accordance with the current NCBI taxonomy, with NJ and FastME performing insignificantly worse, and STC performing as well if applied to high quality distance matrices. δ values are found to be a reliable predictor of phylogenetic accuracy. Conclusion Using the most treelike distance matrices, as

  14. Genome Compositional Organization in Gars Shows More Similarities to Mammals than to Other Ray-Finned Fish.

    Science.gov (United States)

    Symonová, Radka; Majtánová, Zuzana; Arias-Rodriguez, Lenin; Mořkovský, Libor; Kořínková, Tereza; Cavin, Lionel; Pokorná, Martina Johnson; Doležálková, Marie; Flajšhans, Martin; Normandeau, Eric; Ráb, Petr; Meyer, Axel; Bernatchez, Louis

    2017-11-01

    Genomic GC content can vary locally, and GC-rich regions are usually associated with increased DNA thermostability in thermophilic prokaryotes and warm-blooded eukaryotes. Among vertebrates, fish and amphibians appeared to possess a distinctly less heterogeneous AT/GC organization in their genomes, whereas cytogenetically detectable GC heterogeneity has so far only been documented in mammals and birds. The subject of our study is the gar, an ancient "living fossil" of a basal ray-finned fish lineage, known from the Cretaceous period. We carried out cytogenomic analysis in two gar genera (Atractosteus and Lepisosteus) uncovering a GC chromosomal pattern uncharacteristic for fish. Bioinformatic analysis of the spotted gar (Lepisosteus oculatus) confirmed a GC compartmentalization on GC profiles of linkage groups. This indicates a rather mammalian mode of compositional organization on gar chromosomes. Gars are thus the only analyzed extant ray-finned fishes with a GC compartmentalized genome. Since gars are cold-blooded anamniotes, our results contradict the generally accepted hypothesis that the phylogenomic onset of GC compartmentalization occurred near the origin of amniotes. Ecophysiological findings of other authors indicate a metabolic similarity of gars with mammals. We hypothesize that gars might have undergone convergent evolution with the tetrapod lineages leading to mammals on both metabolic and genomic levels. Their metabolic adaptations might have left footprints in their compositional genome evolution, as proposed by the metabolic rate hypothesis. The genome organization described here in gars sheds new light on the compositional genome evolution in vertebrates generally and contributes to better understanding of the complexities of the mechanisms involved in this process. © 2016 Wiley Periodicals, Inc.

  15. Segmental Vitiligo.

    Science.gov (United States)

    van Geel, Nanja; Speeckaert, Reinhart

    2017-04-01

    Segmental vitiligo is characterized by its early onset, rapid stabilization, and unilateral distribution. Recent evidence suggests that segmental and nonsegmental vitiligo could represent variants of the same disease spectrum. Observational studies with respect to its distribution pattern point to a possible role of cutaneous mosaicism, whereas the original stated dermatomal distribution seems to be a misnomer. Although the exact pathogenic mechanism behind the melanocyte destruction is still unknown, increasing evidence has been published on the autoimmune/inflammatory theory of segmental vitiligo. Copyright © 2016 Elsevier Inc. All rights reserved.

  16. Segmental vitiligo with segmental morphea: An autoimmune link?

    Directory of Open Access Journals (Sweden)

    Pravesh Yadav

    2014-01-01

    Full Text Available An 18-year old girl with segmental vitiligo involving the left side of the trunk and left upper limb with segmental morphea involving the right side of trunk and right upper limb without any deeper involvement is illustrated. There was no history of preceding drug intake, vaccination, trauma, radiation therapy, infection, or hormonal therapy. Family history of stable vitiligo in her brother and a history of type II diabetes mellitus in the father were elicited. Screening for autoimmune diseases and antithyroid antibody was negative. An autoimmune link explaining the co-occurrence has been proposed. Cutaneous mosiacism could explain the presence of both the pathologies in a segmental distribution.

  17. Genomic prediction based on data from three layer lines using non-linear regression models

    NARCIS (Netherlands)

    Huang, H.; Windig, J.J.; Vereijken, A.; Calus, M.P.L.

    2014-01-01

    Background - Most studies on genomic prediction with reference populations that include multiple lines or breeds have used linear models. Data heterogeneity due to using multiple populations may conflict with model assumptions used in linear regression methods. Methods - In an attempt to alleviate

  18. Market Segmentation in Business Technology Base: The Case of Segmentation of Sparkling

    Directory of Open Access Journals (Sweden)

    Valéria Riscarolli

    2014-08-01

    Full Text Available A common market segmentation premise for products and services rules consumer behavior as the segmentation center piece. Would this be the logic for segmentation used by small technology based companies? In this article we target at determining the principles of market segmentation used by a vitiwinery company, as research object. This company is recognized by its products excellence, either in domestic as well as in the foreign market, among 13 distinct countries. The research method used is a case study, through information from the company’s CEOs and crossed by primary information from observation and formal registries and documents of the company. In this research we look at sparkling wines market segmentation. Main results indicate that the winery studied considers only technological elements as the basis to build a market segment. One may conclude that a market segmentation for this company is based upon technological dominion of sparkling wines production, aligned with a premium-price policy. In the company, directorship believes that as sparkling wines market is still incipient in the country, sparkling wine market segments will form and consolidate after the evolution of consumers tasting preferences, depending on technologies that boost sparkling wines quality. 

  19. Producing genome structure populations with the dynamic and automated PGS software.

    Science.gov (United States)

    Hua, Nan; Tjong, Harianto; Shin, Hanjun; Gong, Ke; Zhou, Xianghong Jasmine; Alber, Frank

    2018-05-01

    Chromosome conformation capture technologies such as Hi-C are widely used to investigate the spatial organization of genomes. Because genome structures can vary considerably between individual cells of a population, interpreting ensemble-averaged Hi-C data can be challenging, in particular for long-range and interchromosomal interactions. We pioneered a probabilistic approach for the generation of a population of distinct diploid 3D genome structures consistent with all the chromatin-chromatin interaction probabilities from Hi-C experiments. Each structure in the population is a physical model of the genome in 3D. Analysis of these models yields new insights into the causes and the functional properties of the genome's organization in space and time. We provide a user-friendly software package, called PGS, which runs on local machines (for practice runs) and high-performance computing platforms. PGS takes a genome-wide Hi-C contact frequency matrix, along with information about genome segmentation, and produces an ensemble of 3D genome structures entirely consistent with the input. The software automatically generates an analysis report, and provides tools to extract and analyze the 3D coordinates of specific domains. Basic Linux command-line knowledge is sufficient for using this software. A typical running time of the pipeline is ∼3 d with 300 cores on a computer cluster to generate a population of 1,000 diploid genome structures at topological-associated domain (TAD)-level resolution.

  20. The sea lamprey meiotic map improves resolution of ancient vertebrate genome duplications.

    Science.gov (United States)

    Smith, Jeramiah J; Keinath, Melissa C

    2015-08-01

    It is generally accepted that many genes present in vertebrate genomes owe their origin to two whole-genome duplications that occurred deep in the ancestry of the vertebrate lineage. However, details regarding the timing and outcome of these duplications are not well resolved. We present high-density meiotic and comparative genomic maps for the sea lamprey (Petromyzon marinus), a representative of an ancient lineage that diverged from all other vertebrates ∼550 million years ago. Linkage analyses yielded a total of 95 linkage groups, similar to the estimated number of germline chromosomes (1n ∼ 99), spanning a total of 5570.25 cM. Comparative mapping data yield strong support for the hypothesis that a single whole-genome duplication occurred in the basal vertebrate lineage, but do not strongly support a hypothetical second event. Rather, these comparative maps reveal several evolutionarily independent segmental duplications occurring over the last 600+ million years of chordate evolution. This refined history of vertebrate genome duplication should permit more precise investigations of vertebrate evolution. © 2015 Smith and Keinath; Published by Cold Spring Harbor Laboratory Press.

  1. Statistical properties of thermodynamically predicted RNA secondary structures in viral genomes

    Science.gov (United States)

    Spanò, M.; Lillo, F.; Miccichè, S.; Mantegna, R. N.

    2008-10-01

    By performing a comprehensive study on 1832 segments of 1212 complete genomes of viruses, we show that in viral genomes the hairpin structures of thermodynamically predicted RNA secondary structures are more abundant than expected under a simple random null hypothesis. The detected hairpin structures of RNA secondary structures are present both in coding and in noncoding regions for the four groups of viruses categorized as dsDNA, dsRNA, ssDNA and ssRNA. For all groups, hairpin structures of RNA secondary structures are detected more frequently than expected for a random null hypothesis in noncoding rather than in coding regions. However, potential RNA secondary structures are also present in coding regions of dsDNA group. In fact, we detect evolutionary conserved RNA secondary structures in conserved coding and noncoding regions of a large set of complete genomes of dsDNA herpesviruses.

  2. A Parvovirus B19 synthetic genome: sequence features and functional competence.

    Science.gov (United States)

    Manaresi, Elisabetta; Conti, Ilaria; Bua, Gloria; Bonvicini, Francesca; Gallinella, Giorgio

    2017-08-01

    Central to genetic studies for Parvovirus B19 (B19V) is the availability of genomic clones that may possess functional competence and ability to generate infectious virus. In our study, we established a new model genetic system for Parvovirus B19. A synthetic approach was followed, by design of a reference genome sequence, by generation of a corresponding artificial construct and its molecular cloning in a complete and functional form, and by setup of an efficient strategy to generate infectious virus, via transfection in UT7/EpoS1 cells and amplification in erythroid progenitor cells. The synthetic genome was able to generate virus with biological properties paralleling those of native virus, its infectious activity being dependent on the preservation of self-complementarity and sequence heterogeneity within the terminal regions. A virus of defined genome sequence, obtained from controlled cell culture conditions, can constitute a reference tool for investigation of the structural and functional characteristics of the virus. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. The three-dimensional genome organization of Drosophila melanogaster through data integration.

    Science.gov (United States)

    Li, Qingjiao; Tjong, Harianto; Li, Xiao; Gong, Ke; Zhou, Xianghong Jasmine; Chiolo, Irene; Alber, Frank

    2017-07-31

    Genome structures are dynamic and non-randomly organized in the nucleus of higher eukaryotes. To maximize the accuracy and coverage of three-dimensional genome structural models, it is important to integrate all available sources of experimental information about a genome's organization. It remains a major challenge to integrate such data from various complementary experimental methods. Here, we present an approach for data integration to determine a population of complete three-dimensional genome structures that are statistically consistent with data from both genome-wide chromosome conformation capture (Hi-C) and lamina-DamID experiments. Our structures resolve the genome at the resolution of topological domains, and reproduce simultaneously both sets of experimental data. Importantly, this data deconvolution framework allows for structural heterogeneity between cells, and hence accounts for the expected plasticity of genome structures. As a case study we choose Drosophila melanogaster embryonic cells, for which both data types are available. Our three-dimensional genome structures have strong predictive power for structural features not directly visible in the initial data sets, and reproduce experimental hallmarks of the D. melanogaster genome organization from independent and our own imaging experiments. Also they reveal a number of new insights about genome organization and its functional relevance, including the preferred locations of heterochromatic satellites of different chromosomes, and observations about homologous pairing that cannot be directly observed in the original Hi-C or lamina-DamID data. Our approach allows systematic integration of Hi-C and lamina-DamID data for complete three-dimensional genome structure calculation, while also explicitly considering genome structural variability.

  4. Fluence map segmentation

    International Nuclear Information System (INIS)

    Rosenwald, J.-C.

    2008-01-01

    The lecture addressed the following topics: 'Interpreting' the fluence map; The sequencer; Reasons for difference between desired and actual fluence map; Principle of 'Step and Shoot' segmentation; Large number of solutions for given fluence map; Optimizing 'step and shoot' segmentation; The interdigitation constraint; Main algorithms; Conclusions on segmentation algorithms (static mode); Optimizing intensity levels and monitor units; Sliding window sequencing; Synchronization to avoid the tongue-and-groove effect; Accounting for physical characteristics of MLC; Importance of corrections for leaf transmission and offset; Accounting for MLC mechanical constraints; The 'complexity' factor; Incorporating the sequencing into optimization algorithm; Data transfer to the treatment machine; Interface between R and V and accelerator; and Conclusions on fluence map segmentation (Segmentation is part of the overall inverse planning procedure; 'Step and Shoot' and 'Dynamic' options are available for most TPS (depending on accelerator model; The segmentation phase tends to come into the optimization loop; The physical characteristics of the MLC have a large influence on final dose distribution; The IMRT plans (MU and relative dose distribution) must be carefully validated). (P.A.)

  5. Strategic market segmentation

    Directory of Open Access Journals (Sweden)

    Maričić Branko R.

    2015-01-01

    Full Text Available Strategic planning of marketing activities is the basis of business success in modern business environment. Customers are not homogenous in their preferences and expectations. Formulating an adequate marketing strategy, focused on realization of company's strategic objectives, requires segmented approach to the market that appreciates differences in expectations and preferences of customers. One of significant activities in strategic planning of marketing activities is market segmentation. Strategic planning imposes a need to plan marketing activities according to strategically important segments on the long term basis. At the same time, there is a need to revise and adapt marketing activities on the short term basis. There are number of criteria based on which market segmentation is performed. The paper will consider effectiveness and efficiency of different market segmentation criteria based on empirical research of customer expectations and preferences. The analysis will include traditional criteria and criteria based on behavioral model. The research implications will be analyzed from the perspective of selection of the most adequate market segmentation criteria in strategic planning of marketing activities.

  6. Supervised variational model with statistical inference and its application in medical image segmentation.

    Science.gov (United States)

    Li, Changyang; Wang, Xiuying; Eberl, Stefan; Fulham, Michael; Yin, Yong; Dagan Feng, David

    2015-01-01

    Automated and general medical image segmentation can be challenging because the foreground and the background may have complicated and overlapping density distributions in medical imaging. Conventional region-based level set algorithms often assume piecewise constant or piecewise smooth for segments, which are implausible for general medical image segmentation. Furthermore, low contrast and noise make identification of the boundaries between foreground and background difficult for edge-based level set algorithms. Thus, to address these problems, we suggest a supervised variational level set segmentation model to harness the statistical region energy functional with a weighted probability approximation. Our approach models the region density distributions by using the mixture-of-mixtures Gaussian model to better approximate real intensity distributions and distinguish statistical intensity differences between foreground and background. The region-based statistical model in our algorithm can intuitively provide better performance on noisy images. We constructed a weighted probability map on graphs to incorporate spatial indications from user input with a contextual constraint based on the minimization of contextual graphs energy functional. We measured the performance of our approach on ten noisy synthetic images and 58 medical datasets with heterogeneous intensities and ill-defined boundaries and compared our technique to the Chan-Vese region-based level set model, the geodesic active contour model with distance regularization, and the random walker model. Our method consistently achieved the highest Dice similarity coefficient when compared to the other methods.

  7. Why segmentation matters: Experience-driven segmentation errors impair "morpheme" learning.

    Science.gov (United States)

    Finn, Amy S; Hudson Kam, Carla L

    2015-09-01

    We ask whether an adult learner's knowledge of their native language impedes statistical learning in a new language beyond just word segmentation (as previously shown). In particular, we examine the impact of native-language word-form phonotactics on learners' ability to segment words into their component morphemes and learn phonologically triggered variation of morphemes. We find that learning is impaired when words and component morphemes are structured to conflict with a learner's native-language phonotactic system, but not when native-language phonotactics do not conflict with morpheme boundaries in the artificial language. A learner's native-language knowledge can therefore have a cascading impact affecting word segmentation and the morphological variation that relies upon proper segmentation. These results show that getting word segmentation right early in learning is deeply important for learning other aspects of language, even those (morphology) that are known to pose a great difficulty for adult language learners. (c) 2015 APA, all rights reserved).

  8. Consumer segmentation as a means to investigate emotional associations to meals.

    Science.gov (United States)

    Piqueras-Fiszman, Betina; Jaeger, Sara R

    2016-10-01

    Consumers naturally associate emotions to meal occasions and understanding these can advance knowledge of food-related behaviours and attitudes. The present study used an online survey to investigate the emotional associations that people have with recalled meals: 'memorable' (MM) and 'routine' evening (RM). Heterogeneity in the studied consumer population (UK adults, n = 576 and 571, respectively) was accounted for using a data-driven approach to establish emotion-based segments. Two groups of people were identified with very different emotional response patterns to recalled meals. For 'memorable' and 'routine' meals the majority of people (Cluster 1) held strong positive and weak negative emotional associations. In Cluster 2, positive emotions remained more strongly associated than negative emotions, but much less so. In accordance with findings based on other response variables (e.g., preference, attitudes), psychographic variables accounted better for the heterogeneity found in the emotion associations than socio-demographic variables. Participants' level of meal engagement and difficulty in describing feelings (DDF scale) were the two most important predictors of cluster membership. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Evolution of gastropod mitochondrial genome arrangements

    Directory of Open Access Journals (Sweden)

    Zardoya Rafael

    2008-02-01

    Full Text Available Abstract Background Gastropod mitochondrial genomes exhibit an unusually great variety of gene orders compared to other metazoan mitochondrial genome such as e.g those of vertebrates. Hence, gastropod mitochondrial genomes constitute a good model system to study patterns, rates, and mechanisms of mitochondrial genome rearrangement. However, this kind of evolutionary comparative analysis requires a robust phylogenetic framework of the group under study, which has been elusive so far for gastropods in spite of the efforts carried out during the last two decades. Here, we report the complete nucleotide sequence of five mitochondrial genomes of gastropods (Pyramidella dolabrata, Ascobulla fragilis, Siphonaria pectinata, Onchidella celtica, and Myosotella myosotis, and we analyze them together with another ten complete mitochondrial genomes of gastropods currently available in molecular databases in order to reconstruct the phylogenetic relationships among the main lineages of gastropods. Results Comparative analyses with other mollusk mitochondrial genomes allowed us to describe molecular features and general trends in the evolution of mitochondrial genome organization in gastropods. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (ME, MP, ML, BI arrived at a single topology, which was used to reconstruct the evolution of mitochondrial gene rearrangements in the group. Conclusion Four main lineages were identified within gastropods: Caenogastropoda, Vetigastropoda, Patellogastropoda, and Heterobranchia. Caenogastropoda and Vetigastropoda are sister taxa, as well as, Patellogastropoda and Heterobranchia. This result rejects the validity of the derived clade Apogastropoda (Caenogastropoda + Heterobranchia. The position of Patellogastropoda remains unclear likely due to long-branch attraction biases. Within Heterobranchia, the most heterogeneous group of gastropods, neither Euthyneura (because of the inclusion of P

  10. Volumetric segmentation of ADC maps and utility of standard deviation as measure of tumor heterogeneity in soft tissue tumors.

    Science.gov (United States)

    Singer, Adam D; Pattany, Pradip M; Fayad, Laura M; Tresley, Jonathan; Subhawong, Ty K

    2016-01-01

    Determine interobserver concordance of semiautomated three-dimensional volumetric and two-dimensional manual measurements of apparent diffusion coefficient (ADC) values in soft tissue masses (STMs) and explore standard deviation (SD) as a measure of tumor ADC heterogeneity. Concordance correlation coefficients for mean ADC increased with more extensive sampling. Agreement on the SD of tumor ADC values was better for large regions of interest and multislice methods. Correlation between mean and SD ADC was low, suggesting that these parameters are relatively independent. Mean ADC of STMs can be determined by volumetric quantification with high interobserver agreement. STM heterogeneity merits further investigation as a potential imaging biomarker that complements other functional magnetic resonance imaging parameters. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Comparative genome analysis of Pseudogymnoascus spp. reveals primarily clonal evolution with small genome fragments exchanged between lineages.

    Science.gov (United States)

    Leushkin, Evgeny V; Logacheva, Maria D; Penin, Aleksey A; Sutormin, Roman A; Gerasimov, Evgeny S; Kochkina, Galina A; Ivanushkina, Natalia E; Vasilenko, Oleg V; Kondrashov, Alexey S; Ozerskaya, Svetlana M

    2015-05-21

    Pseudogymnoascus spp. is a wide group of fungi lineages in the family Pseudorotiaceae including an aggressive pathogen of bats P. destructans. Although several lineages of P. spp. were shown to produce ascospores in culture, the vast majority of P. spp. demonstrates no evidence of sexual reproduction. P. spp. can tolerate a wide range of different temperatures and salinities and can survive even in permafrost layer. Adaptability of P. spp. to different environments is accompanied by extremely variable morphology and physiology. We sequenced genotypes of 14 strains of P. spp., 5 of which were extracted from permafrost, 1 from a cryopeg, a layer of unfrozen ground in permafrost, and 8 from temperate surface environments. All sequenced genotypes are haploid. Nucleotide diversity among these genomes is very high, with a typical evolutionary distance at synonymous sites dS ≈ 0.5, suggesting that the last common ancestor of these strains lived >50 Mya. The strains extracted from permafrost do not form a separate clade. Instead, each permafrost strain has close relatives from temperate environments. We observed a strictly clonal population structure with no conflicting topologies for ~99% of genome sequences. However, there is a number of short (~100-10,000 nt) genomic segments with the total length of 67.6 Kb which possess phylogenetic patterns strikingly different from the rest of the genome. The most remarkable case is a MAT-locus, which has 2 distinct alleles interspersed along the whole-genome phylogenetic tree. Predominantly clonal structure of genome sequences is consistent with the observations that sexual reproduction is rare in P. spp. Small number of regions with noncanonical phylogenies seem to arise due to some recombination events between derived lineages of P. spp., with MAT-locus being transferred on multiple occasions. All sequenced strains have heterothallic configuration of MAT-locus.

  12. The Rosa genome provides new insights into the domestication of modern roses.

    Science.gov (United States)

    Raymond, Olivier; Gouzy, Jérôme; Just, Jérémy; Badouin, Hélène; Verdenaud, Marion; Lemainque, Arnaud; Vergne, Philippe; Moja, Sandrine; Choisne, Nathalie; Pont, Caroline; Carrère, Sébastien; Caissard, Jean-Claude; Couloux, Arnaud; Cottret, Ludovic; Aury, Jean-Marc; Szécsi, Judit; Latrasse, David; Madoui, Mohammed-Amin; François, Léa; Fu, Xiaopeng; Yang, Shu-Hua; Dubois, Annick; Piola, Florence; Larrieu, Antoine; Perez, Magali; Labadie, Karine; Perrier, Lauriane; Govetto, Benjamin; Labrousse, Yoan; Villand, Priscilla; Bardoux, Claudia; Boltz, Véronique; Lopez-Roques, Céline; Heitzler, Pascal; Vernoux, Teva; Vandenbussche, Michiel; Quesneville, Hadi; Boualem, Adnane; Bendahmane, Abdelhafid; Liu, Chang; Le Bris, Manuel; Salse, Jérôme; Baudino, Sylvie; Benhamed, Moussa; Wincker, Patrick; Bendahmane, Mohammed

    2018-06-01

    Roses have high cultural and economic importance as ornamental plants and in the perfume industry. We report the rose whole-genome sequencing and assembly and resequencing of major genotypes that contributed to rose domestication. We generated a homozygous genotype from a heterozygous diploid modern rose progenitor, Rosa chinensis 'Old Blush'. Using single-molecule real-time sequencing and a meta-assembly approach, we obtained one of the most comprehensive plant genomes to date. Diversity analyses highlighted the mosaic origin of 'La France', one of the first hybrids combining the growth vigor of European species and the recurrent blooming of Chinese species. Genomic segments of Chinese ancestry identified new candidate genes for recurrent blooming. Reconstructing regulatory and secondary metabolism pathways allowed us to propose a model of interconnected regulation of scent and flower color. This genome provides a foundation for understanding the mechanisms governing rose traits and should accelerate improvement in roses, Rosaceae and ornamentals.

  13. Cluster analysis as a tool of guests segmentation by the degree of their demand

    Directory of Open Access Journals (Sweden)

    Damijan Mumel

    2002-01-01

    Full Text Available Authors demonstrate the use of cluster analysis in finding out (ascertaining the homogenity/heterogenity of guests as to the degree of their demand. The degree of guests’ demand is defined according to the importance of perceived service quality components measured by SERVQUAL, which was adopted and adapted, according to the specifics of health spa industry in Slovenia. Goals of the article are: (a the identification of the profile of importance of general health spa service quality components, and (b the identification of groups of guests (segments according to the degree of their demand in the research in 1991 compared with 1999. Cluster analysis serves as useful tool for guest segmentation since it reveals the existence of important differences in the structure of guests in the year 1991 compared with the year 1999. The results serve as a useful database for management in health spas.

  14. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  15. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    Energy Technology Data Exchange (ETDEWEB)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  16. Anterior segment developmental anomalies in a 33-week-old fetus with MIDAS syndrome.

    Science.gov (United States)

    Herwig, Martina C; Loeffler, Karin U; Gembruch, Ulrich; Kuchelmeister, Klaus; Müller, Annette M

    2014-01-01

    We report anterior segment abnormalities in both eyes of a 33-week-old fetus endorsing the diagnosis of MIDAS (microphthalmia, dermal aplasia, and sclerocornea) syndrome. After abortion, the fetus was examined by a standard pediatric autopsy that included macroscopic and microscopic examination of both eyes. Postmortem findings included craniofacial stigmata (such as hypertelorism, a flat nose and low-set ears) and an agenesis of the corpus callosum. Array comparative genomic hybridization revealed a deletion of the short arm of the X chromosome (region Xp22.2 to p22.32). Ophthalmopathologic examination of the eyes revealed microphthalmia with anterior segment developmental anomalies, in particular sclerocornea and Peters' anomaly, respectively. General pathology findings plus the ocular findings allowed the diagnosis of MIDAS syndrome. A discussion of differential diagnoses is provided. This case report indicates that ophthalmopathologic investigation of fetal eyes can be of great value for the further classification of syndromes.

  17. Genomics meets applied ecology: Characterizing habitat quality for sloths in a tropical agroecosystem.

    Science.gov (United States)

    Fountain, Emily D; Kang, Jung Koo; Tempel, Douglas J; Palsbøll, Per J; Pauli, Jonathan N; Zachariah Peery, M

    2018-01-01

    Understanding how habitat quality in heterogeneous landscapes governs the distribution and fitness of individuals is a fundamental aspect of ecology. While mean individual fitness is generally considered a key to assessing habitat quality, a comprehensive understanding of habitat quality in heterogeneous landscapes requires estimates of dispersal rates among habitat types. The increasing accessibility of genomic approaches, combined with field-based demographic methods, provides novel opportunities for incorporating dispersal estimation into assessments of habitat quality. In this study, we integrated genomic kinship approaches with field-based estimates of fitness components and approximate Bayesian computation (ABC) procedures to estimate habitat-specific dispersal rates and characterize habitat quality in two-toed sloths (Choloepus hoffmanni) occurring in a Costa Rican agricultural ecosystem. Field-based observations indicated that birth and survival rates were similar in a sparsely shaded cacao farm and adjacent cattle pasture-forest mosaic. Sloth density was threefold higher in pasture compared with cacao, whereas home range size and overlap were greater in cacao compared with pasture. Dispersal rates were similar between the two habitats, as estimated using ABC procedures applied to the spatial distribution of pairs of related individuals identified using 3,431 single nucleotide polymorphism and 11 microsatellite locus genotypes. Our results indicate that crops produced under a sparse overstorey can, in some cases, constitute lower-quality habitat than pasture-forest mosaics for sloths, perhaps because of differences in food resources or predator communities. Finally, our study demonstrates that integrating field-based demographic approaches with genomic methods can provide a powerful means for characterizing habitat quality for animal populations occurring in heterogeneous landscapes. © 2017 John Wiley & Sons Ltd.

  18. Genome-wide meta-analysis identifies multiple novel associations and ethnic heterogeneity of psoriasis susceptibility

    NARCIS (Netherlands)

    Yin, Xianyong; Low, Hui Qi; Wang, Ling; Li, Yonghong; Ellinghaus, Eva; Han, Jiali; Estivill, Xavier; Sun, Liangdan; Zuo, Xianbo; Shen, Changbing; Zhu, Caihong; Zhang, Anping; Sanchez, Fabio; Padyukov, Leonid; Catanese, Joseph J; Krueger, Gerald G; Duffin, Kristina Callis; Mucha, Sören; Weichenthal, Michael; Weidinger, Stephan; Lieb, Wolfgang; Foo, Jia Nee; Li, Yi; Sim, Karseng; Liany, Herty; Irwan, Ishak; Teo, Yikying; Theng, Colin T S; Gupta, Rashmi; Bowcock, Anne; De Jager, Philip L; Qureshi, Abrar A; de Bakker, Paul I W; Seielstad, Mark; Liao, Wilson; Ståhle, Mona; Franke, Andre; Zhang, Xuejun; Liu, Jianjun

    2015-01-01

    Psoriasis is a common inflammatory skin disease with complex genetics and different degrees of prevalence across ethnic populations. Here we present the largest trans-ethnic genome-wide meta-analysis (GWMA) of psoriasis in 15,369 cases and 19,517 controls of Caucasian and Chinese ancestries. We

  19. Quantification of expression and methylation of the Igf2r imprinted gene in segmental trisomic mouse model

    Czech Academy of Sciences Publication Activity Database

    Vacík, Tomáš; Forejt, Jiří

    2003-01-01

    Roč. 82, - (2003), s. 261-268 ISSN 0888-7543 R&D Projects: GA MŠk LN00A079; GA ČR GV204/98/K015 Grant - others:HHMI(US) 555000306 Institutional research plan: CEZ:AV0Z5052915 Keywords : Genomic imprinting * dosage-sensitive genes * Ts43H segmental trisomy of chromosome 17 Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.488, year: 2003

  20. Draft genome sequence of four coccolithoviruses: Emiliania huxleyi virus EhV-88, EhV-201, EhV-207, and EhV-208.

    Science.gov (United States)

    Nissimov, Jozef I; Worthy, Charlotte A; Rooks, Paul; Napier, Johnathan A; Kimmance, Susan A; Henn, Matthew R; Ogata, Hiroyuki; Allen, Michael J

    2012-03-01

    The Coccolithoviridae are a group of viruses which infect the marine coccolithophorid microalga Emiliania huxleyi. The Emiliania huxleyi viruses (known as EhVs) described herein have 160- to 180-nm diameter icosahedral structures, have genomes of approximately 400 kbp, and consist of more than 450 predicted coding sequences (CDSs). Here, we describe the genomic features of four newly sequenced coccolithoviruses (EhV-88, EhV-201, EhV-207, and EhV-208) together with their draft genome sequences and their annotations, highlighting the homology and heterogeneity of these genomes to the EhV-86 model reference genome.

  1. Addressing challenges of heterogeneous tumor treatment through bispecific protein-mediated pretargeted drug delivery.

    Science.gov (United States)

    Yang, Qi; Parker, Christina L; McCallen, Justin D; Lai, Samuel K

    2015-12-28

    Tumors are frequently characterized by genomically and phenotypically distinct cancer cell subpopulations within the same tumor or between tumor lesions, a phenomenon termed tumor heterogeneity. These diverse cancer cell populations pose a major challenge to targeted delivery of diagnostic and/or therapeutic agents, as the conventional approach of conjugating individual ligands to nanoparticles is often unable to facilitate intracellular delivery to the full spectrum of cancer cells present in a given tumor lesion or patient. As a result, many cancers are only partially suppressed, leading to eventual tumor regrowth and/or the development of drug-resistant tumors. Pretargeting (multistep targeting) approaches involving the administration of 1) a cocktail of bispecific proteins that can collectively bind to the entirety of a mixed tumor population followed by 2) nanoparticles containing therapeutic and/or diagnostic agents that can bind to the bispecific proteins accumulated on the surface of target cells offer the potential to overcome many of the challenges associated with drug delivery to heterogeneous tumors. Despite its considerable success in improving the efficacy of radioimmunotherapy, the pretargeting strategy remains underexplored for a majority of nanoparticle therapeutic applications, especially for targeted delivery to heterogeneous tumors. In this review, we will present concepts in tumor heterogeneity, the shortcomings of conventional targeted systems, lessons learned from pretargeted radioimmunotherapy, and important considerations for harnessing the pretargeting strategy to improve nanoparticle delivery to heterogeneous tumors. Copyright © 2015 Elsevier B.V. All rights reserved.

  2. Complete mtDNA genomes of Filipino ethnolinguistic groups: a melting pot of recent and ancient lineages in the Asia-Pacific region

    Science.gov (United States)

    Delfin, Frederick; Min-Shan Ko, Albert; Li, Mingkun; Gunnarsdóttir, Ellen D; Tabbada, Kristina A; Salvador, Jazelyn M; Calacal, Gayvelline C; Sagum, Minerva S; Datar, Francisco A; Padilla, Sabino G; De Ungria, Maria Corazon A; Stoneking, Mark

    2014-01-01

    The Philippines is a strategic point in the Asia-Pacific region for the study of human diversity, history and origins, as it is a cross-road for human migrations and consequently exhibits enormous ethnolinguistic diversity. Following on a previous in-depth study of Y-chromosome variation, here we provide new insights into the maternal genetic history of Filipino ethnolinguistic groups by surveying complete mitochondrial DNA (mtDNA) genomes from a total of 14 groups (11 groups in this study and 3 groups previously published) including previously published mtDNA hypervariable segment (HVS) data from Filipino regional center groups. Comparison of HVS data indicate genetic differences between ethnolinguistic and regional center groups. The complete mtDNA genomes of 14 ethnolinguistic groups reveal genetic aspects consistent with the Y-chromosome, namely: diversity and heterogeneity of groups, no support for a simple dichotomy between Negrito and non-Negrito groups, and different genetic affinities with Asia-Pacific groups that are both ancient and recent. Although some mtDNA haplogroups can be associated with the Austronesian expansion, there are others that associate with South Asia, Near Oceania and Australia that are consistent with a southern migration route for ethnolinguistic group ancestors into the Asia-Pacific, with a timeline that overlaps with the initial colonization of the Asia-Pacific region, the initial colonization of the Philippines and a possible separate post-colonization migration into the Philippine archipelago. PMID:23756438

  3. Blending genomes: distributive conjugal transfer in mycobacteria, a sexier form of HGT.

    Science.gov (United States)

    Gray, Todd A; Derbyshire, Keith M

    2018-04-18

    This review discusses a novel form of horizontal gene transfer (HGT) found in mycobacteria called Distributive Conjugal Transfer (DCT). While satisfying the criteria for conjugation, DCT occurs by a mechanism so distinct from oriT-mediated conjugation that it could be considered a fourth category of HGT. DCT involves the transfer of chromosomal DNA between mycobacteria and, most significantly, generates transconjugants with mosaic genomes of the parental strains. Multiple segments of donor chromosomal DNA can be co-transferred regardless of their location or the genetic selection and, as a result, the transconjugant genome contains many donor-derived segments; hence the name DCT. This distinguishing feature of DCT separates it from the other known mechanisms of HGT, which generally result in the introduction of a single, defined segment of DNA into the recipient chromosome (Fig. ). Moreover, these mosaic progeny are generated from a single conjugal event, which provides enormous capacity for rapid adaptation and evolution, again distinguishing it from the three classical modes of HGT. Unsurprisingly, the unusual mosaic products of DCT are generated by a conjugal mechanism that is also unusual. Here, we will describe the unique features of DCT and contrast those to other mechanisms of HGT, both from a mechanistic and an evolutionary perspective. Our focus will be on transfer of chromosomal DNA, as opposed to plasmid mobilization, because DCT mediates transfer of chromosomal DNA and is a chromosomally encoded process. © 2018 John Wiley & Sons Ltd.

  4. Whole genome detection of rotavirus mixed infections in human, porcine and bovine samples co-infected with various rotavirus strains collected from sub-Saharan Africa.

    Science.gov (United States)

    Nyaga, Martin M; Jere, Khuzwayo C; Esona, Mathew D; Seheri, Mapaseka L; Stucker, Karla M; Halpin, Rebecca A; Akopov, Asmik; Stockwell, Timothy B; Peenze, Ina; Diop, Amadou; Ndiaye, Kader; Boula, Angeline; Maphalala, Gugu; Berejena, Chipo; Mwenda, Jason M; Steele, A Duncan; Wentworth, David E; Mphahlele, M Jeffrey

    2015-04-01

    Group A rotaviruses (RVA) are among the main global causes of severe diarrhea in children under the age of 5years. Strain diversity, mixed infections and untypeable RVA strains are frequently reported in Africa. We analysed rotavirus-positive human stool samples (n=13) obtained from hospitalised children under the age of 5years who presented with acute gastroenteritis at sentinel hospital sites in six African countries, as well as bovine and porcine stool samples (n=1 each), to gain insights into rotavirus diversity and evolution. Polyacrylamide gel electrophoresis (PAGE) analysis and genotyping with G-(VP7) and P-specific (VP4) typing primers suggested that 13 of the 15 samples contained more than 11 segments and/or mixed G/P genotypes. Full-length amplicons for each segment were generated using RVA-specific primers and sequenced using the Ion Torrent and/or Illumina MiSeq next-generation sequencing platforms. Sequencing detected at least one segment in each sample for which duplicate sequences, often having distinct genotypes, existed. This supported and extended the PAGE and RT-PCR genotyping findings that suggested these samples were collected from individuals that had mixed rotavirus infections. The study reports the first porcine (MRC-DPRU1567) and bovine (MRC-DPRU3010) mixed infections. We also report a unique genome segment 9 (VP7), whose G9 genotype belongs to lineage VI and clusters with porcine reference strains. Previously, African G9 strains have all been in lineage III. Furthermore, additional RVA segments isolated from humans have a clear evolutionary relationship with porcine, bovine and ovine rotavirus sequences, indicating relatively recent interspecies transmission and reassortment. Thus, multiple RVA strains from sub-Saharan Africa are infecting mammalian hosts with unpredictable variations in their gene segment combinations. Whole-genome sequence analyses of mixed RVA strains underscore the considerable diversity of rotavirus sequences and

  5. Novel applications of array comparative genomic hybridization in molecular diagnostics.

    Science.gov (United States)

    Cheung, Sau W; Bi, Weimin

    2018-05-31

    In 2004, the implementation of array comparative genomic hybridization (array comparative genome hybridization [CGH]) into clinical practice marked a new milestone for genetic diagnosis. Array CGH and single-nucleotide polymorphism (SNP) arrays enable genome-wide detection of copy number changes in a high resolution, and therefore microarray has been recognized as the first-tier test for patients with intellectual disability or multiple congenital anomalies, and has also been applied prenatally for detection of clinically relevant copy number variations in the fetus. Area covered: In this review, the authors summarize the evolution of array CGH technology from their diagnostic laboratory, highlighting exonic SNP arrays developed in the past decade which detect small intragenic copy number changes as well as large DNA segments for the region of heterozygosity. The applications of array CGH to human diseases with different modes of inheritance with the emphasis on autosomal recessive disorders are discussed. Expert commentary: An exonic array is a powerful and most efficient clinical tool in detecting genome wide small copy number variants in both dominant and recessive disorders. However, whole-genome sequencing may become the single integrated platform for detection of copy number changes, single-nucleotide changes as well as balanced chromosomal rearrangements in the near future.

  6. Comparative analysis of catfish BAC end sequences with the zebrafish genome

    Directory of Open Access Journals (Sweden)

    Abernathy Jason

    2009-12-01

    Full Text Available Abstract Background Comparative mapping is a powerful tool to transfer genomic information from sequenced genomes to closely related species for which whole genome sequence data are not yet available. However, such an approach is still very limited in catfish, the most important aquaculture species in the United States. This project was initiated to generate additional BAC end sequences and demonstrate their applications in comparative mapping in catfish. Results We reported the generation of 43,000 BAC end sequences and their applications for comparative genome analysis in catfish. Using these and the additional 20,000 existing BAC end sequences as a resource along with linkage mapping and existing physical map, conserved syntenic regions were identified between the catfish and zebrafish genomes. A total of 10,943 catfish BAC end sequences (17.3% had significant BLAST hits to the zebrafish genome (cutoff value ≤ e-5, of which 3,221 were unique gene hits, providing a platform for comparative mapping based on locations of these genes in catfish and zebrafish. Genetic linkage mapping of microsatellites associated with contigs allowed identification of large conserved genomic segments and construction of super scaffolds. Conclusion BAC end sequences and their associated polymorphic markers are great resources for comparative genome analysis in catfish. Highly conserved chromosomal regions were identified to exist between catfish and zebrafish. However, it appears that the level of conservation at local genomic regions are high while a high level of chromosomal shuffling and rearrangements exist between catfish and zebrafish genomes. Orthologous regions established through comparative analysis should facilitate both structural and functional genome analysis in catfish.

  7. Convolutional neural network regression for short-axis left ventricle segmentation in cardiac cine MR sequences.

    Science.gov (United States)

    Tan, Li Kuo; Liew, Yih Miin; Lim, Einly; McLaughlin, Robert A

    2017-07-01

    Automated left ventricular (LV) segmentation is crucial for efficient quantification of cardiac function and morphology to aid subsequent management of cardiac pathologies. In this paper, we parameterize the complete (all short axis slices and phases) LV segmentation task in terms of the radial distances between the LV centerpoint and the endo- and epicardial contours in polar space. We then utilize convolutional neural network regression to infer these parameters. Utilizing parameter regression, as opposed to conventional pixel classification, allows the network to inherently reflect domain-specific physical constraints. We have benchmarked our approach primarily against the publicly-available left ventricle segmentation challenge (LVSC) dataset, which consists of 100 training and 100 validation cardiac MRI cases representing a heterogeneous mix of cardiac pathologies and imaging parameters across multiple centers. Our approach attained a .77 Jaccard index, which is the highest published overall result in comparison to other automated algorithms. To test general applicability, we also evaluated against the Kaggle Second Annual Data Science Bowl, where the evaluation metric was the indirect clinical measures of LV volume rather than direct myocardial contours. Our approach attained a Continuous Ranked Probability Score (CRPS) of .0124, which would have ranked tenth in the original challenge. With this we demonstrate the effectiveness of convolutional neural network regression paired with domain-specific features in clinical segmentation. Copyright © 2017 Elsevier B.V. All rights reserved.

  8. Deformable meshes for medical image segmentation accurate automatic segmentation of anatomical structures

    CERN Document Server

    Kainmueller, Dagmar

    2014-01-01

    ? Segmentation of anatomical structures in medical image data is an essential task in clinical practice. Dagmar Kainmueller introduces methods for accurate fully automatic segmentation of anatomical structures in 3D medical image data. The author's core methodological contribution is a novel deformation model that overcomes limitations of state-of-the-art Deformable Surface approaches, hence allowing for accurate segmentation of tip- and ridge-shaped features of anatomical structures. As for practical contributions, she proposes application-specific segmentation pipelines for a range of anatom

  9. Micromechanics Based Failure Analysis of Heterogeneous Materials

    Science.gov (United States)

    Sertse, Hamsasew M.

    In recent decades, heterogeneous materials are extensively used in various industries such as aerospace, defense, automotive and others due to their desirable specific properties and excellent capability of accumulating damage. Despite their wide use, there are numerous challenges associated with the application of these materials. One of the main challenges is lack of accurate tools to predict the initiation, progression and final failure of these materials under various thermomechanical loading conditions. Although failure is usually treated at the macro and meso-scale level, the initiation and growth of failure is a complex phenomena across multiple scales. The objective of this work is to enable the mechanics of structure genome (MSG) and its companion code SwiftComp to analyze the initial failure (also called static failure), progressive failure, and fatigue failure of heterogeneous materials using micromechanics approach. The initial failure is evaluated at each numerical integration point using pointwise and nonlocal approach for each constituent of the heterogeneous materials. The effects of imperfect interfaces among constituents of heterogeneous materials are also investigated using a linear traction-displacement model. Moreover, the progressive and fatigue damage analyses are conducted using continuum damage mechanics (CDM) approach. The various failure criteria are also applied at a material point to analyze progressive damage in each constituent. The constitutive equation of a damaged material is formulated based on a consistent irreversible thermodynamics approach. The overall tangent modulus of uncoupled elastoplastic damage for negligible back stress effect is derived. The initiation of plasticity and damage in each constituent is evaluated at each numerical integration point using a nonlocal approach. The accumulated plastic strain and anisotropic damage evolution variables are iteratively solved using an incremental algorithm. The damage analyses

  10. Plant STAND P-loop NTPases: a current perspective of genome distribution, evolution, and function : Plant STAND P-loop NTPases: genomic organization, evolution, and molecular mechanism models contribute broadly to plant pathogen defense.

    Science.gov (United States)

    Arya, Preeti; Acharya, Vishal

    2018-02-01

    STAND P-loop NTPase is the common weapon used by plant and other organisms from all three kingdoms of life to defend themselves against pathogen invasion. The purpose of this study is to review comprehensively the latest finding of plant STAND P-loop NTPase related to their genomic distribution, evolution, and their mechanism of action. Earlier, the plant STAND P-loop NTPase known to be comprised of only NBS-LRRs/AP-ATPase/NB-ARC ATPase. However, recent finding suggests that genome of early green plants comprised of two types of STAND P-loop NTPases: (1) mammalian NACHT NTPases and (2) NBS-LRRs. Moreover, YchF (unconventional G protein and members of P-loop NTPase) subfamily has been reported to be exceptionally involved in biotic stress (in case of Oryza sativa), thereby a novel member of STAND P-loop NTPase in green plants. The lineage-specific expansion and genome duplication events are responsible for abundance of plant STAND P-loop NTPases; where "moderate tandem and low segmental duplication" trajectory followed in majority of plant species with few exception (equal contribution of tandem and segmental duplication). Since the past decades, systematic research is being investigated into NBS-LRR function supported the direct recognition of pathogen or pathogen effectors by the latest models proposed via 'integrated decoy' or 'sensor domains' model. Here, we integrate the recently published findings together with the previous literature on the genomic distribution, evolution, and distinct models proposed for functional molecular mechanism of plant STAND P-loop NTPases.

  11. msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding.

    Directory of Open Access Journals (Sweden)

    Anil Raj

    Full Text Available Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede.

  12. msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding.

    Science.gov (United States)

    Raj, Anil; Shim, Heejung; Gilad, Yoav; Pritchard, Jonathan K; Stephens, Matthew

    2015-01-01

    Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede.

  13. Sensitive detection of chromosomal segments of distinct ancestry in admixed populations.

    Directory of Open Access Journals (Sweden)

    Alkes L Price

    2009-06-01

    Full Text Available Identifying the ancestry of chromosomal segments of distinct ancestry has a wide range of applications from disease mapping to learning about history. Most methods require the use of unlinked markers; but, using all markers from genome-wide scanning arrays, it should in principle be possible to infer the ancestry of even very small segments with exquisite accuracy. We describe a method, HAPMIX, which employs an explicit population genetic model to perform such local ancestry inference based on fine-scale variation data. We show that HAPMIX outperforms other methods, and we explore its utility for inferring ancestry, learning about ancestral populations, and inferring dates of admixture. We validate the method empirically by applying it to populations that have experienced recent and ancient admixture: 935 African Americans from the United States and 29 Mozabites from North Africa. HAPMIX will be of particular utility for mapping disease genes in recently admixed populations, as its accurate estimates of local ancestry permit admixture and case-control association signals to be combined, enabling more powerful tests of association than with either signal alone.

  14. Analysis tools for the interplay between genome layout and regulation.

    Science.gov (United States)

    Bouyioukos, Costas; Elati, Mohamed; Képès, François

    2016-06-06

    Genome layout and gene regulation appear to be interdependent. Understanding this interdependence is key to exploring the dynamic nature of chromosome conformation and to engineering functional genomes. Evidence for non-random genome layout, defined as the relative positioning of either co-functional or co-regulated genes, stems from two main approaches. Firstly, the analysis of contiguous genome segments across species, has highlighted the conservation of gene arrangement (synteny) along chromosomal regions. Secondly, the study of long-range interactions along a chromosome has emphasised regularities in the positioning of microbial genes that are co-regulated, co-expressed or evolutionarily correlated. While one-dimensional pattern analysis is a mature field, it is often powerless on biological datasets which tend to be incomplete, and partly incorrect. Moreover, there is a lack of comprehensive, user-friendly tools to systematically analyse, visualise, integrate and exploit regularities along genomes. Here we present the Genome REgulatory and Architecture Tools SCAN (GREAT:SCAN) software for the systematic study of the interplay between genome layout and gene expression regulation. SCAN is a collection of related and interconnected applications currently able to perform systematic analyses of genome regularities as well as to improve transcription factor binding sites (TFBS) and gene regulatory network predictions based on gene positional information. We demonstrate the capabilities of these tools by studying on one hand the regular patterns of genome layout in the major regulons of the bacterium Escherichia coli. On the other hand, we demonstrate the capabilities to improve TFBS prediction in microbes. Finally, we highlight, by visualisation of multivariate techniques, the interplay between position and sequence information for effective transcription regulation.

  15. Genetic variation within and among Danish brown trout ( Salmo trutta L) hatchery strains, assessed by PCR-RFLP analysis of mitochondrial DNA segments

    DEFF Research Database (Denmark)

    Hansen, Michael Møller; Mensberg, Karen-Lise Dons; Rasmussen, Gorm

    1997-01-01

    Eleven Danish brown trout hatchery strains were studied by PCR- RFLP analysis of the ND-I and ND-5/6 segments of the mitochondrial genome. For comparison, data from wild trout representing three Danish river systems also were included. Reduced variability in terms of nucleon diversity and number...

  16. Speaker segmentation and clustering

    OpenAIRE

    Kotti, M; Moschou, V; Kotropoulos, C

    2008-01-01

    07.08.13 KB. Ok to add the accepted version to Spiral, Elsevier says ok whlile mandate not enforced. This survey focuses on two challenging speech processing topics, namely: speaker segmentation and speaker clustering. Speaker segmentation aims at finding speaker change points in an audio stream, whereas speaker clustering aims at grouping speech segments based on speaker characteristics. Model-based, metric-based, and hybrid speaker segmentation algorithms are reviewed. Concerning speaker...

  17. Symposium on single cell analysis and genomic approaches, Experimental Biology 2017 Chicago, Illinois, April 23, 2017.

    Science.gov (United States)

    Coller, Hilary A

    2017-09-01

    Emerging technologies for the analysis of genome-wide information in single cells have the potential to transform many fields of biology, including our understanding of cell states, the response of cells to external stimuli, mosaicism, and intratumor heterogeneity. At Experimental Biology 2017 in Chicago, Physiological Genomics hosted a symposium in which five leaders in the field of single cell genomics presented their recent research. The speakers discussed emerging methodologies in single cell analysis and critical issues for the analysis of single cell data. Also discussed were applications of single cell genomics to understanding the different types of cells within an organism or tissue and the basis for cell-to-cell variability in response to stimuli. Copyright © 2017 the American Physiological Society.

  18. Fusion set selection with surrogate metric in multi-atlas based image segmentation

    International Nuclear Information System (INIS)

    Zhao, Tingting; Ruan, Dan

    2016-01-01

    Multi-atlas based image segmentation sees unprecedented opportunities but also demanding challenges in the big data era. Relevant atlas selection before label fusion plays a crucial role in reducing potential performance loss from heterogeneous data quality and high computation cost from extensive data. This paper starts with investigating the image similarity metric (termed ‘surrogate’), an alternative to the inaccessible geometric agreement metric (termed ‘oracle’) in atlas relevance assessment, and probes into the problem of how to select the ‘most-relevant’ atlases and how many such atlases to incorporate. We propose an inference model to relate the surrogates and the oracle geometric agreement metrics. Based on this model, we quantify the behavior of the surrogates in mimicking oracle metrics for atlas relevance ordering. Finally, analytical insights on the choice of fusion set size are presented from a probabilistic perspective, with the integrated goal of including the most relevant atlases and excluding the irrelevant ones. Empirical evidence and performance assessment are provided based on prostate and corpus callosum segmentation. (paper)

  19. Overview on the Role of Advance Genomics in Conservation Biology of Endangered Species.

    Science.gov (United States)

    Khan, Suliman; Nabi, Ghulam; Ullah, Muhammad Wajid; Yousaf, Muhammad; Manan, Sehrish; Siddique, Rabeea; Hou, Hongwei

    2016-01-01

    In the recent era, due to tremendous advancement in industrialization, pollution and other anthropogenic activities have created a serious scenario for biota survival. It has been reported that present biota is entering a "sixth" mass extinction, because of chronic exposure to anthropogenic activities. Various ex situ and in situ measures have been adopted for conservation of threatened and endangered plants and animal species; however, these have been limited due to various discrepancies associated with them. Current advancement in molecular technologies, especially, genomics, is playing a very crucial role in biodiversity conservation. Advance genomics helps in identifying the segments of genome responsible for adaptation. It can also improve our understanding about microevolution through a better understanding of selection, mutation, assertive matting, and recombination. Advance genomics helps in identifying genes that are essential for fitness and ultimately for developing modern and fast monitoring tools for endangered biodiversity. This review article focuses on the applications of advanced genomics mainly demographic, adaptive genetic variations, inbreeding, hybridization and introgression, and disease susceptibilities, in the conservation of threatened biota. In short, it provides the fundamentals for novice readers and advancement in genomics for the experts working for the conservation of endangered plant and animal species.

  20. Methodology of Segment Management Reporting on the Profitability of Agricultural Holding Interaction with Customers

    Directory of Open Access Journals (Sweden)

    Aleksandra Vasilyevna Glushchenko

    2015-12-01

    Full Text Available The state program of agricultural development and regulation of agricultural products, raw materials and food in a food embargo on the West European suppliers is aimed at the revitalization of the holding structures. The main purpose of agricultural holdings is to ensure food safety and to maximize the consolidated profit in resource-limited settings. The heterogeneous nature of the needs of customers, leading to different performance of agricultural holding interaction with them has an impact on the formulation and conduct of accounting and requires the formation of an aggregated and relevant information about the profitability of relationships with groups of customers and the long-term development strategy of agroformation interaction with them, so there is a need for research and development methodical bases of formation of the administrative reporting segment that meets the needs of modern practice. The purpose of this study is to develop a method of forming the segment management reporting on the profitability of agricultural holding interaction with customers. As part of the problem research, the authors used different scientific methods, such as analysis, synthesis, observation, group data and logic synthesis. The article discusses the necessity of segmentation agricultural holding customers by the criterion of “cooperation profitability”. The basic problem of generating information about the cost of trading in the accounting information system of agricultural holdings is dealt with; a method of forming the segment management reporting based on the results of the ABC analysis including calculation algorithm functional trade costs (Activity-Based Costing, is developed; rank order of agroholding customers is suggested in accordance with the calculated interval limits for them: Segment A - “highly profitable customers,” B - “problem customers” and C - “low-profit customers”; a set of registers and management accounting