WorldWideScience

Sample records for alphoid centromeric sequences

  1. Homologous subfamilies of human alphoid repetitive DNA on different nucleolus organizing chromosomes

    International Nuclear Information System (INIS)

    Joergensen, A.L.; Bostock, C.J.; Bak, A.L.

    1987-01-01

    The organization of alphoid repeated sequences on human nucleolus-organizing (NOR) chromosomes 13, 21, and 22 has been investigated. Analysis of hybridization of alphoid DNA probes to Southern transfers of restriction enzyme-digested DNA fragments from hybrid cells containing single human chromosomes shows that chromosomes 13 and 21 share one subfamily of alphoid repeats, whereas a different subfamily may be held in common by chromosomes 13 and 22. The sequences of cloned 680-base-pair EcoRI fragments of the alphoid DNA from chromosomes 13 and 21 show that the basic unit of this subfamily is indistinguishable on each chromosome. The sequence of cloned 1020-base-pair Xba I fragments from chromosome 22 is related to, but distinguishable from, that of the 680-base-pair EcoRI alphoid subfamily of chromosomes 13 and 21. These results suggest that, at some point after they originated and were homogenized, different subfamilies of alphoid sequences must have exchanged between chromosomes 13 and 21 and separately between chromosomes 13 and 22

  2. Analysis of DNA restriction fragments greater than 5.7 Mb in size from the centromeric region of human chromosomes.

    Science.gov (United States)

    Arn, P H; Li, X; Smith, C; Hsu, M; Schwartz, D C; Jabs, E W

    1991-01-01

    Pulsed electrophoresis was used to study the organization of the human centromeric region. Genomic DNA was digested with rare-cutting enzymes. DNA fragments from 0.2 to greater than 5.7 Mb were separated by electrophoresis and hybridized with alphoid and simple DNA repeats. Rare-cutting enzymes (Mlu I, Nar I, Not I, Nru I, Sal I, Sfi I, Sst II) demonstrated fewer restriction sites at centromeric regions than elsewhere in the genome. The enzyme Not I had the fewest restriction sites at centromeric regions. As much as 70% of these sequences from the centromeric region are present in Not I DNA fragments greater than 5.7 and estimated to be as large as 10 Mb in size. Other repetitive sequences such as short interspersed repeated segments (SINEs), long interspersed repeated segments (LINEs), ribosomal DNA, and mini-satellite DNA that are not enriched at the centromeric region, are not enriched in Not I fragments of greater than 5.7 Mb in size.

  3. De Novo Centromere Formation and Centromeric Sequence Expansion in Wheat and its Wide Hybrids.

    Directory of Open Access Journals (Sweden)

    Xiang Guo

    2016-04-01

    Full Text Available Centromeres typically contain tandem repeat sequences, but centromere function does not necessarily depend on these sequences. We identified functional centromeres with significant quantitative changes in the centromeric retrotransposons of wheat (CRW contents in wheat aneuploids (Triticum aestivum and the offspring of wheat wide hybrids. The CRW signals were strongly reduced or essentially lost in some wheat ditelosomic lines and in the addition lines from the wide hybrids. The total loss of the CRW sequences but the presence of CENH3 in these lines suggests that the centromeres were formed de novo. In wheat and its wide hybrids, which carry large complex genomes or no sequenced genome, we performed CENH3-ChIP-dot-blot methods alone or in combination with CENH3-ChIP-seq and identified the ectopic genomic sequences present at the new centromeres. In adcdition, the transcription of the identified DNA sequences was remarkably increased at the new centromere, suggesting that the transcription of the corresponding sequences may be associated with de novo centromere formation. Stable alien chromosomes with two and three regions containing CRW sequences induced by centromere breakage were observed in the wheat-Th. elongatum hybrid derivatives, but only one was a functional centromere. In wheat-rye (Secale cereale hybrids, the rye centromere-specific sequences spread along the chromosome arms and may have caused centromere expansion. Frequent and significant quantitative alterations in the centromere sequence via chromosomal rearrangement have been systematically described in wheat wide hybridizations, which may affect the retention or loss of the alien chromosomes in the hybrids. Thus, the centromere behavior in wide crosses likely has an important impact on the generation of biodiversity, which ultimately has implications for speciation.

  4. Utilization of a cloned alphoid repeating sequence of human DNA in the study of polymorphism of chromosomal heterochromatin regions

    International Nuclear Information System (INIS)

    Kruminya, A.R.; Kroshkina, V.G.; Yurov, Yu.B.; Aleksandrov, I.A.; Mitkevich, S.P.; Gindilis, V.M.

    1988-01-01

    The chromosomal distribution of the cloned PHS05 fragment of human alphoid DNA was studied by in situ hybridization in 38 individuals. It was shown that this DNA fraction is primarily localized in the pericentric regions of practically all chromosomes of the set. Significant interchromosomal differences and a weakly expressed interindividual polymorphism were discovered in the copying ability of this class of repeating DNA sequences; associations were not found between the results of hybridization and the pattern of Q-polymorphism

  5. Cloning and comparative mapping of a human chromosome 4-specific alpha satellite DNA sequence

    Energy Technology Data Exchange (ETDEWEB)

    D' Aiuto, L.; Marzella, R.; Archidiacono, N.; Rocchi, M. (Universita di Bari (Italy)); Antonacci, R. (Instituto Anatomia Umana Normale, Modena (Italy))

    1993-11-01

    The authors have isolated and characterized two human alphoid DNA clones: p4n1/4 and pZ4.1. Clone p4n1/4 identifies specifically the centromeric region of chromosome 4; pZ4.1 recognizes a subset of alphoid DNA shared by chromosomes 4 and 9. The specificity was determined using fluorescence in situ hybridization experiments on metaphase spreads and Southern blotting analysis of human-hamster somatic cell hybrids. The genomic organization of both subsets was also investigated. Comparative mapping on chimpanzee and gorilla chromosomes was performed. p4n1/4 hybridizes to chimpanzee chromosomes 11 and 13, homologs of human chromosomes 9 and 2q, respectively. On gorilla metaphase spreads, p4n1/4 hybridizes exclusively to the centromeric region of chromosome 19, partially homologous to human chromosome 17. No hybridization signal was detected on chromosome 3 of both chimpanzee and gorilla, in both species homolog of human chromosome 4. Identical comparative mapping results were obtained using pZ4.1 probe, although the latter recognizes an alphoid subset distinct from the one recognized by p4n1/4. The implications of these results in the evolution of centromeric regions of primate chromosomes are discussed. 33 refs., 4 figs.

  6. High quality maize centromere 10 sequence reveals evidence of frequent recombination events

    Directory of Open Access Journals (Sweden)

    Thomas Kai Wolfgruber

    2016-03-01

    Full Text Available The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR have presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 x 10-6 and 5 x 10-5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb of the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length centromeric retrotransposons from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. This repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to facilitate the repair of frequent DSBs in centromeres.

  7. Dynamic epigenetic states of maize centromeres

    Directory of Open Access Journals (Sweden)

    Yalin eLiu

    2015-10-01

    Full Text Available The centromere is a specialized chromosomal region identified as the major constriction, upon which the kinetochore complex is formed, ensuring accurate chromosome orientation and segregation during cell division. The rapid evolution of centromere DNA sequence and the conserved centromere function are two contradictory aspects of centromere biology. Indeed, the sole presence of genetic sequence is not sufficient for centromere formation. Various dicentric chromosomes with one inactive centromere have been recognized. It has also been found that de novo centromere formation is common on fragments in which centromeric DNA sequences are lost. Epigenetic factors play important roles in centromeric chromatin assembly and maintenance. Nondisjunction of the supernumerary B chromosome early prophase of meiosis I requires an active centromere. This review discusses recent studies in maize about genetic and epigenetic elements regulating formation and maintenance of centromere chromatin, as well as centromere behavior in meiosis.

  8. Recent advances in plant centromere biology.

    Science.gov (United States)

    Feng, Chao; Liu, YaLin; Su, HanDong; Wang, HeFei; Birchler, James; Han, FangPu

    2015-03-01

    The centromere, which is one of the essential parts of a chromosome, controls kinetochore formation and chromosome segregation during mitosis and meiosis. While centromere function is conserved in eukaryotes, the centromeric DNA sequences evolve rapidly and have few similarities among species. The histone H3 variant CENH3 (CENP-A in human), which mostly exists in centromeric nucleosomes, is a universal active centromere mark in eukaryotes and plays an essential role in centromere identity determination. The relationship between centromeric DNA sequences and centromere identity determination is one of the intriguing questions in studying centromere formation. Due to the discoveries in the past decades, including "neocentromeres" and "centromere inactivation", it is now believed that the centromere identity is determined by epigenetic mechanisms. This review will present recent progress in plant centromere biology.

  9. Boom-Bust Turnovers of Megabase-Sized Centromeric DNA in Solanum Species: Rapid Evolution of DNA Sequences Associated with Centromeres

    Czech Academy of Sciences Publication Activity Database

    Zhang, H.Q.; Koblížková, Andrea; Wang, K.; Gong, Z.Y.; Oliveira, L.; Torres, G.A.; Wu, Y.; Zhang, W.; Novák, Petr; Buell, C.R.; Macas, Jiří; Jiang, J.

    2014-01-01

    Roč. 26, č. 4 (2014), s. 1436-1447 ISSN 1040-4651 Institutional support: RVO:60077344 Keywords : Alpha-satellite DNA * repetitive sequences * rice centromeres Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 9.338, year: 2014

  10. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

    Directory of Open Access Journals (Sweden)

    Can Alkan

    2007-09-01

    Full Text Available The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.

  11. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

    Science.gov (United States)

    Alkan, Can; Ventura, Mario; Archidiacono, Nicoletta; Rocchi, Mariano; Sahinalp, S Cenk; Eichler, Evan E

    2007-09-01

    The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.

  12. Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

    Science.gov (United States)

    Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario

    2011-01-01

    Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.

  13. Characterization of the genomic organization of the region bordering the centromere of chromosome V of Podospora anserina by direct sequencing.

    Science.gov (United States)

    Silar, Philippe; Barreau, Christian; Debuchy, Robert; Kicka, Sébastien; Turcq, Béatrice; Sainsard-Chanet, Annie; Sellem, Carole H; Billault, Alain; Cattolico, Laurence; Duprat, Simone; Weissenbach, Jean

    2003-08-01

    A Podospora anserina BAC library of 4800 clones has been constructed in the vector pBHYG allowing direct selection in fungi. Screening of the BAC collection for centromeric sequences of chromosome V allowed the recovery of clones localized on either sides of the centromere, but no BAC clone was found to contain the centromere. Seven BAC clones containing 322,195 and 156,244bp from either sides of the centromeric region were sequenced and annotated. One 5S rRNA gene, 5 tRNA genes, and 163 putative coding sequences (CDS) were identified. Among these, only six CDS seem specific to P. anserina. The gene density in the centromeric region is approximately one gene every 2.8kb. Extrapolation of this gene density to the whole genome of P. anserina suggests that the genome contains about 11,000 genes. Synteny analyses between P. anserina and Neurospora crassa show that co-linearity extends at the most to a few genes, suggesting rapid genome rearrangements between these two species.

  14. Structure, Function, and Evolution of Rice Centromeres

    Energy Technology Data Exchange (ETDEWEB)

    Jiang, Jiming

    2010-02-04

    The centromere is the most characteristic landmark of eukaryotic chromosomes. Centromeres function as the site for kinetochore assembly and spindle attachment, allowing for the faithful pairing and segregation of sister chromatids during cell division. Characterization of centromeric DNA is not only essential to understand the structure and organization of plant genomes, but it is also a critical step in the development of plant artificial chromosomes. The centromeres of most model eukaryotic species, consist predominantly of long arrays of satellite DNA. Determining the precise DNA boundary of a centromere has proven to be a difficult task in multicellular eukaryotes. We have successfully cloned and sequenced the centromere of rice chromosome 8 (Cen8), representing the first fully sequenced centromere from any multicellular eukaryotes. The functional core of Cen8 spans ~800 kb of DNA, which was determined by chromatin immunoprecipitation (ChIP) using an antibody against the rice centromere-specific H3 histone. We discovered 16 actively transcribed genes distributed throughout the Cen8 region. In addition to Cen8, we have characterized eight additional rice centromeres using the next generation sequencing technology. We discovered four subfamilies of the CRR retrotransposon that is highly enriched in rice centromeres. CRR elements are constitutively transcribed and different CRR subfamilies are differentially processed by RNAi. These results suggest that different CRR subfamilies may play different roles in the RNAi-mediated pathway for formation and maintenance of centromeric chromatin.

  15. The Past, Present, and Future of Human Centromere Genomics

    Directory of Open Access Journals (Sweden)

    Megan E. Aldrup-MacDonald

    2014-01-01

    Full Text Available The centromere is the chromosomal locus essential for chromosome inheritance and genome stability. Human centromeres are located at repetitive alpha satellite DNA arrays that compose approximately 5% of the genome. Contiguous alpha satellite DNA sequence is absent from the assembled reference genome, limiting current understanding of centromere organization and function. Here, we review the progress in centromere genomics spanning the discovery of the sequence to its molecular characterization and the work done during the Human Genome Project era to elucidate alpha satellite structure and sequence variation. We discuss exciting recent advances in alpha satellite sequence assembly that have provided important insight into the abundance and complex organization of this sequence on human chromosomes. In light of these new findings, we offer perspectives for future studies of human centromere assembly and function.

  16. Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus).

    Science.gov (United States)

    Cech, Jennifer N; Peichel, Catherine L

    2015-12-01

    Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.

  17. Centromere Destiny in Dicentric Chromosomes: New Insights from the Evolution of Human Chromosome 2 Ancestral Centromeric Region.

    Science.gov (United States)

    Chiatante, Giorgia; Giannuzzi, Giuliana; Calabrese, Francesco Maria; Eichler, Evan E; Ventura, Mario

    2017-07-01

    Dicentric chromosomes are products of genomic rearrangements that place two centromeres on the same chromosome. Due to the presence of two primary constrictions, they are inherently unstable and overcome their instability by epigenetically inactivating and/or deleting one of the two centromeres, thus resulting in functionally monocentric chromosomes that segregate normally during cell division. Our understanding to date of dicentric chromosome formation, behavior and fate has been largely inferred from observational studies in plants and humans as well as artificially produced de novo dicentrics in yeast and in human cells. We investigate the most recent product of a chromosome fusion event fixed in the human lineage, human chromosome 2, whose stability was acquired by the suppression of one centromere, resulting in a unique difference in chromosome number between humans (46 chromosomes) and our most closely related ape relatives (48 chromosomes). Using molecular cytogenetics, sequencing, and comparative sequence data, we deeply characterize the relicts of the chromosome 2q ancestral centromere and its flanking regions, gaining insight into the ancestral organization that can be easily broadened to all acrocentric chromosome centromeres. Moreover, our analyses offered the opportunity to trace the evolutionary history of rDNA and satellite III sequences among great apes, thus suggesting a new hypothesis for the preferential inactivation of some human centromeres, including IIq. Our results suggest two possible centromere inactivation models to explain the evolutionarily stabilization of human chromosome 2 over the last 5-6 million years. Our results strongly favor centromere excision through a one-step process. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. Structure, organization, and sequence of alpha satellite DNA from human chromosome 17: evidence for evolution by unequal crossing-over and an ancestral pentamer repeat shared with the human X chromosome.

    Science.gov (United States)

    Waye, J S; Willard, H F

    1986-09-01

    The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.

  19. Homologous alpha satellite sequences on human acrocentric chromosomes with selectivity for chromosomes 13, 14, and 21: implications for recombination between nonhomologues and Robertsonian translocations

    Energy Technology Data Exchange (ETDEWEB)

    Choo, K H; Vissel, B; Brown, R; Filby, R G; Earle, E

    1988-02-25

    The authors report a new subfamily of alpha satellite DNA (pTRA-2) which is found on all the human acrocentric chromosomes. The alphoid nature of the cloned DNA was established by partial sequencing. Southern analysis of restriction enzyme-digested DNA fragments from mouse/human hybrid cells containing only human chromosome 21 showed that the predominant higher-order repeating unit for pTRA-2 is a 3.9 kb structure. Analysis of a consensus in situ hybridization profile derived from 13 normal individuals revealed the localization of 73% of all centromeric autoradiographic grains over the five acrocentric chromosomes, with the following distribution: 20.4%, 21.5%, 17.1%, 7.3% and 6.5% on chromosomes 13, 14, 21, 15 and 22 respectively. An average of 1.4% of grains was found on the centromere of each of the remaining 19 nonacrocentric chromosomes. These results indicate the presence of a common subfamily of alpha satellite DNA on the five acrocentric chromosomes and suggest an evolutionary process consistent with recombination exchange of sequences between the nonhomologues. The results further suggests that such exchanges are more selective for chromosomes 13, 14 and 21 than for chromosomes 15 and 22. The possible role of centromeric alpha satellite DNA in the aetiology of 13q14q and 14q21q Robertsonian translocation involving the common and nonrandom association of chromosomes 13 and 14, and 14 and 21 is discussed.

  20. Epigenetically-inherited centromere and neocentromere DNA replicates earliest in S-phase.

    Directory of Open Access Journals (Sweden)

    Amnon Koren

    2010-08-01

    Full Text Available Eukaryotic centromeres are maintained at specific chromosomal sites over many generations. In the budding yeast Saccharomyces cerevisiae, centromeres are genetic elements defined by a DNA sequence that is both necessary and sufficient for function; whereas, in most other eukaryotes, centromeres are maintained by poorly characterized epigenetic mechanisms in which DNA has a less definitive role. Here we use the pathogenic yeast Candida albicans as a model organism to study the DNA replication properties of centromeric DNA. By determining the genome-wide replication timing program of the C. albicans genome, we discovered that each centromere is associated with a replication origin that is the first to fire on its respective chromosome. Importantly, epigenetic formation of new ectopic centromeres (neocentromeres was accompanied by shifts in replication timing, such that a neocentromere became the first to replicate and became associated with origin recognition complex (ORC components. Furthermore, changing the level of the centromere-specific histone H3 isoform led to a concomitant change in levels of ORC association with centromere regions, further supporting the idea that centromere proteins determine origin activity. Finally, analysis of centromere-associated DNA revealed a replication-dependent sequence pattern characteristic of constitutively active replication origins. This strand-biased pattern is conserved, together with centromere position, among related strains and species, in a manner independent of primary DNA sequence. Thus, inheritance of centromere position is correlated with a constitutively active origin of replication that fires at a distinct early time. We suggest a model in which the distinct timing of DNA replication serves as an epigenetic mechanism for the inheritance of centromere position.

  1. Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis.

    Science.gov (United States)

    Mason, Annaliese S; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A P; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N

    2016-02-01

    Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. Copyright © 2016 by the Genetics Society of America.

  2. The major horse satellite DNA family is associated with centromere competence.

    Science.gov (United States)

    Cerutti, Federico; Gamba, Riccardo; Mazzagatti, Alice; Piras, Francesca M; Cappelletti, Eleonora; Belloni, Elisa; Nergadze, Solomon G; Raimondi, Elena; Giulotto, Elena

    2016-01-01

    The centromere is the specialized locus required for correct chromosome segregation during cell division. The DNA of most eukaryotic centromeres is composed of extended arrays of tandem repeats (satellite DNA). In the horse, we previously showed that, although the centromere of chromosome 11 is completely devoid of tandem repeat arrays, all other centromeres are characterized by the presence of satellite DNA. We isolated three horse satellite DNA sequences (37cen, 2P1 and EC137) and described their chromosomal localization in four species of the genus Equus. In the work presented here, using the ChIP-seq methodology, we showed that, in the horse, the 37cen satellite binds CENP-A, the centromere-specific histone-H3 variant. The 37cen sequence bound by CENP-A is GC-rich with 221 bp units organized in a head-to-tail fashion. The physical interaction of CENP-A with 37cen was confirmed through slot blot experiments. Immuno-FISH on stretched chromosomes and chromatin fibres demonstrated that the extension of satellite DNA stretches is variable and is not related to the organization of CENP-A binding domains. Finally, we proved that the centromeric satellite 37cen is transcriptionally active. Our data offer new insights into the organization of horse centromeres. Although three different satellite DNA families are cytogenetically located at centromeres, only the 37cen family is associated to the centromeric function. Moreover, similarly to other species, CENP-A binding domains are variable in size. The transcriptional competence of the 37cen satellite that we observed adds new evidence to the hypothesis that centromeric transcripts may be required for centromere function.

  3. A unique genomic sequence in the Wolf-Hirschhorn syndrome [WHS] region of humans is conserved in the great apes.

    Science.gov (United States)

    Tarzami, S T; Kringstein, A M; Conte, R A; Verma, R S

    1996-10-01

    The Wolf-Hirschhorn syndrome (WHS) is caused by a partial deletion in the short arm of chromosome 4 band 16.3 (4p 16.3). A unique-sequence human DNA probe (39 kb) localized within this region has been used to search for sequence homology in the apes' equivalent chromosome 3 by FISH-technique. The WHS loci are conserved in higher primates at the expected position. Nevertheless, a control probe, which detects alphoid sequences of the pericentromeric region of humans, is diverged in chimpanzee, gorilla, and orangutan. The conservation of WHS loci and divergence of DNA alphoid sequences have further added to the controversy concerning human descent.

  4. Uncoupling of satellite DNA and centromeric function in the genus Equus.

    Science.gov (United States)

    Piras, Francesca M; Nergadze, Solomon G; Magnani, Elisa; Bertoni, Livia; Attolini, Carmen; Khoriauli, Lela; Raimondi, Elena; Giulotto, Elena

    2010-02-12

    In a previous study, we showed that centromere repositioning, that is the shift along the chromosome of the centromeric function without DNA sequence rearrangement, has occurred frequently during the evolution of the genus Equus. In this work, the analysis of the chromosomal distribution of satellite tandem repeats in Equus caballus, E. asinus, E. grevyi, and E. burchelli highlighted two atypical features: 1) several centromeres, including the previously described evolutionary new centromeres (ENCs), seem to be devoid of satellite DNA, and 2) satellite repeats are often present at non-centromeric termini, probably corresponding to relics of ancestral now inactive centromeres. Immuno-FISH experiments using satellite DNA and antibodies against the kinetochore protein CENP-A demonstrated that satellite-less primary constrictions are actually endowed with centromeric function. The phylogenetic reconstruction of centromere repositioning events demonstrates that the acquisition of satellite DNA occurs after the formation of the centromere during evolution and that centromeres can function over millions of years and many generations without detectable satellite DNA. The rapidly evolving Equus species gave us the opportunity to identify different intermediate steps along the full maturation of ENCs.

  5. Uncoupling of satellite DNA and centromeric function in the genus Equus.

    Directory of Open Access Journals (Sweden)

    Francesca M Piras

    2010-02-01

    Full Text Available In a previous study, we showed that centromere repositioning, that is the shift along the chromosome of the centromeric function without DNA sequence rearrangement, has occurred frequently during the evolution of the genus Equus. In this work, the analysis of the chromosomal distribution of satellite tandem repeats in Equus caballus, E. asinus, E. grevyi, and E. burchelli highlighted two atypical features: 1 several centromeres, including the previously described evolutionary new centromeres (ENCs, seem to be devoid of satellite DNA, and 2 satellite repeats are often present at non-centromeric termini, probably corresponding to relics of ancestral now inactive centromeres. Immuno-FISH experiments using satellite DNA and antibodies against the kinetochore protein CENP-A demonstrated that satellite-less primary constrictions are actually endowed with centromeric function. The phylogenetic reconstruction of centromere repositioning events demonstrates that the acquisition of satellite DNA occurs after the formation of the centromere during evolution and that centromeres can function over millions of years and many generations without detectable satellite DNA. The rapidly evolving Equus species gave us the opportunity to identify different intermediate steps along the full maturation of ENCs.

  6. The rapidly evolving centromere-specific histone has stringent functional requirements in Arabidopsis thaliana.

    Science.gov (United States)

    Ravi, Maruthachalam; Kwong, Pak N; Menorca, Ron M G; Valencia, Joel T; Ramahi, Joseph S; Stewart, Jodi L; Tran, Robert K; Sundaresan, Venkatesan; Comai, Luca; Chan, Simon W-L

    2010-10-01

    Centromeres control chromosome inheritance in eukaryotes, yet their DNA structure and primary sequence are hypervariable. Most animals and plants have megabases of tandem repeats at their centromeres, unlike yeast with unique centromere sequences. Centromere function requires the centromere-specific histone CENH3 (CENP-A in human), which replaces histone H3 in centromeric nucleosomes. CENH3 evolves rapidly, particularly in its N-terminal tail domain. A portion of the CENH3 histone-fold domain, the CENP-A targeting domain (CATD), has been previously shown to confer kinetochore localization and centromere function when swapped into human H3. Furthermore, CENP-A in human cells can be functionally replaced by CENH3 from distantly related organisms including Saccharomyces cerevisiae. We have used cenh3-1 (a null mutant in Arabidopsis thaliana) to replace endogenous CENH3 with GFP-tagged variants. A H3.3 tail domain-CENH3 histone-fold domain chimera rescued viability of cenh3-1, but CENH3's lacking a tail domain were nonfunctional. In contrast to human results, H3 containing the A. thaliana CATD cannot complement cenh3-1. GFP-CENH3 from the sister species A. arenosa functionally replaces A. thaliana CENH3. GFP-CENH3 from the close relative Brassica rapa was targeted to centromeres, but did not complement cenh3-1, indicating that kinetochore localization and centromere function can be uncoupled. We conclude that CENH3 function in A. thaliana, an organism with large tandem repeat centromeres, has stringent requirements for functional complementation in mitosis.

  7. Repeatless and Repeat-Based Centromeres in Potato: Implications for Centromere Evolution[C][W

    Czech Academy of Sciences Publication Activity Database

    Gong, Z.; Wu, Y.; Koblížková, Andrea; Torres, G.A.; Wang, K.; Iovene, M.; Neumann, Pavel; Zhang, W.; Novák, Petr; Buell, C.R.; Macas, Jiří; Jiang, J.

    2012-01-01

    Roč. 24, č. 9 (2012), s. 3559-3574 ISSN 1040-4651 R&D Projects: GA MŠk(CZ) LH11058 Institutional research plan: CEZ:AV0Z50510513 Institutional support: RVO:60077344 Keywords : repetitive sequences * plant satellite repeats * Arabidopsis thaliana * rice centromere * wild potatoes Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 9.251, year: 2012

  8. Analysis of ParB-centromere interactions by multiplex SPR imaging reveals specific patterns for binding ParB in six centromeres of Burkholderiales chromosomes and plasmids.

    Science.gov (United States)

    Pillet, Flavien; Passot, Fanny Marie; Pasta, Franck; Anton Leberre, Véronique; Bouet, Jean-Yves

    2017-01-01

    Bacterial centromeres-also called parS, are cis-acting DNA sequences which, together with the proteins ParA and ParB, are involved in the segregation of chromosomes and plasmids. The specific binding of ParB to parS nucleates the assembly of a large ParB/DNA complex from which ParA-the motor protein, segregates the sister replicons. Closely related families of partition systems, called Bsr, were identified on the chromosomes and large plasmids of the multi-chromosomal bacterium Burkholderia cenocepacia and other species from the order Burkholeriales. The centromeres of the Bsr partition families are 16 bp palindromes, displaying similar base compositions, notably a central CG dinucleotide. Despite centromeres bind the cognate ParB with a narrow specificity, weak ParB-parS non cognate interactions were nevertheless detected between few Bsr partition systems of replicons not belonging to the same genome. These observations suggested that Bsr partition systems could have a common ancestry but that evolution mostly erased the possibilities of cross-reactions between them, in particular to prevent replicon incompatibility. To detect novel similarities between Bsr partition systems, we have analyzed the binding of six Bsr parS sequences and a wide collection of modified derivatives, to their cognate ParB. The study was carried out by Surface Plasmon Resonance imaging (SPRi) mulitplex analysis enabling a systematic survey of each nucleotide position within the centromere. We found that in each parS some positions could be changed while maintaining binding to ParB. Each centromere displays its own pattern of changes, but some positions are shared more or less widely. In addition from these changes we could speculate evolutionary links between these centromeres.

  9. Absence of positive selection on CenH3 in Luzula suggests that holokinetic chromosomes may suppress centromere drive.

    Science.gov (United States)

    Zedek, František; Bureš, Petr

    2016-12-01

    The centromere drive theory explains diversity of eukaryotic centromeres as a consequence of the recurrent conflict between centromeric repeats and centromeric histone H3 (CenH3), in which selfish centromeres exploit meiotic asymmetry and CenH3 evolves adaptively to counterbalance deleterious consequences of driving centromeres. Accordingly, adaptively evolving CenH3 has so far been observed only in eukaryotes with asymmetric meiosis. However, if such evolution is a consequence of centromere drive, it should depend not only on meiotic asymmetry but also on monocentric or holokinetic chromosomal structure. Selective pressures acting on CenH3 have never been investigated in organisms with holokinetic meiosis despite the fact that holokinetic chromosomes have been hypothesized to suppress centromere drive. Therefore, the present study evaluates selective pressures acting on the CenH3 gene in holokinetic organisms for the first time, specifically in the representatives of the plant genus Luzula (Juncaceae), in which the kinetochore formation is not co-localized with any type of centromeric repeat. PCR, cloning and sequencing, and database searches were used to obtain coding CenH3 sequences from Luzula species. Codon substitution models were employed to infer selective regimes acting on CenH3 in Luzula KEY RESULTS: In addition to the two previously published CenH3 sequences from L. nivea, 16 new CenH3 sequences have been isolated from 12 Luzula species. Two CenH3 isoforms in Luzula that originated by a duplication event prior to the divergence of analysed species were found. No signs of positive selection acting on CenH3 in Luzula were detected. Instead, evidence was found that selection on CenH3 of Luzula might have been relaxed. The results indicate that holokinetism itself may suppress centromere drive and, therefore, holokinetic chromosomes might have evolved as a defence against centromere drive. © The Author 2016. Published by Oxford University Press on behalf of

  10. Analysis of ParB-centromere interactions by multiplex SPR imaging reveals specific patterns for binding ParB in six centromeres of Burkholderiales chromosomes and plasmids.

    Directory of Open Access Journals (Sweden)

    Flavien Pillet

    Full Text Available Bacterial centromeres-also called parS, are cis-acting DNA sequences which, together with the proteins ParA and ParB, are involved in the segregation of chromosomes and plasmids. The specific binding of ParB to parS nucleates the assembly of a large ParB/DNA complex from which ParA-the motor protein, segregates the sister replicons. Closely related families of partition systems, called Bsr, were identified on the chromosomes and large plasmids of the multi-chromosomal bacterium Burkholderia cenocepacia and other species from the order Burkholeriales. The centromeres of the Bsr partition families are 16 bp palindromes, displaying similar base compositions, notably a central CG dinucleotide. Despite centromeres bind the cognate ParB with a narrow specificity, weak ParB-parS non cognate interactions were nevertheless detected between few Bsr partition systems of replicons not belonging to the same genome. These observations suggested that Bsr partition systems could have a common ancestry but that evolution mostly erased the possibilities of cross-reactions between them, in particular to prevent replicon incompatibility. To detect novel similarities between Bsr partition systems, we have analyzed the binding of six Bsr parS sequences and a wide collection of modified derivatives, to their cognate ParB. The study was carried out by Surface Plasmon Resonance imaging (SPRi mulitplex analysis enabling a systematic survey of each nucleotide position within the centromere. We found that in each parS some positions could be changed while maintaining binding to ParB. Each centromere displays its own pattern of changes, but some positions are shared more or less widely. In addition from these changes we could speculate evolutionary links between these centromeres.

  11. Most Uv-Induced Reciprocal Translocations in SORDARIA MACROSPORA Occur in or near Centromere Regions.

    Science.gov (United States)

    Leblon, G; Zickler, D; Lebilcot, S

    1986-02-01

    In fungi, translocations can be identified and classified by the patterns of ascospore abortion in asci from crosses of rearrangement x normal sequence. Previous studies of UV-induced rearrangements in Sordaria macrospora revealed that a major class (called type III) appeared to be reciprocal translocations that were anomalous in producing an unexpected class of asci with four aborted ascospores in bbbbaaaa linear sequence (b = black; a = abortive). The present study shows that the anomalous type III rearrangements are, in fact, reciprocal translocations having both breakpoints within or adjacent to centromeres and that bbbbaaaa asci result from 3:1 disjunction from the translocation quadrivalent.-Electron microscopic observations of synaptonemal complexes enable centromeres to be visualized. Lengths of synaptonemal complexes lateral elements in translocation quadrivalents accurately reflect chromosome arm lengths, enabling breakpoints to be located reliably in centromere regions. All genetic data are consistent with the behavior expected of translocations with breakpoints at centromeres.-Two-thirds of the UV-induced reciprocal translocations are of this type. Certain centromere regions are involved preferentially. Among 73 type-III translocations, there were but 13 of the 21 possible chromosome combinations and 20 of the 42 possible combinations of chromosome arms.

  12. Human Artificial Chromosomes with Alpha Satellite-Based De Novo Centromeres Show Increased Frequency of Nondisjunction and Anaphase Lag

    OpenAIRE

    Rudd, M. Katharine; Mays, Robert W.; Schwartz, Stuart; Willard, Huntington F.

    2003-01-01

    Human artificial chromosomes have been used to model requirements for human chromosome segregation and to explore the nature of sequences competent for centromere function. Normal human centromeres require specialized chromatin that consists of alpha satellite DNA complexed with epigenetically modified histones and centromere-specific proteins. While several types of alpha satellite DNA have been used to assemble de novo centromeres in artificial chromosome assays, the extent to which they fu...

  13. SWI/SNF-like chromatin remodeling factor Fun30 supports point centromere function in S. cerevisiae.

    Directory of Open Access Journals (Sweden)

    Mickaël Durand-Dubief

    2012-09-01

    Full Text Available Budding yeast centromeres are sequence-defined point centromeres and are, unlike in many other organisms, not embedded in heterochromatin. Here we show that Fun30, a poorly understood SWI/SNF-like chromatin remodeling factor conserved in humans, promotes point centromere function through the formation of correct chromatin architecture at centromeres. Our determination of the genome-wide binding and nucleosome positioning properties of Fun30 shows that this enzyme is consistently enriched over centromeres and that a majority of CENs show Fun30-dependent changes in flanking nucleosome position and/or CEN core micrococcal nuclease accessibility. Fun30 deletion leads to defects in histone variant Htz1 occupancy genome-wide, including at and around most centromeres. FUN30 genetically interacts with CSE4, coding for the centromere-specific variant of histone H3, and counteracts the detrimental effect of transcription through centromeres on chromosome segregation and suppresses transcriptional noise over centromere CEN3. Previous work has shown a requirement for fission yeast and mammalian homologs of Fun30 in heterochromatin assembly. As centromeres in budding yeast are not embedded in heterochromatin, our findings indicate a direct role of Fun30 in centromere chromatin by promoting correct chromatin architecture.

  14. Replicating centromeric chromatin: Spatial and temporal control of CENP-A assembly

    International Nuclear Information System (INIS)

    Nechemia-Arbely, Yael; Fachinetti, Daniele; Cleveland, Don W.

    2012-01-01

    The centromere is the fundamental unit for insuring chromosome inheritance. This complex region has a distinct type of chromatin in which histone H3 is replaced by a structurally different homologue identified in humans as CENP-A. In metazoans, specific DNA sequences are neither required nor sufficient for centromere identity. Rather, an epigenetic mark comprised of CENP-A containing chromatin is thought to be the major determinant of centromere identity. In this view, CENP-A deposition and chromatin assembly are fundamental processes for the maintenance of centromeric identity across mitotic and meiotic divisions. Several lines of evidence support CENP-A deposition in metazoans occurring at only one time in the cell cycle. Such cell cycle-dependent loading of CENP-A is found in divergent species from human to fission yeast, albeit with differences in the cell cycle point at which CENP-A is assembled. Cell cycle dependent CENP-A deposition requires multiple assembly factors for its deposition and maintenance. This review discusses the regulation of new CENP-A deposition and its relevance to centromere identity and inheritance.

  15. Rad51-Rad52 mediated maintenance of centromeric chromatin in Candida albicans.

    Directory of Open Access Journals (Sweden)

    Sreyoshi Mitra

    2014-04-01

    Full Text Available Specification of the centromere location in most eukaryotes is not solely dependent on the DNA sequence. However, the non-genetic determinants of centromere identity are not clearly defined. While multiple mechanisms, individually or in concert, may specify centromeres epigenetically, most studies in this area are focused on a universal factor, a centromere-specific histone H3 variant CENP-A, often considered as the epigenetic determinant of centromere identity. In spite of variable timing of its loading at centromeres across species, a replication coupled early S phase deposition of CENP-A is found in most yeast centromeres. Centromeres are the earliest replicating chromosomal regions in a pathogenic budding yeast Candida albicans. Using a 2-dimensional agarose gel electrophoresis assay, we identify replication origins (ORI7-LI and ORI7-RI proximal to an early replicating centromere (CEN7 in C. albicans. We show that the replication forks stall at CEN7 in a kinetochore dependent manner and fork stalling is reduced in the absence of the homologous recombination (HR proteins Rad51 and Rad52. Deletion of ORI7-RI causes a significant reduction in the stalled fork signal and an increased loss rate of the altered chromosome 7. The HR proteins, Rad51 and Rad52, have been shown to play a role in fork restart. Confocal microscopy shows declustered kinetochores in rad51 and rad52 mutants, which are evidence of kinetochore disintegrity. CENP-ACaCse4 levels at centromeres, as determined by chromatin immunoprecipitation (ChIP experiments, are reduced in absence of Rad51/Rad52 resulting in disruption of the kinetochore structure. Moreover, western blot analysis reveals that delocalized CENP-A molecules in HR mutants degrade in a similar fashion as in other kinetochore mutants described before. Finally, co-immunoprecipitation assays indicate that Rad51 and Rad52 physically interact with CENP-ACaCse4 in vivo. Thus, the HR proteins Rad51 and Rad52

  16. Molecular structures of centromeric heterochromatin and karyotypic evolution in the Siamese crocodile (Crocodylus siamensis) (Crocodylidae, Crocodylia).

    Science.gov (United States)

    Kawagoshi, Taiki; Nishida, Chizuko; Ota, Hidetoshi; Kumazawa, Yoshinori; Endo, Hideki; Matsuda, Yoichi

    2008-01-01

    Crocodilians have several unique karyotypic features, such as small diploid chromosome numbers (30-42) and the absence of dot-shaped microchromosomes. Of the extant crocodilian species, the Siamese crocodile (Crocodylus siamensis) has no more than 2n = 30, comprising mostly bi-armed chromosomes with large centromeric heterochromatin blocks. To investigate the molecular structures of C-heterochromatin and genomic compartmentalization in the karyotype, characterized by the disappearance of tiny microchromosomes and reduced chromosome number, we performed molecular cloning of centromeric repetitive sequences and chromosome mapping of the 18S-28S rDNA and telomeric (TTAGGG)( n ) sequences. The centromeric heterochromatin was composed mainly of two repetitive sequence families whose characteristics were quite different. Two types of GC-rich CSI-HindIII family sequences, the 305 bp CSI-HindIII-S (G+C content, 61.3%) and 424 bp CSI-HindIII-M (63.1%), were localized to the intensely PI-stained centric regions of all chromosomes, except for chromosome 2 with PI-negative heterochromatin. The 94 bp CSI-DraI (G+C content, 48.9%) was tandem-arrayed satellite DNA and localized to chromosome 2 and four pairs of small-sized chromosomes. The chromosomal size-dependent genomic compartmentalization that is supposedly unique to the Archosauromorpha was probably lost in the crocodilian lineage with the disappearance of microchromosomes followed by the homogenization of centromeric repetitive sequences between chromosomes, except for chromosome 2.

  17. Plasmodium falciparum centromeres display a unique epigenetic makeup and cluster prior to and during schizogony.

    Science.gov (United States)

    Hoeijmakers, Wieteke A M; Flueck, Christian; Françoijs, Kees-Jan; Smits, Arne H; Wetzel, Johanna; Volz, Jennifer C; Cowman, Alan F; Voss, Till; Stunnenberg, Hendrik G; Bártfai, Richárd

    2012-09-01

    Centromeres are essential for the faithful transmission of chromosomes to the next generation, therefore being essential in all eukaryotic organisms. The centromeres of Plasmodium falciparum, the causative agent of the most severe form of malaria, have been broadly mapped on most chromosomes, but their epigenetic composition remained undefined. Here, we reveal that the centromeric histone variant PfCENH3 occupies a 4-4.5 kb region on each P. falciparum chromosome, which is devoid of pericentric heterochromatin but harbours another histone variant, PfH2A.Z. These CENH3 covered regions pinpoint the exact position of the centromere on all chromosomes and revealed that all centromeric regions have similar size and sequence composition. Immunofluorescence assay of PfCENH3 strongly suggests that P. falciparum centromeres cluster to a single nuclear location prior to and during mitosis and cytokinesis but dissociate soon after invasion. In summary, we reveal a dynamic association of Plasmodium centromeres, which bear a unique epigenetic signature and conform to a strict structure. These findings suggest that DNA-associated and epigenetic elements play an important role in centromere establishment in this important human pathogen. © 2012 Blackwell Publishing Ltd.

  18. Cis-Acting Determinants Affecting Centromere Function, Sister-Chromatid Cohesion and Reciprocal Recombination during Meiosis in Saccharomyces Cerevisiae

    OpenAIRE

    Sears, D. D.; Hegemann, J. H.; Shero, J. H.; Hieter, P.

    1995-01-01

    We have employed a system that utilizes homologous pairs of human DNA-derived yeast artificial chromosomes (YACs) as marker chromosomes to assess the specific role (s) of conserved centromere DNA elements (CDEI, CDEII and CDEIII) in meiotic chromosome disjunction fidelity. Thirteen different centromere (CEN) mutations were tested for their effects on meiotic centromere function. YACs containing a wild-type CEN DNA sequence segregate with high fidelity in meiosis I (99% normal segregation) and...

  19. Centromere and telomere sequence alterations reflect the rapid genome evolution within the carnivorous plant genus Genlisea

    Czech Academy of Sciences Publication Activity Database

    Tran, T.D.; Cao, H.X.; Jovtchev, G.; Neumann, Pavel; Novák, Petr; Fojtová, M.; Vu, G.T.H.; Macas, Jiří; Fajkus, Jiří; Schubert, I.; Fuchs, J.

    2015-01-01

    Roč. 84, č. 6 (2015), s. 1087-1099 ISSN 0960-7412 R&D Projects: GA ČR GBP501/12/G090; GA ČR GA13-06943S Institutional support: RVO:60077344 ; RVO:68081707 Keywords : Centromeric tandem repeat * centromeric retrotransposons * Genlisea nigrocaulis, hispidula Subject RIV: EB - Genetics ; Molecular Biology; BO - Biophysics (BFU-R) Impact factor: 5.468, year: 2015

  20. Centromeric heterochromatin: the primordial segregation machine.

    Science.gov (United States)

    Bloom, Kerry S

    2014-01-01

    Centromeres are specialized domains of heterochromatin that provide the foundation for the kinetochore. Centromeric heterochromatin is characterized by specific histone modifications, a centromere-specific histone H3 variant (CENP-A), and the enrichment of cohesin, condensin, and topoisomerase II. Centromere DNA varies orders of magnitude in size from 125 bp (budding yeast) to several megabases (human). In metaphase, sister kinetochores on the surface of replicated chromosomes face away from each other, where they establish microtubule attachment and bi-orientation. Despite the disparity in centromere size, the distance between separated sister kinetochores is remarkably conserved (approximately 1 μm) throughout phylogeny. The centromere functions as a molecular spring that resists microtubule-based extensional forces in mitosis. This review explores the physical properties of DNA in order to understand how the molecular spring is built and how it contributes to the fidelity of chromosome segregation.

  1. Neocentromeres Provide Chromosome Segregation Accuracy and Centromere Clustering to Multiple Loci along a Candida albicans Chromosome.

    Directory of Open Access Journals (Sweden)

    Laura S Burrack

    2016-09-01

    Full Text Available Assembly of kinetochore complexes, involving greater than one hundred proteins, is essential for chromosome segregation and genome stability. Neocentromeres, or new centromeres, occur when kinetochores assemble de novo, at DNA loci not previously associated with kinetochore proteins, and they restore chromosome segregation to chromosomes lacking a functional centromere. Neocentromeres have been observed in a number of diseases and may play an evolutionary role in adaptation or speciation. However, the consequences of neocentromere formation on chromosome missegregation rates, gene expression, and three-dimensional (3D nuclear structure are not well understood. Here, we used Candida albicans, an organism with small, epigenetically-inherited centromeres, as a model system to study the functions of twenty different neocentromere loci along a single chromosome, chromosome 5. Comparison of neocentromere properties relative to native centromere functions revealed that all twenty neocentromeres mediated chromosome segregation, albeit to different degrees. Some neocentromeres also caused reduced levels of transcription from genes found within the neocentromere region. Furthermore, like native centromeres, neocentromeres clustered in 3D with active/functional centromeres, indicating that formation of a new centromere mediates the reorganization of 3D nuclear architecture. This demonstrates that centromere clustering depends on epigenetically defined function and not on the primary DNA sequence, and that neocentromere function is independent of its distance from the native centromere position. Together, the results show that a neocentromere can form at many loci along a chromosome and can support the assembly of a functional kinetochore that exhibits native centromere functions including chromosome segregation accuracy and centromere clustering within the nucleus.

  2. DNA topoisomerase III localizes to centromeres and affects centromeric CENP-A levels in fission yeast.

    Directory of Open Access Journals (Sweden)

    Ulrika Norman-Axelsson

    Full Text Available Centromeres are specialized chromatin regions marked by the presence of nucleosomes containing the centromere-specific histone H3 variant CENP-A, which is essential for chromosome segregation. Assembly and disassembly of nucleosomes is intimately linked to DNA topology, and DNA topoisomerases have previously been implicated in the dynamics of canonical H3 nucleosomes. Here we show that Schizosaccharomyces pombe Top3 and its partner Rqh1 are involved in controlling the levels of CENP-A(Cnp1 at centromeres. Both top3 and rqh1 mutants display defects in chromosome segregation. Using chromatin immunoprecipitation and tiling microarrays, we show that Top3, unlike Top1 and Top2, is highly enriched at centromeric central domains, demonstrating that Top3 is the major topoisomerase in this region. Moreover, centromeric Top3 occupancy positively correlates with CENP-A(Cnp1 occupancy. Intriguingly, both top3 and rqh1 mutants display increased relative enrichment of CENP-A(Cnp1 at centromeric central domains. Thus, Top3 and Rqh1 normally limit the levels of CENP-A(Cnp1 in this region. This new role is independent of the established function of Top3 and Rqh1 in homologous recombination downstream of Rad51. Therefore, we hypothesize that the Top3-Rqh1 complex has an important role in controlling centromere DNA topology, which in turn affects the dynamics of CENP-A(Cnp1 nucleosomes.

  3. Genetic mapping of centromeres in the nine Citrus clementina chromosomes using half-tetrad analysis and recombination patterns in unreduced and haploid gametes.

    Science.gov (United States)

    Aleza, Pablo; Cuenca, José; Hernández, María; Juárez, José; Navarro, Luis; Ollitrault, Patrick

    2015-03-08

    Mapping centromere locations in plant species provides essential information for the analysis of genetic structures and population dynamics. The centromere's position affects the distribution of crossovers along a chromosome and the parental heterozygosity restitution by 2n gametes is a direct function of the genetic distance to the centromere. Sexual polyploidisation is relatively frequent in Citrus species and is widely used to develop new seedless triploid cultivars. The study's objectives were to (i) map the positions of the centromeres of the nine Citrus clementina chromosomes; (ii) analyse the crossover interference in unreduced gametes; and (iii) establish the pattern of genetic recombination in haploid clementine gametes along each chromosome and its relationship with the centromere location and distribution of genic sequences. Triploid progenies were derived from unreduced megagametophytes produced by second-division restitution. Centromere positions were mapped genetically for all linkage groups using half-tetrad analysis. Inference of the physical locations of centromeres revealed one acrocentric, four metacentric and four submetacentric chromosomes. Crossover interference was observed in unreduced gametes, with variation seen between chromosome arms. For haploid gametes, a strong decrease in the recombination rate occurred in centromeric and pericentromeric regions, which contained a low density of genic sequences. In chromosomes VIII and IX, these low recombination rates extended beyond the pericentromeric regions. The genomic region corresponding to a genetic distance recombination pattern along each chromosome. However, regions with low recombination rates extended beyond the pericentromeric regions of some chromosomes into areas richer in genic sequences. The persistence of strong linkage disequilibrium between large numbers of genes promotes the stability of epistatic interactions and multilocus-controlled traits over successive generations but

  4. Identification and Preliminary Analysis of Several Centromere-associated Bacterial Artificial Chromosome Clones from a Diploid Wheat Library

    Institute of Scientific and Technical Information of China (English)

    2006-01-01

    Although the centromeres of some plants have been investigated previously, our knowledge of the wheat centromere is still very limited. To understand the structure and function of the wheat centromere, we used two centromeric repeats (RCS1 and CCS1-5ab) to obtain some centromere-associated bacterial artificial chromosome (BAC) clones in 32 RCS1-related BAC clones that had been screened out from a diploid wheat (Triticum boeoticum Boiss.; 2n=2x=14) BAC library. Southern hybridization results indicated that, of the 32 candidates,there were 28 RCS1-positive clones. Based on gel blot patterns, the frequency of RCS1 was approximately one copy every 69.4 kb in these 28 RCS1-positive BAC clones. More bands were detected when the same filter was probed with CCS1-5ab. Furthermore, the CCS1 bands covered all the bands detected by RCS1, which suggests that some CCS1 repeats were distributed together with RCS1. The frequency of CCS1 families was once every 35.8 kb, nearly twice that of RCS1. Fluorescence in situ hybridization (FISH) analysis indicated that the five BAC clones containing RCS1 and CCS1 sequences all detected signals at the centromeric regions in hexaploid wheat, but the signal intensities on the A-genome chromosomes were stronger than those on the B- and/or D-genome chromosomes. The FISH analysis among nine Triticeae cereals indicated that there were A-genomespecific (or rich) sequences dispersing on chromosome arms in the BAC clone TbBAC5. In addition, at the interphase cells, the centromeres of diploid species usually clustered at one pole and formed a ring-like allocation in the period before metaphase.

  5. Characterization of centromeric histone H3 (CENH3 variants in cultivated and wild carrots (Daucus sp..

    Directory of Open Access Journals (Sweden)

    Frank Dunemann

    Full Text Available In eukaryotes, centromeres are the assembly sites for the kinetochore, a multi-protein complex to which spindle microtubules are attached at mitosis and meiosis, thereby ensuring segregation of chromosomes during cell division. They are specified by incorporation of CENH3, a centromere specific histone H3 variant which replaces canonical histone H3 in the nucleosomes of functional centromeres. To lay a first foundation of a putative alternative haploidization strategy based on centromere-mediated genome elimination in cultivated carrots, in the presented research we aimed at the identification and cloning of functional CENH3 genes in Daucus carota and three distantly related wild species of genus Daucus varying in basic chromosome numbers. Based on mining the carrot transcriptome followed by a subsequent PCR-based cloning, homologous coding sequences for CENH3s of the four Daucus species were identified. The ORFs of the CENH3 variants were very similar, and an amino acid sequence length of 146 aa was found in three out of the four species. Comparison of Daucus CENH3 amino acid sequences with those of other plant CENH3s as well as their phylogenetic arrangement among other dicot CENH3s suggest that the identified genes are authentic CENH3 homologs. To verify the location of the CENH3 protein in the kinetochore regions of the Daucus chromosomes, a polyclonal antibody based on a peptide corresponding to the N-terminus of DcCENH3 was developed and used for anti-CENH3 immunostaining of mitotic root cells. The chromosomal location of CENH3 proteins in the centromere regions of the chromosomes could be confirmed. For genetic localization of the CENH3 gene in the carrot genome, a previously constructed linkage map for carrot was used for mapping a CENH3-specific simple sequence repeat (SSR marker, and the CENH3 locus was mapped on the carrot chromosome 9.

  6. Human artificial chromosomes with alpha satellite-based de novo centromeres show increased frequency of nondisjunction and anaphase lag.

    Science.gov (United States)

    Rudd, M Katharine; Mays, Robert W; Schwartz, Stuart; Willard, Huntington F

    2003-11-01

    Human artificial chromosomes have been used to model requirements for human chromosome segregation and to explore the nature of sequences competent for centromere function. Normal human centromeres require specialized chromatin that consists of alpha satellite DNA complexed with epigenetically modified histones and centromere-specific proteins. While several types of alpha satellite DNA have been used to assemble de novo centromeres in artificial chromosome assays, the extent to which they fully recapitulate normal centromere function has not been explored. Here, we have used two kinds of alpha satellite DNA, DXZ1 (from the X chromosome) and D17Z1 (from chromosome 17), to generate human artificial chromosomes. Although artificial chromosomes are mitotically stable over many months in culture, when we examined their segregation in individual cell divisions using an anaphase assay, artificial chromosomes exhibited more segregation errors than natural human chromosomes (P artificial chromosomes missegregate over a fivefold range, the data suggest that variable centromeric DNA content and/or epigenetic assembly can influence the mitotic behavior of artificial chromosomes.

  7. Premitotic assembly of human CENPs -T and -W switches centromeric chromatin to a mitotic state.

    Directory of Open Access Journals (Sweden)

    Lisa Prendergast

    2011-06-01

    Full Text Available Centromeres are differentiated chromatin domains, present once per chromosome, that direct segregation of the genome in mitosis and meiosis by specifying assembly of the kinetochore. They are distinct genetic loci in that their identity in most organisms is determined not by the DNA sequences they are associated with, but through specific chromatin composition and context. The core nucleosomal protein CENP-A/cenH3 plays a primary role in centromere determination in all species and directs assembly of a large complex of associated proteins in vertebrates. While CENP-A itself is stably transmitted from one generation to the next, the nature of the template for centromere replication and its relationship to kinetochore function are as yet poorly understood. Here, we investigate the assembly and inheritance of a histone fold complex of the centromere, the CENP-T/W complex, which is integrated with centromeric chromatin in association with canonical histone H3 nucleosomes. We have investigated the cell cycle regulation, timing of assembly, generational persistence, and requirement for function of CENPs -T and -W in the cell cycle in human cells. The CENP-T/W complex assembles through a dynamic exchange mechanism in late S-phase and G2, is required for mitosis in each cell cycle and does not persist across cell generations, properties reciprocal to those measured for CENP-A. We propose that the CENP-A and H3-CENP-T/W nucleosome components of the centromere are specialized for centromeric and kinetochore activities, respectively. Segregation of the assembly mechanisms for the two allows the cell to switch between chromatin configurations that reciprocally support the replication of the centromere and its conversion to a mitotic state on postreplicative chromatin.

  8. Somatic association of telocentric chromosomes carrying homologous centromeres in common wheat.

    Science.gov (United States)

    Mello-Sampayo, T

    1973-01-01

    telocentrics, or, as a possible alternative, common repeated sequences of DNA molecules around the centromere region.

  9. RNA Pol II promotes transcription of centromeric satellite DNA in beetles.

    Directory of Open Access Journals (Sweden)

    Zeljka Pezer

    Full Text Available Transcripts of centromeric satellite DNAs are known to play a role in heterochromatin formation as well as in establishment of the kinetochore. However, little is known about basic mechanisms of satellite DNA expression within constitutive heterochromatin and its regulation. Here we present comprehensive analysis of transcription of abundant centromeric satellite DNA, PRAT from beetle Palorus ratzeburgii (Coleoptera. This satellite is characterized by preservation and extreme sequence conservation among evolutionarily distant insect species. PRAT is expressed in all three developmental stages: larvae, pupae and adults at similar level. Transcripts are abundant comprising 0.033% of total RNA and are heterogeneous in size ranging from 0.5 kb up to more than 5 kb. Transcription proceeds from both strands but with 10 fold different expression intensity and transcripts are not processed into siRNAs. Most of the transcripts (80% are not polyadenylated and remain in the nucleus while a small portion is exported to the cytoplasm. Multiple, irregularly distributed transcription initiation sites as well as termination sites have been mapped within the PRAT sequence using primer extension and RLM-RACE. The presence of cap structure as well as poly(A tails in a portion of the transcripts indicate RNA polymerase II-dependent transcription and a putative polymerase II promoter site overlaps the most conserved part of the PRAT sequence. The treatment of larvae with alpha-amanitin decreases the level of PRAT transcripts at concentrations that selectively inhibit pol II activity. In conclusion, stable, RNA polymerase II dependant transcripts of abundant centromeric satellite DNA, not regulated by RNAi, have been identified and characterized. This study offers a basic understanding of expression of highly abundant heterochromatic DNA which in beetle species constitutes up to 50% of the genome.

  10. Genome-wide analysis reveals a cell cycle-dependent mechanism controlling centromere propagation.

    Science.gov (United States)

    Erhardt, Sylvia; Mellone, Barbara G; Betts, Craig M; Zhang, Weiguo; Karpen, Gary H; Straight, Aaron F

    2008-12-01

    Centromeres are the structural and functional foundation for kinetochore formation, spindle attachment, and chromosome segregation. In this study, we isolated factors required for centromere propagation using genome-wide RNA interference screening for defects in centromere protein A (CENP-A; centromere identifier [CID]) localization in Drosophila melanogaster. We identified the proteins CAL1 and CENP-C as essential factors for CID assembly at the centromere. CID, CAL1, and CENP-C coimmunoprecipitate and are mutually dependent for centromere localization and function. We also identified the mitotic cyclin A (CYCA) and the anaphase-promoting complex (APC) inhibitor RCA1/Emi1 as regulators of centromere propagation. We show that CYCA is centromere localized and that CYCA and RCA1/Emi1 couple centromere assembly to the cell cycle through regulation of the fizzy-related/CDH1 subunit of the APC. Our findings identify essential components of the epigenetic machinery that ensures proper specification and propagation of the centromere and suggest a mechanism for coordinating centromere inheritance with cell division.

  11. Genome-wide analysis reveals a cell cycle–dependent mechanism controlling centromere propagation

    Science.gov (United States)

    Erhardt, Sylvia; Mellone, Barbara G.; Betts, Craig M.; Zhang, Weiguo; Karpen, Gary H.; Straight, Aaron F.

    2008-01-01

    Centromeres are the structural and functional foundation for kinetochore formation, spindle attachment, and chromosome segregation. In this study, we isolated factors required for centromere propagation using genome-wide RNA interference screening for defects in centromere protein A (CENP-A; centromere identifier [CID]) localization in Drosophila melanogaster. We identified the proteins CAL1 and CENP-C as essential factors for CID assembly at the centromere. CID, CAL1, and CENP-C coimmunoprecipitate and are mutually dependent for centromere localization and function. We also identified the mitotic cyclin A (CYCA) and the anaphase-promoting complex (APC) inhibitor RCA1/Emi1 as regulators of centromere propagation. We show that CYCA is centromere localized and that CYCA and RCA1/Emi1 couple centromere assembly to the cell cycle through regulation of the fizzy-related/CDH1 subunit of the APC. Our findings identify essential components of the epigenetic machinery that ensures proper specification and propagation of the centromere and suggest a mechanism for coordinating centromere inheritance with cell division. PMID:19047461

  12. A regulatory effect of INMAP on centromere proteins: antisense INMAP induces CENP-B variation and centromeric halo.

    Directory of Open Access Journals (Sweden)

    Tan Tan

    Full Text Available CENP-B is a highly conserved protein that facilitates the assembly of specific centromere structures both in interphase nuclei and on mitotic chromosomes. INMAP is a conserved protein that localizes at nucleus in interphase cells and at mitotic apparatus in mitotic cells. Our previous results showed that INMAP over-expression leads to spindle defects, mitotic arrest and formation of polycentrosomal and multinuclear cells, indicating that INMAP may modulate the function of (a key protein(s in mitotic apparatus. In this study, we demonstrate that INMAP interacts with CENP-B and promotes cleavage of the N-terminal DNA binding domain from CENP-B. The cleaved CENP-B cannot associate with centromeres and thus lose its centromere-related functions. Consistent with these results, CENP-B in INMAP knockdown cells becomes more diffused around kinetochores. Although INMAP knockdown cells do not exhibit gross defects in mitotic spindle formation, these cells go through mitosis, especially prophase and metaphase, with different relative timing, indicating subtle abnormality. These results identify INMAP as a model regulator of CENP-B and support the notion that INMAP regulates mitosis through modulating CENP-B-mediated centromere organization.

  13. Different patterns of evolution in the centromeric and telomeric regions of group A and B haplotypes of the human killer cell Ig-like receptor locus.

    Directory of Open Access Journals (Sweden)

    Chul-Woo Pyo

    Full Text Available The fast evolving human KIR gene family encodes variable lymphocyte receptors specific for polymorphic HLA class I determinants. Nucleotide sequences for 24 representative human KIR haplotypes were determined. With three previously defined haplotypes, this gave a set of 12 group A and 15 group B haplotypes for assessment of KIR variation. The seven gene-content haplotypes are all combinations of four centromeric and two telomeric motifs. 2DL5, 2DS5 and 2DS3 can be present in centromeric and telomeric locations. With one exception, haplotypes having identical gene content differed in their combinations of KIR alleles. Sequence diversity varied between haplotype groups and between centromeric and telomeric halves of the KIR locus. The most variable A haplotype genes are in the telomeric half, whereas the most variable genes characterizing B haplotypes are in the centromeric half. Of the highly polymorphic genes, only the 3DL3 framework gene exhibits a similar diversity when carried by A and B haplotypes. Phylogenetic analysis and divergence time estimates, point to the centromeric gene-content motifs that distinguish A and B haplotypes having emerged ~6 million years ago, contemporaneously with the separation of human and chimpanzee ancestors. In contrast, the telomeric motifs that distinguish A and B haplotypes emerged more recently, ~1.7 million years ago, before the emergence of Homo sapiens. Thus the centromeric and telomeric motifs that typify A and B haplotypes have likely been present throughout human evolution. The results suggest the common ancestor of A and B haplotypes combined a B-like centromeric region with an A-like telomeric region.

  14. HACking the centromere chromatin code: insights from human artificial chromosomes.

    Science.gov (United States)

    Bergmann, Jan H; Martins, Nuno M C; Larionov, Vladimir; Masumoto, Hiroshi; Earnshaw, William C

    2012-07-01

    The centromere is a specialized chromosomal region that serves as the assembly site of the kinetochore. At the centromere, CENP-A nucleosomes form part of a chromatin landscape termed centrochromatin. This chromatin environment conveys epigenetic marks regulating kinetochore formation. Recent work sheds light on the intricate relationship between centrochromatin state, the CENP-A assembly pathway and the maintenance of centromere function. Here, we review the emerging picture of how chromatin affects mammalian kinetochore formation. We place particular emphasis on data obtained from Human Artificial Chromosome (HAC) biology and the targeted engineering of centrochromatin using synthetic HACs. We discuss implications of these findings, which indicate that a delicate balance of histone modifications and chromatin state dictates both de novo centromere formation and the maintenance of centromere identity in dividing cell populations.

  15. Meiotic Studies on Combinations of Chromosomes With Different Sized Centromeres in Maize

    Directory of Open Access Journals (Sweden)

    Fangpu Han

    2018-06-01

    Full Text Available Multiple centromere misdivision derivatives of a translocation between the supernumerary B chromosome and the short arm of chromosome 9 (TB-9Sb permit investigation of how centromeres of different sizes behave in meiosis in opposition or in competition with each other. In the first analysis, heterozygotes were produced between the normal TB-9Sb and derivatives of it that resulted from centromere misdivision that reduced the amounts of centromeric DNA. These heterozygotes could test whether these drastic differences would result in meiotic drive of the larger chromosome in female meiosis. Cytological determinations of the segregation of large and small centromeres among thousands of progeny of four combinations were made. The recovery of the larger centromere was at a few percent higher frequency in two of four combinations. However, examination of phosphorylated histone H2A-Thr133, a characteristic of active centromeres, showed a lack of correlation with the size of the centromeric DNA, suggesting an expansion of the basal protein features of the kinetochore in two of the three cases despite the reduction in the size of the underlying DNA. In the second analysis, plants containing different sizes of the B chromosome centromere were crossed to plants with TB-9Sb with a foldback duplication of 9S (TB-9Sb-Dp9. In the progeny, plants containing large and small versions of the B chromosome centromere were selected by FISH. A meiotic “tug of war” occurred in hybrid combinations by recombination between the normal 9S and the foldback duplication in those cases in which pairing occurred. Such pairing and recombination produce anaphase I bridges but in some cases the large and small centromeres progressed to the same pole. In one combination, new dicentric chromosomes were found in the progeny. Collectively, the results indicate that the size of the underlying DNA of a centromere does not dramatically affect its segregation properties or its ability

  16. Restructuring of Holocentric Centromeres During Meiosis in the Plant Rhynchospora pubera.

    Science.gov (United States)

    Marques, André; Schubert, Veit; Houben, Andreas; Pedrosa-Harand, Andrea

    2016-10-01

    Centromeres are responsible for the correct segregation of chromosomes during mitosis and meiosis. Holocentric chromosomes, characterized by multiple centromere units along each chromatid, have particular adaptations to ensure regular disjunction during meiosis. Here we show by detecting CENH3, CENP-C, tubulin, and centromeric repeats that holocentromeres may be organized differently in mitosis and meiosis of Rhynchospora pubera Contrasting to the mitotic linear holocentromere organization, meiotic centromeres show several clusters of centromere units (cluster-holocentromeres) during meiosis I. They accumulate along the poleward surface of bivalents where spindle fibers perpendicularly attach. During meiosis II, the cluster-holocentromeres are mostly present in the midregion of each chromatid. A linear holocentromere organization is restored after meiosis during pollen mitosis. Thus, a not yet described case of a cluster-holocentromere organization, showing a clear centromere restructuration between mitosis and meiosis, was identified in a holocentric organism. Copyright © 2016 by the Genetics Society of America.

  17. Centromeres Off the Hook: Massive Changes in Centromere Size and Structure Following Duplication of CenH3 Gene in Fabeae Species

    Czech Academy of Sciences Publication Activity Database

    Neumann, Pavel; Pavlíková, Zuzana; Koblížková, Andrea; Vrbová, Iva; Jedličková, Veronika; Novák, Petr; Macas, Jiří

    2015-01-01

    Roč. 32, č. 7 (2015), s. 1862-1879 ISSN 0737-4038 R&D Projects: GA ČR(CZ) GAP501/11/1843; GA MŠk(CZ) LH11058 Institutional support: RVO:60077344 Keywords : Centromere * CenH3 * centromere drive * chromosome Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 13.649, year: 2015

  18. Widespread Positive Selection Drives Differentiation of Centromeric Proteins in the Drosophila melanogaster subgroup.

    Science.gov (United States)

    Beck, Emily A; Llopart, Ana

    2015-11-25

    Rapid evolution of centromeric satellite repeats is thought to cause compensatory amino acid evolution in interacting centromere-associated kinetochore proteins. Cid, a protein that mediates kinetochore/centromere interactions, displays particularly high amino acid turnover. Rapid evolution of both Cid and centromeric satellite repeats led us to hypothesize that the apparent compensatory evolution may extend to interacting partners in the Condensin I complex (i.e., SMC2, SMC4, Cap-H, Cap-D2, and Cap-G) and HP1s. Missense mutations in these proteins often result in improper centromere formation and aberrant chromosome segregation, thus selection for maintained function and coevolution among proteins of the complex is likely strong. Here, we report evidence of rapid evolution and recurrent positive selection in seven centromere-associated proteins in species of the Drosophila melanogaster subgroup, and further postulate that positive selection on these proteins could be a result of centromere drive and compensatory changes, with kinetochore proteins competing for optimal spindle attachment.

  19. Centromeric DNA replication reconstitution reveals DNA loops and ATR checkpoint suppression.

    Science.gov (United States)

    Aze, Antoine; Sannino, Vincenzo; Soffientini, Paolo; Bachi, Angela; Costanzo, Vincenzo

    2016-06-01

    Half of the human genome is made up of repetitive DNA. However, mechanisms underlying replication of chromosome regions containing repetitive DNA are poorly understood. We reconstituted replication of defined human chromosome segments using bacterial artificial chromosomes in Xenopus laevis egg extract. Using this approach we characterized the chromatin assembly and replication dynamics of centromeric alpha-satellite DNA. Proteomic analysis of centromeric chromatin revealed replication-dependent enrichment of a network of DNA repair factors including the MSH2-6 complex, which was required for efficient centromeric DNA replication. However, contrary to expectations, the ATR-dependent checkpoint monitoring DNA replication fork arrest could not be activated on highly repetitive DNA due to the inability of the single-stranded DNA binding protein RPA to accumulate on chromatin. Electron microscopy of centromeric DNA and supercoil mapping revealed the presence of topoisomerase I-dependent DNA loops embedded in a protein matrix enriched for SMC2-4 proteins. This arrangement suppressed ATR signalling by preventing RPA hyper-loading, facilitating replication of centromeric DNA. These findings have important implications for our understanding of repetitive DNA metabolism and centromere organization under normal and stressful conditions.

  20. A segment of the apospory-specific genomic region is highly microsyntenic not only between the apomicts Pennisetum squamulatum and buffelgrass, but also with a rice chromosome 11 centromeric-proximal genomic region.

    Science.gov (United States)

    Gualtieri, Gustavo; Conner, Joann A; Morishige, Daryl T; Moore, L David; Mullet, John E; Ozias-Akins, Peggy

    2006-03-01

    Bacterial artificial chromosome (BAC) clones from apomicts Pennisetum squamulatum and buffelgrass (Cenchrus ciliaris), isolated with the apospory-specific genomic region (ASGR) marker ugt197, were assembled into contigs that were extended by chromosome walking. Gene-like sequences from contigs were identified by shotgun sequencing and BLAST searches, and used to isolate orthologous rice contigs. Additional gene-like sequences in the apomicts' contigs were identified by bioinformatics using fully sequenced BACs from orthologous rice contigs as templates, as well as by interspecies, whole-contig cross-hybridizations. Hierarchical contig orthology was rapidly assessed by constructing detailed long-range contig molecular maps showing the distribution of gene-like sequences and markers, and searching for microsyntenic patterns of sequence identity and spatial distribution within and across species contigs. We found microsynteny between P. squamulatum and buffelgrass contigs. Importantly, this approach also enabled us to isolate from within the rice (Oryza sativa) genome contig Rice A, which shows the highest microsynteny and is most orthologous to the ugt197-containing C1C buffelgrass contig. Contig Rice A belongs to the rice genome database contig 77 (according to the current September 12, 2003, rice fingerprint contig build) that maps proximal to the chromosome 11 centromere, a feature that interestingly correlates with the mapping of ASGR-linked BACs proximal to the centromere or centromere-like sequences. Thus, relatedness between these two orthologous contigs is supported both by their molecular microstructure and by their centromeric-proximal location. Our discoveries promote the use of a microsynteny-based positional-cloning approach using the rice genome as a template to aid in constructing the ASGR toward the isolation of genes underlying apospory.

  1. RNAi and heterochromatin repress centromeric meiotic recombination

    DEFF Research Database (Denmark)

    Ellermeier, Chad; Higuchi, Emily C; Phadnis, Naina

    2010-01-01

    During meiosis, the formation of viable haploid gametes from diploid precursors requires that each homologous chromosome pair be properly segregated to produce an exact haploid set of chromosomes. Genetic recombination, which provides a physical connection between homologous chromosomes, is essen......During meiosis, the formation of viable haploid gametes from diploid precursors requires that each homologous chromosome pair be properly segregated to produce an exact haploid set of chromosomes. Genetic recombination, which provides a physical connection between homologous chromosomes....... Surprisingly, one mutant derepressed for recombination in the heterochromatic mating-type region during meiosis and several mutants derepressed for centromeric gene expression during mitotic growth are not derepressed for centromeric recombination during meiosis. These results reveal a complex relation between...... types of repression by heterochromatin. Our results also reveal a previously undemonstrated role for RNAi and heterochromatin in the repression of meiotic centromeric recombination and, potentially, in the prevention of birth defects by maintenance of proper chromosome segregation during meiosis....

  2. Molecular cloning and sequence analysis of hamster CENP-A cDNA

    Directory of Open Access Journals (Sweden)

    Valdivia Manuel M

    2002-05-01

    Full Text Available Abstract Background The centromere is a specialized locus that mediates chromosome movement during mitosis and meiosis. This chromosomal domain comprises a uniquely packaged form of heterochromatin that acts as a nucleus for the assembly of the kinetochore a trilaminar proteinaceous structure on the surface of each chromatid at the primary constriction. Kinetochores mediate interactions with the spindle fibers of the mitotic apparatus. Centromere protein A (CENP-A is a histone H3-like protein specifically located to the inner plate of kinetochore at active centromeres. CENP-A works as a component of specialized nucleosomes at centromeres bound to arrays of repeat satellite DNA. Results We have cloned the hamster homologue of human and mouse CENP-A. The cDNA isolated was found to contain an open reading frame encoding a polypeptide consisting of 129 amino acid residues with a C-terminal histone fold domain highly homologous to those of CENP-A and H3 sequences previously released. However, significant sequence divergence was found at the N-terminal region of hamster CENP-A that is five and eleven residues shorter than those of mouse and human respectively. Further, a human serine 7 residue, a target site for Aurora B kinase phosphorylation involved in the mechanism of cytokinesis, was not found in the hamster protein. A human autoepitope at the N-terminal region of CENP-A described in autoinmune diseases is not conserved in the hamster protein. Conclusions We have cloned the hamster cDNA for the centromeric protein CENP-A. Significant differences on protein sequence were found at the N-terminal tail of hamster CENP-A in comparison with that of human and mouse. Our results show a high degree of evolutionary divergence of kinetochore CENP-A proteins in mammals. This is related to the high diverse nucleotide repeat sequences found at the centromere DNA among species and support a current centromere model for kinetochore function and structural

  3. The cohesion protein SOLO associates with SMC1 and is required for synapsis, recombination, homolog bias and cohesion and pairing of centromeres in Drosophila Meiosis.

    Science.gov (United States)

    Yan, Rihui; McKee, Bruce D

    2013-01-01

    Cohesion between sister chromatids is mediated by cohesin and is essential for proper meiotic segregation of both sister chromatids and homologs. solo encodes a Drosophila meiosis-specific cohesion protein with no apparent sequence homology to cohesins that is required in male meiosis for centromere cohesion, proper orientation of sister centromeres and centromere enrichment of the cohesin subunit SMC1. In this study, we show that solo is involved in multiple aspects of meiosis in female Drosophila. Null mutations in solo caused the following phenotypes: 1) high frequencies of homolog and sister chromatid nondisjunction (NDJ) and sharply reduced frequencies of homolog exchange; 2) reduced transmission of a ring-X chromosome, an indicator of elevated frequencies of sister chromatid exchange (SCE); 3) premature loss of centromere pairing and cohesion during prophase I, as indicated by elevated foci counts of the centromere protein CID; 4) instability of the lateral elements (LE)s and central regions of synaptonemal complexes (SCs), as indicated by fragmented and spotty staining of the chromosome core/LE component SMC1 and the transverse filament protein C(3)G, respectively, at all stages of pachytene. SOLO and SMC1 are both enriched on centromeres throughout prophase I, co-align along the lateral elements of SCs and reciprocally co-immunoprecipitate from ovarian protein extracts. Our studies demonstrate that SOLO is closely associated with meiotic cohesin and required both for enrichment of cohesin on centromeres and stable assembly of cohesin into chromosome cores. These events underlie and are required for stable cohesion of centromeres, synapsis of homologous chromosomes, and a recombination mechanism that suppresses SCE to preferentially generate homolog crossovers (homolog bias). We propose that SOLO is a subunit of a specialized meiotic cohesin complex that mediates both centromeric and axial arm cohesion and promotes homolog bias as a component of chromosome

  4. A Segment of the Apospory-Specific Genomic Region Is Highly Microsyntenic Not Only between the Apomicts Pennisetum squamulatum and Buffelgrass, But Also with a Rice Chromosome 11 Centromeric-Proximal Genomic Region1[W

    Science.gov (United States)

    Gualtieri, Gustavo; Conner, Joann A.; Morishige, Daryl T.; Moore, L. David; Mullet, John E.; Ozias-Akins, Peggy

    2006-01-01

    Bacterial artificial chromosome (BAC) clones from apomicts Pennisetum squamulatum and buffelgrass (Cenchrus ciliaris), isolated with the apospory-specific genomic region (ASGR) marker ugt197, were assembled into contigs that were extended by chromosome walking. Gene-like sequences from contigs were identified by shotgun sequencing and BLAST searches, and used to isolate orthologous rice contigs. Additional gene-like sequences in the apomicts' contigs were identified by bioinformatics using fully sequenced BACs from orthologous rice contigs as templates, as well as by interspecies, whole-contig cross-hybridizations. Hierarchical contig orthology was rapidly assessed by constructing detailed long-range contig molecular maps showing the distribution of gene-like sequences and markers, and searching for microsyntenic patterns of sequence identity and spatial distribution within and across species contigs. We found microsynteny between P. squamulatum and buffelgrass contigs. Importantly, this approach also enabled us to isolate from within the rice (Oryza sativa) genome contig Rice A, which shows the highest microsynteny and is most orthologous to the ugt197-containing C1C buffelgrass contig. Contig Rice A belongs to the rice genome database contig 77 (according to the current September 12, 2003, rice fingerprint contig build) that maps proximal to the chromosome 11 centromere, a feature that interestingly correlates with the mapping of ASGR-linked BACs proximal to the centromere or centromere-like sequences. Thus, relatedness between these two orthologous contigs is supported both by their molecular microstructure and by their centromeric-proximal location. Our discoveries promote the use of a microsynteny-based positional-cloning approach using the rice genome as a template to aid in constructing the ASGR toward the isolation of genes underlying apospory. PMID:16415213

  5. Novel ZBTB24 Mutation Associated with Immunodeficiency, Centromere Instability, and Facial Anomalies Type-2 Syndrome Identified in a Patient with Very Early Onset Inflammatory Bowel Disease.

    Science.gov (United States)

    Conrad, Máire A; Dawany, Noor; Sullivan, Kathleen E; Devoto, Marcella; Kelsen, Judith R

    2017-12-01

    Very early onset inflammatory bowel disease, diagnosed in children ≤5 years old, can be the initial presentation of some primary immunodeficiencies. In this study, we describe a 17-month-old boy with recurrent infections, growth failure, facial anomalies, and inflammatory bowel disease. Immune evaluation, whole-exome sequencing, karyotyping, and methylation array were performed to evaluate the child's constellation of symptoms and examination findings. Whole-exome sequencing revealed that the child was homozygous for a novel variant in ZBTB24, the gene associated with immunodeficiency, centromere instability, and facial anomalies type-2 syndrome. This describes the first case of inflammatory bowel disease associated with immunodeficiency, centromere instability, and facial anomalies type-2 syndrome in a child with a novel disease-causing mutation in ZBTB24 found on whole-exome sequencing.

  6. Differential Regulation of Strand-Specific Transcripts from Arabidopsis Centromeric Satellite Repeats.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available Centromeres interact with the spindle apparatus to enable chromosome disjunction and typically contain thousands of tandemly arranged satellite repeats interspersed with retrotransposons. While their role has been obscure, centromeric repeats are epigenetically modified and centromere specification has a strong epigenetic component. In the yeast Schizosaccharomyces pombe, long heterochromatic repeats are transcribed and contribute to centromere function via RNA interference (RNAi. In the higher plant Arabidopsis thaliana, as in mammalian cells, centromeric satellite repeats are short (180 base pairs, are found in thousands of tandem copies, and are methylated. We have found transcripts from both strands of canonical, bulk Arabidopsis repeats. At least one subfamily of 180-base pair repeats is transcribed from only one strand and regulated by RNAi and histone modification. A second subfamily of repeats is also silenced, but silencing is lost on both strands in mutants in the CpG DNA methyltransferase MET1, the histone deacetylase HDA6/SIL1, or the chromatin remodeling ATPase DDM1. This regulation is due to transcription from Athila2 retrotransposons, which integrate in both orientations relative to the repeats, and differs between strains of Arabidopsis. Silencing lost in met1 or hda6 is reestablished in backcrosses to wild-type, but silencing lost in RNAi mutants and ddm1 is not. Twenty-four-nucleotide small interfering RNAs from centromeric repeats are retained in met1 and hda6, but not in ddm1, and may have a role in this epigenetic inheritance. Histone H3 lysine-9 dimethylation is associated with both classes of repeats. We propose roles for transcribed repeats in the epigenetic inheritance and evolution of centromeres.

  7. Molecular characterization and chromosomal distribution of a species-specific transcribed centromeric satellite repeat from the olive fruit fly, Bactrocera oleae.

    Directory of Open Access Journals (Sweden)

    Konstantina T Tsoumani

    Full Text Available Satellite repetitive sequences that accumulate in the heterochromatin consist a large fraction of a genome and due to their properties are suggested to be implicated in centromere function. Current knowledge of heterochromatic regions of Bactrocera oleae genome, the major pest of the olive tree, is practically nonexistent. In our effort to explore the repetitive DNA portion of B. oleae genome, a novel satellite sequence designated BoR300 was isolated and cloned. The present study describes the genomic organization, abundance and chromosomal distribution of BoR300 which is organized in tandem, forming arrays of 298 bp-long monomers. Sequence analysis showed an AT content of 60.4%, a CENP-B like-motif and a high curvature value based on predictive models. Comparative analysis among randomly selected monomers demonstrated a high degree of sequence homogeneity (88%-97% of BoR300 repeats, which are present at approximately 3,000 copies per haploid genome accounting for about 0.28% of the total genomic DNA, based on two independent qPCR approaches. In addition, expression of the repeat was also confirmed through RT-PCR, by which BoR300 transcripts were detected in both sexes. Fluorescence in situ hybridization (FISH of BoR300 on mitotic metaphases and polytene chromosomes revealed signals to the centromeres of two out of the six chromosomes which indicated a chromosome-specific centromeric localization. Moreover, BoR300 is not conserved in the closely related Bactrocera species tested and it is also absent in other dipterans, but it's rather restricted to the B. oleae genome. This feature of species-specificity attributed to BoR300 satellite makes it a good candidate as an identification probe of the insect among its relatives at early development stages.

  8. Evolutionary movement of centromeres in horse, donkey, and zebra.

    Science.gov (United States)

    Carbone, Lucia; Nergadze, Solomon G; Magnani, Elisa; Misceo, Doriana; Francesca Cardone, Maria; Roberto, Roberta; Bertoni, Livia; Attolini, Carmen; Francesca Piras, Maria; de Jong, Pieter; Raudsepp, Terje; Chowdhary, Bhanu P; Guérin, Gérard; Archidiacono, Nicoletta; Rocchi, Mariano; Giulotto, Elena

    2006-06-01

    Centromere repositioning (CR) is a recently discovered biological phenomenon consisting of the emergence of a new centromere along a chromosome and the inactivation of the old one. After a CR, the primary constriction and the centromeric function are localized in a new position while the order of physical markers on the chromosome remains unchanged. These events profoundly affect chromosomal architecture. Since horses, asses, and zebras, whose evolutionary divergence is relatively recent, show remarkable morphological similarity and capacity to interbreed despite their chromosomes differing considerably, we investigated the role of CR in the karyotype evolution of the genus Equus. Using appropriate panels of BAC clones in FISH experiments, we compared the centromere position and marker order arrangement among orthologous chromosomes of Burchelli's zebra (Equus burchelli), donkey (Equus asinus), and horse (Equus caballus). Surprisingly, at least eight CRs took place during the evolution of this genus. Even more surprisingly, five cases of CR have occurred in the donkey after its divergence from zebra, that is, in a very short evolutionary time (approximately 1 million years). These findings suggest that in some species the CR phenomenon could have played an important role in karyotype shaping, with potential consequences on population dynamics and speciation.

  9. Molecular cloning and characterization of satellite DNA sequences from constitutive heterochromatin of the habu snake (Protobothrops flavoviridis, Viperidae) and the Burmese python (Python bivittatus, Pythonidae).

    Science.gov (United States)

    Matsubara, Kazumi; Uno, Yoshinobu; Srikulnath, Kornsorn; Seki, Risako; Nishida, Chizuko; Matsuda, Yoichi

    2015-12-01

    Highly repetitive DNA sequences of the centromeric heterochromatin provide valuable molecular cytogenetic markers for the investigation of genomic compartmentalization in the macrochromosomes and microchromosomes of sauropsids. Here, the relationship between centromeric heterochromatin and karyotype evolution was examined using cloned repetitive DNA sequences from two snake species, the habu snake (Protobothrops flavoviridis, Crotalinae, Viperidae) and Burmese python (Python bivittatus, Pythonidae). Three satellite DNA (stDNA) families were isolated from the heterochromatin of these snakes: 168-bp PFL-MspI from P. flavoviridis and 196-bp PBI-DdeI and 174-bp PBI-MspI from P. bivittatus. The PFL-MspI and PBI-DdeI sequences were localized to the centromeric regions of most chromosomes in the respective species, suggesting that the two sequences were the major components of the centromeric heterochromatin in these organisms. The PBI-MspI sequence was localized to the pericentromeric region of four chromosome pairs. The PFL-MspI and the PBI-DdeI sequences were conserved only in the genome of closely related species, Gloydius blomhoffii (Crotalinae) and Python molurus, respectively, although their locations on the chromosomes were slightly different. In contrast, the PBI-MspI sequence was also in the genomes of P. molurus and Boa constrictor (Boidae), and additionally localized to the centromeric regions of eight chromosome pairs in B. constrictor, suggesting that this sequence originated in the genome of a common ancestor of Pythonidae and Boidae, approximately 86 million years ago. The three stDNA sequences showed no genomic compartmentalization between the macrochromosomes and microchromosomes, suggesting that homogenization of the centromeric and/or pericentromeric stDNA sequences occurred in the macrochromosomes and microchromosomes of these snakes.

  10. Satellite DNA Sequences in Canidae and Their Chromosome Distribution in Dog and Red Fox.

    Science.gov (United States)

    Vozdova, Miluse; Kubickova, Svatava; Cernohorska, Halina; Fröhlich, Jan; Rubes, Jiri

    2016-01-01

    Satellite DNA is a characteristic component of mammalian centromeric heterochromatin, and a comparative analysis of its evolutionary dynamics can be used for phylogenetic studies. We analysed satellite and satellite-like DNA sequences available in NCBI for 4 species of the family Canidae (red fox, Vulpes vulpes, VVU; domestic dog, Canis familiaris, CFA; arctic fox, Vulpes lagopus, VLA; raccoon dog, Nyctereutes procyonoides procyonoides, NPR) by comparative sequence analysis, which revealed 86-90% intraspecies and 76-79% interspecies similarity. Comparative fluorescence in situ hybridisation in the red fox and dog showed signals of the red fox satellite probe in canine and vulpine autosomal centromeres, on VVUY, B chromosomes, and in the distal parts of VVU9q and VVU10p which were shown to contain nucleolus organiser regions. The CFA satellite probe stained autosomal centromeres only in the dog. The CFA satellite-like DNA did not show any significant sequence similarity with the satellite DNA of any species analysed and was localised to the centromeres of 9 canine chromosome pairs. No significant heterochromatin block was detected on the B chromosomes of the red fox. Our results show extensive heterogeneity of satellite sequences among Canidae and prove close evolutionary relationships between the red and arctic fox. © 2017 S. Karger AG, Basel.

  11. Centromeric DNA characterization in the model grass Brachypodium distachyon provides insights on the evolution of the genus.

    Science.gov (United States)

    Li, Yinjia; Zuo, Sheng; Zhang, Zhiliang; Li, Zhanjie; Han, Jinlei; Chu, Zhaoqing; Hasterok, Robert; Wang, Kai

    2018-03-01

    Brachypodium distachyon is a well-established model monocot plant, and its small and compact genome has been used as an accurate reference for the much larger and often polyploid genomes of cereals such as Avena sativa (oats), Hordeum vulgare (barley) and Triticum aestivum (wheat). Centromeres are indispensable functional units of chromosomes and they play a core role in genome polyploidization events during evolution. As the Brachypodium genus contains about 20 species that differ significantly in terms of their basic chromosome numbers, genome size, ploidy levels and life strategies, studying their centromeres may provide important insight into the structure and evolution of the genome in this interesting and important genus. In this study, we isolated the centromeric DNA of the B. distachyon reference line Bd21 and characterized its composition via the chromatin immunoprecipitation of the nucleosomes that contain the centromere-specific histone CENH3. We revealed that the centromeres of Bd21 have the features of typical multicellular eukaryotic centromeres. Strikingly, these centromeres contain relatively few centromeric satellite DNAs; in particular, the centromere of chromosome 5 (Bd5) consists of only ~40 kb. Moreover, the centromeric retrotransposons in B. distachyon (CRBds) are evolutionarily young. These transposable elements are located both within and adjacent to the CENH3 binding domains, and have similar compositions. Moreover, based on the presence of CRBds in the centromeres, the species in this study can be grouped into two distinct lineages. This may provide new evidence regarding the phylogenetic relationships within the Brachypodium genus. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.

  12. Mps1 kinase-dependent Sgo2 centromere localisation mediates cohesin protection in mouse oocyte meiosis I.

    Science.gov (United States)

    El Yakoubi, Warif; Buffin, Eulalie; Cladière, Damien; Gryaznova, Yulia; Berenguer, Inés; Touati, Sandra A; Gómez, Rocío; Suja, José A; van Deursen, Jan M; Wassmann, Katja

    2017-09-25

    A key feature of meiosis is the step-wise removal of cohesin, the protein complex holding sister chromatids together, first from arms in meiosis I and then from the centromere region in meiosis II. Centromeric cohesin is protected by Sgo2 from Separase-mediated cleavage, in order to maintain sister chromatids together until their separation in meiosis II. Failures in step-wise cohesin removal result in aneuploid gametes, preventing the generation of healthy embryos. Here, we report that kinase activities of Bub1 and Mps1 are required for Sgo2 localisation to the centromere region. Mps1 inhibitor-treated oocytes are defective in centromeric cohesin protection, whereas oocytes devoid of Bub1 kinase activity, which cannot phosphorylate H2A at T121, are not perturbed in cohesin protection as long as Mps1 is functional. Mps1 and Bub1 kinase activities localise Sgo2 in meiosis I preferentially to the centromere and pericentromere respectively, indicating that Sgo2 at the centromere is required for protection.In meiosis I centromeric cohesin is protected by Sgo2 from Separase-mediated cleavage ensuring that sister chromatids are kept together until their separation in meiosis II. Here the authors demonstrate that Bub1 and Mps1 kinase activities are required for Sgo2 localisation to the centromere region.

  13. Recurrent Gene Duplication Leads to Diverse Repertoires of Centromeric Histones in Drosophila Species.

    Science.gov (United States)

    Kursel, Lisa E; Malik, Harmit S

    2017-06-01

    Despite their essential role in the process of chromosome segregation in most eukaryotes, centromeric histones show remarkable evolutionary lability. Not only have they been lost in multiple insect lineages, but they have also undergone gene duplication in multiple plant lineages. Based on detailed study of a handful of model organisms including Drosophila melanogaster, centromeric histone duplication is considered to be rare in animals. Using a detailed phylogenomic study, we find that Cid, the centromeric histone gene, has undergone at least four independent gene duplications during Drosophila evolution. We find duplicate Cid genes in D. eugracilis (Cid2), in the montium species subgroup (Cid3, Cid4) and in the entire Drosophila subgenus (Cid5). We show that Cid3, Cid4, and Cid5 all localize to centromeres in their respective species. Some Cid duplicates are primarily expressed in the male germline. With rare exceptions, Cid duplicates have been strictly retained after birth, suggesting that they perform nonredundant centromeric functions, independent from the ancestral Cid. Indeed, each duplicate encodes a distinct N-terminal tail, which may provide the basis for distinct protein-protein interactions. Finally, we show some Cid duplicates evolve under positive selection whereas others do not. Taken together, our results support the hypothesis that Drosophila Cid duplicates have subfunctionalized. Thus, these gene duplications provide an unprecedented opportunity to dissect the multiple roles of centromeric histones. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  14. Phylogeny of horse chromosome 5q in the genus Equus and centromere repositioning.

    Science.gov (United States)

    Piras, F M; Nergadze, S G; Poletto, V; Cerutti, F; Ryder, O A; Leeb, T; Raimondi, E; Giulotto, E

    2009-01-01

    Horses, asses and zebras belong to the genus Equus and are the only extant species of the family Equidae in the order Perissodactyla. In a previous work we demonstrated that a key factor in the rapid karyotypic evolution of this genus was evolutionary centromere repositioning, that is, the shift of the centromeric function to a new position without alteration of the order of markers along the chromosome. In search of previously undiscovered evolutionarily new centromeres, we traced the phylogeny of horse chromosome 5, analyzing the order of BAC markers, derived from a horse genomic library, in 7 Equus species (E. caballus, E. hemionus onager, E. kiang, E. asinus, E. grevyi, E. burchelli and E. zebra hartmannae). This analysis showed that repositioned centromeres are present in E. asinus (domestic donkey, EAS) chromosome 16 and in E. burchelli (Burchell's zebra, EBU) chromosome 17, confirming that centromere repositioning is a strikingly frequent phenomenon in this genus. The observation that the neocentromeres in EAS16 and EBU17 are in the same chromosomal position suggests that they may derive from the same event and therefore, E. asinus and E. burchelli may be more closely related than previously proposed; alternatively, 2 centromere repositioning events, involving the same chromosomal region, may have occurred independently in different lineages, pointing to the possible existence of hot spots for neocentromere formation. Our comparative analysis also showed that, while E. caballus chromosome 5 seems to represent the ancestral configuration, centric fission followed by independent fusion events gave rise to 3 different submetacentric chromosomes in other Equus lineages. (c) 2009 S. Karger AG, Basel.

  15. DNA deformability changes of single base pair mutants within CDE binding sites in S. Cerevisiae centromere DNA correlate with measured chromosomal loss rates and CDE binding site symmetries

    Directory of Open Access Journals (Sweden)

    Marx Kenneth A

    2006-03-01

    Full Text Available Abstract Background The centromeres in yeast (S. cerevisiae are organized by short DNA sequences (125 bp on each chromosome consisting of 2 conserved elements: CDEI and CDEIII spaced by a CDEII region. CDEI and CDEIII are critical sequence specific protein binding sites necessary for correct centromere formation and following assembly with proteins, are positioned near each other on a specialized nucleosome. Hegemann et al. BioEssays 1993, 15: 451–460 reported single base DNA mutants within the critical CDEI and CDEIII binding sites on the centromere of chromosome 6 and quantitated centromere loss of function, which they measured as loss rates for the different chromosome 6 mutants during cell division. Olson et al. Proc Natl Acad Sci USA 1998, 95: 11163–11168 reported the use of protein-DNA crystallography data to produce a DNA dinucleotide protein deformability energetic scale (PD-scale that describes local DNA deformability by sequence specific binding proteins. We have used the PD-scale to investigate the DNA sequence dependence of the yeast chromosome 6 mutants' loss rate data. Each single base mutant changes 2 PD-scale values at that changed base position relative to the wild type. In this study, we have utilized these mutants to demonstrate a correlation between the change in DNA deformability of the CDEI and CDEIII core sites and the overall experimentally measured chromosome loss rates of the chromosome 6 mutants. Results In the CDE I and CDEIII core binding regions an increase in the magnitude of change in deformability of chromosome 6 single base mutants with respect to the wild type correlates to an increase in the measured chromosome loss rate. These correlations were found to be significant relative to 105 Monte Carlo randomizations of the dinucleotide PD-scale applied to the same calculation. A net loss of deformability also tends to increase the loss rate. Binding site position specific, 4 data-point correlations were also

  16. Transgenerational propagation and quantitative maintenance of paternal centromeres depends on Cid/Cenp-A presence in Drosophila sperm.

    Directory of Open Access Journals (Sweden)

    Nitika Raychaudhuri

    Full Text Available In Drosophila melanogaster, as in many animal and plant species, centromere identity is specified epigenetically. In proliferating cells, a centromere-specific histone H3 variant (CenH3, named Cid in Drosophila and Cenp-A in humans, is a crucial component of the epigenetic centromere mark. Hence, maintenance of the amount and chromosomal location of CenH3 during mitotic proliferation is important. Interestingly, CenH3 may have different roles during meiosis and the onset of embryogenesis. In gametes of Caenorhabditis elegans, and possibly in plants, centromere marking is independent of CenH3. Moreover, male gamete differentiation in animals often includes global nucleosome for protamine exchange that potentially could remove CenH3 nucleosomes. Here we demonstrate that the control of Cid loading during male meiosis is distinct from the regulation observed during the mitotic cycles of early embryogenesis. But Cid is present in mature sperm. After strong Cid depletion in sperm, paternal centromeres fail to integrate into the gonomeric spindle of the first mitosis, resulting in gynogenetic haploid embryos. Furthermore, after moderate depletion, paternal centromeres are unable to re-acquire normal Cid levels in the next generation. We conclude that Cid in sperm is an essential component of the epigenetic centromere mark on paternal chromosomes and it exerts quantitative control over centromeric Cid levels throughout development. Hence, the amount of Cid that is loaded during each cell cycle appears to be determined primarily by the preexisting centromeric Cid, with little flexibility for compensation of accidental losses.

  17. Mislocalization of the Drosophila centromere-specific histone CIDpromotes formation of functional ectopic kinetochores

    Energy Technology Data Exchange (ETDEWEB)

    Heun, Patrick; Erhardt, Sylvia; Blower, Michael D.; Weiss,Samara; Skora, Andrew D.; Karpen, Gary H.

    2006-01-30

    The centromere-specific histone variant CENP-A (CID in Drosophila) is a structural and functional foundation for kinetochore formation and chromosome segregation. Here, we show that overexpressed CID is mislocalized into normally non-centromeric regions in Drosophila tissue culture cells and animals. Analysis of mitoses in living and fixed cells reveals that mitotic delays, anaphase bridges, chromosome fragmentation, and cell and organismal lethality are all direct consequences of CID mislocalization. In addition, proteins that are normally restricted to endogenous kinetochores assemble at a subset of ectopic CID incorporation regions. The presence of microtubule motors and binding proteins, spindle attachments, and aberrant chromosome morphologies demonstrate that these ectopic kinetochores are functional. We conclude that CID mislocalization promotes formation of ectopic centromeres and multicentric chromosomes, which causes chromosome missegregation, aneuploidy, and growth defects. Thus, CENP-A mislocalization is one possible mechanism for genome instability during cancer progression, as well as centromere plasticity during evolution.

  18. Epigenetic Regulation of Centromere Chromatin Stability by Dietary and Environmental Factors.

    Science.gov (United States)

    Hernández-Saavedra, Diego; Strakovsky, Rita S; Ostrosky-Wegman, Patricia; Pan, Yuan-Xiang

    2017-11-01

    The centromere is a genomic locus required for the segregation of the chromosomes during cell division. This chromosomal region together with pericentromeres has been found to be susceptible to damage, and thus the perturbation of the centromere could lead to the development of aneuploidic events. Metabolic abnormalities that underlie the generation of cancer include inflammation, oxidative stress, cell cycle deregulation, and numerous others. The micronucleus assay, an early clinical marker of cancer, has been shown to provide a reliable measure of genotoxic damage that may signal cancer initiation. In the current review, we will discuss the events that lead to micronucleus formation and centromeric and pericentromeric chromatin instability, as well transcripts emanating from these regions, which were previously thought to be inactive. Studies were selected in PubMed if they reported the effects of nutritional status (macro- and micronutrients) or environmental toxicant exposure on micronucleus frequency or any other chromosomal abnormality in humans, animals, or cell models. Mounting evidence from epidemiologic, environmental, and nutritional studies provides a novel perspective on the origination of aneuploidic events. Although substantial evidence exists describing the role that nutritional status and environmental toxicants have on the generation of micronuclei and other nuclear aberrations, limited information is available to describe the importance of macro- and micronutrients on centromeric and pericentromeric chromatin stability. Moving forward, studies that specifically address the direct link between nutritional status, excess, or deficiency and the epigenetic regulation of the centromere will provide much needed insight into the nutritional and environmental regulation of this chromosomal region and the initiation of aneuploidy. © 2017 American Society for Nutrition.

  19. Centromeres cluster de novo at the beginning of meiosis in Brachypodium distachyon.

    Directory of Open Access Journals (Sweden)

    Ruoyu Wen

    Full Text Available In most eukaryotes that have been studied, the telomeres cluster into a bouquet early in meiosis, and in wheat and its relatives and in Arabidopsis the centromeres pair at the same time. In Arabidopsis, the telomeres do not cluster as a typical telomere bouquet on the nuclear membrane but are associated with the nucleolus both somatically and at the onset of meiosis. We therefore assessed whether Brachypodium distachyon, a monocot species related to cereals and whose genome is approximately twice the size of Arabidopsis thaliana, also exhibited an atypical telomere bouquet and centromere pairing. In order to investigate the occurrence of a bouquet and centromere pairing in B distachyon, we first had to establish protocols for studying meiosis in this species. This enabled us to visualize chromosome behaviour in meiocytes derived from young B distachyon spikelets in three-dimensions by fluorescent in situ hybridization (FISH, and accurately to stage meiosis based on chromatin morphology in relation to spikelet size and the timing of sample collection. Surprisingly, this study revealed that the centromeres clustered as a single site at the same time as the telomeres also formed a bouquet or single cluster.

  20. Centromere separation and association in the nuclei of an interspecific hybrid between Torenia fournieri and T. baillonii (Scrophulariaceae) during mitosis and meiosis.

    Science.gov (United States)

    Kikuchi, Shinji; Tanaka, Hiroyuki; Wako, Toshiyuki; Tsujimoto, Hisashi

    2007-10-01

    In the nuclei of some interspecific hybrid and allopolyploid plant species, each genome occupies a separate spatial domain. To analyze this phenomenon, we studied localization of the centromeres in the nuclei of a hybrid between Torenia fournieri and T. baillonii during mitosis and meiosis using three-dimensional fluorescence in situ hybridization (3D-FISH) probed with species-specific centromere repeats. Centromeres of each genome were located separately in undifferentiated cells but not differentiated cells, suggesting that cell division might be the possible force causing centromere separation. However, no remarkable difference of dividing distance was detected between chromatids with different centromeres in anaphase and telophase, indicating that tension of the spindle fiber attached to each chromatid is not the cause of centromere separation in Torenia. In differentiated cells, centromeres in both genomes were not often observed for the expected chromosome number, indicating centromere association. In addition, association of centromeres from the same genome was observed at a higher frequency than between different genomes. This finding suggests that centromeres within one genome are spatially separated from those within the other. This close position may increase possibility of association between centromeres of the same genome. In meiotic prophase, all centromeres irrespective of the genome were associated in a certain portion of the nucleus. Since centromere association in the interspecific hybrid and amphiploid was tighter than that in the diploid parents, it is possible that this phenomenon may be involved in sorting and pairing of homologous chromosomes.

  1. Telomere-Centromere-Driven Genomic Instability Contributes to Karyotype Evolution in a Mouse Model of Melanoma

    Directory of Open Access Journals (Sweden)

    Amanda Gonçalves dos Santos Silva

    2010-01-01

    Full Text Available Aneuploidy and chromosomal instability (CIN are hallmarks of most solid tumors. These alterations may result from inaccurate chromosomal segregation during mitosis, which can occur through several mechanisms including defective telomere metabolism, centrosome amplification, dysfunctional centromeres, and/or defective spindle checkpoint control. In this work, we used an in vitro murine melanoma model that uses a cellular adhesion blockade as a transforming factor to characterize telomeric and centromeric alterations that accompany melanocyte transformation. To study the timing of the occurrence of telomere shortening in this transformation model, we analyzed the profile of telomere length by quantitative fluorescent in situ hybridization and found that telomere length significantly decreased as additional rounds of cell adhesion blockages were performed. Together with it, an increase in telomere-free ends and complex karyotypic aberrations were also found, which include Robertsonian fusions in 100% of metaphases of the metastatic melanoma cells. These findings are in agreement with the idea that telomere length abnormalities seem to be one of the earliest genetic alterations acquired in the multistep process of malignant transformation and that telomere abnormalities result in telomere aggregation, breakage-bridge-fusion cycles, and CIN. Another remarkable feature of this model is the abundance of centromeric instability manifested as centromere fragments and centromeric fusions. Taken together, our results illustrate for this melanoma model CIN with a structural signature of centromere breakage and telomeric loss.

  2. Meiosis-Specific Loading of the Centromere-Specific Histone CENH3 in Arabidopsis thaliana

    Science.gov (United States)

    Ravi, Maruthachalam; Shibata, Fukashi; Ramahi, Joseph S.; Nagaki, Kiyotaka; Chen, Changbin; Murata, Minoru; Chan, Simon W. L.

    2011-01-01

    Centromere behavior is specialized in meiosis I, so that sister chromatids of homologous chromosomes are pulled toward the same side of the spindle (through kinetochore mono-orientation) and chromosome number is reduced. Factors required for mono-orientation have been identified in yeast. However, comparatively little is known about how meiotic centromere behavior is specialized in animals and plants that typically have large tandem repeat centromeres. Kinetochores are nucleated by the centromere-specific histone CENH3. Unlike conventional histone H3s, CENH3 is rapidly evolving, particularly in its N-terminal tail domain. Here we describe chimeric variants of CENH3 with alterations in the N-terminal tail that are specifically defective in meiosis. Arabidopsis thaliana cenh3 mutants expressing a GFP-tagged chimeric protein containing the H3 N-terminal tail and the CENH3 C-terminus (termed GFP-tailswap) are sterile because of random meiotic chromosome segregation. These defects result from the specific depletion of GFP-tailswap protein from meiotic kinetochores, which contrasts with its normal localization in mitotic cells. Loss of the GFP-tailswap CENH3 variant in meiosis affects recruitment of the essential kinetochore protein MIS12. Our findings suggest that CENH3 loading dynamics might be regulated differently in mitosis and meiosis. As further support for our hypothesis, we show that GFP-tailswap protein is recruited back to centromeres in a subset of pollen grains in GFP-tailswap once they resume haploid mitosis. Meiotic recruitment of the GFP-tailswap CENH3 variant is not restored by removal of the meiosis-specific cohesin subunit REC8. Our results reveal the existence of a specialized loading pathway for CENH3 during meiosis that is likely to involve the hypervariable N-terminal tail. Meiosis-specific CENH3 dynamics may play a role in modulating meiotic centromere behavior. PMID:21695238

  3. Endogenous pararetrovirus sequences associated with 24 nt small RNAs at the centromeres of Fritillaria imperialis L. (Liliaceae), a species with a giant genome

    Czech Academy of Sciences Publication Activity Database

    Becher, H.; Ma, L.; Kelly, L.J.; Kovařík, Aleš; Leitch, I. J.; Leitch, Andrew R.

    2014-01-01

    Roč. 80, č. 5 (2014), s. 823-833 ISSN 0960-7412 R&D Projects: GA ČR(CZ) GA13-10057S Institutional support: RVO:68081707 Keywords : pararetrovirus * Fritillaria imperialis * centromere Subject RIV: BO - Biophysics Impact factor: 5.972, year: 2014

  4. SOLO: a meiotic protein required for centromere cohesion, coorientation, and SMC1 localization in Drosophila melanogaster.

    Science.gov (United States)

    Yan, Rihui; Thomas, Sharon E; Tsai, Jui-He; Yamada, Yukihiro; McKee, Bruce D

    2010-02-08

    Sister chromatid cohesion is essential to maintain stable connections between homologues and sister chromatids during meiosis and to establish correct centromere orientation patterns on the meiosis I and II spindles. However, the meiotic cohesion apparatus in Drosophila melanogaster remains largely uncharacterized. We describe a novel protein, sisters on the loose (SOLO), which is essential for meiotic cohesion in Drosophila. In solo mutants, sister centromeres separate before prometaphase I, disrupting meiosis I centromere orientation and causing nondisjunction of both homologous and sister chromatids. Centromeric foci of the cohesin protein SMC1 are absent in solo mutants at all meiotic stages. SOLO and SMC1 colocalize to meiotic centromeres from early prophase I until anaphase II in wild-type males, but both proteins disappear prematurely at anaphase I in mutants for mei-S332, which encodes the Drosophila homologue of the cohesin protector protein shugoshin. The solo mutant phenotypes and the localization patterns of SOLO and SMC1 indicate that they function together to maintain sister chromatid cohesion in Drosophila meiosis.

  5. APC/C-Cdc20 mediates deprotection of centromeric cohesin at meiosis II in yeast.

    Science.gov (United States)

    Jonak, Katarzyna; Zagoriy, Ievgeniia; Oz, Tugce; Graf, Peter; Rojas, Julie; Mengoli, Valentina; Zachariae, Wolfgang

    2017-06-18

    Cells undergoing meiosis produce haploid gametes through one round of DNA replication followed by 2 rounds of chromosome segregation. This requires that cohesin complexes, which establish sister chromatid cohesion during S phase, are removed in a stepwise manner. At meiosis I, the separase protease triggers the segregation of homologous chromosomes by cleaving cohesin's Rec8 subunit on chromosome arms. Cohesin persists at centromeres because the PP2A phosphatase, recruited by the shugoshin protein, dephosphorylates Rec8 and thereby protects it from cleavage. While chromatids disjoin upon cleavage of centromeric Rec8 at meiosis II, it was unclear how and when centromeric Rec8 is liberated from its protector PP2A. One proposal is that bipolar spindle forces separate PP2A from Rec8 as cells enter metaphase II. We show here that sister centromere biorientation is not sufficient to "deprotect" Rec8 at meiosis II in yeast. Instead, our data suggest that the ubiquitin-ligase APC/C Cdc20 removes PP2A from centromeres by targeting for degradation the shugoshin Sgo1 and the kinase Mps1. This implies that Rec8 remains protected until entry into anaphase II when it is phosphorylated concurrently with the activation of separase. Here, we provide further support for this model and speculate on its relevance to mammalian oocytes.

  6. Esperanto for histones: CENP-A, not CenH3, is the centromeric histone H3 variant.

    Science.gov (United States)

    Earnshaw, W C; Allshire, R C; Black, B E; Bloom, K; Brinkley, B R; Brown, W; Cheeseman, I M; Choo, K H A; Copenhaver, G P; Deluca, J G; Desai, A; Diekmann, S; Erhardt, S; Fitzgerald-Hayes, M; Foltz, D; Fukagawa, T; Gassmann, R; Gerlich, D W; Glover, D M; Gorbsky, G J; Harrison, S C; Heun, P; Hirota, T; Jansen, L E T; Karpen, G; Kops, G J P L; Lampson, M A; Lens, S M; Losada, A; Luger, K; Maiato, H; Maddox, P S; Margolis, R L; Masumoto, H; McAinsh, A D; Mellone, B G; Meraldi, P; Musacchio, A; Oegema, K; O'Neill, R J; Salmon, E D; Scott, K C; Straight, A F; Stukenberg, P T; Sullivan, B A; Sullivan, K F; Sunkel, C E; Swedlow, J R; Walczak, C E; Warburton, P E; Westermann, S; Willard, H F; Wordeman, L; Yanagida, M; Yen, T J; Yoda, K; Cleveland, D W

    2013-04-01

    The first centromeric protein identified in any species was CENP-A, a divergent member of the histone H3 family that was recognised by autoantibodies from patients with scleroderma-spectrum disease. It has recently been suggested to rename this protein CenH3. Here, we argue that the original name should be maintained both because it is the basis of a long established nomenclature for centromere proteins and because it avoids confusion due to the presence of canonical histone H3 at centromeres.

  7. Generation of an approximately 2.4 Mb human X centromere-based minichromosome by targeted telomere-associated chromosome fragmentation in DT40.

    Science.gov (United States)

    Mills, W; Critcher, R; Lee, C; Farr, C J

    1999-05-01

    A linear mammalian artificial chromosome (MAC) will require at least three types of functional element: a centromere, two telomeres and origins of replication. As yet, our understanding of these elements, as well as many other aspects of structure and organization which may be critical for a fully functional mammalian chromosome, remains poor. As a way of defining these various requirements, minichromosome reagents are being developed and analysed. Approaches for minichromosome generation fall into two broad categories: de novo assembly from candidate DNA sequences, or the fragmentation of an existing chromosome to reduce it to a minimal size. Here we describe the generation of a human minichromosome using the latter, top-down, approach. A human X chromosome, present in a DT40-human microcell hybrid, has been manipulated using homologous recombination and the targeted seeding of a de novo telomere. This strategy has generated a linear approximately 2.4 Mb human X centromere-based minichromosome capped by two artificially seeded telomeres: one immediately flanking the centromeric alpha-satellite DNA and the other targeted to the zinc finger gene ZXDA in Xp11.21. The chromosome retains an alpha-satellite domain of approximately 1. 8 Mb, a small array of gamma-satellite repeat ( approximately 40 kb) and approximately 400 kb of Xp proximal DNA sequence. The mitotic stability of this minichromosome has been examined, both in DT40 and following transfer into hamster and human cell lines. In all three backgrounds, the minichromosome is retained efficiently, but in the human and hamster microcell hybrids its copy number is poorly regulated. This approach of engineering well-defined chromosome reagents will allow key questions in MAC development (such as whether a lower size limit exists) to be addressed. In addition, the 2.4 Mb minichromosome described here has potential to be developed as a vector for gene delivery.

  8. The Role of Dicentric Chromosome Formation and Secondary Centromere Deletion in the Evolution of Myeloid Malignancy

    Science.gov (United States)

    MacKinnon, Ruth N.; Campbell, Lynda J.

    2011-01-01

    Dicentric chromosomes have been identified as instigators of the genome instability associated with cancer, but this instability is often resolved by one of a number of different secondary events. These include centromere inactivation, inversion, and intercentromeric deletion. Deletion or excision of one of the centromeres may be a significant occurrence in myeloid malignancy and other malignancies but has not previously been widely recognized, and our reports are the first describing centromere deletion in cancer cells. We review what is known about dicentric chromosomes and the mechanisms by which they can undergo stabilization in both constitutional and cancer genomes. The failure to identify centromere deletion in cancer cells until recently can be partly explained by the standard approaches to routine diagnostic cancer genome analysis, which do not identify centromeres in the context of chromosome organization. This hitherto hidden group of primary dicentric, secondary monocentric chromosomes, together with other unrecognized dicentric chromosomes, points to a greater role for dicentric chromosomes in cancer initiation and progression than is generally acknowledged. We present a model that predicts and explains a significant role for dicentric chromosomes in the formation of unbalanced translocations in malignancy. PMID:22567363

  9. The Role of Dicentric Chromosome Formation and Secondary Centromere Deletion in the Evolution of Myeloid Malignancy

    Directory of Open Access Journals (Sweden)

    Ruth N. MacKinnon

    2011-01-01

    Full Text Available Dicentric chromosomes have been identified as instigators of the genome instability associated with cancer, but this instability is often resolved by one of a number of different secondary events. These include centromere inactivation, inversion, and intercentromeric deletion. Deletion or excision of one of the centromeres may be a significant occurrence in myeloid malignancy and other malignancies but has not previously been widely recognized, and our reports are the first describing centromere deletion in cancer cells. We review what is known about dicentric chromosomes and the mechanisms by which they can undergo stabilization in both constitutional and cancer genomes. The failure to identify centromere deletion in cancer cells until recently can be partly explained by the standard approaches to routine diagnostic cancer genome analysis, which do not identify centromeres in the context of chromosome organization. This hitherto hidden group of primary dicentric, secondary monocentric chromosomes, together with other unrecognized dicentric chromosomes, points to a greater role for dicentric chromosomes in cancer initiation and progression than is generally acknowledged. We present a model that predicts and explains a significant role for dicentric chromosomes in the formation of unbalanced translocations in malignancy.

  10. Centromere pairing by a plasmid-encoded type I ParB protein

    DEFF Research Database (Denmark)

    Ringgaard, Simon; Löwe, Jan; Gerdes, Kenn

    2007-01-01

    The par2 locus of Escherichia coli plasmid pB171 encodes two trans-acting proteins, ParA and ParB, and two cis-acting sites, parC1 and parC2, to which ParB binds cooperatively. ParA is related to MinD and oscillates in helical structures and thereby positions ParB/parC-carrying plasmids regularly......, hence identifying the N terminus of ParB as a requirement for ParB-mediated centromere pairing. These observations suggest that centromere pairing is an important intermediate step in plasmid partitioning mediated by the common type I loci....

  11. Functional Identification of the Plasmodium Centromere and Generation of a Plasmodium Artificial Chromosome

    OpenAIRE

    Iwanaga, Shiroh; Khan, Shahid M.; Kaneko, Izumi; Christodoulou, Zoe; Newbold, Chris; Yuda, Masao; Janse, Chris J.; Waters, Andrew P.

    2010-01-01

    Summary The artificial chromosome represents a useful tool for gene transfer, both as cloning vectors and in chromosome biology research. To generate a Plasmodium artificial chromosome (PAC), we had to first functionally identify and characterize the parasite's centromere. A putative centromere (pbcen5) was cloned from chromosome 5 of the rodent parasite P. berghei based on a Plasmodium gene-synteny map. Plasmids containing pbcen5 were stably maintained in parasites during a blood-stage infec...

  12. Expressed Centromere Specific Histone 3 (CENH3 Variants in Cultivated Triploid and Wild Diploid Bananas (Musa spp.

    Directory of Open Access Journals (Sweden)

    Kariuki S. Muiruri

    2017-06-01

    Full Text Available Centromeres are specified by a centromere specific histone 3 (CENH3 protein, which exists in a complex environment, interacting with conserved proteins and rapidly evolving satellite DNA sequences. The interactions may become more challenging if multiple CENH3 versions are introduced into the zygote as this can affect post-zygotic mitosis and ultimately sexual reproduction. Here, we characterize CENH3 variant transcripts expressed in cultivated triploid and wild diploid progenitor bananas. We describe both splice- and allelic-[Single Nucleotide Polymorphisms (SNP] variants and their effects on the predicted secondary structures of protein. Expressed CENH3 transcripts from six banana genotypes were characterized and clustered into three groups (MusaCENH-1A, MusaCENH-1B, and MusaCENH-2 based on similarity. The CENH3 groups differed with SNPs as well as presence of indels resulting from retained and/or skipped exons. The CENH3 transcripts from different banana genotypes were spliced in either 7/6, 5/4 or 6/5 exons/introns. The 7/6 and the 5/4 exon/intron structures were found in both diploids and triploids, however, 7/6 was most predominant. The 6/5 exon/introns structure was a result of failure of the 7/6 to splice correctly. The various transcripts obtained were predicted to encode highly variable N-terminal tails and a relatively conserved C-terminal histone fold domain (HFD. The SNPs were predicted in some cases to affect the secondary structure of protein by lengthening or shorting the affected domains. Sequencing of banana CENH3 transcripts predicts SNP variations that affect amino acid sequences and alternatively spliced transcripts. Most of these changes affect the N-terminal tail of CENH3.

  13. Positioning of chromosomes in human spermatozoa is determined by ordered centromere arrangement.

    Directory of Open Access Journals (Sweden)

    Olga S Mudrak

    Full Text Available The intranuclear positioning of chromosomes (CHRs is a well-documented fact; however, mechanisms directing such ordering remain unclear. Unlike somatic cells, human spermatozoa contain distinct spatial markers and have asymmetric nuclei which make them a unique model for localizing CHR territories and matching peri-centromere domains. In this study, we established statistically preferential longitudinal and lateral positioning for eight CHRs. Both parameters demonstrated a correlation with the CHR gene densities but not with their sizes. Intranuclear non-random positioning of the CHRs was found to be driven by a specific linear order of centromeres physically interconnected in continuous arrays. In diploid spermatozoa, linear order of peri-centromeres was identical in two genome sets and essentially matched the arrangement established for haploid cells. We propose that the non-random longitudinal order of CHRs in human spermatozoa is generated during meiotic stages of spermatogenesis. The specific arrangement of sperm CHRs may serve as an epigenetic basis for differential transcription/replication and direct spatial CHR organization during early embryogenesis.

  14. Mps1 kinase-dependent Sgo2 centromere localisation mediates cohesin protection in mouse oocyte meiosis I

    NARCIS (Netherlands)

    Yakoubi, W. El; Buffin, E.; Cladiere, D.; Gryaznova, Y.; Berenguer, I.; Touati, S.A.; Gomez, R.; Suja, J.A.; Deursen, J.M.A. van; Wassmann, K.

    2017-01-01

    A key feature of meiosis is the step-wise removal of cohesin, the protein complex holding sister chromatids together, first from arms in meiosis I and then from the centromere region in meiosis II. Centromeric cohesin is protected by Sgo2 from Separase-mediated cleavage, in order to maintain sister

  15. Toxoplasma gondii chromodomain protein 1 binds to heterochromatin and colocalises with centromeres and telomeres at the nuclear periphery.

    Directory of Open Access Journals (Sweden)

    Mathieu Gissot

    Full Text Available BACKGROUND: Apicomplexan parasites are responsible for some of the most deadly parasitic diseases afflicting humans, including malaria and toxoplasmosis. These obligate intracellular parasites exhibit a complex life cycle and a coordinated cell cycle-dependant expression program. Their cell division is a coordinated multistep process. How this complex mechanism is organised remains poorly understood. METHODS AND FINDINGS: In this study, we provide evidence for a link between heterochromatin, cell division and the compartmentalisation of the nucleus in Toxoplasma gondii. We characterised a T. gondii chromodomain containing protein (named TgChromo1 that specifically binds to heterochromatin. Using ChIP-on-chip on a genome-wide scale, we report TgChromo1 enrichment at the peri-centromeric chromatin. In addition, we demonstrate that TgChromo1 is cell-cycle regulated and co-localised with markers of the centrocone. Through the loci-specific FISH technique for T. gondii, we confirmed that TgChromo1 occupies the same nuclear localisation as the peri-centromeric sequences. CONCLUSION: We propose that TgChromo1 may play a role in the sequestration of chromosomes at the nuclear periphery and in the process of T. gondii cell division.

  16. Centromere replication timing determines different forms of genomic instability in Saccharomyces cerevisiae checkpoint mutants during replication stress.

    Science.gov (United States)

    Feng, Wenyi; Bachant, Jeff; Collingwood, David; Raghuraman, M K; Brewer, Bonita J

    2009-12-01

    Yeast replication checkpoint mutants lose viability following transient exposure to hydroxyurea, a replication-impeding drug. In an effort to understand the basis for this lethality, we discovered that different events are responsible for inviability in checkpoint-deficient cells harboring mutations in the mec1 and rad53 genes. By monitoring genomewide replication dynamics of cells exposed to hydroxyurea, we show that cells with a checkpoint deficient allele of RAD53, rad53K227A, fail to duplicate centromeres. Following removal of the drug, however, rad53K227A cells recover substantial DNA replication, including replication through centromeres. Despite this recovery, the rad53K227A mutant fails to achieve biorientation of sister centromeres during recovery from hydroxyurea, leading to secondary activation of the spindle assembly checkpoint (SAC), aneuploidy, and lethal chromosome segregation errors. We demonstrate that cell lethality from this segregation defect could be partially remedied by reinforcing bipolar attachment. In contrast, cells with the mec1-1 sml1-1 mutations suffer from severely impaired replication resumption upon removal of hydroxyurea. mec1-1 sml1-1 cells can, however, duplicate at least some of their centromeres and achieve bipolar attachment, leading to abortive segregation and fragmentation of incompletely replicated chromosomes. Our results highlight the importance of replicating yeast centromeres early and reveal different mechanisms of cell death due to differences in replication fork progression.

  17. Separase Is Required for Homolog and Sister Disjunction during Drosophila melanogaster Male Meiosis, but Not for Biorientation of Sister Centromeres.

    Science.gov (United States)

    Blattner, Ariane C; Chaurasia, Soumya; McKee, Bruce D; Lehner, Christian F

    2016-04-01

    Spatially controlled release of sister chromatid cohesion during progression through the meiotic divisions is of paramount importance for error-free chromosome segregation during meiosis. Cohesion is mediated by the cohesin protein complex and cleavage of one of its subunits by the endoprotease separase removes cohesin first from chromosome arms during exit from meiosis I and later from the pericentromeric region during exit from meiosis II. At the onset of the meiotic divisions, cohesin has also been proposed to be present within the centromeric region for the unification of sister centromeres into a single functional entity, allowing bipolar orientation of paired homologs within the meiosis I spindle. Separase-mediated removal of centromeric cohesin during exit from meiosis I might explain sister centromere individualization which is essential for subsequent biorientation of sister centromeres during meiosis II. To characterize a potential involvement of separase in sister centromere individualization before meiosis II, we have studied meiosis in Drosophila melanogaster males where homologs are not paired in the canonical manner. Meiosis does not include meiotic recombination and synaptonemal complex formation in these males. Instead, an alternative homolog conjunction system keeps homologous chromosomes in pairs. Using independent strategies for spermatocyte-specific depletion of separase complex subunits in combination with time-lapse imaging, we demonstrate that separase is required for the inactivation of this alternative conjunction at anaphase I onset. Mutations that abolish alternative homolog conjunction therefore result in random segregation of univalents during meiosis I also after separase depletion. Interestingly, these univalents become bioriented during meiosis II, suggesting that sister centromere individualization before meiosis II does not require separase.

  18. Essential loci in centromeric heterochromatin of Drosophila melanogaster. I: the right arm of chromosome 2.

    Science.gov (United States)

    Coulthard, Alistair B; Alm, Christina; Cealiac, Iulia; Sinclair, Don A; Honda, Barry M; Rossi, Fabrizio; Dimitri, Patrizio; Hilliker, Arthur J

    2010-06-01

    With the most recent releases of the Drosophila melanogaster genome sequences, much of the previously absent heterochromatic sequences have now been annotated. We undertook an extensive genetic analysis of existing lethal mutations, as well as molecular mapping and sequence analysis (using a candidate gene approach) to identify as many essential genes as possible in the centromeric heterochromatin on the right arm of the second chromosome (2Rh) of D. melanogaster. We also utilized available RNA interference lines to knock down the expression of genes in 2Rh as another approach to identifying essential genes. In total, we verified the existence of eight novel essential loci in 2Rh: CG17665, CG17683, CG17684, CG17883, CG40127, CG41265, CG42595, and Atf6. Two of these essential loci, CG41265 and CG42595, are synonymous with the previously characterized loci l(2)41Ab and unextended, respectively. The genetic and molecular analysis of the previously reported locus, l(2)41Ae, revealed that this is not a single locus, but rather it is a large region of 2Rh that extends from unextended (CG42595) to CG17665 and includes four of the novel loci uncovered here.

  19. Nuclear organization in human sperm: preliminary evidence for altered sex chromosome centromere position in infertile males.

    Science.gov (United States)

    Finch, K A; Fonseka, K G L; Abogrein, A; Ioannou, D; Handyside, A H; Thornhill, A R; Hickson, N; Griffin, D K

    2008-06-01

    Many genetic defects with a chromosomal basis affect male reproduction via a range of different mechanisms. Chromosome position is a well-known marker of nuclear organization, and alterations in standard patterns can lead to disease phenotypes such as cancer, laminopathies and epilepsy. It has been demonstrated that normal mammalian sperm adopt a pattern with the centromeres aligning towards the nuclear centre. The purpose of this study was to test the hypothesis that altered chromosome position in the sperm head is associated with male infertility. The average nuclear positions of fluorescence in-situ hybridization signals for three centromeric probes (for chromosomes X, Y and 18) were compared in normoozoospermic men and in men with compromised semen parameters. In controls, the centromeres of chromosomes X, Y and 18 all occupied a central nuclear location. In infertile men the sex chromosomes appeared more likely to be distributed in a pattern not distinguishable from a random model. Our findings cast doubt on the reliability of centromeric probes for aneuploidy screening. The analysis of chromosome position in sperm heads should be further investigated for the screening of infertile men.

  20. The TubR-centromere complex adopts a double-ring segrosome structure in Type III partition systems.

    Science.gov (United States)

    Martín-García, Bárbara; Martín-González, Alejandro; Carrasco, Carolina; Hernández-Arriaga, Ana M; Ruíz-Quero, Rubén; Díaz-Orejas, Ramón; Aicart-Ramos, Clara; Moreno-Herrero, Fernando; Oliva, María A

    2018-05-14

    In prokaryotes, the centromere is a specialized segment of DNA that promotes the assembly of the segrosome upon binding of the Centromere Binding Protein (CBP). The segrosome structure exposes a specific surface for the interaction of the CBP with the motor protein that mediates DNA movement during cell division. Additionally, the CBP usually controls the transcriptional regulation of the segregation system as a cell cycle checkpoint. Correct segrosome functioning is therefore indispensable for accurate DNA segregation. Here, we combine biochemical reconstruction and structural and biophysical analysis to bring light to the architecture of the segrosome complex in Type III partition systems. We present the particular features of the centromere site, tubC, of the model system encoded in Clostridium botulinum prophage c-st. We find that the split centromere site contains two different iterons involved in the binding and spreading of the CBP, TubR. The resulting nucleoprotein complex consists of a novel double-ring structure that covers part of the predicted promoter. Single molecule data provides a mechanism for the formation of the segrosome structure based on DNA bending and unwinding upon TubR binding.

  1. Recombination patterns reveal information about centromere location on linkage maps

    DEFF Research Database (Denmark)

    Limborg, Morten T.; McKinney, Garrett J.; Seeb, Lisa W.

    2016-01-01

    . mykiss) characterized by low and unevenly distributed recombination – a general feature of male meiosis in many species. Further, a high frequency of double crossovers along chromosome arms in barley reduced resolution for locating centromeric regions on most linkage groups. Despite these limitations...

  2. Chromosome segregation regulation in human zygotes : Altered mitotic histone phosphorylation dynamics underlying centromeric targeting of the chromosomal passenger complex

    NARCIS (Netherlands)

    Van De Werken, C.; Avo Santos, M.; Laven, J. S E; Eleveld, C.; Fauser, B. C J M; Lens, S. M A; Baart, E. B.

    2015-01-01

    STUDY QUESTION Are the kinase feedback loops that regulate activation and centromeric targeting of the chromosomal passenger complex (CPC), functional during mitosis in human embryos? SUMMARY ANSWER Investigation of the regulatory kinase pathways involved in centromeric CPC targeting revealed normal

  3. Abnormal centromere-chromatid apposition (ACCA) and Peters' anomaly.

    Science.gov (United States)

    Wertelecki, W; Dev, V G; Superneau, D W

    1985-08-01

    Abnormal centromere-chromatid apposition (ACCA) was noted in a patient with Peters' anomaly. Previous reports of ACCA emphasized its association with tetraphocomelia and other congenital malformations (Roberts, SC Phocomelia, Pseudothalidomide Syndromes). This report expands the array of congenital malformations associated with ACCA and emphasizes the diagnostic importance of ocular defects for the ascertainment of additional cases of ACCA and its possible relationship with abnormal cell division.

  4. Regulation of Centromere Localization of the Drosophila Shugoshin MEI-S332 and Sister-Chromatid Cohesion in Meiosis

    Science.gov (United States)

    Nogueira, Cristina; Kashevsky, Helena; Pinto, Belinda; Clarke, Astrid; Orr-Weaver, Terry L.

    2014-01-01

    The Shugoshin (Sgo) protein family helps to ensure proper chromosome segregation by protecting cohesion at the centromere by preventing cleavage of the cohesin complex. Some Sgo proteins also influence other aspects of kinetochore-microtubule attachments. Although many Sgo members require Aurora B kinase to localize to the centromere, factors controlling delocalization are poorly understood and diverse. Moreover, it is not clear how Sgo function is inactivated and whether this is distinct from delocalization. We investigated these questions in Drosophila melanogaster, an organism with superb chromosome cytology to monitor Sgo localization and quantitative assays to test its function in sister-chromatid segregation in meiosis. Previous research showed that in mitosis in cell culture, phosphorylation of the Drosophila Sgo, MEI-S332, by Aurora B promotes centromere localization, whereas Polo phosphorylation promotes delocalization. These studies also suggested that MEI-S332 can be inactivated independently of delocalization, a conclusion supported here by localization and function studies in meiosis. Phosphoresistant and phosphomimetic mutants for the Aurora B and Polo phosphorylation sites were examined for effects on MEI-S332 localization and chromosome segregation in meiosis. Strikingly, MEI-S332 with a phosphomimetic mutation in the Aurora B phosphorylation site prematurely dissociates from the centromeres in meiosis I. Despite the absence of MEI-S332 on meiosis II centromeres in male meiosis, sister chromatids segregate normally, demonstrating that detectable levels of this Sgo are not essential for chromosome congression, kinetochore biorientation, or spindle assembly. PMID:25081981

  5. Clinical spectrum of immunodeficiency, centromeric instability and facial dysmorphism (ICF syndrome).

    NARCIS (Netherlands)

    Hagleitner, M.M.; Lankester, A.; Maraschio, P.; Hulten, M.; Fryns, J.P.; Schuetz, C.; Gimelli, G.; Davies, E.G.; Gennery, A.R.; Belohradsky, B.H.; Groot, R. de; Gerritsen, E.J.; Mattina, T.; Howard, P.J.; Fasth, A.; Reisli, I.; Furthner, D.; Slatter, M.A.; Cant, A.J.; Cazzola, G.; Dijken, P.J. van; Deuren, M. van; Greef, J.C. de; Maarel, S.M. van der; Weemaes, C.M.R.

    2008-01-01

    BACKGROUND: Immunodeficiency, centromeric instability and facial dysmorphism (ICF syndrome) is a rare autosomal recessive disease characterised by facial dysmorphism, immunoglobulin deficiency and branching of chromosomes 1, 9 and 16 after PHA stimulation of lymphocytes. Hypomethylation of DNA of a

  6. Conservation of the Centromere/Kinetochore Protein ZW10

    OpenAIRE

    Starr, Daniel A.; Williams, Byron C.; Li, Zexiao; Etemad-Moghadam, Bijan; Dawe, R. Kelly; Goldberg, Michael L.

    1997-01-01

    Mutations in the essential Drosophila melanogaster gene zw10 disrupt chromosome segregation, producing chromosomes that lag at the metaphase plate during anaphase of mitosis and both meiotic divisions. Recent evidence suggests that the product of this gene, DmZW10, acts at the kinetochore as part of a tension-sensing checkpoint at anaphase onset. DmZW10 displays an intriguing cell cycle–dependent intracellular distribution, apparently moving from the centromere/kinetochore at prometaphase to ...

  7. Loss of maternal ATRX results in centromere instability and aneuploidy in the mammalian oocyte and pre-implantation embryo.

    Directory of Open Access Journals (Sweden)

    Claudia Baumann

    2010-09-01

    Full Text Available The α-thalassemia/mental retardation X-linked protein (ATRX is a chromatin-remodeling factor known to regulate DNA methylation at repetitive sequences of the human genome. We have previously demonstrated that ATRX binds to pericentric heterochromatin domains in mouse oocytes at the metaphase II stage where it is involved in mediating chromosome alignment at the meiotic spindle. However, the role of ATRX in the functional differentiation of chromatin structure during meiosis is not known. To test ATRX function in the germ line, we developed an oocyte-specific transgenic RNAi knockdown mouse model. Our results demonstrate that ATRX is required for heterochromatin formation and maintenance of chromosome stability during meiosis. During prophase I arrest, ATRX is necessary to recruit the transcriptional regulator DAXX (death domain associated protein to pericentric heterochromatin. At the metaphase II stage, transgenic ATRX-RNAi oocytes exhibit abnormal chromosome morphology associated with reduced phosphorylation of histone 3 at serine 10 as well as chromosome segregation defects leading to aneuploidy and severely reduced fertility. Notably, a large proportion of ATRX-depleted oocytes and 1-cell stage embryos exhibit chromosome fragments and centromeric DNA-containing micronuclei. Our results provide novel evidence indicating that ATRX is required for centromere stability and the epigenetic control of heterochromatin function during meiosis and the transition to the first mitosis.

  8. Analysis of Primary Structural Determinants That Distinguish the Centromere-Specific Function of Histone Variant Cse4p from Histone H3

    OpenAIRE

    Keith, Kevin C.; Baker, Richard E.; Chen, Yinhuai; Harris, Kendra; Stoler, Sam; Fitzgerald-Hayes, Molly

    1999-01-01

    Cse4p is a variant of histone H3 that has an essential role in chromosome segregation and centromere chromatin structure in budding yeast. Cse4p has a unique 135-amino-acid N terminus and a C-terminal histone-fold domain that is more than 60% identical to histone H3 and the mammalian centromere protein CENP-A. Cse4p and CENP-A have biochemical properties similar to H3 and probably replace H3 in centromere-specific nucleosomes in yeasts and mammals, respectively. In order to identify regions o...

  9. B chromosomes are more frequent in mammals with acrocentric karyotypes: support for the theory of centromeric drive.

    Science.gov (United States)

    Palestis, Brian G; Burt, Austin; Jones, R Neil; Trivers, Robert

    2004-02-07

    The chromosomes of mammals tend to be either mostly acrocentric (having one long arm) or mostly bi-armed, with few species having intermediate karyotypes. The theory of centromeric drive suggests that this observation reflects a bias during female meiosis, favouring either more centromeres or fewer, and that the direction of this bias changes frequently over evolutionary time. B chromosomes are selfish genetic elements found in some individuals within some species. B chromosomes are often harmful, but persist because they drive (i.e. they are transmitted more frequently than expected). We predicted that species with mainly acrocentric chromosomes would be more likely to harbour B chromosomes than those with mainly bi-armed chromosomes, because female meiosis would favour more centromeres over fewer in species with one-armed chromosomes. Our results show that B chromosomes are indeed more common in species with acrocentric chromosomes, across all mammals, among rodents, among non-rodents and in a test of independent taxonomic contrasts. These results provide independent evidence supporting the theory of centromeric drive and also help to explain the distribution of selfish DNA across species. In addition, we demonstrate an association between the shape of the B chromosomes and the shape of the typical ('A') chromosomes.

  10. Chromosome segregation regulation in human zygotes: altered mitotic histone phosphorylation dynamics underlying centromeric targeting of the chromosomal passenger complex.

    Science.gov (United States)

    van de Werken, C; Avo Santos, M; Laven, J S E; Eleveld, C; Fauser, B C J M; Lens, S M A; Baart, E B

    2015-10-01

    Are the kinase feedback loops that regulate activation and centromeric targeting of the chromosomal passenger complex (CPC), functional during mitosis in human embryos? Investigation of the regulatory kinase pathways involved in centromeric CPC targeting revealed normal phosphorylation dynamics of histone H2A at T120 (H2ApT120) by Bub1 kinase and subsequent recruitment of Shugoshin, but phosphorylation of histone H3 at threonine 3 (H3pT3) by Haspin failed to show the expected centromeric enrichment on metaphase chromosomes in the zygote. Human cleavage stage embryos show high levels of chromosomal instability. What causes this high error rate is unknown, as mechanisms used to ensure proper chromosome segregation in mammalian embryos are poorly described. In this study, we investigated the pathways regulating CPC targeting to the inner centromere in human embryos. We characterized the distribution of the CPC in relation to activity of its two main centromeric targeting pathways: the Bub1-H2ApT120-Sgo-CPC and Haspin-H3pT3-CPC pathways. The study was conducted between May 2012 and March 2014 on human surplus embryos resulting from in vitro fertilization treatment and donated for research. In zygotes, nuclear envelope breakdown was monitored by time-lapse imaging to allow timed incubations with specific inhibitors to arrest at prometaphase and metaphase, and to interfere with Haspin and Aurora B/C kinase activity. Functionality of the targeting pathways was assessed through characterization of histone phosphorylation dynamics by immunofluorescent analysis, combined with gene expression by RT-qPCR and immunofluorescent localization of key pathway proteins. Immunofluorescent analysis of the CPC subunit Inner Centromere Protein revealed the pool of stably bound CPC proteins was not strictly confined to the inner centromere of prometaphase chromosomes in human zygotes, as observed in later stages of preimplantation development and somatic cells. Investigation of the

  11. Single molecule localization imaging of telomeres and centromeres using fluorescence in situ hybridization and semiconductor quantum dots.

    Science.gov (United States)

    Wang, Le; Zong, Shenfei; Wang, Zhuyuan; Lu, Ju; Chen, Chen; Zhang, Ruohu; Cui, Yiping

    2018-07-13

    Single molecule localization microscopy (SMLM) is a powerful tool for imaging biological targets at the nanoscale. In this report, we present SMLM imaging of telomeres and centromeres using fluorescence in situ hybridization (FISH). The FISH probes were fabricated by decorating CdSSe/ZnS quantum dots (QDs) with telomere or centromere complementary DNA strands. SMLM imaging experiments using commercially available peptide nucleic acid (PNA) probes labeled with organic fluorophores were also conducted to demonstrate the advantages of using QDs FISH probes. Compared with the PNA probes, the QDs probes have the following merits. First, the fluorescence blinking of QDs can be realized in aqueous solution or PBS buffer without thiol, which is a key buffer component for organic fluorophores' blinking. Second, fluorescence blinking of the QDs probe needs only one excitation light (i.e. 405 nm). While fluorescence blinking of the organic fluorophores usually requires two illumination lights, that is, the activation light (i.e. 405 nm) and the imaging light. Third, the high quantum yield, multiple switching times and a good optical stability make the QDs more suitable for long-term imaging. The localization precision achieved in telomeres and centromeres imaging experiments is about 30 nm, which is far beyond the diffraction limit. SMLM has enabled new insights into telomeres or centromeres on the molecular level, and it is even possible to determine the length of telomere and become a potential technique for telomere-related investigation.

  12. Perturbation of Incenp function impedes anaphase chromatid movements and chromosomal passenger protein flux at centromeres.

    Science.gov (United States)

    Ahonen, Leena J; Kukkonen, Anu M; Pouwels, Jeroen; Bolton, Margaret A; Jingle, Christopher D; Stukenberg, P Todd; Kallio, Marko J

    2009-02-01

    Incenp is an essential mitotic protein that, together with Aurora B, Survivin, and Borealin, forms the core of the chromosomal passenger protein complex (CPC). The CPC regulates various mitotic processes and functions to maintain genomic stability. The proper subcellular localization of the CPC and its full catalytic activity require the presence of each core subunit in the complex. We have investigated the mitotic tasks of the CPC using a function blocking antibody against Incenp microinjected into cells at different mitotic phases. This method allowed temporal analysis of CPC functions without perturbation of complex assembly or activity prior to injection. We have also studied the dynamic properties of Incenp and Aurora B using fusion protein photobleaching. We found that in early mitotic cells, Incenp and Aurora B exhibit dynamic turnover at centromeres, which is prevented by the anti-Incenp antibody. In these cells, the loss of centromeric CPC turnover is accompanied by forced mitotic exit without the execution of cytokinesis. Introduction of anti-Incenp antibody into early anaphase cells causes abnormalities in sister chromatid separation through defects in anaphase spindle functions. In summary, our data uncovers new mitotic roles for the CPC in anaphase and proposes that CPC turnover at centromeres modulates spindle assembly checkpoint signaling.

  13. Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

    Science.gov (United States)

    Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

    2016-01-01

    A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.

  14. Integration of hepatitis B virus DNA in chromosome-specific satellite sequences

    International Nuclear Information System (INIS)

    Shaul, Y.; Garcia, P.D.; Schonberg, S.; Rutter, W.J.

    1986-01-01

    The authors previously reported the cloning and detailed analysis of the integrated hepatitis B virus sequences in a human hepatoma cell line. They report here the integration of at least one of hepatitis B virus at human satellite DNA sequences. The majority of the cellular sequences identified by this satellite were organized as a multimeric composition of a 0.6-kilobase EcoRI fragment. This clone hybridized in situ almost exclusively to the centromeric heterochromatin of chromosomes 1 and 16 and to a lower extent to chromosome 2 and to the heterochromatic region of the Y chromosome. The immediate flanking host sequence appeared as a hierarchy of repeating units which were almost identical to a previously reported human satellite III DNA sequence

  15. A cytogenetic study of hospital workers occupationally exposed to radionuclides in Serbia. Premature centromere division as novel biomarker of exposure?

    Energy Technology Data Exchange (ETDEWEB)

    Pajic, Jelena; Rakic, Boban [Serbian Institute of Occupational Health ' ' Dr Dragomir Karajovic' ' , Belgrade (Serbia). Biodosimetry Dept.; Jovicic, Dubravka [Univ. ' ' Singidunum' ' , Belgrade (Serbia). Genotoxicology Dept.; Milovanovic, Aleksandar [Serbian Institute of Occupational Health ' ' Dr Dragomir Karajovic' ' , Belgrade (Serbia). Biodosimetry Dept.; Belgrade Univ. (Serbia). Occupational Health Dept.

    2016-04-15

    The health risk of chronic exposure to radionuclides includes changes in the genome (e.g., chromosomal aberrations and micronuclei) that increase chromosomal instability. There are also other phenomena, which seem to appear more frequently in metaphases of exposed persons (such as premature centromere division). The aim of this study was to discover whether or not there is correlation between incidence of named cytogenetic changes in persons occupationally exposed to radionuclides in comparison with unexposed control group, and if significant correlation is determined, can premature centromere division be consider as a biomarker of radiation exposure? The exposed group comprised 50 individuals occupationally exposed to radionuclides. The reference control group consisted of 40 unexposed individuals. Chromosomal aberrations, micronuclei and premature centromere division were analyzed according to a standard International Atomic Energy Agency protocol. Statistical analyses were performed using SPSS 17.0 statistics.The means for analyzed cytogenetic changes were significantly higher in the exposed group. Positive correlation between them was found in exposed group. Premature centromere division parameter PCD5-10 was selected as particularly suitable for separating groups (exposed/unexposed). Identification of other phenomena related to radionuclide exposure, beside well known, may clarify recent problems in radiobiology concerning the biological response to low doses of ionizing radiation and its consequences.

  16. A cytogenetic study of hospital workers occupationally exposed to radionuclides in Serbia. Premature centromere division as novel biomarker of exposure?

    International Nuclear Information System (INIS)

    Pajic, Jelena; Rakic, Boban; Jovicic, Dubravka; Milovanovic, Aleksandar; Belgrade Univ.

    2016-01-01

    The health risk of chronic exposure to radionuclides includes changes in the genome (e.g., chromosomal aberrations and micronuclei) that increase chromosomal instability. There are also other phenomena, which seem to appear more frequently in metaphases of exposed persons (such as premature centromere division). The aim of this study was to discover whether or not there is correlation between incidence of named cytogenetic changes in persons occupationally exposed to radionuclides in comparison with unexposed control group, and if significant correlation is determined, can premature centromere division be consider as a biomarker of radiation exposure? The exposed group comprised 50 individuals occupationally exposed to radionuclides. The reference control group consisted of 40 unexposed individuals. Chromosomal aberrations, micronuclei and premature centromere division were analyzed according to a standard International Atomic Energy Agency protocol. Statistical analyses were performed using SPSS 17.0 statistics.The means for analyzed cytogenetic changes were significantly higher in the exposed group. Positive correlation between them was found in exposed group. Premature centromere division parameter PCD5-10 was selected as particularly suitable for separating groups (exposed/unexposed). Identification of other phenomena related to radionuclide exposure, beside well known, may clarify recent problems in radiobiology concerning the biological response to low doses of ionizing radiation and its consequences.

  17. Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins

    OpenAIRE

    Meraldi, Patrick; McAinsh, Andrew D; Rheinbay, Esther; Sorger, Peter K

    2006-01-01

    Background: Kinetochores are large multi-protein structures that assemble on centromeric DNA (CEN DNA) and mediate the binding of chromosomes to microtubules. Comprising 125 base-pairs of CEN DNA and 70 or more protein components, Saccharomyces cerevisiae kinetochores are among the best understood. In contrast, most fungal, plant and animal cells assemble kinetochores on CENs that are longer and more complex, raising the question of whether kinetochore architecture has been conserved through ...

  18. Entropic fluctuations in DNA sequences

    Science.gov (United States)

    Thanos, Dimitrios; Li, Wentian; Provata, Astero

    2018-03-01

    The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.

  19. Epigenetic Histone Marks of Extended Meta-Polycentric Centromeres of Lathyrus and Pisum Chromosomes

    Czech Academy of Sciences Publication Activity Database

    Neumann, Pavel; Schubert, V.; Vrbová, Iva; Manning, Jasper Eugene; Houben, A.; Macas, Jiří

    2016-01-01

    Roč. 7, č. 234 (2016) ISSN 1664-462X R&D Projects: GA ČR(CZ) GAP501/11/1843 Institutional support: RVO:60077344 Keywords : Centromere structure * epigenetic modifications * histone phosphorylation * histone methylation Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 4.298, year: 2016

  20. Correlation between centromere protein-F autoantibodies and cancer analyzed by enzyme-linked immunosorbent assay

    DEFF Research Database (Denmark)

    Welner, Simon; Trier, Nicole Hartwig; Morten Frisch, Morten

    2013-01-01

    Centromere protein-F (CENP-F) is a large nuclear protein of 367 kDa, which is involved in multiple mitosis-related events such as proper assembly of the kinetochores, stabilization of heterochromatin, chromosome alignment and mitotic checkpoint signaling. Several studies have shown a correlation...

  1. Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.

    Science.gov (United States)

    Benko, Sabina; Fantes, Judy A; Amiel, Jeanne; Kleinjan, Dirk-Jan; Thomas, Sophie; Ramsay, Jacqueline; Jamshidi, Negar; Essafi, Abdelkader; Heaney, Simon; Gordon, Christopher T; McBride, David; Golzio, Christelle; Fisher, Malcolm; Perry, Paul; Abadie, Véronique; Ayuso, Carmen; Holder-Espinasse, Muriel; Kilpatrick, Nicky; Lees, Melissa M; Picard, Arnaud; Temple, I Karen; Thomas, Paul; Vazquez, Marie-Paule; Vekemans, Michel; Roest Crollius, Hugues; Hastie, Nicholas D; Munnich, Arnold; Etchevers, Heather C; Pelet, Anna; Farlie, Peter G; Fitzpatrick, David R; Lyonnet, Stanislas

    2009-03-01

    Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.

  2. Fluorescence In Situ Hybridization (FISH-Based Karyotyping Reveals Rapid Evolution of Centromeric and Subtelomeric Repeats in Common Bean (Phaseolus vulgaris and Relatives

    Directory of Open Access Journals (Sweden)

    Aiko Iwata-Otsubo

    2016-04-01

    Full Text Available Fluorescence in situ hybridization (FISH-based karyotyping is a powerful cytogenetics tool to study chromosome organization, behavior, and chromosome evolution. Here, we developed a FISH-based karyotyping system using a probe mixture comprised of centromeric and subtelomeric satellite repeats, 5S rDNA, and chromosome-specific BAC clones in common bean, which enables one to unambiguously distinguish all 11 chromosome pairs. Furthermore, we applied the karyotyping system to several wild relatives and landraces of common bean from two distinct gene pools, as well as other related Phaseolus species, to investigate repeat evolution in the genus Phaseolus. Comparison of karyotype maps within common bean indicates that chromosomal distribution of the centromeric and subtelomeric satellite repeats is stable, whereas the copy number of the repeats was variable, indicating rapid amplification/reduction of the repeats in specific genomic regions. In Phaseolus species that diverged approximately 2–4 million yr ago, copy numbers of centromeric repeats were largely reduced or diverged, and chromosomal distributions have changed, suggesting rapid evolution of centromeric repeats. We also detected variation in the distribution pattern of subtelomeric repeats in Phaseolus species. The FISH-based karyotyping system revealed that satellite repeats are actively and rapidly evolving, forming genomic features unique to individual common bean accessions and Phaseolus species.

  3. B chromosomes and Robertsonian fusions of Dichroplus pratensis (Acrididae): intraspecific support for the centromeric drive theory.

    Science.gov (United States)

    Bidau, C J; Martí, D A

    2004-01-01

    We tested the centromeric drive theory of karyotypic evolution in the grasshopper Dichroplus pratensis, which is simultaneously polymorphic for eight Robertsonian fusions and two classes of B chromosomes. A logistic regression analysis performed on 53 natural populations from Argentina revealed that B chromosomes are more probably found in populations with a higher proportion of acrocentric chromosomes, as the theory predicts. Furthermore, frequencies of B-carrying individuals are significantly negatively correlated with the mean frequency of different Robertsonian fusions per individual. No significant correlations between presence/absence or frequency of Bs, and latitude or altitude of the sampled populations, were found. We thus provide the first intraspecific evidence supporting the centromeric drive theory in relation to the establishment of B chromosomes in natural populations. Copyright 2004 S. Karger AG, Basel

  4. Characterization of the oncogenic function of centromere protein F in hepatocellular carcinoma

    Energy Technology Data Exchange (ETDEWEB)

    Dai, Yongdong; Liu, Lulu; Zeng, Tingting; Zhu, Ying-Hui [State Key Laboratory of Oncology in Southern China, Sun Yat-Sen University Cancer Center, Guangzhou (China); Li, Jiangchao [Vascular Biology Research Institute, Guangdong Pharmaceutical University, Guangzhou (China); Chen, Leilei [Department of Clinical Oncology, The University of Hong Kong, Pokfulam, Hong Kong (China); Li, Yan; Yuan, Yun-Fei [State Key Laboratory of Oncology in Southern China, Sun Yat-Sen University Cancer Center, Guangzhou (China); Ma, Stephanie, E-mail: stefma@hku.hk [Department of Clinical Oncology, The University of Hong Kong, Pokfulam, Hong Kong (China); State Key Laboratory for Liver Research, The University of Hong Kong, Pokfulam, Hong Kong (China); Guan, Xin-Yuan, E-mail: xyguan@hkucc.hku.hk [State Key Laboratory of Oncology in Southern China, Sun Yat-Sen University Cancer Center, Guangzhou (China); Department of Clinical Oncology, The University of Hong Kong, Pokfulam, Hong Kong (China); State Key Laboratory for Liver Research, The University of Hong Kong, Pokfulam, Hong Kong (China)

    2013-07-12

    Highlights: •Overexpression of CENPF is frequently detected in HCC. •Upregulation of CENPF serves as an independent prognosis factor in HCC patients. •CENPF functions as an oncogene in HCC by promoting cell G2/M transition. -- Abstract: Centromere protein F (CENPF) is an essential nuclear protein associated with the centromere-kinetochore complex and plays a critical role in chromosome segregation during mitosis. Up-regulation of CENPF expression has previously been detected in several solid tumors. In this study, we aim to study the expression and functional role of CENPF in hepatocellular carcinoma (HCC). We found CENPF was frequently overexpressed in HCC as compared with non-tumor tissue. Up-regulated CENPF expression in HCC was positively correlated with serum AFP, venous invasion, advanced differentiation stage and a shorter overall survival. Cox regression analysis found that overexpression of CENPF was an independent prognosis factor in HCC. Functional studies found that silencing CENPF could decrease the ability of the cells to proliferate, form colonies and induce tumor formation in nude mice. Silencing CENPF also resulted in the cell cycle arrest at G2/M checkpoint by down-regulating cell cycle proteins cdc2 and cyclin B1. Our data suggest that CENPF is frequently overexpressed in HCC and plays a critical role in driving HCC tumorigenesis.

  5. MHF1-2/CENP-S-X performs distinct roles in centromere metabolism and genetic recombination.

    Science.gov (United States)

    Bhattacharjee, Sonali; Osman, Fekret; Feeney, Laura; Lorenz, Alexander; Bryer, Claire; Whitby, Matthew C

    2013-09-11

    The histone-fold proteins Mhf1/CENP-S and Mhf2/CENP-X perform two important functions in vertebrate cells. First, they are components of the constitutive centromere-associated network, aiding kinetochore assembly and function. Second, they work with the FANCM DNA translocase to promote DNA repair. However, it has been unclear whether there is crosstalk between these roles. We show that Mhf1 and Mhf2 in fission yeast, as in vertebrates, serve a dual function, aiding DNA repair/recombination and localizing to centromeres to promote chromosome segregation. Importantly, these functions are distinct, with the former being dependent on their interaction with the FANCM orthologue Fml1 and the latter not. Together with Fml1, they play a second role in aiding chromosome segregation by processing sister chromatid junctions. However, a failure of this activity does not manifest dramatically increased levels of chromosome missegregation due to the Mus81-Eme1 endonuclease, which acts as a failsafe to resolve DNA junctions before the end of mitosis.

  6. Comparative cytogenetic analysis of chromosomal aberrations and premature centromere division in persons exposed to radionuclides

    International Nuclear Information System (INIS)

    Jovicic, D.; Rakic, B.; Vukov, T.; Pajic, J.; Milacic, S.; Kovacevic, R.; Stevanovic, M.; Drakulic, D.; Bukvic, N.

    2009-01-01

    The aim of the research was to determine the presence of correlation between the frequency of premature centromere division (PCD) and chromosomal aberrations (CA) in metaphases in persons professionally exposed to radionuclides. Biological dosimetry was performed by conventional cytogenetic technique. The presence of PCD was confirmed by Fluorescent in situ hybridization (FISH). The L1.84 probe (specific for centromeric region of chromosome 18) was used. The analysis included 50 subjects employed in the Clinical Center of Serbia (C) (the average age of 45.24 ± 1.18 and the average exposition time 17.96 ± 1.15) and 40 subjects in control group (K) (the average age of 44.40 ± 0.98 and the average years of employment 19.67 ± 0.98 years) which were not exposed to genotoxic agents in their workplaces. The results showed that frequencies of CA and PCD were statistically significantly higher in subjects exposed to radionuclides than in the control group (Mann-Whitney U test, P [sr

  7. Genetic and physical mapping of two centromere-proximal regions of chromosome IV in Aspergillus nidulans

    DEFF Research Database (Denmark)

    Aleksenko, Alexei Y.; Nielsen, Michael Lynge; Clutterbuck, A.J.

    2001-01-01

    revision of the genetic map of the chromosome, including the position of the centromere, Comparison of physical and genetic maps indicates that meiotic recombination is low in subcentromeric DNA, its frequency being reduced from 1 crossover per 0.8 Mb to approximately 1 crossover per 5 Mb per meiosis...

  8. Novel Centromeric Loci of the Wine and Beer Yeast Dekkera bruxellensis CEN1 and CEN2

    DEFF Research Database (Denmark)

    Ishchuk, Olena P.; Vojvoda Zeljko, Tanja; Schifferdecker, Anna J.

    2016-01-01

    The wine and beer yeast Dekkera bruxellensis thrives in environments that are harsh and limiting, especially in concentrations with low oxygen and high ethanol. Its different strains' chromosomes greatly vary in number (karyotype). This study isolates two novel centromeric loci (CEN1 and CEN2...

  9. A DNA sequence element that advances replication origin activation time in Saccharomyces cerevisiae.

    Science.gov (United States)

    Pohl, Thomas J; Kolor, Katherine; Fangman, Walton L; Brewer, Bonita J; Raghuraman, M K

    2013-11-06

    Eukaryotic origins of DNA replication undergo activation at various times in S-phase, allowing the genome to be duplicated in a temporally staggered fashion. In the budding yeast Saccharomyces cerevisiae, the activation times of individual origins are not intrinsic to those origins but are instead governed by surrounding sequences. Currently, there are two examples of DNA sequences that are known to advance origin activation time, centromeres and forkhead transcription factor binding sites. By combining deletion and linker scanning mutational analysis with two-dimensional gel electrophoresis to measure fork direction in the context of a two-origin plasmid, we have identified and characterized a 19- to 23-bp and a larger 584-bp DNA sequence that are capable of advancing origin activation time.

  10. Holocentromeres in Rhynchospora are associated with genome-wide centromere-specific repeat arrays interspersed among euchromatin

    Czech Academy of Sciences Publication Activity Database

    Marques, A.; Ribeiro, T.; Neumann, Pavel; Macas, Jiří; Novák, Petr; Schubert, V.; Pellino, M.; Fuchs, J.; Ma, W.; Kuhlmann, M.; Brandt, R.; Vanzela, A.L.L.; Beseda, Tomáš; Šimková, Hana; Pedrosa-Harand, A.; Houben, A.

    2015-01-01

    Roč. 112, č. 44 (2015), s. 13633-13638 ISSN 0027-8424 R&D Projects: GA ČR GBP501/12/G090; GA MŠk(CZ) LO1204 Institutional support: RVO:60077344 ; RVO:61389030 Keywords : Centromere * satellite DNA * holokinetic * chromosome Subject RIV: EB - Genetics ; Molecular Biology; EB - Genetics ; Molecular Biology (UEB-Q) Impact factor: 9.423, year: 2015

  11. Autoantibodies directed to centromere protein F in a patient with BRCA1 gene mutation

    OpenAIRE

    Moghaddas, Fiona; Joshua, Fredrick; Taylor, Roberta; Fritzler, Marvin J.; Toh, Ban Hock

    2016-01-01

    Background Autoantibodies directed to centromere protein F were first reported in 1993 and their association with malignancy has been well documented. Case We present the case of a 48-year-old Caucasian female with a BRCA1 gene mutation associated with bilateral breast cancer. Antinuclear autoantibody immunofluorescence performed for workup of possible inflammatory arthropathy showed a high titre cell cycle related nuclear speckled pattern, with subsequent confirmation by addressable laser be...

  12. A Molecular-Cytogenetic Method for Locating Genes to Pericentromeric Regions Facilitates a Genome-Wide Comparison of Syntency Between the Centrometric Regions of Wheat and Rice

    Science.gov (United States)

    Centromeres, because of their repeat structure and lack of sequence conservation, are difficult to assemble and compare across organisms. It was recently discovered that rice centromeres often contain genes. This suggested a method for studying centromere homologies between wheat and rice chromosome...

  13. Long-read sequencing data analysis for yeasts.

    Science.gov (United States)

    Yue, Jia-Xing; Liti, Gianni

    2018-06-01

    Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ∼41 h to generate a complete and well-annotated genome from ∼100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.

  14. Detection of chromosomal aberrations by fluorescence in situ hybridization in the first three postirradiation divisions of human lymphocytes

    International Nuclear Information System (INIS)

    Boei, J.J.W.A.; Vermeulen, S.; Natarajan, A.T.

    1996-01-01

    Chromosomal aberrations in human lymphocytes were analyzed by fluorescence in situ hybridization (FISH) in the first 3 postirradiation (0 and 2 Gy) divisions. Cells were grown in the presence of BrdU, collected at different sampling times (47, 70 and 91 h) and analyzed using an alphoid centromeric probe and PCR amplified DNA libraries for chromosomes 2 and 8. Following differential staining of sister chromatids, the analyzed cells were identified to be either in the first, second or third mitosis after irradiation. The frequencies of both dicentrics and fragments showed a reduction of about 50% after each cell generation, whereas translocations were more persistent. Cells within the same postirradiation division showed higher aberration frequencies when derived from later sampling times, indicating a delay in progression of aberrant cells. As a result, the frequencies for dicentrics and fragments remained rather constant at different sampling times if the cell cycle parameter was not taken into account. Thus, the average generation time of the lymphocytes had a clear effect on the obtained aberration frequencies. The described method allows the study of the persistence of chromosome damage using the FISH technique during 3 subsequent cell divisions in vitro

  15. Karyotypic evolution and organization of the highly repetitive DNA sequences in the Japanese shrew-moles, Dymecodon pilirostris and Urotrichus talpoides.

    Science.gov (United States)

    Nakata, A; Yoshimura, A; Kuro-o, M; Obara, Y

    2005-01-01

    The karyological relationship and organization of highly repetitive DNA sequences in Japanese shrew-moles were studied by zoo-blot hybridization and fluorescence in situ hybridization (FISH). When the genomic DNA of the eastern race of Urotrichus talpoides was digested with PstI, three fragments of highly repetitive DNA sequences, approximately 0.7, 0.9, and 1.4 kb in length, were observed as distinct bands. The results of FISH in the eastern race of U. talpoides using these three fragments separately as probes showed that the 0.7-kb PstI fragment was distributed in the centromeric regions of most chromosomes, and that the 0.9- and 1.4-kb fragments were predominantly located in the C-heterochromatin region of chromosome 13p. Although the western race of U. talpoides also had three PstI fragments, 0.9- and 1.4-kb PstI fragments were more ambiguous than those of the eastern race. The PstI- digested genomic DNA in Dymecodonpilirostris produced only a faint 0.9-kb band, and its signal patterns obtained by zoo-blot hybridization were clearly different from those of U. talpoides. The 0.7-kb fragment of U. talpoides hybridized strongly with the 0.9-kb fragment of D. pilirostris. In a FISH analysis, the 0.9-kb fragment of D. pilirostris hybridized with highly repetitive DNA in the centromeric regions of most chromosomes from both D. pilirostris and U. talpoides. Zoo-blot hybridization and FISH analyses suggest that the 0.9- and 1.4-kb PstI fragments were generated specifically in the genome of U. talpoides after the common ancestor differentiated into two extant shrew-mole species. A difference in the length of the centromeric elements between U. talpoides and D. pilirostris might be observed due to certain modifications of the repeating unit.

  16. Genome sequence analysis of the model grass Brachypodium distachyon: insights into grass genome evolution

    Energy Technology Data Exchange (ETDEWEB)

    Schulman, Al

    2009-08-09

    Three subfamilies of grasses, the Erhardtoideae (rice), the Panicoideae (maize, sorghum, sugar cane and millet), and the Pooideae (wheat, barley and cool season forage grasses) provide the basis of human nutrition and are poised to become major sources of renewable energy. Here we describe the complete genome sequence of the wild grass Brachypodium distachyon (Brachypodium), the first member of the Pooideae subfamily to be completely sequenced. Comparison of the Brachypodium, rice and sorghum genomes reveals a precise sequence- based history of genome evolution across a broad diversity of the grass family and identifies nested insertions of whole chromosomes into centromeric regions as a predominant mechanism driving chromosome evolution in the grasses. The relatively compact genome of Brachypodium is maintained by a balance of retroelement replication and loss. The complete genome sequence of Brachypodium, coupled to its exceptional promise as a model system for grass research, will support the development of new energy and food crops

  17. Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

    Science.gov (United States)

    Longo, Mark S; Carone, Dawn M; Green, Eric D; O'Neill, Michael J; O'Neill, Rachel J

    2009-01-01

    Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB), are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN), euchromatic regions (EU), and an evolutionary breakpoint (EB) that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s) and endogenous retroviruses (ERVs) and a depletion of short interspersed nucleotide elements (SINEs) shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33), known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the divergence of marsupials

  18. Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

    Directory of Open Access Journals (Sweden)

    Green Eric D

    2009-07-01

    Full Text Available Abstract Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB, are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN, euchromatic regions (EU, and an evolutionary breakpoint (EB that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s and endogenous retroviruses (ERVs and a depletion of short interspersed nucleotide elements (SINEs shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33, known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the

  19. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change

    Energy Technology Data Exchange (ETDEWEB)

    Hu, Tina T.; Pattyn, Pedro; Bakker, Erica G.; Cao, Jun; Cheng, Jan-Fang; Clark, Richard M.; Fahlgren, Noah; Fawcett, Jeffrey A.; Grimwood, Jane; Gundlach, Heidrun; Haberer, Georg; Hollister, Jesse D.; Ossowski, Stephan; Ottilar, Robert P.; Salamov, Asaf A.; Schneeberger, Korbinian; Spannagl, Manuel; Wang, Xi; Yang, Liang; Nasrallah, Mikhail E.; Bergelson, Joy; Carrington, James C.; Gaut, Brandon S.; Schmutz, Jeremy; Mayer, Klaus F. X.; Van de Peer, Yves; Grigoriev, Igor V.; Nordborg, Magnus; Weigel, Detlef; Guo, Ya-Long

    2011-04-29

    In our manuscript, we present a high-quality genome sequence of the Arabidopsis thaliana relative, Arabidopsis lyrata, produced by dideoxy sequencing. We have performed the usual types of genome analysis (gene annotation, dN/dS studies etc. etc.), but this is relegated to the Supporting Information. Instead, we focus on what was a major motivation for sequencing this genome, namely to understand how A. thaliana lost half its genome in a few million years and lived to tell the tale. The rather surprising conclusion is that there is not a single genomic feature that accounts for the reduced genome, but that every aspect centromeres, intergenic regions, transposable elements, gene family number is affected through hundreds of thousands of cuts. This strongly suggests that overall genome size in itself is what has been under selection, a suggestion that is strongly supported by our demonstration (using population genetics data from A. thaliana) that new deletions seem to be driven to fixation.

  20. Mutations in CENPE define a novel kinetochore-centromeric mechanism for microcephalic primordial dwarfism.

    Science.gov (United States)

    Mirzaa, Ghayda M; Vitre, Benjamin; Carpenter, Gillian; Abramowicz, Iga; Gleeson, Joseph G; Paciorkowski, Alex R; Cleveland, Don W; Dobyns, William B; O'Driscoll, Mark

    2014-08-01

    Defects in centrosome, centrosomal-associated and spindle-associated proteins are the most frequent cause of primary microcephaly (PM) and microcephalic primordial dwarfism (MPD) syndromes in humans. Mitotic progression and segregation defects, microtubule spindle abnormalities and impaired DNA damage-induced G2-M cell cycle checkpoint proficiency have been documented in cell lines from these patients. This suggests that impaired mitotic entry, progression and exit strongly contribute to PM and MPD. Considering the vast protein networks involved in coordinating this cell cycle stage, the list of potential target genes that could underlie novel developmental disorders is large. One such complex network, with a direct microtubule-mediated physical connection to the centrosome, is the kinetochore. This centromeric-associated structure nucleates microtubule attachments onto mitotic chromosomes. Here, we described novel compound heterozygous variants in CENPE in two siblings who exhibit a profound MPD associated with developmental delay, simplified gyri and other isolated abnormalities. CENPE encodes centromere-associated protein E (CENP-E), a core kinetochore component functioning to mediate chromosome congression initially of misaligned chromosomes and in subsequent spindle microtubule capture during mitosis. Firstly, we present a comprehensive clinical description of these patients. Then, using patient cells we document abnormalities in spindle microtubule organization, mitotic progression and segregation, before modeling the cellular pathogenicity of these variants in an independent cell system. Our cellular analysis shows that a pathogenic defect in CENP-E, a kinetochore-core protein, largely phenocopies PCNT-mutated microcephalic osteodysplastic primordial dwarfism-type II patient cells. PCNT encodes a centrosome-associated protein. These results highlight a common underlying pathomechanism. Our findings provide the first evidence for a kinetochore-based route to

  1. Validation of rice genome sequence by optical mapping

    Directory of Open Access Journals (Sweden)

    Pape Louise

    2007-08-01

    Full Text Available Abstract Background Rice feeds much of the world, and possesses the simplest genome analyzed to date within the grass family, making it an economically relevant model system for other cereal crops. Although the rice genome is sequenced, validation and gap closing efforts require purely independent means for accurate finishing of sequence build data. Results To facilitate ongoing sequencing finishing and validation efforts, we have constructed a whole-genome SwaI optical restriction map of the rice genome. The physical map consists of 14 contigs, covering 12 chromosomes, with a total genome size of 382.17 Mb; this value is about 11% smaller than original estimates. 9 of the 14 optical map contigs are without gaps, covering chromosomes 1, 2, 3, 4, 5, 7, 8 10, and 12 in their entirety – including centromeres and telomeres. Alignments between optical and in silico restriction maps constructed from IRGSP (International Rice Genome Sequencing Project and TIGR (The Institute for Genomic Research genome sequence sources are comprehensive and informative, evidenced by map coverage across virtually all published gaps, discovery of new ones, and characterization of sequence misassemblies; all totalling ~14 Mb. Furthermore, since optical maps are ordered restriction maps, identified discordances are pinpointed on a reliable physical scaffold providing an independent resource for closure of gaps and rectification of misassemblies. Conclusion Analysis of sequence and optical mapping data effectively validates genome sequence assemblies constructed from large, repeat-rich genomes. Given this conclusion we envision new applications of such single molecule analysis that will merge advantages offered by high-resolution optical maps with inexpensive, but short sequence reads generated by emerging sequencing platforms. Lastly, map construction techniques presented here points the way to new types of comparative genome analysis that would focus on discernment of

  2. The Release 6 reference sequence of the Drosophila melanogaster genome.

    Science.gov (United States)

    Hoskins, Roger A; Carlson, Joseph W; Wan, Kenneth H; Park, Soo; Mendez, Ivonne; Galle, Samuel E; Booth, Benjamin W; Pfeiffer, Barret D; George, Reed A; Svirskas, Robert; Krzywinski, Martin; Schein, Jacqueline; Accardo, Maria Carmela; Damia, Elisabetta; Messina, Giovanni; Méndez-Lago, María; de Pablos, Beatriz; Demakova, Olga V; Andreyeva, Evgeniya N; Boldyreva, Lidiya V; Marra, Marco; Carvalho, A Bernardo; Dimitri, Patrizio; Villasante, Alfredo; Zhimulev, Igor F; Rubin, Gerald M; Karpen, Gary H; Celniker, Susan E

    2015-03-01

    Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy and middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. Further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads. © 2015 Hoskins et al.; Published by Cold Spring Harbor Laboratory Press.

  3. Immunohistochemical Assessment of Expression of Centromere Protein—A (CENPA) in Human Invasive Breast Cancer

    International Nuclear Information System (INIS)

    Rajput, Ashish B.; Hu, Nianping; Varma, Sonal; Chen, Chien-Hung; Ding, Keyue; Park, Paul C.; Chapman, Judy-Anne W.; SenGupta, Sandip K.; Madarnas, Yolanda; Elliott, Bruce E.; Feilotter, Harriet E.

    2011-01-01

    Abnormal cell division leading to the gain or loss of entire chromosomes and consequent genetic instability is a hallmark of cancer. Centromere protein –A (CENPA) is a centromere-specific histone-H3-like variant gene involved in regulating chromosome segregation during cell division. CENPA is one of the genes included in some of the commercially available RNA based prognostic assays for breast cancer (BCa)—the 70 gene signature MammaPrint ® and the five gene Molecular Grade Index (MGI SM ). Our aim was to assess the immunohistochemical (IHC) expression of CENPA in normal and malignant breast tissue. Clinically annotated triplicate core tissue microarrays of 63 invasive BCa and 20 normal breast samples were stained with a monoclonal antibody against CENPA and scored for percentage of visibly stained nuclei. Survival analyses with Kaplan–Meier (KM) estimate and Cox proportional hazards regression models were applied to assess the associations between CENPA expression and disease free survival (DFS). Average percentage of nuclei visibly stained with CENPA antibody was significantly higher (p = 0.02) in BCa than normal tissue. The 3-year DFS in tumors over-expressing CENPA (>50% stained nuclei) was 79% compared to 85% in low expression tumors (<50% stained nuclei). On multivariate analysis, IHC expression of CENPA showed weak association with DFS (HR > 60.07; p = 0.06) within our small cohort. To the best of our knowledge, this is the first published report evaluating the implications of increased IHC expression of CENPA in paraffin embedded breast tissue samples. Our finding that increased CENPA expression may be associated with shorter DFS in BCa supports its exploration as a potential prognostic biomarker

  4. Immunohistochemical Assessment of Expression of Centromere Protein—A (CENPA) in Human Invasive Breast Cancer

    Energy Technology Data Exchange (ETDEWEB)

    Rajput, Ashish B. [Department of Pathology and Molecular Medicine, Queen' s University, Kingston, ON K7L 3N6 (Canada); Hu, Nianping [Cancer Research institute, Queen' s University, Kingston, ON K7L 3N6 (Canada); Varma, Sonal; Chen, Chien-Hung [Department of Pathology and Molecular Medicine, Queen' s University, Kingston, ON K7L 3N6 (Canada); Ding, Keyue [NCIC Clinical Trials Group, Queen' s University, Kingston, ON K7L 3N6 (Canada); Park, Paul C. [Department of Pathology and Molecular Medicine, Queen' s University, Kingston, ON K7L 3N6 (Canada); Chapman, Judy-Anne W. [NCIC Clinical Trials Group, Queen' s University, Kingston, ON K7L 3N6 (Canada); SenGupta, Sandip K. [Department of Pathology and Molecular Medicine, Queen' s University, Kingston, ON K7L 3N6 (Canada); Madarnas, Yolanda [Cancer Research institute, Queen' s University, Kingston, ON K7L 3N6 (Canada); Department of Oncology, Cancer Center of Southeastern Ontario, Kingston, ON K7L 2V7 (Canada); Elliott, Bruce E.; Feilotter, Harriet E., E-mail: feilotth@kgh.kari.net [Department of Pathology and Molecular Medicine, Queen' s University, Kingston, ON K7L 3N6 (Canada)

    2011-12-06

    Abnormal cell division leading to the gain or loss of entire chromosomes and consequent genetic instability is a hallmark of cancer. Centromere protein –A (CENPA) is a centromere-specific histone-H3-like variant gene involved in regulating chromosome segregation during cell division. CENPA is one of the genes included in some of the commercially available RNA based prognostic assays for breast cancer (BCa)—the 70 gene signature MammaPrint{sup ®} and the five gene Molecular Grade Index (MGI{sup SM}). Our aim was to assess the immunohistochemical (IHC) expression of CENPA in normal and malignant breast tissue. Clinically annotated triplicate core tissue microarrays of 63 invasive BCa and 20 normal breast samples were stained with a monoclonal antibody against CENPA and scored for percentage of visibly stained nuclei. Survival analyses with Kaplan–Meier (KM) estimate and Cox proportional hazards regression models were applied to assess the associations between CENPA expression and disease free survival (DFS). Average percentage of nuclei visibly stained with CENPA antibody was significantly higher (p = 0.02) in BCa than normal tissue. The 3-year DFS in tumors over-expressing CENPA (>50% stained nuclei) was 79% compared to 85% in low expression tumors (<50% stained nuclei). On multivariate analysis, IHC expression of CENPA showed weak association with DFS (HR > 60.07; p = 0.06) within our small cohort. To the best of our knowledge, this is the first published report evaluating the implications of increased IHC expression of CENPA in paraffin embedded breast tissue samples. Our finding that increased CENPA expression may be associated with shorter DFS in BCa supports its exploration as a potential prognostic biomarker.

  5. On-line sorting of human chromosomes by centromeric index, and identification of sorted populations by GTG-banding and fluorescent in situ hybridization

    NARCIS (Netherlands)

    Boschman, G. A.; Rens, W.; Manders, E.; van Oven, C.; Barendsen, G. W.; Aten, J. A.

    1990-01-01

    Using slit-scan flow cytometry, the shape of human metaphase chromosomes, as expressed in their centromeric index (CI), and the DNA content of the chromosomes have been used as parameters in bivariate flow karyotyping. The resolution of the DNA vs CI flow karyogram of the larger chromosomes up to

  6. The Cell Cycle Timing of Centromeric Chromatin Assembly in Drosophila Meiosis Is Distinct from Mitosis Yet Requires CAL1 and CENP-C

    Science.gov (United States)

    Gorgescu, Walter; Tang, Jonathan; Costes, Sylvain V.; Karpen, Gary H.

    2012-01-01

    CENP-A (CID in flies) is the histone H3 variant essential for centromere specification, kinetochore formation, and chromosome segregation during cell division. Recent studies have elucidated major cell cycle mechanisms and factors critical for CENP-A incorporation in mitosis, predominantly in cultured cells. However, we do not understand the roles, regulation, and cell cycle timing of CENP-A assembly in somatic tissues in multicellular organisms and in meiosis, the specialized cell division cycle that gives rise to haploid gametes. Here we investigate the timing and requirements for CID assembly in mitotic tissues and male and female meiosis in Drosophila melanogaster, using fixed and live imaging combined with genetic approaches. We find that CID assembly initiates at late telophase and continues during G1 phase in somatic tissues in the organism, later than the metaphase assembly observed in cultured cells. Furthermore, CID assembly occurs at two distinct cell cycle phases during male meiosis: prophase of meiosis I and after exit from meiosis II, in spermatids. CID assembly in prophase I is also conserved in female meiosis. Interestingly, we observe a novel decrease in CID levels after the end of meiosis I and before meiosis II, which correlates temporally with changes in kinetochore organization and orientation. We also demonstrate that CID is retained on mature sperm despite the gross chromatin remodeling that occurs during protamine exchange. Finally, we show that the centromere proteins CAL1 and CENP-C are both required for CID assembly in meiosis and normal progression through spermatogenesis. We conclude that the cell cycle timing of CID assembly in meiosis is different from mitosis and that the efficient propagation of CID through meiotic divisions and on sperm is likely to be important for centromere specification in the developing zygote. PMID:23300382

  7. ParABS Systems of the Four Replicons of Burkholderia cenocepacia: New Chromosome Centromeres Confer Partition Specificity†

    Science.gov (United States)

    Dubarry, Nelly; Pasta, Franck; Lane, David

    2006-01-01

    Most bacterial chromosomes carry an analogue of the parABS systems that govern plasmid partition, but their role in chromosome partition is ambiguous. parABS systems might be particularly important for orderly segregation of multipartite genomes, where their role may thus be easier to evaluate. We have characterized parABS systems in Burkholderia cenocepacia, whose genome comprises three chromosomes and one low-copy-number plasmid. A single parAB locus and a set of ParB-binding (parS) centromere sites are located near the origin of each replicon. ParA and ParB of the longest chromosome are phylogenetically similar to analogues in other multichromosome and monochromosome bacteria but are distinct from those of smaller chromosomes. The latter form subgroups that correspond to the taxa of their hosts, indicating evolution from plasmids. The parS sites on the smaller chromosomes and the plasmid are similar to the “universal” parS of the main chromosome but with a sequence specific to their replicon. In an Escherichia coli plasmid stabilization test, each parAB exhibits partition activity only with the parS of its own replicon. Hence, parABS function is based on the independent partition of individual chromosomes rather than on a single communal system or network of interacting systems. Stabilization by the smaller chromosome and plasmid systems was enhanced by mutation of parS sites and a promoter internal to their parAB operons, suggesting autoregulatory mechanisms. The small chromosome ParBs were found to silence transcription, a property relevant to autoregulation. PMID:16452432

  8. Chromosomal structures and repetitive sequences divergence in Cucumis species revealed by comparative cytogenetic mapping.

    Science.gov (United States)

    Zhang, Yunxia; Cheng, Chunyan; Li, Ji; Yang, Shuqiong; Wang, Yunzhu; Li, Ziang; Chen, Jinfeng; Lou, Qunfeng

    2015-09-25

    Differentiation and copy number of repetitive sequences affect directly chromosome structure which contributes to reproductive isolation and speciation. Comparative cytogenetic mapping has been verified an efficient tool to elucidate the differentiation and distribution of repetitive sequences in genome. In present study, the distinct chromosomal structures of five Cucumis species were revealed through genomic in situ hybridization (GISH) technique and comparative cytogenetic mapping of major satellite repeats. Chromosome structures of five Cucumis species were investigated using GISH and comparative mapping of specific satellites. Southern hybridization was employed to study the proliferation of satellites, whose structural characteristics were helpful for analyzing chromosome evolution. Preferential distribution of repetitive DNAs at the subtelomeric regions was found in C. sativus, C hystrix and C. metuliferus, while majority was positioned at the pericentromeric heterochromatin regions in C. melo and C. anguria. Further, comparative GISH (cGISH) through using genomic DNA of other species as probes revealed high homology of repeats between C. sativus and C. hystrix. Specific satellites including 45S rDNA, Type I/II, Type III, Type IV, CentM and telomeric repeat were then comparatively mapped in these species. Type I/II and Type IV produced bright signals at the subtelomeric regions of C. sativus and C. hystrix simultaneously, which might explain the significance of their amplification in the divergence of Cucumis subgenus from the ancient ancestor. Unique positioning of Type III and CentM only at the centromeric domains of C. sativus and C. melo, respectively, combining with unique southern bands, revealed rapid evolutionary patterns of centromeric DNA in Cucumis. Obvious interstitial telomeric repeats were observed in chromosomes 1 and 2 of C. sativus, which might provide evidence of the fusion hypothesis of chromosome evolution from x = 12 to x = 7 in

  9. Identification of two new repetitive elements and chromosomal mapping of repetitive DNA sequences in the fish Gymnothorax unicolor (Anguilliformes: Muraenidae

    Directory of Open Access Journals (Sweden)

    E. Coluccia

    2011-05-01

    Full Text Available Muraenidae is a species-rich family, with relationships among genera and species and taxonomy that have not been completely clarified. Few cytogenetic studies have been conducted on this family, and all of them showed the same diploid chromosome number (2n=42 but with conspicuous karyotypic variation among species. The Mediterranean moray eel Gymnothorax unicolor was previously cytogenetically studied using classical techniques that allowed the characterization of its karyotype structure and the constitutive heterochromatin and argyrophilic nucleolar organizer regions (Ag-NORs distribution pattern. In the present study, we describe two new repetitive elements (called GuMboI and GuDdeI obtained from restricted genomic DNA of G. unicolor that were characterized by Southern blot and physically localized by in situ hybridization on metaphase chromosomes. As they are highly repetitive DNA sequences, they map in heterochromatic regions. However, while GuDdeI was localized in the centromeric regions, the GuMboI fraction was distributed on some centromeres and was co-localized with the nucleolus organizer region (NOR. Comparative analysis with other Mediterranean species such as Muraena helena pointed out that these DNA fractions are species-specific and could potentially be used for species discrimination. As a new contribution to the karyotype of this species, we found that the major ribosomal genes are localized on acrocentric chromosome 9 and that the telomeres of each chromosome are composed of a tandem repeat derived from a poly-TTAGGG DNA sequence, as it occurs in most vertebrate species. The results obtained add new information useful in comparative genomics at the chromosomal level and contribute to the cytogenetic knowledge regarding this fish family, which has not been extensively studied.

  10. Lack of independent prognostic and predictive value of centromere 17 copy number changes in breast cancer patients with known HER2 and TOP2A status

    DEFF Research Database (Denmark)

    Nielsen, Kirsten Vang; Ejlertsen, Bent; Møller, Susanne

    2011-01-01

    hybridization (FISH) with centromere 17 (CEN-17) and TOP2A was performed on 120 normal breast specimens. The diploid CEN-17 copy number was reduced from the expected two signals in whole nuclei to an average of 1.68 signals per nucleus in cut sections of normal breast. Ploidy levels determined in normal breast...

  11. A coordinated interdependent protein circuitry stabilizes the kinetochore ensemble to protect CENP-A in the human pathogenic yeast Candida albicans.

    Directory of Open Access Journals (Sweden)

    Jitendra Thakur

    Full Text Available Unlike most eukaryotes, a kinetochore is fully assembled early in the cell cycle in budding yeasts Saccharomyces cerevisiae and Candida albicans. These kinetochores are clustered together throughout the cell cycle. Kinetochore assembly on point centromeres of S. cerevisiae is considered to be a step-wise process that initiates with binding of inner kinetochore proteins on specific centromere DNA sequence motifs. In contrast, kinetochore formation in C. albicans, that carries regional centromeres of 3-5 kb long, has been shown to be a sequence independent but an epigenetically regulated event. In this study, we investigated the process of kinetochore assembly/disassembly in C. albicans. Localization dependence of various kinetochore proteins studied by confocal microscopy and chromatin immunoprecipitation (ChIP assays revealed that assembly of a kinetochore is a highly coordinated and interdependent event. Partial depletion of an essential kinetochore protein affects integrity of the kinetochore cluster. Further protein depletion results in complete collapse of the kinetochore architecture. In addition, GFP-tagged kinetochore proteins confirmed similar time-dependent disintegration upon gradual depletion of an outer kinetochore protein (Dam1. The loss of integrity of a kinetochore formed on centromeric chromatin was demonstrated by reduced binding of CENP-A and CENP-C at the centromeres. Most strikingly, Western blot analysis revealed that gradual depletion of any of these essential kinetochore proteins results in concomitant reduction in cellular protein levels of CENP-A. We further demonstrated that centromere bound CENP-A is protected from the proteosomal mediated degradation. Based on these results, we propose that a coordinated interdependent circuitry of several evolutionarily conserved essential kinetochore proteins ensures integrity of a kinetochore formed on the foundation of CENP-A containing centromeric chromatin.

  12. Dicentric chromosome aberration analysis using giemsa and centromere specific fluorescence in-situ hybridization for biological dosimetry: An inter- and intra-laboratory comparison in Indian laboratories

    International Nuclear Information System (INIS)

    Bhavani, M.; Tamizh Selvan, G.; Kaur, Harpreet; Adhikari, J.S.; Vijayalakshmi, J.; Venkatachalam, P.; Chaudhury, N.K.

    2014-01-01

    To facilitate efficient handling of large samples, an attempt towards networking of laboratories in India for biological dosimetry was carried out. Human peripheral blood samples were exposed to 60 Co γ-radiation for ten different doses (0–5 Gy) at a dose rate of 0.7 and 2 Gy/min. The chromosomal aberrations (CA) were scored in Giemsa-stained and fluorescence in-situ hybridization with centromere-specific probes. No significant difference (p>0.05) was observed in the CA yield for given doses except 4 and 5 Gy, between the laboratories, among the scorers and also staining methods adapted suggest the reliability and validates the inter-lab comparisons exercise for triage applications. - Highlights: • This is the first report from India on Networking for Biological Dosimetry preparedness using dicentric chromosomal (DC) aberration assay. • There is no significant difference in the in vitro dose response curve (Slope, Intercept, Curvature) constructed among the two labs. • No significant variation in the scoring of DC aberrations between the scorers irrespective of labs. • The DC results obtained by the labs from the Giemsa stained metaphase preparations were confirmed with centromere specific-FISH for further reliability and validity

  13. Kaustuv Sanyal

    Indian Academy of Sciences (India)

    Molecular evolution of the process of chromosome segregation: Lessons from fungal ... DNA sequence is rapidly changing across all forms of eukaryotic life. Centromeres in many budding yeast species are specified by the DNA sequence.

  14. Defining the Sequence Elements and Candidate Genes for the Coloboma Mutation.

    Directory of Open Access Journals (Sweden)

    Elizabeth A. Robb

    Full Text Available The chicken coloboma mutation exhibits features similar to human congenital developmental malformations such as ocular coloboma, cleft-palate, dwarfism, and polydactyly. The coloboma-associated region and encoded genes were investigated using advanced genomic, genetic, and gene expression technologies. Initially, the mutation was linked to a 990 kb region encoding 11 genes; the application of the genetic and genomic tools led to a reduction of the linked region to 176 kb and the elimination of 7 genes. Furthermore, bioinformatics analyses of capture array-next generation sequence data identified genetic elements including SNPs, insertions, deletions, gaps, chromosomal rearrangements, and miRNA binding sites within the introgressed causative region relative to the reference genome sequence. Coloboma-specific variants within exons, UTRs, and splice sites were studied for their contribution to the mutant phenotype. Our compiled results suggest three genes for future studies. The three candidate genes, SLC30A5 (a zinc transporter, CENPH (a centromere protein, and CDK7 (a cyclin-dependent kinase, are differentially expressed (compared to normal embryos at stages and in tissues affected by the coloboma mutation. Of these genes, two (SLC30A5 and CENPH are considered high-priority candidate based upon studies in other vertebrate model systems.

  15. Sequence Ready Characterization of the Pericentromeric Region of 19p12

    Energy Technology Data Exchange (ETDEWEB)

    Evan E. Eichler

    2006-08-31

    Current mapping and sequencing strategies have been inadequate within the proximal portion of 19p12 due, in part, to the presence of a recently expanded ZNF (zinc-finger) gene family and the presence of large (25-50 kb) inverted beta-satellite repeat structures which bracket this tandemly duplicated gene family. The virtual of absence of classically defined “unique” sequence within the region has hampered efforts to identify and characterize a suitable minimal tiling path of clones which can be used as templates required for finished sequencing of the region. The goal of this proposal is to develop and implement a novel sequence-anchor strategy to generate a contiguous BAC map of the most proximal portion of chromosome 19p12 for the purpose of complete sequence characterization. The target region will be an estimated 4.5 Mb of DNA extending from STS marker D19S450 (the beginning of the ZNF gene cluster) to the centromeric (alpha-satellite) junction of 19p11. The approach will entail 1) pre-selection of 19p12 BAC and cosmid clones (NIH approved library) utilizing both 19p12 -unique and 19p12-SPECIFIC repeat probes (Eichler et al., 1998); 2) the generation of a BAC/cosmid end-sequence map across the region with a density of one marker every 8kb; 3) the development of a second-generation of STS (sequence tagged sites) which will be used to identify and verify clonal overlap at the level of the sequence; 4) incorporation of these sequence-anchored overlapping clones into existing cosmid/BAC restriction maps developed at Livermore National Laboratory; and 5) validation of the organization of this region utilizing high-resolution FISH techniques (extended chromatin analysis) on monochromosomal 19 somatic cell hybrids and parental cell lines of source material. The data generated will be used in the selection of the most parsimonious tiling path of BAC clones to be sequenced as part of the JGI effort on chromosome 19 and should serve as a model for the sequence

  16. Transcriptional Response of Human Neurospheres to Helper-Dependent CAV-2 Vectors Involves the Modulation of DNA Damage Response, Microtubule and Centromere Gene Groups.

    Directory of Open Access Journals (Sweden)

    Stefania Piersanti

    Full Text Available Brain gene transfer using viral vectors will likely become a therapeutic option for several disorders. Helper-dependent (HD canine adenovirus type 2 vectors (CAV-2 are well suited for this goal. These vectors are poorly immunogenic, efficiently transduce neurons, are retrogradely transported to afferent structures in the brain and lead to long-term transgene expression. CAV-2 vectors are being exploited to unravel behavior, cognition, neural networks, axonal transport and therapy for orphan diseases. With the goal of better understanding and characterizing HD-CAV-2 for brain therapy, we analyzed the transcriptomic modulation induced by HD-CAV-2 in human differentiated neurospheres derived from midbrain progenitors. This 3D model system mimics several aspects of the dynamic nature of human brain. We found that differentiated neurospheres are readily transduced by HD-CAV-2 and that transduction generates two main transcriptional responses: a DNA damage response and alteration of centromeric and microtubule probes. Future investigations on the biochemistry of processes highlighted by probe modulations will help defining the implication of HD-CAV-2 and CAR receptor binding in enchaining these functional pathways. We suggest here that the modulation of DNA damage genes is related to viral DNA, while the alteration of centromeric and microtubule probes is possibly enchained by the interaction of the HD-CAV-2 fibre with CAR.

  17. Two structurally distinct inhibitors of glycogen synthase kinase 3 induced centromere positive micronuclei in human lymphoblastoid TK6 cells.

    Science.gov (United States)

    Mishima, Masayuki; Tanaka, Kenji; Takeiri, Akira; Harada, Asako; Kubo, Chiyomi; Sone, Sachiko; Nishimura, Yoshikazu; Tachibana, Yukako; Okazaki, Makoto

    2008-08-25

    Glycogen synthase kinase 3 (GSK3) is an attractive novel pharmacological target. Inhibition of GSK3 is recently regarded as one of the viable approaches to therapy for Alzheimer's disease, cancer, diabetes mellitus, osteoporosis, and bipolar mood disorder. Here, we have investigated the aneugenic potential of two potent and highly specific inhibitors of GSK3 by using an in vitro micronucleus test with human lymphoblastoid TK6 cells. One inhibitor was a newly synthesized maleimide derivative and the other was a previously known aminopyrimidine derivative. Both compounds elicited statistically significant and concentration-dependent increases in micronucleated cells. One hundred micronuclei (MN) of each were analyzed using centromeric DNA staining with fluorescence in situ hybridization. Both the two structurally distinct compounds induced centromere-positive micronuclei (CMN). Calculated from the frequency of MN cells and the percentage of CMN, CMN cell incidence after treatment with the maleimide compound at 1.2 microM, 2.4 microM, and 4.8 microM was 11.6, 27.7, and 56.3 per 1000 cells, respectively; the negative control was 4.5. CMN cell incidence after the treatment with the aminopyrimidine compound at 1.8 microM, 3.6 microM, and 5.4 microM was 6.7, 9.8 and 17.2 per 1000 cells, respectively. Both compounds exhibited concentration-dependent increase in the number of mitotic cells. The frequency of CMN cells correlated well with mitotic cell incidence after treatment with either compound. Furthermore, both inhibitors induced abnormal mitotic cells with asymmetric mitotic spindles and lagging anaphase chromosomes. These results lend further support to the hypothesis that the inhibition of GSK3 activity affects microtubule function and exhibits an aneugenic mode of action.

  18. Chromosome mapping of repetitive sequences in four Serrasalmidae species (Characiformes

    Directory of Open Access Journals (Sweden)

    Leila Braga Ribeiro

    2014-01-01

    Full Text Available The Serrasalmidae family is composed of a number of commercially interesting species, mainly in the Amazon region where most of these fishes occur. In the present study, we investigated the genomic organization of the 18S and 5S rDNA and telomeric sequences in mitotic chromosomes of four species from the basal clade of the Serrasalmidae family: Colossoma macropomum, Mylossoma aureum, M. duriventre, and Piaractus mesopotamicus, in order to understand the chromosomal evolution in the family. All the species studied had diploid numbers 2n = 54 and exclusively biarmed chromosomes, but variations of the karyotypic formulas were observed. C-banding resulted in similar patterns among the analyzed species, with heterochromatic blocks mainly present in centromeric regions. The 18S rDNA mapping of C. macropomum and P. mesopotamicus revealed multiple sites of this gene; 5S rDNA sites were detected in two chromosome pairs in all species, although not all of them were homeologs. Hybridization with a telomeric probe revealed signals in the terminal portions of chromosomes in all the species and an interstitial signal was observed in one pair of C. macropomum.

  19. The N-terminus of survivin is a mitochondrial-targeting sequence and Src regulator

    Science.gov (United States)

    Dunajová, Lucia; Cash, Emily; Markus, Robert; Rochette, Sophie; Townley, Amelia R.

    2016-01-01

    ABSTRACT Survivin (also known as BIRC5) is a cancer-associated protein that exists in several locations in the cell. Its cytoplasmic residence in interphase cells is governed by CRM1 (also known as XPO1)-mediated nuclear exportation, and its localisation during mitosis to the centromeres and midzone microtubules is that of a canonical chromosomal passenger protein. In addition to these well-established locations, survivin is also a mitochondrial protein, but how it gets there and its function therein is presently unclear. Here, we show that the first ten amino acids at the N-terminus of survivin are sufficient to target GFP to the mitochondria in vivo, and ectopic expression of this decapeptide decreases cell adhesion and accelerates proliferation. The data support a signalling mechanism in which this decapeptide regulates the tyrosine kinase Src, leading to reduced focal adhesion plaques and disruption of F-actin organisation. This strongly suggests that the N-terminus of survivin is a mitochondrial-targeting sequence that regulates Src, and that survivin acts in concert with Src to promote tumorigenesis. PMID:27246243

  20. Transcription of tandemly repetitive DNA: functional roles.

    Science.gov (United States)

    Biscotti, Maria Assunta; Canapa, Adriana; Forconi, Mariko; Olmo, Ettore; Barucca, Marco

    2015-09-01

    A considerable fraction of the eukaryotic genome is made up of satellite DNA constituted of tandemly repeated sequences. These elements are mainly located at centromeres, pericentromeres, and telomeres and are major components of constitutive heterochromatin. Although originally satellite DNA was thought silent and inert, an increasing number of studies are providing evidence on its transcriptional activity supporting, on the contrary, an unexpected dynamicity. This review summarizes the multiple structural roles of satellite noncoding RNAs at chromosome level. Indeed, satellite noncoding RNAs play a role in the establishment of a heterochromatic state at centromere and telomere. These highly condensed structures are indispensable to preserve chromosome integrity and genome stability, preventing recombination events, and ensuring the correct chromosome pairing and segregation. Moreover, these RNA molecules seem to be involved also in maintaining centromere identity and in elongation, capping, and replication of telomere. Finally, the abnormal variation of centromeric and pericentromeric DNA transcription across major eukaryotic lineages in stress condition and disease has evidenced the critical role that these transcripts may play and the potentially dire consequences for the organism.

  1. Isolation and Characterization of the Etheostoma tallapoosae (Teleostei: Percidae CENP-A Gene

    Directory of Open Access Journals (Sweden)

    Leos G. Kral

    2011-10-01

    Full Text Available Both centromeric alpha-satellite sequences as well as centromeric protein A (CENP-A are highly variable in eukaryotes. CENP-A, a histone H3 variant, is thought to act as the epigenetic “mark” for assembly of centromeric proteins. While most of the histone fold domain (HFD of the CENP-A is fairly well conserved, a portion of this HFD as well as the N-terminal tail show adaptive variation in both plants and animals. Such variation may establish reproductive barriers that may lead to speciation. The family Percidae contains over 200 species most of which are within the subfamily Etheostomatinae. This subfamily represents a species rich radiation of freshwater fishes in North America and these species exhibit both allopatric and sympatric distributions. In order to study the evolution of CENP-A in percid fish species, we have isolated and characterized the CENP-A gene from Etheostoma tallapoosae by PCR based gene walking. As a result of this study we have demonstrated that the Tallapoosa darter CENP-A gene HFD sequences can be isolated from genomic DNA by nested PCR in a manner that does not lead to the amplification of the highly sequence related histone H3 gene. We also demonstrated that PCR based walking can be subsequently used to isolate the rest of the CENP-A gene and adjacent gene sequences. These adjacent gene sequences provide us with a primer binding sites for PCR isolation of the CENP-A gene from other percid species of fishes. An initial comparison of three percid species shows that the N-terminal tail of the percid CENP-A gene shows adaptive evolution.

  2. Adaptive Evolution of CENP-A in Percid Fishes

    Directory of Open Access Journals (Sweden)

    Harriet N. A. Abbey

    2015-07-01

    Full Text Available Centromeric protein A (CENP-A is the epigenetic determinant of centromeres. This protein has been shown to be adaptively evolving in a number of animal and plant species. In a previous communication we were able to demonstrate that signs of adaptive evolution were detected in the comparison of CENP-A sequences from three percid fish species. In this study we isolated the CENP-A gene from eight additional species from the Percidae family. With these sequences and those previously obtained, we carried out a more robust statistical analysis of codon specific positive selection in CENP-A coding sequences of eleven percid species. We were able to demonstrate that at least two amino acid positions within the N-terminal tail are under strong positive selection and that one of these positions is potentially a substrate for phosphorylation. While nonsynonymous substitutions were detected in the histone fold domain, these were not statistically supported as resulting from positive selection.

  3. A third-generation microsatellite-based linkage map of the honey bee, Apis mellifera, and its comparison with the sequence-based physical map.

    Science.gov (United States)

    Solignac, Michel; Mougel, Florence; Vautrin, Dominique; Monnerot, Monique; Cornuet, Jean-Marie

    2007-01-01

    The honey bee is a key model for social behavior and this feature led to the selection of the species for genome sequencing. A genetic map is a necessary companion to the sequence. In addition, because there was originally no physical map for the honey bee genome project, a meiotic map was the only resource for organizing the sequence assembly on the chromosomes. We present the genetic (meiotic) map here and describe the main features that emerged from comparison with the sequence-based physical map. The genetic map of the honey bee is saturated and the chromosomes are oriented from the centromeric to the telomeric regions. The map is based on 2,008 markers and is about 40 Morgans (M) long, resulting in a marker density of one every 2.05 centiMorgans (cM). For the 186 megabases (Mb) of the genome mapped and assembled, this corresponds to a very high average recombination rate of 22.04 cM/Mb. Honey bee meiosis shows a relatively homogeneous recombination rate along and across chromosomes, as well as within and between individuals. Interference is higher than inferred from the Kosambi function of distance. In addition, numerous recombination hotspots are dispersed over the genome. The very large genetic length of the honey bee genome, its small physical size and an almost complete genome sequence with a relatively low number of genes suggest a very promising future for association mapping in the honey bee, particularly as the existence of haploid males allows easy bulk segregant analysis.

  4. Condensin HEAT subunits required for DNA repair, kinetochore/centromere function and ploidy maintenance in fission yeast.

    Directory of Open Access Journals (Sweden)

    Xingya Xu

    Full Text Available Condensin, a central player in eukaryotic chromosomal dynamics, contains five evolutionarily-conserved subunits. Two SMC (structural maintenance of chromosomes subunits contain ATPase, hinge, and coiled-coil domains. One non-SMC subunit is similar to bacterial kleisin, and two other non-SMC subunits contain HEAT (similar to armadillo repeats. Here we report isolation and characterization of 21 fission yeast (Schizosaccharomyces pombe mutants for three non-SMC subunits, created using error-prone mutagenesis that resulted in single-amino acid substitutions. Beside condensation, segregation, and DNA repair defects, similar to those observed in previously isolated SMC and cnd2 mutants, novel phenotypes were observed for mutants of HEAT-repeats containing Cnd1 and Cnd3 subunits. cnd3-L269P is hypersensitive to the microtubule poison, thiabendazole, revealing defects in kinetochore/centromere and spindle assembly checkpoints. Three cnd1 and three cnd3 mutants increased cell size and doubled DNA content, thereby eliminating the haploid state. Five of these mutations reside in helix B of HEAT repeats. Two non-SMC condensin subunits, Cnd1 and Cnd3, are thus implicated in ploidy maintenance.

  5. Are ribosomal DNA clusters rearrangement hotspots? A case study in the genus Mus (Rodentia, Muridae

    Directory of Open Access Journals (Sweden)

    Douzery Emmanuel JP

    2011-05-01

    Full Text Available Abstract Background Recent advances in comparative genomics have considerably improved our knowledge of the evolution of mammalian karyotype architecture. One of the breakthroughs was the preferential localization of evolutionary breakpoints in regions enriched in repetitive sequences (segmental duplications, telomeres and centromeres. In this context, we investigated the contribution of ribosomal genes to genome reshuffling since they are generally located in pericentromeric or subtelomeric regions, and form repeat clusters on different chromosomes. The target model was the genus Mus which exhibits a high rate of karyotypic change, a large fraction of which involves centromeres. Results The chromosomal distribution of rDNA clusters was determined by in situ hybridization of mouse probes in 19 species. Using a molecular-based reference tree, the phylogenetic distribution of clusters within the genus was reconstructed, and the temporal association between rDNA clusters, breakpoints and centromeres was tested by maximum likelihood analyses. Our results highlighted the following features of rDNA cluster dynamics in the genus Mus: i rDNA clusters showed extensive diversity in number between species and an almost exclusive pericentromeric location, ii a strong association between rDNA sites and centromeres was retrieved which may be related to their shared constraint of concerted evolution, iii 24% of the observed breakpoints mapped near an rDNA cluster, and iv a substantial rate of rDNA cluster change (insertion, deletion also occurred in the absence of chromosomal rearrangements. Conclusions This study on the dynamics of rDNA clusters within the genus Mus has revealed a strong evolutionary relationship between rDNA clusters and centromeres. Both of these genomic structures coincide with breakpoints in the genus Mus, suggesting that the accumulation of a large number of repeats in the centromeric region may contribute to the high level of chromosome

  6. Identification of a 450-bp region of human papillomavirus type 1 that promotes episomal replication in Saccharomyces cerevisiae

    International Nuclear Information System (INIS)

    Chattopadhyay, Anasuya; Schmidt, Martin C.; Khan, Saleem A.

    2005-01-01

    Human papillomaviruses (HPVs) replicate as nuclear plasmids in infected cells. Since the DNA replication machinery is generally conserved between humans and Saccharomyces cerevisiae, we studied whether HPV-1 DNA can replicate in yeast. Plasmids containing a selectable marker (with or without a yeast centromere) and either the full-length HPV-1 genome or various regions of the viral long control region (LCR) and the 3' end of the L1 gene were introduced into S. cerevisiae and their ability to replicate episomally was investigated. Our results show that HPV-1 sequences promote episomal replication of plasmids although the yeast centromere is required for plasmid retention. We have mapped the autonomously replicating sequence activity of HPV-1 DNA to a 450 base-pair sequence (HPV-1 nt 6783-7232) that includes 293 nucleotides from the 5' region of the viral LCR and 157 nucleotides from the 3' end of the L1 gene. The HPV-1 ARS does not include the binding sites for the viral E1 and E2 proteins, and these proteins are dispensable for replication in S. cerevisiae

  7. Killer Immunoglobulin-Like Receptor Allele Determination Using Next-Generation Sequencing Technology

    Directory of Open Access Journals (Sweden)

    Bercelin Maniangou

    2017-05-01

    Full Text Available The impact of natural killer (NK cell alloreactivity on hematopoietic stem cell transplantation (HSCT outcome is still debated due to the complexity of graft parameters, HLA class I environment, the nature of killer cell immunoglobulin-like receptor (KIR/KIR ligand genetic combinations studied, and KIR+ NK cell repertoire size. KIR genes are known to be polymorphic in terms of gene content, copy number variation, and number of alleles. These allelic polymorphisms may impact both the phenotype and function of KIR+ NK cells. We, therefore, speculate that polymorphisms may alter donor KIR+ NK cell phenotype/function thus modulating post-HSCT KIR+ NK cell alloreactivity. To investigate KIR allele polymorphisms of all KIR genes, we developed a next-generation sequencing (NGS technology on a MiSeq platform. To ensure the reliability and specificity of our method, genomic DNA from well-characterized cell lines were used; high-resolution KIR typing results obtained were then compared to those previously reported. Two different bioinformatic pipelines were used allowing the attribution of sequencing reads to specific KIR genes and the assignment of KIR alleles for each KIR gene. Our results demonstrated successful long-range KIR gene amplifications of all reference samples using intergenic KIR primers. The alignment of reads to the human genome reference (hg19 using BiRD pipeline or visualization of data using Profiler software demonstrated that all KIR genes were completely sequenced with a sufficient read depth (mean 317× for all loci and a high percentage of mapping (mean 93% for all loci. Comparison of high-resolution KIR typing obtained to those published data using exome capture resulted in a reported concordance rate of 95% for centromeric and telomeric KIR genes. Overall, our results suggest that NGS can be used to investigate the broad KIR allelic polymorphism. Hence, these data improve our knowledge, not only on KIR+ NK cell alloreactivity in

  8. Function of Junk: Pericentromeric Satellite DNA in Chromosome Maintenance.

    Science.gov (United States)

    Jagannathan, Madhav; Yamashita, Yukiko M

    2018-04-02

    Satellite DNAs are simple tandem repeats that exist at centromeric and pericentromeric regions on eukaryotic chromosomes. Unlike the centromeric satellite DNA that comprises the vast majority of natural centromeres, function(s) for the much more abundant pericentromeric satellite repeats are poorly understood. In fact, the lack of coding potential allied with rapid divergence of repeat sequences across eukaryotes has led to their dismissal as "junk DNA" or "selfish parasites." Although implicated in various biological processes, a conserved function for pericentromeric satellite DNA remains unidentified. We have addressed the role of satellite DNA through studying chromocenters, a cytological aggregation of pericentromeric satellite DNA from multiple chromosomes into DNA-dense nuclear foci. We have shown that multivalent satellite DNA-binding proteins cross-link pericentromeric satellite DNA on chromosomes into chromocenters. Disruption of chromocenters results in the formation of micronuclei, which arise by budding off the nucleus during interphase. We propose a model that satellite DNAs are critical chromosome elements that are recognized by satellite DNA-binding proteins and incorporated into chromocenters. We suggest that chromocenters function to preserve the entire chromosomal complement in a single nucleus, a fundamental and unquestioned feature of eukaryotic genomes. We speculate that the rapid divergence of satellite DNA sequences between closely related species results in discordant chromocenter function and may underlie speciation and hybrid incompatibility. © 2017 Jagannathan and Yamashita; Published by Cold Spring Harbor Laboratory Press.

  9. Breakpoint of an inversion of chromosome 14 in a T-cell leukemia: sequences downstream of the immunoglobulin heavy chain locus are implicated in tumorigenesis

    International Nuclear Information System (INIS)

    Baer, R.; Heppell, A.; Taylor, A.M.R.; Rabbitts, P.H.; Boullier, B.; Rabbitts, T.H.

    1987-01-01

    T-cell tumors are characterized by inversions or translocations of chromosome 14. The breakpoints of these karyotypic abnormalities occur in chromosome bands 14q11 and 14q32 - the same bands in which the T-cell receptor (TCR) α-chain and immunoglobulin heavy chain genes have been mapped, respectively. Patients with ataxia-telangiectasia are particularly prone to development of T-cell chronic lymphocytic leukemia with such chromosomal abnormalities. The authors describe DNA rearrangements of the TCR α-chain gene in an ataxia-telangiectasia-associated leukemia containing both a normal and an inverted chromosome 14. The normal chromosome 14 has undergone a productive join of TCR α-chain variable (V/sub α/) and joining (J/sub α/) gene segments. The other allele of the TCR α-chain gene features a DNA rearrangement, about 50 kilobases from the TCR α-chain constant (C/sub α/) gene, that represents the breakpoint of the chromosome 14 inversion; this breakpoint is comprised of a TCR J/sub α/) segment (from 14q11) fused to sequences derived from 14q32 but on the centromeric side of C/sub μ/. These results imply that 14q32 sequences located at an undetermined distance downstream of immunoglobulin C/sub μ/ locus can contribute to the development of T-cell tumors

  10. A Tandemly Arranged Pattern of Two 5S rDNA Arrays in Amolops mantzorum (Anura, Ranidae).

    Science.gov (United States)

    Liu, Ting; Song, Menghuan; Xia, Yun; Zeng, Xiaomao

    2017-01-01

    In an attempt to extend the knowledge of the 5S rDNA organization in anurans, the 5S rDNA sequences of Amolops mantzorum were isolated, characterized, and mapped by FISH. Two forms of 5S rDNA, type I (209 bp) and type II (about 870 bp), were found in specimens investigated from various populations. Both of them contained a 118-bp coding sequence, readily differentiated by their non-transcribed spacer (NTS) sizes and compositions. Four probes (the 5S rDNA coding sequences, the type I NTS, the type II NTS, and the entire type II 5S rDNA sequences) were respectively labeled with TAMRA or digoxigenin to hybridize with mitotic chromosomes for samples of all localities. It turned out that all probes showed the same signals that appeared in every centromeric region and in the telomeric regions of chromosome 5, without differences within or between populations. Obviously, both type I and type II of the 5S rDNA arrays arranged in tandem, which was contrasting with other frogs or fishes recorded to date. More interestingly, all the probes detected centromeric regions in all karyotypes, suggesting the presence of a satellite DNA family derived from 5S rDNA. © 2017 S. Karger AG, Basel.

  11. DNA gains at 8q23.2

    DEFF Research Database (Denmark)

    da Silva Veiga, Luciana Caricati; Bérgamo, Nádia Aparecida; dos Reis, Patrícia Pintor

    2003-01-01

    Gains or amplifications involving chromosome arm 8q are one of the most recurrent chromosomal alterations in head and neck tumors. To characterize previously reported gains, we performed fluorescence in situ hybridization (FISH) using the sequences BAC RP1179E1 and 8-centromere PMJ 128 as probes....

  12. Not para-, not peri-, but centric inversion of chromosome 12

    DEFF Research Database (Denmark)

    Silahtaroglu, A N; Hacihanefioglu, S; Güven, G S

    1998-01-01

    a break in the alphoid repeats followed by an inversion within the short arm, resulting in a pseudodicentric chromosome. Further FISH analyses using telomeric and subtelomeric probes showed that the other breakpoint was in the subtelomeric region of the short arm. The karyotype is designated 47,XXY,inv(12......)(p10p13.3). To our knowledge this is the first report of a case of "centric inversion"....

  13. Centromere Protein (CENP)-W Interacts with Heterogeneous Nuclear Ribonucleoprotein (hnRNP) U and May Contribute to Kinetochore-Microtubule Attachment in Mitotic Cells

    Science.gov (United States)

    Chun, Younghwa; Kim, Raehyung; Lee, Soojin

    2016-01-01

    Background Recent studies have shown that heterogeneous nuclear ribonucleoprotein U (hnRNP U), a component of the hnRNP complex, contributes to stabilize the kinetochore-microtubule interaction during mitosis. CENP-W was identified as an inner centromere component that plays crucial roles in the formation of a functional kinetochore complex. Results We report that hnRNP U interacts with CENP-W, and the interaction between hnRNP U and CENP-W mutually increased each other’s protein stability by inhibiting the proteasome-mediated degradation. Further, their co-localization was observed chiefly in the nuclear matrix region and at the microtubule-kinetochore interface during interphase and mitosis, respectively. Both microtubule-stabilizing and microtubule-destabilizing agents significantly decreased the protein stability of CENP-W. Furthermore, loss of microtubules and defects in microtubule organization were observed in CENP-W-depleted cells. Conclusion Our data imply that CENP-W plays an important role in the attachment and interaction between microtubules and kinetochore during mitosis. PMID:26881882

  14. The RSF1 histone-remodelling factor facilitates DNA double-strand break repair by recruiting centromeric and Fanconi Anaemia proteins.

    Directory of Open Access Journals (Sweden)

    Fabio Pessina

    2014-05-01

    Full Text Available ATM is a central regulator of the cellular responses to DNA double-strand breaks (DSBs. Here we identify a biochemical interaction between ATM and RSF1 and we characterise the role of RSF1 in this response. The ATM-RSF1 interaction is dependent upon both DSBs and ATM kinase activity. Together with SNF2H/SMARCA5, RSF1 forms the RSF chromatin-remodelling complex. Although RSF1 is specific to the RSF complex, SNF2H/SMARCA5 is a catalytic subunit of several other chromatin-remodelling complexes. Although not required for checkpoint signalling, RSF1 is required for efficient repair of DSBs via both end-joining and homology-directed repair. Specifically, the ATM-dependent recruitment to sites of DSBs of the histone fold proteins CENPS/MHF1 and CENPX/MHF2, previously identified at centromeres, is RSF1-dependent. In turn these proteins recruit and regulate the mono-ubiquitination of the Fanconi Anaemia proteins FANCD2 and FANCI. We propose that by depositing CENPS/MHF1 and CENPX/MHF2, the RSF complex either directly or indirectly contributes to the reorganisation of chromatin around DSBs that is required for efficient DNA repair.

  15. Karyotypes and Distribution of Tandem Repeat Sequences in Brassica nigra Determined by Fluorescence in situ Hybridization

    Czech Academy of Sciences Publication Activity Database

    Wang, G.; He, Q.; Macas, Jiří; Novák, Petr; Neumann, Pavel; Meng, D.; Zhao, H.; Guo, N.; Han, S.; Zong, M.; Jin, W.; Liu, F.

    2017-01-01

    Roč. 152, č. 3 (2017), s. 158-165 ISSN 1424-8581 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : asymmetric somatic hybridization * Fluorescence in situ hybridization * Karyotype * (Peri) centromere Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Genetics and heredity (medical genetics to be 3) Impact factor: 1.354, year: 2016

  16. Genetic, genomic, and molecular tools for studying the protoploid yeast, L. waltii.

    Science.gov (United States)

    Di Rienzi, Sara C; Lindstrom, Kimberly C; Lancaster, Ragina; Rolczynski, Lisa; Raghuraman, M K; Brewer, Bonita J

    2011-02-01

    Sequencing of the yeast Kluyveromyces waltii (recently renamed Lachancea waltii) provided evidence of a whole genome duplication event in the lineage leading to the well-studied Saccharomyces cerevisiae. While comparative genomic analyses of these yeasts have proven to be extremely instructive in modeling the loss or maintenance of gene duplicates, experimental tests of the ramifications following such genome alterations remain difficult. To transform L. waltii from an organism of the computational comparative genomic literature into an organism of the functional comparative genomic literature, we have developed genetic, molecular and genomic tools for working with L. waltii. In particular, we have characterized basic properties of L. waltii (growth, ploidy, molecular karyotype, mating type and the sexual cycle), developed transformation, cell cycle arrest and synchronization protocols, and have created centromeric and non-centromeric vectors as well as a genome browser for L. waltii. We hope that these tools will be used by the community to follow up on the ideas generated by sequence data and lead to a greater understanding of eukaryotic biology and genome evolution. 2010 John Wiley & Sons, Ltd.

  17. Development of techniques using DNA analysis method for detection/analysis of radiation-induced mutation. Development of an useful probe/primer and improvement of detection efficacy

    International Nuclear Information System (INIS)

    Maekawa, Hideaki; Tsuchida, Kozo; Hashido, Kazuo; Takada, Naoko; Kameoka, Yosuke; Hirata, Makoto

    1999-01-01

    Previously, it was demonstrated that detection of centromere became easy and reliable through fluorescent staining by FISH method using a probe of the sequence preserved in α-satelite DNA. Since it was, however, found inappropriate to detect dicentrics based on the relative amount of DNA probe on each chromosome. A prove which allows homogeneous detection of α-satelite DNA for each chromosome was constructed. A presumed sequence specific to kinetochore, CENP-B box was amplified by PCR method and the product DNA was used as a probe. However, the variation in amounts of probe DNA among chromosomes was decreased by only about 20%. Then, a program for image processing of the results obtained from FISH using α-satelite DNA was constructed to use as a marker for centromere. When compared with detection of abnormal chromosomes stained by the conventional method, calculation efficacy for only detection of centromere was improved by the use of this program. Calculation to discriminate the normal or not was still complicated and the detection efficacy was little improved. Chromosomal abnormalities in lymphocytes were used to detect the effects of radiation. In this method, it is needed to shift the phase of cells into metaphase. The mutation induced by radiation might be often repaired during shifting. To exclude this possibility, DNA extraction was conducted at a low temperature and immediately after exposure to 137 Cs, and a rapid genome detection method was established using the genome DNA. As the model genomes, the following three were used: 1) long chain repeated sequences widely dispersed over chromosome, 2) cluster genes, 3) single copy genes. The effects of radiation were detectable at 1-2 Gy for the long repeated sequences and at 7 Gy for the cluster genes, respectively, whereas no significant effects were observed at any Gy tested for the single copy genes. Amplification was marked in the cells exposed at 1-10 Gy (peak at 4 Gy), suggesting that these regions had

  18. Repetitive DNAs highlight the role of chromosomal fusions in the karyotype evolution of Dascyllus species (Pomacentridae, Perciformes).

    Science.gov (United States)

    Getlekha, Nuntaporn; Molina, Wagner Franco; de Bello Cioffi, Marcelo; Yano, Cassia Fernanda; Maneechot, Nuntiya; Bertollo, Luiz Antonio Carlos; Supiwong, Weerayuth; Tanomtong, Alongklod

    2016-04-01

    The Dascyllus genus consists of 11 species spread over vast regions of the Indo-Pacific, showing remarkable reductions in the diploid chromosome numbers (2n). The present study analyzed the karyotypes and other chromosomal characteristics of D. trimaculatus (2n = 48; 2st + 46a; NF = 50), D. carneus (2n = 48; 2st + 46a; NF = 50) and D. aruanus (2n = 30; 18m + 2st + 10a; NF = 50) from the Thailand Gulf (Pacific Ocean) and D. melanurus (2n = 48; 2st + 46a; NF = 50) from the Andaman Sea (Indian Ocean), employing conventional cytogenetic analyses and the chromosomal mapping of repetitive DNAs, using 18S and 5S rDNA, telomeric sequences and (CA)15, (GA)15, and (CAA)10 microsatellites as probes. The C-positive heterochromatin was found in the centromeric regions of most chromosomal pairs and 18S rDNA phenotypes were single in all species. However, in D. aruanus (2n = 30), which harbors nine metacentric pairs; the 5S rDNA sites were located in the centromeric region of the shortest one. The mapping of the telomeric sequences in D. aruanus revealed the presence of interstitial telomeric sites (ITS) in the centromeric region of four metacentric pairs, with one of these pairs also displaying an additional ITS in the long arms. Distinct chromosomal markers confirmed the reduction of the 2n by chromosomal fusions, highlighting the precise characterization of these rearrangements by the cytogenetic mapping of the repetitive DNAs.

  19. Fungal genome and mating system transitions facilitated by chromosomal translocations involving intercentromeric recombination.

    Directory of Open Access Journals (Sweden)

    Sheng Sun

    2017-08-01

    Full Text Available Species within the human pathogenic Cryptococcus species complex are major threats to public health, causing approximately 1 million annual infections globally. Cryptococcus amylolentus is the most closely known related species of the pathogenic Cryptococcus species complex, and it is non-pathogenic. Additionally, while pathogenic Cryptococcus species have bipolar mating systems with a single large mating type (MAT locus that represents a derived state in Basidiomycetes, C. amylolentus has a tetrapolar mating system with 2 MAT loci (P/R and HD located on different chromosomes. Thus, studying C. amylolentus will shed light on the transition from tetrapolar to bipolar mating systems in the pathogenic Cryptococcus species, as well as its possible link with the origin and evolution of pathogenesis. In this study, we sequenced, assembled, and annotated the genomes of 2 C. amylolentus isolates, CBS6039 and CBS6273, which are sexual and interfertile. Genome comparison between the 2 C. amylolentus isolates identified the boundaries and the complete gene contents of the P/R and HD MAT loci. Bioinformatic and chromatin immunoprecipitation sequencing (ChIP-seq analyses revealed that, similar to those of the pathogenic Cryptococcus species, C. amylolentus has regional centromeres (CENs that are enriched with species-specific transposable and repetitive DNA elements. Additionally, we found that while neither the P/R nor the HD locus is physically closely linked to its centromere in C. amylolentus, and the regions between the MAT loci and their respective centromeres show overall synteny between the 2 genomes, both MAT loci exhibit genetic linkage to their respective centromere during meiosis, suggesting the presence of recombinational suppressors and/or epistatic gene interactions in the MAT-CEN intervening regions. Furthermore, genomic comparisons between C. amylolentus and related pathogenic Cryptococcus species provide evidence that multiple chromosomal

  20. Unique small RNA signatures uncovered in the tammar wallaby genome

    Directory of Open Access Journals (Sweden)

    Lindsay James

    2012-10-01

    Full Text Available Abstract Background Small RNAs have proven to be essential regulatory molecules encoded within eukaryotic genomes. These short RNAs participate in a diverse array of cellular processes including gene regulation, chromatin dynamics and genome defense. The tammar wallaby, a marsupial mammal, is a powerful comparative model for studying the evolution of regulatory networks. As part of the genome sequencing initiative for the tammar, we have explored the evolution of each of the major classes of mammalian small RNAs in an Australian marsupial for the first time, including the first genome-scale analysis of the newest class of small RNAs, centromere repeat associated short interacting RNAs (crasiRNAs. Results Using next generation sequencing, we have characterized the major classes of small RNAs, micro (mi RNAs, piwi interacting (pi RNAs, and the centromere repeat associated short interacting (crasi RNAs in the tammar. We examined each of these small RNA classes with respect to the newly assembled tammar wallaby genome for gene and repeat features, salient features that define their canonical sequences, and the constitution of both highly conserved and species-specific members. Using a combination of miRNA hairpin predictions and co-mapping with miRBase entries, we identified a highly conserved cluster of miRNA genes on the X chromosome in the tammar and a total of 94 other predicted miRNA producing genes. Mapping all miRNAs to the tammar genome and comparing target genes among tammar, mouse and human, we identified 163 conserved target genes. An additional nine genes were identified in tammar that do not have an orthologous miRNA target in human and likely represent novel miRNA-regulated genes in the tammar. A survey of the tammar gonadal piRNAs shows that these small RNAs are enriched in retroelements and carry members from both marsupial and tammar-specific repeat classes. Lastly, this study includes the first in-depth analyses of the newly

  1. Identification of a fourth locus (EVR4) for familial exudative vitreoretinopathy (FEVR).

    Science.gov (United States)

    Toomes, Carmel; Downey, Louise M; Bottomley, Helen M; Scott, Sheila; Woodruff, Geoffrey; Trembath, Richard C; Inglehearn, Chris F

    2004-01-15

    Familial exudative vitreoretinopathy (FEVR) is a genetically heterogeneous inherited blinding disorder of the retinal vascular system. To date three loci have been mapped: EVR1 on chromosome 11q, EVR2 on chromosome Xp, and EVR3 on chromosome 11p. The gene underlying EVR3 remains unidentified whilst the EVR2 gene, which encodes the Norrie disease protein (NDP), was identified over a decade ago. More recently, FZD4, the gene that encodes the Wnt receptor Frizzled-4, was identified as the mutated gene at the EVR1 locus. The purpose of this study was to screen FZD4 in a large family previously proven to be linked to the EVR1 locus. PCR products were generated using genomic DNA from affected family members with primers designed to amplify the coding sequence of FZD4. The PCR products were screened for mutations by direct sequencing. Genotyping was performed in all available family members using fluorescently labeled microsatellite markers from chromosome 11q. Sequencing of the EVR1 gene, FZD4, in this family identified no mutation. To investigate this family further we performed high-resolution genotyping with markers spanning chromosome 11q. Haplotype analysis excluded FZD4 as the mutated gene in this family and identified a candidate region approximately 10 cM centromeric to EVR1. This new FEVR locus is flanked by markers D11S1368 (centromeric) and D11S937 (telomeric) and spans approximately 15 cM. High-resolution genotyping and haplotype analysis excluded FZD4 as the defective gene in a family previously linked to the EVR1 locus. The results indicate that the gene mutated in this family lies centromeric to the EVR1 gene, FZD4, and is also genetically distinct from the EVR3 locus. This new locus has been designated EVR4 and is the fourth FEVR locus to be described.

  2. Prognostic and predictive role of ESR1 status for postmenopausal patients with endocrine-responsive early breast cancer in the Danish cohort of the BIG 1-98 trial

    DEFF Research Database (Denmark)

    Ejlertsen, B; Aldridge, J; Nielsen, K V

    2012-01-01

    postmenopausal Danish women with early breast cancer randomly assigned to receive 5 years of letrozole, tamoxifen or a sequence of these agents in the Breast International Group 1-98 trial and who had ER ≥1% after central review. RESULTS: By FISH, 13.6% of patients had an ESR1-to-Centromere-6 (CEN-6) ratio ≥2...... (amplified), and 4.2% had ESR1-to-CEN-6 ratio...

  3. The Genome of the Basidiomycetous Yeast and Human Pathogen Cryptococcus neoformans

    OpenAIRE

    Loftus, Brendan J.; Fung, Eula; Roncaglia, Paola; Rowley, Don; Amedeo, Paolo; Bruno, Dan; Vamathevan, Jessica; Miranda, Molly; Anderson, Iain J.; Fraser, James A.; Allen, Jonathan E.; Bosdet, Ian E.; Brent, Michael R.; Chiu, Readman; Doering, Tamara L.

    2005-01-01

    Cryptococcus neoformans is a basidiomycetous yeast ubiquitous in the environment, a model for fungal pathogenesis, and an opportunistic human pathogen of global importance. We have sequenced its ~20-megabase genome, which contains ~6500 intron-rich gene structures and encodes a transcriptome abundant in alternatively spliced and antisense messages. The genome is rich in transposons, many of which cluster at candidate centromeric regions. The presence of these transposons may drive karyotype i...

  4. Telomere disruption results in non-random formation of de novo dicentric chromosomes involving acrocentric human chromosomes.

    Directory of Open Access Journals (Sweden)

    Kaitlin M Stimpson

    2010-08-01

    Full Text Available Genome rearrangement often produces chromosomes with two centromeres (dicentrics that are inherently unstable because of bridge formation and breakage during cell division. However, mammalian dicentrics, and particularly those in humans, can be quite stable, usually because one centromere is functionally silenced. Molecular mechanisms of centromere inactivation are poorly understood since there are few systems to experimentally create dicentric human chromosomes. Here, we describe a human cell culture model that enriches for de novo dicentrics. We demonstrate that transient disruption of human telomere structure non-randomly produces dicentric fusions involving acrocentric chromosomes. The induced dicentrics vary in structure near fusion breakpoints and like naturally-occurring dicentrics, exhibit various inter-centromeric distances. Many functional dicentrics persist for months after formation. Even those with distantly spaced centromeres remain functionally dicentric for 20 cell generations. Other dicentrics within the population reflect centromere inactivation. In some cases, centromere inactivation occurs by an apparently epigenetic mechanism. In other dicentrics, the size of the alpha-satellite DNA array associated with CENP-A is reduced compared to the same array before dicentric formation. Extra-chromosomal fragments that contained CENP-A often appear in the same cells as dicentrics. Some of these fragments are derived from the same alpha-satellite DNA array as inactivated centromeres. Our results indicate that dicentric human chromosomes undergo alternative fates after formation. Many retain two active centromeres and are stable through multiple cell divisions. Others undergo centromere inactivation. This event occurs within a broad temporal window and can involve deletion of chromatin that marks the locus as a site for CENP-A maintenance/replenishment.

  5. Detection and Automated Scoring of Dicentric Chromosomes in Nonstimulated Lymphocyte Prematurely Condensed Chromosomes After Telomere and Centromere Staining

    Energy Technology Data Exchange (ETDEWEB)

    M' kacher, Radhia [Laboratoire de Radiobiologie et Oncologie, Commissariat à l' Energie Atomique, Fontenay-aux-Roses (France); El Maalouf, Elie [Laboratoire de Radiobiologie et Oncologie, Commissariat à l' Energie Atomique, Fontenay-aux-Roses (France); Laboratoire Modélisation Intelligence Processus Systèmes (MIPS)–Groupe TIIM3D, Université de Haute-Alsace, Mulhouse (France); Terzoudi, Georgia [Laboratory of Radiobiology & Biodosimetry, National Center for Scientific Research Demokritos, Athens (Greece); Ricoul, Michelle [Laboratoire de Radiobiologie et Oncologie, Commissariat à l' Energie Atomique, Fontenay-aux-Roses (France); Heidingsfelder, Leonhard [MetaSystems, Altlussheim (Germany); Karachristou, Ionna [Laboratory of Radiobiology & Biodosimetry, National Center for Scientific Research Demokritos, Athens (Greece); Laplagne, Eric [Pole Concept, Paris (France); Hempel, William M. [Laboratoire de Radiobiologie et Oncologie, Commissariat à l' Energie Atomique, Fontenay-aux-Roses (France); Colicchio, Bruno; Dieterlen, Alain [Laboratoire Modélisation Intelligence Processus Systèmes (MIPS)–Groupe TIIM3D, Université de Haute-Alsace, Mulhouse (France); Pantelias, Gabriel [Laboratory of Radiobiology & Biodosimetry, National Center for Scientific Research Demokritos, Athens (Greece); Sabatier, Laure, E-mail: laure.sabatier@cea.fr [Laboratoire de Radiobiologie et Oncologie, Commissariat à l' Energie Atomique, Fontenay-aux-Roses (France)

    2015-03-01

    Purpose: To combine telomere and centromere (TC) staining of premature chromosome condensation (PCC) fusions to identify dicentrics, centric rings, and acentric chromosomes, making possible the realization of a dose–response curve and automation of the process. Methods and Materials: Blood samples from healthy donors were exposed to {sup 60}Co irradiation at varying doses up to 8 Gy, followed by a repair period of 8 hours. Premature chromosome condensation fusions were carried out, and TC staining using peptide nucleic acid probes was performed. Chromosomal aberration (CA) scoring was carried out manually and automatically using PCC-TCScore software, developed in our laboratory. Results: We successfully optimized the hybridization conditions and image capture parameters, to increase the sensitivity and effectiveness of CA scoring. Dicentrics, centric rings, and acentric chromosomes were rapidly and accurately detected, leading to a linear-quadratic dose–response curve by manual scoring at up to 8 Gy. Using PCC-TCScore software for automatic scoring, we were able to detect 95% of dicentrics and centric rings. Conclusion: The introduction of TC staining to the PCC fusion technique has made possible the rapid scoring of unstable CAs, including dicentrics, with a level of accuracy and ease not previously possible. This new approach can be used for biological dosimetry in radiation emergency medicine, where the rapid and accurate detection of dicentrics is a high priority using automated scoring. Because there is no culture time, this new approach can also be used for the follow-up of patients treated by genotoxic therapy, creating the possibility to perform the estimation of induced chromosomal aberrations immediately after the blood draw.

  6. Characterization of a chromosome-specific chimpanzee alpha satellite subset: Evolutionary relationship to subsets on human chromosomes

    Energy Technology Data Exchange (ETDEWEB)

    Warburton, P.E.; Gosden, J.; Lawson, D. [Western General Hospital, Edinburgh (United Kingdom)] [and others

    1996-04-15

    Alpha satellite DNA is a tandemly repeated DNA family found at the centromeres of all primate chromosomes examined. The fundamental repeat units of alpha satellite DNA are diverged 169- to 172-bp monomers, often found to be organized in chromosome-specific higher-order repeat units. The chromosomes of human (Homo sapiens (HSA)), chimpanzee (Pan troglodytes (PTR) and Pan paniscus), and gorilla (Gorilla gorilla) share a remarkable similarity and synteny. It is of interest to ask if alpha satellite arrays at centromeres of homologous chromosomes between these species are closely related (evolving in an orthologous manner) or if the evolutionary processes that homogenize and spread these arrays within and between chromosomes result in nonorthologous evolution of arrays. By using PCR primers specific for human chromosome 17-specific alpha satellite DNA, we have amplified, cloned, and characterized a chromosome-specific subset from the PTR chimpanzee genome. Hybridization both on Southern blots and in situ as well as sequence analysis show that this subset is most closely related, as expected, to sequences on HSA 17. However, in situ hybridization reveals that this subset is not found on the homologous chromosome in chimpanzee (PTR 19), but instead on PTR 12, which is homologous to HSA 2p. 40 refs., 3 figs.

  7. Diagnostic/prognostic molecular cytogenetic follow-up applied in satellited marker cases

    Energy Technology Data Exchange (ETDEWEB)

    Papenhausen, P.R.; Anderson, S.

    1994-09-01

    Special caution needs to be exercised in offering a good prognosis in Prader-Willi probe negative 15-derived marker cases, since it is clear that phenotypic effects can still be associated with the apparent presence of proximal sequences. We have had two postnatal cases in this category, one which was inherited from an unaffected paternal (non-mosaic) carrier, possibly demonstrating imprinting effects. Familial studies are continuing in this case. Although the D22/S9 locus appears diagnostic of cateye syndrome (CES), the dual specificity of the 14/22 centromeric probe leaves the possibility of a poor prognosis 14 derivation when the CES probe is negative. Therefore, it is imperative that proximal long arm 13, 14, 21 and more proximal 15 FISH probes be implemented so that a phenotypically correlated database may indicate the proper FISH probes necessary for accurate prognosis. Bisatellited markers is which a bipartite centromeric probe signal was found were considered to be higher risk than those with the single signal in counseling.

  8. Genome-wide Control of Heterochromatin Replication by the Telomere Capping Protein TRF2.

    Science.gov (United States)

    Mendez-Bermudez, Aaron; Lototska, Liudmyla; Bauwens, Serge; Giraud-Panis, Marie-Josèphe; Croce, Olivier; Jamet, Karine; Irizar, Agurtzane; Mowinckel, Macarena; Koundrioukoff, Stephane; Nottet, Nicolas; Almouzni, Genevieve; Teulade-Fichou, Mare-Paule; Schertzer, Michael; Perderiset, Mylène; Londoño-Vallejo, Arturo; Debatisse, Michelle; Gilson, Eric; Ye, Jing

    2018-05-03

    Hard-to-replicate regions of chromosomes (e.g., pericentromeres, centromeres, and telomeres) impede replication fork progression, eventually leading, in the event of replication stress, to chromosome fragility, aging, and cancer. Our knowledge of the mechanisms controlling the stability of these regions is essentially limited to telomeres, where fragility is counteracted by the shelterin proteins. Here we show that the shelterin subunit TRF2 ensures progression of the replication fork through pericentromeric heterochromatin, but not centromeric chromatin. In a process involving its N-terminal basic domain, TRF2 binds to pericentromeric Satellite III sequences during S phase, allowing the recruitment of the G-quadruplex-resolving helicase RTEL1 to facilitate fork progression. We also show that TRF2 is required for the stability of other heterochromatic regions localized throughout the genome, paving the way for future research on heterochromatic replication and its relationship with aging and cancer. Copyright © 2018 Elsevier Inc. All rights reserved.

  9. Universal sequence map (USM of arbitrary discrete sequences

    Directory of Open Access Journals (Sweden)

    Almeida Jonas S

    2002-02-01

    Full Text Available Abstract Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM, is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR. The latter enables the representation of 4 unit type sequences (like DNA as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules.

  10. Is sequence awareness mandatory for perceptual sequence learning: An assessment using a pure perceptual sequence learning design.

    Science.gov (United States)

    Deroost, Natacha; Coomans, Daphné

    2018-02-01

    We examined the role of sequence awareness in a pure perceptual sequence learning design. Participants had to react to the target's colour that changed according to a perceptual sequence. By varying the mapping of the target's colour onto the response keys, motor responses changed randomly. The effect of sequence awareness on perceptual sequence learning was determined by manipulating the learning instructions (explicit versus implicit) and assessing the amount of sequence awareness after the experiment. In the explicit instruction condition (n = 15), participants were instructed to intentionally search for the colour sequence, whereas in the implicit instruction condition (n = 15), they were left uninformed about the sequenced nature of the task. Sequence awareness after the sequence learning task was tested by means of a questionnaire and the process-dissociation-procedure. The results showed that the instruction manipulation had no effect on the amount of perceptual sequence learning. Based on their report to have actively applied their sequence knowledge during the experiment, participants were subsequently regrouped in a sequence strategy group (n = 14, of which 4 participants from the implicit instruction condition and 10 participants from the explicit instruction condition) and a no-sequence strategy group (n = 16, of which 11 participants from the implicit instruction condition and 5 participants from the explicit instruction condition). Only participants of the sequence strategy group showed reliable perceptual sequence learning and sequence awareness. These results indicate that perceptual sequence learning depends upon the continuous employment of strategic cognitive control processes on sequence knowledge. Sequence awareness is suggested to be a necessary but not sufficient condition for perceptual learning to take place. Copyright © 2018 Elsevier B.V. All rights reserved.

  11. The Genome of the Basidiomycetous Yeast and Human Pathogen Cryptococcus neoformans

    Science.gov (United States)

    Loftus, Brendan J.; Fung, Eula; Roncaglia, Paola; Rowley, Don; Amedeo, Paolo; Bruno, Dan; Vamathevan, Jessica; Miranda, Molly; Anderson, Iain J.; Fraser, James A.; Allen, Jonathan E.; Bosdet, Ian E.; Brent, Michael R.; Chiu, Readman; Doering, Tamara L.; Donlin, Maureen J.; D’Souza, Cletus A.; Fox, Deborah S.; Grinberg, Viktoriya; Fu, Jianmin; Fukushima, Marilyn; Haas, Brian J.; Huang, James C.; Janbon, Guilhem; Jones, Steven J. M.; Koo, Hean L.; Krzywinski, Martin I.; Kwon-Chung, June K.; Lengeler, Klaus B.; Maiti, Rama; Marra, Marco A.; Marra, Robert E.; Mathewson, Carrie A.; Mitchell, Thomas G.; Pertea, Mihaela; Riggs, Florenta R.; Salzberg, Steven L.; Schein, Jacqueline E.; Shvartsbeyn, Alla; Shin, Heesun; Shumway, Martin; Specht, Charles A.; Suh, Bernard B.; Tenney, Aaron; Utterback, Terry R.; Wickes, Brian L.; Wortman, Jennifer R.; Wye, Natasja H.; Kronstad, James W.; Lodge, Jennifer K.; Heitman, Joseph; Davis, Ronald W.; Fraser, Claire M.; Hyman, Richard W.

    2012-01-01

    Cryptococcus neoformans is a basidiomycetous yeast ubiquitous in the environment, a model for fungal pathogenesis, and an opportunistic human pathogen of global importance. We have sequenced its ~20-megabase genome, which contains ~6500 intron-rich gene structures and encodes a transcriptome abundant in alternatively spliced and antisense messages. The genome is rich in transposons, many of which cluster at candidate centromeric regions. The presence of these transposons may drive karyotype instability and phenotypic variation. C. neoformans encodes unique genes that may contribute to its unusual virulence properties, and comparison of two phenotypically distinct strains reveals variation in gene content in addition to sequence polymorphisms between the genomes. PMID:15653466

  12. A mitosis-specific and R loop-driven ATR pathway promotes faithful chromosome segregation.

    Science.gov (United States)

    Kabeche, Lilian; Nguyen, Hai Dang; Buisson, Rémi; Zou, Lee

    2018-01-05

    The ataxia telangiectasia mutated and Rad3-related (ATR) kinase is crucial for DNA damage and replication stress responses. Here, we describe an unexpected role of ATR in mitosis. Acute inhibition or degradation of ATR in mitosis induces whole-chromosome missegregation. The effect of ATR ablation is not due to altered cyclin-dependent kinase 1 (CDK1) activity, DNA damage responses, or unscheduled DNA synthesis but to loss of an ATR function at centromeres. In mitosis, ATR localizes to centromeres through Aurora A-regulated association with centromere protein F (CENP-F), allowing ATR to engage replication protein A (RPA)-coated centromeric R loops. As ATR is activated at centromeres, it stimulates Aurora B through Chk1, preventing formation of lagging chromosomes. Thus, a mitosis-specific and R loop-driven ATR pathway acts at centromeres to promote faithful chromosome segregation, revealing functions of R loops and ATR in suppressing chromosome instability. Copyright © 2018, American Association for the Advancement of Science.

  13. Quantum-Sequencing: Fast electronic single DNA molecule sequencing

    Science.gov (United States)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.

  14. Multimodal sequence learning.

    Science.gov (United States)

    Kemény, Ferenc; Meier, Beat

    2016-02-01

    While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. The CENP-T C-Terminus Is Exclusively Proximal to H3.1 and not to H3.2 or H3.3

    Science.gov (United States)

    Abendroth, Christian; Hofmeister, Antje; Hake, Sandra B.; Kamweru, Paul K.; Miess, Elke; Dornblut, Carsten; Küffner, Isabell; Deng, Wen; Leonhardt, Heinrich; Orthaus, Sandra; Hoischen, Christian; Diekmann, Stephan

    2015-01-01

    The kinetochore proteins assemble onto centromeric chromatin and regulate DNA segregation during cell division. The inner kinetochore proteins bind centromeres while most outer kinetochore proteins assemble at centromeres during mitosis, connecting the complex to microtubules. The centromere–kinetochore complex contains specific nucleosomes and nucleosomal particles. CENP-A replaces canonical H3 in centromeric nucleosomes, defining centromeric chromatin. Next to CENP-A, the CCAN multi-protein complex settles which contains CENP-T/W/S/X. These four proteins are described to form a nucleosomal particle at centromeres. We had found the CENP-T C-terminus and the CENP-S termini next to histone H3.1 but not to CENP-A, suggesting that the Constitutive Centromere-Associated Network (CCAN) bridges a CENP-A- and a H3-containing nucleosome. Here, we show by in vivo FRET that this proximity between CENP-T and H3 is specific for H3.1 but neither for the H3.1 mutants H3.1C96A and H3.1C110A nor for H3.2 or H3.3. We also found CENP-M next to H3.1 but not to these H3.1 mutants. Consistently, we detected CENP-M next to CENP-S. These data elucidate the local molecular neighborhood of CCAN proteins next to a H3.1-containing centromeric nucleosome. They also indicate an exclusive position of H3.1 clearly distinct from H3.2, thus documenting a local, and potentially also functional, difference between H3.1 and H3.2. PMID:25775162

  16. The CENP-T C-Terminus Is Exclusively Proximal to H3.1 and not to H3.2 or H3.3

    Directory of Open Access Journals (Sweden)

    Christian Abendroth

    2015-03-01

    Full Text Available The kinetochore proteins assemble onto centromeric chromatin and regulate DNA segregation during cell division. The inner kinetochore proteins bind centromeres while most outer kinetochore proteins assemble at centromeres during mitosis, connecting the complex to microtubules. The centromere–kinetochore complex contains specific nucleosomes and nucleosomal particles. CENP-A replaces canonical H3 in centromeric nucleosomes, defining centromeric chromatin. Next to CENP-A, the CCAN multi-protein complex settles which contains CENP-T/W/S/X. These four proteins are described to form a nucleosomal particle at centromeres. We had found the CENP-T C-terminus and the CENP-S termini next to histone H3.1 but not to CENP-A, suggesting that the Constitutive Centromere-Associated Network (CCAN bridges a CENP-A- and a H3-containing nucleosome. Here, we show by in vivo FRET that this proximity between CENP-T and H3 is specific for H3.1 but neither for the H3.1 mutants H3.1C96A and H3.1C110A nor for H3.2 or H3.3. We also found CENP-M next to H3.1 but not to these H3.1 mutants. Consistently, we detected CENP-M next to CENP-S. These data elucidate the local molecular neighborhood of CCAN proteins next to a H3.1-containing centromeric nucleosome. They also indicate an exclusive position of H3.1 clearly distinct from H3.2, thus documenting a local, and potentially also functional, difference between H3.1 and H3.2.

  17. Novel expressed sequence tag- simple sequence repeats (EST ...

    African Journals Online (AJOL)

    Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...

  18. Genome Sequence Databases (Overview): Sequencing and Assembly

    Energy Technology Data Exchange (ETDEWEB)

    Lapidus, Alla L.

    2009-01-01

    From the date its role in heredity was discovered, DNA has been generating interest among scientists from different fields of knowledge: physicists have studied the three dimensional structure of the DNA molecule, biologists tried to decode the secrets of life hidden within these long molecules, and technologists invent and improve methods of DNA analysis. The analysis of the nucleotide sequence of DNA occupies a special place among the methods developed. Thanks to the variety of sequencing technologies available, the process of decoding the sequence of genomic DNA (or whole genome sequencing) has become robust and inexpensive. Meanwhile the assembly of whole genome sequences remains a challenging task. In addition to the need to assemble millions of DNA fragments of different length (from 35 bp (Solexa) to 800 bp (Sanger)), great interest in analysis of microbial communities (metagenomes) of different complexities raises new problems and pushes some new requirements for sequence assembly tools to the forefront. The genome assembly process can be divided into two steps: draft assembly and assembly improvement (finishing). Despite the fact that automatically performed assembly (or draft assembly) is capable of covering up to 98% of the genome, in most cases, it still contains incorrectly assembled reads. The error rate of the consensus sequence produced at this stage is about 1/2000 bp. A finished genome represents the genome assembly of much higher accuracy (with no gaps or incorrectly assembled areas) and quality ({approx}1 error/10,000 bp), validated through a number of computer and laboratory experiments.

  19. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  20. Blind sequence-length estimation of low-SNR cyclostationary sequences

    CSIR Research Space (South Africa)

    Vlok, JD

    2014-06-01

    Full Text Available Several existing direct-sequence spread spectrum (DSSS) detection and estimation algorithms assume prior knowledge of the symbol period or sequence length, although very few sequence-length estimation techniques are available in the literature...

  1. Dog Y chromosomal DNA sequence: identification, sequencing and SNP discovery

    Directory of Open Access Journals (Sweden)

    Kirkness Ewen

    2006-10-01

    Full Text Available Abstract Background Population genetic studies of dogs have so far mainly been based on analysis of mitochondrial DNA, describing only the history of female dogs. To get a picture of the male history, as well as a second independent marker, there is a need for studies of biallelic Y-chromosome polymorphisms. However, there are no biallelic polymorphisms reported, and only 3200 bp of non-repetitive dog Y-chromosome sequence deposited in GenBank, necessitating the identification of dog Y chromosome sequence and the search for polymorphisms therein. The genome has been only partially sequenced for one male dog, disallowing mapping of the sequence into specific chromosomes. However, by comparing the male genome sequence to the complete female dog genome sequence, candidate Y-chromosome sequence may be identified by exclusion. Results The male dog genome sequence was analysed by Blast search against the human genome to identify sequences with a best match to the human Y chromosome and to the female dog genome to identify those absent in the female genome. Candidate sequences were then tested for male specificity by PCR of five male and five female dogs. 32 sequences from the male genome, with a total length of 24 kbp, were identified as male specific, based on a match to the human Y chromosome, absence in the female dog genome and male specific PCR results. 14437 bp were then sequenced for 10 male dogs originating from Europe, Southwest Asia, Siberia, East Asia, Africa and America. Nine haplotypes were found, which were defined by 14 substitutions. The genetic distance between the haplotypes indicates that they originate from at least five wolf haplotypes. There was no obvious trend in the geographic distribution of the haplotypes. Conclusion We have identified 24159 bp of dog Y-chromosome sequence to be used for population genetic studies. We sequenced 14437 bp in a worldwide collection of dogs, identifying 14 SNPs for future SNP analyses, and

  2. Sequence assembly

    DEFF Research Database (Denmark)

    Scheibye-Alsing, Karsten; Hoffmann, S.; Frankel, Annett Maria

    2009-01-01

    Despite the rapidly increasing number of sequenced and re-sequenced genomes, many issues regarding the computational assembly of large-scale sequencing data have remain unresolved. Computational assembly is crucial in large genome projects as well for the evolving high-throughput technologies and...... in genomic DNA, highly expressed genes and alternative transcripts in EST sequences. We summarize existing comparisons of different assemblers and provide a detailed descriptions and directions for download of assembly programs at: http://genome.ku.dk/resources/assembly/methods.html....

  3. Localization of specific sequences and DNA single-strand breaks in individual UV-A-irradiated human lymphocytes by COMET FISH

    Science.gov (United States)

    Bock, Claudia; Rapp, Alexander; Dittmar, Heike; Monajembashi, Shamci; Greulich, Karl-Otto

    1999-01-01

    The COMET assay, a single cell electrophoresis technique which allows to separate electrophoretically fractionated DNA according to size has been combined with fluorescence in situ hybridization (FISH) which allows to localize specific genes or gene regions. This combination (COMET FISH) allows the detection of DNA single strand breaks in specific regions of the genome of human lymphocytes at the single cell level. Various types of DNA probes, e.g. centromere-, (alpha) - satellite-, telomere-, whole chromosome-, single copy- and region specific DNA probes have been used to investigate whether the UV-A induced DNA single strand breaks are distributed randomly all over the human genome or induced at specific sites ('hot spots'). In the investigated human peripheral blood lymphocytes all but one centromere reveal low sensitivity for UV-A irradiation (500 kJ/m2), while telomeres are randomly distributed over COMET heads and tails. The human chromosome 1 is fractionated by irradiation, but remains in the COMET head, indicating an only moderate degree of fractionation. Among three tested single copy probes, c- myc, p53 and p58, the p53 gene located on chromosome 17p13.1 and the p58 gene (1p36) appear to be located in UV-A stable regions of the human genome in 95% of 65 investigated lymphocytes. In contrast, the c-myc proto-oncogene (8q24) is found in the COMET tail in 90% of the 27 investigated lymphocytes and thus appears to be more sensitive to UV-A irradiation.

  4. Tidying up international nucleotide sequence databases: ecological, geographical and sequence quality annotation of its sequences of mycorrhizal fungi.

    Science.gov (United States)

    Tedersoo, Leho; Abarenkov, Kessy; Nilsson, R Henrik; Schüssler, Arthur; Grelet, Gwen-Aëlle; Kohout, Petr; Oja, Jane; Bonito, Gregory M; Veldre, Vilmar; Jairus, Teele; Ryberg, Martin; Larsson, Karl-Henrik; Kõljalg, Urmas

    2011-01-01

    Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi.

  5. Shotgun protein sequencing.

    Energy Technology Data Exchange (ETDEWEB)

    Faulon, Jean-Loup Michel; Heffelfinger, Grant S.

    2009-06-01

    A novel experimental and computational technique based on multiple enzymatic digestion of a protein or protein mixture that reconstructs protein sequences from sequences of overlapping peptides is described in this SAND report. This approach, analogous to shotgun sequencing of DNA, is to be used to sequence alternative spliced proteins, to identify post-translational modifications, and to sequence genetically engineered proteins.

  6. A New Approach to Dissect Nuclear Organization: TALE-Mediated Genome Visualization (TGV).

    Science.gov (United States)

    Miyanari, Yusuke

    2016-01-01

    Spatiotemporal organization of chromatin within the nucleus has so far remained elusive. Live visualization of nuclear remodeling could be a promising approach to understand its functional relevance in genome functions and mechanisms regulating genome architecture. Recent technological advances in live imaging of chromosomes begun to explore the biological roles of the movement of the chromatin within the nucleus. Here I describe a new technique, called TALE-mediated genome visualization (TGV), which allows us to visualize endogenous repetitive sequence including centromeric, pericentromeric, and telomeric repeats in living cells.

  7. Foundations of Sequence-to-Sequence Modeling for Time Series

    OpenAIRE

    Kuznetsov, Vitaly; Mariet, Zelda

    2018-01-01

    The availability of large amounts of time series data, paired with the performance of deep-learning algorithms on a broad class of problems, has recently led to significant interest in the use of sequence-to-sequence models for time series forecasting. We provide the first theoretical analysis of this time series forecasting framework. We include a comparison of sequence-to-sequence modeling to classical time series models, and as such our theory can serve as a quantitative guide for practiti...

  8. Detection of M-Sequences from Spike Sequence in Neuronal Networks

    Directory of Open Access Journals (Sweden)

    Yoshi Nishitani

    2012-01-01

    Full Text Available In circuit theory, it is well known that a linear feedback shift register (LFSR circuit generates pseudorandom bit sequences (PRBS, including an M-sequence with the maximum period of length. In this study, we tried to detect M-sequences known as a pseudorandom sequence generated by the LFSR circuit from time series patterns of stimulated action potentials. Stimulated action potentials were recorded from dissociated cultures of hippocampal neurons grown on a multielectrode array. We could find several M-sequences from a 3-stage LFSR circuit (M3. These results show the possibility of assembling LFSR circuits or its equivalent ones in a neuronal network. However, since the M3 pattern was composed of only four spike intervals, the possibility of an accidental detection was not zero. Then, we detected M-sequences from random spike sequences which were not generated from an LFSR circuit and compare the result with the number of M-sequences from the originally observed raster data. As a result, a significant difference was confirmed: a greater number of “0–1” reversed the 3-stage M-sequences occurred than would have accidentally be detected. This result suggests that some LFSR equivalent circuits are assembled in neuronal networks.

  9. Genome Sequencing

    DEFF Research Database (Denmark)

    Sato, Shusei; Andersen, Stig Uggerhøj

    2014-01-01

    The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based on transcr......The current Lotus japonicus reference genome sequence is based on a hybrid assembly of Sanger TAC/BAC, Sanger shotgun and Illumina shotgun sequencing data generated from the Miyakojima-MG20 accession. It covers nearly all expressed L. japonicus genes and has been annotated mainly based...

  10. Regulation of Budding Yeast CENP-A levels Prevents Misincorporation at Promoter Nucleosomes and Transcriptional Defects.

    Directory of Open Access Journals (Sweden)

    Erica M Hildebrand

    2016-03-01

    Full Text Available The exclusive localization of the histone H3 variant CENP-A to centromeres is essential for accurate chromosome segregation. Ubiquitin-mediated proteolysis helps to ensure that CENP-A does not mislocalize to euchromatin, which can lead to genomic instability. Consistent with this, overexpression of the budding yeast CENP-A(Cse4 is lethal in cells lacking Psh1, the E3 ubiquitin ligase that targets CENP-A(Cse4 for degradation. To identify additional mechanisms that prevent CENP-A(Cse4 misincorporation and lethality, we analyzed the genome-wide mislocalization pattern of overexpressed CENP-A(Cse4 in the presence and absence of Psh1 by chromatin immunoprecipitation followed by high throughput sequencing. We found that ectopic CENP-A(Cse4 is enriched at promoters that contain histone H2A.Z(Htz1 nucleosomes, but that H2A.Z(Htz1 is not required for CENP-A(Cse4 mislocalization. Instead, the INO80 complex, which removes H2A.Z(Htz1 from nucleosomes, promotes the ectopic deposition of CENP-A(Cse4. Transcriptional profiling revealed gene expression changes in the psh1Δ cells overexpressing CENP-A(Cse4. The down-regulated genes are enriched for CENP-A(Cse4 mislocalization to promoters, while the up-regulated genes correlate with those that are also transcriptionally up-regulated in an htz1Δ strain. Together, these data show that regulating centromeric nucleosome localization is not only critical for maintaining centromere function, but also for ensuring accurate promoter function and transcriptional regulation.

  11. Subchromosomal karyotype evolution in Equidae.

    Science.gov (United States)

    Musilova, P; Kubickova, S; Vahala, J; Rubes, J

    2013-04-01

    Equidae is a small family which comprises horses, African and Asiatic asses, and zebras. Despite equids having diverged quite recently, their karyotypes underwent rapid evolution which resulted in extensive differences among chromosome complements in respective species. Comparative mapping using whole-chromosome painting probes delineated genome-wide chromosome homologies among extant equids, enabling us to trace chromosome rearrangements that occurred during evolution. In the present study, we performed subchromosomal comparative mapping among seven Equidae species, representing the whole family. Region-specific painting and bacterial artificial chromosome probes were used to determine the orientation of evolutionarily conserved segments with respect to centromere positions. This allowed assessment of the configuration of all fusions occurring during the evolution of Equidae, as well as revealing discrepancies in centromere location caused by centromere repositioning or inversions. Our results indicate that the prevailing type of fusion in Equidae is centric fusion. Tandem fusions of the type telomere-telomere occur almost exclusively in the karyotype of Hartmann's zebra and are characteristic of this species' evolution. We revealed inversions in segments homologous to horse chromosomes 3p/10p and 13 in zebras and confirmed inversions in segments 4/31 in African ass, 7 in horse and 8p/20 in zebras. Furthermore, our mapping results suggested that centromere repositioning events occurred in segments homologous to horse chromosomes 7, 8q, 10p and 19 in the African ass and an element homologous to horse chromosome 16 in Asiatic asses. Centromere repositioning in chromosome 1 resulted in three different chromosome types occurring in extant species. Heterozygosity of the centromere position of this chromosome was observed in the kiang. Other subtle changes in centromere position were described in several evolutionary conserved chromosomal segments, suggesting that tiny

  12. [Complete genome sequencing and sequence analysis of BCG Tice].

    Science.gov (United States)

    Wang, Zhiming; Pan, Yuanlong; Wu, Jun; Zhu, Baoli

    2012-10-04

    The objective of this study is to obtain the complete genome sequence of Bacillus Calmette-Guerin Tice (BCG Tice), in order to provide more information about the molecular biology of BCG Tice and design more reasonable vaccines to prevent tuberculosis. We assembled the data from high-throughput sequencing with SOAPdenovo software, with many contigs and scaffolds obtained. There are many sequence gaps and physical gaps remained as a result of regional low coverage and low quality. We designed primers at the end of contigs and performed PCR amplification in order to link these contigs and scaffolds. With various enzymes to perform PCR amplification, adjustment of PCR reaction conditions, and combined with clone construction to sequence, all the gaps were finished. We obtained the complete genome sequence of BCG Tice and submitted it to GenBank of National Center for Biotechnology Information (NCBI). The genome of BCG Tice is 4334064 base pairs in length, with GC content 65.65%. The problems and strategies during the finishing step of BCG Tice sequencing are illuminated here, with the hope of affording some experience to those who are involved in the finishing step of genome sequencing. The microarray data were verified by our results.

  13. A Three-Dimensional Model of the Yeast Genome

    Science.gov (United States)

    Noble, William; Duan, Zhi-Jun; Andronescu, Mirela; Schutz, Kevin; McIlwain, Sean; Kim, Yoo Jung; Lee, Choli; Shendure, Jay; Fields, Stanley; Blau, C. Anthony

    Layered on top of information conveyed by DNA sequence and chromatin are higher order structures that encompass portions of chromosomes, entire chromosomes, and even whole genomes. Interphase chromosomes are not positioned randomly within the nucleus, but instead adopt preferred conformations. Disparate DNA elements co-localize into functionally defined aggregates or factories for transcription and DNA replication. In budding yeast, Drosophila and many other eukaryotes, chromosomes adopt a Rabl configuration, with arms extending from centromeres adjacent to the spindle pole body to telomeres that abut the nuclear envelope. Nonetheless, the topologies and spatial relationships of chromosomes remain poorly understood. Here we developed a method to globally capture intra- and inter-chromosomal interactions, and applied it to generate a map at kilobase resolution of the haploid genome of Saccharomyces cerevisiae. The map recapitulates known features of genome organization, thereby validating the method, and identifies new features. Extensive regional and higher order folding of individual chromosomes is observed. Chromosome XII exhibits a striking conformation that implicates the nucleolus as a formidable barrier to interaction between DNA sequences at either end. Inter-chromosomal contacts are anchored by centromeres and include interactions among transfer RNA genes, among origins of early DNA replication and among sites where chromosomal breakpoints occur. Finally, we constructed a three-dimensional model of the yeast genome. Our findings provide a glimpse of the interface between the form and function of a eukaryotic genome.

  14. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    Science.gov (United States)

    de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084

  15. Sequencing of chloroplast genome using whole cellular DNA and Solexa sequencing technology

    Directory of Open Access Journals (Sweden)

    Jian eWu

    2012-11-01

    Full Text Available Sequencing of the chloroplast genome using traditional sequencing methods has been difficult because of its size (>120 kb and the complicated procedures required to prepare templates. To explore the feasibility of sequencing the chloroplast genome using DNA extracted from whole cells and Solexa sequencing technology, we sequenced whole cellular DNA isolated from leaves of three Brassica rapa accessions with one lane per accession. In total, 246 Mb, 362Mb, 361 Mb sequence data were generated for the three accessions Chiifu-401-42, Z16 and FT, respectively. Microreads were assembled by reference-guided assembly using the cpDNA sequences of B. rapa, Arabidopsis thaliana, and Nicotiana tabacum. We achieved coverage of more than 99.96% of the cp genome in the three tested accessions using the B. rapa sequence as the reference. When A. thaliana or N. tabacum sequences were used as references, 99.7–99.8% or 95.5–99.7% of the B. rapa chloroplast genome was covered, respectively. These results demonstrated that sequencing of whole cellular DNA isolated from young leaves using the Illumina Genome Analyzer is an efficient method for high-throughput sequencing of chloroplast genome.

  16. Sequence Algebra, Sequence Decision Diagrams and Dynamic Fault Trees

    International Nuclear Information System (INIS)

    Rauzy, Antoine B.

    2011-01-01

    A large attention has been focused on the Dynamic Fault Trees in the past few years. By adding new gates to static (regular) Fault Trees, Dynamic Fault Trees aim to take into account dependencies among events. Merle et al. proposed recently an algebraic framework to give a formal interpretation to these gates. In this article, we extend Merle et al.'s work by adopting a slightly different perspective. We introduce Sequence Algebras that can be seen as Algebras of Basic Events, representing failures of non-repairable components. We show how to interpret Dynamic Fault Trees within this framework. Finally, we propose a new data structure to encode sets of sequences of Basic Events: Sequence Decision Diagrams. Sequence Decision Diagrams are very much inspired from Minato's Zero-Suppressed Binary Decision Diagrams. We show that all operations of Sequence Algebras can be performed on this data structure.

  17. The holocentric species Luzula elegans shows interplay between centromere and large-scale genome organization

    Czech Academy of Sciences Publication Activity Database

    Heckmann, S.; Macas, Jiří; Kumke, K.; Fuchs, J.; Schubert, V.; Ma, L.; Novák, Petr; Neumann, Pavel; Taudien, S.; Platzer, M.; Houben, A.

    2013-01-01

    Roč. 73, č. 4 (2013), s. 555-565 ISSN 0960-7412 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : Histone H3 * Polycentric chromosomes * Repetitive sequences * DNA-Replication Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 6.815, year: 2013

  18. Rapid Multiplex Small DNA Sequencing on the MinION Nanopore Sequencing Platform

    Directory of Open Access Journals (Sweden)

    Shan Wei

    2018-05-01

    Full Text Available Real-time sequencing of short DNA reads has a wide variety of clinical and research applications including screening for mutations, target sequences and aneuploidy. We recently demonstrated that MinION, a nanopore-based DNA sequencing device the size of a USB drive, could be used for short-read DNA sequencing. In this study, an ultra-rapid multiplex library preparation and sequencing method for the MinION is presented and applied to accurately test normal diploid and aneuploidy samples’ genomic DNA in under three hours, including library preparation and sequencing. This novel method shows great promise as a clinical diagnostic test for applications requiring rapid short-read DNA sequencing.

  19. Deep-sequencing protocols influence the results obtained in small-RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Joern Toedling

    Full Text Available Second-generation sequencing is a powerful method for identifying and quantifying small-RNA components of cells. However, little attention has been paid to the effects of the choice of sequencing platform and library preparation protocol on the results obtained. We present a thorough comparison of small-RNA sequencing libraries generated from the same embryonic stem cell lines, using different sequencing platforms, which represent the three major second-generation sequencing technologies, and protocols. We have analysed and compared the expression of microRNAs, as well as populations of small RNAs derived from repetitive elements. Despite the fact that different libraries display a good correlation between sequencing platforms, qualitative and quantitative variations in the results were found, depending on the protocol used. Thus, when comparing libraries from different biological samples, it is strongly recommended to use the same sequencing platform and protocol in order to ensure the biological relevance of the comparisons.

  20. Multilocus Sequence Typing of Total-Genome-Sequenced Bacteria

    DEFF Research Database (Denmark)

    Larsen, Mette Voldby; Cosentino, Salvatore; Rasmussen, Simon

    2012-01-01

    Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS...

  1. Roles of repetitive sequences

    Energy Technology Data Exchange (ETDEWEB)

    Bell, G.I.

    1991-12-31

    The DNA of higher eukaryotes contains many repetitive sequences. The study of repetitive sequences is important, not only because many have important biological function, but also because they provide information on genome organization, evolution and dynamics. In this paper, I will first discuss some generic effects that repetitive sequences will have upon genome dynamics and evolution. In particular, it will be shown that repetitive sequences foster recombination among, and turnover of, the elements of a genome. I will then consider some examples of repetitive sequences, notably minisatellite sequences and telomere sequences as examples of tandem repeats, without and with respectively known function, and Alu sequences as an example of interspersed repeats. Some other examples will also be considered in less detail.

  2. Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence.

    Science.gov (United States)

    Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E

    2016-01-01

    It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven

  3. Nonparametric combinatorial sequence models.

    Science.gov (United States)

    Wauthier, Fabian L; Jordan, Michael I; Jojic, Nebojsa

    2011-11-01

    This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences. If multiple such groups exist, complex interactions can emerge between them. Sequences of this kind arise frequently in biology but methodologies for analyzing them are still being developed. This article presents a nonparametric prior on sequences which allows combinatorial structures to emerge and which induces a posterior distribution over factorized sequence representations. We carry out experiments on three biological sequence families which indicate that combinatorial structures are indeed present and that combinatorial sequence models can more succinctly describe them than simpler mixture models. We conclude with an application to MHC binding prediction which highlights the utility of the posterior distribution over sequence representations induced by the prior. By integrating out the posterior, our method compares favorably to leading binding predictors.

  4. Rapid and Accurate Sequencing of Enterovirus Genomes Using MinION Nanopore Sequencer.

    Science.gov (United States)

    Wang, Ji; Ke, Yue Hua; Zhang, Yong; Huang, Ke Qiang; Wang, Lei; Shen, Xin Xin; Dong, Xiao Ping; Xu, Wen Bo; Ma, Xue Jun

    2017-10-01

    Knowledge of an enterovirus genome sequence is very important in epidemiological investigation to identify transmission patterns and ascertain the extent of an outbreak. The MinION sequencer is increasingly used to sequence various viral pathogens in many clinical situations because of its long reads, portability, real-time accessibility of sequenced data, and very low initial costs. However, information is lacking on MinION sequencing of enterovirus genomes. In this proof-of-concept study using Enterovirus 71 (EV71) and Coxsackievirus A16 (CA16) strains as examples, we established an amplicon-based whole genome sequencing method using MinION. We explored the accuracy, minimum sequencing time, discrimination and high-throughput sequencing ability of MinION, and compared its performance with Sanger sequencing. Within the first minute (min) of sequencing, the accuracy of MinION was 98.5% for the single EV71 strain and 94.12%-97.33% for 10 genetically-related CA16 strains. In as little as 14 min, 99% identity was reached for the single EV71 strain, and in 17 min (on average), 99% identity was achieved for 10 CA16 strains in a single run. MinION is suitable for whole genome sequencing of enteroviruses with sufficient accuracy and fine discrimination and has the potential as a fast, reliable and convenient method for routine use. Copyright © 2017 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.

  5. LINE retrotransposon RNA is an essential structural and functional epigenetic component of a core neocentromeric chromatin.

    Directory of Open Access Journals (Sweden)

    Anderly C Chueh

    2009-01-01

    Full Text Available We have previously identified and characterized the phenomenon of ectopic human centromeres, known as neocentromeres. Human neocentromeres form epigenetically at euchromatic chromosomal sites and are structurally and functionally similar to normal human centromeres. Recent studies have indicated that neocentromere formation provides a major mechanism for centromere repositioning, karyotype evolution, and speciation. Using a marker chromosome mardel(10 containing a neocentromere formed at the normal chromosomal 10q25 region, we have previously mapped a 330-kb CENP-A-binding domain and described an increased prevalence of L1 retrotransposons in the underlying DNA sequences of the CENP-A-binding clusters. Here, we investigated the potential role of the L1 retrotransposons in the regulation of neocentromere activity. Determination of the transcriptional activity of a panel of full-length L1s (FL-L1s across a 6-Mb region spanning the 10q25 neocentromere chromatin identified one of the FL-L1 retrotransposons, designated FL-L1b and residing centrally within the CENP-A-binding clusters, to be transcriptionally active. We demonstrated the direct incorporation of the FL-L1b RNA transcripts into the CENP-A-associated chromatin. RNAi-mediated knockdown of the FL-L1b RNA transcripts led to a reduction in CENP-A binding and an impaired mitotic function of the 10q25 neocentromere. These results indicate that LINE retrotransposon RNA is a previously undescribed essential structural and functional component of the neocentromeric chromatin and that retrotransposable elements may serve as a critical epigenetic determinant in the chromatin remodelling events leading to neocentromere formation.

  6. PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities

    Directory of Open Access Journals (Sweden)

    Baldwin Stephen A

    2011-03-01

    Full Text Available Abstract Background Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. Results The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. Conclusions PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.

  7. PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.

    Science.gov (United States)

    Troshin, Peter V; Postis, Vincent Lg; Ashworth, Denise; Baldwin, Stephen A; McPherson, Michael J; Barton, Geoffrey J

    2011-03-07

    Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.

  8. Sequence embedding for fast construction of guide trees for multiple sequence alignment

    LENUS (Irish Health Repository)

    Blackshields, Gordon

    2010-05-14

    Abstract Background The most widely used multiple sequence alignment methods require sequences to be clustered as an initial step. Most sequence clustering methods require a full distance matrix to be computed between all pairs of sequences. This requires memory and time proportional to N 2 for N sequences. When N grows larger than 10,000 or so, this becomes increasingly prohibitive and can form a significant barrier to carrying out very large multiple alignments. Results In this paper, we have tested variations on a class of embedding methods that have been designed for clustering large numbers of complex objects where the individual distance calculations are expensive. These methods involve embedding the sequences in a space where the similarities within a set of sequences can be closely approximated without having to compute all pair-wise distances. Conclusions We show how this approach greatly reduces computation time and memory requirements for clustering large numbers of sequences and demonstrate the quality of the clusterings by benchmarking them as guide trees for multiple alignment. Source code is available for download from http:\\/\\/www.clustal.org\\/mbed.tgz.

  9. The Schizosaccharomyces pombe JmjC-protein, Msc1, prevents H2A.Z localization in centromeric and subtelomeric chromatin domains.

    Directory of Open Access Journals (Sweden)

    Luke Buchanan

    2009-11-01

    Full Text Available Eukaryotic genomes are repetitively packaged into chromatin by nucleosomes, however they are regulated by the differences between nucleosomes, which establish various chromatin states. Local chromatin cues direct the inheritance and propagation of chromatin status via self-reinforcing epigenetic mechanisms. Replication-independent histone exchange could potentially perturb chromatin status if histone exchange chaperones, such as Swr1C, loaded histone variants into wrong sites. Here we show that in Schizosaccharomyces pombe, like Saccharomyces cerevisiae, Swr1C is required for loading H2A.Z into specific sites, including the promoters of lowly expressed genes. However S. pombe Swr1C has an extra subunit, Msc1, which is a JumonjiC-domain protein of the Lid/Jarid1 family. Deletion of Msc1 did not disrupt the S. pombe Swr1C or its ability to bind and load H2A.Z into euchromatin, however H2A.Z was ectopically found in the inner centromere and in subtelomeric chromatin. Normally this subtelomeric region not only lacks H2A.Z but also shows uniformly lower levels of H3K4me2, H4K5, and K12 acetylation than euchromatin and disproportionately contains the most lowly expressed genes during vegetative growth, including many meiotic-specific genes. Genes within and adjacent to subtelomeric chromatin become overexpressed in the absence of either Msc1, Swr1, or paradoxically H2A.Z itself. We also show that H2A.Z is N-terminally acetylated before, and lysine acetylated after, loading into chromatin and that it physically associates with the Nap1 histone chaperone. However, we find a negative correlation between the genomic distributions of H2A.Z and Nap1/Hrp1/Hrp3, suggesting that the Nap1 chaperones remove H2A.Z from chromatin. These data describe H2A.Z action in S. pombe and identify a new mode of chromatin surveillance and maintenance based on negative regulation of histone variant misincorporation.

  10. Determination of HER2 and p53 Mutations by Sequence Analysis Method and EGFR/Chromosome 7 Gene Status by Fluorescence in Situ Hybridization for the Predilection of Targeted Therapy Modalities in Immunohistochemically Triple Negative Breast Carcinomas in Turkish Population.

    Science.gov (United States)

    Pala, Emel Ebru; Bayol, Umit; Keskin, Elif Usturali; Ozguzer, Alp; Kucuk, Ulku; Ozer, Ozge; Koc, Altug

    2015-09-01

    Triple negative breast cancer (TNBC), an agressive subtype accounts nearly 15 % of all breast carcinomas. Conventional chemotherapy is the only treatment modality thus new, effective targeted therapy methods have been investigated. Epidermal growth factor receptor (EGFR) inhibitors give hope according to the recent studies results. Also therapeutic agents have been tried against aberrant p53 signal activity as TNBC show high p53 mutation rates. Our aim was to detect the incidence of mutations/amplifications identified in TNBC in our population. Here we used sequence analysis to detect HER2 (exon 18-23), p53 (exon 5-8) mutations; fluorescence in situ hybridization (FISH) method to analyse EGFR/chromosome 7 centromere gene status in 82 immunohistochemically TNBC. Basaloid phenotype was identified in 49 (59.8 %) patients. EGFR amplification was noted in 5 cases (6.1 %). All EGFR amplified cases showed EGFR overexpression by immunohistochemistry (IHC). p53 mutations were identified in 33 (40.2 %) cases. Almost 60 % of the basal like breast cancer cases showed p53 mutation. Only one case showed HER2 mutation (exon 20:g.36830_3). Our results showed that gene amplification is not the unique mechanism in EGFR overexpression. IHC might be used in the decision of anti-EGFR therapy in routine practice. p53 mutation rate was lower than the rates reported in the literature probably due to ethnic differences and low sensitivity of sanger sequences in general mutation screening. We also established the rarity of HER2 mutation in TNBC. In conclusion EGFR and p53 are the major targets in TNBC also for our population.

  11. Sequencing of BAC pools by different next generation sequencing platforms and strategies

    Directory of Open Access Journals (Sweden)

    Scholz Uwe

    2011-10-01

    Full Text Available Abstract Background Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. Results Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. Conclusion Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.

  12. Long sequence correlation coprocessor

    Science.gov (United States)

    Gage, Douglas W.

    1994-09-01

    A long sequence correlation coprocessor (LSCC) accelerates the bitwise correlation of arbitrarily long digital sequences by calculating in parallel the correlation score for 16, for example, adjacent bit alignments between two binary sequences. The LSCC integrated circuit is incorporated into a computer system with memory storage buffers and a separate general purpose computer processor which serves as its controller. Each of the LSCC's set of sequential counters simultaneously tallies a separate correlation coefficient. During each LSCC clock cycle, computer enable logic associated with each counter compares one bit of a first sequence with one bit of a second sequence to increment the counter if the bits are the same. A shift register assures that the same bit of the first sequence is simultaneously compared to different bits of the second sequence to simultaneously calculate the correlation coefficient by the different counters to represent different alignments of the two sequences.

  13. Next-Generation Sequencing Platforms

    Science.gov (United States)

    Mardis, Elaine R.

    2013-06-01

    Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.

  14. Optimization of sequence alignment for simple sequence repeat regions

    Directory of Open Access Journals (Sweden)

    Ogbonnaya Francis C

    2011-07-01

    Full Text Available Abstract Background Microsatellites, or simple sequence repeats (SSRs, are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs. SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic

  15. Adaptive Processing for Sequence Alignment

    KAUST Repository

    Zidan, Mohammed A.; Bonny, Talal; Salama, Khaled N.

    2012-01-01

    Disclosed are various embodiments for adaptive processing for sequence alignment. In one embodiment, among others, a method includes obtaining a query sequence and a plurality of database sequences. A first portion of the plurality of database sequences is distributed to a central processing unit (CPU) and a second portion of the plurality of database sequences is distributed to a graphical processing unit (GPU) based upon a predetermined splitting ratio associated with the plurality of database sequences, where the database sequences of the first portion are shorter than the database sequences of the second portion. A first alignment score for the query sequence is determined with the CPU based upon the first portion of the plurality of database sequences and a second alignment score for the query sequence is determined with the GPU based upon the second portion of the plurality of database sequences.

  16. Adaptive Processing for Sequence Alignment

    KAUST Repository

    Zidan, Mohammed A.

    2012-01-26

    Disclosed are various embodiments for adaptive processing for sequence alignment. In one embodiment, among others, a method includes obtaining a query sequence and a plurality of database sequences. A first portion of the plurality of database sequences is distributed to a central processing unit (CPU) and a second portion of the plurality of database sequences is distributed to a graphical processing unit (GPU) based upon a predetermined splitting ratio associated with the plurality of database sequences, where the database sequences of the first portion are shorter than the database sequences of the second portion. A first alignment score for the query sequence is determined with the CPU based upon the first portion of the plurality of database sequences and a second alignment score for the query sequence is determined with the GPU based upon the second portion of the plurality of database sequences.

  17. Direct chloroplast sequencing: comparison of sequencing platforms and analysis tools for whole chloroplast barcoding.

    Directory of Open Access Journals (Sweden)

    Marta Brozynska

    Full Text Available Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina and Ion Torrent (Life Technology sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare. Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis.

  18. Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

    Science.gov (United States)

    Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

    2017-07-01

    PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.

  19. Allele Re-sequencing Technologies

    DEFF Research Database (Denmark)

    Byrne, Stephen; Farrell, Jacqueline Danielle; Asp, Torben

    2013-01-01

    The development of next-generation sequencing technologies has made sequencing an affordable approach for detection of genetic variations associated with various traits. However, the cost of whole genome re-sequencing still remains too high to be feasible for many plant species with large...... alternative to whole genome re-sequencing to identify causative genetic variations in plants. One challenge, however, will be efficient bioinformatics strategies for data handling and analysis from the increasing amount of sequence information....

  20. Design of Long Period Pseudo-Random Sequences from the Addition of -Sequences over

    Directory of Open Access Journals (Sweden)

    Ren Jian

    2004-01-01

    Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of -sequences with pairwise-prime linear spans (AMPLS. Using -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to , a signal set is constructed.

  1. Error Analysis of Deep Sequencing of Phage Libraries: Peptides Censored in Sequencing

    Directory of Open Access Journals (Sweden)

    Wadim L. Matochko

    2013-01-01

    Full Text Available Next-generation sequencing techniques empower selection of ligands from phage-display libraries because they can detect low abundant clones and quantify changes in the copy numbers of clones without excessive selection rounds. Identification of errors in deep sequencing data is the most critical step in this process because these techniques have error rates >1%. Mechanisms that yield errors in Illumina and other techniques have been proposed, but no reports to date describe error analysis in phage libraries. Our paper focuses on error analysis of 7-mer peptide libraries sequenced by Illumina method. Low theoretical complexity of this phage library, as compared to complexity of long genetic reads and genomes, allowed us to describe this library using convenient linear vector and operator framework. We describe a phage library as N×1 frequency vector n=ni, where ni is the copy number of the ith sequence and N is the theoretical diversity, that is, the total number of all possible sequences. Any manipulation to the library is an operator acting on n. Selection, amplification, or sequencing could be described as a product of a N×N matrix and a stochastic sampling operator (Sa. The latter is a random diagonal matrix that describes sampling of a library. In this paper, we focus on the properties of Sa and use them to define the sequencing operator (Seq. Sequencing without any bias and errors is Seq=Sa IN, where IN is a N×N unity matrix. Any bias in sequencing changes IN to a nonunity matrix. We identified a diagonal censorship matrix (CEN, which describes elimination or statistically significant downsampling, of specific reads during the sequencing process.

  2. Log-balanced combinatorial sequences

    Directory of Open Access Journals (Sweden)

    Tomislav Došlic

    2005-01-01

    Full Text Available We consider log-convex sequences that satisfy an additional constraint imposed on their rate of growth. We call such sequences log-balanced. It is shown that all such sequences satisfy a pair of double inequalities. Sufficient conditions for log-balancedness are given for the case when the sequence satisfies a two- (or more- term linear recurrence. It is shown that many combinatorially interesting sequences belong to this class, and, as a consequence, that the above-mentioned double inequalities are valid for all of them.

  3. Clinical evaluation of further-developed MRCP sequences in comparison with standard MRCP sequences

    International Nuclear Information System (INIS)

    Hundt, W.; Scheidler, J.; Reiser, M.; Petsch, R.

    2002-01-01

    The purpose of this study was the comparison of technically improved single-shot magnetic resonance cholangiopancreatography (MRCP) sequences with standard single-shot rapid acquisition with relaxation enhancement (RARE) and half-Fourier acquired single-shot turbo spin-echo (HASTE) sequences in evaluating the normal and abnormal biliary duct system. The bile duct system of 45 patients was prospectively investigated on a 1.5-T MRI system. The investigation was performed with RARE and HASTE MR cholangiography sequences with standard and high spatial resolutions, and with a delayed-echo half-Fourier RARE (HASTE) sequence. Findings of the improved MRCP sequences were compared with the standard MRCP sequences. The level of confidence in assessing the diagnosis was divided into five groups. The Wilcoxon signed-rank test at a level of p<0.05 was applied. In 15 patients no pathology was found. The MRCP showed stenoses of the bile duct system in 10 patients and choledocholithiasis and cholecystolithiasis in 16 patients. In 12 patients a dilatation of the bile duct system was found. Comparison of the low- and high spatial resolution sequences and the short and long TE times of the half-Fourier RARE (HASTE) sequence revealed no statistically significant differences regarding accuracy of the examination. The diagnostic confidence level in assessing normal or pathological findings for the high-resolution RARE and half-Fourier RARE (HASTE) was significantly better than for the standard sequences. For the delayed-echo half-Fourier RARE (HASTE) sequence no statistically significant difference was seen. The high-resolution RARE and half-Fourier RARE (HASTE) sequences had a higher confidence level, but there was no significant difference in diagnosis in terms of detection and assessment of pathological changes in the biliary duct system compared with standard sequences. (orig.)

  4. Polyadenylated Sequencing Primers Enable Complete Readability of PCR Amplicons Analyzed by Dideoxynucleotide Sequencing

    Directory of Open Access Journals (Sweden)

    Martin Beránek

    2012-01-01

    Full Text Available Dideoxynucleotide DNA sequencing is one of the principal procedures in molecular biology. Loss of an initial part of nucleotides behind the 3' end of the sequencing primer limits the readability of sequenced amplicons. We present a method which extends the readability by using sequencing primers modified by polyadenylated tails attached to their 5' ends. Performing a polymerase chain reaction, we amplified eight amplicons of six human genes (AMELX, APOE, HFE, MBL2, SERPINA1 and TGFB1 ranging from 106 bp to 680 bp. Polyadenylation of the sequencing primers minimized the loss of bases in all amplicons. Complete sequences of shorter products (AMELX 106 bp, SERPINA1 121 bp, HFE 208 bp, APOE 244 bp, MBL2 317 bp were obtained. In addition, in the case of TGFB1 products (366 bp, 432 bp, and 680 bp, respectively, the lengths of sequencing readings were significantly longer if adenylated primers were used. Thus, single strand dideoxynucleotide sequencing with adenylated primers enables complete or near complete readability of short PCR amplicons.

  5. Next-generation sequencing technology a new tool for killer cell immunoglobulin-like receptor allele typing in hematopoietic stem cell transplantation.

    Science.gov (United States)

    Maniangou, B; Retière, C; Gagne, K

    2018-02-01

    Killer cell Immunoglobulin-like Receptor (KIR) genes are a family of genes located together within the leukocyte receptor cluster on human chromosome 19q13.4. To date, 17 KIR genes have been identified including nine inhibitory genes (2DL1/L2/L3/L4/L5A/L5B, 3DL1/L2/L3), six activating genes (2DS1/S2/S3/S4/S5, 3DS1) and two pseudogenes (2DP1, 3DP1) classified into group A (KIR A) and group B (KIR B) haplotypes. The number and the nature of KIR genes vary between the individuals. In addition, these KIR genes are known to be polymorphic at allelic level (907 alleles described in July 2017). KIR genes encode for receptors which are predominantly expressed by Natural Killer (NK) cells. KIR receptors recognize HLA class I molecules and are able to kill residual recipient leukemia cells, and thus reduce the likelihood of relapse. KIR alleles of Hematopoietic Stem Cell (HSC) donor would require to be known (Alicata et al. Eur J Immunol 2016) because the KIR allele polymorphism may affect both the KIR + NK cell phenotype and function (Gagne et al. Eur J Immunol 2013; Bari R, et al. Sci Rep 2016) as well as HSCT outcome (Boudreau et al. JCO 2017). The introduction of the Next Generation Sequencing (NGS) has overcome current conventional DNA sequencing method limitations, known to be time consuming. Recently, a novel NGS KIR allele typing approach of all KIR genes was developed by our team in Nantes from 30 reference DNAs (Maniangou et al. Front in Immunol 2017). This NGS KIR allele typing approach is simple, fast, reliable, specific and showed a concordance rate of 95% for centromeric and telomeric KIR genes in comparison with high-resolution KIR typing obtained to those published data using exome capture (Norman PJ et al. Am J Hum Genet 2016). This NGS KIR allele typing approach may also be used in reproduction and to better study KIR + NK cell implication in the control of viral infections. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  6. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Science.gov (United States)

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697

  7. Sequence Read Archive (SRA)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Sequence Read Archive (SRA) stores raw sequencing data from the next generation of sequencing platforms including Roche 454 GS System®, Illumina Genome...

  8. Heterochromatin and molecular characterization of DsmarMITE transposable element in the beetle Dichotomius schiffleri (Coleoptera: Scarabaeidae).

    Science.gov (United States)

    Xavier, Crislaine; Cabral-de-Mello, Diogo Cavalcanti; de Moura, Rita Cássia

    2014-12-01

    Cytogenetic studies of the Neotropical beetle genus Dichotomius (Scarabaeinae, Coleoptera) have shown dynamism for centromeric constitutive heterochromatin sequences. In the present work we studied the chromosomes and isolated repetitive sequences of Dichotomius schiffleri aiming to contribute to the understanding of coleopteran genome/chromosomal organization. Dichotomius schiffleri presented a conserved karyotype and heterochromatin distribution in comparison to other species of the genus with 2n = 18, biarmed chromosomes, and pericentromeric C-positive blocks. Similarly to heterochromatin distributional patterns, the highly and moderately repetitive DNA fraction (C 0 t-1 DNA) was detected in pericentromeric areas, contrasting with the euchromatic mapping of an isolated TE (named DsmarMITE). After structural analyses, the DsmarMITE was classified as a non-autonomous element of the type miniature inverted-repeat transposable element (MITE) with terminal inverted repeats similar to Mariner elements of insects from different orders. The euchromatic distribution for DsmarMITE indicates that it does not play a part in the dynamics of constitutive heterochromatin sequences.

  9. Bellerophon: a program to detect chimeric sequences in multiple sequence alignments.

    Science.gov (United States)

    Huber, Thomas; Faulkner, Geoffrey; Hugenholtz, Philip

    2004-09-22

    Bellerophon is a program for detecting chimeric sequences in multiple sequence datasets by an adaption of partial treeing analysis. Bellerophon was specifically developed to detect 16S rRNA gene chimeras in PCR-clone libraries of environmental samples but can be applied to other nucleotide sequence alignments. Bellerophon is available as an interactive web server at http://foo.maths.uq.edu.au/~huber/bellerophon.pl

  10. Supernumerary ring chromosome 20 characterized by fluorescence in situ hybridization

    NARCIS (Netherlands)

    Van Langen, Irene M.; Otter, Mariëlle A.; Aronson, Daniël C.; Overweg-Plandsoen, W.C.G.; Hennekam, Raoul C.M.; Leschot, Nico J.; Hoovers, Jan M.N.

    1996-01-01

    We report on a boy with mild dysmorphic features and developmental delay, in whom karyotyping showed an additional minute ring chromosome in 60% of metaphases. Fluorescence in situ hybridization (FISH) with a centromere specific probe demonstrated that the ring chromosome contained the centromeric

  11. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    DEFF Research Database (Denmark)

    de Souza, S J; Camargo, A A; Briones, M R

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central ...

  12. Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

    Science.gov (United States)

    Martin, Andrew C R

    2014-01-01

    The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.

  13. Automated cleaning and pre-processing of immunoglobulin gene sequences from high-throughput sequencing

    Directory of Open Access Journals (Sweden)

    Miri eMichaeli

    2012-12-01

    Full Text Available High throughput sequencing (HTS yields tens of thousands to millions of sequences that require a large amount of pre-processing work to clean various artifacts. Such cleaning cannot be performed manually. Existing programs are not suitable for immunoglobulin (Ig genes, which are variable and often highly mutated. This paper describes Ig-HTS-Cleaner (Ig High Throughput Sequencing Cleaner, a program containing a simple cleaning procedure that successfully deals with pre-processing of Ig sequences derived from HTS, and Ig-Indel-Identifier (Ig Insertion – Deletion Identifier, a program for identifying legitimate and artifact insertions and/or deletions (indels. Our programs were designed for analyzing Ig gene sequences obtained by 454 sequencing, but they are applicable to all types of sequences and sequencing platforms. Ig-HTS-Cleaner and Ig-Indel-Identifier have been implemented in Java and saved as executable JAR files, supported on Linux and MS Windows. No special requirements are needed in order to run the programs, except for correctly constructing the input files as explained in the text. The programs' performance has been tested and validated on real and simulated data sets.

  14. Multilocus Sequence Analysis and rpoB Sequencing of Mycobacterium abscessus (Sensu Lato) Strains▿

    Science.gov (United States)

    Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

    2011-01-01

    Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536T, M. massiliense CIP 108297T, and M. bolletii CIP 108541T) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the clustering

  15. Multilocus sequence analysis and rpoB sequencing of Mycobacterium abscessus (sensu lato) strains.

    Science.gov (United States)

    Macheras, Edouard; Roux, Anne-Laure; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby; Bodmer, Thomas; Cambau, Emmanuelle; Gaillard, Jean-Louis; Heym, Beate

    2011-02-01

    Mycobacterium abscessus, Mycobacterium bolletii, and Mycobacterium massiliense (Mycobacterium abscessus sensu lato) are closely related species that currently are identified by the sequencing of the rpoB gene. However, recent studies show that rpoB sequencing alone is insufficient to discriminate between these species, and some authors have questioned their current taxonomic classification. We studied here a large collection of M. abscessus (sensu lato) strains by partial rpoB sequencing (752 bp) and multilocus sequence analysis (MLSA). The final MLSA scheme developed was based on the partial sequences of eight housekeeping genes: argH, cya, glpK, gnd, murC, pgm, pta, and purH. The strains studied included the three type strains (M. abscessus CIP 104536(T), M. massiliense CIP 108297(T), and M. bolletii CIP 108541(T)) and 120 isolates recovered between 1997 and 2007 in France, Germany, Switzerland, and Brazil. The rpoB phylogenetic tree confirmed the existence of three main clusters, each comprising the type strain of one species. However, divergence values between the M. massiliense and M. bolletii clusters all were below 3% and between the M. abscessus and M. massiliense clusters were from 2.66 to 3.59%. The tree produced using the concatenated MLSA gene sequences (4,071 bp) also showed three main clusters, each comprising the type strain of one species. The M. abscessus cluster had a bootstrap value of 100% and was mostly compact. Bootstrap values for the M. massiliense and M. bolletii branches were much lower (71 and 61%, respectively), with the M. massiliense cluster having a fuzzy aspect. Mean (range) divergence values were 2.17% (1.13 to 2.58%) between the M. abscessus and M. massiliense clusters, 2.37% (1.5 to 2.85%) between the M. abscessus and M. bolletii clusters, and 2.28% (0.86 to 2.68%) between the M. massiliense and M. bolletii clusters. Adding the rpoB sequence to the MLSA-concatenated sequence (total sequence, 4,823 bp) had little effect on the

  16. ReRep: Computational detection of repetitive sequences in genome survey sequences (GSS

    Directory of Open Access Journals (Sweden)

    Alves-Ferreira Marcelo

    2008-09-01

    Full Text Available Abstract Background Genome survey sequences (GSS offer a preliminary global view of a genome since, unlike ESTs, they cover coding as well as non-coding DNA and include repetitive regions of the genome. A more precise estimation of the nature, quantity and variability of repetitive sequences very early in a genome sequencing project is of considerable importance, as such data strongly influence the estimation of genome coverage, library quality and progress in scaffold construction. Also, the elimination of repetitive sequences from the initial assembly process is important to avoid errors and unnecessary complexity. Repetitive sequences are also of interest in a variety of other studies, for instance as molecular markers. Results We designed and implemented a straightforward pipeline called ReRep, which combines bioinformatics tools for identifying repetitive structures in a GSS dataset. In a case study, we first applied the pipeline to a set of 970 GSSs, sequenced in our laboratory from the human pathogen Leishmania braziliensis, the causative agent of leishmaniosis, an important public health problem in Brazil. We also verified the applicability of ReRep to new sequencing technologies using a set of 454-reads of an Escheria coli. The behaviour of several parameters in the algorithm is evaluated and suggestions are made for tuning of the analysis. Conclusion The ReRep approach for identification of repetitive elements in GSS datasets proved to be straightforward and efficient. Several potential repetitive sequences were found in a L. braziliensis GSS dataset generated in our laboratory, and further validated by the analysis of a more complete genomic dataset from the EMBL and Sanger Centre databases. ReRep also identified most of the E. coli K12 repeats prior to assembly in an example dataset obtained by automated sequencing using 454 technology. The parameters controlling the algorithm behaved consistently and may be tuned to the properties

  17. sequenceMiner algorithm

    Data.gov (United States)

    National Aeronautics and Space Administration — Detecting and describing anomalies in large repositories of discrete symbol sequences. sequenceMiner has been open-sourced! Download the file below to try it out....

  18. Molecular cytogenetic of the Amoy croaker, Argyrosomus amoyensis (Teleostei, Sciaenidae)

    Science.gov (United States)

    Liao, Mengxiang; Zheng, Jiao; Wang, Zhiyong; Wang, Yilei; Zhang, Jing; Cai, Mingyi

    2017-08-01

    The family Sciaenidae is remarkable for its species richness and economic importance. However, the cytogenetic data available in this fish group are still limited, especially those obtained using fluorescence in situ hybridization (FISH). In the present study, the chromosome characteristics of a sciaenid species, Argyrosomus amoyensis, were examined with several cytogenetic methods, including dual-FISH with 18S and 5S rDNA probes, and a self-genomic in situ hybridization procedure (Self-GISH). The karyotype of A. amoyensis comprised 2n=48 acrocentric chromosomes. A single pair of nucleolar organizer regions (NORs) was located at the proximal position of chromosome 1, which was positive for silver nitrate impregnation (AgNO3) staining and denaturation-propidium iodide (DPI) staining but negative for Giemsa staining and 4',6-diamidino-2-phenylindole (DAPI) staining, and was confirmed by FISH with 18S rDNA probes. The 5S rDNA sites were located at the centromeric region of chromosome 3. Telomeric FISH signals were detected at all chromosome ends with different intensities, but internal telomeric sequences (ITSs) were not found. Self-GISH resulted in strong signals distributed at the centromeric regions of all chromosomes. C-banding revealed not only centromeric heterochromatin, but also heterochromatin that located on NORs, in interstitial and distal telomeric regions of specific chromosomes. These results suggest that the karyotype of Amoy croaker was relatively conserved and primitive. By comparison with the reported cytogenetic data of other sciaenids, it can be deduced that although the karyotypic macrostructure and chromosomal localization of 18S rDNA are conserved, the distribution of 5S rDNA varies dynamically among sciaenid species. Thus, the 5S rDNA sites may have different evolutionary dynamics in relation to other chromosomal regions, and have the potential to be effective cytotaxonomic markers in Sciaenidae.

  19. Mammalian ChlR1 has a role in heterochromatin organization

    International Nuclear Information System (INIS)

    Inoue, Akira; Hyle, Judith; Lechner, Mark S.; Lahti, Jill M.

    2011-01-01

    The ChlR1 DNA helicase, encoded by DDX11 gene, which is responsible for Warsaw breakage syndrome (WABS), has a role in sister-chromatid cohesion. In this study, we show that human ChlR1 deficient cells exhibit abnormal heterochromatin organization. While constitutive heterochromatin is discretely localized at perinuclear and perinucleolar regions in control HeLa cells, ChlR1-depleted cells showed dispersed localization of constitutive heterochromatin accompanied by disrupted centromere clustering. Cells isolated from Ddx11 -/- embryos also exhibited diffuse localization of centromeres and heterochromatin foci. Similar abnormalities were found in HeLa cells depleted of combinations of HP1α and HP1β. Immunofluorescence and chromatin immunoprecipitation showed a decreased level of HP1α at pericentric regions in ChlR1-depleted cells. Trimethyl-histone H3 at lysine 9 (H3K9-me3) was also modestly decreased at pericentric sequences. The abnormality in pericentric heterochromatin was further supported by decreased DNA methylation within major satellite repeats of Ddx11 -/- embryos. Furthermore, micrococcal nuclease (MNase) assay revealed a decreased chromatin density at the telomeres. These data suggest that in addition to a role in sister-chromatid cohesion, ChlR1 is also involved in the proper formation of heterochromatin, which in turn contributes to global nuclear organization and pleiotropic effects. -- Highlights: → New role for ChlR1 (DDX11), a cohesinopathy gene, in heterochromatin organization. → Loss of ChlR1 altered heterochromatin localization and centromere clustering. → Reduced ChlR1 levels also reduced HP1α and H3K9-me3 binding to pericentric DNA. → Decreased DNA methylation was found in pericentric repeats of Ddx11 -/- embryos. → These findings will aid in understanding the pathogenesis of Warsaw breakage syndrome.

  20. Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.

    Science.gov (United States)

    Harvey, Michael G; Smith, Brian Tilston; Glenn, Travis C; Faircloth, Brant C; Brumfield, Robb T

    2016-09-01

    Sequence capture and restriction site associated DNA sequencing (RAD-Seq) are two genomic enrichment strategies for applying next-generation sequencing technologies to systematics studies. At shallow timescales, such as within species, RAD-Seq has been widely adopted among researchers, although there has been little discussion of the potential limitations and benefits of RAD-Seq and sequence capture. We discuss a series of issues that may impact the utility of sequence capture and RAD-Seq data for shallow systematics in non-model species. We review prior studies that used both methods, and investigate differences between the methods by re-analyzing existing RAD-Seq and sequence capture data sets from a Neotropical bird (Xenops minutus). We suggest that the strengths of RAD-Seq data sets for shallow systematics are the wide dispersion of markers across the genome, the relative ease and cost of laboratory work, the deep coverage and read overlap at recovered loci, and the high overall information that results. Sequence capture's benefits include flexibility and repeatability in the genomic regions targeted, success using low-quality samples, more straightforward read orthology assessment, and higher per-locus information content. The utility of a method in systematics, however, rests not only on its performance within a study, but on the comparability of data sets and inferences with those of prior work. In RAD-Seq data sets, comparability is compromised by low overlap of orthologous markers across species and the sensitivity of genetic diversity in a data set to an interaction between the level of natural heterozygosity in the samples examined and the parameters used for orthology assessment. In contrast, sequence capture of conserved genomic regions permits interrogation of the same loci across divergent species, which is preferable for maintaining comparability among data sets and studies for the purpose of drawing general conclusions about the impact of

  1. Extensive gene conversion at the PMS2 DNA mismatch repair locus.

    Science.gov (United States)

    Hayward, Bruce E; De Vos, Michel; Valleley, Elizabeth M A; Charlton, Ruth S; Taylor, Graham R; Sheridan, Eamonn; Bonthron, David T

    2007-05-01

    Mutations of the PMS2 DNA repair gene predispose to a characteristic range of malignancies, with either childhood onset (when both alleles are mutated) or a partially penetrant adult onset (if heterozygous). These mutations have been difficult to detect, due to interference from a family of pseudogenes located on chromosome 7. One of these, the PMS2CL pseudogene, lies within a 100-kb inverted duplication (inv dup), 700 kb centromeric to PMS2 itself on 7p22. Here, we show that the reference genomic sequences cannot be relied upon to distinguish PMS2 from PMS2CL, because of sequence transfer between the two loci. The 7p22 inv dup occurred prior to the divergence of modern ape species (15 million years ago [Mya]), but has undergone extensive sequence homogenization. This process appears to be ongoing, since there is considerable allelic diversity within the duplicated region, much of it derived from sequence exchange between PMS2 and PMS2CL. This sequence diversity can result in both false-positive and false-negative mutation analysis at this locus. Great caution is still needed in the design and interpretation of PMS2 mutation screens. 2007 Wiley-Liss, Inc.

  2. Slit-scanning technique using standard cell sorter instruments for analyzing and sorting nonacrocentric human chromosomes, including small ones

    NARCIS (Netherlands)

    Rens, W.; van Oven, C. H.; Stap, J.; Jakobs, M. E.; Aten, J. A.

    1994-01-01

    We have investigated the performance of two types of standard flow cell sorter instruments, a System 50 Cytofluorograph and a FACSTar PLUS cell sorter, for the on-line centromeric index (CI) analysis of human chromosomes. To optimize the results, we improved the detection efficiency for centromeres

  3. A few Smarandache Integer Sequences

    OpenAIRE

    Ibstedt, Henry

    2010-01-01

    This paper deals with the analysis of a few Smarandache Integer Sequences which first appeared in Properties or the Numbers, F. Smarandache, University or Craiova Archives, 1975. The first four sequences are recurrence generated sequences while the last three are concatenation sequences.

  4. Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

    Science.gov (United States)

    2012-01-01

    Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence

  5. Yeast genome sequencing:

    DEFF Research Database (Denmark)

    Piskur, Jure; Langkjær, Rikke Breinhold

    2004-01-01

    For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...

  6. Sequence Matters but How Exactly? A Method for Evaluating Activity Sequences from Data

    Science.gov (United States)

    Doroudi, Shayan; Holstein, Kenneth; Aleven, Vincent; Brunskill, Emma

    2016-01-01

    How should a wide variety of educational activities be sequenced to maximize student learning? Although some experimental studies have addressed this question, educational data mining methods may be able to evaluate a wider range of possibilities and better handle many simultaneous sequencing constraints. We introduce Sequencing Constraint…

  7. A family of DNA repeats in Aspergillus nidulans has assimilated degenerated retrotransposons

    DEFF Research Database (Denmark)

    Nielsen, M.L.; Hermansen, T.D.; Aleksenko, Alexei Y.

    2001-01-01

    In the course of a chromosomal walk towards the centromere of chromosome IV of Aspergillus nidulans, several cross- hybridizing genomic cosmid clones were isolated. Restriction mapping of two such clones revealed that their restriction patterns were similar in a region of at least 15 kb, indicati......) phenomenon, first described in Neurospora crassa, may have operated in A. nidulans. The data indicate that this family of repeats has assimilated mobile elements that subsequently degenerated but then underwent further duplications as a part of the host repeats....... the presence of a large repeat. The nature of the repeat was further investigated by sequencing and Southern analysis. The study revealed a family of long dispersed repeats with a high degree of sequence similarity. The number and location of the repeats vary between wild isolates. Two copies of the repeat...

  8. Role of Securin, Separase and Cohesins in female meiosis and polar body formation in Drosophila.

    Science.gov (United States)

    Guo, Zhihao; Batiha, Osamah; Bourouh, Mohammed; Fifield, Eric; Swan, Andrew

    2016-02-01

    Chromosome segregation in meiosis is controlled by a conserved pathway that culminates in Separase-mediated cleavage of the α-kleisin Rec8, leading to dissolution of cohesin rings. Drosophila has no gene encoding Rec8, and the absence of a known Separase target raises the question of whether Separase and its regulator Securin (Pim in Drosophila) are important in Drosophila meiosis. Here, we investigate the role of Securin, Separase and the cohesin complex in female meiosis using fluorescence in situ hybridization against centromeric and arm-specific sequences to monitor cohesion. We show that Securin destruction and Separase activity are required for timely release of arm cohesion in anaphase I and centromere-proximal cohesion in anaphase II. They are also required for release of arm cohesion on polar body chromosomes. Cohesion on polar body chromosomes depends on the cohesin components SMC3 and the mitotic α-kleisin Rad21 (also called Vtd in Drosophila). We provide cytological evidence that SMC3 is required for arm cohesion in female meiosis, whereas Rad21, in agreement with recent findings, is not. We conclude that in Drosophila meiosis, cohesion is regulated by a conserved Securin-Separase pathway that targets a diverged Separase target, possibly within the cohesin complex. © 2016. Published by The Company of Biologists Ltd.

  9. Mapping sequences by parts

    Directory of Open Access Journals (Sweden)

    Guziolowski Carito

    2007-09-01

    Full Text Available Abstract Background: We present the N-map method, a pairwise and asymmetrical approach which allows us to compare sequences by taking into account evolutionary events that produce shuffled, reversed or repeated elements. Basically, the optimal N-map of a sequence s over a sequence t is the best way of partitioning the first sequence into N parts and placing them, possibly complementary reversed, over the second sequence in order to maximize the sum of their gapless alignment scores. Results: We introduce an algorithm computing an optimal N-map with time complexity O (|s| × |t| × N using O (|s| × |t| × N memory space. Among all the numbers of parts taken in a reasonable range, we select the value N for which the optimal N-map has the most significant score. To evaluate this significance, we study the empirical distributions of the scores of optimal N-maps and show that they can be approximated by normal distributions with a reasonable accuracy. We test the functionality of the approach over random sequences on which we apply artificial evolutionary events. Practical Application: The method is illustrated with four case studies of pairs of sequences involving non-standard evolutionary events.

  10. From Sequence to Morphology - Long-Range Correlations in Complete Sequenced Genomes

    NARCIS (Netherlands)

    T.A. Knoch (Tobias)

    2004-01-01

    textabstractThe largely unresolved sequential organization, i.e. the relations within DNA sequences, and its connection to the three-dimensional organization of genomes was investigated by correlation analyses of completely sequenced chromosomes from Viroids, Archaea, Bacteria, Arabidopsis

  11. Antibodies to SS-A/Ro-52kD and centromere in autoimmune liver disease: a clue to diagnosis and prognosis of primary biliary cirrhosis.

    Science.gov (United States)

    Granito, A; Muratori, P; Muratori, L; Pappas, G; Cassani, F; Worthington, J; Ferri, S; Quarneti, C; Cipriano, V; de Molo, C; Lenzi, M; Chapman, R W; Bianchi, F B

    2007-09-15

    Primary biliary cirrhosis (PBC) may be associated with various rheumatological disorders. To investigate the frequency and significance of 'rheumatological' antinuclear antibodies in the field of autoimmune chronic liver disease, with special regard to PBC. We studied 105 patients with PBC, 162 autoimmune liver disease controls (type 1 and 2 autoimmune hepatitis, primary sclerosing cholangitis), 30 systemic lupus erythematosus and 50 blood donors. Sera were tested for the presence of antibodies to extractable nuclear antigens (anti-ENA) by counterimmunoelectrophoresis, enzyme-linked and immunoblot (IB) assay, and for the presence of anti-centromere antibodies (ACA) by indirect immunofluorescence on HEp-2 cells and IB. The overall prevalence of IB-detected anti-ENA in PBC (30%) was higher than in type 1 autoimmune hepatitis (2.5%, P < 0.0001), type 2 autoimmune hepatitis (0%, P < 0.0001) and primary sclerosing cholangitis (11.5%, P = 0.006) and lower than in systemic lupus erythematosus (53%, P = 0.03). The most frequent anti-ENA reactivity in PBC was anti-SSA/Ro-52kD (28%). ACA were detected by IB in 21% PBC patients and never in the other subjects (P < 0.0001). Anti-SS-A/Ro/52kD positive PBC patients had at the time of diagnosis a more advanced histological stage (P = 0.01) and higher serum levels of bilirubin (P = 0.01) and IgM (P = 0.03) compared with negative ones. In the autoimmune liver disease setting, anti-SS-A/Ro-52kD and ACA have a high specificity for PBC and can thus be of diagnostic relevance in anti-mitochondrial antibodies negative cases. If confirmed in further studies with adequate follow-up, anti-SS-A/Ro-52kD antibodies might identify PBC patients with a more advanced and active disease.

  12. Biomolecule Sequencer: Next-Generation DNA Sequencing Technology for In-Flight Environmental Monitoring, Research, and Beyond

    Science.gov (United States)

    Smith, David J.; Burton, Aaron; Castro-Wallace, Sarah; John, Kristen; Stahl, Sarah E.; Dworkin, Jason Peter; Lupisella, Mark L.

    2016-01-01

    On the International Space Station (ISS), technologies capable of rapid microbial identification and disease diagnostics are not currently available. NASA still relies upon sample return for comprehensive, molecular-based sample characterization. Next-generation DNA sequencing is a powerful approach for identifying microorganisms in air, water, and surfaces onboard spacecraft. The Biomolecule Sequencer payload, manifested to SpaceX-9 and scheduled on the Increment 4748 research plan (June 2016), will assess the functionality of a commercially-available next-generation DNA sequencer in the microgravity environment of ISS. The MinION device from Oxford Nanopore Technologies (Oxford, UK) measures picoamp changes in electrical current dependent on nucleotide sequences of the DNA strand migrating through nanopores in the system. The hardware is exceptionally small (9.5 x 3.2 x 1.6 cm), lightweight (120 grams), and powered only by a USB connection. For the ISS technology demonstration, the Biomolecule Sequencer will be powered by a Microsoft Surface Pro3. Ground-prepared samples containing lambda bacteriophage, Escherichia coli, and mouse genomic DNA, will be launched and stored frozen on the ISS until experiment initiation. Immediately prior to sequencing, a crew member will collect and thaw frozen DNA samples, connect the sequencer to the Surface Pro3, inject thawed samples into a MinION flow cell, and initiate sequencing. At the completion of the sequencing run, data will be downlinked for ground analysis. Identical, synchronous ground controls will be used for data comparisons to determine sequencer functionality, run-time sequence, current dynamics, and overall accuracy. We will present our latest results from the ISS flight experiment the first time DNA has ever been sequenced in space and discuss the many potential applications of the Biomolecule Sequencer for environmental monitoring, medical diagnostics, higher fidelity and more adaptable Space Biology Human

  13. Pairwise Sequence Alignment Library

    Energy Technology Data Exchange (ETDEWEB)

    2015-05-20

    Vector extensions, such as SSE, have been part of the x86 CPU since the 1990s, with applications in graphics, signal processing, and scientific applications. Although many algorithms and applications can naturally benefit from automatic vectorization techniques, there are still many that are difficult to vectorize due to their dependence on irregular data structures, dense branch operations, or data dependencies. Sequence alignment, one of the most widely used operations in bioinformatics workflows, has a computational footprint that features complex data dependencies. The trend of widening vector registers adversely affects the state-of-the-art sequence alignment algorithm based on striped data layouts. Therefore, a novel SIMD implementation of a parallel scan-based sequence alignment algorithm that can better exploit wider SIMD units was implemented as part of the Parallel Sequence Alignment Library (parasail). Parasail features: Reference implementations of all known vectorized sequence alignment approaches. Implementations of Smith Waterman (SW), semi-global (SG), and Needleman Wunsch (NW) sequence alignment algorithms. Implementations across all modern CPU instruction sets including AVX2 and KNC. Language interfaces for C/C++ and Python.

  14. Chromatin preferences of the perichromosomal layer constituent pKi-67.

    Science.gov (United States)

    Traut, Walther; Endl, Elmar; Garagna, Silvia; Scholzen, Thomas; Schwinger, Eberhard; Gerdes, Johannes; Winking, Heinz

    2002-01-01

    The proliferation-associated nuclear protein pKi-67 relocates from the nucleolus to the chromosome surface during the G2/M transition of the cell cycle and contributes to the formation of the 'perichromosomal layer'. We investigated the in-vivo binding preferences of pKi-67 for various chromatin blocks of the mitotic chromosomes from the human and two mouse species, Mus musculus and M. caroli. All chromosomes were decorated with pKi-67 but displayed a gap of pKi-67 decoration in the centromere and NOR regions. pKi-67 distribution in a rearranged mouse chromosome showed that the formation of the centromeric gap was controlled by the specific chromatin in that region. While most chromatin served as a substrate for direct or indirect binding of pKi-67, we identified three types of chromatin that bound less or no pKi-67. These were: (1) the centromeric heterochromatin defined by the alpha satellite DNA in the human, by the mouse minor satellite in M. musculus and the 60- and 79-bp satellites in M. caroli; (2) the pericentromeric heterochromatin in M. musculus defined by the mouse major satellite, and (3) NORs in the human and in M. musculus defined by rDNA repeats. In contrast, the conspicuous blocks of pericentromeric heterochromatin in human chromosomes 1, 9 and 16 containing the 5-bp satellite showed intense pKi-67 decoration. The centromeric gap may have a biological significance for the proper attachment of the chromosomes to the mitotic spindle. In this context, our results suggest a new role for centromeric heterochromatin: the control of the centromeric gap in the perichromosomal layer.

  15. Analysis and Visualization Tool for Targeted Amplicon Bisulfite Sequencing on Ion Torrent Sequencers.

    Directory of Open Access Journals (Sweden)

    Stephan Pabinger

    Full Text Available Targeted sequencing of PCR amplicons generated from bisulfite deaminated DNA is a flexible, cost-effective way to study methylation of a sample at single CpG resolution and perform subsequent multi-target, multi-sample comparisons. Currently, no platform specific protocol, support, or analysis solution is provided to perform targeted bisulfite sequencing on a Personal Genome Machine (PGM. Here, we present a novel tool, called TABSAT, for analyzing targeted bisulfite sequencing data generated on Ion Torrent sequencers. The workflow starts with raw sequencing data, performs quality assessment, and uses a tailored version of Bismark to map the reads to a reference genome. The pipeline visualizes results as lollipop plots and is able to deduce specific methylation-patterns present in a sample. The obtained profiles are then summarized and compared between samples. In order to assess the performance of the targeted bisulfite sequencing workflow, 48 samples were used to generate 53 different Bisulfite-Sequencing PCR amplicons from each sample, resulting in 2,544 amplicon targets. We obtained a mean coverage of 282X using 1,196,822 aligned reads. Next, we compared the sequencing results of these targets to the methylation level of the corresponding sites on an Illumina 450k methylation chip. The calculated average Pearson correlation coefficient of 0.91 confirms the sequencing results with one of the industry-leading CpG methylation platforms and shows that targeted amplicon bisulfite sequencing provides an accurate and cost-efficient method for DNA methylation studies, e.g., to provide platform-independent confirmation of Illumina Infinium 450k methylation data. TABSAT offers a novel way to analyze data generated by Ion Torrent instruments and can also be used with data from the Illumina MiSeq platform. It can be easily accessed via the Platomics platform, which offers a web-based graphical user interface along with sample and parameter storage

  16. DNA Sequencing by Capillary Electrophoresis

    Science.gov (United States)

    Karger, Barry L.; Guttman, Andras

    2009-01-01

    Sequencing of human and other genomes has been at the center of interest in the biomedical field over the past several decades and is now leading toward an era of personalized medicine. During this time, DNA sequencing methods have evolved from the labor intensive slab gel electrophoresis, through automated multicapillary electrophoresis systems using fluorophore labeling with multispectral imaging, to the “next generation” technologies of cyclic array, hybridization based, nanopore and single molecule sequencing. Deciphering the genetic blueprint and follow-up confirmatory sequencing of Homo sapiens and other genomes was only possible by the advent of modern sequencing technologies that was a result of step by step advances with a contribution of academics, medical personnel and instrument companies. While next generation sequencing is moving ahead at break-neck speed, the multicapillary electrophoretic systems played an essential role in the sequencing of the Human Genome, the foundation of the field of genomics. In this prospective, we wish to overview the role of capillary electrophoresis in DNA sequencing based in part of several of our articles in this journal. PMID:19517496

  17. HIV Sequence Compendium 2015

    Energy Technology Data Exchange (ETDEWEB)

    Foley, Brian Thomas [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Leitner, Thomas Kenneth [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Apetrei, Cristian [Univ. of Pittsburgh, PA (United States); Hahn, Beatrice [Univ. of Pennsylvania, Philadelphia, PA (United States); Mizrachi, Ilene [National Center for Biotechnology Information, Bethesda, MD (United States); Mullins, James [Univ. of Washington, Seattle, WA (United States); Rambaut, Andrew [Univ. of Edinburgh, Scotland (United Kingdom); Wolinsky, Steven [Northwestern Univ., Evanston, IL (United States); Korber, Bette Tina Marie [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2015-10-05

    This compendium is an annual printed summary of the data contained in the HIV sequence database. We try to present a judicious selection of the data in such a way that it is of maximum utility to HIV researchers. Each of the alignments attempts to display the genetic variability within the different species, groups and subtypes of the virus. This compendium contains sequences published before January 1, 2015. Hence, though it is published in 2015 and called the 2015 Compendium, its contents correspond to the 2014 curated alignments on our website. The number of sequences in the HIV database is still increasing. In total, at the end of 2014, there were 624,121 sequences in the HIV Sequence Database, an increase of 7% since the previous year. This is the first year that the number of new sequences added to the database has decreased compared to the previous year. The number of near complete genomes (>7000 nucleotides) increased to 5834 by end of 2014. However, as in previous years, the compendium alignments contain only a fraction of these. A more complete version of all alignments is available on our website, http://www.hiv.lanl.gov/ content/sequence/NEWALIGN/align.html As always, we are open to complaints and suggestions for improvement. Inquiries and comments regarding the compendium should be addressed to seq-info@lanl.gov.

  18. PN Sequence Preestimator Scheme for DS-SS Signal Acquisition Using Block Sequence Estimation

    Directory of Open Access Journals (Sweden)

    Sang Kyu Park

    2005-03-01

    Full Text Available An m-sequence (PN sequence preestimator scheme for direct-sequence spread spectrum (DS-SS signal acquisition by using block sequence estimation (BSE is proposed and analyzed. The proposed scheme consists of an estimator and a verifier which work according to the PN sequence chip clock, and provides not only the enhanced chip estimates with a threshold decision logic and one-chip error correction among the first m received chips, but also the reliability check of the estimates with additional decision logic. The probabilities of the estimator and verifier operations are calculated. With these results, the detection, the false alarm, and the missing probabilities of the proposed scheme are derived. In addition, using a signal flow graph, the average acquisition time is calculated. The proposed scheme can be used as a preestimator and easily implemented by changing the internal signal path of a generally used digital matched filter (DMF correlator or any other correlator that has a lot of sampling data memories for sampled PN sequence. The numerical results show rapid acquisition performance in a relatively good CNR.

  19. GuiTope: an application for mapping random-sequence peptides to protein sequences.

    Science.gov (United States)

    Halperin, Rebecca F; Stafford, Phillip; Emery, Jack S; Navalkar, Krupa Arun; Johnston, Stephen Albert

    2012-01-03

    Random-sequence peptide libraries are a commonly used tool to identify novel ligands for binding antibodies, other proteins, and small molecules. It is often of interest to compare the selected peptide sequences to the natural protein binding partners to infer the exact binding site or the importance of particular residues. The ability to search a set of sequences for similarity to a set of peptides may sometimes enable the prediction of an antibody epitope or a novel binding partner. We have developed a software application designed specifically for this task. GuiTope provides a graphical user interface for aligning peptide sequences to protein sequences. All alignment parameters are accessible to the user including the ability to specify the amino acid frequency in the peptide library; these frequencies often differ significantly from those assumed by popular alignment programs. It also includes a novel feature to align di-peptide inversions, which we have found improves the accuracy of antibody epitope prediction from peptide microarray data and shows utility in analyzing phage display datasets. Finally, GuiTope can randomly select peptides from a given library to estimate a null distribution of scores and calculate statistical significance. GuiTope provides a convenient method for comparing selected peptide sequences to protein sequences, including flexible alignment parameters, novel alignment features, ability to search a database, and statistical significance of results. The software is available as an executable (for PC) at http://www.immunosignature.com/software and ongoing updates and source code will be available at sourceforge.net.

  20. GuiTope: an application for mapping random-sequence peptides to protein sequences

    Directory of Open Access Journals (Sweden)

    Halperin Rebecca F

    2012-01-01

    Full Text Available Abstract Background Random-sequence peptide libraries are a commonly used tool to identify novel ligands for binding antibodies, other proteins, and small molecules. It is often of interest to compare the selected peptide sequences to the natural protein binding partners to infer the exact binding site or the importance of particular residues. The ability to search a set of sequences for similarity to a set of peptides may sometimes enable the prediction of an antibody epitope or a novel binding partner. We have developed a software application designed specifically for this task. Results GuiTope provides a graphical user interface for aligning peptide sequences to protein sequences. All alignment parameters are accessible to the user including the ability to specify the amino acid frequency in the peptide library; these frequencies often differ significantly from those assumed by popular alignment programs. It also includes a novel feature to align di-peptide inversions, which we have found improves the accuracy of antibody epitope prediction from peptide microarray data and shows utility in analyzing phage display datasets. Finally, GuiTope can randomly select peptides from a given library to estimate a null distribution of scores and calculate statistical significance. Conclusions GuiTope provides a convenient method for comparing selected peptide sequences to protein sequences, including flexible alignment parameters, novel alignment features, ability to search a database, and statistical significance of results. The software is available as an executable (for PC at http://www.immunosignature.com/software and ongoing updates and source code will be available at sourceforge.net.

  1. Targeted sequencing of plant genomes

    Science.gov (United States)

    Mark D. Huynh

    2014-01-01

    Next-generation sequencing (NGS) has revolutionized the field of genetics by providing a means for fast and relatively affordable sequencing. With the advancement of NGS, wholegenome sequencing (WGS) has become more commonplace. However, sequencing an entire genome is still not cost effective or even beneficial in all cases. In studies that do not require a whole-...

  2. Repdigits in k-Lucas sequences

    Indian Academy of Sciences (India)

    57(2) 2000 243-254) proved that 11 is the largest number with only one distinct digit (the so-called repdigit) in the sequence ( L n ( 2 ) ) n . In this paper, we address a similar problem in the family of -Lucas sequences. We also show that the -Lucas sequences have similar properties to those of -Fibonacci sequences ...

  3. A Limousin specific myostatin allele affects longissimus muscle area and fatty acid profiles in a Wagyu-Limousin F*2* population

    Science.gov (United States)

    A microsatellite-based genome scan of a Wagyu x Limousin F2 cross population previously demonstrated QTL affecting longissimus muscle area (LMA) and fatty acid composition were present in regions near the centromere of BTA 2. In this study we used 70 SNP markers to examine the centromeric 20 megabas...

  4. Enrichment of target sequences for next-generation sequencing applications in research and diagnostics.

    Science.gov (United States)

    Altmüller, Janine; Budde, Birgit S; Nürnberg, Peter

    2014-02-01

    Abstract Targeted re-sequencing such as gene panel sequencing (GPS) has become very popular in medical genetics, both for research projects and in diagnostic settings. The technical principles of the different enrichment methods have been reviewed several times before; however, new enrichment products are constantly entering the market, and researchers are often puzzled about the requirement to take decisions about long-term commitments, both for the enrichment product and the sequencing technology. This review summarizes important considerations for the experimental design and provides helpful recommendations in choosing the best sequencing strategy for various research projects and diagnostic applications.

  5. HIV Sequence Compendium 2010

    Energy Technology Data Exchange (ETDEWEB)

    Kuiken, Carla [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Foley, Brian [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Leitner, Thomas [Los Alamos National Lab. (LANL), Los Alamos, NM (United States); Apetrei, Christian [Univ. of Pittsburgh, PA (United States); Hahn, Beatrice [Univ. of Alabama, Tuscaloosa, AL (United States); Mizrachi, Ilene [National Center for Biotechnology Information, Bethesda, MD (United States); Mullins, James [Univ. of Washington, Seattle, WA (United States); Rambaut, Andrew [Univ. of Edinburgh, Scotland (United Kingdom); Wolinsky, Steven [Northwestern Univ., Evanston, IL (United States); Korber, Bette [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)

    2010-12-31

    This compendium is an annual printed summary of the data contained in the HIV sequence database. In these compendia we try to present a judicious selection of the data in such a way that it is of maximum utility to HIV researchers. Each of the alignments attempts to display the genetic variability within the different species, groups and subtypes of the virus. This compendium contains sequences published before January 1, 2010. Hence, though it is called the 2010 Compendium, its contents correspond to the 2009 curated alignments on our website. The number of sequences in the HIV database is still increasing exponentially. In total, at the time of printing, there were 339,306 sequences in the HIV Sequence Database, an increase of 45% since last year. The number of near complete genomes (>7000 nucleotides) increased to 2576 by end of 2009, reflecting a smaller increase than in previous years. However, as in previous years, the compendium alignments contain only a small fraction of these. Included in the alignments are a small number of sequences representing each of the subtypes and the more prevalent circulating recombinant forms (CRFs) such as 01 and 02, as well as a few outgroup sequences (group O and N and SIV-CPZ). Of the rarer CRFs we included one representative each. A more complete version of all alignments is available on our website, http://www.hiv.lanl.gov/content/sequence/NEWALIGN/align.html. Reprints are available from our website in the form of both HTML and PDF files. As always, we are open to complaints and suggestions for improvement. Inquiries and comments regarding the compendium should be addressed to seq-info@lanl.gov.

  6. Almost convergence of triple sequences

    OpenAIRE

    Ayhan Esi; M.Necdet Catalbas

    2013-01-01

    In this paper we introduce and study the concepts of almost convergence and almost Cauchy for triple sequences. Weshow that the set of almost convergent triple sequences of 0's and 1's is of the first category and also almost everytriple sequence of 0's and 1's is not almost convergent.Keywords: almost convergence, P-convergent, triple sequence.

  7. Intermediate filaments and gene regulation.

    Science.gov (United States)

    Traub, P

    1995-01-01

    way specify different chromatin organizations and thus the expression of distinct sets of cell- or tissue-specific proteins. In support of this, different type III IFs have been shown to preferentially interact with guanine-rich, highly repetitive, double-stranded fragments of total genomic DNA, including chromosomal telomere sequences. Surprisingly, they also bound AT-rich, centromeric satellite DNA sequences with high efficiency. Since most of the affinity-isolated, non-telomeric and -centromeric DNA fragments contain regulatory elements that are normally located in 5'/3'-flanking and intron regions of genes, cIFs may activate gene expression or repress it as the result of telomeric and centromeric position effects. However, the nucleotide sequences of the cIF-bound, genomic DNA fragments also predict the involvement of cIF(protein)s in recombination and hence in evolutionary processes. Based on these observations, the initially observed minor effects of cIF protein knock-out mutations on the phenotype of transgenic mice may be interpreted as a redundancy phenomenon operating at the levels of the cytoskeleton and gene expression, whereas the capacity of the mutated animals to adapt to new environments via recombination processes may be severely disturbed and, as such, perceivable only after many generations of less favorable living conditions.

  8. Sequence History Update Tool

    Science.gov (United States)

    Khanampompan, Teerapat; Gladden, Roy; Fisher, Forest; DelGuercio, Chris

    2008-01-01

    The Sequence History Update Tool performs Web-based sequence statistics archiving for Mars Reconnaissance Orbiter (MRO). Using a single UNIX command, the software takes advantage of sequencing conventions to automatically extract the needed statistics from multiple files. This information is then used to populate a PHP database, which is then seamlessly formatted into a dynamic Web page. This tool replaces a previous tedious and error-prone process of manually editing HTML code to construct a Web-based table. Because the tool manages all of the statistics gathering and file delivery to and from multiple data sources spread across multiple servers, there is also a considerable time and effort savings. With the use of The Sequence History Update Tool what previously took minutes is now done in less than 30 seconds, and now provides a more accurate archival record of the sequence commanding for MRO.

  9. GROUPING WEB ACCESS SEQUENCES uSING SEQUENCE ALIGNMENT METHOD

    OpenAIRE

    BHUPENDRA S CHORDIA; KRISHNAKANT P ADHIYA

    2011-01-01

    In web usage mining grouping of web access sequences can be used to determine the behavior or intent of a set of users. Grouping websessions is how to measure the similarity between web sessions. There are many shortcomings in traditional measurement methods. The taskof grouping web sessions based on similarity and consists of maximizing the intra-group similarity while minimizing the inter-groupsimilarity is done using sequence alignment method. This paper introduces a new method to group we...

  10. The sequence specificity of UV-induced DNA damage in a systematically altered DNA sequence.

    Science.gov (United States)

    Khoe, Clairine V; Chung, Long H; Murray, Vincent

    2018-06-01

    The sequence specificity of UV-induced DNA damage was investigated in a specifically designed DNA plasmid using two procedures: end-labelling and linear amplification. Absorption of UV photons by DNA leads to dimerisation of pyrimidine bases and produces two major photoproducts, cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). A previous study had determined that two hexanucleotide sequences, 5'-GCTC*AC and 5'-TATT*AA, were high intensity UV-induced DNA damage sites. The UV clone plasmid was constructed by systematically altering each nucleotide of these two hexanucleotide sequences. One of the main goals of this study was to determine the influence of single nucleotide alterations on the intensity of UV-induced DNA damage. The sequence 5'-GCTC*AC was designed to examine the sequence specificity of 6-4PPs and the highest intensity 6-4PP damage sites were found at 5'-GTTC*CC nucleotides. The sequence 5'-TATT*AA was devised to investigate the sequence specificity of CPDs and the highest intensity CPD damage sites were found at 5'-TTTT*CG nucleotides. It was proposed that the tetranucleotide DNA sequence, 5'-YTC*Y (where Y is T or C), was the consensus sequence for the highest intensity UV-induced 6-4PP adduct sites; while it was 5'-YTT*C for the highest intensity UV-induced CPD damage sites. These consensus tetranucleotides are composed entirely of consecutive pyrimidines and must have a DNA conformation that is highly productive for the absorption of UV photons. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.

  11. "First generation" automated DNA sequencing technology.

    Science.gov (United States)

    Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

    2011-10-01

    Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.

  12. The recurrence sequences via Sylvester matrices

    Science.gov (United States)

    Karaduman, Erdal; Deveci, Ömür

    2017-07-01

    In this work, we define the Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by using the Slyvester matrices which are obtained from the characteristic polynomials of the Pell and Jacobsthal sequences and then, we study the sequences defined modulo m. Also, we obtain the cyclic groups and the semigroups from the generating matrices of these sequences when read modulo m and then, we derive the relationships among the orders of the cyclic groups and the periods of the sequences. Furthermore, we redefine Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by means of the elements of the groups and then, we examine them in the finite groups.

  13. Swab-to-Sequence: Real-time Data Analysis Platform for the Biomolecule Sequencer

    Data.gov (United States)

    National Aeronautics and Space Administration — DNA was successfully sequenced on the ISS in 2016, but the DNA sequenced was prepared on the ground. With FY’16 IRAD funds, the same team developed a...

  14. Quantitative phenotyping via deep barcode sequencing.

    Science.gov (United States)

    Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey

    2009-10-01

    Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.

  15. ADDRESS SEQUENCES FOR MULTI RUN RAM TESTING

    Directory of Open Access Journals (Sweden)

    V. N. Yarmolik

    2014-01-01

    Full Text Available A universal approach for generation of address sequences with specified properties is proposed and analyzed. A modified version of the Antonov and Saleev algorithm for Sobol sequences genera-tion is chosen as a mathematical description of the proposed method. Within the framework of the proposed universal approach, the Sobol sequences form a subset of the address sequences. Other sub-sets are also formed, which are Gray sequences, anti-Gray sequences, counter sequences and sequenc-es with specified properties.

  16. The advantages of SMRT sequencing

    OpenAIRE

    Roberts, Richard J; Carneiro, Mauricio O; Schatz, Michael C

    2013-01-01

    Of the current next-generation sequencing technologies, SMRT sequencing is sometimes overlooked. However, attributes such as long reads, modified base detection and high accuracy make SMRT a useful technology and an ideal approach to the complete sequencing of small genomes.

  17. A safe an easy method for building consensus HIV sequences from 454 massively parallel sequencing data.

    Science.gov (United States)

    Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico

    2018-02-01

    To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.

  18. Rapid Diagnostics of Onboard Sequences

    Science.gov (United States)

    Starbird, Thomas W.; Morris, John R.; Shams, Khawaja S.; Maimone, Mark W.

    2012-01-01

    Keeping track of sequences onboard a spacecraft is challenging. When reviewing Event Verification Records (EVRs) of sequence executions on the Mars Exploration Rover (MER), operators often found themselves wondering which version of a named sequence the EVR corresponded to. The lack of this information drastically impacts the operators diagnostic capabilities as well as their situational awareness with respect to the commands the spacecraft has executed, since the EVRs do not provide argument values or explanatory comments. Having this information immediately available can be instrumental in diagnosing critical events and can significantly enhance the overall safety of the spacecraft. This software provides auditing capability that can eliminate that uncertainty while diagnosing critical conditions. Furthermore, the Restful interface provides a simple way for sequencing tools to automatically retrieve binary compiled sequence SCMFs (Space Command Message Files) on demand. It also enables developers to change the underlying database, while maintaining the same interface to the existing applications. The logging capabilities are also beneficial to operators when they are trying to recall how they solved a similar problem many days ago: this software enables automatic recovery of SCMF and RML (Robot Markup Language) sequence files directly from the command EVRs, eliminating the need for people to find and validate the corresponding sequences. To address the lack of auditing capability for sequences onboard a spacecraft during earlier missions, extensive logging support was added on the Mars Science Laboratory (MSL) sequencing server. This server is responsible for generating all MSL binary SCMFs from RML input sequences. The sequencing server logs every SCMF it generates into a MySQL database, as well as the high-level RML file and dictionary name inputs used to create the SCMF. The SCMF is then indexed by a hash value that is automatically included in all command

  19. MatrixPlot: visualizing sequence constraints

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Stærfeldt, Hans Henrik; Lund, Ole

    1999-01-01

    MatrixPlot: visualizing sequence constraints. Sub-title Abstract Summary : MatrixPlot is a program for making high-quality matrix plots, such as mutual information plots of sequence alignments and distance matrices of sequences with known three-dimensional coordinates. The user can add information...

  20. Anomaly Detection in Sequences

    Data.gov (United States)

    National Aeronautics and Space Administration — We present a set of novel algorithms which we call sequenceMiner, that detect and characterize anomalies in large sets of high-dimensional symbol sequences that...

  1. Direct, rapid RNA sequence analysis

    International Nuclear Information System (INIS)

    Peattie, D.A.

    1987-01-01

    The original methods of RNA sequence analysis were based on enzymatic production and chromatographic separation of overlapping oligonucleotide fragments from within an RNA molecule followed by identification of the mononucleotides comprising the oligomer. Over the past decade the field of nucleic acid sequencing has changed dramatically, however, and RNA molecules now can be sequenced in a variety of more streamlined fashions. Most of the more recent advances in RNA sequencing have involved one-dimensional electrophoretic separation of 32 P-end-labeled oligoribonucleotides on polyacrylamide gels. In this chapter the author discusses two of these methods for determining the nucleotide sequences of RNA molecules rapidly: the chemical method and the enzymatic method. Both methods are direct and degradative, i.e., they rely on fragmatic and chemical approaches should be utilized. The single-strand-specific ribonucleases (A, T 1 , T 2 , and S 1 ) provide an efficient means to locate double-helical regions rapidly, and the chemical reactions provide a means to determine the RNA sequence within these regions. In addition, the chemical reactions allow one to assign interactions to specific atoms and to distinguish secondary interactions from tertiary ones. If the RNA molecule is small enough to be sequenced directly by the enzymatic or chemical method, the probing reactions can be done easily at the same time as sequencing reactions

  2. An evaluation of Comparative Genome Sequencing (CGS by comparing two previously-sequenced bacterial genomes

    Directory of Open Access Journals (Sweden)

    Herring Christopher D

    2007-08-01

    Full Text Available Abstract Background With the development of new technology, it has recently become practical to resequence the genome of a bacterium after experimental manipulation. It is critical though to know the accuracy of the technique used, and to establish confidence that all of the mutations were detected. Results In order to evaluate the accuracy of genome resequencing using the microarray-based Comparative Genome Sequencing service provided by Nimblegen Systems Inc., we resequenced the E. coli strain W3110 Kohara using MG1655 as a reference, both of which have been completely sequenced using traditional sequencing methods. CGS detected 7 of 8 small sequence differences, one large deletion, and 9 of 12 IS element insertions present in W3110, but did not detect a large chromosomal inversion. In addition, we confirmed that CGS also detected 2 SNPs, one deletion and 7 IS element insertions that are not present in the genome sequence, which we attribute to changes that occurred after the creation of the W3110 lambda clone library. The false positive rate for SNPs was one per 244 Kb of genome sequence. Conclusion CGS is an effective way to detect multiple mutations present in one bacterium relative to another, and while highly cost-effective, is prone to certain errors. Mutations occurring in repeated sequences or in sequences with a high degree of secondary structure may go undetected. It is also critical to follow up on regions of interest in which SNPs were not called because they often indicate deletions or IS element insertions.

  3. Short read sequence typing (SRST: multi-locus sequence types from short reads

    Directory of Open Access Journals (Sweden)

    Inouye Michael

    2012-07-01

    Full Text Available Abstract Background Multi-locus sequence typing (MLST has become the gold standard for population analyses of bacterial pathogens. This method focuses on the sequences of a small number of loci (usually seven to divide the population and is simple, robust and facilitates comparison of results between laboratories and over time. Over the last decade, researchers and population health specialists have invested substantial effort in building up public MLST databases for nearly 100 different bacterial species, and these databases contain a wealth of important information linked to MLST sequence types such as time and place of isolation, host or niche, serotype and even clinical or drug resistance profiles. Recent advances in sequencing technology mean it is increasingly feasible to perform bacterial population analysis at the whole genome level. This offers massive gains in resolving power and genetic profiling compared to MLST, and will eventually replace MLST for bacterial typing and population analysis. However given the wealth of data currently available in MLST databases, it is crucial to maintain backwards compatibility with MLST schemes so that new genome analyses can be understood in their proper historical context. Results We present a software tool, SRST, for quick and accurate retrieval of sequence types from short read sets, using inputs easily downloaded from public databases. SRST uses read mapping and an allele assignment score incorporating sequence coverage and variability, to determine the most likely allele at each MLST locus. Analysis of over 3,500 loci in more than 500 publicly accessible Illumina read sets showed SRST to be highly accurate at allele assignment. SRST output is compatible with common analysis tools such as eBURST, Clonal Frame or PhyloViz, allowing easy comparison between novel genome data and MLST data. Alignment, fastq and pileup files can also be generated for novel alleles. Conclusions SRST is a novel

  4. Mind the gap; seven reasons to close fragmented genome assemblies.

    Science.gov (United States)

    Thomma, Bart P H J; Seidl, Michael F; Shi-Kunne, Xiaoqian; Cook, David E; Bolton, Melvin D; van Kan, Jan A L; Faino, Luigi

    2016-05-01

    Like other domains of life, research into the biology of filamentous microbes has greatly benefited from the advent of whole-genome sequencing. Next-generation sequencing (NGS) technologies have revolutionized sequencing, making genomic sciences accessible to many academic laboratories including those that study non-model organisms. Thus, hundreds of fungal genomes have been sequenced and are publically available today, although these initiatives have typically yielded considerably fragmented genome assemblies that often lack large contiguous genomic regions. Many important genomic features are contained in intergenic DNA that is often missing in current genome assemblies, and recent studies underscore the significance of non-coding regions and repetitive elements for the life style, adaptability and evolution of many organisms. The study of particular types of genetic elements, such as telomeres, centromeres, repetitive elements, effectors, and clusters of co-regulated genes, but also of phenomena such as structural rearrangements, genome compartmentalization and epigenetics, greatly benefits from having a contiguous and high-quality, preferably even complete and gapless, genome assembly. Here we discuss a number of important reasons to produce gapless, finished, genome assemblies to help answer important biological questions. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. Computational analysis of sequence selection mechanisms.

    Science.gov (United States)

    Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

    2004-04-01

    Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.

  6. Putting instruction sequences into effect

    NARCIS (Netherlands)

    Bergstra, J.A.

    2011-01-01

    An attempt is made to define the concept of execution of an instruction sequence. It is found to be a special case of directly putting into effect of an instruction sequence. Directly putting into effect of an instruction sequences comprises interpretation as well as execution. Directly putting into

  7. Repeated DNA sequences in fungi

    Energy Technology Data Exchange (ETDEWEB)

    Dutta, S K

    1974-11-01

    Several fungal species, representatives of all broad groups like basidiomycetes, ascomycetes and phycomycetes, were examined for the nature of repeated DNA sequences by DNA:DNA reassociation studies using hydroxyapatite chromatography. All of the fungal species tested contained 10 to 20 percent repeated DNA sequences. There are approximately 100 to 110 copies of repeated DNA sequences of approximately 4 x 10/sup 7/ daltons piece size of each. Repeated DNA sequence homoduplexes showed on average 5/sup 0/C difference of T/sub e/50 (temperature at which 50 percent duplexes dissociate) values from the corresponding homoduplexes of unfractionated whole DNA. It is suggested that a part of repetitive sequences in fungi constitutes mitochondrial DNA and a part of it constitutes nuclear DNA. (auth)

  8. Centromeric and non-centromeric satellite DNA organisation differs in holocentric Rhynchospora species

    Czech Academy of Sciences Publication Activity Database

    Ribeiro, T.; Marques, A.; Novák, Petr; Schubert, V.; Vanzela, A.L.L.; Macas, Jiří; Houben, A.; Pedrosa-Harand, A.

    2017-01-01

    Roč. 126, September 19 (2017), s. 325-335 ISSN 0009-5915 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : Chromocentre * Cyperaceae * Heterochromatin * Holocentric chromosome Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Genetics and heredity (medical genetics to be 3) Impact factor: 4.414, year: 2016

  9. In situ optical sequencing and structure analysis of a trinucleotide repeat genome region by localization microscopy after specific COMBO-FISH nano-probing

    Science.gov (United States)

    Stuhlmüller, M.; Schwarz-Finsterle, J.; Fey, E.; Lux, J.; Bach, M.; Cremer, C.; Hinderhofer, K.; Hausmann, M.; Hildenbrand, G.

    2015-10-01

    Trinucleotide repeat expansions (like (CGG)n) of chromatin in the genome of cell nuclei can cause neurological disorders such as for example the Fragile-X syndrome. Until now the mechanisms are not clearly understood as to how these expansions develop during cell proliferation. Therefore in situ investigations of chromatin structures on the nanoscale are required to better understand supra-molecular mechanisms on the single cell level. By super-resolution localization microscopy (Spectral Position Determination Microscopy; SPDM) in combination with nano-probing using COMBO-FISH (COMBinatorial Oligonucleotide FISH), novel insights into the nano-architecture of the genome will become possible. The native spatial structure of trinucleotide repeat expansion genome regions was analysed and optical sequencing of repetitive units was performed within 3D-conserved nuclei using SPDM after COMBO-FISH. We analysed a (CGG)n-expansion region inside the 5' untranslated region of the FMR1 gene. The number of CGG repeats for a full mutation causing the Fragile-X syndrome was found and also verified by Southern blot. The FMR1 promotor region was similarly condensed like a centromeric region whereas the arrangement of the probes labelling the expansion region seemed to indicate a loop-like nano-structure. These results for the first time demonstrate that in situ chromatin structure measurements on the nanoscale are feasible. Due to further methodological progress it will become possible to estimate the state of trinucleotide repeat mutations in detail and to determine the associated chromatin strand structural changes on the single cell level. In general, the application of the described approach to any genome region will lead to new insights into genome nano-architecture and open new avenues for understanding mechanisms and their relevance in the development of heredity diseases.

  10. Chameleon sequences in neurodegenerative diseases.

    Science.gov (United States)

    Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

    2016-03-25

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. Chameleon sequences in neurodegenerative diseases

    International Nuclear Information System (INIS)

    Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

    2016-01-01

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix to coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.

  12. Chameleon sequences in neurodegenerative diseases

    Energy Technology Data Exchange (ETDEWEB)

    Bahramali, Golnaz [Institute of Biochemistry and Biophysics, University of Tehran, Tehran (Iran, Islamic Republic of); Goliaei, Bahram, E-mail: goliaei@ut.ac.ir [Institute of Biochemistry and Biophysics, University of Tehran, Tehran (Iran, Islamic Republic of); Minuchehr, Zarrin, E-mail: minuchehr@nigeb.ac.ir [Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran (Iran, Islamic Republic of); Salari, Ali [Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran (Iran, Islamic Republic of)

    2016-03-25

    Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix to coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.

  13. Multineuronal Spike Sequences Repeat with Millisecond Precision

    Directory of Open Access Journals (Sweden)

    Koki eMatsumoto

    2013-06-01

    Full Text Available Cortical microcircuits are nonrandomly wired by neurons. As a natural consequence, spikes emitted by microcircuits are also nonrandomly patterned in time and space. One of the prominent spike organizations is a repetition of fixed patterns of spike series across multiple neurons. However, several questions remain unsolved, including how precisely spike sequences repeat, how the sequences are spatially organized, how many neurons participate in sequences, and how different sequences are functionally linked. To address these questions, we monitored spontaneous spikes of hippocampal CA3 neurons ex vivo using a high-speed functional multineuron calcium imaging technique that allowed us to monitor spikes with millisecond resolution and to record the location of spiking and nonspiking neurons. Multineuronal spike sequences were overrepresented in spontaneous activity compared to the statistical chance level. Approximately 75% of neurons participated in at least one sequence during our observation period. The participants were sparsely dispersed and did not show specific spatial organization. The number of sequences relative to the chance level decreased when larger time frames were used to detect sequences. Thus, sequences were precise at the millisecond level. Sequences often shared common spikes with other sequences; parts of sequences were subsequently relayed by following sequences, generating complex chains of multiple sequences.

  14. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin

    2007-08-01

    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  15. DNA sequencing conference, 2

    Energy Technology Data Exchange (ETDEWEB)

    Cook-Deegan, R.M. [Georgetown Univ., Kennedy Inst. of Ethics, Washington, DC (United States); Venter, J.C. [National Inst. of Neurological Disorders and Strokes, Bethesda, MD (United States); Gilbert, W. [Harvard Univ., Cambridge, MA (United States); Mulligan, J. [Stanford Univ., CA (United States); Mansfield, B.K. [Oak Ridge National Lab., TN (United States)

    1991-06-19

    This conference focused on DNA sequencing, genetic linkage mapping, physical mapping, informatics and bioethics. Several were used to study this sequencing and mapping. This article also discusses computer hardware and software aiding in the mapping of genes.

  16. International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

    Science.gov (United States)

    Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

    2015-01-01

    This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030

  17. International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

    Directory of Open Access Journals (Sweden)

    Nathan D. Olson

    2015-03-01

    Full Text Available This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1 identity of biologically conserved position, (2 ratio of 16S rRNA gene copies featuring identified variants, and (3 the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies.

  18. Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

    DEFF Research Database (Denmark)

    Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk

    2013-01-01

    to the consensus sequence. Additionally, we got an average sequence depth for the genome of 4000 for the Iontorrent PGM and 400 for the FLX platform making the mapping suitable for single nucleotide variant (SNV) detection. The analysis revealed a single non-silent SNV A10665G leading to the amino acid change D......Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms...

  19. Design of Long Period Pseudo-Random Sequences from the Addition of m -Sequences over 𝔽 p

    Directory of Open Access Journals (Sweden)

    Ren Jian

    2004-01-01

    Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of m -sequences with pairwise-prime linear spans (AMPLS. Using m -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to 𝔽 2 , a signal set ( ( 2 n − 1 ( 2 m − 1 , ( 2 n + 1 ( 2 m + 1 , ( 2 ( n + 1 / 2 + 1 ( 2 ( m + 1 / 2 + 1 is constructed.

  20. The Colliding Beams Sequencer

    International Nuclear Information System (INIS)

    Johnson, D.E.; Johnson, R.P.

    1989-01-01

    The Colliding Beam Sequencer (CBS) is a computer program used to operate the pbar-p Collider by synchronizing the applications programs and simulating the activities of the accelerator operators during filling and storage. The Sequencer acts as a meta-program, running otherwise stand alone applications programs, to do the set-up, beam transfers, acceleration, low beta turn on, and diagnostics for the transfers and storage. The Sequencer and its operational performance will be described along with its special features which include a periodic scheduler and command logger. 14 refs., 3 figs

  1. Teaching Task Sequencing via Verbal Mediation.

    Science.gov (United States)

    Rusch, Frank R.; And Others

    1987-01-01

    Verbal sequence training was used to teach a moderately mentally retarded woman to sequence job-related tasks. Learning to say the tasks in the proper sequence resulted in the employee performing her tasks in that sequence, and the employee was capable of mediating her own work behavior when scheduled changes occurred. (Author/JDD)

  2. RANDNA: a random DNA sequence generator.

    Science.gov (United States)

    Piva, Francesco; Principato, Giovanni

    2006-01-01

    Monte Carlo simulations are useful to verify the significance of data. Genomic regularities, such as the nucleotide correlations or the not uniform distribution of the motifs throughout genomic or mature mRNA sequences, exist and their significance can be checked by means of the Monte Carlo test. The test needs good quality random sequences in order to work, moreover they should have the same nucleotide distribution as the sequences in which the regularities have been found. Random DNA sequences are also useful to estimate the background score of an alignment, that is a threshold below which the resulting score is merely due to chance. We have developed RANDNA, a free software which allows to produce random DNA or RNA sequences setting both their length and the percentage of nucleotide composition. Sequences having the same nucleotide distribution of exonic, intronic or intergenic sequences can be generated. Its graphic interface makes it possible to easily set the parameters that characterize the sequences being produced and saved in a text format file. The pseudo-random number generator function of Borland Delphi 6 is used, since it guarantees a good randomness, a long cycle length and a high speed. We have checked the quality of sequences generated by the software, by means of well-known tests, both by themselves and versus genuine random sequences. We show the good quality of the generated sequences. The software, complete with examples and documentation, is freely available to users from: http://www.introni.it/en/software.

  3. Dynamic Sequence Assignment.

    Science.gov (United States)

    1983-12-01

    D-136 548 DYNAMIIC SEQUENCE ASSIGNMENT(U) ADVANCED INFORMATION AND 1/2 DECISION SYSTEMS MOUNTAIN YIELW CA C A 0 REILLY ET AL. UNCLSSIIED DEC 83 AI/DS...I ADVANCED INFORMATION & DECISION SYSTEMS Mountain View. CA 94040 84 u ,53 V,..’. Unclassified _____ SCURITY CLASSIFICATION OF THIS PAGE REPORT...reviews some important heuristic algorithms developed for fas- ter solution of the sequence assignment problem. 3.1. DINAMIC MOGRAMUNIG FORMULATION FOR

  4. General LTE Sequence

    OpenAIRE

    Billal, Masum

    2015-01-01

    In this paper,we have characterized sequences which maintain the same property described in Lifting the Exponent Lemma. Lifting the Exponent Lemma is a very powerful tool in olympiad number theory and recently it has become very popular. We generalize it to all sequences that maintain a property like it i.e. if p^{\\alpha}||a_k and p^\\b{eta}||n, then p^{{\\alpha}+\\b{eta}}||a_{nk}.

  5. Decidability of uniform recurrence of morphic sequences

    OpenAIRE

    Durand , Fabien

    2012-01-01

    We prove that the uniform recurrence of morphic sequences is decidable. For this we show that the number of derived sequences of uniformly recurrent morphic sequences is bounded. As a corollary we obtain that uniformly recurrent morphic sequences are primitive substitutive sequences.

  6. Genomic sequencing of Pleistocene cave bears

    Energy Technology Data Exchange (ETDEWEB)

    Noonan, James P.; Hofreiter, Michael; Smith, Doug; Priest, JamesR.; Rohland, Nadin; Rabeder, Gernot; Krause, Johannes; Detter, J. Chris; Paabo, Svante; Rubin, Edward M.

    2005-04-01

    Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome, the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.

  7. Targeted assembly of short sequence reads.

    Directory of Open Access Journals (Sweden)

    René L Warren

    Full Text Available As next-generation sequence (NGS production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.

  8. Accurate Local-Ancestry Inference in Exome-Sequenced Admixed Individuals via Off-Target Sequence Reads

    Science.gov (United States)

    Hu, Youna; Willer, Cristen; Zhan, Xiaowei; Kang, Hyun Min; Abecasis, Gonçalo R.

    2013-01-01

    Estimates of the ancestry of specific chromosomal regions in admixed individuals are useful for studies of human evolutionary history and for genetic association studies. Previously, this ancestry inference relied on high-quality genotypes from genome-wide association study (GWAS) arrays. These high-quality genotypes are not always available when samples are exome sequenced, and exome sequencing is the strategy of choice for many ongoing genetic studies. Here we show that off-target reads generated during exome-sequencing experiments can be combined with on-target reads to accurately estimate the ancestry of each chromosomal segment in an admixed individual. To reconstruct local ancestry, our method SEQMIX models aligned bases directly instead of relying on hard genotype calls. We evaluate the accuracy of our method through simulations and analysis of samples sequenced by the 1000 Genomes Project and the NHLBI Grand Opportunity Exome Sequencing Project. In African Americans, we show that local-ancestry estimates derived by our method are very similar to those derived with Illumina’s Omni 2.5M genotyping array and much improved in relation to estimates that use only exome genotypes and ignore off-target sequencing reads. Software implementing this method, SEQMIX, can be applied to analysis of human population history or used for genetic association studies in admixed individuals. PMID:24210252

  9. Method and apparatus for biological sequence comparison

    Science.gov (United States)

    Marr, T.G.; Chang, W.I.

    1997-12-23

    A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.

  10. Memory and learning with rapid audiovisual sequences

    Science.gov (United States)

    Keller, Arielle S.; Sekuler, Robert

    2015-01-01

    We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed. PMID:26575193

  11. The RNA world, automatic sequences and oncogenetics

    Energy Technology Data Exchange (ETDEWEB)

    Tahir Shah, K

    1993-04-01

    We construct a model of the RNA world in terms of naturally evolving nucleotide sequences assuming only Crick-Watson base pairing and self-cleaving/splicing capability. These sequences have the following properties. (1) They are recognizable by an automation (or automata). That is, to each k-sequence, there exist a k-automation which accepts, recognizes or generates the k-sequence. These are known as automatic sequences. Fibonacci and Morse-Thue sequences are the most natural outcome of pre-biotic chemical conditions. (2) Infinite (resp. large) sequences are self-similar (resp. nearly self-similar) under certain rewrite rules and consequently give rise to fractal (resp.fractal-like) structures. Computationally, such sequences can also be generated by their corresponding deterministic parallel re-write system, known as a DOL system. The self-similar sequences are fixed points of their respective rewrite rules. Some of these automatic sequences have the capability that they can read or ``accept`` other sequences while others can detect errors and trigger error-correcting mechanisms. They can be enlarged and have block and/or palindrome structure. Linear recurring sequences such as Fibonacci sequence are simply Feed-back Shift Registers, a well know model of information processing machines. We show that a mutation of any rewrite rule can cause a combinatorial explosion of error and relates this to oncogenetical behavior. On the other hand, a mutation of sequences that are not rewrite rules, leads to normal evolutionary change. Known experimental results support our hypothesis. (author). Refs.

  12. Memory and learning with rapid audiovisual sequences.

    Science.gov (United States)

    Keller, Arielle S; Sekuler, Robert

    2015-01-01

    We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed.

  13. The RNA world, automatic sequences and oncogenetics

    International Nuclear Information System (INIS)

    Tahir Shah, K.

    1993-04-01

    We construct a model of the RNA world in terms of naturally evolving nucleotide sequences assuming only Crick-Watson base pairing and self-cleaving/splicing capability. These sequences have the following properties. 1) They are recognizable by an automation (or automata). That is, to each k-sequence, there exist a k-automation which accepts, recognizes or generates the k-sequence. These are known as automatic sequences. Fibonacci and Morse-Thue sequences are the most natural outcome of pre-biotic chemical conditions. 2) Infinite (resp. large) sequences are self-similar (resp. nearly self-similar) under certain rewrite rules and consequently give rise to fractal (resp.fractal-like) structures. Computationally, such sequences can also be generated by their corresponding deterministic parallel re-write system, known as a DOL system. The self-similar sequences are fixed points of their respective rewrite rules. Some of these automatic sequences have the capability that they can read or 'accept' other sequences while others can detect errors and trigger error-correcting mechanisms. They can be enlarged and have block and/or palindrome structure. Linear recurring sequences such as Fibonacci sequence are simply Feed-back Shift Registers, a well know model of information processing machines. We show that a mutation of any rewrite rule can cause a combinatorial explosion of error and relates this to oncogenetical behavior. On the other hand, a mutation of sequences that are not rewrite rules, leads to normal evolutionary change. Known experimental results support our hypothesis. (author). Refs

  14. Comprehensive cytological characterization of the Gossypium hirsutum genome based on the development of a set of chromosome cytological markers

    Institute of Scientific and Technical Information of China (English)

    Wenbo; Shan; Yanqin; Jiang; Jinlei; Han; Kai; Wang

    2016-01-01

    Cotton is the world’s most important natural fiber crop. It is also a model system for studying polyploidization, genomic organization, and genome-size variation. Integrating the cytological characterization of cotton with its genetic map will be essential for understanding its genome structure and evolution, as well as for performing further genetic-map based mapping and cloning. In this study, we isolated a complete set of bacterial artificial chromosome clones anchored to each of the 52 chromosome arms of the tetraploid cotton Gossypium hirsutum. Combining these with telomere and centromere markers, we constructed a standard karyotype for the G. hirsutum inbred line TM-1. We dissected the chromosome arm localizations of the 45 S and 5S r DNA and suggest a centromere repositioning event in the homoeologous chromosomes AT09 and DT09. By integrating a systematic karyotype analysis with the genetic linkage map, we observed different genome sizes and chromosomal structures between the subgenomes of the tetraploid cotton and those of its diploid ancestors. Using evidence of conserved coding sequences, we suggest that the different evolutionary paths of non-coding retrotransposons account for most of the variation in size between the subgenomes of tetraploid cotton and its diploid ancestors. These results provide insights into the cotton genome and will facilitate further genome studies in G. hirsutum.

  15. Comprehensive cytological characterization of the Gossypium hirsutum genome based on the development of a set of chromosome cytological markers

    Directory of Open Access Journals (Sweden)

    Wenbo Shan

    2016-08-01

    Full Text Available Cotton is the world's most important natural fiber crop. It is also a model system for studying polyploidization, genomic organization, and genome-size variation. Integrating the cytological characterization of cotton with its genetic map will be essential for understanding its genome structure and evolution, as well as for performing further genetic-map based mapping and cloning. In this study, we isolated a complete set of bacterial artificial chromosome clones anchored to each of the 52 chromosome arms of the tetraploid cotton Gossypium hirsutum. Combining these with telomere and centromere markers, we constructed a standard karyotype for the G. hirsutum inbred line TM-1. We dissected the chromosome arm localizations of the 45S and 5S rDNA and suggest a centromere repositioning event in the homoeologous chromosomes AT09 and DT09. By integrating a systematic karyotype analysis with the genetic linkage map, we observed different genome sizes and chromosomal structures between the subgenomes of the tetraploid cotton and those of its diploid ancestors. Using evidence of conserved coding sequences, we suggest that the different evolutionary paths of non-coding retrotransposons account for most of the variation in size between the subgenomes of tetraploid cotton and its diploid ancestors. These results provide insights into the cotton genome and will facilitate further genome studies in G. hirsutum.

  16. Phylogenomics of African guenons.

    Science.gov (United States)

    Moulin, Sibyle; Gerbault-Seureau, Michèle; Dutrillaux, Bernard; Richard, Florence Anne

    2008-01-01

    The karyotypes of 28 specimens belonging to 26 species of Cercopithecinae have been compared with each other and with human karyotype by chromosome banding and, for some of them, by Zoo-FISH (human painting probes) techniques. The study includes the first description of the karyotypes of four species and a synonym of Cercopithecus nictitans. The chromosomal homologies obtained provide us with new data on a large number of rearrangements. This allows us to code chromosomal characters to draw Cercopithecini phylogenetic trees, which are compared to phylogenetic data based on DNA sequences. Our findings show that some of the superspecies proposed by Kingdon (1997 The Kingdon Field Guide to African Mammals, Academic Press.) and Groves (2001 Primates Taxonomy, Smithsonian Institution Press) do not form homogeneous groups and that the genus Cercopithecus is paraphyletic, in agreement with previous molecular analyses. The evolution of Cercopithecini karyotypes is mainly due to non-centromeric chromosome fissions and centromeric shifts or inversions. Non-Robertsonian translocations occurred in C. hamlyni and C. neglectus. The position of chromosomal rearrangements in the phylogenetic tree leads us to propose that the Cercopithecini evolution proceeded by either repeated fission events facilitated by peculiar genomic structures or successive reticulate phases, in which heterozygous populations for few rearranged chromosomes were present, allowing the spreading of chromosomal forms in various combinations, before the speciation process.

  17. Compressing DNA sequence databases with coil

    Directory of Open Access Journals (Sweden)

    Hendy Michael D

    2008-05-01

    Full Text Available Abstract Background Publicly available DNA sequence databases such as GenBank are large, and are growing at an exponential rate. The sheer volume of data being dealt with presents serious storage and data communications problems. Currently, sequence data is usually kept in large "flat files," which are then compressed using standard Lempel-Ziv (gzip compression – an approach which rarely achieves good compression ratios. While much research has been done on compressing individual DNA sequences, surprisingly little has focused on the compression of entire databases of such sequences. In this study we introduce the sequence database compression software coil. Results We have designed and implemented a portable software package, coil, for compressing and decompressing DNA sequence databases based on the idea of edit-tree coding. coil is geared towards achieving high compression ratios at the expense of execution time and memory usage during compression – the compression time represents a "one-off investment" whose cost is quickly amortised if the resulting compressed file is transmitted many times. Decompression requires little memory and is extremely fast. We demonstrate a 5% improvement in compression ratio over state-of-the-art general-purpose compression tools for a large GenBank database file containing Expressed Sequence Tag (EST data. Finally, coil can efficiently encode incremental additions to a sequence database. Conclusion coil presents a compelling alternative to conventional compression of flat files for the storage and distribution of DNA sequence databases having a narrow distribution of sequence lengths, such as EST data. Increasing compression levels for databases having a wide distribution of sequence lengths is a direction for future work.

  18. Dynamic distribution patterns of ribosomal DNA and chromosomal evolution in Paphiopedilum, a lady's slipper orchid

    Directory of Open Access Journals (Sweden)

    Albert Victor A

    2011-09-01

    Full Text Available Abstract Background Paphiopedilum is a horticulturally and ecologically important genus of ca. 80 species of lady's slipper orchids native to Southeast Asia. These plants have long been of interest regarding their chromosomal evolution, which involves a progressive aneuploid series based on either fission or fusion of centromeres. Chromosome number is positively correlated with genome size, so rearrangement processes must include either insertion or deletion of DNA segments. We have conducted Fluorescence In Situ Hybridization (FISH studies using 5S and 25S ribosomal DNA (rDNA probes to survey for rearrangements, duplications, and phylogenetically-correlated variation within Paphiopedilum. We further studied sequence variation of the non-transcribed spacers of 5S rDNA (5S-NTS to examine their complex duplication history, including the possibility that concerted evolutionary forces may homogenize diversity. Results 5S and 25S rDNA loci among Paphiopedilum species, representing all key phylogenetic lineages, exhibit a considerable diversity that correlates well with recognized evolutionary groups. 25S rDNA signals range from 2 (representing 1 locus to 9, the latter representing hemizygosity. 5S loci display extensive structural variation, and show from 2 specific signals to many, both major and minor and highly dispersed. The dispersed signals mainly occur at centromeric and subtelomeric positions, which are hotspots for chromosomal breakpoints. Phylogenetic analysis of cloned 5S rDNA non-transcribed spacer (5S-NTS sequences showed evidence for both ancient and recent post-speciation duplication events, as well as interlocus and intralocus diversity. Conclusions Paphiopedilum species display many chromosomal rearrangements - for example, duplications, translocations, and inversions - but only weak concerted evolutionary forces among highly duplicated 5S arrays, which suggests that double-strand break repair processes are dynamic and ongoing. These

  19. Region segmentation along image sequence

    International Nuclear Information System (INIS)

    Monchal, L.; Aubry, P.

    1995-01-01

    A method to extract regions in sequence of images is proposed. Regions are not matched from one image to the following one. The result of a region segmentation is used as an initialization to segment the following and image to track the region along the sequence. The image sequence is exploited as a spatio-temporal event. (authors). 12 refs., 8 figs

  20. Phylogenetic Trees From Sequences

    Science.gov (United States)

    Ryvkin, Paul; Wang, Li-San

    In this chapter, we review important concepts and approaches for phylogeny reconstruction from sequence data.We first cover some basic definitions and properties of phylogenetics, and briefly explain how scientists model sequence evolution and measure sequence divergence. We then discuss three major approaches for phylogenetic reconstruction: distance-based phylogenetic reconstruction, maximum parsimony, and maximum likelihood. In the third part of the chapter, we review how multiple phylogenies are compared by consensus methods and how to assess confidence using bootstrapping. At the end of the chapter are two sections that list popular software packages and additional reading.

  1. Fast and secure retrieval of DNA sequences

    NARCIS (Netherlands)

    2014-01-01

    Sequence models are retrieved from a sequences index. The sequence models model DNA or RNA sequences stored in a database, and each comprises a finite memory tree source model and parameters for the finite memory tree source model. One or more DNA or RNA sequences stored in the database are

  2. Robustness analysis of chiller sequencing control

    International Nuclear Information System (INIS)

    Liao, Yundan; Sun, Yongjun; Huang, Gongsheng

    2015-01-01

    Highlights: • Uncertainties with chiller sequencing control were systematically quantified. • Robustness of chiller sequencing control was systematically analyzed. • Different sequencing control strategies were sensitive to different uncertainties. • A numerical method was developed for easy selection of chiller sequencing control. - Abstract: Multiple-chiller plant is commonly employed in the heating, ventilating and air-conditioning system to increase operational feasibility and energy-efficiency under part load condition. In a multiple-chiller plant, chiller sequencing control plays a key role in achieving overall energy efficiency while not sacrifices the cooling sufficiency for indoor thermal comfort. Various sequencing control strategies have been developed and implemented in practice. Based on the observation that (i) uncertainty, which cannot be avoided in chiller sequencing control, has a significant impact on the control performance and may cause the control fail to achieve the expected control and/or energy performance; and (ii) in current literature few studies have systematically addressed this issue, this paper therefore presents a study on robustness analysis of chiller sequencing control in order to understand the robustness of various chiller sequencing control strategies under different types of uncertainty. Based on the robustness analysis, a simple and applicable method is developed to select the most robust control strategy for a given chiller plant in the presence of uncertainties, which will be verified using case studies

  3. ABS: Sequence alignment by scanning

    KAUST Repository

    Bonny, Mohamed Talal; Salama, Khaled N.

    2011-01-01

    Sequence alignment is an essential tool in almost any computational biology research. It processes large database sequences and considered to be high consumers of computation time. Heuristic algorithms are used to get approximate but fast results. We introduce fast alignment algorithm, called Alignment By Scanning (ABS), to provide an approximate alignment of two DNA sequences. We compare our algorithm with the well-known alignment algorithms, the FASTA (which is heuristic) and the 'Needleman-Wunsch' (which is optimal). The proposed algorithm achieves up to 76% enhancement in alignment score when it is compared with the FASTA Algorithm. The evaluations are conducted using different lengths of DNA sequences. © 2011 IEEE.

  4. ABS: Sequence alignment by scanning

    KAUST Repository

    Bonny, Mohamed Talal

    2011-08-01

    Sequence alignment is an essential tool in almost any computational biology research. It processes large database sequences and considered to be high consumers of computation time. Heuristic algorithms are used to get approximate but fast results. We introduce fast alignment algorithm, called Alignment By Scanning (ABS), to provide an approximate alignment of two DNA sequences. We compare our algorithm with the well-known alignment algorithms, the FASTA (which is heuristic) and the \\'Needleman-Wunsch\\' (which is optimal). The proposed algorithm achieves up to 76% enhancement in alignment score when it is compared with the FASTA Algorithm. The evaluations are conducted using different lengths of DNA sequences. © 2011 IEEE.

  5. Fast global sequence alignment technique

    KAUST Repository

    Bonny, Mohamed Talal

    2011-11-01

    Bioinformatics database is growing exponentially in size. Processing these large amount of data may take hours of time even if super computers are used. One of the most important processing tool in Bioinformatics is sequence alignment. We introduce fast alignment algorithm, called \\'Alignment By Scanning\\' (ABS), to provide an approximate alignment of two DNA sequences. We compare our algorithm with the wellknown sequence alignment algorithms, the \\'GAP\\' (which is heuristic) and the \\'Needleman-Wunsch\\' (which is optimal). The proposed algorithm achieves up to 51% enhancement in alignment score when it is compared with the GAP Algorithm. The evaluations are conducted using different lengths of DNA sequences. © 2011 IEEE.

  6. Biosensors for DNA sequence detection

    Science.gov (United States)

    Vercoutere, Wenonah; Akeson, Mark

    2002-01-01

    DNA biosensors are being developed as alternatives to conventional DNA microarrays. These devices couple signal transduction directly to sequence recognition. Some of the most sensitive and functional technologies use fibre optics or electrochemical sensors in combination with DNA hybridization. In a shift from sequence recognition by hybridization, two emerging single-molecule techniques read sequence composition using zero-mode waveguides or electrical impedance in nanoscale pores.

  7. Comparison of cytogenetic effects after occupational exposure to X-rays with those after foetal pelvimetric exposure

    International Nuclear Information System (INIS)

    Kirsch-Volders, M.; Poma, K.; Verschaeve, L.; Hens, L.; Susanne, C.; Elegem, P. van

    1978-01-01

    In utero acute low-level exposure to X-rays (300 to 350 mrads) induces a significant increase of band-loss in G-trypsin banded chromosomes of umbilical lymphocytes. Results, however, have to be confirmed by dose-effect relation studies and analysis with spectro-photometric scanning of deleted chromosomes. The same in utero exposure induces a significant dissociaton of chromosome pair 13 as revealed by centromere-centromere, angle and association tendency analysis of chromosome distribution in comparison with a control group. Occupational chronic low-level exposure to ionizing radiation does not modify significantly the amount of SCE in lymphocytes of peripheral blood. However, an analysis of centromere-centromere distances, angle values and associaton tendencies of the different chromosome combinations clearly shows an association of chromosome pair 12-16 after exposure to ionizing radiation. It is difficult to assess the exact biological importance of the observed chromosome modifications. However, referring to the already described dissociation of human acrocentric chromosomes after in vivo exposure to low levels of phenyl Hg acetate or inorganic Pb, the chromosome distribution seems not to be significantly disturbed by chronic or acute low-level exposure to ionizing radiation. (author)

  8. Transformed composite sequences for improved qubit addressing

    Science.gov (United States)

    Merrill, J. True; Doret, S. Charles; Vittorini, Grahame; Addison, J. P.; Brown, Kenneth R.

    2014-10-01

    Selective laser addressing of a single atom or atomic ion qubit can be improved using narrow-band composite pulse sequences. We describe a Lie-algebraic technique to generalize known narrow-band sequences and introduce sequences related by dilation and rotation of sequence generators. Our method improves known narrow-band sequences by decreasing both the pulse time and the residual error. Finally, we experimentally demonstrate these composite sequences using 40Ca+ ions trapped in a surface-electrode ion trap.

  9. Analysis of Pteridium ribosomal RNA sequences by rapid direct sequencing.

    Science.gov (United States)

    Tan, M K

    1991-08-01

    A total of 864 bases from 5 regions interspersed in the 18S and 26S rRNA molecules from various clones of Pteridium covering the general geographical distribution of the genus was analysed using a rapid rRNA sequencing technique. No base difference has been detected amongst the three major lineages, two of which apparently separated before the breakup of the ancient supercontinent, Pangaea. These regions of the rRNA sequences have thus been conserved for at least 160 million years and are here compared with other eukaryotic, especially plant rRNAs.

  10. The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

    Directory of Open Access Journals (Sweden)

    Jonas Binladen

    2007-02-01

    Full Text Available The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources.We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences. Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis.We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%. Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial

  11. Accident sequence quantification with KIRAP

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Tae Un; Han, Sang Hoon; Kim, Kil You; Yang, Jun Eon; Jeong, Won Dae; Chang, Seung Cheol; Sung, Tae Yong; Kang, Dae Il; Park, Jin Hee; Lee, Yoon Hwan; Hwang, Mi Jeong

    1997-01-01

    The tasks of probabilistic safety assessment(PSA) consists of the identification of initiating events, the construction of event tree for each initiating event, construction of fault trees for event tree logics, the analysis of reliability data and finally the accident sequence quantification. In the PSA, the accident sequence quantification is to calculate the core damage frequency, importance analysis and uncertainty analysis. Accident sequence quantification requires to understand the whole model of the PSA because it has to combine all event tree and fault tree models, and requires the excellent computer code because it takes long computation time. Advanced Research Group of Korea Atomic Energy Research Institute(KAERI) has developed PSA workstation KIRAP(Korea Integrated Reliability Analysis Code Package) for the PSA work. This report describes the procedures to perform accident sequence quantification, the method to use KIRAP`s cut set generator, and method to perform the accident sequence quantification with KIRAP. (author). 6 refs.

  12. Accident sequence quantification with KIRAP

    International Nuclear Information System (INIS)

    Kim, Tae Un; Han, Sang Hoon; Kim, Kil You; Yang, Jun Eon; Jeong, Won Dae; Chang, Seung Cheol; Sung, Tae Yong; Kang, Dae Il; Park, Jin Hee; Lee, Yoon Hwan; Hwang, Mi Jeong.

    1997-01-01

    The tasks of probabilistic safety assessment(PSA) consists of the identification of initiating events, the construction of event tree for each initiating event, construction of fault trees for event tree logics, the analysis of reliability data and finally the accident sequence quantification. In the PSA, the accident sequence quantification is to calculate the core damage frequency, importance analysis and uncertainty analysis. Accident sequence quantification requires to understand the whole model of the PSA because it has to combine all event tree and fault tree models, and requires the excellent computer code because it takes long computation time. Advanced Research Group of Korea Atomic Energy Research Institute(KAERI) has developed PSA workstation KIRAP(Korea Integrated Reliability Analysis Code Package) for the PSA work. This report describes the procedures to perform accident sequence quantification, the method to use KIRAP's cut set generator, and method to perform the accident sequence quantification with KIRAP. (author). 6 refs

  13. A measurement of disorder in binary sequences

    Science.gov (United States)

    Gong, Longyan; Wang, Haihong; Cheng, Weiwen; Zhao, Shengmei

    2015-03-01

    We propose a complex quantity, AL, to characterize the degree of disorder of L-length binary symbolic sequences. As examples, we respectively apply it to typical random and deterministic sequences. One kind of random sequences is generated from a periodic binary sequence and the other is generated from the logistic map. The deterministic sequences are the Fibonacci and Thue-Morse sequences. In these analyzed sequences, we find that the modulus of AL, denoted by |AL | , is a (statistically) equivalent quantity to the Boltzmann entropy, the metric entropy, the conditional block entropy and/or other quantities, so it is a useful quantitative measure of disorder. It can be as a fruitful index to discern which sequence is more disordered. Moreover, there is one and only one value of |AL | for the overall disorder characteristics. It needs extremely low computational costs. It can be easily experimentally realized. From all these mentioned, we believe that the proposed measure of disorder is a valuable complement to existing ones in symbolic sequences.

  14. Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

    Directory of Open Access Journals (Sweden)

    Jason D Thompson

    Full Text Available Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.

  15. Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

    Science.gov (United States)

    Thompson, Jason D; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

    2012-01-01

    Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.

  16. Static multiplicities in heterogeneous azeotropic distillation sequences

    DEFF Research Database (Denmark)

    Esbjerg, Klavs; Andersen, Torben Ravn; Jørgensen, Sten Bay

    1998-01-01

    In this paper the results of a bifurcation analysis on heterogeneous azeotropic distillation sequences are given. Two sequences suitable for ethanol dehydration are compared: The 'direct' and the 'indirect' sequence. It is shown, that the two sequences, despite their similarities, exhibit very...... different static behavior. The method of Petlyuk and Avet'yan (1971), Bekiaris et al. (1993), which assumes infinite reflux and infinite number of stages, is extended to and applied on heterogeneous azeotropic distillation sequences. The predictions are substantiated through simulations. The static sequence...

  17. Permutation Entropy for Random Binary Sequences

    Directory of Open Access Journals (Sweden)

    Lingfeng Liu

    2015-12-01

    Full Text Available In this paper, we generalize the permutation entropy (PE measure to binary sequences, which is based on Shannon’s entropy, and theoretically analyze this measure for random binary sequences. We deduce the theoretical value of PE for random binary sequences, which can be used to measure the randomness of binary sequences. We also reveal the relationship between this PE measure with other randomness measures, such as Shannon’s entropy and Lempel–Ziv complexity. The results show that PE is consistent with these two measures. Furthermore, we use PE as one of the randomness measures to evaluate the randomness of chaotic binary sequences.

  18. Perfect sequences over the real quaternions

    OpenAIRE

    Kuznetsov, Oleg

    2017-01-01

    In this Thesis, perfect sequences over the real quaternions are first considered. Definitions for the right and left periodic autocorrelation functions are given, and right and left perfect sequences introduced. It is shown that the right (left) perfection of any sequence implies the left (right) perfection, so concepts of right and left perfect sequences over the real quaternions are equivalent. Unitary transformations of the quaternion space ℍ are then considered. Using the equivalence of t...

  19. Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.

    Science.gov (United States)

    Schirmer, Melanie; Ijaz, Umer Z; D'Amore, Rosalinda; Hall, Neil; Sloan, William T; Quince, Christopher

    2015-03-31

    With read lengths of currently up to 2 × 300 bp, high throughput and low sequencing costs Illumina's MiSeq is becoming one of the most utilized sequencing platforms worldwide. The platform is manageable and affordable even for smaller labs. This enables quick turnaround on a broad range of applications such as targeted gene sequencing, metagenomics, small genome sequencing and clinical molecular diagnostics. However, Illumina error profiles are still poorly understood and programs are therefore not designed for the idiosyncrasies of Illumina data. A better knowledge of the error patterns is essential for sequence analysis and vital if we are to draw valid conclusions. Studying true genetic variation in a population sample is fundamental for understanding diseases, evolution and origin. We conducted a large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data. We tested state-of-the-art library preparation methods for amplicon sequencing and showed that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns. Furthermore we tested the efficiency of various error correction strategies and identified quality trimming (Sickle) combined with error correction (BayesHammer) followed by read overlapping (PANDAseq) as the most successful approach, reducing substitution error rates on average by 93%. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Sequences for Student Investigation

    Science.gov (United States)

    Barton, Jeffrey; Feil, David; Lartigue, David; Mullins, Bernadette

    2004-01-01

    We describe two classes of sequences that give rise to accessible problems for undergraduate research. These problems may be understood with virtually no prerequisites and are well suited for computer-aided investigation. The first sequence is a variation of one introduced by Stephen Wolfram in connection with his study of cellular automata. The…

  1. Image sequence analysis

    CERN Document Server

    1981-01-01

    The processing of image sequences has a broad spectrum of important applica­ tions including target tracking, robot navigation, bandwidth compression of TV conferencing video signals, studying the motion of biological cells using microcinematography, cloud tracking, and highway traffic monitoring. Image sequence processing involves a large amount of data. However, because of the progress in computer, LSI, and VLSI technologies, we have now reached a stage when many useful processing tasks can be done in a reasonable amount of time. As a result, research and development activities in image sequence analysis have recently been growing at a rapid pace. An IEEE Computer Society Workshop on Computer Analysis of Time-Varying Imagery was held in Philadelphia, April 5-6, 1979. A related special issue of the IEEE Transactions on Pattern Anal­ ysis and Machine Intelligence was published in November 1980. The IEEE Com­ puter magazine has also published a special issue on the subject in 1981. The purpose of this book ...

  2. Alu-mediated large deletion of the CDSN gene as a cause of peeling skin disease.

    Science.gov (United States)

    Wada, T; Matsuda, Y; Muraoka, M; Toma, T; Takehara, K; Fujimoto, M; Yachie, A

    2014-10-01

    Peeling skin disease (PSD) is an autosomal recessive skin disorder caused by mutations in CDSN and is characterized by superficial peeling of the upper epidermis. Corneodesmosin (CDSN) is a major component of corneodesmosomes that plays an important role in maintaining epidermis integrity. Herein, we report a patient with PSD caused by a novel homozygous large deletion in the 6p21.3 region encompassing the CDSN gene, which abrogates CDSN expression. Several genes including C6orf15, PSORS1C1, PSORS1C2, CCHCR1, and TCF19 were also deleted, however, the patient showed only clinical features typical of PSD. The deletion size was 59.1 kb. Analysis of the sequence surrounding the breakpoint showed that both telomeric and centromeric breakpoints existed within Alu-S sequences that were oriented in opposite directions. These results suggest an Alu-mediated recombination event as the mechanism underlying the deletion in our patient. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  3. SAAS: Short Amino Acid Sequence - A Promising Protein Secondary Structure Prediction Method of Single Sequence

    Directory of Open Access Journals (Sweden)

    Zhou Yuan Wu

    2013-07-01

    Full Text Available In statistical methods of predicting protein secondary structure, many researchers focus on single amino acid frequencies in α-helices, β-sheets, and so on, or the impact near amino acids on an amino acid forming a secondary structure. But the paper considers a short sequence of amino acids (3, 4, 5 or 6 amino acids as integer, and statistics short sequence's probability forming secondary structure. Also, many researchers select low homologous sequences as statistical database. But this paper select whole PDB database. In this paper we propose a strategy to predict protein secondary structure using simple statistical method. Numerical computation shows that, short amino acids sequence as integer to statistics, which can easy see trend of short sequence forming secondary structure, and it will work well to select large statistical database (whole PDB database without considering homologous, and Q3 accuracy is ca. 74% using this paper proposed simple statistical method, but accuracy of others statistical methods is less than 70%.

  4. Transcriptome sequencing of the Microarray Quality Control (MAQC RNA reference samples using next generation sequencing

    Directory of Open Access Journals (Sweden)

    Thierry-Mieg Danielle

    2009-06-01

    Full Text Available Abstract Background Transcriptome sequencing using next-generation sequencing platforms will soon be competing with DNA microarray technologies for global gene expression analysis. As a preliminary evaluation of these promising technologies, we performed deep sequencing of cDNA synthesized from the Microarray Quality Control (MAQC reference RNA samples using Roche's 454 Genome Sequencer FLX. Results We generated more that 3.6 million sequence reads of average length 250 bp for the MAQC A and B samples and introduced a data analysis pipeline for translating cDNA read counts into gene expression levels. Using BLAST, 90% of the reads mapped to the human genome and 64% of the reads mapped to the RefSeq database of well annotated genes with e-values ≤ 10-20. We measured gene expression levels in the A and B samples by counting the numbers of reads that mapped to individual RefSeq genes in multiple sequencing runs to evaluate the MAQC quality metrics for reproducibility, sensitivity, specificity, and accuracy and compared the results with DNA microarrays and Quantitative RT-PCR (QRTPCR from the MAQC studies. In addition, 88% of the reads were successfully aligned directly to the human genome using the AceView alignment programs with an average 90% sequence similarity to identify 137,899 unique exon junctions, including 22,193 new exon junctions not yet contained in the RefSeq database. Conclusion Using the MAQC metrics for evaluating the performance of gene expression platforms, the ExpressSeq results for gene expression levels showed excellent reproducibility, sensitivity, and specificity that improved systematically with increasing shotgun sequencing depth, and quantitative accuracy that was comparable to DNA microarrays and QRTPCR. In addition, a careful mapping of the reads to the genome using the AceView alignment programs shed new light on the complexity of the human transcriptome including the discovery of thousands of new splice variants.

  5. Comparative genomics beyond sequence-based alignments

    DEFF Research Database (Denmark)

    Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.

    2008-01-01

    Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...

  6. ASAP: Amplification, sequencing & annotation of plastomes

    Directory of Open Access Journals (Sweden)

    Folta Kevin M

    2005-12-01

    Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and

  7. The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing

    DEFF Research Database (Denmark)

    Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P

    2007-01-01

    BACKGROUND: The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine...... primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution...

  8. cDNA sequence quality data - Budding yeast cDNA sequencing project | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Budding yeast cDNA sequencing project cDNA sequence quality data Data detail Data name cDNA sequence quality... data DOI 10.18908/lsdba.nbdc00838-003 Description of data contents Phred's quality score. P...tion Download License Update History of This Database Site Policy | Contact Us cDNA sequence quality

  9. Spreading Sequences Generated Using Asymmetrical Integer-Number Maps

    Directory of Open Access Journals (Sweden)

    V. Sebesta

    2007-09-01

    Full Text Available Chaotic sequences produced by piecewise linear maps can be transformed to binary sequences. The binary sequences are optimal for the asynchronous DS/CDMA systems in case of certain shapes of the maps. This paper is devoted to the one-to-one integer-number maps derived from the suitable asymmetrical piecewise linear maps. Such maps give periodic integer-number sequences, which can be transformed to the binary sequences. The binary sequences produced via proposed modified integer-number maps are perfectly balanced and embody good autocorrelation and crosscorrelation properties. The number of different binary sequences is sizable. The sequences are suitable as spreading sequences in DS/CDMA systems.

  10. Genomic sequencing in clinical trials

    OpenAIRE

    Mestan, Karen K; Ilkhanoff, Leonard; Mouli, Samdeep; Lin, Simon

    2011-01-01

    Abstract Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to fin...

  11. Compact flow diagrams for state sequences

    NARCIS (Netherlands)

    Buchin, K.A.; Buchin, M.E.; Gudmundsson, J.; Horton, M.J.; Sijben, S.

    2016-01-01

    We introduce the concept of compactly representing a large number of state sequences, e.g., sequences of activities, as a flow diagram. We argue that the flow diagram representation gives an intuitive summary that allows the user to detect patterns among large sets of state sequences. Simplified,

  12. The presence of five nifH-like sequences in Clostridium pasteurianum: sequence divergence and transcription properties.

    OpenAIRE

    Wang, S Z; Chen, J S; Johnson, J L

    1988-01-01

    The nifH gene encodes the iron protein (component II) of the nitrogenase complex. We have previously shown the presence in Clostridium pasteurianum of two nifH-like sequences in addition to the nifH1 gene which codes for a protein identical to the isolated iron protein. In the present study, we report that there are at least five nifH-like sequences in C. pasteurianum. DNA sequencing data indicate that the six nifH (nifH1) and nifH-like (nifH2, nifH3, nifH4, nifH5 and nifH6) sequences are not...

  13. Experimental design-based functional mining and characterization of high-throughput sequencing data in the sequence read archive.

    Directory of Open Access Journals (Sweden)

    Takeru Nakazato

    Full Text Available High-throughput sequencing technology, also called next-generation sequencing (NGS, has the potential to revolutionize the whole process of genome sequencing, transcriptomics, and epigenetics. Sequencing data is captured in a public primary data archive, the Sequence Read Archive (SRA. As of January 2013, data from more than 14,000 projects have been submitted to SRA, which is double that of the previous year. Researchers can download raw sequence data from SRA website to perform further analyses and to compare with their own data. However, it is extremely difficult to search entries and download raw sequences of interests with SRA because the data structure is complicated, and experimental conditions along with raw sequences are partly described in natural language. Additionally, some sequences are of inconsistent quality because anyone can submit sequencing data to SRA with no quality check. Therefore, as a criterion of data quality, we focused on SRA entries that were cited in journal articles. We extracted SRA IDs and PubMed IDs (PMIDs from SRA and full-text versions of journal articles and retrieved 2748 SRA ID-PMID pairs. We constructed a publication list referring to SRA entries. Since, one of the main themes of -omics analyses is clarification of disease mechanisms, we also characterized SRA entries by disease keywords, according to the Medical Subject Headings (MeSH extracted from articles assigned to each SRA entry. We obtained 989 SRA ID-MeSH disease term pairs, and constructed a disease list referring to SRA data. We previously developed feature profiles of diseases in a system called "Gendoo". We generated hyperlinks between diseases extracted from SRA and the feature profiles of it. The developed project, publication and disease lists resulting from this study are available at our web service, called "DBCLS SRA" (http://sra.dbcls.jp/. This service will improve accessibility to high-quality data from SRA.

  14. Next-generation sequencing of multiple individuals per barcoded library by deconvolution of sequenced amplicons using endonuclease fragment analysis

    DEFF Research Database (Denmark)

    Andersen, Jeppe D; Pereira, Vania; Pietroni, Carlotta

    2014-01-01

    The simultaneous sequencing of samples from multiple individuals increases the efficiency of next-generation sequencing (NGS) while also reducing costs. Here we describe a novel and simple approach for sequencing DNA from multiple individuals per barcode. Our strategy relies on the endonuclease...... digestion of PCR amplicons prior to library preparation, creating a specific fragment pattern for each individual that can be resolved after sequencing. By using both barcodes and restriction fragment patterns, we demonstrate the ability to sequence the human melanocortin 1 receptor (MC1R) genes from 72...... individuals using only 24 barcoded libraries....

  15. Comparative analysis of sequences from PT 2013

    DEFF Research Database (Denmark)

    Mikkelsen, Susie Sommer

    Sheatfish and not EHNV. Generally, mistakes occurred at the ends of the sequences. This can be due to several factors. One is that the sequence has not been trimmed of the sequence primer sites. Another is the lack of quality control of the chromatogram. Finally, sequencing in just one direction can result...... diseases in Europe. As part of the EURL proficiency test for fish diseases it is required to sequence any RANA virus isolates found in any of the samples. It is also highly recommended to sequence the ISA virus to determine whether it be HPRΔ or HPR0. Furthermore, it is recommended that any VHSV and IHNV...... isolates be genotyped. As part of the evaluation of the proficiency results it was decided this year to look into the quality and similarity of the sequence results for selected viruses. Ampoule III in the proficiency test 2013 contained an EHNV isolate. The EURL received 43 sequences from 41 laboratories...

  16. Meiosis-specific cohesin component, Stag3 is essential for maintaining centromere chromatid cohesion, and required for DNA repair and synapsis between homologous chromosomes.

    Science.gov (United States)

    Hopkins, Jessica; Hwang, Grace; Jacob, Justin; Sapp, Nicklas; Bedigian, Rick; Oka, Kazuhiro; Overbeek, Paul; Murray, Steve; Jordan, Philip W

    2014-07-01

    Cohesins are important for chromosome structure and chromosome segregation during mitosis and meiosis. Cohesins are composed of two structural maintenance of chromosomes (SMC1-SMC3) proteins that form a V-shaped heterodimer structure, which is bridged by a α-kleisin protein and a stromal antigen (STAG) protein. Previous studies in mouse have shown that there is one SMC1 protein (SMC1β), two α-kleisins (RAD21L and REC8) and one STAG protein (STAG3) that are meiosis-specific. During meiosis, homologous chromosomes must recombine with one another in the context of a tripartite structure known as the synaptonemal complex (SC). From interaction studies, it has been shown that there are at least four meiosis-specific forms of cohesin, which together with the mitotic cohesin complex, are lateral components of the SC. STAG3 is the only meiosis-specific subunit that is represented within all four meiosis-specific cohesin complexes. In Stag3 mutant germ cells, the protein level of other meiosis-specific cohesin subunits (SMC1β, RAD21L and REC8) is reduced, and their localization to chromosome axes is disrupted. In contrast, the mitotic cohesin complex remains intact and localizes robustly to the meiotic chromosome axes. The instability of meiosis-specific cohesins observed in Stag3 mutants results in aberrant DNA repair processes, and disruption of synapsis between homologous chromosomes. Furthermore, mutation of Stag3 results in perturbation of pericentromeric heterochromatin clustering, and disruption of centromere cohesion between sister chromatids during meiotic prophase. These defects result in early prophase I arrest and apoptosis in both male and female germ cells. The meiotic defects observed in Stag3 mutants are more severe when compared to single mutants for Smc1β, Rec8 and Rad21l, however they are not as severe as the Rec8, Rad21l double mutants. Taken together, our study demonstrates that STAG3 is required for the stability of all meiosis-specific cohesin

  17. Meiosis-specific cohesin component, Stag3 is essential for maintaining centromere chromatid cohesion, and required for DNA repair and synapsis between homologous chromosomes.

    Directory of Open Access Journals (Sweden)

    Jessica Hopkins

    2014-07-01

    Full Text Available Cohesins are important for chromosome structure and chromosome segregation during mitosis and meiosis. Cohesins are composed of two structural maintenance of chromosomes (SMC1-SMC3 proteins that form a V-shaped heterodimer structure, which is bridged by a α-kleisin protein and a stromal antigen (STAG protein. Previous studies in mouse have shown that there is one SMC1 protein (SMC1β, two α-kleisins (RAD21L and REC8 and one STAG protein (STAG3 that are meiosis-specific. During meiosis, homologous chromosomes must recombine with one another in the context of a tripartite structure known as the synaptonemal complex (SC. From interaction studies, it has been shown that there are at least four meiosis-specific forms of cohesin, which together with the mitotic cohesin complex, are lateral components of the SC. STAG3 is the only meiosis-specific subunit that is represented within all four meiosis-specific cohesin complexes. In Stag3 mutant germ cells, the protein level of other meiosis-specific cohesin subunits (SMC1β, RAD21L and REC8 is reduced, and their localization to chromosome axes is disrupted. In contrast, the mitotic cohesin complex remains intact and localizes robustly to the meiotic chromosome axes. The instability of meiosis-specific cohesins observed in Stag3 mutants results in aberrant DNA repair processes, and disruption of synapsis between homologous chromosomes. Furthermore, mutation of Stag3 results in perturbation of pericentromeric heterochromatin clustering, and disruption of centromere cohesion between sister chromatids during meiotic prophase. These defects result in early prophase I arrest and apoptosis in both male and female germ cells. The meiotic defects observed in Stag3 mutants are more severe when compared to single mutants for Smc1β, Rec8 and Rad21l, however they are not as severe as the Rec8, Rad21l double mutants. Taken together, our study demonstrates that STAG3 is required for the stability of all meiosis

  18. Next-generation sequencing

    DEFF Research Database (Denmark)

    Rieneck, Klaus; Bak, Mads; Jønson, Lars

    2013-01-01

    , Illumina); several millions of PCR sequences were analyzed. RESULTS: The results demonstrated the feasibility of diagnosing the fetal KEL1 or KEL2 blood group from cell-free DNA purified from maternal plasma. CONCLUSION: This method requires only one primer pair, and the large amount of sequence...... information obtained allows well for statistical analysis of the data. This general approach can be integrated into current laboratory practice and has numerous applications. Besides DNA-based predictions of blood group phenotypes, platelet phenotypes, or sickle cell anemia, and the determination of zygosity...

  19. Roche genome sequencer FLX based high-throughput sequencing of ancient DNA

    DEFF Research Database (Denmark)

    Alquezar-Planas, David E; Fordyce, Sarah Louise

    2012-01-01

    Since the development of so-called "next generation" high-throughput sequencing in 2005, this technology has been applied to a variety of fields. Such applications include disease studies, evolutionary investigations, and ancient DNA. Each application requires a specialized protocol to ensure...... that the data produced is optimal. Although much of the procedure can be followed directly from the manufacturer's protocols, the key differences lie in the library preparation steps. This chapter presents an optimized protocol for the sequencing of fossil remains and museum specimens, commonly referred...

  20. EGNAS: an exhaustive DNA sequence design algorithm

    Directory of Open Access Journals (Sweden)

    Kick Alfred

    2012-06-01

    Full Text Available Abstract Background The molecular recognition based on the complementary base pairing of deoxyribonucleic acid (DNA is the fundamental principle in the fields of genetics, DNA nanotechnology and DNA computing. We present an exhaustive DNA sequence design algorithm that allows to generate sets containing a maximum number of sequences with defined properties. EGNAS (Exhaustive Generation of Nucleic Acid Sequences offers the possibility of controlling both interstrand and intrastrand properties. The guanine-cytosine content can be adjusted. Sequences can be forced to start and end with guanine or cytosine. This option reduces the risk of “fraying” of DNA strands. It is possible to limit cross hybridizations of a defined length, and to adjust the uniqueness of sequences. Self-complementarity and hairpin structures of certain length can be avoided. Sequences and subsequences can optionally be forbidden. Furthermore, sequences can be designed to have minimum interactions with predefined strands and neighboring sequences. Results The algorithm is realized in a C++ program. TAG sequences can be generated and combined with primers for single-base extension reactions, which were described for multiplexed genotyping of single nucleotide polymorphisms. Thereby, possible foldback through intrastrand interaction of TAG-primer pairs can be limited. The design of sequences for specific attachment of molecular constructs to DNA origami is presented. Conclusions We developed a new software tool called EGNAS for the design of unique nucleic acid sequences. The presented exhaustive algorithm allows to generate greater sets of sequences than with previous software and equal constraints. EGNAS is freely available for noncommercial use at http://www.chm.tu-dresden.de/pc6/EGNAS.

  1. Integrated sequence analysis. Final report

    International Nuclear Information System (INIS)

    Andersson, K.; Pyy, P.

    1998-02-01

    The NKS/RAK subprojet 3 'integrated sequence analysis' (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term 'methodology' denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as

  2. Integrated sequence analysis. Final report

    Energy Technology Data Exchange (ETDEWEB)

    Andersson, K.; Pyy, P

    1998-02-01

    The NKS/RAK subprojet 3 `integrated sequence analysis` (ISA) was formulated with the overall objective to develop and to test integrated methodologies in order to evaluate event sequences with significant human action contribution. The term `methodology` denotes not only technical tools but also methods for integration of different scientific disciplines. In this report, we first discuss the background of ISA and the surveys made to map methods in different application fields, such as man machine system simulation software, human reliability analysis (HRA) and expert judgement. Specific event sequences were, after the surveys, selected for application and testing of a number of ISA methods. The event sequences discussed in the report were cold overpressure of BWR, shutdown LOCA of BWR, steam generator tube rupture of a PWR and BWR disturbed signal view in the control room after an external event. Different teams analysed these sequences by using different ISA and HRA methods. Two kinds of results were obtained from the ISA project: sequence specific and more general findings. The sequence specific results are discussed together with each sequence description. The general lessons are discussed under a separate chapter by using comparisons of different case studies. These lessons include areas ranging from plant safety management (design, procedures, instrumentation, operations, maintenance and safety practices) to methodological findings (ISA methodology, PSA,HRA, physical analyses, behavioural analyses and uncertainty assessment). Finally follows a discussion about the project and conclusions are presented. An interdisciplinary study of complex phenomena is a natural way to produce valuable and innovative results. This project came up with structured ways to perform ISA and managed to apply the in practice. The project also highlighted some areas where more work is needed. In the HRA work, development is required for the use of simulators and expert judgement as

  3. Sequence protein identification by randomized sequence database and transcriptome mass spectrometry (SPIDER-TMS): from manual to automatic application of a 'de novo sequencing' approach.

    Science.gov (United States)

    Pascale, Raffaella; Grossi, Gerarda; Cruciani, Gabriele; Mecca, Giansalvatore; Santoro, Donatello; Sarli Calace, Renzo; Falabella, Patrizia; Bianco, Giuliana

    Sequence protein identification by a randomized sequence database and transcriptome mass spectrometry software package has been developed at the University of Basilicata in Potenza (Italy) and designed to facilitate the determination of the amino acid sequence of a peptide as well as an unequivocal identification of proteins in a high-throughput manner with enormous advantages of time, economical resource and expertise. The software package is a valid tool for the automation of a de novo sequencing approach, overcoming the main limits and a versatile platform useful in the proteomic field for an unequivocal identification of proteins, starting from tandem mass spectrometry data. The strength of this software is that it is a user-friendly and non-statistical approach, so protein identification can be considered unambiguous.

  4. Fast global sequence alignment technique

    KAUST Repository

    Bonny, Mohamed Talal; Salama, Khaled N.

    2011-01-01

    fast alignment algorithm, called 'Alignment By Scanning' (ABS), to provide an approximate alignment of two DNA sequences. We compare our algorithm with the wellknown sequence alignment algorithms, the 'GAP' (which is heuristic) and the 'Needleman

  5. Ancestral sequence alignment under optimal conditions

    Directory of Open Access Journals (Sweden)

    Brown Daniel G

    2005-11-01

    Full Text Available Abstract Background Multiple genome alignment is an important problem in bioinformatics. An important subproblem used by many multiple alignment approaches is that of aligning two multiple alignments. Many popular alignment algorithms for DNA use the sum-of-pairs heuristic, where the score of a multiple alignment is the sum of its induced pairwise alignment scores. However, the biological meaning of the sum-of-pairs of pairs heuristic is not obvious. Additionally, many algorithms based on the sum-of-pairs heuristic are complicated and slow, compared to pairwise alignment algorithms. An alternative approach to aligning alignments is to first infer ancestral sequences for each alignment, and then align the two ancestral sequences. In addition to being fast, this method has a clear biological basis that takes into account the evolution implied by an underlying phylogenetic tree. In this study we explore the accuracy of aligning alignments by ancestral sequence alignment. We examine the use of both maximum likelihood and parsimony to infer ancestral sequences. Additionally, we investigate the effect on accuracy of allowing ambiguity in our ancestral sequences. Results We use synthetic sequence data that we generate by simulating evolution on a phylogenetic tree. We use two different types of phylogenetic trees: trees with a period of rapid growth followed by a period of slow growth, and trees with a period of slow growth followed by a period of rapid growth. We examine the alignment accuracy of four ancestral sequence reconstruction and alignment methods: parsimony, maximum likelihood, ambiguous parsimony, and ambiguous maximum likelihood. Additionally, we compare against the alignment accuracy of two sum-of-pairs algorithms: ClustalW and the heuristic of Ma, Zhang, and Wang. Conclusion We find that allowing ambiguity in ancestral sequences does not lead to better multiple alignments. Regardless of whether we use parsimony or maximum likelihood, the

  6. Recursive sequences in first-year calculus

    Science.gov (United States)

    Krainer, Thomas

    2016-02-01

    This article provides ready-to-use supplementary material on recursive sequences for a second-semester calculus class. It equips first-year calculus students with a basic methodical procedure based on which they can conduct a rigorous convergence or divergence analysis of many simple recursive sequences on their own without the need to invoke inductive arguments as is typically required in calculus textbooks. The sequences that are accessible to this kind of analysis are predominantly (eventually) monotonic, but also certain recursive sequences that alternate around their limit point as they converge can be considered.

  7. Hardware Accelerated Sequence Alignment with Traceback

    Directory of Open Access Journals (Sweden)

    Scott Lloyd

    2009-01-01

    in a timely manner. Known methods to accelerate alignment on reconfigurable hardware only address sequence comparison, limit the sequence length, or exhibit memory and I/O bottlenecks. A space-efficient, global sequence alignment algorithm and architecture is presented that accelerates the forward scan and traceback in hardware without memory and I/O limitations. With 256 processing elements in FPGA technology, a performance gain over 300 times that of a desktop computer is demonstrated on sequence lengths of 16000. For greater performance, the architecture is scalable to more processing elements.

  8. Transcription blockage by homopurine DNA sequences: role of sequence composition and single-strand breaks

    Science.gov (United States)

    Belotserkovskii, Boris P.; Neil, Alexander J.; Saleh, Syed Shayon; Shin, Jane Hae Soo; Mirkin, Sergei M.; Hanawalt, Philip C.

    2013-01-01

    The ability of DNA to adopt non-canonical structures can affect transcription and has broad implications for genome functioning. We have recently reported that guanine-rich (G-rich) homopurine-homopyrimidine sequences cause significant blockage of transcription in vitro in a strictly orientation-dependent manner: when the G-rich strand serves as the non-template strand [Belotserkovskii et al. (2010) Mechanisms and implications of transcription blockage by guanine-rich DNA sequences., Proc. Natl Acad. Sci. USA, 107, 12816–12821]. We have now systematically studied the effect of the sequence composition and single-stranded breaks on this blockage. Although substitution of guanine by any other base reduced the blockage, cytosine and thymine reduced the blockage more significantly than adenine substitutions, affirming the importance of both G-richness and the homopurine-homopyrimidine character of the sequence for this effect. A single-strand break in the non-template strand adjacent to the G-rich stretch dramatically increased the blockage. Breaks in the non-template strand result in much weaker blockage signals extending downstream from the break even in the absence of the G-rich stretch. Our combined data support the notion that transcription blockage at homopurine-homopyrimidine sequences is caused by R-loop formation. PMID:23275544

  9. SeqLib: a C ++ API for rapid BAM manipulation, sequence alignment and sequence assembly.

    Science.gov (United States)

    Wala, Jeremiah; Beroukhim, Rameen

    2017-03-01

    We present SeqLib, a C ++ API and command line tool that provides a rapid and user-friendly interface to BAM/SAM/CRAM files, global sequence alignment operations and sequence assembly. Four C libraries perform core operations in SeqLib: HTSlib for BAM access, BWA-MEM and BLAT for sequence alignment and Fermi for error correction and sequence assembly. Benchmarking indicates that SeqLib has lower CPU and memory requirements than leading C ++ sequence analysis APIs. We demonstrate an example of how minimal SeqLib code can extract, error-correct and assemble reads from a CRAM file and then align with BWA-MEM. SeqLib also provides additional capabilities, including chromosome-aware interval queries and read plotting. Command line tools are available for performing integrated error correction, micro-assemblies and alignment. SeqLib is available on Linux and OSX for the C ++98 standard and later at github.com/walaj/SeqLib. SeqLib is released under the Apache2 license. Additional capabilities for BLAT alignment are available under the BLAT license. jwala@broadinstitue.org ; rameen@broadinstitute.org. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  10. Exploration of noncoding sequences in metagenomes.

    Directory of Open Access Journals (Sweden)

    Fabián Tobar-Tosse

    Full Text Available Environment-dependent genomic features have been defined for different metagenomes, whose genes and their associated processes are related to specific environments. Identification of ORFs and their functional categories are the most common methods for association between functional and environmental features. However, this analysis based on finding ORFs misses noncoding sequences and, therefore, some metagenome regulatory or structural information could be discarded. In this work we analyzed 23 whole metagenomes, including coding and noncoding sequences using the following sequence patterns: (G+C content, Codon Usage (Cd, Trinucleotide Usage (Tn, and functional assignments for ORF prediction. Herein, we present evidence of a high proportion of noncoding sequences discarded in common similarity-based methods in metagenomics, and the kind of relevant information present in those. We found a high density of trinucleotide repeat sequences (TRS in noncoding sequences, with a regulatory and adaptive function for metagenome communities. We present associations between trinucleotide values and gene function, where metagenome clustering correlate with microorganism adaptations and kinds of metagenomes. We propose here that noncoding sequences have relevant information to describe metagenomes that could be considered in a whole metagenome analysis in order to improve their organization, classification protocols, and their relation with the environment.

  11. Sequence-to-Sequence Prediction of Vehicle Trajectory via LSTM Encoder-Decoder Architecture

    OpenAIRE

    Park, Seong Hyeon; Kim, ByeongDo; Kang, Chang Mook; Chung, Chung Choo; Choi, Jun Won

    2018-01-01

    In this paper, we propose a deep learning based vehicle trajectory prediction technique which can generate the future trajectory sequence of surrounding vehicles in real time. We employ the encoder-decoder architecture which analyzes the pattern underlying in the past trajectory using the long short-term memory (LSTM) based encoder and generates the future trajectory sequence using the LSTM based decoder. This structure produces the $K$ most likely trajectory candidates over occupancy grid ma...

  12. Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

    Directory of Open Access Journals (Sweden)

    Graner Andreas

    2008-10-01

    Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular

  13. Third-Generation Sequencing and Analysis of Four Complete Pig Liver Esterase Gene Sequences in Clones Identified by Screening BAC Library.

    Science.gov (United States)

    Zhou, Qiongqiong; Sun, Wenjuan; Liu, Xiyan; Wang, Xiliang; Xiao, Yuncai; Bi, Dingren; Yin, Jingdong; Shi, Deshi

    2016-01-01

    Pig liver carboxylesterase (PLE) gene sequences in GenBank are incomplete, which has led to difficulties in studying the genetic structure and regulation mechanisms of gene expression of PLE family genes. The aim of this study was to obtain and analysis of complete gene sequences of PLE family by screening from a Rongchang pig BAC library and third-generation PacBio gene sequencing. After a number of existing incomplete PLE isoform gene sequences were analysed, primers were designed based on conserved regions in PLE exons, and the whole pig genome used as a template for Polymerase chain reaction (PCR) amplification. Specific primers were then selected based on the PCR amplification results. A three-step PCR screening method was used to identify PLE-positive clones by screening a Rongchang pig BAC library and PacBio third-generation sequencing was performed. BLAST comparisons and other bioinformatics methods were applied for sequence analysis. Five PLE-positive BAC clones, designated BAC-10, BAC-70, BAC-75, BAC-119 and BAC-206, were identified. Sequence analysis yielded the complete sequences of four PLE genes, PLE1, PLE-B9, PLE-C4, and PLE-G2. Complete PLE gene sequences were defined as those containing regulatory sequences, exons, and introns. It was found that, not only did the PLE exon sequences of the four genes show a high degree of homology, but also that the intron sequences were highly similar. Additionally, the regulatory region of the genes contained two 720bps reverse complement sequences that may have an important function in the regulation of PLE gene expression. This is the first report to confirm the complete sequences of four PLE genes. In addition, the study demonstrates that each PLE isoform is encoded by a single gene and that the various genes exhibit a high degree of sequence homology, suggesting that the PLE family evolved from a single ancestral gene. Obtaining the complete sequences of these PLE genes provides the necessary foundation for

  14. Exome sequencing and genetic testing for MODY.

    Directory of Open Access Journals (Sweden)

    Stefan Johansson

    Full Text Available Genetic testing for monogenic diabetes is important for patient care. Given the extensive genetic and clinical heterogeneity of diabetes, exome sequencing might provide additional diagnostic potential when standard Sanger sequencing-based diagnostics is inconclusive.The aim of the study was to examine the performance of exome sequencing for a molecular diagnosis of MODY in patients who have undergone conventional diagnostic sequencing of candidate genes with negative results.We performed exome enrichment followed by high-throughput sequencing in nine patients with suspected MODY. They were Sanger sequencing-negative for mutations in the HNF1A, HNF4A, GCK, HNF1B and INS genes. We excluded common, non-coding and synonymous gene variants, and performed in-depth analysis on filtered sequence variants in a pre-defined set of 111 genes implicated in glucose metabolism.On average, we obtained 45 X median coverage of the entire targeted exome and found 199 rare coding variants per individual. We identified 0-4 rare non-synonymous and nonsense variants per individual in our a priori list of 111 candidate genes. Three of the variants were considered pathogenic (in ABCC8, HNF4A and PPARG, respectively, thus exome sequencing led to a genetic diagnosis in at least three of the nine patients. Approximately 91% of known heterozygous SNPs in the target exomes were detected, but we also found low coverage in some key diabetes genes using our current exome sequencing approach. Novel variants in the genes ARAP1, GLIS3, MADD, NOTCH2 and WFS1 need further investigation to reveal their possible role in diabetes.Our results demonstrate that exome sequencing can improve molecular diagnostics of MODY when used as a complement to Sanger sequencing. However, improvements will be needed, especially concerning coverage, before the full potential of exome sequencing can be realized.

  15. FRESCO: Referential compression of highly similar sequences.

    Science.gov (United States)

    Wandelt, Sebastian; Leser, Ulf

    2013-01-01

    In many applications, sets of similar texts or sequences are of high importance. Prominent examples are revision histories of documents or genomic sequences. Modern high-throughput sequencing technologies are able to generate DNA sequences at an ever-increasing rate. In parallel to the decreasing experimental time and cost necessary to produce DNA sequences, computational requirements for analysis and storage of the sequences are steeply increasing. Compression is a key technology to deal with this challenge. Recently, referential compression schemes, storing only the differences between a to-be-compressed input and a known reference sequence, gained a lot of interest in this field. In this paper, we propose a general open-source framework to compress large amounts of biological sequence data called Framework for REferential Sequence COmpression (FRESCO). Our basic compression algorithm is shown to be one to two orders of magnitudes faster than comparable related work, while achieving similar compression ratios. We also propose several techniques to further increase compression ratios, while still retaining the advantage in speed: 1) selecting a good reference sequence; and 2) rewriting a reference sequence to allow for better compression. In addition,we propose a new way of further boosting the compression ratios by applying referential compression to already referentially compressed files (second-order compression). This technique allows for compression ratios way beyond state of the art, for instance,4,000:1 and higher for human genomes. We evaluate our algorithms on a large data set from three different species (more than 1,000 genomes, more than 3 TB) and on a collection of versions of Wikipedia pages. Our results show that real-time compression of highly similar sequences at high compression ratios is possible on modern hardware.

  16. Genome Sequence of Australian Indigenous Wine Yeast Torulaspora delbrueckii COFT1 Using Nanopore Sequencing.

    Science.gov (United States)

    Tondini, Federico; Jiranek, Vladimir; Grbin, Paul R; Onetto, Cristobal A

    2018-04-26

    Here, we report the first sequenced genome of an indigenous Australian wine isolate of Torulaspora delbrueckii using the Oxford Nanopore MinION and Illumina HiSeq sequencing platforms. The genome size is 9.4 Mb and contains 4,831 genes. Copyright © 2018 Tondini et al.

  17. Targeted Gene Sequencing and Whole-Exome Sequencing in Autopsied Fetuses with Prenatally Diagnosed Kidney Anomalies

    DEFF Research Database (Denmark)

    Rasmussen, M; Sunde, L; Nielsen, M L

    2018-01-01

    Identification of fetal kidney anomalies invites questions about underlying causes and recurrence risk in future pregnancies. We therefore investigated the diagnostic yield of next-generation sequencing in fetuses with bilateral kidney anomalies and the correlation between disrupted genes and fetal...... phenotypes. Fetuses with bilateral kidney anomalies were screened using an in-house-designed kidney-gene panel. In families where candidate variants were not identified, whole-exome sequencing was performed. Genes uncovered by this analysis were added to our kidney-panel. We identified likely deleterious...... of nephronophthisis. Exome sequencing identified ROBO1 variants in one family and a GREB1L variant in another family. GREB1L and ROBO1 were added to our kidney-gene panel and additional variants were identified. Next-generation sequencing substantially contributes to identifying causes of fetal kidney anomalies...

  18. Sequence Factorization with Multiple References.

    Directory of Open Access Journals (Sweden)

    Sebastian Wandelt

    Full Text Available The success of high-throughput sequencing has lead to an increasing number of projects which sequence large populations of a species. Storage and analysis of sequence data is a key challenge in these projects, because of the sheer size of the datasets. Compression is one simple technology to deal with this challenge. Referential factorization and compression schemes, which store only the differences between input sequence and a reference sequence, gained lots of interest in this field. Highly-similar sequences, e.g., Human genomes, can be compressed with a compression ratio of 1,000:1 and more, up to two orders of magnitude better than with standard compression techniques. Recently, it was shown that the compression against multiple references from the same species can boost the compression ratio up to 4,000:1. However, a detailed analysis of using multiple references is lacking, e.g., for main memory consumption and optimality. In this paper, we describe one key technique for the referential compression against multiple references: The factorization of sequences. Based on the notion of an optimal factorization, we propose optimization heuristics and identify parameter settings which greatly influence 1 the size of the factorization, 2 the time for factorization, and 3 the required amount of main memory. We evaluate a total of 30 setups with a varying number of references on data from three different species. Our results show a wide range of factorization sizes (optimal to an overhead of up to 300%, factorization speed (0.01 MB/s to more than 600 MB/s, and main memory usage (few dozen MB to dozens of GB. Based on our evaluation, we identify the best configurations for common use cases. Our evaluation shows that multi-reference factorization is much better than single-reference factorization.

  19. Multiplexed microsatellite recovery using massively parallel sequencing

    Science.gov (United States)

    Jennings, T.N.; Knaus, B.J.; Mullins, T.D.; Haig, S.M.; Cronn, R.C.

    2011-01-01

    Conservation and management of natural populations requires accurate and inexpensive genotyping methods. Traditional microsatellite, or simple sequence repeat (SSR), marker analysis remains a popular genotyping method because of the comparatively low cost of marker development, ease of analysis and high power of genotype discrimination. With the availability of massively parallel sequencing (MPS), it is now possible to sequence microsatellite-enriched genomic libraries in multiplex pools. To test this approach, we prepared seven microsatellite-enriched, barcoded genomic libraries from diverse taxa (two conifer trees, five birds) and sequenced these on one lane of the Illumina Genome Analyzer using paired-end 80-bp reads. In this experiment, we screened 6.1 million sequences and identified 356958 unique microreads that contained di- or trinucleotide microsatellites. Examination of four species shows that our conversion rate from raw sequences to polymorphic markers compares favourably to Sanger- and 454-based methods. The advantage of multiplexed MPS is that the staggering capacity of modern microread sequencing is spread across many libraries; this reduces sample preparation and sequencing costs to less than $400 (USD) per species. This price is sufficiently low that microsatellite libraries could be prepared and sequenced for all 1373 organisms listed as 'threatened' and 'endangered' in the United States for under $0.5M (USD).

  20. Automated degenerate PCR primer design for high-throughput sequencing improves efficiency of viral sequencing

    Directory of Open Access Journals (Sweden)

    Li Kelvin

    2012-11-01

    Full Text Available Abstract Background In a high-throughput environment, to PCR amplify and sequence a large set of viral isolates from populations that are potentially heterogeneous and continuously evolving, the use of degenerate PCR primers is an important strategy. Degenerate primers allow for the PCR amplification of a wider range of viral isolates with only one set of pre-mixed primers, thus increasing amplification success rates and minimizing the necessity for genome finishing activities. To successfully select a large set of degenerate PCR primers necessary to tile across an entire viral genome and maximize their success, this process is best performed computationally. Results We have developed a fully automated degenerate PCR primer design system that plays a key role in the J. Craig Venter Institute’s (JCVI high-throughput viral sequencing pipeline. A consensus viral genome, or a set of consensus segment sequences in the case of a segmented virus, is specified using IUPAC ambiguity codes in the consensus template sequence to represent the allelic diversity of the target population. PCR primer pairs are then selected computationally to produce a minimal amplicon set capable of tiling across the full length of the specified target region. As part of the tiling process, primer pairs are computationally screened to meet the criteria for successful PCR with one of two described amplification protocols. The actual sequencing success rates for designed primers for measles virus, mumps virus, human parainfluenza virus 1 and 3, human respiratory syncytial virus A and B and human metapneumovirus are described, where >90% of designed primer pairs were able to consistently successfully amplify >75% of the isolates. Conclusions Augmenting our previously developed and published JCVI Primer Design Pipeline, we achieved similarly high sequencing success rates with only minor software modifications. The recommended methodology for the construction of the consensus

  1. Sequence analysis of Leukemia DNA

    Science.gov (United States)

    Nacong, Nasria; Lusiyanti, Desy; Irawan, Muhammad. Isa

    2018-03-01

    Cancer is a very deadly disease, one of which is leukemia disease or better known as blood cancer. The cancer cell can be detected by taking DNA in laboratory test. This study focused on local alignment of leukemia and non leukemia data resulting from NCBI in the form of DNA sequences by using Smith-Waterman algorithm. SmithWaterman algorithm was invented by TF Smith and MS Waterman in 1981. These algorithms try to find as much as possible similarity of a pair of sequences, by giving a negative value to the unequal base pair (mismatch), and positive values on the same base pair (match). So that will obtain the maximum positive value as the end of the alignment, and the minimum value as the initial alignment. This study will use sequences of leukemia and 3 sequences of non leukemia.

  2. Locomotor sequence learning in visually guided walking

    DEFF Research Database (Denmark)

    Choi, Julia T; Jensen, Peter; Nielsen, Jens Bo

    2016-01-01

    walking. In addition, we determined how age (i.e., healthy young adults vs. children) and biomechanical factors (i.e., walking speed) affected the rate and magnitude of locomotor sequence learning. The results showed that healthy young adults (age 24 ± 5 years, N = 20) could learn a specific sequence...... of step lengths over 300 training steps. Younger children (age 6-10 years, N = 8) have lower baseline performance, but their magnitude and rate of sequence learning was the same compared to older children (11-16 years, N = 10) and healthy adults. In addition, learning capacity may be more limited...... to modify step length from one trial to the next. Our sequence learning paradigm is derived from the serial reaction-time (SRT) task that has been used in upper limb studies. Both random and ordered sequences of step lengths were used to measure sequence-specific and sequence non-specific learning during...

  3. Cytogenetic and molecular markers for detecting Aegilops uniaristata chromosomes in a wheat background.

    Science.gov (United States)

    Gong, Wenping; Li, Guangrong; Zhou, Jianping; Li, Genying; Liu, Cheng; Huang, Chengyan; Zhao, Zhendong; Yang, Zujun

    2014-09-01

    Aegilops uniaristata has many agronomically useful traits that can be used for wheat breeding. So far, a Triticum turgidum - Ae. uniaristata amphiploid and one set of Chinese Spring (CS) - Ae. uniaristata addition lines have been produced. To guide Ae. uniaristata chromatin transformation from these lines into cultivated wheat through chromosome engineering, reliable cytogenetic and molecular markers specific for Ae. uniaristata chromosomes need to be developed. Standard C-banding shows that C-bands mainly exist in the centromeric regions of Ae. uniaristata but rarely at the distal ends. Fluorescence in situ hybridization (FISH) using (GAA)8 as a probe showed that the hybridization signal of chromosomes 1N-7N are different, thus (GAA)8 can be used to identify all Ae. uniaristata chromosomes in wheat background simultaneously. Moreover, a total of 42 molecular markers specific for Ae. uniaristata chromosomes were developed by screening expressed sequence tag - sequence tagged site (EST-STS), expressed sequence tag - simple sequence repeat (EST-SSR), and PCR-based landmark unique gene (PLUG) primers. The markers were subsequently localized using the CS - Ae. uniaristata addition lines and different wheat cultivars as controls. The cytogenetic and molecular markers developed herein will be helpful for screening and identifying wheat - Ae. uniaristata progeny.

  4. Identification and chromosome mapping of repetitive elements in the Astyanax scabripinnis (Teleostei: Characidae) species complex.

    Science.gov (United States)

    Barbosa, Patrícia; de Oliveira, Luiz Antonio; Pucci, Marcela Baer; Santos, Mateus Henrique; Moreira-Filho, Orlando; Vicari, Marcelo Ricardo; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira

    2015-02-01

    Most part of the eukaryotic genome is composed of repeated sequences or multiple copies of DNA, which were considered as "junk DNA", and may be associated to the heterochromatin. In this study, three populations of Astyanax aff. scabripinnis from Brazilian rivers of Guaratinguetá and Pindamonhangaba (São Paulo) and a population from Maringá (Paraná) were analyzed concerning the localization of the nucleolar organizer regions (Ag-NORs), the As51 satellite DNA, the 18S ribosomal DNA (rDNA), and the 5S rDNA. Repeated sequences were also isolated and identified by the Cot - 1 method, which indicated similarity (90%) with the LINE UnaL2 retrotransposon. The fluorescence in situ hybridization (FISH) showed the retrotransposon dispersed and more concentrated markers in centromeric and telomeric chromosomal regions. These sequences were co-localized and interspaced with 18S and 5S rDNA and As51, confirmed by fiber-FISH essay. The B chromosome found in these populations pointed to a conspicuous hybridization with LINE probe, which is also co-located in As51 sequences. The NORs were active at unique sites of a homologous pair in the three populations. There were no evidences that transposable elements and repetitive DNA had influence in the transcriptional regulation of ribosomal genes in our analyses.

  5. Heterogeneous breakpoints in patients with acute lymphoblastic leukemia and the dic(9;20)(p11-13;q11) show recurrent involvement of genes at 20q11.21.

    Science.gov (United States)

    An, Qian; Wright, Sarah L; Moorman, Anthony V; Parker, Helen; Griffiths, Mike; Ross, Fiona M; Davies, Teresa; Harrison, Christine J; Strefford, Jon C

    2009-08-01

    The dic(9;20)(p11-13;q11) is a recurrent chromosomal abnormality in patients with acute lymphoblastic leukemia. Although it results in loss of material from 9p and 20q, the molecular targets on both chromosomes have not been fully elucidated. From an initial cohort of 58 with acute lymphoblastic leukemia patients with this translocation, breakpoint mapping with fluorescence in situ hybridization on 26 of them revealed breakpoint heterogeneity of both chromosomes. PAX5 has been proposed to be the target gene on 9p, while for 20q, FISH analysis implicated the involvement of the ASXL1 gene, either by a breakpoint within (n=4) or centromeric (deletion, n=12) of the gene. Molecular copy-number counting, long-distance inverse PCR and direct sequence analysis identified six dic(9;20) breakpoint sequences. In addition to the three previously reported: PAX5-ASXL1, PAX5-C20ORF112 and PAX5-KIF3B; we identified three new ones in this study: sequences 3' of PAX5 disrupting ASXL1, and ZCCHC7 disrupted by sequences 3' of FRG1B and LOC1499503. This study provides insight into the breakpoint complexity underlying dicentric chromosomal formation in acute lymphoblastic leukemia and highlights putative target gene loci.

  6. Design of Protein Multi-specificity Using an Independent Sequence Search Reduces the Barrier to Low Energy Sequences.

    Directory of Open Access Journals (Sweden)

    Alexander M Sevy

    2015-07-01

    Full Text Available Computational protein design has found great success in engineering proteins for thermodynamic stability, binding specificity, or enzymatic activity in a 'single state' design (SSD paradigm. Multi-specificity design (MSD, on the other hand, involves considering the stability of multiple protein states simultaneously. We have developed a novel MSD algorithm, which we refer to as REstrained CONvergence in multi-specificity design (RECON. The algorithm allows each state to adopt its own sequence throughout the design process rather than enforcing a single sequence on all states. Convergence to a single sequence is encouraged through an incrementally increasing convergence restraint for corresponding positions. Compared to MSD algorithms that enforce (constrain an identical sequence on all states the energy landscape is simplified, which accelerates the search drastically. As a result, RECON can readily be used in simulations with a flexible protein backbone. We have benchmarked RECON on two design tasks. First, we designed antibodies derived from a common germline gene against their diverse targets to assess recovery of the germline, polyspecific sequence. Second, we design "promiscuous", polyspecific proteins against all binding partners and measure recovery of the native sequence. We show that RECON is able to efficiently recover native-like, biologically relevant sequences in this diverse set of protein complexes.

  7. Assessing Telomere Length Using Surface Enhanced Raman Scattering

    Science.gov (United States)

    Zong, Shenfei; Wang, Zhuyuan; Chen, Hui; Cui, Yiping

    2014-11-01

    Telomere length can provide valuable insight into telomeres and telomerase related diseases, including cancer. Here, we present a brand-new optical telomere length measurement protocol using surface enhanced Raman scattering (SERS). In this protocol, two single strand DNA are used as SERS probes. They are labeled with two different Raman molecules and can specifically hybridize with telomeres and centromere, respectively. First, genome DNA is extracted from cells. Then the telomere and centromere SERS probes are added into the genome DNA. After hybridization with genome DNA, excess SERS probes are removed by magnetic capturing nanoparticles. Finally, the genome DNA with SERS probes attached is dropped onto a SERS substrate and subjected to SERS measurement. Longer telomeres result in more attached telomere probes, thus a stronger SERS signal. Consequently, SERS signal can be used as an indicator of telomere length. Centromere is used as the inner control. By calibrating the SERS intensity of telomere probe with that of the centromere probe, SERS based telomere measurement is realized. This protocol does not require polymerase chain reaction (PCR) or electrophoresis procedures, which greatly simplifies the detection process. We anticipate that this easy-operation and cost-effective protocol is a fine alternative for the assessment of telomere length.

  8. Meikin-associated polo-like kinase specifies Bub1 distribution in meiosis I.

    Science.gov (United States)

    Miyazaki, Seira; Kim, Jihye; Yamagishi, Yuya; Ishiguro, Tadashi; Okada, Yuki; Tanno, Yuji; Sakuno, Takeshi; Watanabe, Yoshinori

    2017-06-01

    In meiosis I, sister chromatids are captured by microtubules emanating from the same pole (mono-orientation), and centromeric cohesion is protected throughout anaphase. Shugoshin, which is localized to centromeres depending on the phosphorylation of histone H2A by Bub1 kinase, plays a central role in protecting meiotic cohesin Rec8 from separase cleavage. Another key meiotic kinetochore factor, meikin, may regulate cohesion protection, although the underlying molecular mechanisms remain elusive. Here, we show that fission yeast Moa1 (meikin), which associates stably with CENP-C during meiosis I, recruits Plo1 (polo-like kinase) to the kinetochores and phosphorylates Spc7 (KNL1) to accumulate Bub1. Consequently, in contrast to the transient kinetochore localization of mitotic Bub1, meiotic Bub1 persists at kinetochores until anaphase I. The meiotic Bub1 pool ensures robust Sgo1 (shugoshin) localization and cohesion protection at centromeres by cooperating with heterochromatin protein Swi6, which binds and stabilizes Sgo1. Furthermore, molecular genetic analyses show a hierarchical regulation of centromeric cohesion protection by meikin and shugoshin that is important for establishing meiosis-specific chromosome segregation. We provide evidence that the meiosis-specific Bub1 regulation is conserved in mouse. © 2017 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.

  9. Quantitative comparison between a multiecho sequence and a single-echo sequence for susceptibility-weighted phase imaging.

    Science.gov (United States)

    Gilbert, Guillaume; Savard, Geneviève; Bard, Céline; Beaudoin, Gilles

    2012-06-01

    The aim of this study was to investigate the benefits arising from the use of a multiecho sequence for susceptibility-weighted phase imaging using a quantitative comparison with a standard single-echo acquisition. Four healthy adult volunteers were imaged on a clinical 3-T system using a protocol comprising two different three-dimensional susceptibility-weighted gradient-echo sequences: a standard single-echo sequence and a multiecho sequence. Both sequences were repeated twice in order to evaluate the local noise contribution by a subtraction of the two acquisitions. For the multiecho sequence, the phase information from each echo was independently unwrapped, and the background field contribution was removed using either homodyne filtering or the projection onto dipole fields method. The phase information from all echoes was then combined using a weighted linear regression. R2 maps were also calculated from the multiecho acquisitions. The noise standard deviation in the reconstructed phase images was evaluated for six manually segmented regions of interest (frontal white matter, posterior white matter, globus pallidus, putamen, caudate nucleus and lateral ventricle). The use of the multiecho sequence for susceptibility-weighted phase imaging led to a reduction of the noise standard deviation for all subjects and all regions of interest investigated in comparison to the reference single-echo acquisition. On average, the noise reduction ranged from 18.4% for the globus pallidus to 47.9% for the lateral ventricle. In addition, the amount of noise reduction was found to be strongly inversely correlated to the estimated R2 value (R=-0.92). In conclusion, the use of a multiecho sequence is an effective way to decrease the noise contribution in susceptibility-weighted phase images, while preserving both contrast and acquisition time. The proposed approach additionally permits the calculation of R2 maps. Copyright © 2012 Elsevier Inc. All rights reserved.

  10. Defining reference sequences for Nocardia species by similarity and clustering analyses of 16S rRNA gene sequence data.

    Directory of Open Access Journals (Sweden)

    Manal Helal

    Full Text Available BACKGROUND: The intra- and inter-species genetic diversity of bacteria and the absence of 'reference', or the most representative, sequences of individual species present a significant challenge for sequence-based identification. The aims of this study were to determine the utility, and compare the performance of several clustering and classification algorithms to identify the species of 364 sequences of 16S rRNA gene with a defined species in GenBank, and 110 sequences of 16S rRNA gene with no defined species, all within the genus Nocardia. METHODS: A total of 364 16S rRNA gene sequences of Nocardia species were studied. In addition, 110 16S rRNA gene sequences assigned only to the Nocardia genus level at the time of submission to GenBank were used for machine learning classification experiments. Different clustering algorithms were compared with a novel algorithm or the linear mapping (LM of the distance matrix. Principal Components Analysis was used for the dimensionality reduction and visualization. RESULTS: The LM algorithm achieved the highest performance and classified the set of 364 16S rRNA sequences into 80 clusters, the majority of which (83.52% corresponded with the original species. The most representative 16S rRNA sequences for individual Nocardia species have been identified as 'centroids' in respective clusters from which the distances to all other sequences were minimized; 110 16S rRNA gene sequences with identifications recorded only at the genus level were classified using machine learning methods. Simple kNN machine learning demonstrated the highest performance and classified Nocardia species sequences with an accuracy of 92.7% and a mean frequency of 0.578. CONCLUSION: The identification of centroids of 16S rRNA gene sequence clusters using novel distance matrix clustering enables the identification of the most representative sequences for each individual species of Nocardia and allows the quantitation of inter- and intra

  11. Farey sequences and resistor networks

    Indian Academy of Sciences (India)

    Green's function, while the perturbation of a network is investigated in [3]. ... In Theorem 1 below, we employ the Farey sequence to establish a strict .... We next show that the Farey sequence method is applicable for circuits with n or fewer.

  12. The Biomolecule Sequencer Project: Nanopore Sequencing as a Dual-Use Tool for Crew Health and Astrobiology Investigations

    Science.gov (United States)

    John, K. K.; Botkin, D. S.; Burton, A. S.; Castro-Wallace, S. L.; Chaput, J. D.; Dworkin, J. P.; Lehman, N.; Lupisella, M. L.; Mason, C. E.; Smith, D. J.; hide

    2016-01-01

    Human missions to Mars will fundamentally transform how the planet is explored, enabling new scientific discoveries through more sophisticated sample acquisition and processing than can currently be implemented in robotic exploration. The presence of humans also poses new challenges, including ensuring astronaut safety and health and monitoring contamination. Because the capability to transfer materials to Earth will be extremely limited, there is a strong need for in situ diagnostic capabilities. Nucleotide sequencing is a particularly powerful tool because it can be used to: (1) mitigate microbial risks to crew by allowing identification of microbes in water, in air, and on surfaces; (2) identify optimal treatment strategies for infections that arise in crew members; and (3) track how crew members, microbes, and mission-relevant organisms (e.g., farmed plants) respond to conditions on Mars through transcriptomic and genomic changes. Sequencing would also offer benefits for science investigations occurring on the surface of Mars by permitting identification of Earth-derived contamination in samples. If Mars contains indigenous life, and that life is based on nucleic acids or other closely related molecules, sequencing would serve as a critical tool for the characterization of those molecules. Therefore, spaceflight-compatible nucleic acid sequencing would be an important capability for both crew health and astrobiology exploration. Advances in sequencing technology on Earth have been driven largely by needs for higher throughput and read accuracy. Although some reduction in size has been achieved, nearly all commercially available sequencers are not compatible with spaceflight due to size, power, and operational requirements. Exceptions are nanopore-based sequencers that measure changes in current caused by DNA passing through pores; these devices are inherently much smaller and require significantly less power than sequencers using other detection methods

  13. mirVAFC: A Web Server for Prioritizations of Pathogenic Sequence Variants from Exome Sequencing Data via Classifications.

    Science.gov (United States)

    Li, Zhongshan; Liu, Zhenwei; Jiang, Yi; Chen, Denghui; Ran, Xia; Sun, Zhong Sheng; Wu, Jinyu

    2017-01-01

    Exome sequencing has been widely used to identify the genetic variants underlying human genetic disorders for clinical diagnoses, but the identification of pathogenic sequence variants among the huge amounts of benign ones is complicated and challenging. Here, we describe a new Web server named mirVAFC for pathogenic sequence variants prioritizations from clinical exome sequencing (CES) variant data of single individual or family. The mirVAFC is able to comprehensively annotate sequence variants, filter out most irrelevant variants using custom criteria, classify variants into different categories as for estimated pathogenicity, and lastly provide pathogenic variants prioritizations based on classifications and mutation effects. Case studies using different types of datasets for different diseases from publication and our in-house data have revealed that mirVAFC can efficiently identify the right pathogenic candidates as in original work in each case. Overall, the Web server mirVAFC is specifically developed for pathogenic sequence variant identifications from family-based CES variants using classification-based prioritizations. The mirVAFC Web server is freely accessible at https://www.wzgenomics.cn/mirVAFC/. © 2016 WILEY PERIODICALS, INC.

  14. Phylogenomics of Phrynosomatid Lizards: Conflicting Signals from Sequence Capture versus Restriction Site Associated DNA Sequencing

    Science.gov (United States)

    Leaché, Adam D.; Chavez, Andreas S.; Jones, Leonard N.; Grummer, Jared A.; Gottscho, Andrew D.; Linkem, Charles W.

    2015-01-01

    Sequence capture and restriction site associated DNA sequencing (RADseq) are popular methods for obtaining large numbers of loci for phylogenetic analysis. These methods are typically used to collect data at different evolutionary timescales; sequence capture is primarily used for obtaining conserved loci, whereas RADseq is designed for discovering single nucleotide polymorphisms (SNPs) suitable for population genetic or phylogeographic analyses. Phylogenetic questions that span both “recent” and “deep” timescales could benefit from either type of data, but studies that directly compare the two approaches are lacking. We compared phylogenies estimated from sequence capture and double digest RADseq (ddRADseq) data for North American phrynosomatid lizards, a species-rich and diverse group containing nine genera that began diversifying approximately 55 Ma. Sequence capture resulted in 584 loci that provided a consistent and strong phylogeny using concatenation and species tree inference. However, the phylogeny estimated from the ddRADseq data was sensitive to the bioinformatics steps used for determining homology, detecting paralogs, and filtering missing data. The topological conflicts among the SNP trees were not restricted to any particular timescale, but instead were associated with short internal branches. Species tree analysis of the largest SNP assembly, which also included the most missing data, supported a topology that matched the sequence capture tree. This preferred phylogeny provides strong support for the paraphyly of the earless lizard genera Holbrookia and Cophosaurus, suggesting that the earless morphology either evolved twice or evolved once and was subsequently lost in Callisaurus. PMID:25663487

  15. Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

    Science.gov (United States)

    Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

    2017-07-01

    DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.

  16. Diagnostic accuracy of unenhanced, contrast-enhanced perfusion and angiographic MRI sequences for pulmonary embolism diagnosis: results of independent sequence readings

    Energy Technology Data Exchange (ETDEWEB)

    Revel, Marie Pierre [Hopital Europeen Georges Pompidou, APHP, Departments of Radiology, Paris (France); Universite Paris Descartes Sorbonne Paris Cite, Paris (France); Hotel-Dieu, Service de Radiologie, Paris (France); Sanchez, Olivier; Meyer, Guy [Hopital Europeen Georges Pompidou, APHP, Respiratory and intensive care and, Paris (France); Universite Paris Descartes Sorbonne Paris Cite, Paris (France); INSERM Unite 765, Paris (France); Lefort, Catherine; Couchon, Sophie; Hernigou, Anne; Frija, Guy [Hopital Europeen Georges Pompidou, APHP, Departments of Radiology, Paris (France); Niarra, Ralph [Hopital Europeen Georges Pompidou, APHP, Clinical Epidemiology, Paris (France); Universite Paris Descartes Sorbonne Paris Cite, Paris (France); Chatellier, Gilles [Hopital Europeen Georges Pompidou, APHP, Clinical Epidemiology, Paris (France); Universite Paris Descartes Sorbonne Paris Cite, Paris (France); INSERM CIC-EC E4, Paris (France)

    2013-09-15

    To independently evaluate unenhanced, contrast-enhanced perfusion and angiographic MR sequences for pulmonary embolism (PE) diagnosis. Prospective investigation, including 274 patients who underwent perfusion, unenhanced 2D steady-state-free-precession (SSFP) and contrast-enhanced 3D angiographic MR sequences on a 1.5-T unit, in addition to CTA (CT angiography). Two independent readers evaluated each sequence independently in random order. Sensitivity, specificity, predictive values and inter-reader agreement were calculated for each sequence, excluding sequences judged inconclusive. Sensitivity was also calculated according to PE location. Contrast-enhanced angiographic sequences showed the highest sensitivity (82.9 and 89.7 %, reader 1 and reader 2, respectively), specificity (98.5 and 100 %) and agreement (kappa value 0.77). Unenhanced angiographic sequences, although less sensitive overall (68.7 and 76.4 %), were sensitive for the detection of proximal PE (92.7 and 100 %) and showed high specificity (96.1 and 99.1 %) and good agreement (kappa value 0.62). Perfusion sequences showed lower sensitivity (75.0 and 79.3 %), specificity (84.8 and 89.7 %) and agreement (kappa value 0.51), and a negative predictive value of 84.8 % at best. Compared with contrast-enhanced angiographic sequences, unenhanced sequences demonstrate lower sensitivity, except for proximal PE, but high specificity and agreement. The negative predictive value of perfusion sequences was insufficient to safely rule out PE. (orig.)

  17. Enhanced throughput for infrared automated DNA sequencing

    Science.gov (United States)

    Middendorf, Lyle R.; Gartside, Bill O.; Humphrey, Pat G.; Roemer, Stephen C.; Sorensen, David R.; Steffens, David L.; Sutter, Scott L.

    1995-04-01

    Several enhancements have been developed and applied to infrared automated DNA sequencing resulting in significantly higher throughput. A 41 cm sequencing gel (31 cm well- to-read distance) combines high resolution of DNA sequencing fragments with optimized run times yielding two runs per day of 500 bases per sample. A 66 cm sequencing gel (56 cm well-to-read distance) produces sequence read lengths of up to 1000 bases for ds and ss templates using either T7 polymerase or cycle-sequencing protocols. Using a multichannel syringe to load 64 lanes allows 16 samples (compatible with 96-well format) to be visualized for each run. The 41 cm gel configuration allows 16,000 bases per day (16 samples X 500 bases/sample X 2 ten hour runs/day) to be sequenced with the advantages of infrared technology. Enhancements to internal labeling techniques using an infrared-labeled dATP molecule (Boehringer Mannheim GmbH, Penzberg, Germany; Sequenase (U.S. Biochemical) have also been made. The inclusion of glycerol in the sequencing reactions yields greatly improved results for some primer and template combinations. The inclusion of (alpha) -Thio-dNTP's in the labeling reaction increases signal intensity two- to three-fold.

  18. DSAP: deep-sequencing small RNA analysis pipeline.

    Science.gov (United States)

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  19. Universal sequence replication, reversible polymerization and early functional biopolymers: a model for the initiation of prebiotic sequence evolution.

    Directory of Open Access Journals (Sweden)

    Sara Imari Walker

    Full Text Available Many models for the origin of life have focused on understanding how evolution can drive the refinement of a preexisting enzyme, such as the evolution of efficient replicase activity. Here we present a model for what was, arguably, an even earlier stage of chemical evolution, when polymer sequence diversity was generated and sustained before, and during, the onset of functional selection. The model includes regular environmental cycles (e.g. hydration-dehydration cycles that drive polymers between times of replication and functional activity, which coincide with times of different monomer and polymer diffusivity. Template-directed replication of informational polymers, which takes place during the dehydration stage of each cycle, is considered to be sequence-independent. New sequences are generated by spontaneous polymer formation, and all sequences compete for a finite monomer resource that is recycled via reversible polymerization. Kinetic Monte Carlo simulations demonstrate that this proposed prebiotic scenario provides a robust mechanism for the exploration of sequence space. Introduction of a polymer sequence with monomer synthetase activity illustrates that functional sequences can become established in a preexisting pool of otherwise non-functional sequences. Functional selection does not dominate system dynamics and sequence diversity remains high, permitting the emergence and spread of more than one functional sequence. It is also observed that polymers spontaneously form clusters in simulations where polymers diffuse more slowly than monomers, a feature that is reminiscent of a previous proposal that the earliest stages of life could have been defined by the collective evolution of a system-wide cooperation of polymer aggregates. Overall, the results presented demonstrate the merits of considering plausible prebiotic polymer chemistries and environments that would have allowed for the rapid turnover of monomer resources and for

  20. Sequence dependent aggregation of peptides and fibril formation

    Science.gov (United States)

    Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

    2017-09-01

    Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.

  1. The sequence of spacers between the consensus sequences modulates the strength of procaryotic promoters

    DEFF Research Database (Denmark)

    Jensen, Peter Ruhdal; Hammer, Karin

    1998-01-01

    A library of synthetic promoters for Lactococcus lactis was constructed, in which the known consensus sequences were kept constant while the sequences of the separating spacers were randomized. The library consists of 38 promoters which differ in strength from 0.3 relative units, and up to more t......-reactors and cell factories....

  2. Graphene nanodevices for DNA sequencing

    NARCIS (Netherlands)

    Heerema, S.J.; Dekker, C.

    2016-01-01

    Fast, cheap, and reliable DNA sequencing could be one of the most disruptive innovations of this decade, as it will pave the way for personalized medicine. In pursuit of such technology, a variety of nanotechnology-based approaches have been explored and established, including sequencing with

  3. WebPrInSeS: automated full-length clone sequence identification and verification using high-throughput sequencing data.

    Science.gov (United States)

    Massouras, Andreas; Decouttere, Frederik; Hens, Korneel; Deplancke, Bart

    2010-07-01

    High-throughput sequencing (HTS) is revolutionizing our ability to obtain cheap, fast and reliable sequence information. Many experimental approaches are expected to benefit from the incorporation of such sequencing features in their pipeline. Consequently, software tools that facilitate such an incorporation should be of great interest. In this context, we developed WebPrInSeS, a web server tool allowing automated full-length clone sequence identification and verification using HTS data. WebPrInSeS encompasses two separate software applications. The first is WebPrInSeS-C which performs automated sequence verification of user-defined open-reading frame (ORF) clone libraries. The second is WebPrInSeS-E, which identifies positive hits in cDNA or ORF-based library screening experiments such as yeast one- or two-hybrid assays. Both tools perform de novo assembly using HTS data from any of the three major sequencing platforms. Thus, WebPrInSeS provides a highly integrated, cost-effective and efficient way to sequence-verify or identify clones of interest. WebPrInSeS is available at http://webprinses.epfl.ch/ and is open to all users.

  4. Weak disorder in Fibonacci sequences

    Energy Technology Data Exchange (ETDEWEB)

    Ben-Naim, E [Theoretical Division and Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545 (United States); Krapivsky, P L [Department of Physics and Center for Molecular Cybernetics, Boston University, Boston, MA 02215 (United States)

    2006-05-19

    We study how weak disorder affects the growth of the Fibonacci series. We introduce a family of stochastic sequences that grow by the normal Fibonacci recursion with probability 1 - {epsilon}, but follow a different recursion rule with a small probability {epsilon}. We focus on the weak disorder limit and obtain the Lyapunov exponent that characterizes the typical growth of the sequence elements, using perturbation theory. The limiting distribution for the ratio of consecutive sequence elements is obtained as well. A number of variations to the basic Fibonacci recursion including shift, doubling and copying are considered. (letter to the editor)

  5. Sequencing at sea: challenges and experiences in Ion Torrent PGM sequencing during the 2013 Southern Line Islands Research Expedition

    Directory of Open Access Journals (Sweden)

    Yan Wei Lim

    2014-08-01

    Full Text Available Genomics and metagenomics have revolutionized our understanding of marine microbial ecology and the importance of microbes in global geochemical cycles. However, the process of DNA sequencing has always been an abstract extension of the research expedition, completed once the samples were returned to the laboratory. During the 2013 Southern Line Islands Research Expedition, we started the first effort to bring next generation sequencing to some of the most remote locations on our planet. We successfully sequenced twenty six marine microbial genomes, and two marine microbial metagenomes using the Ion Torrent PGM platform on the Merchant Yacht Hanse Explorer. Onboard sequence assembly, annotation, and analysis enabled us to investigate the role of the microbes in the coral reef ecology of these islands and atolls. This analysis identified phosphonate as an important phosphorous source for microbes growing in the Line Islands and reinforced the importance of L-serine in marine microbial ecosystems. Sequencing in the field allowed us to propose hypotheses and conduct experiments and further sampling based on the sequences generated. By eliminating the delay between sampling and sequencing, we enhanced the productivity of the research expedition. By overcoming the hurdles associated with sequencing on a boat in the middle of the Pacific Ocean we proved the flexibility of the sequencing, annotation, and analysis pipelines.

  6. Genome-wide SNP identification by high-throughput sequencing and selective mapping allows sequence assembly positioning using a framework genetic linkage map

    Directory of Open Access Journals (Sweden)

    Xu Xiangming

    2010-12-01

    Full Text Available Abstract Background Determining the position and order of contigs and scaffolds from a genome assembly within an organism's genome remains a technical challenge in a majority of sequencing projects. In order to exploit contemporary technologies for DNA sequencing, we developed a strategy for whole genome single nucleotide polymorphism sequencing allowing the positioning of sequence contigs onto a linkage map using the bin mapping method. Results The strategy was tested on a draft genome of the fungal pathogen Venturia inaequalis, the causal agent of apple scab, and further validated using sequence contigs derived from the diploid plant genome Fragaria vesca. Using our novel method we were able to anchor 70% and 92% of sequences assemblies for V. inaequalis and F. vesca, respectively, to genetic linkage maps. Conclusions We demonstrated the utility of this approach by accurately determining the bin map positions of the majority of the large sequence contigs from each genome sequence and validated our method by mapping single sequence repeat markers derived from sequence contigs on a full mapping population.

  7. Next-Generation Sequencing in the Mycology Lab.

    Science.gov (United States)

    Zoll, Jan; Snelders, Eveline; Verweij, Paul E; Melchers, Willem J G

    New state-of-the-art techniques in sequencing offer valuable tools in both detection of mycobiota and in understanding of the molecular mechanisms of resistance against antifungal compounds and virulence. Introduction of new sequencing platform with enhanced capacity and a reduction in costs for sequence analysis provides a potential powerful tool in mycological diagnosis and research. In this review, we summarize the applications of next-generation sequencing techniques in mycology.

  8. Finding the most significant common sequence and structure motifs in a set of RNA sequences

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Heyer, L.J.; Stormo, G.D.

    1997-01-01

    We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints, In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections, The first part utilizes a simplified...

  9. Comparing Whole-Genome Sequencing with Sanger Sequencing for spa Typing of Methicillin-Resistant Staphylococcus aureus

    DEFF Research Database (Denmark)

    Bartels, Mette Damkjaer; Petersen, Andreas; Worning, Peder

    2014-01-01

    spa typing of methicillin-resistant Staphylococcus aureus (MRSA) has traditionally been done by PCR amplification and Sanger sequencing of the spa repeat region. At Hvidovre Hospital, Denmark, whole-genome sequencing (WGS) of all MRSA isolates has been performed routinely since January 2013, and ...

  10. Single-copy genes define a conserved order between rice and wheat for understanding differences caused by duplication, deletion, and transposition of genes.

    Science.gov (United States)

    Singh, Nagendra K; Dalal, Vivek; Batra, Kamlesh; Singh, Binay K; Chitra, G; Singh, Archana; Ghazi, Irfan A; Yadav, Mahavir; Pandit, Awadhesh; Dixit, Rekha; Singh, Pradeep K; Singh, Harvinder; Koundal, Kirpa R; Gaikwad, Kishor; Mohapatra, Trilochan; Sharma, Tilak R

    2007-01-01

    The high-quality rice genome sequence is serving as a reference for comparative genome analysis in crop plants, especially cereals. However, early comparisons with bread wheat showed complex patterns of conserved synteny (gene content) and colinearity (gene order). Here, we show the presence of ancient duplicated segments in the progenitor of wheat, which were first identified in the rice genome. We also show that single-copy (SC) rice genes, those representing unique matches with wheat expressed sequence tag (EST) unigene contigs in the whole rice genome, show more than twice the proportion of genes mapping to syntenic wheat chromosome as compared to the multicopy (MC) or duplicated rice genes. While 58.7% of the 1,244 mapped SC rice genes were located in single syntenic wheat chromosome groups, the remaining 41.3% were distributed randomly to the other six non-syntenic wheat groups. This could only be explained by a background dispersal of genes in the genome through transposition or other unknown mechanism. The breakdown of rice-wheat synteny due to such transpositions was much greater near the wheat centromeres. Furthermore, the SC rice genes revealed a conserved primordial gene order that gives clues to the origin of rice and wheat chromosomes from a common ancestor through polyploidy, aneuploidy, centromeric fusions, and translocations. Apart from the bin-mapped wheat EST contigs, we also compared 56,298 predicted rice genes with 39,813 wheat EST contigs assembled from 409,765 EST sequences and identified 7,241 SC rice gene homologs of wheat. Based on the conserved colinearity of 1,063 mapped SC rice genes across the bins of individual wheat chromosomes, we predicted the wheat bin location of 6,178 unmapped SC rice gene homologs and validated the location of 213 of these in the telomeric bins of 21 wheat chromosomes with 35.4% initial success. This opens up the possibility of directed mapping of a large number of conserved SC rice gene homologs in wheat

  11. Electricity sequence control

    International Nuclear Information System (INIS)

    Shin, Heung Ryeol

    2010-03-01

    The contents of the book are introduction of control system, like classification and control signal, introduction of electricity power switch, such as push-button and detection switch sensor for induction type and capacitance type machinery for control, solenoid valve, expression of sequence and type of electricity circuit about using diagram, time chart, marking and term, logic circuit like Yes, No, and, or and equivalence logic, basic electricity circuit, electricity sequence control, added condition, special program control about choice and jump of program, motor control, extra circuit on repeat circuit, pause circuit in a conveyer, safety regulations and rule about classification of electricity disaster and protective device for insulation.

  12. High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform

    DEFF Research Database (Denmark)

    Fordyce, Sarah Louise; Avila Arcos, Maria del Carmen; Rockenbauer, Eszter

    2011-01-01

    repeat units. These methods do not allow for the full resolution of STR base composition that sequencing approaches could provide. Here we present an STR profiling method based on the use of the Roche Genome Sequencer (GS) FLX to simultaneously sequence multiple core STR loci. Using this method...

  13. A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology.

    Science.gov (United States)

    Ustek, Duran; Sirma, Sema; Gumus, Ergun; Arikan, Muzaffer; Cakiris, Aris; Abaci, Neslihan; Mathew, Jaicy; Emrence, Zeliha; Azakli, Hulya; Cosan, Fulya; Cakar, Atilla; Parlak, Mahmut; Kursun, Olcay

    2012-10-01

    One application of next-generation sequencing (NGS) is the targeted resequencing of interested genes which has not been used in viral integration site analysis of gene therapy applications. Here, we combined targeted sequence capture array and next generation sequencing to address the whole genome profiling of viral integration sites. Human 293T and K562 cells were transduced with a HIV-1 derived vector. A custom made DNA probe sets targeted pLVTHM vector used to capture lentiviral vector/human genome junctions. The captured DNA was sequenced using GS FLX platform. Seven thousand four hundred and eighty four human genome sequences flanking the long terminal repeats (LTR) of pLVTHM fragment sequences matched with an identity of at least 98% and minimum 50 bp criteria in both cells. In total, 203 unique integration sites were identified. The integrations in both cell lines were totally distant from the CpG islands and from the transcription start sites and preferentially located in introns. A comparison between the two cell lines showed that the lentiviral-transduced DNA does not have the same preferred regions in the two different cell lines. Copyright © 2012 Elsevier B.V. All rights reserved.

  14. ϕ-statistically quasi Cauchy sequences

    Directory of Open Access Journals (Sweden)

    Bipan Hazarika

    2016-04-01

    Full Text Available Let P denote the space whose elements are finite sets of distinct positive integers. Given any element σ of P, we denote by p(σ the sequence {pn(σ} such that pn(σ=1 for n ∈ σ and pn(σ=0 otherwise. Further Ps={σ∈P:∑n=1∞pn(σ≤s}, i.e. Ps is the set of those σ whose support has cardinality at most s. Let (ϕn be a non-decreasing sequence of positive integers such that nϕn+1≤(n+1ϕn for all n∈N and the class of all sequences (ϕn is denoted by Φ. Let E⊆N. The number δϕ(E=lims→∞1ϕs|{k∈σ,σ∈Ps:k∈E}| is said to be the ϕ-density of E. A sequence (xn of points in R is ϕ-statistically convergent (or Sϕ-convergent to a real number ℓ for every ε > 0 if the set {n∈N:|xn−ℓ|≥ɛ} has ϕ-density zero. We introduce ϕ-statistically ward continuity of a real function. A real function is ϕ-statistically ward continuous if it preserves ϕ-statistically quasi Cauchy sequences where a sequence (xn is called to be ϕ-statistically quasi Cauchy (or Sϕ-quasi Cauchy when (Δxn=(xn+1−xn is ϕ-statistically convergent to 0. i.e. a sequence (xn of points in R is called ϕ-statistically quasi Cauchy (or Sϕ-quasi Cauchy for every ε > 0 if {n∈N:|xn+1−xn|≥ɛ} has ϕ-density zero. Also we introduce the concept of ϕ-statistically ward compactness and obtain results related to ϕ-statistically ward continuity, ϕ-statistically ward compactness, statistically ward continuity, ward continuity, ward compactness, ordinary compactness, uniform continuity, ordinary continuity, δ-ward continuity, and slowly oscillating continuity.

  15. FISH as A method for detection of radiation Induced genetic damage

    International Nuclear Information System (INIS)

    Lakatosova, M.; Holeckova, B.

    2006-01-01

    Fluorescence in situ hybridization (FISH) has been considered as a suitable method for rapid and easy detection of chromosome aberrations. In contrast to the standard conventional staining procedure, this technique enables the detection and specification of stable chromosomal re-arrangements, which are compatible with cellular division and thus, they could be transmitted from common ancestral to next cell generations. FISH chromosome - specific painting probes have been effectively applied for the detection of chromosomal damage after exposure to radiation. During last years, several specific fluorescent labeled probes were performed that allowed precise detection of centromeres, sub-telomeres or other regions (sequences) in genome. Our paper deals with describing of different types of FISH probes and their possibilities for application in radiobiology. (authors)

  16. In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

    Science.gov (United States)

    Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

    2011-01-01

    To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533

  17. Very high resolution single pass HLA genotyping using amplicon sequencing on the 454 next generation DNA sequencers: Comparison with Sanger sequencing.

    Science.gov (United States)

    Yamamoto, F; Höglund, B; Fernandez-Vina, M; Tyan, D; Rastrou, M; Williams, T; Moonsamy, P; Goodridge, D; Anderson, M; Erlich, H A; Holcomb, C L

    2015-12-01

    Compared to Sanger sequencing, next-generation sequencing offers advantages for high resolution HLA genotyping including increased throughput, lower cost, and reduced genotype ambiguity. Here we describe an enhancement of the Roche 454 GS GType HLA genotyping assay to provide very high resolution (VHR) typing, by the addition of 8 primer pairs to the original 14, to genotype 11 HLA loci. These additional amplicons help resolve common and well-documented alleles and exclude commonly found null alleles in genotype ambiguity strings. Simplification of workflow to reduce the initial preparation effort using early pooling of amplicons or the Fluidigm Access Array™ is also described. Performance of the VHR assay was evaluated on 28 well characterized cell lines using Conexio Assign MPS software which uses genomic, rather than cDNA, reference sequence. Concordance was 98.4%; 1.6% had no genotype assignment. Of concordant calls, 53% were unambiguous. To further assess the assay, 59 clinical samples were genotyped and results compared to unambiguous allele assignments obtained by prior sequence-based typing supplemented with SSO and/or SSP. Concordance was 98.7% with 58.2% as unambiguous calls; 1.3% could not be assigned. Our results show that the amplicon-based VHR assay is robust and can replace current Sanger methodology. Together with software enhancements, it has the potential to provide even higher resolution HLA typing. Copyright © 2015. Published by Elsevier Inc.

  18. Tournaments, oriented graphs and football sequences

    Directory of Open Access Journals (Sweden)

    Pirzada S.

    2017-08-01

    Full Text Available Consider the result of a soccer league competition where n teams play each other exactly once. A team gets three points for each win and one point for each draw. The total score obtained by each team vi is called the f-score of vi and is denoted by fi. The sequences of all f-scores [fi]i=1n$\\left[ {{\\rm{f}}_{\\rm{i}} } \\right]_{{\\rm{i}} = 1}^{\\rm{n}} $ arranged in non-decreasing order is called the f-score sequence of the competition. We raise the following problem: Which sequences of non-negative integers in non-decreasing order is a football sequence, that is the outcome of a soccer league competition. We model such a competition by an oriented graph with teams represented by vertices in which the teams play each other once, with an arc from team u to team v if and only if u defeats v. We obtain some necessary conditions for football sequences and some characterizations under restrictions.

  19. Geoseq: a tool for dissecting deep-sequencing datasets

    Directory of Open Access Journals (Sweden)

    Homann Robert

    2010-10-01

    Full Text Available Abstract Background Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO, Sequence Read Archive (SRA hosted by the NCBI, or the DNA Data Bank of Japan (ddbj. Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Results Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Conclusions Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a identify differential isoform expression in mRNA-seq datasets, b identify miRNAs (microRNAs in libraries, and identify mature and star sequences in miRNAS and c to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.

  20. DNA sequence modeling based on context trees

    NARCIS (Netherlands)

    Kusters, C.J.; Ignatenko, T.; Roland, J.; Horlin, F.

    2015-01-01

    Genomic sequences contain instructions for protein and cell production. Therefore understanding and identification of biologically and functionally meaningful patterns in DNA sequences is of paramount importance. Modeling of DNA sequences in its turn can help to better understand and identify such

  1. A comparison of 454 sequencing and clonal sequencing for the characterization of hepatitis C virus NS3 variants

    NARCIS (Netherlands)

    Ho, Cynthia K. Y.; Welkers, Matthijs R. A.; Thomas, Xiomara V.; Sullivan, James C.; Kieffer, Tara L.; Reesink, Henk W.; Rebers, Sjoerd P. H.; de Jong, Menno D.; Schinkel, Janke; Molenkamp, Richard

    2015-01-01

    We compared 454 amplicon sequencing with clonal sequencing for the characterization of intra-host hepatitis C virus (HCV) NS3 variants. Clonal and 454 sequences were obtained from 12 patients enrolled in a clinical phase I study for telaprevir, an NS3-4a protease inhibitor. Thirty-nine datasets were

  2. Pairwise local structural alignment of RNA sequences with sequence similarity less than 40%

    DEFF Research Database (Denmark)

    Havgaard, Jakob Hull; Lyngsø, Rune B.; Stormo, Gary D.

    2005-01-01

    detect two genes with low sequence similarity, where the genes are part of a larger genomic region. Results: Here we present such an approach for pairwise local alignment which is based on FILDALIGN and the Sankoff algorithm for simultaneous structural alignment of multiple sequences. We include...... the ability to conduct mutual scans of two sequences of arbitrary length while searching for common local structural motifs of some maximum length. This drastically reduces the complexity of the algorithm. The scoring scheme includes structural parameters corresponding to those available for free energy....... The structure prediction performance for a family is typically around 0.7 using Matthews correlation coefficient. In case (2), the algorithm is successful at locating RNA families with an average sensitivity of 0.8 and a positive predictive value of 0.9 using a BLAST-like hit selection scheme. Availability...

  3. Low-Energy Electron-Induced Strand Breaks in Telomere-Derived DNA Sequences-Influence of DNA Sequence and Topology.

    Science.gov (United States)

    Rackwitz, Jenny; Bald, Ilko

    2018-03-26

    During cancer radiation therapy high-energy radiation is used to reduce tumour tissue. The irradiation produces a shower of secondary low-energy (DNA very efficiently by dissociative electron attachment. Recently, it was suggested that low-energy electron-induced DNA strand breaks strongly depend on the specific DNA sequence with a high sensitivity of G-rich sequences. Here, we use DNA origami platforms to expose G-rich telomere sequences to low-energy (8.8 eV) electrons to determine absolute cross sections for strand breakage and to study the influence of sequence modifications and topology of telomeric DNA on the strand breakage. We find that the telomeric DNA 5'-(TTA GGG) 2 is more sensitive to low-energy electrons than an intermixed sequence 5'-(TGT GTG A) 2 confirming the unique electronic properties resulting from G-stacking. With increasing length of the oligonucleotide (i.e., going from 5'-(GGG ATT) 2 to 5'-(GGG ATT) 4 ), both the variety of topology and the electron-induced strand break cross sections increase. Addition of K + ions decreases the strand break cross section for all sequences that are able to fold G-quadruplexes or G-intermediates, whereas the strand break cross section for the intermixed sequence remains unchanged. These results indicate that telomeric DNA is rather sensitive towards low-energy electron-induced strand breakage suggesting significant telomere shortening that can also occur during cancer radiation therapy. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Identification of 10 882 porcine microsatellite sequences and virtual mapping of 4528 of these sequences

    DEFF Research Database (Denmark)

    Karlskov-Mortensen, Peter; Hu, Z.L.; Gorodkin, Jan

    2007-01-01

    the human genome (BLAST cut-off threshold = 1 x 10-5). All microsatellite sequences placed on the comparative map are accessible at http://www.animalgenome.org/QTLdb/pig.html . These sequences increase the number of identified microsatellites in the porcine genome by several orders of magnitude...

  5. LPTAU, Quasi Random Sequence Generator

    International Nuclear Information System (INIS)

    Sobol, Ilya M.

    1993-01-01

    1 - Description of program or function: LPTAU generates quasi random sequences. These are uniformly distributed sets of L=M N points in the N-dimensional unit cube: I N =[0,1]x...x[0,1]. These sequences are used as nodes for multidimensional integration; as searching points in global optimization; as trial points in multi-criteria decision making; as quasi-random points for quasi Monte Carlo algorithms. 2 - Method of solution: Uses LP-TAU sequence generation (see references). 3 - Restrictions on the complexity of the problem: The number of points that can be generated is L 30 . The dimension of the space cannot exceed 51

  6. Main sequence mass loss

    International Nuclear Information System (INIS)

    Brunish, W.M.; Guzik, J.A.; Willson, L.A.; Bowen, G.

    1987-01-01

    It has been hypothesized that variable stars may experience mass loss, driven, at least in part, by oscillations. The class of stars we are discussing here are the δ Scuti variables. These are variable stars with masses between about 1.2 and 2.25 M/sub θ/, lying on or very near the main sequence. According to this theory, high rotation rates enhance the rate of mass loss, so main sequence stars born in this mass range would have a range of mass loss rates, depending on their initial rotation velocity and the amplitude of the oscillations. The stars would evolve rapidly down the main sequence until (at about 1.25 M/sub θ/) a surface convection zone began to form. The presence of this convective region would slow the rotation, perhaps allowing magnetic braking to occur, and thus sharply reduce the mass loss rate. 7 refs

  7. High-throughput sequence alignment using Graphics Processing Units

    Directory of Open Access Journals (Sweden)

    Trapnell Cole

    2007-12-01

    Full Text Available Abstract Background The recent availability of new, less expensive high-throughput DNA sequencing technologies has yielded a dramatic increase in the volume of sequence data that must be analyzed. These data are being generated for several purposes, including genotyping, genome resequencing, metagenomics, and de novo genome assembly projects. Sequence alignment programs such as MUMmer have proven essential for analysis of these data, but researchers will need ever faster, high-throughput alignment tools running on inexpensive hardware to keep up with new sequence technologies. Results This paper describes MUMmerGPU, an open-source high-throughput parallel pairwise local sequence alignment program that runs on commodity Graphics Processing Units (GPUs in common workstations. MUMmerGPU uses the new Compute Unified Device Architecture (CUDA from nVidia to align multiple query sequences against a single reference sequence stored as a suffix tree. By processing the queries in parallel on the highly parallel graphics card, MUMmerGPU achieves more than a 10-fold speedup over a serial CPU version of the sequence alignment kernel, and outperforms the exact alignment component of MUMmer on a high end CPU by 3.5-fold in total application time when aligning reads from recent sequencing projects using Solexa/Illumina, 454, and Sanger sequencing technologies. Conclusion MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by new, high-throughput sequencing technologies. MUMmerGPU demonstrates that even memory-intensive applications can run significantly faster on the relatively low-cost GPU than on the CPU.

  8. Sequencing intractable DNA to close microbial genomes.

    Directory of Open Access Journals (Sweden)

    Richard A Hurt

    Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  9. Sequencing Intractable DNA to Close Microbial Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Hurt, Jr., Richard Ashley [ORNL; Brown, Steven D [ORNL; Podar, Mircea [ORNL; Palumbo, Anthony Vito [ORNL; Elias, Dwayne A [ORNL

    2012-01-01

    Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled intractable resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such difficult regions in the non-contiguous finished Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. These developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.

  10. Commercial Art: Scope and Sequence.

    Science.gov (United States)

    Nashville - Davidson County Metropolitan Public Schools, TN.

    This scope and sequence guide, developed for a commercial art vocational education program, represents an initial step in the development of a systemwide articulated curriculum sequence for all vocational programs within the Metropolitan Nashville Public School System. It was developed as a result of needs expressed by teachers, parents, and the…

  11. Divide and conquer: enriching environmental sequencing data.

    Directory of Open Access Journals (Sweden)

    Anne Bergeron

    2007-09-01

    Full Text Available In environmental sequencing projects, a mix of DNA from a whole microbial community is fragmented and sequenced, with one of the possible goals being to reconstruct partial or complete genomes of members of the community. In communities with high diversity of species, a significant proportion of the sequences do not overlap any other fragment in the sample. This problem will arise not only in situations with a relatively even distribution of many species, but also when the community in a particular environment is routinely dominated by the same few species. In the former case, no genomes may be assembled at all, while in the latter case a few dominant species in an environment will always be sequenced at high coverage to the detriment of coverage of the greater number of sparse species.Here we show that, with the same global sequencing effort, separating the species into two or more sub-communities prior to sequencing can yield a much higher proportion of sequences that can be assembled. We first use the Lander-Waterman model to show that, if the expected percentage of singleton sequences is higher than 25%, then, under the uniform distribution hypothesis, splitting the community is always a wise choice. We then construct simulated microbial communities to show that the results hold for highly non-uniform distributions. We also show that, for the distributions considered in the experiments, it is possible to estimate quite accurately the relative diversity of the two sub-communities.Given the fact that several methods exist to split microbial communities based on physical properties such as size, density, surface biochemistry, or optical properties, we strongly suggest that groups involved in environmental sequencing, and expecting high diversity, consider splitting their communities in order to maximize the information content of their sequencing effort.

  12. How Next-Generation Sequencing Has Aided Our Understanding of the Sequence Composition and Origin of B Chromosomes

    Directory of Open Access Journals (Sweden)

    Alevtina Ruban

    2017-10-01

    Full Text Available Accessory, supernumerary, or—most simply—B chromosomes, are found in many eukaryotic karyotypes. These small chromosomes do not follow the usual pattern of segregation, but rather are transmitted in a higher than expected frequency. As increasingly being demonstrated by next-generation sequencing (NGS, their structure comprises fragments of standard (A chromosomes, although in some plant species, their sequence also includes contributions from organellar genomes. Transcriptomic analyses of various animal and plant species have revealed that, contrary to what used to be the common belief, some of the B chromosome DNA is protein-encoding. This review summarizes the progress in understanding B chromosome biology enabled by the application of next-generation sequencing technology and state-of-the-art bioinformatics. In particular, a contrast is drawn between a direct sequencing approach and a strategy based on a comparative genomics as alternative routes that can be taken towards the identification of B chromosome sequences.

  13. Management of High-Throughput DNA Sequencing Projects: Alpheus.

    Science.gov (United States)

    Miller, Neil A; Kingsmore, Stephen F; Farmer, Andrew; Langley, Raymond J; Mudge, Joann; Crow, John A; Gonzalez, Alvaro J; Schilkey, Faye D; Kim, Ryan J; van Velkinburgh, Jennifer; May, Gregory D; Black, C Forrest; Myers, M Kathy; Utsey, John P; Frost, Nicholas S; Sugarbaker, David J; Bueno, Raphael; Gullans, Stephen R; Baxter, Susan M; Day, Steve W; Retzel, Ernest F

    2008-12-26

    High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricultural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage significant volumes of high-dimensional and inter-related data and analysis. Alpheus is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multi-gigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche-454 (pyrosequencing) and Applied Biosystem's SOLiD (sequencing-by-ligation). Alpheus enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. Alpheus is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximizing the identification of true positives. Alpheus also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.

  14. Sequencing and comparing whole mitochondrial genomes ofanimals

    Energy Technology Data Exchange (ETDEWEB)

    Boore, Jeffrey L.; Macey, J. Robert; Medina, Monica

    2005-04-22

    Comparing complete animal mitochondrial genome sequences is becoming increasingly common for phylogenetic reconstruction and as a model for genome evolution. Not only are they much more informative than shorter sequences of individual genes for inferring evolutionary relatedness, but these data also provide sets of genome-level characters, such as the relative arrangements of genes, that can be especially powerful. We describe here the protocols commonly used for physically isolating mtDNA, for amplifying these by PCR or RCA, for cloning,sequencing, assembly, validation, and gene annotation, and for comparing both sequences and gene arrangements. On several topics, we offer general observations based on our experiences to date with determining and comparing complete mtDNA sequences.

  15. Comparison of next generation sequencing technologies for transcriptome characterization

    Directory of Open Access Journals (Sweden)

    Soltis Douglas E

    2009-08-01

    Full Text Available Abstract Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19. We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica and the magnoliid avocado (Persea americana using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB, 119,518 (88.7% mapped exactly to known exons, while 1,117 (0.8% mapped to introns, 11,524 (8.6% spanned annotated intron/exon boundaries, and 3,066 (2.3% extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance

  16. Spatio-temporal alignment of pedobarographic image sequences.

    Science.gov (United States)

    Oliveira, Francisco P M; Sousa, Andreia; Santos, Rubim; Tavares, João Manuel R S

    2011-07-01

    This article presents a methodology to align plantar pressure image sequences simultaneously in time and space. The spatial position and orientation of a foot in a sequence are changed to match the foot represented in a second sequence. Simultaneously with the spatial alignment, the temporal scale of the first sequence is transformed with the aim of synchronizing the two input footsteps. Consequently, the spatial correspondence of the foot regions along the sequences as well as the temporal synchronizing is automatically attained, making the study easier and more straightforward. In terms of spatial alignment, the methodology can use one of four possible geometric transformation models: rigid, similarity, affine, or projective. In the temporal alignment, a polynomial transformation up to the 4th degree can be adopted in order to model linear and curved time behaviors. Suitable geometric and temporal transformations are found by minimizing the mean squared error (MSE) between the input sequences. The methodology was tested on a set of real image sequences acquired from a common pedobarographic device. When used in experimental cases generated by applying geometric and temporal control transformations, the methodology revealed high accuracy. In addition, the intra-subject alignment tests from real plantar pressure image sequences showed that the curved temporal models produced better MSE results (P alignment of pedobarographic image data, since previous methods can only be applied on static images.

  17. Intra-species sequence comparisons for annotating genomes

    Energy Technology Data Exchange (ETDEWEB)

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  18. Flow cytometry measurements of human chromosome kinetochore labeling

    International Nuclear Information System (INIS)

    Fantes, J.A.; Green, D.K.; Malloy, P.; Sumner, A.T.

    1989-01-01

    A method for the preparation and measurement of immunofluorescent human chromosome centromeres in suspension is described using CREST antibodies, which bind to the centromeric region of chromosomes. Fluorescein isothiocyanate (FITC)-conjugated antihuman antibodies provide the fluorescent label. Labeled chromosomes are examined on microscope slides and by flow cytometry. In both cases a dye which binds to DNA is added to provide identification of the chromosome groups. Sera from different CREST patients vary in their ability to bind to chromosome arms in addition to the centromeric region. Flow cytometry and microfluorimetry measurements have shown that with a given CREST serum the differences in kinetochore fluorescence between chromosomes are only minor. Flow cytometry experiments to relate the number of dicentric chromosomes, induced by in vitro radiation of peripheral blood cells to the slightly increased number of chromosomes with above-average kinetochore fluorescence did not produce decisive radiation dosimetry results

  19. Cytogenetic analysis of some Brazilian marsupials (Didelphidae

    DEFF Research Database (Denmark)

    Casartelli, C; Rogatto, S R; Ferrari, I

    1986-01-01

    Three species of marsupials from the Amazon region (Marmosa cinerea, Caluromys lanatus, and Didelphis marsupialis) and two from the region of São Paulo (Didelphis marsupialis and Didelphis albiventris) were studied. The G-banding pattern of the species with 2n = 14 (M. cinerea and C. lanatus......, being located in the telomeric region of one pair and in the centromeric region of the other pair. Caluromys lanatus has centromeric Ag-NORs in one acrocentric and in one submetacentric chromosome pairs. Didelphis marsupialis has three chromosome pairs with telomeric Ag-NORs. In D. albiventris the Ag......) was very similar, as well as the pattern of G-bands in the species with 22 chromosomes (Didelphis). All of the autosomes of M. cinerea and D. albiventris have centromeric C-bands and the Y chromosome is totally C-band positive. The long arm of the M. cinerea X chromosome is completely C-band positive...

  20. Information decomposition method to analyze symbolical sequences

    International Nuclear Information System (INIS)

    Korotkov, E.V.; Korotkova, M.A.; Kudryashov, N.A.

    2003-01-01

    The information decomposition (ID) method to analyze symbolical sequences is presented. This method allows us to reveal a latent periodicity of any symbolical sequence. The ID method is shown to have advantages in comparison with application of the Fourier transformation, the wavelet transform and the dynamic programming method to look for latent periodicity. Examples of the latent periods for poetic texts, DNA sequences and amino acids are presented. Possible origin of a latent periodicity for different symbolical sequences is discussed

  1. Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

    Science.gov (United States)

    Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

    2012-08-01

    Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.

  2. A 28,000 Years Old Cro-Magnon mtDNA Sequence Differs from All Potentially Contaminating Modern Sequences

    Science.gov (United States)

    Caramelli, David; Milani, Lucio; Vai, Stefania; Modi, Alessandra; Pecchioli, Elena; Girardi, Matteo; Pilli, Elena; Lari, Martina; Lippi, Barbara; Ronchitelli, Annamaria; Mallegni, Francesco; Casoli, Antonella; Bertorelle, Giorgio; Barbujani, Guido

    2008-01-01

    Background DNA sequences from ancient speciments may in fact result from undetected contamination of the ancient specimens by modern DNA, and the problem is particularly challenging in studies of human fossils. Doubts on the authenticity of the available sequences have so far hampered genetic comparisons between anatomically archaic (Neandertal) and early modern (Cro-Magnoid) Europeans. Methodology/Principal Findings We typed the mitochondrial DNA (mtDNA) hypervariable region I in a 28,000 years old Cro-Magnoid individual from the Paglicci cave, in Italy (Paglicci 23) and in all the people who had contact with the sample since its discovery in 2003. The Paglicci 23 sequence, determined through the analysis of 152 clones, is the Cambridge reference sequence, and cannot possibly reflect contamination because it differs from all potentially contaminating modern sequences. Conclusions/Significance: The Paglicci 23 individual carried a mtDNA sequence that is still common in Europe, and which radically differs from those of the almost contemporary Neandertals, demonstrating a genealogical continuity across 28,000 years, from Cro-Magnoid to modern Europeans. Because all potential sources of modern DNA contamination are known, the Paglicci 23 sample will offer a unique opportunity to get insight for the first time into the nuclear genes of early modern Europeans. PMID:18628960

  3. A 28,000 years old Cro-Magnon mtDNA sequence differs from all potentially contaminating modern sequences.

    Directory of Open Access Journals (Sweden)

    David Caramelli

    Full Text Available BACKGROUND: DNA sequences from ancient specimens may in fact result from undetected contamination of the ancient specimens by modern DNA, and the problem is particularly challenging in studies of human fossils. Doubts on the authenticity of the available sequences have so far hampered genetic comparisons between anatomically archaic (Neandertal and early modern (Cro-Magnoid Europeans. METHODOLOGY/PRINCIPAL FINDINGS: We typed the mitochondrial DNA (mtDNA hypervariable region I in a 28,000 years old Cro-Magnoid individual from the Paglicci cave, in Italy (Paglicci 23 and in all the people who had contact with the sample since its discovery in 2003. The Paglicci 23 sequence, determined through the analysis of 152 clones, is the Cambridge reference sequence, and cannot possibly reflect contamination because it differs from all potentially contaminating modern sequences. CONCLUSIONS/SIGNIFICANCE: The Paglicci 23 individual carried a mtDNA sequence that is still common in Europe, and which radically differs from those of the almost contemporary Neandertals, demonstrating a genealogical continuity across 28,000 years, from Cro-Magnoid to modern Europeans. Because all potential sources of modern DNA contamination are known, the Paglicci 23 sample will offer a unique opportunity to get insight for the first time into the nuclear genes of early modern Europeans.

  4. Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.

    Science.gov (United States)

    Desai, Aarti; Marwah, Veer Singh; Yadav, Akshay; Jha, Vineet; Dhaygude, Kishor; Bangar, Ujwala; Kulkarni, Vivek; Jere, Abhay

    2013-01-01

    Next Generation Sequencing (NGS) is a disruptive technology that has found widespread acceptance in the life sciences research community. The high throughput and low cost of sequencing has encouraged researchers to undertake ambitious genomic projects, especially in de novo genome sequencing. Currently, NGS systems generate sequence data as short reads and de novo genome assembly using these short reads is computationally very intensive. Due to lower cost of sequencing and higher throughput, NGS systems now provide the ability to sequence genomes at high depth. However, currently no report is available highlighting the impact of high sequence depth on genome assembly using real data sets and multiple assembly algorithms. Recently, some studies have evaluated the impact of sequence coverage, error rate and average read length on genome assembly using multiple assembly algorithms, however, these evaluations were performed using simulated datasets. One limitation of using simulated datasets is that variables such as error rates, read length and coverage which are known to impact genome assembly are carefully controlled. Hence, this study was undertaken to identify the minimum depth of sequencing required for de novo assembly for different sized genomes using graph based assembly algorithms and real datasets. Illumina reads for E.coli (4.6 MB) S.kudriavzevii (11.18 MB) and C.elegans (100 MB) were assembled using SOAPdenovo, Velvet, ABySS, Meraculous and IDBA-UD. Our analysis shows that 50X is the optimum read depth for assembling these genomes using all assemblers except Meraculous which requires 100X read depth. Moreover, our analysis shows that de novo assembly from 50X read data requires only 6-40 GB RAM depending on the genome size and assembly algorithm used. We believe that this information can be extremely valuable for researchers in designing experiments and multiplexing which will enable optimum utilization of sequencing as well as analysis resources.

  5. Identification of genomic insertion and flanking sequence of G2-EPSPS and GAT transgenes in soybean using whole genome sequencing method

    Directory of Open Access Journals (Sweden)

    Bingfu Guo

    2016-07-01

    Full Text Available Molecular characterization of sequences flanking exogenous fragment insertions is essential for safety assessment and labeling of genetically modified organisms (GMO. In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS method. About 21 Gb sequence data (~21× coverage for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundary of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of the genomic insertion site of the G2-EPSPS and GAT transgenes will facilitate the use of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS is a cost-effective and rapid method of identifying sites of T-DNA insertions and flanking sequences in soybean.

  6. Leaf sequencing algorithms for segmented multileaf collimation

    International Nuclear Information System (INIS)

    Kamath, Srijit; Sahni, Sartaj; Li, Jonathan; Palta, Jatinder; Ranka, Sanjay

    2003-01-01

    The delivery of intensity-modulated radiation therapy (IMRT) with a multileaf collimator (MLC) requires the conversion of a radiation fluence map into a leaf sequence file that controls the movement of the MLC during radiation delivery. It is imperative that the fluence map delivered using the leaf sequence file is as close as possible to the fluence map generated by the dose optimization algorithm, while satisfying hardware constraints of the delivery system. Optimization of the leaf sequencing algorithm has been the subject of several recent investigations. In this work, we present a systematic study of the optimization of leaf sequencing algorithms for segmental multileaf collimator beam delivery and provide rigorous mathematical proofs of optimized leaf sequence settings in terms of monitor unit (MU) efficiency under most common leaf movement constraints that include minimum leaf separation constraint and leaf interdigitation constraint. Our analytical analysis shows that leaf sequencing based on unidirectional movement of the MLC leaves is as MU efficient as bidirectional movement of the MLC leaves

  7. Leaf sequencing algorithms for segmented multileaf collimation

    Energy Technology Data Exchange (ETDEWEB)

    Kamath, Srijit [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States); Sahni, Sartaj [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States); Li, Jonathan [Department of Radiation Oncology, University of Florida, Gainesville, FL (United States); Palta, Jatinder [Department of Radiation Oncology, University of Florida, Gainesville, FL (United States); Ranka, Sanjay [Department of Computer and Information Science and Engineering, University of Florida, Gainesville, FL (United States)

    2003-02-07

    The delivery of intensity-modulated radiation therapy (IMRT) with a multileaf collimator (MLC) requires the conversion of a radiation fluence map into a leaf sequence file that controls the movement of the MLC during radiation delivery. It is imperative that the fluence map delivered using the leaf sequence file is as close as possible to the fluence map generated by the dose optimization algorithm, while satisfying hardware constraints of the delivery system. Optimization of the leaf sequencing algorithm has been the subject of several recent investigations. In this work, we present a systematic study of the optimization of leaf sequencing algorithms for segmental multileaf collimator beam delivery and provide rigorous mathematical proofs of optimized leaf sequence settings in terms of monitor unit (MU) efficiency under most common leaf movement constraints that include minimum leaf separation constraint and leaf interdigitation constraint. Our analytical analysis shows that leaf sequencing based on unidirectional movement of the MLC leaves is as MU efficient as bidirectional movement of the MLC leaves.

  8. Genome Sequencing and Analysis Conference IV

    Energy Technology Data Exchange (ETDEWEB)

    1993-12-31

    J. Craig Venter and C. Thomas Caskey co-chaired Genome Sequencing and Analysis Conference IV held at Hilton Head, South Carolina from September 26--30, 1992. Venter opened the conference by noting that approximately 400 researchers from 16 nations were present four times as many participants as at Genome Sequencing Conference I in 1989. Venter also introduced the Data Fair, a new component of the conference allowing exchange and on-site computer analysis of unpublished sequence data.

  9. Sequencing BPS spectra

    Energy Technology Data Exchange (ETDEWEB)

    Gukov, Sergei [Walter Burke Institute for Theoretical Physics, California Institute of Technology,1200 E California Blvd, Pasadena, CA 91125 (United States); Max-Planck-Institut für Mathematik,Vivatsgasse 7, D-53111 Bonn (Germany); Nawata, Satoshi [Walter Burke Institute for Theoretical Physics, California Institute of Technology,1200 E California Blvd, Pasadena, CA 91125 (United States); Centre for Quantum Geometry of Moduli Spaces, University of Aarhus,Nordre Ringgade 1, DK-8000 (Denmark); Saberi, Ingmar [Walter Burke Institute for Theoretical Physics, California Institute of Technology,1200 E California Blvd, Pasadena, CA 91125 (United States); Stošić, Marko [CAMGSD, Departamento de Matemática, Instituto Superior Técnico,Av. Rovisco Pais, 1049-001 Lisbon (Portugal); Mathematical Institute SANU,Knez Mihajlova 36, 11000 Belgrade (Serbia); Sułkowski, Piotr [Walter Burke Institute for Theoretical Physics, California Institute of Technology,1200 E California Blvd, Pasadena, CA 91125 (United States); Faculty of Physics, University of Warsaw,ul. Pasteura 5, 02-093 Warsaw (Poland)

    2016-03-02

    This paper provides both a detailed study of color-dependence of link homologies, as realized in physics as certain spaces of BPS states, and a broad study of the behavior of BPS states in general. We consider how the spectrum of BPS states varies as continuous parameters of a theory are perturbed. This question can be posed in a wide variety of physical contexts, and we answer it by proposing that the relationship between unperturbed and perturbed BPS spectra is described by a spectral sequence. These general considerations unify previous applications of spectral sequence techniques to physics, and explain from a physical standpoint the appearance of many spectral sequences relating various link homology theories to one another. We also study structural properties of colored HOMFLY homology for links and evaluate Poincaré polynomials in numerous examples. Among these structural properties is a novel “sliding” property, which can be explained by using (refined) modular S-matrix. This leads to the identification of modular transformations in Chern-Simons theory and 3d N=2 theory via the 3d/3d correspondence. Lastly, we introduce the notion of associated varieties as classical limits of recursion relations of colored superpolynomials of links, and study their properties.

  10. Sequencing BPS spectra

    International Nuclear Information System (INIS)

    Gukov, Sergei; Nawata, Satoshi; Saberi, Ingmar; Stošić, Marko; Sułkowski, Piotr

    2016-01-01

    This paper provides both a detailed study of color-dependence of link homologies, as realized in physics as certain spaces of BPS states, and a broad study of the behavior of BPS states in general. We consider how the spectrum of BPS states varies as continuous parameters of a theory are perturbed. This question can be posed in a wide variety of physical contexts, and we answer it by proposing that the relationship between unperturbed and perturbed BPS spectra is described by a spectral sequence. These general considerations unify previous applications of spectral sequence techniques to physics, and explain from a physical standpoint the appearance of many spectral sequences relating various link homology theories to one another. We also study structural properties of colored HOMFLY homology for links and evaluate Poincaré polynomials in numerous examples. Among these structural properties is a novel “sliding” property, which can be explained by using (refined) modular S-matrix. This leads to the identification of modular transformations in Chern-Simons theory and 3d N=2 theory via the 3d/3d correspondence. Lastly, we introduce the notion of associated varieties as classical limits of recursion relations of colored superpolynomials of links, and study their properties.

  11. On peculiar Šindel sequences

    Czech Academy of Sciences Publication Activity Database

    Křížek, Michal; Somer, L.

    2010-01-01

    Roč. 17, č. 2 (2010), s. 129-140 ISSN 0972-5555 R&D Projects: GA AV ČR(CZ) IAA100190803 Institutional research plan: CEZ:AV0Z10190503 Keywords : quadratic residue * Chinese remainder theorem * primitive Šindel sequences * Prague clock sequence Subject RIV: BA - General Mathematics http://www.pphmj.com/abstract/5095.htm

  12. Fractals in DNA sequence analysis

    Institute of Scientific and Technical Information of China (English)

    Yu Zu-Guo(喻祖国); Vo Anh; Gong Zhi-Min(龚志民); Long Shun-Chao(龙顺潮)

    2002-01-01

    Fractal methods have been successfully used to study many problems in physics, mathematics, engineering, finance,and even in biology. There has been an increasing interest in unravelling the mysteries of DNA; for example, how can we distinguish coding and noncoding sequences, and the problems of classification and evolution relationship of organisms are key problems in bioinformatics. Although much research has been carried out by taking into consideration the long-range correlations in DNA sequences, and the global fractal dimension has been used in these works by other people, the models and methods are somewhat rough and the results are not satisfactory. In recent years, our group has introduced a time series model (statistical point of view) and a visual representation (geometrical point of view)to DNA sequence analysis. We have also used fractal dimension, correlation dimension, the Hurst exponent and the dimension spectrum (multifractal analysis) to discuss problems in this field. In this paper, we introduce these fractal models and methods and the results of DNA sequence analysis.

  13. Attentional load and implicit sequence learning.

    Science.gov (United States)

    Shanks, David R; Rowland, Lee A; Ranger, Mandeep S

    2005-06-01

    A widely employed conceptualization of implicit learning hypothesizes that it makes minimal demands on attentional resources. This conjecture was investigated by comparing learning under single-task and dual-task conditions in the sequential reaction time (SRT) task. Participants learned probabilistic sequences, with dual-task participants additionally having to perform a counting task using stimuli that were targets in the SRT display. Both groups were then tested for sequence knowledge under single-task (Experiments 1 and 2) or dual-task (Experiment 3) conditions. Participants also completed a free generation task (Experiments 2 and 3) under inclusion or exclusion conditions to determine if sequence knowledge was conscious or unconscious in terms of its access to intentional control. The experiments revealed that the secondary task impaired sequence learning and that sequence knowledge was consciously accessible. These findings disconfirm both the notion that implicit learning is able to proceed normally under conditions of divided attention, and that the acquired knowledge is inaccessible to consciousness. A unitary framework for conceptualizing implicit and explicit learning is proposed.

  14. Sequence of human protamine 2 cDNA

    Energy Technology Data Exchange (ETDEWEB)

    Domenjoud, L; Fronia, C; Uhde, F; Engel, W [Universitaet Goettingen (West Germany)

    1988-08-11

    The authors report the cloning and sequencing of a cDNA clone for human protamine 2 (hp2), isolated from a human testis cDNA library cloned in the vector {lambda}-gt11. A 66mer oligonucleotide, that corresponds to an amino acid sequence which is highly conserved between hp2 and mouse protamine 2 (mp2) served as hybridization probe. The homology between the amino acid sequence deduced from our cDNA and the published amino acid sequence for hp2 is 100%.

  15. Snake Genome Sequencing: Results and Future Prospects.

    Science.gov (United States)

    Kerkkamp, Harald M I; Kini, R Manjunatha; Pospelov, Alexey S; Vonk, Freek J; Henkel, Christiaan V; Richardson, Michael K

    2016-12-01

    Snake genome sequencing is in its infancy-very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.

  16. Snake Genome Sequencing: Results and Future Prospects

    Directory of Open Access Journals (Sweden)

    Harald M. I. Kerkkamp

    2016-12-01

    Full Text Available Snake genome sequencing is in its infancy—very much behind the progress made in sequencing the genomes of humans, model organisms and pathogens relevant to biomedical research, and agricultural species. We provide here an overview of some of the snake genome projects in progress, and discuss the biological findings, with special emphasis on toxinology, from the small number of draft snake genomes already published. We discuss the future of snake genomics, pointing out that new sequencing technologies will help overcome the problem of repetitive sequences in assembling snake genomes. Genome sequences are also likely to be valuable in examining the clustering of toxin genes on the chromosomes, in designing recombinant antivenoms and in studying the epigenetic regulation of toxin gene expression.

  17. High-Throughput Next-Generation Sequencing of Polioviruses

    Science.gov (United States)

    Montmayeur, Anna M.; Schmidt, Alexander; Zhao, Kun; Magaña, Laura; Iber, Jane; Castro, Christina J.; Chen, Qi; Henderson, Elizabeth; Ramos, Edward; Shaw, Jing; Tatusov, Roman L.; Dybdahl-Sissoko, Naomi; Endegue-Zanga, Marie Claire; Adeniji, Johnson A.; Oberste, M. Steven; Burns, Cara C.

    2016-01-01

    ABSTRACT The poliovirus (PV) is currently targeted for worldwide eradication and containment. Sanger-based sequencing of the viral protein 1 (VP1) capsid region is currently the standard method for PV surveillance. However, the whole-genome sequence is sometimes needed for higher resolution global surveillance. In this study, we optimized whole-genome sequencing protocols for poliovirus isolates and FTA cards using next-generation sequencing (NGS), aiming for high sequence coverage, efficiency, and throughput. We found that DNase treatment of poliovirus RNA followed by random reverse transcription (RT), amplification, and the use of the Nextera XT DNA library preparation kit produced significantly better results than other preparations. The average viral reads per total reads, a measurement of efficiency, was as high as 84.2% ± 15.6%. PV genomes covering >99 to 100% of the reference length were obtained and validated with Sanger sequencing. A total of 52 PV genomes were generated, multiplexing as many as 64 samples in a single Illumina MiSeq run. This high-throughput, sequence-independent NGS approach facilitated the detection of a diverse range of PVs, especially for those in vaccine-derived polioviruses (VDPV), circulating VDPV, or immunodeficiency-related VDPV. In contrast to results from previous studies on other viruses, our results showed that filtration and nuclease treatment did not discernibly increase the sequencing efficiency of PV isolates. However, DNase treatment after nucleic acid extraction to remove host DNA significantly improved the sequencing results. This NGS method has been successfully implemented to generate PV genomes for molecular epidemiology of the most recent PV isolates. Additionally, the ability to obtain full PV genomes from FTA cards will aid in facilitating global poliovirus surveillance. PMID:27927929

  18. Karyotypic diversity in four species of the genus Gymnotus Linnaeus, 1758 (Teleostei, Gymnotiformes, Gymnotidae): physical mapping of ribosomal genes and telomeric sequences

    Science.gov (United States)

    Scacchetti, Priscilla Cardim; Pansonato-Alves, José Carlos; Utsunomia, Ricardo; Oliveira, Claudio; Foresti, Fausto

    2011-01-01

    Abstract Conventional (Giemsa, C-Banding, Ag-NORs, CMA3) and molecular (5S rDNA, 18S rDNA, telomeric sequences) cytogenetic studies were carried out in specimens of ten distinct fish populations of the genus Gymnotus (Gymnotus sylvius Albert and Fernandes-Matioli, 1999, Gymnotus inaequilabiatus Valenciennes, 1839, Gymnotus pantherinus Steindachner, 1908, and G. cf. carapo Linnaeus, 1758) from different Brazilian hydrographic basins. Gymnotus sylvius presented a diploid number of 40 chromosomes (22m+12sm+6st), Gymnotus pantherinus presented 52 chromosomes (32m+18sm+2st), while Gymnotus inaequilabiatus (42m+10sm+2a)and Gymnotus cf. carapo (38m+12sm+4st) presented 54 chromosomes. The C-banding technique revealed centromeric marks in all chromosomes of all species. Besides that, conspicuous blocks of heterochromatin were found interstitially on the chromosomes of Gymnotus inaequilabiatus, Gymnotus cf. carapo,and Gymnotus pantherinus. All four species showed single nucleolus organizing regions confirmed by results obtained through Ag-NORs and FISH experiments using 18S rDNA probes, which showed the NORs localized on the first chromosome pair in Gymnotus inaequilabiatus, Gymnotus cf. carapo,and Gymnotus pantherinus, and on pair 2 in Gymnotus sylvius. CMA3 staining revealed additional unrelated NORs marks in Gymnotus sylvius and Gymnotus pantherinus. The 5S rDNA probes revealed signals on one pair in Gymnotus sylvius and two pairs in Gymnotus pantherinus; Gymnotus inaequilabiatus had about seventeen pairs marked, and Gymnotus cf. carapo had about fifteen pairs marked. It is considered that the high amount of heterochromatin identified in the chromosomes of Gymnotus inaequilabiatus and Gymnotus cf. carapo could have facilitated the dispersion of 5S rDNA in these species. Interstitial signals were detected on the first metacentric pair of Gymnotus sylvius by telomeric probes (TTAGGG)n indicating the possible occurrence of chromosomal fusions in this species. The present

  19. Karyotypic diversity in four species of the genus Gymnotus Linnaeus, 1758 (Teleostei, Gymnotiformes, Gymnotidae: physical mapping of ribosomal genes and telomeric sequences

    Directory of Open Access Journals (Sweden)

    Priscilla Scacchetti

    2011-08-01

    Full Text Available Conventional (Giemsa, C-Banding, Ag-NORs, CMA3 and molecular (5S rDNA, 18S rDNA, telomeric sequences cytogenetic studies were carried out in specimens of ten distinct fish populations of the genus Gymnotus (G. sylvius Albert and Fernandes-Matioli, 1999, G. inaequilabiatus Valenciennes, 1839, G. pantherinus Steindachner, 1908, and G. cf. carapo Linnaeus, 1758 from different Brazilian hydrographic basins. G. sylvius presented a diploid number of 40 chromosomes (22m+12sm+6st, G. pantherinus presented 52 chromosomes (32m+18sm+2st, while G. inaequilabiatus (42m+10sm+2a and G. cf. carapo (38m+12sm+4st presented 54 chromosomes. The C-banding technique revealed centromeric marks in all chromosomes of all species. Besides that, conspicuous blocks of heterochromatin were found interstitially on the chromosomes of G. inaequilabiatus, G. cf. carapo, and G. pantherinus. All four species showed single nucleolus organizing regions confirmed by results obtained through Ag-NORs and FISH experiments using 18S rDNA probes, which showed the NORs localized on the first chromosome pair in G. inaequilabiatus, G. cf. carapo, and G. pantherinus, and on pair 2 in G. sylvius. CMA3 staining revealed additional unrelated NORs marks in G. sylvius and G. pantherinus. The 5S rDNA probes revealed signals on one pair in G. sylvius and two pairs in G. pantherinus; G. inaequilabiatus had about seventeen pairs marked, and G. cf. carapo had about fifteen pairs marked. It is considered that the high amount of heterochromatin identified in the chromosomes of G. inaequilabiatus and G. cf. carapo could have facilitated the dispersion of 5S rDNA in these species. Interstitial signals were detected on the first metacentric pair of G. sylvius by telomeric probes (TTAGGGn indicating the possible occurrence of chromosomal fusions in this species. The present study reveals valuable cytotaxonomic markers for this group and allows a more precise evaluation of the processes involved in the

  20. Sequence determinants of human microsatellite variability

    Directory of Open Access Journals (Sweden)

    Jakobsson Mattias

    2009-12-01

    Full Text Available Abstract Background Microsatellite loci are frequently used in genomic studies of DNA sequence repeats and in population studies of genetic variability. To investigate the effect of sequence properties of microsatellites on their level of variability we have analyzed genotypes at 627 microsatellite loci in 1,048 worldwide individuals from the HGDP-CEPH cell line panel together with the DNA sequences of these microsatellites in the human RefSeq database. Results Calibrating PCR fragment lengths in individual genotypes by using the RefSeq sequence enabled us to infer repeat number in the HGDP-CEPH dataset and to calculate the mean number of repeats (as opposed to the mean PCR fragment length, under the assumption that differences in PCR fragment length reflect differences in the numbers of repeats in the embedded repeat sequences. We find the mean and maximum numbers of repeats across individuals to be positively correlated with heterozygosity. The size and composition of the repeat unit of a microsatellite are also important factors in predicting heterozygosity, with tetra-nucleotide repeat units high in G/C content leading to higher heterozygosity. Finally, we find that microsatellites containing more separate sets of repeated motifs generally have higher heterozygosity. Conclusions These results suggest that sequence properties of microsatellites have a significant impact in determining the features of human microsatellite variability.

  1. Sequence comparison and phylogenetic analysis of core gene of ...

    African Journals Online (AJOL)

    Phylogenetic analysis suggests that our sequences are clustered with sequences reported from Japan. This is the first phylogenetic analysis of HCV core gene from Pakistani population. Our sequences and sequences from Japan are grouped into same cluster in the phylogenetic tree. Sequence comparison and ...

  2. Application of genotyping-by-sequencing on semiconductor sequencing platforms: a comparison of genetic and reference-based marker ordering in barley.

    Directory of Open Access Journals (Sweden)

    Martin Mascher

    Full Text Available The rapid development of next-generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS, a low-cost, reduced representation sequencing method, is becoming a common approach for whole-genome marker profiling in many species. With quickly developing sequencing technologies, adapting current GBS methodologies to new platforms will leverage these advancements for future studies. To test new semiconductor sequencing platforms for GBS, we genotyped a barley recombinant inbred line (RIL population. Based on a previous GBS approach, we designed bar code and adapter sets for the Ion Torrent platforms. Four sets of 24-plex libraries were constructed consisting of 94 RILs and the two parents and sequenced on two Ion platforms. In parallel, a 96-plex library of the same RILs was sequenced on the Illumina HiSeq 2000. We applied two different computational pipelines to analyze sequencing data; the reference-independent TASSEL pipeline and a reference-based pipeline using SAMtools. Sequence contigs positioned on the integrated physical and genetic map were used for read mapping and variant calling. We found high agreement in genotype calls between the different platforms and high concordance between genetic and reference-based marker order. There was, however, paucity in the number of SNP that were jointly discovered by the different pipelines indicating a strong effect of alignment and filtering parameters on SNP discovery. We show the utility of the current barley genome assembly as a framework for developing very low-cost genetic maps, facilitating high resolution genetic mapping and negating the need for developing de novo genetic maps for future studies in barley. Through demonstration of GBS on semiconductor sequencing platforms, we conclude that the GBS approach is amenable to a range of platforms and can easily be modified as new

  3. HPV-QUEST: A highly customized system for automated HPV sequence analysis capable of processing Next Generation sequencing data set.

    Science.gov (United States)

    Yin, Li; Yao, Jiqiang; Gardner, Brent P; Chang, Kaifen; Yu, Fahong; Goodenow, Maureen M

    2012-01-01

    Next Generation sequencing (NGS) applied to human papilloma viruses (HPV) can provide sensitive methods to investigate the molecular epidemiology of multiple type HPV infection. Currently a genotyping system with a comprehensive collection of updated HPV reference sequences and a capacity to handle NGS data sets is lacking. HPV-QUEST was developed as an automated and rapid HPV genotyping system. The web-based HPV-QUEST subtyping algorithm was developed using HTML, PHP, Perl scripting language, and MYSQL as the database backend. HPV-QUEST includes a database of annotated HPV reference sequences with updated nomenclature covering 5 genuses, 14 species and 150 mucosal and cutaneous types to genotype blasted query sequences. HPV-QUEST processes up to 10 megabases of sequences within 1 to 2 minutes. Results are reported in html, text and excel formats and display e-value, blast score, and local and coverage identities; provide genus, species, type, infection site and risk for the best matched reference HPV sequence; and produce results ready for additional analyses.

  4. ON SOME RECURRENCE TYPE SMARANDACHE SEQUENCES

    OpenAIRE

    MAJUMDAR, A.A.K.; GUNARTO, H.

    2000-01-01

    In this paper, we study some properties of ten recurrence type Smarandache sequences, namely, the Smarandache odd, even, prime product, square product, higher-power product, permutation, consecutive, reverse, symmetric, and pierced chain sequences.

  5. Protecting genomic sequence anonymity with generalization lattices.

    Science.gov (United States)

    Malin, B A

    2005-01-01

    Current genomic privacy technologies assume the identity of genomic sequence data is protected if personal information, such as demographics, are obscured, removed, or encrypted. While demographic features can directly compromise an individual's identity, recent research demonstrates such protections are insufficient because sequence data itself is susceptible to re-identification. To counteract this problem, we introduce an algorithm for anonymizing a collection of person-specific DNA sequences. The technique is termed DNA lattice anonymization (DNALA), and is based upon the formal privacy protection schema of k -anonymity. Under this model, it is impossible to observe or learn features that distinguish one genetic sequence from k-1 other entries in a collection. To maximize information retained in protected sequences, we incorporate a concept generalization lattice to learn the distance between two residues in a single nucleotide region. The lattice provides the most similar generalized concept for two residues (e.g. adenine and guanine are both purines). The method is tested and evaluated with several publicly available human population datasets ranging in size from 30 to 400 sequences. Our findings imply the anonymization schema is feasible for the protection of sequences privacy. The DNALA method is the first computational disclosure control technique for general DNA sequences. Given the computational nature of the method, guarantees of anonymity can be formally proven. There is room for improvement and validation, though this research provides the groundwork from which future researchers can construct genomics anonymization schemas tailored to specific datasharing scenarios.

  6. Tracking TCRβ sequence clonotype expansions during antiviral therapy using high-throughput sequencing of the hypervariable region

    Directory of Open Access Journals (Sweden)

    Mark W Robinson

    2016-04-01

    Full Text Available To maintain a persistent infection viruses such as hepatitis C virus (HCV employ a range of mechanisms that subvert protective T cell responses. The suppression of antigen-specific T cell responses by HCV hinders efforts to profile T cell responses during chronic infection and antiviral therapy. Conventional methods of detecting antigen-specific T cells utilise either antigen stimulation (e.g. ELISpot, proliferation assays, cytokine production or antigen-loaded tetramer staining. This limits the ability to profile T cell responses during chronic infection due to suppressed effector function and the requirement for prior knowledge of antigenic viral peptide sequences. Recently high-throughput sequencing (HTS technologies have been developed for the analysis of T cell repertoires. In the present study we have assessed the feasibility of HTS of the TCRβ complementarity determining region (CDR3 to track T cell expansions in an antigen-independent manner. Using sequential blood samples from HCV-infected individuals undergoing anti-viral therapy we were able to measure the population frequencies of >35,000 TCRβ sequence clonotypes in each individual over the course of 12 weeks. TRBV/TRBJ gene segment usage varied markedly between individuals but remained relatively constant within individuals across the course of therapy. Despite this stable TRBV/TRBJ gene segment usage, a number of TCRβ sequence clonotypes showed dramatic changes in read frequency. These changes could not be linked to therapy outcomes in the present study however the TCRβ CDR3 sequences with the largest fold changes did include sequences with identical TRBV/TRBJ gene segment usage and high joining region homology to previously published CDR3 sequences from HCV-specific T cells targeting the HLA-B*0801-restricted 1395HSKKKCDEL1403 and HLA-A*0101–restricted 1435ATDALMTGY1443 epitopes. The pipeline developed in this proof of concept study provides a platform for the design of

  7. Sequencing genes in silico using single nucleotide polymorphisms

    Directory of Open Access Journals (Sweden)

    Zhang Xinyi

    2012-01-01

    Full Text Available Abstract Background The advent of high throughput sequencing technology has enabled the 1000 Genomes Project Pilot 3 to generate complete sequence data for more than 906 genes and 8,140 exons representing 697 subjects. The 1000 Genomes database provides a critical opportunity for further interpreting disease associations with single nucleotide polymorphisms (SNPs discovered from genetic association studies. Currently, direct sequencing of candidate genes or regions on a large number of subjects remains both cost- and time-prohibitive. Results To accelerate the translation from discovery to functional studies, we propose an in silico gene sequencing method (ISS, which predicts phased sequences of intragenic regions, using SNPs. The key underlying idea of our method is to infer diploid sequences (a pair of phased sequences/alleles at every functional locus utilizing the deep sequencing data from the 1000 Genomes Project and SNP data from the HapMap Project, and to build prediction models using flanking SNPs. Using this method, we have developed a database of prediction models for 611 known genes. Sequence prediction accuracy for these genes is 96.26% on average (ranges 79%-100%. This database of prediction models can be enhanced and scaled up to include new genes as the 1000 Genomes Project sequences additional genes on additional individuals. Applying our predictive model for the KCNJ11 gene to the Wellcome Trust Case Control Consortium (WTCCC Type 2 diabetes cohort, we demonstrate how the prediction of phased sequences inferred from GWAS SNP genotype data can be used to facilitate interpretation and identify a probable functional mechanism such as protein changes. Conclusions Prior to the general availability of routine sequencing of all subjects, the ISS method proposed here provides a time- and cost-effective approach to broadening the characterization of disease associated SNPs and regions, and facilitating the prioritization of candidate

  8. Statistical approaches to use a model organism for regulatory sequences annotation of newly sequenced species.

    Directory of Open Access Journals (Sweden)

    Pietro Liò

    Full Text Available A major goal of bioinformatics is the characterization of transcription factors and the transcriptional programs they regulate. Given the speed of genome sequencing, we would like to quickly annotate regulatory sequences in newly-sequenced genomes. In such cases, it would be helpful to predict sequence motifs by using experimental data from closely related model organism. Here we present a general algorithm that allow to identify transcription factor binding sites in one newly sequenced species by performing Bayesian regression on the annotated species. First we set the rationale of our method by applying it within the same species, then we extend it to use data available in closely related species. Finally, we generalise the method to handle the case when a certain number of experiments, from several species close to the species on which to make inference, are available. In order to show the performance of the method, we analyse three functionally related networks in the Ascomycota. Two gene network case studies are related to the G2/M phase of the Ascomycota cell cycle; the third is related to morphogenesis. We also compared the method with MatrixReduce and discuss other types of validation and tests. The first network is well known and provides a biological validation test of the method. The two cell cycle case studies, where the gene network size is conserved, demonstrate an effective utility in annotating new species sequences using all the available replicas from model species. The third case, where the gene network size varies among species, shows that the combination of information is less powerful but is still informative. Our methodology is quite general and could be extended to integrate other high-throughput data from model organisms.

  9. Genome Sequences of Oryza Species

    KAUST Repository

    Kumagai, Masahiko

    2018-02-14

    This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.

  10. Genome Sequences of Oryza Species

    KAUST Repository

    Kumagai, Masahiko; Tanaka, Tsuyoshi; Ohyanagi, Hajime; Hsing, Yue-Ie C.; Itoh, Takeshi

    2018-01-01

    This chapter summarizes recent data obtained from genome sequencing, annotation projects, and studies on the genome diversity of Oryza sativa and related Oryza species. O. sativa, commonly known as Asian rice, is the first monocot species whose complete genome sequence was deciphered based on physical mapping by an international collaborative effort. This genome, along with its accurate and comprehensive annotation, has become an indispensable foundation for crop genomics and breeding. With the development of innovative sequencing technologies, genomic studies of O. sativa have dramatically increased; in particular, a large number of cultivars and wild accessions have been sequenced and compared with the reference rice genome. Since de novo genome sequencing has become cost-effective, the genome of African cultivated rice, O. glaberrima, has also been determined. Comparative genomic studies have highlighted the independent domestication processes of different rice species, but it also turned out that Asian and African rice share a common gene set that has experienced similar artificial selection. An international project aimed at constructing reference genomes and examining the genome diversity of wild Oryza species is currently underway, and the genomes of some species are publicly available. This project provides a platform for investigations such as the evolution, development, polyploidization, and improvement of crops. Studies on the genomic diversity of Oryza species, including wild species, should provide new insights to solve the problem of growing food demands in the face of rapid climatic changes.

  11. A comparative evaluation of sequence classification programs

    Directory of Open Access Journals (Sweden)

    Bazinet Adam L

    2012-05-01

    Full Text Available Abstract Background A fundamental problem in modern genomics is to taxonomically or functionally classify DNA sequence fragments derived from environmental sampling (i.e., metagenomics. Several different methods have been proposed for doing this effectively and efficiently, and many have been implemented in software. In addition to varying their basic algorithmic approach to classification, some methods screen sequence reads for ’barcoding genes’ like 16S rRNA, or various types of protein-coding genes. Due to the sheer number and complexity of methods, it can be difficult for a researcher to choose one that is well-suited for a particular analysis. Results We divided the very large number of programs that have been released in recent years for solving the sequence classification problem into three main categories based on the general algorithm they use to compare a query sequence against a database of sequences. We also evaluated the performance of the leading programs in each category on data sets whose taxonomic and functional composition is known. Conclusions We found significant variability in classification accuracy, precision, and resource consumption of sequence classification programs when used to analyze various metagenomics data sets. However, we observe some general trends and patterns that will be useful to researchers who use sequence classification programs.

  12. Blazar Sequence in Fermi Era Liang Chen

    Indian Academy of Sciences (India)

    Abstract. In this paper, we review the latest research results on the topic of blazar sequence. It seems that the blazar sequence is phenomenally ruled out, while the theoretical blazar sequence still holds. We point out that black hole mass is a dominated parameter accounting for high-power- high-synchrotron-peaked and ...

  13. Synaptotagmin gene content of the sequenced genomes

    Directory of Open Access Journals (Sweden)

    Craxton Molly

    2004-07-01

    Full Text Available Abstract Background Synaptotagmins exist as a large gene family in mammals. There is much interest in the function of certain family members which act crucially in the regulated synaptic vesicle exocytosis required for efficient neurotransmission. Knowledge of the functions of other family members is relatively poor and the presence of Synaptotagmin genes in plants indicates a role for the family as a whole which is wider than neurotransmission. Identification of the Synaptotagmin genes within completely sequenced genomes can provide the entire Synaptotagmin gene complement of each sequenced organism. Defining the detailed structures of all the Synaptotagmin genes and their encoded products can provide a useful resource for functional studies and a deeper understanding of the evolution of the gene family. The current rapid increase in the number of sequenced genomes from different branches of the tree of life, together with the public deposition of evolutionarily diverse transcript sequences make such studies worthwhile. Results I have compiled a detailed list of the Synaptotagmin genes of Caenorhabditis, Anopheles, Drosophila, Ciona, Danio, Fugu, Mus, Homo, Arabidopsis and Oryza by examining genomic and transcript sequences from public sequence databases together with some transcript sequences obtained by cDNA library screening and RT-PCR. I have compared all of the genes and investigated the relationship between plant Synaptotagmins and their non-Synaptotagmin counterparts. Conclusions I have identified and compared 98 Synaptotagmin genes from 10 sequenced genomes. Detailed comparison of transcript sequences reveals abundant and complex variation in Synaptotagmin gene expression and indicates the presence of Synaptotagmin genes in all animals and land plants. Amino acid sequence comparisons indicate patterns of conservation and diversity in function. Phylogenetic analysis shows the origin of Synaptotagmins in multicellular eukaryotes and their

  14. Unified Deep Learning Architecture for Modeling Biology Sequence.

    Science.gov (United States)

    Wu, Hongjie; Cao, Chengyuan; Xia, Xiaoyan; Lu, Qiang

    2017-10-09

    Prediction of the spatial structure or function of biological macromolecules based on their sequence remains an important challenge in bioinformatics. When modeling biological sequences using traditional sequencing models, characteristics, such as long-range interactions between basic units, the complicated and variable output of labeled structures, and the variable length of biological sequences, usually lead to different solutions on a case-by-case basis. This study proposed the use of bidirectional recurrent neural networks based on long short-term memory or a gated recurrent unit to capture long-range interactions by designing the optional reshape operator to adapt to the diversity of the output labels and implementing a training algorithm to support the training of sequence models capable of processing variable-length sequences. Additionally, the merge and pooling operators enhanced the ability to capture short-range interactions between basic units of biological sequences. The proposed deep-learning model and its training algorithm might be capable of solving currently known biological sequence-modeling problems through the use of a unified framework. We validated our model on one of the most difficult biological sequence-modeling problems currently known, with our results indicating the ability of the model to obtain predictions of protein residue interactions that exceeded the accuracy of current popular approaches by 10% based on multiple benchmarks.

  15. Safety Assessment of Advanced Imaging Sequences I: Measurements

    DEFF Research Database (Denmark)

    Jensen, Jørgen Arendt; Rasmussen, Morten Fischer; Pihl, Michael Johannes

    2016-01-01

    intensity measurement program. The approach can measure and store data for a full imaging sequence in 3.8 to 8.2 s per spatial position. Based on Ispta, MI, and probe surface temperature, the method gives the ability to determine whether a sequence is within US FDA limits, or alternatively indicate how......A method for rapid measurement of intensities (Ispta), mechanical index (MI), and probe surface temperature for any ultrasound scanning sequence is presented. It uses the scanner’s sampling capability to give an accurate measurement of the whole imaging sequence for all emissions to yield the true...... measurement system (Onda Corporation, Sunnyvale, CA, USA). Four different sequences have been measured: a fixed focus emission, a duplex sequence containing B-mode and flow emissions, a vector flow sequence with B-mode and flow emissions in 17 directions, and finally a synthetic aperture (SA) duplex flow...

  16. New Sequences with Low Correlation and Large Family Size

    Science.gov (United States)

    Zeng, Fanxin

    In direct-sequence code-division multiple-access (DS-CDMA) communication systems and direct-sequence ultra wideband (DS-UWB) radios, sequences with low correlation and large family size are important for reducing multiple access interference (MAI) and accepting more active users, respectively. In this paper, a new collection of families of sequences of length pn-1, which includes three constructions, is proposed. The maximum number of cyclically distinct families without GMW sequences in each construction is φ(pn-1)/n·φ(pm-1)/m, where p is a prime number, n is an even number, and n=2m, and these sequences can be binary or polyphase depending upon choice of the parameter p. In Construction I, there are pn distinct sequences within each family and the new sequences have at most d+2 nontrivial periodic correlation {-pm-1, -1, pm-1, 2pm-1,…,dpm-1}. In Construction II, the new sequences have large family size p2n and possibly take the nontrivial correlation values in {-pm-1, -1, pm-1, 2pm-1,…,(3d-4)pm-1}. In Construction III, the new sequences possess the largest family size p(d-1)n and have at most 2d correlation levels {-pm-1, -1,pm-1, 2pm-1,…,(2d-2)pm-1}. Three constructions are near-optimal with respect to the Welch bound because the values of their Welch-Ratios are moderate, WR_??_d, WR_??_3d-4 and WR_??_2d-2, respectively. Each family in Constructions I, II and III contains a GMW sequence. In addition, Helleseth sequences and Niho sequences are special cases in Constructions I and III, and their restriction conditions to the integers m and n, pm≠2 (mod 3) and n≅0 (mod 4), respectively, are removed in our sequences. Our sequences in Construction III include the sequences with Niho type decimation 3·2m-2, too. Finally, some open questions are pointed out and an example that illustrates the performance of these sequences is given.

  17. SequenceCEROSENE: a computational method and web server to visualize spatial residue neighborhoods at the sequence level.

    Science.gov (United States)

    Heinke, Florian; Bittrich, Sebastian; Kaiser, Florian; Labudde, Dirk

    2016-01-01

    To understand the molecular function of biopolymers, studying their structural characteristics is of central importance. Graphics programs are often utilized to conceive these properties, but with the increasing number of available structures in databases or structure models produced by automated modeling frameworks this process requires assistance from tools that allow automated structure visualization. In this paper a web server and its underlying method for generating graphical sequence representations of molecular structures is presented. The method, called SequenceCEROSENE (color encoding of residues obtained by spatial neighborhood embedding), retrieves the sequence of each amino acid or nucleotide chain in a given structure and produces a color coding for each residue based on three-dimensional structure information. From this, color-highlighted sequences are obtained, where residue coloring represent three-dimensional residue locations in the structure. This color encoding thus provides a one-dimensional representation, from which spatial interactions, proximity and relations between residues or entire chains can be deduced quickly and solely from color similarity. Furthermore, additional heteroatoms and chemical compounds bound to the structure, like ligands or coenzymes, are processed and reported as well. To provide free access to SequenceCEROSENE, a web server has been implemented that allows generating color codings for structures deposited in the Protein Data Bank or structure models uploaded by the user. Besides retrieving visualizations in popular graphic formats, underlying raw data can be downloaded as well. In addition, the server provides user interactivity with generated visualizations and the three-dimensional structure in question. Color encoded sequences generated by SequenceCEROSENE can aid to quickly perceive the general characteristics of a structure of interest (or entire sets of complexes), thus supporting the researcher in the initial

  18. Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM

    Directory of Open Access Journals (Sweden)

    Yunyun Liang

    2015-01-01

    Full Text Available Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM. Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS, segmented PsePSSM, and segmented autocovariance transformation (ACT based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640 are adopted in this paper. Then a 700-dimensional (700D feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA. To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences.

  19. Whole Genome Sequencing of Enterovirus species C Isolates by High-throughput Sequencing: Development of Generic Primers

    Directory of Open Access Journals (Sweden)

    Maël Bessaud

    2016-08-01

    Full Text Available Enteroviruses are among the most common viruses infecting humans and can cause diverse clinical syndromes ranging from minor febrile illness to severe and potentially fatal diseases. Enterovirus species C (EV-C consists of more than 20 types, among which the 3 serotypes of polioviruses, the etiological agents of poliomyelitis, are included. Biodiversity and evolution of EV-C genomes are shaped by frequent recombination events. Therefore, identification and characterization of circulating EV-C strains require the sequencing of different genomic regions.A simple method was developed to sequence quickly the entire genome of EV-C isolates. Four overlapping fragments were produced separately by RT-PCR performed with generic primers. The four amplicons were then pooled and purified prior to be sequenced by high-throughput technique.The method was assessed on a panel of EV-Cs belonging to a wide-range of types. It can be used to determine full-length genome sequences through de novo assembly of thousands of reads. It was also able to discriminate reads from closely related viruses in mixtures.By decreasing the workload compared to classical Sanger-based techniques, this method will serve as a precious tool for sequencing large panels of EV-Cs isolated in cell cultures during environmental surveillance or from patients, including vaccine-derived polioviruses.

  20. The fission yeast heterochromatin protein Rik1 is required for telomere clustering during meiosis

    DEFF Research Database (Denmark)

    Tuzon, Creighton T; Borgstrøm, Britta; Weilguny, Dietmar

    2004-01-01

    Telomeres share the ability to silence nearby transcription with heterochromatin, but the requirement of heterochromatin proteins for most telomere functions is unknown. The fission yeast Rik1 protein is required for heterochromatin formation at centromeres and the mating-type locus, as it recrui...... meiosis. However, Rik1 is dispensable for the protective roles of telomeres in preventing chromosome end-fusion. Thus, a Swi6-independent heterochromatin function distinct from that at centromeres and the mating-type locus operates at telomeres during sexual differentiation....