WorldWideScience

Sample records for tandem repeat sequence

  1. Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

    Science.gov (United States)

    Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

    1997-12-01

    Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.

  2. Tandemly repeated sequence in 5'end of mtDNA control region of ...

    African Journals Online (AJOL)

    Extensive length variability was observed in 5' end sequence of the mitochondrial DNA control region of the Japanese Spanish mackerel (Scomberomorus niphonius). This length variability was due to the presence of varying numbers of a 56-bp tandemly repeated sequence and a 46-bp insertion/deletion (indel).

  3. In situ detection of tandem DNA repeat length

    Energy Technology Data Exchange (ETDEWEB)

    Yaar, R.; Szafranski, P.; Cantor, C.R.; Smith, C.L. [Boston Univ., MA (United States)

    1996-11-01

    A simple method for scoring short tandem DNA repeats is presented. An oligonucleotide target, containing tandem repeats embedded in a unique sequence, was hybridized to a set of complementary probes, containing tandem repeats of known lengths. Single-stranded loop structures formed on duplexes containing a mismatched (different) number of tandem repeats. No loop structure formed on duplexes containing a matched (identical) number of tandem repeats. The matched and mismatched loop structures were enzymatically distinguished and differentially labeled by treatment with S1 nuclease and the Klenow fragment of DNA polymerase. 7 refs., 4 figs.

  4. Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats

    OpenAIRE

    Gymrek, Melissa

    2016-01-01

    This was presented as a BitesizeBio Webinar entitled "Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats"Accompanying scripts can be accessed on github:https://github.com/mgymrek/mgymrek-bitesizebio-webinar 

  5. Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

    Science.gov (United States)

    Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

    2015-05-01

    Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.

  6. APE1 incision activity at abasic sites in tandem repeat sequences.

    Science.gov (United States)

    Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

    2014-05-29

    Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.

  7. Use of short tandem repeat sequences to study Mycobacterium leprae in leprosy patients in Malawi and India.

    Directory of Open Access Journals (Sweden)

    Saroj K Young

    2008-04-01

    Full Text Available Inadequate understanding of the transmission of Mycobacterium leprae makes it difficult to predict the impact of leprosy control interventions. Genotypic tests that allow tracking of individual bacterial strains would strengthen epidemiological studies and contribute to our understanding of the disease.Genotyping assays based on variation in the copy number of short tandem repeat sequences were applied to biopsies collected in population-based epidemiological studies of leprosy in northern Malawi, and from members of multi-case households in Hyderabad, India. In the Malawi series, considerable genotypic variability was observed between patients, and also within patients, when isolates were collected at different times or from different tissues. Less within-patient variability was observed when isolates were collected from similar tissues at the same time. Less genotypic variability was noted amongst the closely related Indian patients than in the Malawi series.Lineages of M. leprae undergo changes in their pattern of short tandem repeat sequences over time. Genetic divergence is particularly likely between bacilli inhabiting different (e.g., skin and nerve tissues. Such variability makes short tandem repeat sequences unsuitable as a general tool for population-based strain typing of M. leprae, or for distinguishing relapse from reinfection. Careful use of these markers may provide insights into the development of disease within individuals and for tracking of short transmission chains.

  8. A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

    Science.gov (United States)

    Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

    2017-01-01

    TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.

  9. TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

    Science.gov (United States)

    Richard, François D; Kajava, Andrey V

    2014-06-01

    The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. Optimization of sequence alignment for simple sequence repeat regions

    Directory of Open Access Journals (Sweden)

    Ogbonnaya Francis C

    2011-07-01

    Full Text Available Abstract Background Microsatellites, or simple sequence repeats (SSRs, are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs. SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic

  11. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

    Science.gov (United States)

    Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

    2012-11-07

    Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40

  12. Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species

    DEFF Research Database (Denmark)

    Larsen, Svend Arild; Mogensen, Line; Dietz, Rune

    2005-01-01

    repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp...

  13. Generating markers based on biotic stress of protein system in and tandem repeats sequence for Aquilaria sp

    International Nuclear Information System (INIS)

    Azhar Mohamad; Muhammad Hanif Azhari N; Siti Norhayati Ismail

    2014-01-01

    Aquilaria sp. belongs to the Thymelaeaceae family and is well distributed in Asia region. The species has multipurpose use from root to shoot and is an economically important crop, which generates wide interest in understanding genetic diversity of the species. Knowledge on DNA-based markers has become a prerequisite for more effective application of molecular marker techniques in breeding and mapping programs. In this work, both targeted genes and tandem repeat sequences were used for DNA fingerprinting in Aquilaria sp. A total of 100 ISSR (inter simple sequence repeat) primers and 50 combination pairs of specific primers derived from conserved region of a specific protein known as system in were optimized. 38 ISSR primers were found affirmative for polymorphism evaluation study and were generated from both specific and degenerate ISSR primers. And one utmost combination of system in primers showed significant results in distinguishing the Aquilaria sp. In conclusion, polymorphism derived from ISSR profiling and targeted stress genes of protein system in proved as a powerful approach for identification and molecular classification of Aquilaria sp. which will be useful for diversification in identifying any mutant lines derived from nature. (author)

  14. Genome-wide analysis of tandem repeats in plants and green algae

    Science.gov (United States)

    Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

    2014-01-01

    Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...

  15. Multiple-locus variable-number tandem repeat analysis of Neisseria meningitidis yields groupings similar to those obtained by multilocus sequence typing.

    NARCIS (Netherlands)

    Schouls, Leo M; Ende, Arie van der; Damen, Marjolein; Pol, Ingrid van de

    2006-01-01

    We identified many variable-number tandem repeat (VNTR) loci in the genomes of Neisseria meningitidis serogroups A, B, and C and utilized a number of these loci to develop a multiple-locus variable-number tandem repeat analysis (MLVA). Eighty-five N. meningitidis serogroup B and C isolates obtained

  16. Identification and Characterization of Tandem Repeats in Exon III of Dopamine Receptor D4 (DRD4) Genes from Different Mammalian Species

    DEFF Research Database (Denmark)

    Larsen, S. A.; Mogensen, L.; Dietz, R.

    2005-01-01

    composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem...

  17. Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species

    DEFF Research Database (Denmark)

    Larsen, Svend Arild; Mogensen, Line; Dietz, Rune

    2005-01-01

    composed of 15- and 12- bp modules. Tandem repeats composed of 18-bp modules were found in sequences from the horse, zebra, onager, and donkey, Asiatic bear, polar bear, common raccoon, dolphin, harbor porpoise, and domestic cat. Several of these sequences have been analyzed previously without a tandem...

  18. Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.

    Science.gov (United States)

    Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T

    1997-12-01

    The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.

  19. Inter- and intra-strain variability of tandem repeats in Mycoplasma pneumoniae based on next-generation sequencing data.

    Science.gov (United States)

    Zhang, Jing; Song, Xiaohong; Ma, Marella J; Xiao, Li; Kenri, Tsuyoshi; Sun, Hongmei; Ptacek, Travis; Li, Shaoli; Waites, Ken B; Atkinson, T Prescott; Shibayama, Keigo; Dybvig, Kevin; Feng, Yanmei

    2017-02-01

    To characterize inter- and intra-strain variability of variable-number tandem repeats (VNTRs) in Mycoplasma pneumoniae to determine the optimal multilocus VNTR analysis scheme for improved strain typing. Whole genome assemblies and next-generation sequencing data from diverse M. pneumoniae isolates were used to characterize VNTRs and their variability, and to compare the strain discriminability of new VNTR and existing markers. We identified 13 VNTRs including five reported previously. These VNTRs displayed different levels of inter- and intra-strain copy number variations. All new markers showed similar or higher discriminability compared with existing VNTR markers and the P1 typing system. Our study provides novel insights into VNTR variations and potential new multilocus VNTR analysis schemes for improved genotyping of M. pneumoniae.

  20. Analysis of genetic polymorphism of nine short tandem repeat loci in ...

    African Journals Online (AJOL)

    Yomi

    2012-03-15

    Mar 15, 2012 ... Key words: short tandem repeat, repeat motif, genetic polymorphism, Han population, forensic genetics. INTRODUCTION. Short tandem repeat (STR) is widely .... Data analysis. The exact test of Hardy-Weinberg equilibrium was conducted with. Arlequin version 3.5 software (Computational and Molecular.

  1. Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

    Energy Technology Data Exchange (ETDEWEB)

    Jackson, P.J.; Walthers, E.A.; Richmond, K.L. [Los Alamos National Lab., NM (United States)] [and others

    1997-04-01

    PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats are generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.

  2. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    Directory of Open Access Journals (Sweden)

    Glass John I

    2010-07-01

    Full Text Available Abstract Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT. Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the

  3. Human β satellite DNA: Genomic organization and sequence definition of a class of highly repetitive tandem DNA

    International Nuclear Information System (INIS)

    Waye, J.S.; Willard, H.F.

    1989-01-01

    The authors describe a class of human repetitive DNA, called β satellite, that, at a most fundamental level, exists as tandem arrays of diverged ∼68-base-pair monomer repeat units. The monomer units are organized as distinct subsets, each characterized by a multimeric higher-order repeat unit that is tandemly reiterated and represents a recent unit of amplification. They have cloned, characterized, and determined the sequence of two β satellite higher-order repeat units: one located on chromosome 9, the other on the acrocentric chromosomes (13, 14, 15, 21, and 22) and perhaps other sites in the genome. Analysis by pulsed-field gel electrophoresis reveals that these tandem arrays are localized in large domains that are marked by restriction fragment length polymorphisms. In total, β-satellite sequences comprise several million base pairs of DNA in the human genome. Analysis of this DNA family should permit insights into the nature of chromosome-specific and nonspecific modes of satellite DNA evolution and provide useful tools for probing the molecular organization and concerted evolution of the acrocentric chromosomes

  4. 5meCpG epigenetic marks neighboring a primate-conserved core promoter short tandem repeat indicate X-chromosome inactivation.

    Science.gov (United States)

    Machado, Filipe Brum; Machado, Fabricio Brum; Faria, Milena Amendro; Lovatel, Viviane Lamim; Alves da Silva, Antonio Francisco; Radic, Claudia Pamela; De Brasi, Carlos Daniel; Rios, Álvaro Fabricio Lopes; de Sousa Lopes, Susana Marina Chuva; da Silveira, Leonardo Serafim; Ruiz-Miranda, Carlos Ramon; Ramos, Ester Silveira; Medina-Acosta, Enrique

    2014-01-01

    X-chromosome inactivation (XCI) is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX) and males (XY). DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG) in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR) exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa) from inactive (Xi) X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8) and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2) and Xq (AR) chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae) and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic

  5. 5meCpG epigenetic marks neighboring a primate-conserved core promoter short tandem repeat indicate X-chromosome inactivation.

    Directory of Open Access Journals (Sweden)

    Filipe Brum Machado

    Full Text Available X-chromosome inactivation (XCI is the epigenetic transcriptional silencing of an X-chromosome during the early stages of embryonic development in female eutherian mammals. XCI assures monoallelic expression in each cell and compensation for dosage-sensitive X-linked genes between females (XX and males (XY. DNA methylation at the carbon-5 position of the cytosine pyrimidine ring in the context of a CpG dinucleotide sequence (5meCpG in promoter regions is a key epigenetic marker for transcriptional gene silencing. Using computational analysis, we revealed an extragenic tandem GAAA repeat 230-bp from the landmark CpG island of the human X-linked retinitis pigmentosa 2 RP2 promoter whose 5meCpG status correlates with XCI. We used this RP2 onshore tandem GAAA repeat to develop an allele-specific 5meCpG-based PCR assay that is highly concordant with the human androgen receptor (AR exonic tandem CAG repeat-based standard HUMARA assay in discriminating active (Xa from inactive (Xi X-chromosomes. The RP2 onshore tandem GAAA repeat contains neutral features that are lacking in the AR disease-linked tandem CAG repeat, is highly polymorphic (heterozygosity rates approximately 0.8 and shows minimal variation in the Xa/Xi ratio. The combined informativeness of RP2/AR is approximately 0.97, and this assay excels at determining the 5meCpG status of alleles at the Xp (RP2 and Xq (AR chromosome arms in a single reaction. These findings are relevant and directly translatable to nonhuman primate models of XCI in which the AR CAG-repeat is monomorphic. We conducted the RP2 onshore tandem GAAA repeat assay in the naturally occurring chimeric New World monkey marmoset (Callitrichidae and found it to be informative. The RP2 onshore tandem GAAA repeat will facilitate studies on the variable phenotypic expression of dominant and recessive X-linked diseases, epigenetic changes in twins, the physiology of aging hematopoiesis, the pathogenesis of age-related hematopoietic

  6. Brucella 'HOOF-Prints': strain typing by multi-locus analysis of variable number tandem repeats (VNTRs

    Directory of Open Access Journals (Sweden)

    Halling Shirley M

    2003-07-01

    Full Text Available Abstract Background Currently, there are very few tools available for subtyping Brucella isolates for epidemiological trace-back. Subtyping is difficult because of the genetic homogeneity within the genus. Sequencing of the genomes from three Brucella species has facilitated the search for DNA sequence variability. Recently, hypervariability among short tandem repeat sequences has been exploited for strain-typing of several bacterial pathogens. Results An eight-base pair tandem repeat sequence was discovered in nine genomic loci of the B. abortus genome. Eight loci were hypervariable among the three Brucella species. A PCR-based method was developed to identify the number of repeat units (alleles at each locus, generating strain-specific fingerprints. None of the loci exhibited species- or biovar-specific alleles. Sometimes, a species or biovar contained a specific allele at one or more loci, but the allele also occurred in other species or biovars. The technique successfully differentiated the type strains for all Brucella species and biovars, among unrelated B. abortus biovar 1 field isolates in cattle, and among B. abortus strains isolated from bison and elk. Isolates from the same herd or from short-term in vitro passage exhibited little or no variability in fingerprint pattern. Sometimes, isolates from an animal would have multiple alleles at a locus, possibly from mixed infections in enzootic areas, residual disease from incomplete depopulation of an infected herd or molecular evolution within the strain. Therefore, a mixed population or a pool of colonies from each animal and/or tissue was tested. Conclusion This paper describes a new method for fingerprinting Brucella isolates based on multi-locus characterization of a variable number, eight-base pair, tandem repeat. We have named this technique "HOOF-Prints" for Hypervariable Octameric Oligonucleotide Finger-Prints. The technique is highly discriminatory among Brucella species, among

  7. Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter baumannii and Interlaboratory Validation of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿†

    Science.gov (United States)

    Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo

    2011-01-01

    Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956

  8. TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

    Science.gov (United States)

    Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio

    2010-06-15

    Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.

  9. Evaluation of tandem repeats for MLVA typing of Streptococcus uberis isolated from bovine mastitis

    Directory of Open Access Journals (Sweden)

    Lamoureux Jérémy

    2006-11-01

    Full Text Available Abstract Background Streptococcus uberis is a common cause of bovine mastitis and recommended control measures, based on improved milking practice, teat dipping and antibiotic treatment at drying-off, are poorly efficient against this environmental pathogen. A simple and efficient typing method would be helpful in identifying S.uberis sources, virulent strains and cow to cow transmission. The potential of MLVA (Multiple Loci VNTR Analysis; VNTR, Variable Number of Tandem Repeats for S. uberis mastitis isolates genotyping was investigated. Results The genomic sequence of Streptococcus uberis (strain 0104J was analyzed for potential variable number tandem repeats (VNTRs. Twenty-five tandem repeats were identified and amplified by PCR with DNA samples from 24 S. uberis strains. A set of seven TRs were found to be polymorphic and used for MLVA typing of 88 S. uberis isolates. A total of 82 MLVA types were obtained with 22 types among 26 strains isolated from the milk of mastitic cows belonging to our experimental herd, and 61 types for 62 epidemiologically unrelated strains, i.e. collected in different herds and areas. Conclusion The MLVA method can be applied to S. uberis genotyping and constitutes an interesting complement to existing typing methods. This method, which is easy to perform, low cost and can be used in routine, could facilitate investigations of the epidemiology of S. uberis mastitis in dairy cows.

  10. Tandemly repeated sequence in 5'end of mtDNA control region of ...

    African Journals Online (AJOL)

    STORAGESEVER

    2008-12-17

    Dec 17, 2008 ... chain reaction (PCR). Japanese Spanish ... mainly covered general ecology and fishery biology. No study concerning the ... Conserved sequence blocks and the repeat units are indicated by boxes. performed using the exact ...

  11. Ten tandem repeats of β-hCG 109-118 enhance immunogenicity and anti-tumor effects of β-hCG C-terminal peptide carried by mycobacterial heat-shock protein HSP65

    International Nuclear Information System (INIS)

    Zhang Yankai; Yan Rong; He Yi; Liu Wentao; Cao Rongyue; Yan Ming; Li Taiming; Liu Jingjing; Wu Jie

    2006-01-01

    The β-subunit of human chorionic gonadotropin (β-hCG) is secreted by many kinds of tumors and it has been used as an ideal target antigen to develop vaccines against tumors. In view of the low immunogenicity of this self-peptide,we designed a method based on isocaudamer technique to repeat tandemly the 10-residue sequence X of β-hCG (109-118), then 10 tandemly repeated copies of the 10-residue sequence combined with β-hCG C-terminal 37 peptides were fused to mycobacterial heat-shock protein 65 to construct a fusion protein HSP65-X10-βhCGCTP37 as an immunogen. In this study, we examined the effect of the tandem repeats of this 10-residue sequence in eliciting an immune by comparing the immunogenicity and anti-tumor effects of the two immunogens, HSP65-X10-βhCGCTP37 and HSP65-βhCGCTP37 (without the 10 tandem repeats). Immunization of mice with the fusion protein HSP65-X10-βhCGCTP37 elicited much higher levels of specific anti-β-hCG antibodies and more effectively inhibited the growth of Lewis lung carcinoma (LLC) in vivo than with HSP65-βhCGCTP37, which should suggest that HSP65-X10-βhCGCTP37 may be an effective protein vaccine for the treatment of β-hCG-dependent tumors and multiple tandem repeats of a certain epitope are an efficient method to overcome the low immunogenicity of self-peptide antigens

  12. PSSRdb: a relational database of polymorphic simple sequence repeats extracted from prokaryotic genomes.

    Science.gov (United States)

    Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A

    2011-01-01

    PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.

  13. Y-Chromosome short tandem repeat, typing technology, locus ...

    African Journals Online (AJOL)

    Aghomotsegin

    2015-07-08

    Jul 8, 2015 ... Y-Chromosome short tandem repeat, typing technology, locus information and allele frequency in different population: A review. Muhanned Abdulhasan Kareem1, Ameera Omran Hussein2 and Imad Hadi Hameed2*. 1Babylon University, Centre of Environmental Research, Hilla City, Iraq. 2Department of ...

  14. X-Chromosome short tandem repeat, advantages and typing ...

    African Journals Online (AJOL)

    Microsatellites of the X-chromosome have been increasingly studied in recent years as a useful tool in forensic analysis. This review describes some details of X-chromosomal short tandem repeat (STR) analysis. Among them are: microsatellites, amplification using polymerase chain reaction (PCR) of STRs, PCR product ...

  15. Sequence-specific DNA alkylation by tandem Py-Im polyamide conjugates.

    Science.gov (United States)

    Taylor, Rhys Dylan; Kawamoto, Yusuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

    2014-09-01

    Tandem N-methylpyrrole-N-methylimidazole (Py-Im) polyamides with good sequence-specific DNA-alkylating activities have been designed and synthesized. Three alkylating tandem Py-Im polyamides with different linkers, which each contained the same moiety for the recognition of a 10 bp DNA sequence, were evaluated for their reactivity and selectivity by DNA alkylation, using high-resolution denaturing gel electrophoresis. All three conjugates displayed high reactivities for the target sequence. In particular, polyamide 1, which contained a β-alanine linker, displayed the most-selective sequence-specific alkylation towards the target 10 bp DNA sequence. The tandem Py-Im polyamide conjugates displayed greater sequence-specific DNA alkylation than conventional hairpin Py-Im polyamide conjugates (4 and 5). For further research, the design of tandem Py-Im polyamide conjugates could play an important role in targeting specific gene sequences. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Novel expressed sequence tag- simple sequence repeats (EST ...

    African Journals Online (AJOL)

    Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...

  17. Characterization of the major formamidopyrimidine-DNA glycosylase homolog in Mycobacterium tuberculosis and its linkage to variable tandem repeats.

    Science.gov (United States)

    Olsen, Ingrid; Balasingham, Seetha V; Davidsen, Tonje; Debebe, Ephrem; Rødland, Einar A; van Soolingen, Dick; Kremer, Kristin; Alseth, Ingrun; Tønjum, Tone

    2009-07-01

    The ability to repair DNA damage is likely to play an important role in the survival of facultative intracellular parasites because they are exposed to high levels of reactive oxygen species and nitrogen intermediates inside phagocytes. Correcting oxidative damage in purines and pyrimidines is the primary function of the enzymes formamidopyrimidine (faPy)-DNA glycosylase (Fpg) and endonuclease VIII (Nei) of the base excision repair pathway, respectively. Four gene homologs, belonging to the fpg/nei family, have been identified in Mycobacterium tuberculosis H37Rv. The recombinant protein encoded by M. tuberculosis Rv2924c, termed Mtb-Fpg1, was overexpressed, purified and biochemically characterized. The enzyme removed faPy and 5-hydroxycytosine lesions, as well as 8-oxo-7,8-dihydroguanine (8oxoG) opposite to C, T and G. Mtb-Fpg1 thus exhibited substrate specificities typical for Fpg enzymes. Although Mtb-fpg1 showed nearly complete nucleotide sequence conservation in 32 M. tuberculosis isolates, the region upstream of Mtb-fpg1 in these strains contained tandem repeat motifs of variable length. A relationship between repeat length and Mtb-fpg1 expression level was demonstrated in M. tuberculosis strains, indicating that an increased length of the tandem repeats positively influenced the expression levels of Mtb-fpg1. This is the first example of such a tandem repeat region of variable length being linked to the expression level of a bacterial gene.

  18. Potentials and limitations of histone repeat sequences for phylogenetic reconstruction of Sophophora.

    Science.gov (United States)

    Baldo, A M; Les, D H; Strausbaugh, L D

    1999-11-01

    Simplified DNA sequence acquisition has provided many new data sets that are useful for phylogenetic reconstruction, including single- and multiple-copy nuclear and organellar genes. Although transcribed regions receive much attention, nontranscribed regions have recently been added to the repertoire of sequences suitable for phylogenetic studies, especially for closely related taxa. We evaluated the efficacy of a small portion of the histone repeat for phylogenetic reconstruction among Drosophila species. Histone repeats in invertebrates offer distinct advantages similar to those of widely used ribosomal repeats. First, the units are tandemly repeated and undergo concerted evolution. Second, histone repeats include both highly conserved coding and variable intergenic regions. This composition facilitates application of "universal" primers spanning potentially informative sites. We examined a small region of the histone repeat, including the intergenic spacer segments of coding regions from the divergently transcribed H2A and H2B histone genes. The spacer (about 230 bp) exists as a mosaic with highly conserved functional motifs interspersed with rapidly diverging regions; the former aid in alignment of the spacer. There are no ambiguities in alignment of coding regions. Coding and noncoding regions were analyzed together and separately for phylogenetic information. Parsimony, distance, and maximum-likelihood methods successfully retrieve the corroborated phylogeny for the taxa examined. This study demonstrates the resolving power of a small histone region which may now be added to the growing collection of phylogenetically useful DNA sequences.

  19. Repeated DNA sequences in fungi

    Energy Technology Data Exchange (ETDEWEB)

    Dutta, S K

    1974-11-01

    Several fungal species, representatives of all broad groups like basidiomycetes, ascomycetes and phycomycetes, were examined for the nature of repeated DNA sequences by DNA:DNA reassociation studies using hydroxyapatite chromatography. All of the fungal species tested contained 10 to 20 percent repeated DNA sequences. There are approximately 100 to 110 copies of repeated DNA sequences of approximately 4 x 10/sup 7/ daltons piece size of each. Repeated DNA sequence homoduplexes showed on average 5/sup 0/C difference of T/sub e/50 (temperature at which 50 percent duplexes dissociate) values from the corresponding homoduplexes of unfractionated whole DNA. It is suggested that a part of repetitive sequences in fungi constitutes mitochondrial DNA and a part of it constitutes nuclear DNA. (auth)

  20. New polymorphisms within the variable number tandem repeat (VNTR) 7 locus of Mycobacterium avium subsp. paratuberculosis.

    Science.gov (United States)

    Fawzy, Ahmad; Zschöck, Michael; Ewers, Christa; Eisenberg, Tobias

    2016-06-01

    Variable number tandem repeat (VNTR) is a frequently employed typing method of Mycobacterium avium paratuberculosis (MAP) isolates. Based on whole genome sequencing in a previous study, allelic diversity at some VNTR loci seems to over- or under-estimate the actual phylogenetic variance among isolates. Interestingly, two closely related isolates on one farm showed polymorphism at the VNTR 7 locus, raising concerns about the misleading role that it might play in genotyping. We aimed to investigate the underlying basis of VNTR 7-polymorphism by analyzing sequence data for published genomes and field isolates of MAP and other M. avium complex (MAC) members. In contrast to MAP strains from cattle, strains from sheep displayed an "imperfect" repeat within VNTR 7, which was identical to respective allele types in other MAC genomes. Subspecies- and strain-specific single nucleotide polymorphisms (SNPs) and two novel (16 and 56 bp) repeats were detected. Given the combination of the three existing repeats, there are at least five different patterns for VNTR 7. The present findings highlight a higher polymorphism and probable instability of VNTR 7 locus that needs to be considered and challenged in future studies. Until then, sequencing of this locus in future studies is important to correctly assign the underlying allele types.(1). Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. D20S16 is a complex interspersed repeated sequence: Genetic and physical analysis of the locus

    Energy Technology Data Exchange (ETDEWEB)

    Bowden, D.W.; Krawchuk, M.D.; Howard, T.D. [Wake Forest Univ., Winston-Salem, NC (United States)] [and others

    1995-01-20

    The genomic structure of the D20S16 locus has been evaluated using genetic and physical methods. D20S16, originally detected with the probe CRI-L1214, is a highly informative, complex restriction fragment length polymorphism consisting of two separate allelic systems. The allelic systems have the characteristics of conventional VNTR polymorphisms and are separated by recombination ({theta} = 0.02, Z{sub max} = 74.82), as demonstrated in family studies. Most of these recombination events are meiotic crossovers and are maternal in origin, but two, including deletion of the locus in a cell line from a CEPH family member, occur without evidence for exchange of flanking markers. DNA sequence analysis suggests that the basis of the polymorphism is variable numbers of a 98-bp sequence tandemly repeated with 87 to 90% sequence similarity between repeats. The 98-bp repeat is a dimer of 49 bp sequence with 45 to 98% identity between the elements. In addition, nonpolymorphic genomic sequences adjacent to the polymorphic 98-bp repeat tracts are also repeated but are not polymorphic, i.e., show no individual to individual variation. Restriction enzyme mapping of cosmids containing the CRI-L1214 sequence suggests that there are multiple interspersed repeats of the CRI-L1214 sequence on chromosome 20. The results of dual-color fluorescence in situ hybridization experiments with interphase nuclei are also consistent with multiple repeats of an interspersed sequence on chromosome 20. 23 refs., 6 figs.

  2. The DUB/USP17 deubiquitinating enzymes: A gene family within a tandemly repeated sequence, is also embedded within the copy number variable Beta-defensin cluster

    Directory of Open Access Journals (Sweden)

    Scott Christopher J

    2010-04-01

    Full Text Available Abstract Background The DUB/USP17 subfamily of deubiquitinating enzymes were originally identified as immediate early genes induced in response to cytokine stimulation in mice (DUB-1, DUB-1A, DUB-2, DUB-2A. Subsequently we have identified a number of human family members and shown that one of these (DUB-3 is also cytokine inducible. We originally showed that constitutive expression of DUB-3 can block cell proliferation and more recently we have demonstrated that this is due to its regulation of the ubiquitination and activity of the 'CAAX' box protease RCE1. Results Here we demonstrate that the human DUB/USP17 family members are found on both chromosome 4p16.1, within a block of tandem repeats, and on chromosome 8p23.1, embedded within the copy number variable beta-defensin cluster. In addition, we show that the multiple genes observed in humans and other distantly related mammals have arisen due to the independent expansion of an ancestral sequence within each species. However, it is also apparent when sequences from humans and the more closely related chimpanzee are compared, that duplication events have taken place prior to these species separating. Conclusions The observation that the DUB/USP17 genes, which can influence cell growth and survival, have evolved from an unstable ancestral sequence which has undergone multiple and varied duplications in the species examined marks this as a unique family. In addition, their presence within the beta-defensin repeat raises the question whether they may contribute to the influence of this repeat on immune related conditions.

  3. Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping

    Directory of Open Access Journals (Sweden)

    Harvey Steven P

    2007-03-01

    Full Text Available Abstract Background The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. Results B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation to that of the most diverse tandemly repeated regions found in other less diverse bacteria. Conclusion The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were

  4. Development and validation of a single-tube multiple-locus variable number tandem repeat analysis for Klebsiella pneumoniae.

    Directory of Open Access Journals (Sweden)

    Antoinette A T P Brink

    Full Text Available Genotyping of Klebsiella pneumoniae is indispensable for management of nosocomial infections, monitoring of emerging strains--including extended-spectrum beta-lactamase (ESBL producers-, and general epidemiology. Such objectives require a high-resolution genotyping method with a fixed scheme that allows (1 long-term retrospective and prospective assessment, (2 objective result readout and (3 library storage for database development and exchangeable results. We have developed a multiple-locus variable number tandem repeat analysis (MLVA using a single-tube fluorescently primed multiplex PCR for 8 Variable Number Tandem Repeats (VNTRs and automated fragment size analysis. The type allocation scheme was optimized using 224 K. pneumoniae clinical isolates, which yielded 101 MLVA types. The method was compared to the gold standard multilocus sequence typing (MLST using a subset of these clinical isolates (n = 95 and found to be highly concordant, with at least as high a resolution but with considerably less hands-on time. Our results position this MLVA scheme as an appropriate, high-throughput and relatively low-cost tool for K. pneumoniae epidemiology.

  5. Evaluation and selection of tandem repeat loci for a Brucella MLVA typing assay

    Directory of Open Access Journals (Sweden)

    Denoeud France

    2006-02-01

    Full Text Available Abstract Background The classification of Brucella into species and biovars relies on phenotypic characteristics and sometimes raises difficulties in the interpretation of the results due to an absence of standardization of the typing reagents. In addition, the resolution of this biotyping is moderate and requires the manipulation of the living agent. More efficient DNA-based methods are needed, and this work explores the suitability of multiple locus variable number tandem repeats analysis (MLVA for both typing and species identification. Results Eighty tandem repeat loci predicted to be polymorphic by genome sequence analysis of three available Brucella genome sequences were tested for polymorphism by genotyping 21 Brucella strains (18 reference strains representing the six 'classical' species and all biovars as well as 3 marine mammal strains currently recognized as members of two new species. The MLVA data efficiently cluster the strains as expected according to their species and biovar. For practical use, a subset of 15 loci preserving this clustering was selected and applied to the typing of 236 isolates. Using this MLVA-15 assay, the clusters generated correspond to the classical biotyping scheme of Brucella spp. The 15 markers have been divided into two groups, one comprising 8 user-friendly minisatellite markers with a good species identification capability (panel 1 and another complementary group of 7 microsatellite markers with higher discriminatory power (panel 2. Conclusion The MLVA-15 assay can be applied to large collections of Brucella strains with automated or manual procedures, and can be proposed as a complement, or even a substitute, of classical biotyping methods. This is facilitated by the fact that MLVA is based on non-infectious material (DNA whereas the biotyping procedure itself requires the manipulation of the living agent. The data produced can be queried on a dedicated MLVA web service site.

  6. Analysis of an "off-ladder" allele at the Penta D short tandem repeat locus.

    Science.gov (United States)

    Yang, Y L; Wang, J G; Wang, D X; Zhang, W Y; Liu, X J; Cao, J; Yang, S L

    2015-11-25

    Kinship testing of a father and his son from Guangxi, China, the location of the Zhuang minority people, was performed using the PowerPlex® 18D System with a short tandem repeat typing kit. The results indicated that both the father and his son had an off-ladder allele at the Penta D locus, with a genetic size larger than that of the maximal standard allelic ladder. To further identify this locus, monogenic amplification, gene cloning, and genetic sequencing were performed. Sequencing analysis demonstrated that the fragment size of the Penta D-OL locus was 469 bp and the core sequence was [AAAGA]21, also called Penta D-21. The rare Penta D-21 allele was found to be distributed among the Zhuang population from the Guangxi Zhuang Autonomous Region of China; therefore, this study improved the range of DNA data available for this locus and enhanced our ability for individual identification of gene loci.

  7. Enhanced antibody-dependent cellular phagocytosis by chimeric monoclonal antibodies with tandemly repeated Fc domains.

    Science.gov (United States)

    Nagashima, Hiroaki; Ootsubo, Michiko; Fukazawa, Mizuki; Motoi, Sotaro; Konakahara, Shu; Masuho, Yasuhiko

    2011-04-01

    We previously reported that chimeric monoclonal antibodies (mAbs) with tandemly repeated Fc domains, which were developed by introducing tandem repeats of Fc domains downstream of 2 Fab domains, augmented binding avidities for all Fcγ receptors, resulting in enhanced antibody (Ab)-dependent cellular cytotoxicity. Here we investigated regarding Ab-dependent cellular phagocytosis (ADCP) mediated by these chimeric mAbs, which is considered one of the most important mechanisms that kills tumor cells, using two-color flow cytometric methods. ADCP mediated by T3-Ab, a chimeric mAb with 3 tandemly repeated Fc domains, was 5 times more potent than that by native anti-CD20 M-Ab (M-Ab hereafter). Furthermore, T3-Ab-mediated ADCP was resistant to competitive inhibition by intravenous Ig (IVIG), although M-Ab-mediated ADCP decreased in the presence of IVIG. An Fcγ receptor-blocking study demonstrated that T3-Ab mediated ADCP via both FcγRIA and FcγRIIA, whereas M-Ab mediated ADCP exclusively via FcγRIA. These results suggest that chimeric mAbs with tandemly repeated Fc domains enhance ADCP as well as ADCC, and that Fc multimerization may significantly enhance the efficacy of therapeutic Abs. Copyright © 2010 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  8. MSDB: A Comprehensive Database of Simple Sequence Repeats.

    Science.gov (United States)

    Avvaru, Akshay Kumar; Saxena, Saketh; Sowpati, Divya Tej; Mishra, Rakesh Kumar

    2017-06-01

    Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  9. Genetic Analysis of Eight X-Chromosomal Short Tandem Repeat ...

    African Journals Online (AJOL)

    X-Chromosome short tandem repeat (STR) typing can complement existing DNA profiling protocols and can also offer useful information in cases of complex kinship analysis. This is the first population study of 8 X-linked STRs in Iraq. The purpose of this work was to provide a basic data of allele and haplotype frequency for ...

  10. Changes in Variable Number of Tandem Repeats in 'Candidatus Liberibacter asiaticus' through Insect Transmission.

    Directory of Open Access Journals (Sweden)

    Hiroshi Katoh

    Full Text Available Citrus greening (huanglongbing is the most destructive citrus disease worldwide. The disease is associated with three species of 'Candidatus Liberibacter' among which 'Ca. Liberibacter asiaticus' has the widest distribution. 'Ca. L. asiaticus' is commonly transmitted by a phloem-feeding insect vector, the Asian citrus psyllid Diaphorina citri. A previous study showed that isolates of 'Ca. L. asiaticus' were clearly differentiated by variable number of tandem repeat (VNTR profiles at four loci in the genome. In this study, the VNTR analysis was further validated by assessing the stability of these repeats after multiplication of the pathogen upon host-to-host transmission using a 'Ca. L. asiaticus' strain from Japan. The results showed that some tandem repeats showed detectable changes after insect transmission. To our knowledge, this is the first report to demonstrate that the repeat numbers VNTR 002 and 077 of 'Ca. L. asiaticus' change through psyllid transmission. VNTRs in the recipient plant were apparently unrelated to the growing phase of the vector. In contrast, changes in the number of tandem repeats increased with longer acquisition and inoculation access periods, whereas changes were not observed through psyllid transmission after relatively short acquisition and inoculation access periods, up to 20 and 19 days, respectively.

  11. The proliferation marker pKi-67 becomes masked to MIB-1 staining after expression of its tandem repeats.

    Science.gov (United States)

    Schmidt, Mirko H H; Broll, Rainer; Bruch, Hans-Peter; Duchrow, Michael

    2002-11-01

    The Ki-67 antigen, pKi-67, is one of the most commonly used markers of proliferating cells. The protein can only be detected in dividing cells (G(1)-, S-, G(2)-, and M-phase) but not in quiescent cells (G(0)). The standard antibody to detect pKi-67 is MIB-1, which detects the so-called 'Ki-67 motif' FKELF in 9 of the protein's 16 tandem repeats. To investigate the function of these repeats we expressed three of them in an inducible gene expression system in HeLa cells. Surprisingly, addition of a nuclear localization sequence led to a complete absence of signal in the nuclei of MIB-1-stained cells. At the same time antibodies directed against different epitopes of pKi-67 did not fail to detect the protein. We conclude that the overexpression of the 'Ki-67 motif', which is present in the repeats, can lead to inability of MIB-1 to detect its antigen as demonstrated in adenocarcinoma tissue samples. Thereafter, in order to prevent the underestimation of Ki-67 proliferation indices in MIB-1-labeled preparations, additional antibodies (for example, MIB-21) should be used. Additionally, we could show in a mammalian two-hybrid assay that recombinant pKi-67 repeats are capable of self-associating with endogenous pKi-67. Speculating that the tandem repeats are intimately involved in its protein-protein interactions, this offers new insights in how access to these repeats is regulated by pKi-67 itself.

  12. A Predominant Variable-Number Tandem-Repeat Cluster of Mycobacterium tuberculosis Isolates among Asylum Seekers in the Netherlands and Denmark, Deciphered by Whole-Genome Sequencing.

    NARCIS (Netherlands)

    Jajou, Rana; de Neeling, Albert; Rasmussen, Erik Michael; Norman, Anders; Mulder, Arnout; van Hunen, Rianne; de Vries, Gerard; Haddad, Walid; Anthony, Richard; Lillebaek, Troels; van der Hoek, Wim; van Soolingen, Dick

    In many countries,Mycobacterium tuberculosisisolates are routinely subjected to variable-number tandem-repeat (VNTR) typing to investigateM. tuberculosistransmission. Unexpectedly, cross-border clusters were identified among African refugees in the Netherlands and Denmark, although transmission in

  13. X-Chromosomal short tandem repeat loci in the Turkish population ...

    African Journals Online (AJOL)

    In this study, we aimed to demonstrate the importance and utility of polymorphic short tandem repeat (STR) found on the human X chromosome and to provide the first allelic frequency data of X-STR (X chromosomal) loci in the Turkish population. Blood samples were taken from unrelated individuals (135 males and 129 ...

  14. Toward Male Individualization with Rapidly Mutating Y-Chromosomal Short Tandem Repeats

    NARCIS (Netherlands)

    K. Ballantyne (Kaye); A. Ralf (Arwin); R. Aboukhalid (Rachid); N.M. Achakzai (Niaz); T. Anjos (Tania); Q. Ayub (Qasim); J. Balažic (Jože); J. Ballantyne (Jack); D.J. Ballard (David); B. Berger (Burkhard); C. Bobillo (Cecilia); M. Bouabdellah (Mehdi); H. Burri (Helen); T. Capal (Tomas); S. Caratti (Stefano); J. Cárdenas (Jorge); F. Cartault (François); E.F. Carvalho (Elizeu); M. de Carvalho (Margarete); B. Cheng (Baowen); M.D. Coble (Michael); D. Comas (David); D. Corach (Daniel); M. D'Amato (Mauro); S. Davison (Sean); P. de Knijff (Peter); M.C.A. de Ungria (Maria Corazon); R. Decorte (Ronny); T. Dobosz (Tadeusz); B.M. Dupuy (Berit); S. Elmrghni (Samir); M. Gliwiński (Mateusz); S.C. Gomes (Sara); L. Grol (Laurens); C. Haas (Cordula); E. Hanson (Erin); J. Henke (Jürgen); L. Henke (Lotte); F. Herrera-Rodríguez (Fabiola); C.R. Hill (Carolyn); G. Holmlund (Gunilla); K. Honda (Katsuya); U.-D. Immel (Uta-Dorothee); S. Inokuchi (Shota); R. Jobling; M. Kaddura (Mahmoud); J.S. Kim (Jong); S.H. Kim (Soon); W. Kim (Wook); T.E. King (Turi); E. Klausriegler (Eva); D. Kling (Daniel); L. Kovačević (Lejla); L. Kovatsi (Leda); P. Krajewski (Paweł); S. Kravchenko (Sergey); M.H.D. Larmuseau (Maarten); E.Y. Lee (Eun Young); R. Lessig (Rüdiger); L.A. Livshits (Ludmila); D. Marjanović (Damir); M. Minarik (Marek); N. Mizuno (Natsuko); H. Moreira (Helena); N. Morling (Niels); M. Mukherjee (Meeta); P. Munier (Patrick); J. Nagaraju (Javaregowda); F. Neuhuber (Franz); S. Nie (Shengjie); P. Nilasitsataporn (Premlaphat); T. Nishi (Takeki); H.H. Oh (Hye); S. Olofsson (Sylvia); V. Onofri (Valerio); J. Palo (Jukka); H. Pamjav (Horolma); W. Parson (Walther); M. Petlach (Michal); C. Phillips (Christopher); R. Ploski (Rafal); S.P.R. Prasad (Samayamantri P.); D. Primorac (Dragan); G.A. Purnomo (Gludhug); J. Purps (Josephine); H. Rangel-Villalobos (Hector); K. Reogonekbała (Krzysztof); B. Rerkamnuaychoke (Budsaba); D.R. Gonzalez (Danel Rey); C. Robino (Carlo); L. Roewer (Lutz); A. de Rosa (Anna); A. Sajantila (Antti); A. Sala (Andrea); J.M. Salvador (Jazelyn); P. Sanz (Paula); C. Schmitt (Christian); A.K. Sharma (Anisha K.); D.A. Silva (Dayse); K.-J. Shin (Kyoung-Jin); T. Sijen (Titia); M. Sirker (Miriam); D. Siváková (Daniela); V. Škaro (Vedrana); C. Solano-Matamoros (Carlos); L. Souto (L.); V. Stenzl (Vlastimil); H. Sudoyo (Herawati); D. Syndercombe-Court (Denise); A. Tagliabracci (Adriano); D. Taylor (Duncan); A. Tillmar (Andreas); I.S. Tsybovsky (Iosif); C. Tyler-Smith (Chris); K. van der Gaag (Kristiaan); D. Vanek (Daniel); A. Völgyi (Antónia); D. Ward (Denise); P. Willemse (Patricia); E.P.H. Yap (Eric); Z-Y. Yong (Ze-Yie); I.Z. Pajnič (Irena Zupanič); M.H. Kayser (Manfred)

    2014-01-01

    textabstractRelevant for various areas of human genetics, Y-chromosomal short tandem repeats (Y-STRs) are commonly used for testing close paternal relationships among individuals and populations, and for male lineage identification. However, even the widely used 17-loci Yfiler set cannot resolve

  15. Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats.

    Science.gov (United States)

    Fungtammasan, Arkarachai; Tomaszkiewicz, Marta; Campos-Sánchez, Rebeca; Eckert, Kristin A; DeGiorgio, Michael; Makova, Kateryna D

    2016-10-01

    Transcript variation has important implications for organismal function in health and disease. Most transcriptome studies focus on assessing variation in gene expression levels and isoform representation. Variation at the level of transcript sequence is caused by RNA editing and transcription errors, and leads to nongenetically encoded transcript variants, or RNA-DNA differences (RDDs). Such variation has been understudied, in part because its detection is obscured by reverse transcription (RT) and sequencing errors. It has only been evaluated for intertranscript base substitution differences. Here, we investigated transcript sequence variation for short tandem repeats (STRs). We developed the first maximum-likelihood estimator (MLE) to infer RT error and RDD rates, taking next generation sequencing error rates into account. Using the MLE, we empirically evaluated RT error and RDD rates for STRs in a large-scale DNA and RNA replicated sequencing experiment conducted in a primate species. The RT error rates increased exponentially with STR length and were biased toward expansions. The RDD rates were approximately 1 order of magnitude lower than the RT error rates. The RT error rates estimated with the MLE from a primate data set were concordant with those estimated with an independent method, barcoded RNA sequencing, from a Caenorhabditis elegans data set. Our results have important implications for medical genomics, as STR allelic variation is associated with >40 diseases. STR nonallelic transcript variation can also contribute to disease phenotype. The MLE and empirical rates presented here can be used to evaluate the probability of disease-associated transcripts arising due to RDD. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  16. Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data.

    Science.gov (United States)

    Liu, San-Xu; Hou, Wei; Zhang, Xue-Yan; Peng, Chang-Jun; Yue, Bi-Song; Fan, Zhen-Xin; Li, Jing

    2018-07-18

    The Tibetan macaque, which is endemic to China, is currently listed as a Near Endangered primate species by the International Union for Conservation of Nature (IUCN). Short tandem repeats (STRs) refer to repetitive elements of genome sequence that range in length from 1-6 bp. They are found in many organisms and are widely applied in population genetic studies. To clarify the distribution characteristics of genome-wide STRs and understand their variation among Tibetan macaques, we conducted a genome-wide survey of STRs with next-generation sequencing of five macaque samples. A total of 1 077 790 perfect STRs were mined from our assembly, with an N50 of 4 966 bp. Mono-nucleotide repeats were the most abundant, followed by tetra- and di-nucleotide repeats. Analysis of GC content and repeats showed consistent results with other macaques. Furthermore, using STR analysis software (lobSTR), we found that the proportion of base pair deletions in the STRs was greater than that of insertions in the five Tibetan macaque individuals (Pgenome showed good amplification efficiency and could be used to study population genetics in Tibetan macaques. The neighbor-joining tree classified the five macaques into two different branches according to their geographical origin, indicating high genetic differentiation between the Huangshan and Sichuan populations. We elucidated the distribution characteristics of STRs in the Tibetan macaque genome and provided an effective method for screening polymorphic STRs. Our results also lay a foundation for future genetic variation studies of macaques.

  17. Short tandem repeat analysis in Japanese population.

    Science.gov (United States)

    Hashiyada, M

    2000-01-01

    Short tandem repeats (STRs), known as microsatellites, are one of the most informative genetic markers for characterizing biological materials. Because of the relatively small size of STR alleles (generally 100-350 nucleotides), amplification by polymerase chain reaction (PCR) is relatively easy, affording a high sensitivity of detection. In addition, STR loci can be amplified simultaneously in a multiplex PCR. Thus, substantial information can be obtained in a single analysis with the benefits of using less template DNA, reducing labor, and reducing the contamination. We investigated 14 STR loci in a Japanese population living in Sendai by three multiplex PCR kits, GenePrint PowerPlex 1.1 and 2.2. Fluorescent STR System (Promega, Madison, WI, USA) and AmpF/STR Profiler (Perkin-Elmer, Norwalk, CT, USA). Genomic DNA was extracted using sodium dodecyl sulfate (SDS) proteinase K or Chelex 100 treatment followed by the phenol/chloroform extraction. PCR was performed according to the manufacturer's protocols. Electrophoresis was carried out on an ABI 377 sequencer and the alleles were determined by GeneScan 2.0.2 software (Perkin-Elmer). In 14 STRs loci, statistical parameters indicated a relatively high rate, and no significant deviation from Hardy-Weinberg equilibrium was detected. We apply this STR system to paternity testing and forensic casework, e.g., personal identification in rape cases. This system is an effective tool in the forensic sciences to obtain information on individual identification.

  18. Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter pittii and Development of an Optimized Multiple-Locus VNTR Analysis Typing Scheme.

    Science.gov (United States)

    Hu, Yuan; Li, Bo Qing; Jin, Da Zhi; He, Li Hua; Tao, Xiao Xia; Zhang, Jian Zhong

    2015-12-01

    To develop a multiple-locus variable-number tandem-repeat (VNTR) analysis (MLVA) assay for Acinetobacter pittii typing. Polymorphic VNTRs were searched by Tandem Repeats Finder. The distribution and polymorphism of each VNTR locus were analyzed in all the A. pittii genomes deposited in the NCBI genome database by BLAST and were evaluated with a collection of 20 well-characterized clinical A. pittii strains and one reference strain. The MLVA assay was compared with pulsed-field gel electrophoresis (PFGE) for discriminating A. pittii isolates. Ten VNTR loci were identified upon bioinformatic screening of A. pittii genomes, but only five of them showed full amplifiability and good polymorphism. Therefore, an MLVA assay composed of five VNTR loci was developed. The typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. Compared with PFGE, the new optimized MLVA typing scheme provided the same and even greater discrimination. Compared with PFGE, MLVA typing is a faster and more standardized alternative for studying the genetic relatedness of A. pittii isolates in disease surveillance and outbreak investigation. Copyright © 2015 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.

  19. A novel multiple locus variable number of tandem repeat (VNTR) analysis (MLVA) method for Propionibacterium acnes.

    Science.gov (United States)

    Hauck, Yolande; Soler, Charles; Gérôme, Patrick; Vong, Rithy; Macnab, Christine; Appere, Géraldine; Vergnaud, Gilles; Pourcel, Christine

    2015-07-01

    Propionibacterium acnes plays a central role in the pathogenesis of acne and is responsible for severe opportunistic infections. Numerous typing schemes have been developed that allow the identification of phylotypes, but they are often insufficient to differentiate subtypes. To better understand the genetic diversity of this species and to perform epidemiological analyses, high throughput discriminant genotyping techniques are needed. Here we describe the development of a multiple locus variable number of tandem repeats (VNTR) analysis (MLVA) method. Thirteen VNTRs were identified in the genome of P. acnes and were used to genotype a collection of clinical isolates. In addition, publically available sequencing data for 102 genomes were analyzed in silico, providing an MLVA genotype. The clustering of MLVA data was in perfect congruence with whole genome based clustering. Analysis of the clustered regularly interspaced short palindromic repeat (CRISPR) element uncovered new spacers, a supplementary source of genotypic information. The present MLVA13 scheme and associated internet database represents a first line genotyping assay to investigate large number of isolates. Particular strains may then be submitted to full genome sequencing in order to better analyze their pathogenic potential. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. Comparative study of IS6110 restriction fragment length polymorphism and variable-number tandem-repeat typing of Mycobacterium tuberculosis isolates in the Netherlands, based on a 5-year nationwide survey

    NARCIS (Netherlands)

    de Beer, Jessica L.; van Ingen, Jakko; de Vries, Gerard; Erkens, Connie; Sebek, Maruschka; Mulder, Arnout; Sloot, Rosa; van den Brandt, Anne-Marie; Enaimi, Mimount; Kremer, Kristin; Supply, Philip; van Soolingen, Dick

    2013-01-01

    In order to switch from IS6110 and polymorphic GC-rich repetitive sequence (PGRS) restriction fragment length polymorphism (RFLP) to 24-locus variable-number tandem-repeat (VNTR) typing of Mycobacterium tuberculosis complex isolates in the national tuberculosis control program in The Netherlands, a

  1. Comparative Study of IS6110 Restriction Fragment Length Polymorphism and Variable-Number Tandem-Repeat Typing of Mycobacterium tuberculosis Isolates in the Netherlands, Based on a 5-Year Nationwide Survey

    NARCIS (Netherlands)

    Beer, J.L. de; Ingen, J. van; Vries, G. de; Erkens, C.; Sebek, M.; Mulder, A.; Sloot, R.; Brandt, A.M. van den; Enaimi, M.; Kremer, K.; Supply, P.; Soolingen, D. van

    2013-01-01

    In order to switch from IS6110 and polymorphic GC-rich repetitive sequence (PGRS) restriction fragment length polymorphism (RFLP) to 24-locus variable-number tandem-repeat (VNTR) typing of Mycobacterium tuberculosis complex isolates in the national tuberculosis control program in The Netherlands, a

  2. The Pentapeptide Repeat Proteins

    OpenAIRE

    Vetting, Matthew W.; Hegde, Subray S.; Fajardo, J. Eduardo; Fiser, Andras; Roderick, Steven L.; Takiff, Howard E.; Blanchard, John S.

    2006-01-01

    The Pentapeptide Repeat Protein (PRP) family has over 500 members in the prokaryotic and eukaryotic kingdoms. These proteins are composed of, or contain domains composed of, tandemly repeated amino acid sequences with a consensus sequence of [S,T,A,V][D,N][L,F]-[S,T,R][G]. The biochemical function of the vast majority of PRP family members is unknown. The three-dimensional structure of the first member of the PRP family was determined for the fluoroquinolone resistance protein (MfpA) from Myc...

  3. GENETIC DIVERSITY OF TYPHA LATIFOLIA (TYPHACEAE) AND THE IMPACT OF POLLUTANTS EXAMINED WITH TANDEM-REPETITIVE DNA PROBES

    Science.gov (United States)

    Genetic diversity at variable-number-tandem-repeat (VNTR) loci was examined in the common cattail, Typha latifolia (Typhaceae), using three synthetic DNA probes composed of tandemly repeated "core" sequences (GACA, GATA, and GCAC). The principal objectives of this investigation w...

  4. Tandem Reaction of Cationic Copolymerization and Concertedly Induced Hetero-Diels-Alder Reaction Preparing Sequence-Regulated Polymers.

    Science.gov (United States)

    Matsumoto, Suzuka; Kanazawa, Arihiro; Kanaoka, Shokyoku; Aoshima, Sadahito

    2017-06-14

    A unique tandem reaction of sequence-controlled cationic copolymerization and site-specific hetero-Diels-Alder (DA) reaction is demonstrated. In the controlled cationic copolymerization of furfural and 2-acetoxyethyl vinyl ether (AcOVE), only the furan ring adjacent to the propagating carbocation underwent the hetero-DA reaction with the aldehyde moiety of another furfural molecule. A further and equally important feature of the copolymerization is that the obtained copolymers had unprecedented 2:(1 + 1)-type alternating structures of repeating sequences of two VE and one furfural units in the main chain and one furfural unit in the side chain. The specific DA reaction is attributed to the delocalization of the positive charge to the side furan ring.

  5. Identification and Mapping of Simple Sequence Repeat Markers from Common Bean (Phaseolus vulgaris L. Bacterial Artificial Chromosome End Sequences for Genome Characterization and Genetic–Physical Map Integration

    Directory of Open Access Journals (Sweden)

    Juana M. Córdoba

    2010-11-01

    Full Text Available Microsatellite markers or simple sequence repeat (SSR loci are useful for diversity characterization and genetic–physical mapping. Different in silico microsatellite search methods have been developed for mining bacterial artificial chromosome (BAC end sequences for SSRs. The overall goal of this study was genome characterization based on SSRs in 89,017 BAC end sequences (BESs from the G19833 common bean ( L. library. Another objective was to identify new SSR taking into account three tandem motif identification programs (Automated Microsatellite Marker Development [AMMD], Tandem Repeats Finder [TRF], and SSRLocator [SSRL]. Among the microsatellite search engines, SSRL identified the highest number of SSRs; however, when primer design was attempted, the number dropped due to poor primer design regions. Automated Microsatellite Marker Development software identified many SSRs with valuable AT/TA or AG/TC motifs, while TRF found fewer SSRs and produced no primers. A subgroup of 323 AT-rich, di-, and trinucleotide SSRs were selected from the AMMD results and used in a parental survey with DOR364 and G19833, of which 75 could be mapped in the corresponding population; these represented 4052 BAC clones. Together with 92 previously mapped BES- and 114 non-BES-derived markers, a total of 280 SSRs were included in the polymerase chain reaction (PCR-based map, integrating a total of 8232 BAC clones in 162 contigs from the physical map.

  6. RUNX2 tandem repeats and the evolution of facial length in placental mammals

    Directory of Open Access Journals (Sweden)

    Pointer Marie A

    2012-06-01

    Full Text Available Abstract Background When simple sequence repeats are integrated into functional genes, they can potentially act as evolutionary ‘tuning knobs’, supplying abundant genetic variation with minimal risk of pleiotropic deleterious effects. The genetic basis of variation in facial shape and length represents a possible example of this phenomenon. Runt-related transcription factor 2 (RUNX2, which is involved in osteoblast differentiation, contains a functionally-important tandem repeat of glutamine and alanine amino acids. The ratio of glutamines to alanines (the QA ratio in this protein seemingly influences the regulation of bone development. Notably, in domestic breeds of dog, and in carnivorans in general, the ratio of glutamines to alanines is strongly correlated with facial length. Results In this study we examine whether this correlation holds true across placental mammals, particularly those mammals for which facial length is highly variable and related to adaptive behavior and lifestyle (e.g., primates, afrotherians, xenarthrans. We obtained relative facial length measurements and RUNX2 sequences for 41 mammalian species representing 12 orders. Using both a phylogenetic generalized least squares model and a recently-developed Bayesian comparative method, we tested for a correlation between genetic and morphometric data while controlling for phylogeny, evolutionary rates, and divergence times. Non-carnivoran taxa generally had substantially lower glutamine-alanine ratios than carnivorans (primates and xenarthrans with means of 1.34 and 1.25, respectively, compared to a mean of 3.1 for carnivorans, and we found no correlation between RUNX2 sequence and face length across placental mammals. Conclusions Results of our diverse comparative phylogenetic analyses indicate that QA ratio does not consistently correlate with face length across the 41 mammalian taxa considered. Thus, although RUNX2 might function as a ‘tuning knob’ modifying face

  7. Structure, organization, and sequence of alpha satellite DNA from human chromosome 17: evidence for evolution by unequal crossing-over and an ancestral pentamer repeat shared with the human X chromosome.

    Science.gov (United States)

    Waye, J S; Willard, H F

    1986-09-01

    The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.

  8. A multi locus variable number of tandem repeat analysis (MLVA scheme for Streptococcus agalactiae genotyping

    Directory of Open Access Journals (Sweden)

    Mereghetti Laurent

    2011-07-01

    Full Text Available Abstract Background Multilocus sequence typing (MLST is currently the reference method for genotyping Streptococcus agalactiae strains, the leading cause of infectious disease in newborns and a major cause of disease in immunocompromised children and adults. We describe here a genotyping method based on multiple locus variable number of tandem repeat (VNTR analysis (MLVA applied to a population of S. agalactiae strains of various origins characterized by MLST and serotyping. Results We studied a collection of 186 strains isolated from humans and cattle and three reference strains (A909, NEM316 and 2603 V/R. Among 34 VNTRs, 6 polymorphic VNTRs loci were selected for use in genotyping of the bacterial population. The MLVA profile consists of a series of allele numbers, corresponding to the number of repeats at each VNTR locus. 98 MLVA genotypes were obtained compared to 51 sequences types generated by MLST. The MLVA scheme generated clusters which corresponded well to the main clonal complexes obtained by MLST. However it provided a higher discriminatory power. The diversity index obtained with MLVA was 0.960 compared to 0.881 with MLST for this population of strains. Conclusions The MLVA scheme proposed here is a rapid, cheap and easy genotyping method generating results suitable for exchange and comparison between different laboratories and for the epidemiologic surveillance of S. agalactiae and analyses of outbreaks.

  9. Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

    Science.gov (United States)

    Gibbons, John G; Rokas, Antonis

    2009-03-01

    Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.

  10. MULTIPLE-LOCUS VARIABLE-NUMBER TANDEM REPEAT ANALYSIS OF BRUCELLA ISOLATES FROM THAILAND.

    Science.gov (United States)

    Kumkrong, Khurawan; Chankate, Phanita; Tonyoung, Wittawat; Intarapuk, Apiradee; Kerdsin, Anusak; Kalambaheti, Thareerat

    2017-01-01

    Brucellosis-induced abortion can result in significant economic loss to farm animals. Brucellosis can be transmitted to humans during slaughter of infected animals or via consumption of contaminated food products. Strain identification of Brucella isolates can reveal the route of transmission. Brucella strains were isolated from vaginal swabs of farm animal, cow milk and from human blood cultures. Multiplex PCR was used to identify Brucella species, and owing to high DNA homology among Brucella isolates, multiple-locus variable-number tandem repeat analysis (MLVA) based on the number of tandem repeats at 16 different genomic loci was used for strain identification. Multiplex PCR categorized the isolates into B. abortus (n = 7), B. melitensis (n = 37), B. suis (n = 3), and 5 of unknown Brucella spp. MLVA-16 clustering analysis differentiated the strains into various genotypes, with Brucella isolates from the same geographic region being closely related, and revealed that the Thai isolates were phylogenetically distinct from those in other countries, including within the Southeast Asian region. Thus, MLVA-16 typing has utility in epidemiological studies.

  11. Identification and characterization of a tandem repeat in exon III of the dopamine receptor D4 (DRD4) gene in cetaceans

    DEFF Research Database (Denmark)

    Mogensen, Line; Kinze, Carl Christian; Werge, Thomas

    2006-01-01

    A large number of mammalian species harbor a tandem repeat in exon III of the gene encoding dopamine receptor D4 (DRD4), a receptor associated with cognitive functions. In this study, a DRD4 gene exon III tandem repeat from the order Cetacea was identified and characterized. Included in our study...

  12. Amyloid formation and disaggregation of α-synuclein and its tandem repeat (α-TR)

    International Nuclear Information System (INIS)

    Bae, Song Yi; Kim, Seulgi; Hwang, Heejin; Kim, Hyun-Kyung; Yoon, Hyun C.; Kim, Jae Ho; Lee, SangYoon; Kim, T. Doohun

    2010-01-01

    Research highlights: → Formation of the α-synuclein amyloid fibrils by [BIMbF 3 Im]. → Disaggregation of amyloid fibrils by epigallocatechin gallate (EGCG) and baicalein. → Amyloid formation of α-synuclein tandem repeat (α-TR). -- Abstract: The aggregation of α-synuclein is clearly related to the pathogenesis of Parkinson's disease. Therefore, detailed understanding of the mechanism of fibril formation is highly valuable for the development of clinical treatment and also of the diagnostic tools. Here, we have investigated the interaction of α-synuclein with ionic liquids by using several biochemical techniques including Thioflavin T assays and transmission electron microscopy (TEM). Our data shows a rapid formation of α-synuclein amyloid fibrils was stimulated by 1-butyl-3-methylimidazolium bis(trifluoromethylsulfonyl)imide [BIMbF 3 Im], and these fibrils could be disaggregated by polyphenols such as epigallocatechin gallate (EGCG) and baicalein. Furthermore, the effect of [BIMbF 3 Im] on the α-synuclein tandem repeat (α-TR) in the aggregation process was studied.

  13. A novel typing method for Listeria monocytogenes using high-resolution melting analysis (HRMA) of tandem repeat regions.

    Science.gov (United States)

    Ohshima, Chihiro; Takahashi, Hajime; Iwakawa, Ai; Kuda, Takashi; Kimura, Bon

    2017-07-17

    Listeria monocytogenes, which is responsible for causing food poisoning known as listeriosis, infects humans and animals. Widely distributed in the environment, this bacterium is known to contaminate food products after being transmitted to factories via raw materials. To minimize the contamination of products by food pathogens, it is critical to identify and eliminate factory entry routes and pathways for the causative bacteria. High resolution melting analysis (HRMA) is a method that takes advantage of differences in DNA sequences and PCR product lengths that are reflected by the disassociation temperature. Through our research, we have developed a multiple locus variable-number tandem repeat analysis (MLVA) using HRMA as a simple and rapid method to differentiate L. monocytogenes isolates. While evaluating our developed method, the ability of MLVA-HRMA, MLVA using capillary electrophoresis, and multilocus sequence typing (MLST) was compared for their ability to discriminate between strains. The MLVA-HRMA method displayed greater discriminatory ability than MLST and MLVA using capillary electrophoresis, suggesting that the variation in the number of repeat units, along with mutations within the DNA sequence, was accurately reflected by the melting curve of HRMA. Rather than relying on DNA sequence analysis or high-resolution electrophoresis, the MLVA-HRMA method employs the same process as PCR until the analysis step, suggesting a combination of speed and simplicity. The result of MLVA-HRMA method is able to be shared between different laboratories. There are high expectations that this method will be adopted for regular inspections at food processing facilities in the near future. Copyright © 2017. Published by Elsevier B.V.

  14. The leucine-rich repeat structure.

    Science.gov (United States)

    Bella, J; Hindle, K L; McEwan, P A; Lovell, S C

    2008-08-01

    The leucine-rich repeat is a widespread structural motif of 20-30 amino acids with a characteristic repetitive sequence pattern rich in leucines. Leucine-rich repeat domains are built from tandems of two or more repeats and form curved solenoid structures that are particularly suitable for protein-protein interactions. Thousands of protein sequences containing leucine-rich repeats have been identified by automatic annotation methods. Three-dimensional structures of leucine-rich repeat domains determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. As the essential structural principles become well established, the leucine-rich repeat architecture is emerging as an attractive framework for structural prediction and protein engineering. This review presents an update of the current understanding of leucine-rich repeat structure at the primary, secondary, tertiary and quaternary levels and discusses specific examples from recently determined three-dimensional structures.

  15. The Asian Rice Gall Midge (Orseolia oryzae Mitogenome Has Evolved Novel Gene Boundaries and Tandem Repeats That Distinguish Its Biotypes.

    Directory of Open Access Journals (Sweden)

    Isha Atray

    Full Text Available The complete mitochondrial genome of the Asian rice gall midge, Orseolia oryzae (Diptera; Cecidomyiidae was sequenced, annotated and analysed in the present study. The circular genome is 15,286 bp with 13 protein-coding genes, 22 tRNAs and 2 ribosomal RNA genes, and a 578 bp non-coding control region. All protein coding genes used conventional start codons and terminated with a complete stop codon. The genome presented many unusual features: (1 rearrangement in the order of tRNAs as well as protein coding genes; (2 truncation and unusual secondary structures of tRNAs; (3 presence of two different repeat elements in separate non-coding regions; (4 presence of one pseudo-tRNA gene; (5 inversion of the rRNA genes; (6 higher percentage of non-coding regions when compared with other insect mitogenomes. Rearrangements of the tRNAs and protein coding genes are explained on the basis of tandem duplication and random loss model and why intramitochondrial recombination is a better model for explaining rearrangements in the O. oryzae mitochondrial genome is discussed. Furthermore, we evaluated the number of iterations of the tandem repeat elements found in the mitogenome. This led to the identification of genetic markers capable of differentiating rice gall midge biotypes and the two Orseolia species investigated.

  16. Analysis of genetic polymorphism of nine short tandem repeat loci in ...

    African Journals Online (AJOL)

    This study was carried out to investigate the genetic polymorphism of nine short tandem repeat (STR) loci including D2S1772, D6S1043, D7S3048, D8S1132, D11S2368, D12S391, D13S325, D18S1364 and D22GATA198B05 in Chinese Han population of Henan province and to assess its value in forensic science.

  17. Characterization of Dutch Staphylococcus aureus from bovine mastitis using a Multiple Locus Variable Number Tandem Repeat Analysis

    NARCIS (Netherlands)

    Ikawaty, R.; Brouwer, E.C.; Jansen, M.D.; Duijkeren, van E.; Mevius, D.J.; Verhoef, J.; Fluit, A.C.

    2009-01-01

    Current typing methods for Staphylococcus aureus have important drawbacks. We evaluated a Multiple Locus Variable Number Tandem Repeat Analysis (MLVA) scheme with 6 loci which lacks most drawbacks on 85 bovine mastitis isolates from The Netherlands. For each locus the number of repeat units (RU) was

  18. Low numbers of repeat units in variable number of tandem repeats (VNTR) regions of white spot syndrome virus are correlated with disease outbreaks.

    Science.gov (United States)

    Hoa, T T T; Zwart, M P; Phuong, N T; de Jong, M C M; Vlak, J M

    2012-11-01

    White spot syndrome virus (WSSV) is the most important pathogen in shrimp farming systems worldwide including the Mekong Delta, Vietnam. The genome of WSSV is characterized by the presence of two major 'indel regions' found at ORF14/15 and ORF23/24 (WSSV-Thailand) and three regions with variable number tandem repeats (VNTR) located in ORF75, ORF94 and ORF125. In the current study, we investigated whether or not the number of repeat units in the VNTRs correlates with virus outbreak status and/or shrimp farming practice. We analysed 662 WSSV samples from individual WSSV-infected Penaeus monodon shrimp from 104 ponds collected from two important shrimp farming regions of the Mekong Delta: Ca Mau and Bac Lieu. Using this large data set and statistical analysis, we found that for ORF94 and ORF125, the mean number of repeat units (RUs) in VNTRs was significantly lower in disease outbreak ponds than in non-outbreak ponds. Although a higher mean RU number was observed in the improved-extensive system than in the rice-shrimp or semi-intensive systems, these differences were not significant. VNTR sequences are thus not only useful markers for studying WSSV genotypes and populations, but specific VNTR variants also correlate with disease outbreaks in shrimp farming systems. © 2012 Blackwell Publishing Ltd.

  19. Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Legionella pneumophila and Development of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿

    Science.gov (United States)

    Pourcel, Christine; Visca, Paolo; Afshar, Baharak; D'Arezzo, Silvia; Vergnaud, Gilles; Fry, Norman K.

    2007-01-01

    The utility of a genotypic typing assay for Legionella pneumophila was investigated. A multiple-locus variable number of tandem repeats (VNTR) analysis (MLVA) scheme using PCR and agarose gel electrophoresis is proposed based on eight minisatellite markers. Panels of well-characterized strains were examined in a multicenter analysis to validate the assay and to compare its performance to that of other genotyping assays. Excellent typeability, reproducibility, stability, and epidemiological concordance were observed. The MLVA type or profile is composed of a string of allele numbers, corresponding to the number of repeats at each VNTR locus, separated by commas, in a predetermined order. A database containing information from 99 L. pneumophila serogroup 1 strains and four strains of other serogroups and their MLVA profiles, which can be queried online, is available from http://bacterial-genotyping.igmors.u-psud.fr/. PMID:17251393

  20. simple sequence repeat (SSR)

    African Journals Online (AJOL)

    In the present study, 78 mapped simple sequence repeat (SSR) markers representing 11 linkage groups of adzuki bean were evaluated for transferability to mungbean and related Vigna spp. 41 markers amplified characteristic bands in at least one Vigna species. The transferability percentage across the genotypes ranged ...

  1. Filipino DNA variation at 12 X-chromosome short tandem repeat markers.

    Science.gov (United States)

    Salvador, Jazelyn M; Apaga, Dame Loveliness T; Delfin, Frederick C; Calacal, Gayvelline C; Dennis, Sheila Estacio; De Ungria, Maria Corazon A

    2018-06-08

    Demands for solving complex kinship scenarios where only distant relatives are available for testing have risen in the past years. In these instances, other genetic markers such as X-chromosome short tandem repeat (X-STR) markers are employed to supplement autosomal and Y-chromosomal STR DNA typing. However, prior to use, the degree of STR polymorphism in the population requires evaluation through generation of an allele or haplotype frequency population database. This population database is also used for statistical evaluation of DNA typing results. Here, we report X-STR data from 143 unrelated Filipino male individuals who were genotyped via conventional polymerase chain reaction-capillary electrophoresis (PCR-CE) using the 12 X-STR loci included in the Investigator ® Argus X-12 kit (Qiagen) and via massively parallel sequencing (MPS) of seven X-STR loci included in the ForenSeq ™ DNA Signature Prep kit of the MiSeq ® FGx ™ Forensic Genomics System (Illumina). Allele calls between PCR-CE and MPS systems were consistent (100% concordance) across seven overlapping X-STRs. Allele and haplotype frequencies and other parameters of forensic interest were calculated based on length (PCR-CE, 12 X-STRs) and sequence (MPS, seven X-STRs) variations observed in the population. Results of our study indicate that the 12 X-STRs in the PCR-CE system are highly informative for the Filipino population. MPS of seven X-STR loci identified 73 X-STR alleles compared with 55 X-STR alleles that were identified solely by length via PCR-CE. Of the 73 sequence-based alleles observed, six alleles have not been reported in the literature. The population data presented here may serve as a reference Philippine frequency database of X-STRs for forensic casework applications. Copyright © 2018 Elsevier B.V. All rights reserved.

  2. Typing Method for the QUB11a Locus of Mycobacterium tuberculosis: IS6110 Insertions and Tandem Repeat Analysis

    Directory of Open Access Journals (Sweden)

    Eriko Maeda-Mitani

    2016-01-01

    Full Text Available QUB11a is used as a locus for variable number of tandem repeats (VNTR analysis of Mycobacterium tuberculosis Beijing lineage. However, amplification of QUB11a occasionally produces large fragments (>1,400 bp that are not easily measured by capillary electrophoresis because of a lack of the typical stutter peak patterns that are used for counting repeat numbers. IS6110 insertion may complicate VNTR analysis of large QUB11a fragments in M. tuberculosis. We established a method for determining both tandem repeat numbers and IS6110 insertion in the QUB11a locus of M. tuberculosis using capillary electrophoresis analysis and BsmBI digestion. All 29 large QUB11a fragments (>1,200 bp investigated contained IS6110 insertions and varied in the number of repeats (18 patterns and location of IS6110 insertions. This method allows VNTR analysis with high discrimination.

  3. First worldwide proficiency study on variable-number tandem-repeat typing of Mycobacterium tuberculosis complex strains.

    NARCIS (Netherlands)

    Beer, J.L. de; Kremer, K.; Kodmon, C.; Supply, P.; Soolingen, D. van

    2012-01-01

    Although variable-number tandem-repeat (VNTR) typing has gained recognition as the new standard for the DNA fingerprinting of Mycobacterium tuberculosis complex (MTBC) isolates, external quality control programs have not yet been developed. Therefore, we organized the first multicenter proficiency

  4. Linking Y‐chromosomal short tandem repeat loci to human male impulsive aggression

    OpenAIRE

    Yang, Chun; Ba, Huajie; Cao, Yin; Dong, Guoying; Zhang, Shuyou; Gao, Zhiqin; Zhao, Hanqing; Zhou, Xianju

    2017-01-01

    Abstract Introduction Men are more susceptible to impulsive behavior than women. Epidemiological studies revealed that the impulsive aggressive behavior is affected by genetic factors, and the male‐specific Y chromosome plays an important role in this behavior. In this study, we investigated the association between the impulsive aggressive behavior and Y‐chromosomal short tandem repeats (Y‐STRs) loci. Methods The collected biologic samples from 271 offenders with impulsive aggressive behavior...

  5. Expansion of protein domain repeats.

    Directory of Open Access Journals (Sweden)

    Asa K Björklund

    2006-08-01

    Full Text Available Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein-protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.

  6. Allele Frequency Data for 17 Short Tandem Repeats in a Czech Population Sample

    Czech Academy of Sciences Publication Activity Database

    Šimková, H.; Faltus, Václav; Marván, Richard; Pexa, T.; Stenzl, V.; Brouček, J.; Hořínek, A.; Mazura, Ivan; Zvárová, Jana

    2009-01-01

    Roč. 4, č. 1 (2009), e15-e17 ISSN 1872-4973 R&D Projects: GA MŠk(CZ) 1M06014 Institutional research plan: CEZ:AV0Z10300504 Keywords : short tandem repeat (STR) * allelic frequency * PowerPlex 16 System * AmpflSTR Identifiler * population genetics * Czech Republic Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.421, year: 2009

  7. Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

    Science.gov (United States)

    Trofimova, Irina; Krasikova, Alla

    2016-12-01

    Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.

  8. Exact Tandem Repeats Analyzer (E-TRA): A new program for DNA ...

    Indian Academy of Sciences (India)

    Unknown

    Advanced user defined parameters/options let the researchers use different minimum motif repeats ... E-TRA, we used 5,465,605 human EST sequences derived from 18,814,550 ..... repeat rates of T-cells, embryo and testis were higher.

  9. Variable-number tandem repeats as molecular markers for biotypes of Pasteuria ramosa in Daphnia spp.

    Science.gov (United States)

    Mouton, Laurence; Nong, Guang; Preston, James F; Ebert, Dieter

    2007-06-01

    Variable-number tandem repeats (VNTRs) have been identified in populations of Pasteuria ramosa, a castrating endobacterium of Daphnia species. The allelic polymorphisms at 14 loci in laboratory and geographically diverse soil samples showed that VNTRs may serve as biomarkers for the genetic characterization of P. ramosa isolates.

  10. Epitopes of MUC1 Tandem Repeats in Cancer as Revealed by Antibody Crystallography: Toward Glycopeptide Signature-Guided Therapy

    Directory of Open Access Journals (Sweden)

    Dapeng Zhou

    2018-05-01

    Full Text Available Abnormally O-glycosylated MUC1 tandem repeat glycopeptide epitopes expressed by multiple types of cancer have long been attractive targets for therapy in the race against genetic mutations of tumor cells. Glycopeptide signature-guided therapy might be a more promising avenue than mutation signature-guided therapy. Three O-glycosylated peptide motifs, PDTR, GSTA, and GVTS, exist in a tandem repeat HGVTSAPDTRPAPGSTAPPA, containing five O-glycosylation sites. The exact peptide and sugar residues involved in antibody binding are poorly defined. Co-crystal structures of glycopeptides and respective monoclonal antibodies are very few. Here we review 3 groups of monoclonal antibodies: antibodies which only bind to peptide portion, antibodies which only bind to sugar portion, and antibodies which bind to both peptide and sugar portions. The antigenicity of peptide and sugar portions of glyco-MUC1 tandem repeat were analyzed according to available biochemical and structural data, especially the GSTA and GVTS motifs independent from the most studied PDTR. Tn is focused as a peptide-modifying residue in vaccine design, to induce glycopeptide-binding antibodies with cross reactivity to Tn-related tumor glycans, but not glycans of healthy cells. The unique requirement for the designs of antibody in antibody-drug conjugate, bi-specific antibodies, and chimeric antigen receptors are also discussed.

  11. Variable number of tandem repeats of 9 Plasmodium vivax genes among Southeast Asian isolates.

    Science.gov (United States)

    Wang, Bo; Nyunt, Myat Htut; Yun, Seung-Gyu; Lu, Feng; Cheng, Yang; Han, Jin-Hee; Ha, Kwon-Soo; Park, Won Sun; Hong, Seok-Ho; Lim, Chae-Seung; Cao, Jun; Sattabongkot, Jetsumon; Kyaw, Myat Phone; Cui, Liwang; Han, Eun-Taek

    2017-06-01

    The variable number of tandem repeats (VNTRs) provides valuable information about both the functional and evolutionary aspects of genetic diversity. Comparative analysis of 3 Plasmodium falciparum genomes has shown that more than 9% of its open reading frames (ORFs) harbor VNTRs. Although microsatellites and VNTR genes of P. vivax were reported, the VNTR polymorphism of genes has not been examined widely. In this study, 230 P. vivax genes were analyzed for VNTRs by SERV, and 33 kinds of TR deletions or insertions from 29 P. vivax genes (12.6%) were found. Of these, 9 VNTR fragments from 8 P. vivax genes were used for PCR amplification and sequence analysis to examine the genetic diversity among 134 isolates from four Southeast Asian countries (China, Republic of Korea, Thailand, and Myanmar) with different malaria endemicity. We confirmed the existence of extensive polymorphism of VNTR fragments in field isolates. This detection provides several suitable markers for analysis of the molecular epidemiology of P. vivax field isolates. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. [Reticulate evolution of parthenogenetic species of the Lacertidae rock lizards: inheritance of CLsat tandem repeats and anonymous RAPD markers].

    Science.gov (United States)

    Chobanu, D; Rudykh, I A; Riabinina, N L; Grechko, V V; Kramerov, D A; Darevskiĭ, I S

    2002-01-01

    The genetic relatedness of several bisexual and of four unisexual "Lacerta saxicola complex" lizards was studied, using monomer sequences of the complex-specific CLsat tandem repeats and anonymous RAPD markers. Genomes of parthenospecies were shown to include different satellite monomers. The structure of each such monomer is specific for a certain pair of bisexual species. This fact might be interpreted in favor of co-dominant inheritance of these markers in bisexual species hybridogenesis. This idea is supported by the results obtained with RAPD markers; i.e., unisexual species genomes include only the loci characteristic of certain bisexual species. At the same time, in neither case parthenospecies possess specific, autoapomorphic loci that were not present in this or that bisexual species.

  13. Large scale analysis of small repeats via mining of the human genome

    NARCIS (Netherlands)

    van den Berg, I.; Bosnacki, D.; Hilbers, P.A.J.

    2009-01-01

    Small repetitive sequences, called tandem repeats, are abundant throughout the human genome, both in coding and in non-coding regions. Their role is still mostly unknown, but at least 20 of those repetitive sequences have been related to neurodegenerative disorders. The mutational process that is

  14. Roles of repetitive sequences

    Energy Technology Data Exchange (ETDEWEB)

    Bell, G.I.

    1991-12-31

    The DNA of higher eukaryotes contains many repetitive sequences. The study of repetitive sequences is important, not only because many have important biological function, but also because they provide information on genome organization, evolution and dynamics. In this paper, I will first discuss some generic effects that repetitive sequences will have upon genome dynamics and evolution. In particular, it will be shown that repetitive sequences foster recombination among, and turnover of, the elements of a genome. I will then consider some examples of repetitive sequences, notably minisatellite sequences and telomere sequences as examples of tandem repeats, without and with respectively known function, and Alu sequences as an example of interspersed repeats. Some other examples will also be considered in less detail.

  15. Massively parallel sequencing of forensic STRs

    DEFF Research Database (Denmark)

    Parson, Walther; Ballard, David; Budowle, Bruce

    2016-01-01

    The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that...

  16. Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT){sub n} {center_dot} (GA){sub n} microsatellite embedded within the U2 repeat unit

    Energy Technology Data Exchange (ETDEWEB)

    Liao, D.; Weiner, A.M. [Yale Univ., New Haven, CT (United States)

    1995-12-10

    The RNU2 locus encoding human U2 small nuclear RNA (snRNA) is organized as a nearly perfect tandem array containing 5 to 22 copies of a 5.8-kb repeat unit. Just downstream of the U2 snRNA gene in each 5.8-kb repeat unit lies a large (CT){sub n}{center_dot}(GA){sub n} dinucleotide repeat (n {approx} 70). This form of genomic organization, in which one repeat is embedded within another, provides an unusual opportunity to study the balance of forces maintaining the homogeneity of both kinds of repeats. Using a combination of field inversion gel electrophoresis and polymerase chain reaction, we have been able to study the CT microsatellites within individual U2 tandem arrays. We find that the CT microsatellites within an RNU2 allele exhibit significant length polymorphism, despite the remarkable homogeneity of the surrounding U2 repeat units. Length polymorphism is due primarily to loss or gain of CT dinucleotide repeats, but other types of deletions, insertions, and substitutions are also frequent. Polymorphism is greatly reduced in regions where pure (CT){sub n} tracts are interrupted by occasional G residues, suggesting that irregularities stabilize both the length and the sequence of the dinucleotide repeat. We further show that the RNU2 loci of other catarrhine primates (gorilla, chimpanzee, ogangutan, and baboon) contain orthologous CT microsatellites; these also exhibit length polymorphism, but are highly divergent from each other. Thus, although the CT microsatellite is evolving far more rapidly than the rest of the U2 repeat unit, it has persisted through multiple speciation events spanning >35 Myr. The persistence of the CT microsatellite, despite polymorphism and rapid evolution, suggests that it might play a functional role in concerted evolution of the RNU2 loci, perhaps as an initiation site for recombination and/or gene conversion. 70 refs., 5 figs.

  17. GENETIC VARIATION IN RED RASPBERRIES (RUBUS IDAEUS L.; ROSACEAE) FROM SITES DIFFERING IN ORGANIC POLLUTANTS COMPARED WITH SYNTHETIC TANDEM REPEAT DNA PROBES

    Science.gov (United States)

    Two synthetic tandem repetitive DNA probes were used to compare genetic variation at variable-number-tandem-repeat (VNTR) loci among Rubus idaeus L. var. strigosus (Michx.) Maxim. (Rosaceae) individuals sampled at eight sites contaminated by pollutants (N = 39) and eight adjacent...

  18. Comparison of Variable Number Tandem Repeat and Short Tandem Repeat Genetic Markers for Qualitative and Quantitative Chimerism Analysis Post Allogeneic Stem Cell Transplantation

    International Nuclear Information System (INIS)

    Mossallam, G.I.; Smith, A.G.; Mcfarland, C.

    2005-01-01

    Analysis of donor chimerism has become a routine procedure for the documentation of engraftment after allogeneic hematopoietic stem cell transplantation. Quantitative analysis of chimerism kinetics has been shown to predict graft failure or relapse. In this study, we compared the use of variable number tandem repeats (VNTR) and short tandem repeats (STR) as polymorphic genetic markers in chimerism analysis. This study included qualitative and quantitative assessment of both techniques to assess informative yield and sensitivity. Patients and Methods: We analyzed 206 samples representing 40 transplant recipients and their HLA identical sibling donors. A panel of six VNTR loci, 15 STR loci and 1 sex chromosome locus was used. Amplified VNTR products were visualized in an ethidium bromide stained gel. STR loci were amplified using fluorescent primers, and the products were analyzed by capillary electrophoresis. VNTR and STR analysis gave comparable qualitative results in the majority of cases. The incidence of mixed chimerism (Me) by STR analysis was 45% compared to 32% in cases evaluated by VNTR analysis. STR markers were more informative; several informative loci could be identified in all patients. Unique alleles for both patient and donor could be identified in all patients by STR versus 32/40 by VNTR analysis. The STR markers were also more sensitive in the detection of chimerism. The size of VNTR alleles and differences between the size of donor and recipient VNTR alleles affected the sensitivity of detection. With both techniques, quantitative assessment of chimerism showed some discrepancies between the estimated and the calculated percentage of donor DNA. Discordance between the two estimates was observed in 8/19 patients with Me. However, sequential monitoring of the relative band intensity of VNTR alleles offered some insight into the direction of change in engraftment over time. The higher yield of informative loci with STR and the automated measurement of

  19. Tandem repeats, high copy number and remarkable diel expression rhythm of form II RuBisCO in Prorocentrum donghaiense (Dinophyceae.

    Directory of Open Access Journals (Sweden)

    Xinguo Shi

    Full Text Available Gene structure and expression regulation of form II RuBisCO (rbcII in dinoflagellates are still poorly understood. Here we isolated this gene (Pdrbc and investigated its diel expression pattern in a harmful algal bloom forming dinoflagellate Prorocentrum donghaiense. We obtained cDNA sequences with triple tandem repeats of the coding unit (CU; the 5' region has the sequence of a typical dinoflagellate plastid gene, encoding an N-terminus with two transmembrane regions separated by a plastid transit peptide. The CUs (1,455 bp except 1464 bp in last CU are connected through a 63 bp spacer. Phylogenetic analysis showed that rbcII CUs within species formed monophyletic clusters, indicative of intraspecific gene duplication or purifying evolution. Using quantitative PCR (qPCR we estimated 117±40 CUs of Pdrbc in the P. donghaiense genome. Although it is commonly believed that most dinoflagellate genes lack transcriptional regulation, our RT-qPCR analysis on synchronized cultures revealed remarkable diel rhythm of Pdrbc expression, showing significant correlations of transcript abundance with the timing of the dark-to-light transition and cell cycle G2M-phase. When the cultures were shifted to continuous light, Pdrbc expression remained significantly correlated with the G2M-phase. Under continuous darkness the cell cycle was arrested at the G1 phase, and the rhythm of Pdrbc transcription disappeared. Our results suggest that dinoflagellate rbcII 1 undergoes duplication or sequence purification within species, 2 is organized in tandem arrays in most species probably to facilitate efficient translation and import of the encoded enzyme, and 3 is regulated transcriptionally in a cell cycle-dependent fashion at least in some dinoflagellates.

  20. [Identification of novel variable number tandem repeat (VNTR) loci in Mycobacterium avium and development of an effective means of VNTR typing].

    Science.gov (United States)

    Kurokawa, Kazuhiro; Uchiya, Kei-Ichi; Yagi, Tetsuya; Takahashi, Hiroyasu; Niimi, Masaki; Ichikawa, Kazuya; Inagaki, Takayuki; Moriyama, Makoto; Nikai, Toshiaki; Hayashi, Yuta; Nakagawa, Taku; Ogawa, Kenji

    2012-07-01

    To make more effective use of variable number tandem repeat (VNTR) typing, we identified novel VNTR loci in Mycobacterium avium and used them for modified M. avium tandem repeat-VNTR (MATR-VNTR) typing. Analysis of a DNA sample extracted from a clinical isolate (strain HN135) with the FLX system genome sequencer (Roche Diagnostic System) led to discovery of several novel VNTR loci. The allelic diversity of the novel VNTR loci was evaluated for 71 clinical isolates and compared with the diversity of the MATR-VNTR loci. To improve efficacy of MATR-VNTR typing, we tested typing using 2 sets of loci selected from the newly identified loci and the MATR loci, i.e., one set containing 7 and another 16 loci. Hunter Gaston's discriminatory index (HGDI) was calculated for these sets. Six VNTR loci were newly identified, of which 5 showed a high diversity. The HGDI was 0.980 for the improved new typing using a set of 7 loci, and 0.995 for another set of 16 loci, while it was 0.992 for the conventional MATR-VNTR typing. VNTR typing with the set of the 7 loci enabled a rapid analysis, and another set of 16 loci enabled a precise analysis, as compared with conventional MATR-VNTR typing. A method that uses only VNTR loci with relatively high allelic diversity is considered to be a useful tool for VNTR typing of MAC isolates.

  1. Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells

    DEFF Research Database (Denmark)

    Rodriguez, Jairo; Vives, Laura; Jordà, Mireia

    2008-01-01

    Methylation of the cytosine is the most frequent epigenetic modification of DNA in mammalian cells. In humans, most of the methylated cytosines are found in CpG-rich sequences within tandem and interspersed repeats that make up to 45% of the human genome, being Alu repeats the most common family....

  2. Simple sequence repeat marker development and genetic mapping ...

    Indian Academy of Sciences (India)

    polymorphic SSR (simple sequence repeats) markers from libraries enriched for GA, CAA and AAT repeats, as well as 6 ... ers for quinoa was the development of a genetic linkage map ...... Weber J. L. 1990 Informativeness of human (dC-dA)n.

  3. Structural basis for sequence-specific recognition of DNA by TAL effectors

    KAUST Repository

    Deng, Dong; Yan, Chuangye; Pan, Xiaojing; Mahfouz, Magdy M.; Wang, Jiawei; Zhu, Jiankang; Shi, Yi Gong; Yan, Nieng

    2012-01-01

    TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat comprises 33 to 35 conserved amino acids and targets a specific base pair

  4. DNA fingerprinting of Mycobacterium leprae strains using variable number tandem repeat (VNTR) - fragment length analysis (FLA).

    Science.gov (United States)

    Jensen, Ronald W; Rivest, Jason; Li, Wei; Vissa, Varalakshmi

    2011-07-15

    The study of the transmission of leprosy is particularly difficult since the causative agent, Mycobacterium leprae, cannot be cultured in the laboratory. The only sources of the bacteria are leprosy patients, and experimentally infected armadillos and nude mice. Thus, many of the methods used in modern epidemiology are not available for the study of leprosy. Despite an extensive global drug treatment program for leprosy implemented by the WHO, leprosy remains endemic in many countries with approximately 250,000 new cases each year. The entire M. leprae genome has been mapped and many loci have been identified that have repeated segments of 2 or more base pairs (called micro- and minisatellites). Clinical strains of M. leprae may vary in the number of tandem repeated segments (short tandem repeats, STR) at many of these loci. Variable number tandem repeat (VNTR) analysis has been used to distinguish different strains of the leprosy bacilli. Some of the loci appear to be more stable than others, showing less variation in repeat numbers, while others seem to change more rapidly, sometimes in the same patient. While the variability of certain VNTRs has brought up questions regarding their suitability for strain typing, the emerging data suggest that analyzing multiple loci, which are diverse in their stability, can be used as a valuable epidemiological tool. Multiple locus VNTR analysis (MLVA) has been used to study leprosy evolution and transmission in several countries including China, Malawi, the Philippines, and Brazil. MLVA involves multiple steps. First, bacterial DNA is extracted along with host tissue DNA from clinical biopsies or slit skin smears (SSS). The desired loci are then amplified from the extracted DNA via polymerase chain reaction (PCR). Fluorescently-labeled primers for 4-5 different loci are used per reaction, with 18 loci being amplified in a total of four reactions. The PCR products may be subjected to agarose gel electrophoresis to verify the

  5. Detection and quantitative characterization of artificial extra peaks following polymerase chain reaction amplification of 14 short tandem repeat systems used in forensic investigations

    DEFF Research Database (Denmark)

    Meldgaard, Michael; Morling, N

    1997-01-01

    Detection on automated DNA sequencers of polymerase chain reaction (PCR) products of tetra- and penta-nucleotide short tandem repeat (STR) loci frequently reveals one or more extra peaks along with the true, major allele peak. The most frequent extra peak pattern is a single smaller peak which...... is one repeat unit shorter than the true allele peak. The existence of such artificial peaks is of special importance when the methods are used for forensic investigations because the artificial extra peaks may simulate true alleles when samples containing mixtures of DNA from different individuals...... are analyzed. We have investigated the relative levels of formation of extra peaks in 14 STR marker systems. We found that not only the parameters of the PCR but also factors determining the stringency during the post-PCR and pre-electrophoresis handling of samples were of importance for the formation of extra...

  6. Application of Variable-Number Tandem-Repeat Typing To Discriminate Ralstonia solanacearum Strains Associated with English Watercourses and Disease Outbreaks

    Science.gov (United States)

    Bryant, Ruth; Bew, Janice; Conyers, Christine; Stones, Robert; Alcock, Michael; Elphinstone, John

    2013-01-01

    Variable-number tandem-repeat (VNTR) analysis was used for high-resolution discrimination among Ralstonia solanacearum phylotype IIB sequevar 1 (PIIB-1) isolates and further evaluated for use in source tracing. Five tandem-repeat-containing loci (comprising six tandem repeats) discriminated 17 different VNTR profiles among 75 isolates from potato, geranium, bittersweet (Solanum dulcamara), tomato, and the environment. R. solanacearum isolates from crops at three unrelated outbreak sites where river water had been used for irrigation had distinct VNTR profiles that were shared with PIIB-1 isolates from infected bittersweet growing upriver of each site. The VNTR profiling results supported the implication that the source of R. solanacearum at each outbreak was contaminated river water. Analysis of 51 isolates from bittersweet growing in river water at different locations provided a means to evaluate the technique for studying the epidemiology of the pathogen in the environment. Ten different VNTR profiles were identified among bittersweet PIIB-1 isolates from the River Thames. Repeated findings of contiguous river stretches that produced isolates that shared single VNTR profiles supported the hypothesis that the pathogen had disseminated from infected bittersweet plants located upriver. VNTR profiles shared between bittersweet isolates from two widely separated Thames tributaries (River Ray and River Colne) suggested they were independently contaminated with the same clonal type. Some bittersweet isolates had VNTR profiles that were shared with potato isolates collected outside the United Kingdom. It was concluded that VNTR profiling could contribute to further understanding of R. solanacearum epidemiology and assist in control of future disease outbreaks. PMID:23892739

  7. Genotyping of Bacillus anthracis strains based on automated capillary 25-loci Multiple Locus Variable-Number Tandem Repeats Analysis

    Directory of Open Access Journals (Sweden)

    Ciervo Alessandra

    2006-04-01

    Full Text Available Abstract Background The genome of Bacillus anthracis, the etiological agent of anthrax, is highly monomorphic which makes differentiation between strains difficult. A Multiple Locus Variable-number tandem repeats (VNTR Analysis (MLVA assay based on 20 markers was previously described. It has considerable discrimination power, reproducibility, and low cost, especially since the markers proposed can be typed by agarose-gel electrophoresis. However in an emergency situation, faster genotyping and access to representative databases is necessary. Results Genotyping of B. anthracis reference strains and isolates from France and Italy was done using a 25 loci MLVA assay combining 21 previously described loci and 4 new ones. DNA was amplified in 4 multiplex PCR reactions and the length of the resulting 25 amplicons was estimated by automated capillary electrophoresis. The results were reproducible and the data were consistent with other gel based methods once differences in mobility patterns were taken into account. Some alleles previously unresolved by agarose gel electrophoresis could be resolved by capillary electrophoresis, thus further increasing the assay resolution. One particular locus, Bams30, is the result of a recombination between a 27 bp tandem repeat and a 9 bp tandem repeat. The analysis of the array illustrates the evolution process of tandem repeats. Conclusion In a crisis situation of suspected bioterrorism, standardization, speed and accuracy, together with the availability of reference typing data are important issues, as illustrated by the 2001 anthrax letters event. In this report we describe an upgrade of the previously published MLVA method for genotyping of B. anthracis and apply the method to the typing of French and Italian B. anthracis strain collections. The increased number of markers studied compared to reports using only 8 loci greatly improves the discrimination power of the technique. An Italian strain belonging to the

  8. The expansion of heterochromatin blocks in rye reflects the co-amplification of tandem repeats and adjacent transposable elements

    Czech Academy of Sciences Publication Activity Database

    Evtushenko, E.V.; Levitsky, V.G.; Elisafenko, E.A.; Gunbin, K.V.; Belousov, A.I.; Šafář, Jan; Doležel, Jaroslav; Vershinin, A.V.

    2016-01-01

    Roč. 17, MAY 4 (2016), s. 337 ISSN 1471-2164 R&D Projects: GA MŠk(CZ) LO1204 Institutional support: RVO:61389030 Keywords : Tandem repeats * Transposable elements * Subtelomeric heterochromatin Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.729, year: 2016

  9. ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

    Science.gov (United States)

    Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

    2006-06-16

    The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.

  10. Tandem repeat variation near the HIC1 (hypermethylated in cancer 1) promoter predicts outcome of oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer.

    Science.gov (United States)

    Okazaki, Satoshi; Schirripa, Marta; Loupakis, Fotios; Cao, Shu; Zhang, Wu; Yang, Dongyun; Ning, Yan; Berger, Martin D; Miyamoto, Yuji; Suenaga, Mitsukuni; Iqubal, Syma; Barzi, Afsaneh; Cremolini, Chiara; Falcone, Alfredo; Battaglin, Francesca; Salvatore, Lisa; Borelli, Beatrice; Helentjaris, Timothy G; Lenz, Heinz-Josef

    2017-11-15

    The hypermethylated in cancer 1/sirtuin 1 (HIC1/SIRT1) axis plays an important role in regulating the nucleotide excision repair pathway, which is the main oxaliplatin-induced damage-repair system. On the basis of prior evidence that the variable number of tandem repeat (VNTR) sequence located near the promoter lesion of HIC1 is associated with HIC1 gene expression, the authors tested the hypothesis that this VNTR is associated with clinical outcome in patients with metastatic colorectal cancer who receive oxaliplatin-based chemotherapy. Four independent cohorts were tested. Patients who received oxaliplatin-based chemotherapy served as the training cohort (n = 218), and those who received treatment without oxaliplatin served as the control cohort (n = 215). Two cohorts of patients who received oxaliplatin-based chemotherapy were used for validation studies (n = 176 and n = 73). The VNTR sequence near HIC1 was analyzed by polymerase chain reaction analysis and gel electrophoresis and was tested for associations with the response rate, progression-free survival, and overall survival. In the training cohort, patients who harbored at least 5 tandem repeats (TRs) in both alleles had a significantly shorter PFS compared with those who had fewer than 4 TRs in at least 1 allele (9.5 vs 11.6 months; hazard ratio, 1.93; P = .012), and these findings remained statistically significant after multivariate analysis (hazard ratio, 2.00; 95% confidence interval, 1.13-3.54; P = .018). This preliminary association was confirmed in the validation cohort, and patients who had at least 5 TRs in both alleles had a worse PFS compared with the other cohort (7.9 vs 9.8 months; hazard ratio, 1.85; P = .044). The current findings suggest that the VNTR sequence near HIC1 could be a predictive marker for oxaliplatin-based chemotherapy in patients with metastatic colorectal cancer. Cancer 2017;123:4506-14. © 2017 American Cancer Society. © 2017 American Cancer Society.

  11. Stress-induced rearrangement of Fusarium retrotransposon sequences.

    Science.gov (United States)

    Anaya, N; Roncero, M I

    1996-11-27

    Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.

  12. DNA Fingerprint Analysis of Three Short Tandem Repeat (STR) Loci for Biochemistry and Forensic Science Laboratory Courses

    Science.gov (United States)

    McNamara-Schroeder, Kathleen; Olonan, Cheryl; Chu, Simon; Montoya, Maria C.; Alviri, Mahta; Ginty, Shannon; Love, John J.

    2006-01-01

    We have devised and implemented a DNA fingerprinting module for an upper division undergraduate laboratory based on the amplification and analysis of three of the 13 short tandem repeat loci that are required by the Federal Bureau of Investigation Combined DNA Index System (FBI CODIS) data base. Students first collect human epithelial (cheek)…

  13. Identification, variation and transcription of pneumococcal repeat sequences

    Science.gov (United States)

    2011-01-01

    Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003

  14. Visualization of tandem repeat mutagenesis in Bacillus subtilis.

    Science.gov (United States)

    Dormeyer, Miriam; Lentes, Sabine; Ballin, Patrick; Wilkens, Markus; Klumpp, Stefan; Kohlheyer, Dietrich; Stannek, Lorena; Grünberger, Alexander; Commichau, Fabian M

    2018-03-01

    Mutations are crucial for the emergence and evolution of proteins with novel functions, and thus for the diversity of life. Tandem repeats (TRs) are mutational hot spots that are present in the genomes of all organisms. Understanding the molecular mechanism underlying TR mutagenesis at the level of single cells requires the development of mutation reporter systems. Here, we present a mutation reporter system that is suitable to visualize mutagenesis of TRs occurring in single cells of the Gram-positive model bacterium Bacillus subtilis using microfluidic single-cell cultivation. The system allows measuring the elimination of TR units due to growth rate recovery. The cultivation of bacteria carrying the mutation reporter system in microfluidic chambers allowed us for the first time to visualize the emergence of a specific mutation at the level of single cells. The application of the mutation reporter system in combination with microfluidics might be helpful to elucidate the molecular mechanism underlying TR (in)stability in bacteria. Moreover, the mutation reporter system might be useful to assess whether mutations occur in response to nutrient starvation. Copyright © 2018 Elsevier B.V. All rights reserved.

  15. A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis)

    Science.gov (United States)

    Hou, Wan-ru; Chen, Yu; Wu, Xia; Hu, Jin-chu; Peng, Zheng-song; Yang, Jung; Tang, Zong-xiang; Zhou, Cai-Quan; Li, Yu-ming; Yang, Shi-kui; Du, Yu-jie; Kong, Ling-lu; Ren, Zheng-long; Zhang, Huai-yu; Shuai, Su-rong

    2007-01-01

    We obtained the complete mitochondrial genome of U.thibetanus mupinensis by DNA sequencing based on the PCR fragments of 18 primers we designed. The results indicate that the mtDNA is 16 868 bp in size, encodes 13 protein genes, 22 tRNA genes, and 2 rRNA genes, with an overall H-strand base composition of 31.2% A, 25.4% C, 15.5% G and 27.9% T. The sequence of the control region (CR) located between tRNA-Pro and tRNA-Phe is 1422 bp in size, consists of 8.43% of the whole genome, GC content is 51.9% and has a 6bp tandem repeat and two 10bp tandem repeats identified by using the Tandem Repeats Finder. U. thibetanus mupinensis mitochondrial genome shares high similarity with those of three other Ursidae: U. americanus (91.46%), U. arctos (89.25%) and U. maritimus (87.66%). PMID:17205108

  16. Exceptionally long 5' UTR short tandem repeats specifically linked to primates.

    Science.gov (United States)

    Namdar-Aligoodarzi, P; Mohammadparast, S; Zaker-Kandjani, B; Talebi Kakroodi, S; Jafari Vesiehsari, M; Ohadi, M

    2015-09-10

    We have previously reported genome-scale short tandem repeats (STRs) in the core promoter interval (i.e. -120 to +1 to the transcription start site) of protein-coding genes that have evolved identically in primates vs. non-primates. Those STRs may function as evolutionary switch codes for primate speciation. In the current study, we used the Ensembl database to analyze the 5' untranslated region (5' UTR) between +1 and +60 of the transcription start site of the entire human protein-coding genes annotated in the GeneCards database, in order to identify "exceptionally long" STRs (≥5-repeats), which may be of selective/adaptive advantage. The importance of this critical interval is its function as core promoter, and its effect on transcription and translation. In order to minimize ascertainment bias, we analyzed the evolutionary status of the human 5' UTR STRs of ≥5-repeats in several species encompassing six major orders and superorders across mammals, including primates, rodents, Scandentia, Laurasiatheria, Afrotheria, and Xenarthra. We introduce primate-specific STRs, and STRs which have expanded from mouse to primates. Identical co-occurrence of the identified STRs of rare average frequency between 0.006 and 0.0001 in primates supports a role for those motifs in processes that diverged primates from other mammals, such as neuronal differentiation (e.g. APOD and FGF4), and craniofacial development (e.g. FILIP1L). A number of the identified STRs of ≥5-repeats may be human-specific (e.g. ZMYM3 and DAZAP1). Future work is warranted to examine the importance of the listed genes in primate/human evolution, development, and disease. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. The polymorphic integumentary mucin B.1 from Xenopus laevis contains the short consensus repeat.

    Science.gov (United States)

    Probst, J C; Hauser, F; Joba, W; Hoffmann, W

    1992-03-25

    The frog integumentary mucin B.1 (FIM-B.1), discovered by molecular cloning, contains a cysteine-rich C-terminal domain which is homologous with von Willebrand factor. With the help of the polymerase chain reaction, we now characterize a contiguous region 5' to the von Willebrand factor domain containing the short consensus repeat typical of many proteins from the complement system. Multiple transcripts have been cloned, which originate from a single animal and differ by a variable number of tandem repeats (rep-33 sequences). These different transcripts probably originate solely from two genes and are generated presumably by alternative splicing of an huge array of functional cassettes. This model is supported by analysis of genomic FIM-B.1 sequences from Xenopus laevis. Here, rep-33 sequences are arranged in an interrupted array of individual units. Additionally, results of Southern analysis revealed genetic polymorphism between different animals which is predicted to be within the tandem repeats. A first investigation of the predicted mucins with the help of a specific antibody against a synthetic peptide determined the molecular mass of FIM-B.1 to greater than 200 kDa. Here again, genetic polymorphism between different animals is detected.

  18. Isolation of human simple repeat loci by hybridization selection.

    Science.gov (United States)

    Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

    1994-04-01

    We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.

  19. High quality maize centromere 10 sequence reveals evidence of frequent recombination events

    Directory of Open Access Journals (Sweden)

    Thomas Kai Wolfgruber

    2016-03-01

    Full Text Available The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR have presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 x 10-6 and 5 x 10-5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb of the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length centromeric retrotransposons from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. This repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to facilitate the repair of frequent DSBs in centromeres.

  20. Determination of allele frequencies in nine short tandem repeat loci ...

    African Journals Online (AJOL)

    SERVER

    2008-04-17

    Apr 17, 2008 ... out the human genome. These loci are a rich source of highly polymorphic markers that may be detected using the polymerase chain reaction (PCR). PCR is a mimic of the normal cellular process of replication of DNA molecules. Each STR is distinguished by the number of times a sequence is repeated, ...

  1. Always look on both sides: phylogenetic information conveyed by simple sequence repeat allele sequences.

    Directory of Open Access Journals (Sweden)

    Stéphanie Barthe

    Full Text Available Simple sequence repeat (SSR markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily, mutations in the target sequences follow the stepwise mutation model (SMM. Generally speaking, PCR amplicon sizes are used as direct indicators of the number of SSR repeats composing an allele with the data analysis either ignoring the extent of allele size differences or assuming that there is a direct correlation between differences in amplicon size and evolutionary distance. However, without precisely knowing the kind and distribution of polymorphism within an allele (SSR and the associated flanking region (FR sequences, it is hard to say what kind of evolutionary message is conveyed by such a synthetic descriptor of polymorphism as DNA amplicon size. In this study, we sequenced several SSR alleles in multiple populations of three divergent tree genera and disentangled the types of polymorphisms contained in each portion of the DNA amplicon containing an SSR. The patterns of diversity provided by amplicon size variation, SSR variation itself, insertions/deletions (indels, and single nucleotide polymorphisms (SNPs observed in the FRs were compared. Amplicon size variation largely reflected SSR repeat number. The amount of variation was as large in FRs as in the SSR itself. The former contributed significantly to the phylogenetic information and sometimes was the main source of differentiation among individuals and populations contained by FR and SSR regions of SSR markers. The presence of mutations occurring at different rates within a marker's sequence offers the opportunity to analyse evolutionary events occurring on various timescales, but at the same time calls for caution in the interpretation of SSR marker data when the distribution of within

  2. De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

    Science.gov (United States)

    Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

    2013-01-01

    Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176

  3. Repeat-containing protein effectors of plant-associated organisms

    Directory of Open Access Journals (Sweden)

    Carl H. Mesarich

    2015-10-01

    Full Text Available Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.

  4. Analysis of tandem repeat units of the promoter of capsanthin/capsorubin synthase (Ccs) gene in pepper fruit.

    Science.gov (United States)

    Tian, Shi-Lin; Li, Zheng; Li, Li; Shah, S N M; Gong, Zhen-Hui

    2017-07-01

    Capsanthin/capsorubin synthase ( Ccs ) gene is a key gene that regulates the synthesis of capsanthin and the development of red coloration in pepper fruits. There are three tandem repeat units in the promoter region of Ccs , but the potential effects of the number of repetitive units on the transcriptional regulation of Ccs has been unclear. In the present study, expression vectors carrying different numbers of repeat units of the Ccs promoter were constructed, and the transient expression of the β-glucuronidase ( GUS ) gene was used to detect differences in expression levels associated with the promoter fragments. These repeat fragments and the plant expression vector PBI121 containing the 35s CaMV promoter were ligated to form recombinant vectors that were transfected into Agrobacterium tumefaciens GV3101. A fluorescence spectrophotometer was used to analyze the expression associated with the various repeat units. It was concluded that the constructs containing at least one repeat were associated with GUS expression, though they did not differ from one another. This repeating unit likely plays a role in transcription and regulation of Ccs expression.

  5. Multineuronal Spike Sequences Repeat with Millisecond Precision

    Directory of Open Access Journals (Sweden)

    Koki eMatsumoto

    2013-06-01

    Full Text Available Cortical microcircuits are nonrandomly wired by neurons. As a natural consequence, spikes emitted by microcircuits are also nonrandomly patterned in time and space. One of the prominent spike organizations is a repetition of fixed patterns of spike series across multiple neurons. However, several questions remain unsolved, including how precisely spike sequences repeat, how the sequences are spatially organized, how many neurons participate in sequences, and how different sequences are functionally linked. To address these questions, we monitored spontaneous spikes of hippocampal CA3 neurons ex vivo using a high-speed functional multineuron calcium imaging technique that allowed us to monitor spikes with millisecond resolution and to record the location of spiking and nonspiking neurons. Multineuronal spike sequences were overrepresented in spontaneous activity compared to the statistical chance level. Approximately 75% of neurons participated in at least one sequence during our observation period. The participants were sparsely dispersed and did not show specific spatial organization. The number of sequences relative to the chance level decreased when larger time frames were used to detect sequences. Thus, sequences were precise at the millisecond level. Sequences often shared common spikes with other sequences; parts of sequences were subsequently relayed by following sequences, generating complex chains of multiple sequences.

  6. Mycobacterial Interspersed Repetitive-Unit–Variable-Number Tandem-Repeat (MIRU-VNTR) Genotyping of Mycobacterium intracellulare for Strain Comparison with Establishment of a PCR-Based Database

    Science.gov (United States)

    Iakhiaeva, Elena; McNulty, Steven; Brown Elliott, Barbara A.; Falkinham, Joseph O.; Williams, Myra D.; Vasireddy, Ravikiran; Wilson, Rebecca W.; Turenne, Christine

    2013-01-01

    Strain comparison is important to population genetics and to evaluate relapses in patients with Mycobacterium avium complex (MAC) lung disease, but the “gold standard” of pulsed-field gel electrophoresis (PFGE) is time-consuming and complex. We used variable-number tandem repeats (VNTR) for fingerprinting of respiratory isolates of M. intracellulare from patients with underlying bronchiectasis, to establish a nonsequence-based database for population analysis. Different genotypes identified by PFGE underwent species identification using a 16S rRNA gene multiplex PCR. Genotypes of M. intracellulare were confirmed by internal transcribed spacer 1 (ITS1) sequencing and characterized using seven VNTR primers. The pattern of VNTR amplicon sizes and repeat number defined each specific VNTR type. Forty-two VNTR types were identified among 84 genotypes. PFGE revealed most isolates with the same VNTR type to be clonal or exhibit similar grouping of bands. Repetitive sequence-based PCR (rep-PCR) showed minimal pattern diversity between VNTR types compared to PFGE. Fingerprinting of relapse isolates from 31 treated patients using VNTR combined with 16S multiplex PCR unambiguously and reliably distinguished different genotypes from the same patient, with results comparable to those of PFGE. VNTR for strain comparison is easier and faster than PFGE, is as accurate as PFGE, and does not require sequencing. Starting with a collection of 167 M. intracellulare isolates, VNTR distinguished M. intracellulare into 42 clonal groups. Comparison of isolates from different geographic areas, habitats, and clinical settings is now possible. PMID:23175249

  7. Mycobacterial interspersed repetitive-unit-variable-number tandem-repeat (MIRU-VNTR) genotyping of mycobacterium intracellulare for strain comparison with establishment of a PCR-based database.

    Science.gov (United States)

    Iakhiaeva, Elena; McNulty, Steven; Brown Elliott, Barbara A; Falkinham, Joseph O; Williams, Myra D; Vasireddy, Ravikiran; Wilson, Rebecca W; Turenne, Christine; Wallace, Richard J

    2013-02-01

    Strain comparison is important to population genetics and to evaluate relapses in patients with Mycobacterium avium complex (MAC) lung disease, but the "gold standard" of pulsed-field gel electrophoresis (PFGE) is time-consuming and complex. We used variable-number tandem repeats (VNTR) for fingerprinting of respiratory isolates of M. intracellulare from patients with underlying bronchiectasis, to establish a nonsequence-based database for population analysis. Different genotypes identified by PFGE underwent species identification using a 16S rRNA gene multiplex PCR. Genotypes of M. intracellulare were confirmed by internal transcribed spacer 1 (ITS1) sequencing and characterized using seven VNTR primers. The pattern of VNTR amplicon sizes and repeat number defined each specific VNTR type. Forty-two VNTR types were identified among 84 genotypes. PFGE revealed most isolates with the same VNTR type to be clonal or exhibit similar grouping of bands. Repetitive sequence-based PCR (rep-PCR) showed minimal pattern diversity between VNTR types compared to PFGE. Fingerprinting of relapse isolates from 31 treated patients using VNTR combined with 16S multiplex PCR unambiguously and reliably distinguished different genotypes from the same patient, with results comparable to those of PFGE. VNTR for strain comparison is easier and faster than PFGE, is as accurate as PFGE, and does not require sequencing. Starting with a collection of 167 M. intracellulare isolates, VNTR distinguished M. intracellulare into 42 clonal groups. Comparison of isolates from different geographic areas, habitats, and clinical settings is now possible.

  8. Genome-scale portrait and evolutionary significance of human-specific core promoter tri- and tetranucleotide short tandem repeats.

    Science.gov (United States)

    Nazaripanah, N; Adelirad, F; Delbari, A; Sahaf, R; Abbasi-Asl, T; Ohadi, M

    2018-04-05

    While there is an ongoing trend to identify single nucleotide substitutions (SNSs) that are linked to inter/intra-species differences and disease phenotypes, short tandem repeats (STRs)/microsatellites may be of equal (if not more) importance in the above processes. Genes that contain STRs in their promoters have higher expression divergence compared to genes with fixed or no STRs in the gene promoters. In line with the above, recent reports indicate a role of repetitive sequences in the rise of young transcription start sites (TSSs) in human evolution. Following a comparative genomics study of all human protein-coding genes annotated in the GeneCards database, here we provide a genome-scale portrait of human-specific short- and medium-size (≥ 3-repeats) tri- and tetranucleotide STRs and STR motifs in the critical core promoter region between - 120 and + 1 to the TSS and evidence of skewing of this compartment in reference to the STRs that are not human-specific (Levene's test p human-specific transcripts was detected in the tri and tetra human-specific compartments (mid-p genome-scale skewing of STRs at a specific region of the human genome and a link between a number of these STRs and TSS selection/transcript specificity. The STRs and genes listed here may have a role in the evolution and development of characteristics and phenotypes that are unique to the human species.

  9. Comparison of a Variable-Number Tandem-Repeat (VNTR) Method for Typing Mycobacterium avium with Mycobacterial Interspersed Repetitive-Unit-VNTR and IS1245 Restriction Fragment Length Polymorphism Typing▿ †

    OpenAIRE

    Inagaki, Takayuki; Nishimori, Kei; Yagi, Tetsuya; Ichikawa, Kazuya; Moriyama, Makoto; Nakagawa, Taku; Shibayama, Takami; Uchiya, Kei-ichi; Nikai, Toshiaki; Ogawa, Kenji

    2009-01-01

    Mycobacterium avium complex (MAC) infections are increasing annually in various countries, including Japan, but the route of transmission and pathophysiology of the infection remain unclear. Currently, a variable-number tandem-repeat (VNTR) typing method using the Mycobacterium avium tandem repeat (MATR) loci (MATR-VNTR) is employed in Japan for epidemiological studies using clinical isolates of M. avium. In this study, the usefulness of this MATR-VNTR typing method was compared with that of ...

  10. Complete DNA sequence of the linear mitochondrial genome of the pathogenic yeast Candida parapsilosis

    DEFF Research Database (Denmark)

    Nosek, J.; Novotna, M.; Hlavatovicova, Z.

    2004-01-01

    The complete sequence of the mitochondrial DNA of the opportunistic yeast pathogen Candida parapsilosis was determined. The mitochondrial genome is represented by linear DNA molecules terminating with tandem repeats of a 738-bp unit. The number of repeats varies, thus generating a population...

  11. Development of simple sequence repeat (SSR) markers that are ...

    African Journals Online (AJOL)

    Simple sequence repeats (SSRs) markers were developed through data mining of 3,803 expressed sequence tags (ESTs) previously published. A total of 144 di- to penta-type SSRs were identified and they were screened for polymorphism between two turnip cultivars, 'Tsuda' and 'Yurugi Akamaru'. Out of 90 EST-SSRs for ...

  12. Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

    Directory of Open Access Journals (Sweden)

    Charlotte Rehm

    Full Text Available In prokaryotes simple sequence repeats (SSRs with unit sizes of 1-5 nucleotides (nt are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4 structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc, Xanthomonas axonopodis pv. citri str. 306 (Xac, and Nostoc sp. strain PCC7120 (Ana. In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.

  13. Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

    Science.gov (United States)

    Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S

    2015-01-01

    In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.

  14. Large-scale studies of the HphI insulin gene variable-number-of-tandem-repeats polymorphism in relation to Type 2 diabetes mellitus and insulin release

    DEFF Research Database (Denmark)

    Hansen, S K; Gjesing, A P; Rasmussen, S K

    2004-01-01

    The class III allele of the variable-number-of-tandem-repeats polymorphism located 5' of the insulin gene (INS-VNTR) has been associated with Type 2 diabetes and altered birthweight. It has also been suggested, although inconsistently, that the class III allele plays a role in glucose-induced ins......The class III allele of the variable-number-of-tandem-repeats polymorphism located 5' of the insulin gene (INS-VNTR) has been associated with Type 2 diabetes and altered birthweight. It has also been suggested, although inconsistently, that the class III allele plays a role in glucose...

  15. Imported brucellosis in Denmark: Molecular identification and multiple-locus variable number tandem repeat analysis (MLVA) genotyping of the bacteria

    DEFF Research Database (Denmark)

    Aftab, H.; Dargis, R.; Christensen, J. J.

    2011-01-01

    A polymerase chain reaction was used to identify Brucella species isolated from humans in Denmark. Consecutive analysis of referred bacteria and re-examination of historical isolates identified all as Brucella melitensis. Multiple-locus variable number tandem repeat analysis (MLVA) placed...... the isolates in the previously defined 'East Mediterranean' B. melitensis group....

  16. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

    Science.gov (United States)

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-11-16

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Use of multiple-locus variable-number tandem-repeats analysis (MLVA) typing to characterize Salmonella Typhimurium DT41 broiler breeder infections

    DEFF Research Database (Denmark)

    Litrup, E.; Christensen, H.; Nordentoft, Steen

    2010-01-01

    To characterize isolates of Salmonella Typhimurium DT41 obtained from infected flocks of broiler breeders by multiple-locus variable-number tandem-repeats analysis (MLVA) and compare results with a diverse strain collection from Germany and United Kingdom and isolates from Danish patients. A total...

  18. Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

    Science.gov (United States)

    Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

    2017-04-01

    There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.

  19. Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

    Science.gov (United States)

    Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

    2010-08-01

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.

  20. Tandem Mass Spectrum Sequencing: An Alternative to Database Search Engines in Shotgun Proteomics.

    Science.gov (United States)

    Muth, Thilo; Rapp, Erdmann; Berven, Frode S; Barsnes, Harald; Vaudel, Marc

    2016-01-01

    Protein identification via database searches has become the gold standard in mass spectrometry based shotgun proteomics. However, as the quality of tandem mass spectra improves, direct mass spectrum sequencing gains interest as a database-independent alternative. In this chapter, the general principle of this so-called de novo sequencing is introduced along with pitfalls and challenges of the technique. The main tools available are presented with a focus on user friendly open source software which can be directly applied in everyday proteomic workflows.

  1. DNA fingerprinting of Shiga-toxin producing Escherichia coli O157 based on Multiple-Locus Variable-Number Tandem-Repeats Analysis (MLVA

    Directory of Open Access Journals (Sweden)

    Vardund Traute

    2003-12-01

    Full Text Available Abstract Background The ability to react early to possible outbreaks of Escherichia coli O157:H7 and to trace possible sources relies on the availability of highly discriminatory and reliable techniques. The development of methods that are fast and has the potential for complete automation is needed for this important pathogen. Methods In all 73 isolates of shiga-toxin producing E. coli O157 (STEC were used in this study. The two available fully sequenced STEC genomes were scanned for tandem repeated stretches of DNA, which were evaluated as polymorphic markers for isolate identification. Results The 73 E. coli isolates displayed 47 distinct patterns and the MLVA assay was capable of high discrimination between the E. coli O157 strains. The assay was fast and all the steps can be automated. Conclusion The findings demonstrate a novel high discriminatory molecular typing method for the important pathogen E. coli O157 that is fast, robust and offers many advantages compared to current methods.

  2. Simple sequence repeat marker loci discovery using SSR primer.

    Science.gov (United States)

    Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David

    2004-06-12

    Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/

  3. Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

    Science.gov (United States)

    Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

    2002-07-01

    Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.

  4. Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

    Directory of Open Access Journals (Sweden)

    Graner Andreas

    2008-10-01

    Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular

  5. Computational study of the human dystrophin repeats: interaction properties and molecular dynamics.

    Directory of Open Access Journals (Sweden)

    Baptiste Legrand

    Full Text Available Dystrophin is a large protein involved in the rare genetic disease Duchenne muscular dystrophy (DMD. It functions as a mechanical linker between the cytoskeleton and the sarcolemma, and is able to resist shear stresses during muscle activity. In all, 75% of the dystrophin molecule consists of a large central rod domain made up of 24 repeat units that share high structural homology with spectrin-like repeats. However, in the absence of any high-resolution structure of these repeats, the molecular basis of dystrophin central domain's functions has not yet been deciphered. In this context, we have performed a computational study of the whole dystrophin central rod domain based on the rational homology modeling of successive and overlapping tandem repeats and the analysis of their surface properties. Each tandem repeat has very specific surface properties that make it unique. However, the repeats share enough electrostatic-surface similarities to be grouped into four separate clusters. Molecular dynamics simulations of four representative tandem repeats reveal specific flexibility or bending properties depending on the repeat sequence. We thus suggest that the dystrophin central rod domain is constituted of seven biologically relevant sub-domains. Our results provide evidence for the role of the dystrophin central rod domain as a scaffold platform with a wide range of surface features and biophysical properties allowing it to interact with its various known partners such as proteins and membrane lipids. This new integrative view is strongly supported by the previous experimental works that investigated the isolated domains and the observed heterogeneity of the severity of dystrophin related pathologies, especially Becker muscular dystrophy.

  6. SeqEntropy: genome-wide assessment of repeats for short read sequencing.

    Directory of Open Access Journals (Sweden)

    Hsueh-Ting Chu

    Full Text Available BACKGROUND: Recent studies on genome assembly from short-read sequencing data reported the limitation of this technology to reconstruct the entire genome even at very high depth coverage. We investigated the limitation from the perspective of information theory to evaluate the effect of repeats on short-read genome assembly using idealized (error-free reads at different lengths. METHODOLOGY/PRINCIPAL FINDINGS: We define a metric H(k to be the entropy of sequencing reads at a read length k and use the relative loss of entropy ΔH(k to measure the impact of repeats for the reconstruction of whole-genome from sequences of length k. In our experiments, we found that entropy loss correlates well with de-novo assembly coverage of a genome, and a score of ΔH(k>1% indicates a severe loss in genome reconstruction fidelity. The minimal read lengths to achieve ΔH(k<1% are different for various organisms and are independent of the genome size. For example, in order to meet the threshold of ΔH(k<1%, a read length of 60 bp is needed for the sequencing of human genome (3.2 10(9 bp and 320 bp for the sequencing of fruit fly (1.8×10(8 bp. We also calculated the ΔH(k scores for 2725 prokaryotic chromosomes and plasmids at several read lengths. Our results indicate that the levels of repeats in different genomes are diverse and the entropy of sequencing reads provides a measurement for the repeat structures. CONCLUSIONS/SIGNIFICANCE: The proposed entropy-based measurement, which can be calculated in seconds to minutes in most cases, provides a rapid quantitative evaluation on the limitation of idealized short-read genome sequencing. Moreover, the calculation can be parallelized to scale up to large euakryotic genomes. This approach may be useful to tune the sequencing parameters to achieve better genome assemblies when a closely related genome is already available.

  7. The association of 22 Y chromosome short tandem repeat loci with initiative-aggressive behavior.

    Science.gov (United States)

    Yang, Chun; Ba, Huajie; Zhang, Wei; Zhang, Shuyou; Zhao, Hanqing; Yu, Haiying; Gao, Zhiqin; Wang, Binbin

    2018-05-15

    Aggressive behavior represents an important public concern and a clinical challenge to behaviorists and psychiatrists. Aggression in humans is known to have an important genetic basis, so to investigate the association of Y chromosome short tandem repeat (Y-STR) loci with initiative-aggressive behavior, we compared allelic and haplotypic distributions of 22 Y-STRs in a group of Chinese males convicted of premeditated extremely violent crimes (n = 271) with a normal control group (n = 492). Allelic distributions of DYS533 and DYS437 loci differed significantly between the two groups (P initiative aggression in non-psychiatric subjects. Copyright © 2018 Elsevier B.V. All rights reserved.

  8. Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

    Science.gov (United States)

    Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

    2017-07-01

    PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.

  9. Identification of cis-regulatory sequences that activate transcription in the suspensor of plant embryos.

    Science.gov (United States)

    Kawashima, Tomokazu; Wang, Xingjun; Henry, Kelli F; Bi, Yuping; Weterings, Koen; Goldberg, Robert B

    2009-03-03

    Little is known about the molecular mechanisms by which the embryo proper and suspensor of plant embryos activate specific gene sets shortly after fertilization. We analyzed the upstream region of the scarlet runner bean (Phaseolus coccineus) G564 gene to understand how genes are activated specifically within the suspensor during early embryo development. Previously, we showed that the G564 upstream region has a block of tandem repeats, which contain a conserved 10-bp motif (GAAAAG(C)/(T)GAA), and that deletion of these repeats results in a loss of suspensor transcription. Here, we use gain-of-function (GOF) experiments with transgenic globular-stage tobacco embryos to show that only 1 of the 5 tandem repeats is required to drive suspensor-specific transcription. Fine-scale deletion and scanning mutagenesis experiments with 1 tandem repeat uncovered a 54-bp region that contains all of the sequences required to activate transcription in the suspensor, including the 10-bp motif (GAAAAGCGAA) and a similar 10-bp-like motif (GAAAAACGAA). Site-directed mutagenesis and GOF experiments indicated that both the 10-bp and 10-bp-like motifs are necessary, but not sufficient to activate transcription in the suspensor, and that a sequence (TTGGT) between the 10-bp and the 10-bp-like motifs is also necessary for suspensor transcription. Together, these data identify sequences that are required to activate transcription in the suspensor of a plant embryo after fertilization.

  10. simple sequence repeat (SSR) markers in genetic analysis of

    African Journals Online (AJOL)

    Yomi

    2012-08-28

    1998). Cross- species amplification of soybean (Glycine max) simple sequence repeats (SSRs) within the genus and other legume genera: implications for the transferability of SSRs in plants. Mol. Biol. Evol. 15:1275-1287.

  11. Multiple-locus variable-number tandem-repeat analysis of pathogenic Yersinia enterocolitica in China.

    Directory of Open Access Journals (Sweden)

    Xin Wang

    Full Text Available The predominant bioserotypes of pathogenic Yersinia enterocolitica in China are 2/O: 9 and 3/O: 3; no pathogenic O: 8 strains have been found to date. Multiple-Locus Variable-Number Tandem-Repeat Analysis (MLVA based on seven loci was able to distinguish 104 genotypes among 218 pathogenic Y. enterocolitica isolates in China and from abroad, showing a high resolution. The major pathogenic serogroups in China, O: 3 and O: 9, were divided into two clusters based on MLVA genotyping. The different distribution of Y. enterocolitica MLVA genotypes maybe due to the recent dissemination of specific clones of 2/O: 9 and 3/O: 3 strains in China. MLVA was a helpful tool for bacterial pathogen surveillance and investigation of pathogenic Y. enterocolitica outbreaks.

  12. Unique CCT repeats mediate transcription of the TWIST1 gene in mesenchymal cell lines

    International Nuclear Information System (INIS)

    Ohkuma, Mizue; Funato, Noriko; Higashihori, Norihisa; Murakami, Masanori; Ohyama, Kimie; Nakamura, Masataka

    2007-01-01

    TWIST1, a basic helix-loop-helix transcription factor, plays critical roles in embryo development, cancer metastasis and mesenchymal progenitor differentiation. Little is known about transcriptional regulation of TWIST1 expression. Here we identified DNA sequences responsible for TWIST1 expression in mesenchymal lineage cell lines. Reporter assays with TWIST1 promoter mutants defined the -102 to -74 sequences that are essential for TWIST1 expression in human and mouse mesenchymal cell lines. Tandem repeats of CCT, but not putative CREB and NF-κB sites in the sequences substantially supported activity of the TWIST1 promoter. Electrophoretic mobility shift assay demonstrated that the DNA sequences with the CCT repeats formed complexes with nuclear factors, containing, at least, Sp1 and Sp3. These results suggest critical implication of the CCT repeats in association with Sp1 and Sp3 factors in sustaining expression of the TWIST1 gene in mesenchymal cells

  13. Comparative effectiveness of inter-simple sequence repeat and ...

    African Journals Online (AJOL)

    A study to compare the effectiveness of inter-simple sequence repeats (ISSR) and randomly amplified polymorphic DNA (RAPD) profiling was carried out with a total of 65 DNA samples using 12 species of Indian Garcinia. ISSR and RAPD profiling were performed with 19 and 12 primers, respectively. ISSR markers ...

  14. SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.

    Science.gov (United States)

    Anwar, Tamanna; Khan, Asad U

    2006-02-20

    Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.

  15. MNS16A tandem repeat minisatellite of human telomerase gene: functional studies in colorectal, lung and prostate cancer.

    Science.gov (United States)

    Hofer, Philipp; Zöchmeister, Cornelia; Behm, Christian; Brezina, Stefanie; Baierl, Andreas; Doriguzzi, Angelina; Vanas, Vanita; Holzmann, Klaus; Sutterlüty-Fall, Hedwig; Gsur, Andrea

    2017-04-25

    MNS16A, a functional polymorphic tandem repeat minisatellite, is located in the promoter region of an antisense transcript of the human telomerase reverse transcriptase gene. MNS16A promoter activity depends on the variable number of tandem repeats (VNTR) presenting varying numbers of transcription factor binding sites for GATA binding protein 1. Although MNS16A has been investigated in multiple cancer epidemiology studies with incongruent findings, functional data of only two VNTRs (VNTR-243 and VNTR-302) were available thus far, linking the shorter VNTR to higher promoter activity.For the first time, we investigated promoter activity of all six VNTRs of MNS16A in cell lines of colorectal, lung and prostate cancer using Luciferase reporter assay. In all investigated cell lines shorter VNTRs showed higher promoter activity. While this anticipated indirect linear relationship was affirmed for colorectal cancer SW480 (P = 0.006), a piecewise linear regression model provided significantly better model fit in lung cancer A-427 (P = 6.9 × 10-9) and prostate cancer LNCaP (P = 0.039). In silico search for transcription factor binding sites in MNS16A core repeat element suggested a higher degree of complexity involving X-box binding protein 1, general transcription factor II-I, and glucocorticoid receptor alpha in addition to GATA binding protein 1.Further functional studies in additional cancers are requested to extend our knowledge of MNS16A functionality uncovering potential cancer type-specific differences. Risk alleles may vary in different malignancies and their determination in vitro could be relevant for interpretation of genotype data.

  16. Thermodynamic characterization of tandem mismatches found in naturally occurring RNA

    Science.gov (United States)

    Christiansen, Martha E.; Znosko, Brent M.

    2009-01-01

    Although all sequence symmetric tandem mismatches and some sequence asymmetric tandem mismatches have been thermodynamically characterized and a model has been proposed to predict the stability of previously unmeasured sequence asymmetric tandem mismatches [Christiansen,M.E. and Znosko,B.M. (2008) Biochemistry, 47, 4329–4336], experimental thermodynamic data for frequently occurring tandem mismatches is lacking. Since experimental data is preferred over a predictive model, the thermodynamic parameters for 25 frequently occurring tandem mismatches were determined. These new experimental values, on average, are 1.0 kcal/mol different from the values predicted for these mismatches using the previous model. The data for the sequence asymmetric tandem mismatches reported here were then combined with the data for 72 sequence asymmetric tandem mismatches that were published previously, and the parameters used to predict the thermodynamics of previously unmeasured sequence asymmetric tandem mismatches were updated. The average absolute difference between the measured values and the values predicted using these updated parameters is 0.5 kcal/mol. This updated model improves the prediction for tandem mismatches that were predicted rather poorly by the previous model. This new experimental data and updated predictive model allow for more accurate calculations of the free energy of RNA duplexes containing tandem mismatches, and, furthermore, should allow for improved prediction of secondary structure from sequence. PMID:19509311

  17. Simple sequence repeat (SSR)-based genetic variability among ...

    African Journals Online (AJOL)

    The objective of this study was to compare if simple sequence repeat (SSR) markers could correctly identify peanut genotypes with difference in specific leaf weight (SLW) and relative water content (RWC). Four peanut genotypes and two water regimes (FC and 1/3 available water; 1/3 AW) were arranged in factorial ...

  18. Optimization of Standard In-House 24-Locus Variable-Number Tandem-Repeat Typing for Mycobacterium tuberculosis and Its Direct Application to Clinical Material

    NARCIS (Netherlands)

    de Beer, Jessica L.; Akkerman, Onno W.; Schurch, Anita C.; Mulder, Arnout; van der Werf, Tjip S.; van der Zanden, Adri G. M.; van Ingen, Jakko; van Soolingen, Dick

    Variable-number tandem-repeat (VNTR) typing with a panel of 24 loci is the current gold standard in the molecular typing of Mycobacterium tuberculosis complex isolates. However, because of technical problems, a part of the loci often cannot be amplified by multiplex PCRs. Therefore, a considerable

  19. Spectrum of Phenylalanine Hydroxylase Gene Mutations in Hamadan and Lorestan Provinces of Iran and Their Associations with Variable Number of Tandem Repeat Alleles.

    Science.gov (United States)

    Alibakhshi, Reza; Moradi, Keivan; Biglari, Mostafa; Shafieenia, Samaneh

    2018-05-01

    Phenylketonuria (PKU) is one of the most common known inherited metabolic diseases. The present study aimed to investigate the status of molecular defects in phenylalanine hydroxylase ( PAH ) gene in western Iranian PKU patients (predominantly from Kermanshah, Hamadan, and Lorestan provinces) during 2014-2016. Additionally, the results were compared with similar studies in Iran. Nucleotide sequence analysis of all 13 exons and their flanking intronic regions of the PAH gene was performed in 18 western Iranian PKU patients. Moreover, a variable number of tandem repeat (VNTR) located in the PAH gene was studied. The results revealed a mutational spectrum encompassing 11 distinct mutations distributed along the PAH gene sequence on 34 of the 36 mutant alleles (diagnostic efficiency of 94.4%). Also, four PAH VNTR alleles (with repeats of 3, 7, 8 and 9) were detected. The three most frequent mutations were IVS9+5G>A, IVS7-5T>C, and p.P281L with the frequency of 27.8%, 11%, and 11%, respectively. The results showed that there is not only a consanguineous relation, but also a difference in PAH characters of mutations between Kermanshah and the other two parts of western Iran (Hamadan and Lorestan). Also, it seems that the spectrum of mutations in western Iran is relatively distinct from other parts of the country, suggesting that this region might be a special PAH gene distribution region. Moreover, our findings can be useful in the identification of genotype to phenotype relationship in patients, and provide future abilities for confirmatory diagnostic testing, prognosis, and predict the severity of PKU patients.

  20. Alu repeats as markers for forensic DNA analyses

    Energy Technology Data Exchange (ETDEWEB)

    Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Kass, D.H. [Louisiana State Univ., New Orleans, LA (United States)] [and others

    1994-01-01

    The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 98.9% nucleotide identity with the HS subfamily consensus sequence, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 inch and 3 inch unique flanking DNA sequences from each HS Alu that allow the locus to be assayed for the presence or absence of the Alu repeat. The dimorphic HS Alu sequences probably inserted in the human genome after the radiation of modem humans (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project. HS Alu family member insertions differ from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) in that polymorphisms due to Alu insertions arise as a result of a unique event which has occurred only one time in the human population and spread through the population from that point. Therefore, individuals that share HS Alu repeats inherited these elements from a common ancestor. Most VNTR and RFLP polymorphisms may arise multiple times in parallel within a population.

  1. Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.

    Science.gov (United States)

    Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R

    2006-12-01

    Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.

  2. Complete Chloroplast Genome Sequence of Tartary Buckwheat (Fagopyrum tataricum and Comparative Analysis with Common Buckwheat (F. esculentum.

    Directory of Open Access Journals (Sweden)

    Kwang-Soo Cho

    Full Text Available We report the chloroplast (cp genome sequence of tartary buckwheat (Fagopyrum tataricum obtained by next-generation sequencing technology and compared this with the previously reported common buckwheat (F. esculentum ssp. ancestrale cp genome. The cp genome of F. tataricum has a total sequence length of 159,272 bp, which is 327 bp shorter than the common buckwheat cp genome. The cp gene content, order, and orientation are similar to those of common buckwheat, but with some structural variation at tandem and palindromic repeat frequencies and junction areas. A total of seven InDels (around 100 bp were found within the intergenic sequences and the ycf1 gene. Copy number variation of the 21-bp tandem repeat varied in F. tataricum (four repeats and F. esculentum (one repeat, and the InDel of the ycf1 gene was 63 bp long. Nucleotide and amino acid have highly conserved coding sequence with about 98% homology and four genes--rpoC2, ycf3, accD, and clpP--have high synonymous (Ks value. PCR based InDel markers were applied to diverse genetic resources of F. tataricum and F. esculentum, and the amplicon size was identical to that expected in silico. Therefore, these InDel markers are informative biomarkers to practically distinguish raw or processed buckwheat products derived from F. tataricum and F. esculentum.

  3. Evaluation of 13 short tandem repeated loci for use in personal identification applications

    Energy Technology Data Exchange (ETDEWEB)

    Hammond, H.A.; Caskey, C.T. (Baylor College of Medicine, Houston, TX (United States)); Jin, L.; Zhong, Y.; Chakraborty, R. (Univ. of Texas Graduate School of Biomedical Sciences, Houston, TX (United States))

    1994-07-01

    Personal identification by using DNA typing methodologies has been an issue in the popular and scientific press for several years. The authors present a PCR-based DNA-typing method using 13 unlinked short tandem repeat (STR) loci. Validation of the loci and methodology has been performed to meet standards set by the forensic community and the accrediting organization for parentage testing. Extensive statistical analysis has addressed the issues surrounding the presentation of [open quotes]match[close quotes] statistics. The authors have found STR loci to provide a rapid, sensitive, and reliable method of DNA typing for parentage testing, forensic identification, and medical diagnostics. Valid statistical analysis is generally simpler than similar analysis of RFLP-VNTR results and provides powerful statistical evidence of the low frequency of random multilocus genotype matching. 54 refs., 4 figs., 6 tabs.

  4. Revisiting the TALE repeat.

    Science.gov (United States)

    Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

    2014-04-01

    Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.

  5. Limitations of variable number of tandem repeat typing identified through whole genome sequencing of Mycobacterium avium subsp. paratuberculosis on a national and herd level.

    Science.gov (United States)

    Ahlstrom, Christina; Barkema, Herman W; Stevenson, Karen; Zadoks, Ruth N; Biek, Roman; Kao, Rowland; Trewby, Hannah; Haupstein, Deb; Kelton, David F; Fecteau, Gilles; Labrecque, Olivia; Keefe, Greg P; McKenna, Shawn L B; De Buck, Jeroen

    2015-03-08

    Mycobacterium avium subsp. paratuberculosis (MAP), the causative bacterium of Johne's disease in dairy cattle, is widespread in the Canadian dairy industry and has significant economic and animal welfare implications. An understanding of the population dynamics of MAP can be used to identify introduction events, improve control efforts and target transmission pathways, although this requires an adequate understanding of MAP diversity and distribution between herds and across the country. Whole genome sequencing (WGS) offers a detailed assessment of the SNP-level diversity and genetic relationship of isolates, whereas several molecular typing techniques used to investigate the molecular epidemiology of MAP, such as variable number of tandem repeat (VNTR) typing, target relatively unstable repetitive elements in the genome that may be too unpredictable to draw accurate conclusions. The objective of this study was to evaluate the diversity of bovine MAP isolates in Canadian dairy herds using WGS and then determine if VNTR typing can distinguish truly related and unrelated isolates. Phylogenetic analysis based on 3,039 SNPs identified through WGS of 124 MAP isolates identified eight genetically distinct subtypes in dairy herds from seven Canadian provinces, with the dominant type including over 80% of MAP isolates. VNTR typing of 527 MAP isolates identified 12 types, including "bison type" isolates, from seven different herds. At a national level, MAP isolates differed from each other by 1-2 to 239-240 SNPs, regardless of whether they belonged to the same or different VNTR types. A herd-level analysis of MAP isolates demonstrated that VNTR typing may both over-estimate and under-estimate the relatedness of MAP isolates found within a single herd. The presence of multiple MAP subtypes in Canada suggests multiple introductions into the country including what has now become one dominant type, an important finding for Johne's disease control. VNTR typing often failed to

  6. The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

    Science.gov (United States)

    Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

    2014-04-01

    Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.

  7. DeNovoGUI: an open source graphical user interface for de novo sequencing of tandem mass spectra.

    Science.gov (United States)

    Muth, Thilo; Weilnböck, Lisa; Rapp, Erdmann; Huber, Christian G; Martens, Lennart; Vaudel, Marc; Barsnes, Harald

    2014-02-07

    De novo sequencing is a popular technique in proteomics for identifying peptides from tandem mass spectra without having to rely on a protein sequence database. Despite the strong potential of de novo sequencing algorithms, their adoption threshold remains quite high. We here present a user-friendly and lightweight graphical user interface called DeNovoGUI for running parallelized versions of the freely available de novo sequencing software PepNovo+, greatly simplifying the use of de novo sequencing in proteomics. Our platform-independent software is freely available under the permissible Apache2 open source license. Source code, binaries, and additional documentation are available at http://denovogui.googlecode.com .

  8. MNS16A tandem repeats minisatellite of human telomerase gene: a risk factor for colorectal cancer.

    Science.gov (United States)

    Hofer, Philipp; Baierl, Andreas; Feik, Elisabeth; Führlinger, Gerhard; Leeb, Gernot; Mach, Karl; Holzmann, Klaus; Micksche, Michael; Gsur, Andrea

    2011-06-01

    Telomerase reactivation and expression of human telomerase gene [human telomerase reverse transcriptase (hTERT)] are hallmarks of unlimited proliferation potential of cancer cells. A polymorphic tandem repeats minisatellite of hTERT gene, termed MNS16A was reported to influence hTERT expression. To assess the role of MNS16A as potential biomarker for colorectal cancer (CRC), we investigated for the first time the association of MNS16A genotypes with risk of colorectal polyps and CRC. In the ongoing colorectal cancer study of Austria (CORSA), 3842 Caucasian participants were recruited within a large screening project in the province Burgenland including 90 CRC cases, 308 high-risk polyps, 1022 low-risk polyps and 1822 polyp free controls verified by colonoscopy. MNS16A genotypes were determined by polymerase chain reaction from genomic DNA. Associations of MNS16A genotypes with CRC risk were estimated by logistic regression analysis computing odds ratios (ORs) and 95% confidence intervals (CIs). We identified five different variable number of tandem repeats (VNTRs) of MNS16A including VNTR-364, a newly discovered rare variant. VNTR-274 allele was associated with a 2.7-fold significantly increased risk of CRC compared with the VNTR-302 wild-type (OR = 2.69; 95% CI = 1.11-6.50; P = 0.028). In our CORSA study, the medium length VNTR-274 was identified as risk factor for CRC. Although, this population-based study herewith reports the largest cohort size concerning MNS16A thus far, further large-scale studies in diverse populations are warranted to confirm hTERT MNS16A genotype as potential biomarker for assessment of CRC risk.

  9. A Further Analysis of the Relationship between Yellow Ripe-Fruit Color and the Capsanthin-Capsorubin Synthase Gene in Pepper (Capsicum sp.) Indicated a New Mutant Variant in C. annuum and a Tandem Repeat Structure in Promoter Region

    Science.gov (United States)

    Gui, Xiao-Ling; Chang, Xiao-Bei; Gong, Zhen-Hui

    2013-01-01

    Mature pepper (Capsicum sp.) fruits come in a variety of colors, including red, orange, yellow, brown, and white. To better understand the genetic and regulatory relationships between the yellow fruit phenotype and the capsanthin-capsorubin synthase gene (Ccs), we examined 156 Capsicum varieties, most of which were collected from Northwest Chinese landraces. A new ccs variant was identified in the yellow fruit cultivar CK7. Cluster analysis revealed that CK7, which belongs to the C. annuum species, has low genetic similarity to other yellow C. annuum varieties. In the coding sequence of this ccs allele, we detected a premature stop codon derived from a C to G change, as well as a downstream frame-shift caused by a 1-bp nucleotide deletion. In addition, the expression of the gene was detected in mature CK7 fruit. Furthermore, the promoter sequences of Ccs from some pepper varieties were examined, and we detected a 176-bp tandem repeat sequence in the promoter region. In all C. annuum varieties examined in this study, the repeat number was three, compared with four in two C. chinense accessions. The sequence similarity ranged from 84.8% to 97.7% among the four types of repeats, and some putative cis-elements were also found in every repeat. This suggests that the transcriptional regulation of Ccs expression is complex. Based on the analysis of the novel C. annuum mutation reported here, along with the studies of three mutation types in yellow C. annuum and C. chinense accessions, we suggest that the mechanism leading to the production of yellow color fruit may be not as complex as that leading to orange fruit production. PMID:23637942

  10. Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

    KAUST Repository

    Cahill, Matt J.

    2010-07-12

    Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.

  11. Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies.

    Directory of Open Access Journals (Sweden)

    Matt J Cahill

    Full Text Available BACKGROUND: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. METHODOLOGY/PRINCIPAL FINDINGS: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. CONCLUSIONS: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length.

  12. Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

    KAUST Repository

    Cahill, Matt J.; Kö ser, Claudio U.; Ross, Nicholas E.; Archer, John A.C.

    2010-01-01

    Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.

  13. Spectrum of Phenylalanine Hydroxylase Gene Mutations in Hamadan and Lorestan Provinces of Iran and Their Associations with Variable Number of Tandem Repeat Alleles

    Directory of Open Access Journals (Sweden)

    Reza Alibakhshi

    2018-05-01

    Full Text Available Phenylketonuria (PKU is one of the most common known inherited metabolic diseases. The present study aimed to investigate the status of molecular defects in phenylalanine hydroxylase (PAH gene in western Iranian PKU patients (predominantly from Kermanshah, Hamadan, and Lorestan provinces during 2014-2016. Additionally, the results were compared with similar studies in Iran. Nucleotide sequence analysis of all 13 exons and their flanking intronic regions of the PAH gene was performed in 18 western Iranian PKU patients. Moreover, a variable number of tandem repeat (VNTR located in the PAH gene was studied. The results revealed a mutational spectrum encompassing 11 distinct mutations distributed along the PAH gene sequence on 34 of the 36 mutant alleles (diagnostic efficiency of 94.4%. Also, four PAH VNTR alleles (with repeats of 3, 7, 8 and 9 were detected. The three most frequent mutations were IVS9+5G>A, IVS7-5T>C, and p.P281L with the frequency of 27.8%, 11%, and 11%, respectively. The results showed that there is not only a consanguineous relation, but also a difference in PAH characters of mutations between Kermanshah and the other two parts of western Iran (Hamadan and Lorestan. Also, it seems that the spectrum of mutations in western Iran is relatively distinct from other parts of the country, suggesting that this region might be a special PAH gene distribution region. Moreover, our findings can be useful in the identification of genotype to phenotype relationship in patients, and provide future abilities for confirmatory diagnostic testing, prognosis, and predict the severity of PKU patients.

  14. Glycoprotein I of herpes simplex virus type 1 contains a unique polymorphic tandem-repeated mucin region

    DEFF Research Database (Denmark)

    Norberg, Peter; Olofsson, Sigvard; Tarp, Mads Agervig

    2007-01-01

    Glycoprotein I (gI) of herpes simplex virus type 1 (HSV-1) contains a tandem repeat (TR) region including the amino acids serine and threonine, residues that can be utilized for O-glycosylation. The length of this TR region was determined for 82 clinical HSV-1 isolates and the results revealed......-glycosylation not only for the two most commonly expressed N-acetyl-d-galactosamine (GalNAc)-T1 and -T2 transferases, but also for the GalNAc-T3, -T4 and -T11 transferases. Immunoblotting of virus-infected cells showed that gI was exclusively O-glycosylated with GalNAc monosaccharides (Tn antigen). A polymorphic mucin...

  15. Genetic diversity of Neisseria meningitidis serogroup C ST-4821 in China based on multiple-locus variable number tandem repeat analysis.

    Directory of Open Access Journals (Sweden)

    Xiaoying Shan

    Full Text Available Neisseria meningitidis sequence type (ST-4821 was first reported in China in 2003, and a new hyper-virulent lineage has been designated as the ST-4821 complex. A large number of N. meningitidis ST-4821 strains have been identified in China since 2003; however, the microevolution characteristics of this complex are unclear. Different combinations of variable number of tandem repeats (VNTR loci were used in multiple-locus VNTR analysis (MLVA to analyze 118 N. meningitidis serogroup C ST-4821 strains isolated from seventeen provinces between 2003 and 2012. Additionally, MLVA with five VNTR loci was performed due to its high discriminatory power. One hundred and eighteen isolates were found to comprise 112 subtypes based on MLVA, and 16 outbreak-associated strains were clustered into one group. These data indicate a high level of diversity for N. meningitidis ST-4821 due to microevolution in the last decade. In addition, the results revealed high similarity between isolates from the same geographic origins, which is helpful when monitoring the spread of N. meningitidis serogroup C ST-4821 and will provide valuable information for the control and prevention of bacterial meningitis in China.

  16. [Usefulness of the variable numbers of tandem repeats (VNTR) analysis for complex infections of Mycobacterium avium and Mycobacterium intracellulare].

    Science.gov (United States)

    Tsunematsu, Noriko; Goto, Mieko; Saiki, Yumiko; Baba, Michiko; Udagawa, Tadashi; Kazumi, Yuko

    2008-09-01

    The bacilli which were isolated from a patient suspected of the mixed infections with Mycobacterium avium and Mycobacterium intracellulare, were analyzed. The genotypes of M. avium in the sedimented fractions of treated sputum and in some colonies isolated from Ogawa medium were compared by the Variable Numbers of Tandem Repeats (VNTR). A woman, aged 57. Mycobacterial species isolated from some colonies by culture in 2004 and 2006 and from the treated sputum in 2006, were determined by DNA sequencing analysis of the 16S rRNA gene. Also, by using VNTR, the genotype of mycobacteria was analyzed. [Results] (1) The colony isolated from Ogawa medium in 2004 was monoclonal M. avium. (2) By VNTR analyses of specimens in 2006, multiple acid-fast bacteria were found in the sputum sediment and in isolated bacteria from Ogawa medium. (3) By analyses of 16S rRNA DNA sequence, M. avium and M. intracellulare were found in the colonies isolated from the sputum sediment and the Ogawa medium in 2006. (4) The same VNTR patterns were obtained in M. avium in 2004 and 2006 when single colony was analyzed. (5) From the showerhead and culvert of the bathroom in the patient's house, M. avium was not detected. By VNTR analyses, it was considered that the mixed infections of M. avium and M. intracellulare had been generated during treatment in this case. Therefore, in the case of suspected complex infection, VNTR analysis would be a useful genotyping method in M. avium complex infection.

  17. Repeat Sequence Proteins as Matrices for Nanocomposites

    Energy Technology Data Exchange (ETDEWEB)

    Drummy, L.; Koerner, H; Phillips, D; McAuliffe, J; Kumar, M; Farmer, B; Vaia, R; Naik, R

    2009-01-01

    Recombinant protein-inorganic nanocomposites comprised of exfoliated Na+ montmorillonite (MMT) in a recombinant protein matrix based on silk-like and elastin-like amino acid motifs (silk elastin-like protein (SELP)) were formed via a solution blending process. Charged residues along the protein backbone are shown to dominate long-range interactions, whereas the SELP repeat sequence leads to local protein/MMT compatibility. Up to a 50% increase in room temperature modulus and a comparable decrease in high temperature coefficient of thermal expansion occur for cast films containing 2-10 wt.% MMT.

  18. The soybean-Phytophthora resistance locus Rps1-k encompasses coiled coil-nucleotide binding-leucine rich repeat-like genes and repetitive sequences

    Directory of Open Access Journals (Sweden)

    Bhattacharyya Madan K

    2008-03-01

    Full Text Available Abstract Background A series of Rps (resistance to Pytophthora sojae genes have been protecting soybean from the root and stem rot disease caused by the Oomycete pathogen, Phytophthora sojae. Five Rps genes were mapped to the Rps1 locus located near the 28 cM map position on molecular linkage group N of the composite genetic soybean map. Among these five genes, Rps1-k was introgressed from the cultivar, Kingwa. Rps1-k has been providing stable and broad-spectrum Phytophthora resistance in the major soybean-producing regions of the United States. Rps1-k has been mapped and isolated. More than one functional Rps1-k gene was identified from the Rps1-k locus. The clustering feature at the Rps1-k locus might have facilitated the expansion of Rps1-k gene numbers and the generation of new recognition specificities. The Rps1-k region was sequenced to understand the possible evolutionary steps that shaped the generation of Phytophthora resistance genes in soybean. Results Here the analyses of sequences of three overlapping BAC clones containing the 184,111 bp Rps1-k region are reported. A shotgun sequencing strategy was applied in sequencing the BAC contig. Sequence analysis predicted a few full-length genes including two Rps1-k genes, Rps1-k-1 and Rps1-k-2. Previously reported Rps1-k-3 from this genomic region 1 was evolved through intramolecular recombination between Rps1-k-1 and Rps1-k-2 in Escherichia coli. The majority of the predicted genes are truncated and therefore most likely they are nonfunctional. A member of a highly abundant retroelement, SIRE1, was identified from the Rps1-k region. The Rps1-k region is primarily composed of repetitive sequences. Sixteen simple repeat and 63 tandem repeat sequences were identified from the locus. Conclusion These data indicate that the Rps1 locus is located in a gene-poor region. The abundance of repetitive sequences in the Rps1-k region suggested that the location of this locus is in or near a

  19. Transferability of short tandem repeat markers for two wild Canid species inhabiting the Brazilian Cerrado.

    Science.gov (United States)

    Rodrigues, F M; Telles, M P C; Resende, L V; Soares, T N; Diniz-Filho, J A F; Jácomo, A T A; Silveira, L

    2006-12-13

    The maned wolf (Chrysocyon brachyurus) and the crab-eating fox (Cerdocyon thous) are two wild-canid species found in the Brazilian Cerrado. We tested cross-amplification and transferability of 29 short tandem repeat primers originally developed for cattle and domestic dogs and cats on 38 individuals of each of these two species, collected in the Emas National Park, which is the largest national park in the Cerrado region. Six of these primers were successfully transferred (CSSM-038, PEZ-05, PEZ-12, LOCO-13, LOCO-15, and PEZ-20); five of which were found to be polymorphic. Genetic parameter values (number of alleles per locus, observed and expected heterozygosities, and fixation indices) were within the expected range reported for canid populations worldwide.

  20. Variable-number-of-tandem-repeats analysis of genetic diversity in Pasteuria ramosa.

    Science.gov (United States)

    Mouton, L; Ebert, D

    2008-05-01

    Variable-number-of-tandem-repeats (VNTR) markers are increasingly being used in population genetic studies of bacteria. They were recently developed for Pasteuria ramosa, an endobacterium that infects Daphnia species. In the present study, we genotyped P. ramosa in 18 infected hosts from the United Kingdom, Belgium, and two lakes in the United States using seven VNTR markers. Two Daphnia species were collected: D. magna and D. dentifera. Six loci showed length polymorphism, with as many as five alleles identified for a single locus. Similarity coefficient calculations showed that the extent of genetic variation between pairs of isolates within populations differed according to the population, but it was always less than the genetic distances among populations. Analysis of the genetic distances performed using principal component analysis revealed strong clustering by location of origin, but not by host Daphnia species. Our study demonstrated that the VNTR markers available for P. ramosa are informative in revealing genetic differences within and among populations and may therefore become an important tool for providing detailed analysis of population genetics and epidemiology.

  1. Linking Y-chromosomal short tandem repeat loci to human male impulsive aggression.

    Science.gov (United States)

    Yang, Chun; Ba, Huajie; Cao, Yin; Dong, Guoying; Zhang, Shuyou; Gao, Zhiqin; Zhao, Hanqing; Zhou, Xianju

    2017-11-01

    Men are more susceptible to impulsive behavior than women. Epidemiological studies revealed that the impulsive aggressive behavior is affected by genetic factors, and the male-specific Y chromosome plays an important role in this behavior. In this study, we investigated the association between the impulsive aggressive behavior and Y-chromosomal short tandem repeats (Y-STRs) loci. The collected biologic samples from 271 offenders with impulsive aggressive behavior and 492 healthy individuals without impulsive aggressive behavior were amplified by PowerPlex R Y23 PCR System and the resultant products were separated by electrophoresis and further genotyped. Then, comparisons in allele and haplotype frequencies of the selected 22 Y-STRs were made in the two groups. Our results showed that there were significant differences in allele frequencies at DYS448 and DYS456 between offenders and controls ( p  impulsive aggression. However, the DYS448-DYS456-22-15 is less related to impulsive aggression. Our results suggest a link between Y-chromosomal allele types and male impulsive aggression.

  2. Evolutionary history of the PER3 variable number of tandem repeats (VNTR): idiosyncratic aspect of primate molecular circadian clock.

    Science.gov (United States)

    Sabino, Flávia Cal; Ribeiro, Amanda Oliveira; Tufik, Sérgio; Torres, Laila Brito; Oliveira, José Américo; Mello, Luiz Eugênio Araújo Moraes; Cavalcante, Jeferson Souza; Pedrazzoli, Mario

    2014-01-01

    The PER3 gene is one of the clock genes, which function in the core mammalian molecular circadian system. A variable number of tandem repeats (VNTR) locus in the 18th exon of this gene has been strongly associated to circadian rhythm phenotypes and sleep organization in humans, but it has not been identified in other mammals except primates. To better understand the evolution and the placement of the PER3 VNTR in a phylogenetical context, the present study enlarges the investigation about the presence and the structure of this variable region in a large sample of primate species and other mammals. The analysis of the results has revealed that the PER3 VNTR occurs exclusively in simiiforme primates and that the number of copies of the primitive unit ranges from 2 to 11 across different primate species. Two transposable elements surrounding the 18th exon of PER3 were found in primates with published genome sequences, including the tarsiiforme Tarsius syrichta, which lacks the VNTR. These results suggest that this VNTR may have evolved in a common ancestor of the simiiforme branch and that the evolutionary copy number differentiation of this VNTR may be associated with primate simiiformes sleep and circadian phenotype patterns.

  3. Evolutionary history of the PER3 variable number of tandem repeats (VNTR: idiosyncratic aspect of primate molecular circadian clock.

    Directory of Open Access Journals (Sweden)

    Flávia Cal Sabino

    Full Text Available The PER3 gene is one of the clock genes, which function in the core mammalian molecular circadian system. A variable number of tandem repeats (VNTR locus in the 18th exon of this gene has been strongly associated to circadian rhythm phenotypes and sleep organization in humans, but it has not been identified in other mammals except primates. To better understand the evolution and the placement of the PER3 VNTR in a phylogenetical context, the present study enlarges the investigation about the presence and the structure of this variable region in a large sample of primate species and other mammals. The analysis of the results has revealed that the PER3 VNTR occurs exclusively in simiiforme primates and that the number of copies of the primitive unit ranges from 2 to 11 across different primate species. Two transposable elements surrounding the 18th exon of PER3 were found in primates with published genome sequences, including the tarsiiforme Tarsius syrichta, which lacks the VNTR. These results suggest that this VNTR may have evolved in a common ancestor of the simiiforme branch and that the evolutionary copy number differentiation of this VNTR may be associated with primate simiiformes sleep and circadian phenotype patterns.

  4. Topological characteristics of helical repeat proteins

    NARCIS (Netherlands)

    Groves, M R; Barford, D

    The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem

  5. Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

    Science.gov (United States)

    Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

    2014-10-01

    Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.

  6. Roles of genes and Alu repeats in nonlinear correlations of HUMHBB DNA sequence

    International Nuclear Information System (INIS)

    Xiao Yi; Huang Yanzhao

    2004-01-01

    DNA sequences of different species and different portion of the DNA of the same species may have completely different correlation properties, but the origin of these correlations is still not very clear and is currently being investigated, especially in different particular cases. We report here a study of the DNA sequence of human beta globin region (HUMHBB) which has strong linear and nonlinear correlations. We studied the roles of two of the typical elements of DNA sequence, genes and Alu repeats, in the nonlinear correlations of HUMHBB. We find that there exist strong nonlinear correlations between the exons or introns in different genes and between the Alu repeats. They may be one of the major sources of the nonlinear correlations in HUMBHB

  7. Variable number of tandem repeat markers in the genome sequence of Mycosphaerella fijiensis, the causal agent of black leaf streak disease of banana (Musa spp).

    Science.gov (United States)

    Garcia, S A L; Van der Lee, T A J; Ferreira, C F; Te Lintel Hekkert, B; Zapater, M-F; Goodwin, S B; Guzmán, M; Kema, G H J; Souza, M T

    2010-11-09

    We searched the genome of Mycosphaerella fijiensis for molecular markers that would allow population genetics analysis of this plant pathogen. M. fijiensis, the causal agent of banana leaf streak disease, also known as black Sigatoka, is the most devastating pathogen attacking bananas (Musa spp). Recently, the entire genome sequence of M. fijiensis became available. We screened this database for VNTR markers. Forty-two primer pairs were selected for validation, based on repeat type and length and the number of repeat units. Five VNTR markers showing multiple alleles were validated with a reference set of isolates from different parts of the world and a population from a banana plantation in Costa Rica. Polymorphism information content values varied from 0.6414 to 0.7544 for the reference set and from 0.0400 and 0.7373 for the population set. Eighty percent of the polymorphism information content values were above 0.60, indicating that the markers are highly informative. These markers allowed robust scoring of agarose gels and proved to be useful for variability and population genetics studies. In conclusion, the strategy we developed to identify and validate VNTR markers is an efficient means to incorporate markers that can be used for fungicide resistance management and to develop breeding strategies to control banana black leaf streak disease. This is the first report of VNTR-minisatellites from the M. fijiensis genome sequence.

  8. Development of new multilocus variable number of tandem repeat analysis (MLVA) for Listeria innocua and its application in a food processing plant.

    Science.gov (United States)

    Takahashi, Hajime; Ohshima, Chihiro; Nakagawa, Miku; Thanatsang, Krittaporn; Phraephaisarn, Chirapiphat; Chaturongkasumrit, Yuphakhun; Keeratipibul, Suwimon; Kuda, Takashi; Kimura, Bon

    2014-01-01

    Listeria innocua is an important hygiene indicator bacterium in food industries because it behaves similar to Listeria monocytogenes, which is pathogenic to humans. PFGE is often used to characterize bacterial strains and to track contamination source. However, because PFGE is an expensive, complicated, time-consuming protocol, and poses difficulty in data sharing, development of a new typing method is necessary. MLVA is a technique that identifies bacterial strains on the basis of the number of tandem repeats present in the genome varies depending on the strains. MLVA has gained attention due to its high reproducibility and ease of data sharing. In this study, we developed a MLVA protocol to assess L. innocua and evaluated it by tracking the contamination source of L. innocua in an actual food manufacturing factory by typing the bacterial strains isolated from the factory. Three VNTR regions of the L. innocua genome were chosen for use in the MLVA. The number of repeat units in each VNTR region was calculated based on the results of PCR product analysis using capillary electrophoresis (CE). The calculated number of repetitions was compared with the results of the gene sequence analysis to demonstrate the accuracy of the CE repeat number analysis. The developed technique was evaluated using 60 L. innocua strains isolated from a food factory. These 60 strains were classified into 11 patterns using MLVA. Many of the strains were classified into ST-6, revealing that this MLVA strain type can contaminate each manufacturing process in the factory. The MLVA protocol developed in this study for L. innocua allowed rapid and easy analysis through the use of CE. This technique was found to be very useful in hygiene control in factories because it allowed us to track contamination sources and provided information regarding whether the bacteria were present in the factories.

  9. Development of new multilocus variable number of tandem repeat analysis (MLVA for Listeria innocua and its application in a food processing plant.

    Directory of Open Access Journals (Sweden)

    Hajime Takahashi

    Full Text Available Listeria innocua is an important hygiene indicator bacterium in food industries because it behaves similar to Listeria monocytogenes, which is pathogenic to humans. PFGE is often used to characterize bacterial strains and to track contamination source. However, because PFGE is an expensive, complicated, time-consuming protocol, and poses difficulty in data sharing, development of a new typing method is necessary. MLVA is a technique that identifies bacterial strains on the basis of the number of tandem repeats present in the genome varies depending on the strains. MLVA has gained attention due to its high reproducibility and ease of data sharing. In this study, we developed a MLVA protocol to assess L. innocua and evaluated it by tracking the contamination source of L. innocua in an actual food manufacturing factory by typing the bacterial strains isolated from the factory. Three VNTR regions of the L. innocua genome were chosen for use in the MLVA. The number of repeat units in each VNTR region was calculated based on the results of PCR product analysis using capillary electrophoresis (CE. The calculated number of repetitions was compared with the results of the gene sequence analysis to demonstrate the accuracy of the CE repeat number analysis. The developed technique was evaluated using 60 L. innocua strains isolated from a food factory. These 60 strains were classified into 11 patterns using MLVA. Many of the strains were classified into ST-6, revealing that this MLVA strain type can contaminate each manufacturing process in the factory. The MLVA protocol developed in this study for L. innocua allowed rapid and easy analysis through the use of CE. This technique was found to be very useful in hygiene control in factories because it allowed us to track contamination sources and provided information regarding whether the bacteria were present in the factories.

  10. Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

    Science.gov (United States)

    Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

    2016-01-01

    A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.

  11. Combination of Single Nucleotide Polymorphism and Variable-Number Tandem Repeats for Genotyping a Homogenous Population of Mycobacterium tuberculosis Beijing Strains in China

    OpenAIRE

    Luo, Tao; Yang, Chongguang; Gagneux, Sebastien; Gicquel, Brigitte; Mei, Jian; Gao, Qian

    2012-01-01

    The standard 15- and 24-locus variable-number tandem repeat (VNTR) genotyping methods have demonstrated adequate discriminatory power and a small homoplasy effect for tracing tuberculosis (TB) transmission and predicting Mycobacterium tuberculosis lineages in European and North American countries. However, its validity for the definition of transmission in homogenous M. tuberculosis populations in settings with high TB burdens has been questioned. Here, we genotyped a population-based collect...

  12. Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

    Science.gov (United States)

    de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

    2014-06-01

    The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Germline mutation rates at tandem repeat loci in DNA-repair deficient mice

    International Nuclear Information System (INIS)

    Barber, Ruth C.; Miccoli, Laurent; Buul, Paul P.W. van; Burr, Karen L.-A.; Duyn-Goedhart, Annemarie van; Angulo, Jaime F.; Dubrova, Yuri E.

    2004-01-01

    Mutation rates at two expanded simple tandem repeat (ESTR) loci were studied in the germline of non-exposed and irradiated severe combined immunodeficient (scid) and poly(ADP-ribose) polymerase (PARP-1 -/- ) deficient male mice. Non-exposed scid and PARP -/- male mice showed considerably elevated ESTR mutation rates, far higher than those in wild-type isogenic mice and other inbred strains. The irradiated scid and PARP-1 -/- male mice did not show any detectable increases in their mutation rate, whereas significant ESTR mutation induction was observed in the irradiated wild-type isogenic males. ESTR mutation spectra in the scid and PARP-1 -/- strains did not differ from those in the isogenic wild-type strains. Considering these data and the results of previous studies, we propose that a delay in repair of DNA damage in scid and PARP-1 -/- mice could result in replication fork pausing which, in turn, may affect ESTR mutation rate in the non-irradiated males. The lack of mutation induction in irradiated scid and PARP-1 -/- can be explained by the high cell killing effects of irradiation on the germline of deficient mice

  14. Characterization of α-isopropylmalate synthases containing different copy numbers of tandem repeats in Mycobacterium tuberculosis

    Directory of Open Access Journals (Sweden)

    Palittapongarnpim Prasit

    2009-06-01

    Full Text Available Abstract Background Alpha-isopropylmalate synthase (α-IPMS is the key enzyme that catalyzes the first committed step in the leucine biosynthetic pathway. The gene encoding α-IPMS in Mycobacterium tuberculosis, leuA, is polymorphic due to the insertion of 57-bp repeat units referred to as Variable Number of Tandem Repeats (VNTR. The role of the VNTR found within the M. tuberculosis genome is unclear. To investigate the role of the VNTR in leuA, we compared two α-IPMS proteins with different numbers of amino acid repeats, one with two copies and the other with 14 copies. We have cloned leuA with 14 copies of the repeat units into the pET15b expression vector with a His6-tag at the N-terminus, as was previously done for the leuA gene with two copies of the repeat units. Results The recombinant His6-α-IPMS proteins with two and 14 copies (α-IPMS-2CR and α-IPMS-14CR, respectively of the repeat units were purified by immobilized metal ion affinity chromatography and gel filtration. Both enzymes were found to be dimers by gel filtration. Both enzymes work well at pH values of 7–8.5 and temperatures of 37–42°C. However, α-IPMS-14CR tolerates pH values and temperatures outside of this range better than α-IPMS-2CR does. α-IPMS-14CR has higher affinity than α-IPMS-2CR for the two substrates, α-ketoisovalerate and acetyl CoA. Furthermore, α-IPMS-2CR was feedback inhibited by the end product l-leucine, whereas α-IPMS-14CR was not. Conclusion The differences in the kinetic properties and the l-leucine feedback inhibition between the two M. tuberculosis α-IPMS proteins containing low and high numbers of VNTR indicate that a large VNTR insertion affects protein structure and function. Demonstration of l-leucine binding to α-IPMS-14CR would confirm whether or not α-IPMS-14CR responds to end-product feedback inhibition.

  15. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

    Science.gov (United States)

    Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.

  16. [Evaluation of variable number of tandem repeats (VNTR) isolates of Mycobacterium bovis in Algeria].

    Science.gov (United States)

    Sahraoui, Naima; Muller, Borna; Djamel, Yala; Fadéla, Boulahbal; Rachid, Ouzrout; Jakob, Zinsstag; Djamel, Guetarni

    2010-01-01

    The discriminatory potency of variable number of tandem repeats (VNTR), based on 7 loci (MIRU 26, 27 and 5 ETRs A, B, C, D, E) was assayed on Mycobacterium bovis strains obtained from samples due to tuberculosis in two slaughterhouses in Algeria. The technique of MIRU-VNTR has been evaluated on 88 strains of M. bovis and one strain of M. caprea and shows 41 different profiles. Results showed that the VNTR were highly discriminatory with an allelic diversity of 0.930 when four loci (ETR A, B, C and MIRU 27) were highly discriminatory (h>0.25) and three loci (ETR D and E MIRU 26) moderately discriminatory (0.11VNTR loci were highly discriminatory be adequate for the first proper differentiation of strains of M. bovis in Algeria. The VNTR technique has proved a valuable tool for further development and application of epidemiological research for the of tuberculosis transmission in Algeria.

  17. Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

    Science.gov (United States)

    Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

    2016-05-23

    Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.

  18. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    Directory of Open Access Journals (Sweden)

    Surendra K Prajapati

    Full Text Available The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75 from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years and long-term population history (79,235 to 104,008 of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes to that inferred from mitochondrial genome diversity.

  19. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    Science.gov (United States)

    Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

    2013-01-01

    The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

  20. Longitudinal survey of Staphylococcus aureus in cystic fibrosis patients using a multiple-locus variable-number of tandem-repeats analysis method

    OpenAIRE

    Vergnaud Gilles; Moissenet Didier; Corvol Harriet; Fauroux Brigitte; Corbineau Gaëlle; Hormigos Katia; Vu-Thien Hoang; Pourcel Christine

    2010-01-01

    Abstract Background Staphylococcus aureus infection in patients with cystic fibrosis (CF) is frequent and may be due to colonization by a few pathogenic lineages. Systematic genotyping of all isolates, methicillin-susceptible S. aureus (MSSA) as well as methicillin-resistant S. aureus (MRSA) is necessary to identify such lineages and follow their evolution in patients. Multiple-locus variable-number tandem repeat analysis (MLVA/VNTR) was used to survey S. aureus clinical isolates in a French ...

  1. Inverted repeats in the promoter as an autoregulatory sequence for TcrX in Mycobacterium tuberculosis

    International Nuclear Information System (INIS)

    Bhattacharya, Monolekha; Das, Amit Kumar

    2011-01-01

    Highlights: ► The regulatory sequences recognized by TcrX have been identified. ► The regulatory region comprises of inverted repeats segregated by 30 bp region. ► The mode of binding of TcrX with regulatory sequence is unique. ► In silico TcrX–DNA docked model binds one of the inverted repeats. ► Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has not been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by ∼30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.

  2. Short tandem repeat profiling: part of an overall strategy for reducing the frequency of cell misidentification.

    Science.gov (United States)

    Nims, Raymond W; Sykes, Greg; Cottrill, Karin; Ikonomi, Pranvera; Elmore, Eugene

    2010-12-01

    The role of cell authentication in biomedical science has received considerable attention, especially within the past decade. This quality control attribute is now beginning to be given the emphasis it deserves by granting agencies and by scientific journals. Short tandem repeat (STR) profiling, one of a few DNA profiling technologies now available, is being proposed for routine identification (authentication) of human cell lines, stem cells, and tissues. The advantage of this technique over methods such as isoenzyme analysis, karyotyping, human leukocyte antigen typing, etc., is that STR profiling can establish identity to the individual level, provided that the appropriate number and types of loci are evaluated. To best employ this technology, a standardized protocol and a data-driven, quality-controlled, and publically searchable database will be necessary. This public STR database (currently under development) will enable investigators to rapidly authenticate human-based cultures to the individual from whom the cells were sourced. Use of similar approaches for non-human animal cells will require developing other suitable loci sets. While implementing STR analysis on a more routine basis should significantly reduce the frequency of cell misidentification, additional technologies may be needed as part of an overall authentication paradigm. For instance, isoenzyme analysis, PCR-based DNA amplification, and sequence-based barcoding methods enable rapid confirmation of a cell line's species of origin while screening against cross-contaminations, especially when the cells present are not recognized by the species-specific STR method. Karyotyping may also be needed as a supporting tool during establishment of an STR database. Finally, good cell culture practices must always remain a major component of any effort to reduce the frequency of cell misidentification.

  3. Clustering of Beijing genotype Mycobacterium tuberculosis isolates from the Mekong delta in Vietnam on the basis of variable number of tandem repeat versus restriction fragment length polymorphism typing.

    NARCIS (Netherlands)

    Huyen, M.N.; Kremer, K.; Lan, N.T.; Buu, T.N.; Cobelens, F.G.; Tiemersma, E.W.; Haas, P. de; Soolingen, D. van

    2013-01-01

    BACKGROUND: In comparison to restriction fragment length polymorphism (RFLP) typing, variable number of tandem repeat (VNTR) typing is easier to perform, faster and yields results in a simple, numerical format. Therefore, this technique has gained recognition as the new international gold standard

  4. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence

    NARCIS (Netherlands)

    Semenova, E.V.; Jore, M.M.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

    2011-01-01

    Prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR)/Cas (CRISPR-associated sequences) systems provide adaptive immunity against viruses when a spacer sequence of small CRISPR RNA (crRNA) matches a protospacer sequence in the viral genome. Viruses that escape CRISPR/Cas

  5. Intermittency as a universal characteristic of the complete chromosome DNA sequences of eukaryotes: From protozoa to human genomes

    Science.gov (United States)

    Rybalko, S.; Larionov, S.; Poptsova, M.; Loskutov, A.

    2011-10-01

    Large-scale dynamical properties of complete chromosome DNA sequences of eukaryotes are considered. Using the proposed deterministic models with intermittency and symbolic dynamics we describe a wide spectrum of large-scale patterns inherent in these sequences, such as segmental duplications, tandem repeats, and other complex sequence structures. It is shown that the recently discovered gene number balance on the strands is not of a random nature, and certain subsystems of a complete chromosome DNA sequence exhibit the properties of deterministic chaos.

  6. Clustering of Beijing genotype Mycobacterium tuberculosis isolates from the Mekong delta in Vietnam on the basis of variable number of tandem repeat versus restriction fragment length polymorphism typing

    NARCIS (Netherlands)

    Huyen, Mai N. T.; Kremer, Kristin; Lan, Nguyen T. N.; Buu, Tran N.; Cobelens, Frank G. J.; Tiemersma, Edine W.; de Haas, Petra; van Soolingen, Dick

    2013-01-01

    In comparison to restriction fragment length polymorphism (RFLP) typing, variable number of tandem repeat (VNTR) typing is easier to perform, faster and yields results in a simple, numerical format. Therefore, this technique has gained recognition as the new international gold standard in typing of

  7. Simple sequence repeat (SSR) markers are effective for identifying ...

    African Journals Online (AJOL)

    DNA was extracted from newly formed leaves and amplified using 21 simple sequence repeat (SSR) markers (NH001c, NH002b, NH005b, NH007b, NH008b, NH009b, NH011b, NH013b, NH012a, NH014a, NH015a, NH017a, KA4b, KA5, KA14, KA16, KB16, KU10, BGA35, BGT23b and HGA8b). The data was analyzed by ...

  8. Evaluation of advanced multiplex short tandem repeat systems in pairwise kinship analysis.

    Science.gov (United States)

    Tamura, Tomonori; Osawa, Motoki; Ochiai, Eriko; Suzuki, Takanori; Nakamura, Takashi

    2015-09-01

    The AmpFLSTR Identifiler Kit, comprising 15 autosomal short tandem repeat (STR) loci, is commonly employed in forensic practice for calculating match probabilities and parentage testing. The conventional system exhibits insufficient estimation for kinship analysis such as sibship testing because of shortness of examined loci. This study evaluated the power of the PowerPlex Fusion System, GlobalFiler Kit, and PowerPlex 21 System, which comprise more than 20 autosomal STR loci, to estimate pairwise blood relatedness (i.e., parent-child, full siblings, second-degree relatives, and first cousins). The genotypes of all 24 STR loci in 10,000 putative pedigrees were constructed by simulation. The likelihood ratio for each locus was calculated from joint probabilities for relatives and non-relatives. The combined likelihood ratio was calculated according to the product rule. The addition of STR loci improved separation between relatives and non-relatives. However, these systems were less effectively extended to the inference for first cousins. In conclusion, these advanced systems will be useful in forensic personal identification, especially in the evaluation of full siblings and second-degree relatives. Moreover, the additional loci may give rise to two major issues of more frequent mutational events and several pairs of linked loci on the same chromosome. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  9. Interleukin 6 variable number of tandem repeats (VNTR) gene polymorphism in centenarians.

    Science.gov (United States)

    Capurso, C; Solfrizzi, V; D'Introno, A; Colacicco, A M; Capurso, S A; Semeraro, C; Capurso, A; Panza, F

    2007-11-01

    Recent population-based studies identified the magnitude of interleukin 6 (IL6) serum levels as a marker for functional disability, and a predictor of disability and mortality among the elderly. We investigated whether there was evidence in Southern Italy of an association between the IL6 gene variable number of tandem repeats (VNTR) polymorphism and extreme longevity, and tested for the possible interaction of apolipoprotein E (APOE) alleles with the IL6 VNTR alleles. Four alleles coding for variants of four different lengths have been identified: allele A [760 base pairs (bp)], allele B (680 bp), allele C (640 bp), and allele D (610 bp). IL6 VNTR and APOE allele and genotype frequencies were studied in a total of 61 centenarians and 94 middle-aged subjects from Southern Italy. The IL6 VNTR allele B was overrepresented in the younger control group compared with centenarians (odds ratio: 0.56, 95% confidence interval: 0.35-0.88, Bonferroni p-value VNTR alleles and APOE alleles on the odds ratios to reach extreme longevity were evaluated for the smallest number of subjects in centenarians and younger controls. Our findings suggested that the presence of the IL6 VNTR allele B could be detrimental for reaching extreme longevity.

  10. Variable-Number Tandem-Repeat Analysis of Respiratory and Household Water Biofilm Isolates of “Mycobacterium avium subsp. hominissuis” with Establishment of a PCR Database

    Science.gov (United States)

    Iakhiaeva, Elena; Howard, Susan T.; Brown Elliott, Barbara A.; McNulty, Steven; Newman, Kristopher L.; Falkinham, Joseph O.; Williams, Myra; Kwait, Rebecca; Lande, Leah; Vasireddy, Ravikiran; Turenne, Christine

    2016-01-01

    “Mycobacterium avium subsp. hominissuis” is an important cause of pulmonary disease. It is acquired from environmental sources, but there is no methodology for large population studies. We evaluated the potential of variable-number tandem-repeat (VNTR) analysis. Clinical and household biofilm M. avium isolates underwent molecular identification. Testing for IS901 was done to separate M. avium subsp. avium from M. avium subsp. hominissuis. VNTR types were defined using VNTR loci, and subtyping was performed using 3′ hsp65 and internal transcribed spacer (ITS) sequencing. Forty-nine VNTR types and eight subtypes of M. avium subsp. hominissuis (IS901 negative) were identified among 416 isolates of M. avium from 121 patients and 80 biofilm sites. Of those types, 67% were found only among patient isolates, 11% only among household water isolates, and 23% among both. Of 13 VNTR types that included ≥4 patients, the majority (61.5%) represented geographic clustering (same city). Most VNTR types with multiple patients belonged to the same 3′ hsp65 sequence code (sequevar). A total of 44 isolates belonging to four M. avium subsp. hominissuis VNTR types (8%), including three with the rare Mav-F ITS sequence and 0/8 subspecies, produced amplicons with IS901 PCR primers. By sequencing, all 44 amplicons were not IS901 but ISMav6, which was recently observed in Japan but had not been previously described among U.S. isolates. VNTR analysis of M. avium subsp. hominissuis isolates is easier and faster than pulsed-field gel electrophoresis. Seven VNTR loci separated 417 isolates into 49 types. No isolates of M. avium subsp. avium were identified. The distributions of the VNTR copy numbers, the allelic diversity, and the low prevalence of ISMav6 differed from the findings for respiratory isolates reported from Japan. PMID:26739155

  11. In Silico Retrieving of Opium Poppy (Papaver Somniferum L. Microsatellites

    Directory of Open Access Journals (Sweden)

    Masárová Veronika

    2015-12-01

    Full Text Available Repetitive tandem sequences were retrieved within nucleotide sequences of opium poppy (Papaver somniferum L. genomic DNA available in the GenBank® database. Altogether 538 different microsatellites with the desired length characteristics of tandem repeats have been identified within 450 sequences of opium poppy DNA available in the database. The most frequented were mononucleotide repeats (246; nevertheless, 44 dinucleotide, 148 trinucleotide, 62 tetranucleotide, 28 pentanucleotide and 5 hexanucleotide tandem repeats have also been found. The most abundant were trinucleotide motifs (27.50%, and the most abundant motifs within each group of tandem repeats were TA/AT, TTC/GAA, GGTT/AACC and TTTTA/ TAAAA. Five hexanucleotide repeats contained four different motifs.

  12. Repeated-Sprint Sequences During Female Soccer Matches Using Fixed and Individual Speed Thresholds.

    Science.gov (United States)

    Nakamura, Fábio Y; Pereira, Lucas A; Loturco, Irineu; Rosseti, Marcelo; Moura, Felipe A; Bradley, Paul S

    2017-07-01

    Nakamura, FY, Pereira, LA, Loturco, I, Rosseti, M, Moura, FA, and Bradley, PS. Repeated-sprint sequences during female soccer matches using fixed and individual speed thresholds. J Strength Cond Res 31(7): 1802-1810, 2017-The main objective of this study was to characterize the occurrence of single sprint and repeated-sprint sequences (RSS) during elite female soccer matches, using fixed (20 km·h) and individually based speed thresholds (>90% of the mean speed from a 20-m sprint test). Eleven elite female soccer players from the same team participated in the study. All players performed a 20-m linear sprint test, and were assessed in up to 10 official matches using Global Positioning System technology. Magnitude-based inferences were used to test for meaningful differences. Results revealed that irrespective of adopting fixed or individual speed thresholds, female players produced only a few RSS during matches (2.3 ± 2.4 sequences using the fixed threshold and 3.3 ± 3.0 sequences using the individually based threshold), with most sequences composing of just 2 sprints. Additionally, central defenders performed fewer sprints (10.2 ± 4.1) than other positions (fullbacks: 28.1 ± 5.5; midfielders: 21.9 ± 10.5; forwards: 31.9 ± 11.1; with the differences being likely to almost certainly associated with effect sizes ranging from 1.65 to 2.72), and sprinting ability declined in the second half. The data do not support the notion that RSS occurs frequently during soccer matches in female players, irrespective of using fixed or individual speed thresholds to define sprint occurrence. However, repeated-sprint ability development cannot be ruled out from soccer training programs because of its association with match-related performance.

  13. Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

    Directory of Open Access Journals (Sweden)

    Natalie L. Dillon

    2014-01-01

    Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.

  14. Genomic organization and developmental fate of adjacent repeated sequences in a foldback DNA clone of Tetrahymena thermophila

    International Nuclear Information System (INIS)

    Tschunko, A.H.; Loechel, R.H.; McLaren, N.C.; Allen, S.L.

    1987-01-01

    DNA sequence elimination and rearrangement occurs during the development of somatic cell lineages of eukaryotes and was first discovered over a century ago. However, the significance and mechanism of chromatin elimination are not understood. DNA elimination also occurs during the development of the somatic macronucleus from the germinal micronucleus in unicellular ciliated protozoa such as Tetrahymena thermophila. In this study foldback DNA from the micronucleus was used as a probe to isolate ten clones. All of those tested (4/4) contained sequences that were repetitive in the micronucleus and rearranged in the macronucleus. Inverted repeated sequences were present in one clone. This clone, pTtFBl, was subjected to a detailed analysis of its developmental fate. Subregions were subcloned and used as probes against Southern blots of micronuclear and macronuclear DNA. DNA was labeled with [ 33 P]-labeled dATP. The authors found that all subregions defined repeated sequence families in the micronuclear genome. A minimum of four different families was defined, two of which are retained in the macronucleus and two of which are completely eliminated. The inverted repeat family is retained with little rearrangement. Two of the families, defined by subregions that do not contain parts of the inverted repeat are totally eliminated during macronuclear development-and contain open reading frames. The significance of retained inverted repeats to the process of elimination is discussed

  15. Analysis of short tandem repeat (STR) polymorphisms by the powerplex 16 system and capillary electrophoresis: application to forensic practice.

    OpenAIRE

    Okamoto, Osamu; Yamamoto, Yuji; Inagaki, Sachiyo; Yoshitome, Kei; ishikawa, Takaki; Imabayashi, Kiyomi; Miyaishi, Satoru; Ishizu, Hideo

    2003-01-01

    Allele and genotype frequencies for 15 short tandem repeat (STR) polymorphisms--D3S1358, TH01, D21S11, D18S51, Penta E, D5S818, D13S317, D7S820, D16S539, CSF1PO, Penta D, vWA, D8S1179, TPOX and FGA--in a Japanese population were estimated. No deviations of the observed allele frequency from Hardy-Weinberg equilibrium expectations were found for any of the systems studied. Between 2 new pentanucleotide STR loci, Penta E and Penta D, for which there is only limited data regarding the allelic di...

  16. Molecular characterization of Leptospira sp by multilocus variable number tandem repeat analysis (MLVA from clinical samples: a case report

    Directory of Open Access Journals (Sweden)

    Hélène Pailhoriès

    2015-08-01

    Full Text Available Leptospirosis is a zoonotic infection for which diagnosis is difficult. It has appeared as a global emerging infectious disease over recent years. Genotype determination often requires a Leptospira strain obtained by culture, which is a long and fastidious technique. A method based on multilocus variable number tandem repeat analysis (MLVA to determine the genotype of Leptospira interrogans, performed directly on blood or urine samples, is proposed. This method was applied to a fatal case of leptospirosis for which the geographical origin of infection was unknown. This technique will allow a genotype to be obtained for L. interrogans, even when cultures remain negative.

  17. Complete chloroplast genome sequence of a major economic species, Ziziphus jujuba (Rhamnaceae).

    Science.gov (United States)

    Ma, Qiuyue; Li, Shuxian; Bi, Changwei; Hao, Zhaodong; Sun, Congrui; Ye, Ning

    2017-02-01

    Ziziphus jujuba is an important woody plant with high economic and medicinal value. Here, we analyzed and characterized the complete chloroplast (cp) genome of Z. jujuba, the first member of the Rhamnaceae family for which the chloroplast genome sequence has been reported. We also built a web browser for navigating the cp genome of Z. jujuba ( http://bio.njfu.edu.cn/gb2/gbrowse/Ziziphus_jujuba_cp/ ). Sequence analysis showed that this cp genome is 161,466 bp long and has a typical quadripartite structure of large (LSC, 89,120 bp) and small (SSC, 19,348 bp) single-copy regions separated by a pair of inverted repeats (IRs, 26,499 bp). The sequence contained 112 unique genes, including 78 protein-coding genes, 30 transfer RNAs, and four ribosomal RNAs. The genome structure, gene order, GC content, and codon usage are similar to other typical angiosperm cp genomes. A total of 38 tandem repeats, two forward repeats, and three palindromic repeats were detected in the Z. jujuba cp genome. Simple sequence repeat (SSR) analysis revealed that most SSRs were AT-rich. The homopolymer regions in the cp genome of Z. jujuba were verified and manually corrected by Sanger sequencing. One-third of mononucleotide repeats were found to be erroneously sequenced by the 454 pyrosequencing, which resulted in sequences of 1-4 bases shorter than that by the Sanger sequencing. Analyzing the cp genome of Z. jujuba revealed that the IR contraction and expansion events resulted in ycf1 and rps19 pseudogenes. A phylogenetic analysis based on 64 protein-coding genes showed that Z. jujuba was closely related to members of the Elaeagnaceae family, which will be helpful for phylogenetic studies of other Rosales species. The complete cp genome sequence of Z. jujuba will facilitate population, phylogenetic, and cp genetic engineering studies of this economic plant.

  18. The discovery, function and development of the variable number tandem repeats in different Mycobacterium species.

    Science.gov (United States)

    Sun, Zhaogang; Li, Weimin; Xu, Shaofa; Huang, Hairong

    2016-09-01

    The method of genotyping by variable number tandem repeats (VNTRs) facilitates the epidemiological studies of different Mycobacterium species worldwide. Until now, the VNTR method is not fully understood, for example, its discovery, function and classification. The inconsistent nomenclature and terminology of VNTR is especially confusing. In this review, we first describe in detail the VNTRs in Mycobacterium tuberculosis (M. tuberculosis), as this pathogen resulted in more deaths than any other microbial pathogen as well as for which extensive studies of VNTRs were carried out, and then we outline the recent progress of the VNTR-related epidemiological research in several other Mycobacterium species, such as M. abscessus, M. africanum, M. avium, M. bovis, M. canettii, M. caprae, M. intracellulare, M. leprae, M. marinum, M. microti, M. pinnipedii and M. ulcerans from different countries and regions. This article is aimed mainly at the practical notes of VNTR to help the scientists in better understanding and performing this method.

  19. Multi-locus variable-number tandem repeat profiling of Salmonella enterica serovar Typhi isolates from blood cultures and gallbladder specimens from Makassar, South-Sulawesi, Indonesia.

    Directory of Open Access Journals (Sweden)

    Mochammad Hatta

    Full Text Available Multi-locus variable-number tandem repeat analysis differentiated 297 Salmonella enterica serovar Typhi blood culture isolates from Makassar in 76 genotypes and a single unique S. Typhi genotype was isolated from the cholecystectomy specimens of four patients with cholelithiasis. The high diversity in S. Typhi genotypes circulating in Makassar indicates that the number of carriers could be very large, which may complicate disease prevention and control.

  20. Lymphatic filarial species differentiation using evolutionarily modified tandem repeats: generation of new genetic markers.

    Science.gov (United States)

    Sakthidevi, Moorthy; Murugan, Vadivel; Hoti, Sugeerappa Laxmanappa; Kaliraj, Perumal

    2010-05-01

    Polymerase chain reaction based methods are promising tools for the monitoring and evaluation of the Global Program for the Elimination of Lymphatic Filariasis. The currently available PCR methods do not differentiate the DNA of Wuchereria bancrofti or Brugia malayi by a single PCR and hence are cumbersome. Therefore, we designed a single step PCR strategy for differentiating Bancroftian infection from Brugian infection based on a newly identified gene from the W. bancrofti genome, abundant larval transcript-2 (alt-2), which is abundantly expressed. The difference in PCR product sizes generated from the presence or absence of evolutionarily altered tandem repeats in alt-2 intron-3 differentiated W. bancrofti from B. malayi. The analysis was performed on the genomic DNA of microfilariae from a number of patient blood samples or microfilariae positive slides from different Indian geographical regions. The assay gave consistent results, differentiating the two filarial parasite species accurately. This alt-2 intron-3 based PCR assay can be a potential tool for the diagnosis and differentiation of co-infections by lymphatic filarial parasites. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  1. NIST mixed stain study 3: signal intensity balance in commercial short tandem repeat multiplexes.

    Science.gov (United States)

    Duewer, David L; Kline, Margaret C; Redman, Janette W; Butler, John M

    2004-12-01

    Short-tandem repeat (STR) allelic intensities were collected from more than 60 forensic laboratories for a suite of seven samples as part of the National Institute of Standards and Technology-coordinated 2001 Mixed Stain Study 3 (MSS3). These interlaboratory challenge data illuminate the relative importance of intrinsic and user-determined factors affecting the locus-to-locus balance of signal intensities for currently used STR multiplexes. To varying degrees, seven of the eight commercially produced multiplexes used by MSS3 participants displayed very similar patterns of intensity differences among the different loci probed by the multiplexes for all samples, in the hands of multiple analysts, with a variety of supplies and instruments. These systematic differences reflect intrinsic properties of the individual multiplexes, not user-controllable measurement practices. To the extent that quality systems specify minimum and maximum absolute intensities for data acceptability and data interpretation schema require among-locus balance, these intrinsic intensity differences may decrease the utility of multiplex results and surely increase the cost of analysis.

  2. Alu repeats as markers for human population genetics

    Energy Technology Data Exchange (ETDEWEB)

    Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Bazan, H. [Louisiana State Univ., New Orleans, LA (United States). Medical Center] [and others

    1993-09-01

    The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 97.9% nucleotide identity with each other and an average of 98.9% nucleotide identity with the HS subfamily consensus sequence. HS Alu family members are thought to be derived from a single source ``master`` gene, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 in. and 3 in. unique flanking DNA sequences from each HS Alu that allows the locus to be assayed for the presence or absence of an Alu repeat. Individual HS Alu sequences were found to be either monomorphic or dimorphic for the presence or absence of each repeat. The monomorphic HS Alu family members inserted in the human genome after the human/great ape divergence (which is thought to have occurred 4--6 million years ago), but before the radiation of modem man. The dimorphic HS Alu sequences inserted in the human genome after the radiation of modem man (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project as well. HS Alu family member insertion dimorphism differs from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) because individuals share HS Alu family member insertions based upon identity by descent from a common ancestor as a result of a single event which occurred one time within the human population. The VNTR and RFLP polymorphisms may arise multiple times within a population and are identical by state only.

  3. The sequence and de novo assembly of the giant panda genome

    Science.gov (United States)

    Li, Ruiqiang; Fan, Wei; Tian, Geng; Zhu, Hongmei; He, Lin; Cai, Jing; Huang, Quanfei; Cai, Qingle; Li, Bo; Bai, Yinqi; Zhang, Zhihe; Zhang, Yaping; Wang, Wen; Li, Jun; Wei, Fuwen; Li, Heng; Jian, Min; Li, Jianwen; Zhang, Zhaolei; Nielsen, Rasmus; Li, Dawei; Gu, Wanjun; Yang, Zhentao; Xuan, Zhaoling; Ryder, Oliver A.; Leung, Frederick Chi-Ching; Zhou, Yan; Cao, Jianjun; Sun, Xiao; Fu, Yonggui; Fang, Xiaodong; Guo, Xiaosen; Wang, Bo; Hou, Rong; Shen, Fujun; Mu, Bo; Ni, Peixiang; Lin, Runmao; Qian, Wubin; Wang, Guodong; Yu, Chang; Nie, Wenhui; Wang, Jinhuan; Wu, Zhigang; Liang, Huiqing; Min, Jiumeng; Wu, Qi; Cheng, Shifeng; Ruan, Jue; Wang, Mingwei; Shi, Zhongbin; Wen, Ming; Liu, Binghang; Ren, Xiaoli; Zheng, Huisong; Dong, Dong; Cook, Kathleen; Shan, Gao; Zhang, Hao; Kosiol, Carolin; Xie, Xueying; Lu, Zuhong; Zheng, Hancheng; Li, Yingrui; Steiner, Cynthia C.; Lam, Tommy Tsan-Yuk; Lin, Siyuan; Zhang, Qinghui; Li, Guoqing; Tian, Jing; Gong, Timing; Liu, Hongde; Zhang, Dejin; Fang, Lin; Ye, Chen; Zhang, Juanbin; Hu, Wenbo; Xu, Anlong; Ren, Yuanyuan; Zhang, Guojie; Bruford, Michael W.; Li, Qibin; Ma, Lijia; Guo, Yiran; An, Na; Hu, Yujie; Zheng, Yang; Shi, Yongyong; Li, Zhiqiang; Liu, Qing; Chen, Yanling; Zhao, Jing; Qu, Ning; Zhao, Shancen; Tian, Feng; Wang, Xiaoling; Wang, Haiyin; Xu, Lizhi; Liu, Xiao; Vinar, Tomas; Wang, Yajun; Lam, Tak-Wah; Yiu, Siu-Ming; Liu, Shiping; Zhang, Hemin; Li, Desheng; Huang, Yan; Wang, Xia; Yang, Guohua; Jiang, Zhi; Wang, Junyi; Qin, Nan; Li, Li; Li, Jingxiang; Bolund, Lars; Kristiansen, Karsten; Wong, Gane Ka-Shu; Olson, Maynard; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian; Wang, Jun

    2013-01-01

    Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes. PMID:20010809

  4. RePS: a sequence assembler that masks exact repeats identified from the shotgun data

    DEFF Research Database (Denmark)

    Wang, Jun; Wong, Gane Ka-Shu; Ni, Peixiang

    2002-01-01

    We describe a sequence assembler, RePS (repeat-masked Phrap with scaffolding), that explicitly identifies exact 20mer repeats from the shotgun data and removes them prior to the assembly. The established software is used to compute meaningful error probabilities for each base. Clone......-end-pairing information is used to construct scaffolds that order and orient the contigs. We show with real data for human and rice that reasonable assemblies are possible even at coverages of only 4x to 6x, despite having up to 42.2% in exact repeats. Udgivelsesdato: 2002-May...

  5. In silico analysis of Simple Sequence Repeats from chloroplast genomes of Solanaceae species

    Directory of Open Access Journals (Sweden)

    Evandro Vagner Tambarussi

    2009-01-01

    Full Text Available The availability of chloroplast genome (cpDNA sequences of Atropa belladonna, Nicotiana sylvestris, N.tabacum, N. tomentosiformis, Solanum bulbocastanum, S. lycopersicum and S. tuberosum, which are Solanaceae species,allowed us to analyze the organization of cpSSRs in their genic and intergenic regions. In general, the number of cpSSRs incpDNA ranged from 161 in S. tuberosum to 226 in N. tabacum, and the number of intergenic cpSSRs was higher than geniccpSSRs. The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, pentaandhexanucleotide repeats. Multiple alignments of all cpSSRs sequences from Solanaceae species made the identification ofnucleotide variability possible and the phylogeny was estimated by maximum parsimony. Our study showed that the plastomedatabase can be exploited for phylogenetic analysis and biotechnological approaches.

  6. Structural analysis of a repetitive protein sequence motif in strepsirrhine primate amelogenin.

    Directory of Open Access Journals (Sweden)

    Rodrigo S Lacruz

    2011-03-01

    Full Text Available Strepsirrhines are members of a primate suborder that has a distinctive set of features associated with the development of the dentition. Amelogenin (AMEL, the better known of the enamel matrix proteins, forms 90% of the secreted organic matrix during amelogenesis. Although AMEL has been sequenced in numerous mammalian lineages, the only reported strepsirrhine AMEL sequences are those of the ring-tailed lemur and galago, which contain a set of additional proline-rich tandem repeats absent in all other primates species analyzed to date, but present in some non-primate mammals. Here, we first determined that these repeats are present in AMEL from three additional lemur species and thus are likely to be widespread throughout this group. To evaluate the functional relevance of these repeats in strepsirrhines, we engineered a mutated murine amelogenin sequence containing a similar proline-rich sequence to that of Lemur catta. In the monomeric form, the MQP insertions had no influence on the secondary structure or refolding properties, whereas in the assembled form, the insertions increased the hydrodynamic radii. We speculate that increased AMEL nanosphere size may influence enamel formation in strepsirrhine primates.

  7. Structural basis for sequence-specific recognition of DNA by TAL effectors

    KAUST Repository

    Deng, Dong

    2012-01-05

    TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat comprises 33 to 35 conserved amino acids and targets a specific base pair by using two hypervariable residues [known as repeat variable diresidues (RVDs)] at positions 12 and 13. Here, we report the crystal structures of an 11.5-repeat TAL effector in both DNA-free and DNA-bound states. Each TAL repeat comprises two helices connected by a short RVD-containing loop. The 11.5 repeats form a right-handed, superhelical structure that tracks along the sense strand of DNA duplex, with RVDs contacting the major groove. The 12th residue stabilizes the RVD loop, whereas the 13th residue makes a base-specific contact. Understanding DNA recognition by TAL effectors may facilitate rational design of DNA-binding proteins with biotechnological applications.

  8. Noninvasive prenatal paternity testing (NIPAT) through maternal plasma DNA sequencing

    DEFF Research Database (Denmark)

    Jiang, Haojun; Xie, Yifan; Li, Xuchao

    2016-01-01

    developed a noninvasive prenatal paternity testing (NIPAT) based on SNP typing with maternal plasma DNA sequencing. We evaluated the influence factors (minor allele frequency (MAF), the number of total SNP, fetal fraction and effective sequencing depth) and designed three different selective SNP panels......Short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) have been already used to perform noninvasive prenatal paternity testing from maternal plasma DNA. The frequently used technologies were PCR followed by capillary electrophoresis and SNP typing array, respectively. Here, we...... paternity test using STR multiplex system. Our study here proved that the maternal plasma DNA sequencing-based technology is feasible and accurate in determining paternity, which may provide an alternative in forensic application in the future....

  9. DNA dynamics is likely to be a factor in the genomic nucleotide repeats expansions related to diseases.

    Directory of Open Access Journals (Sweden)

    Boian S Alexandrov

    Full Text Available Trinucleotide repeats sequences (TRS represent a common type of genomic DNA motif whose expansion is associated with a large number of human diseases. The driving molecular mechanisms of the TRS ongoing dynamic expansion across generations and within tissues and its influence on genomic DNA functions are not well understood. Here we report results for a novel and notable collective breathing behavior of genomic DNA of tandem TRS, leading to propensity for large local DNA transient openings at physiological temperature. Our Langevin molecular dynamics (LMD and Markov Chain Monte Carlo (MCMC simulations demonstrate that the patterns of openings of various TRSs depend specifically on their length. The collective propensity for DNA strand separation of repeated sequences serves as a precursor for outsized intermediate bubble states independently of the G/C-content. We report that repeats have the potential to interfere with the binding of transcription factors to their consensus sequence by altered DNA breathing dynamics in proximity of the binding sites. These observations might influence ongoing attempts to use LMD and MCMC simulations for TRS-related modeling of genomic DNA functionality in elucidating the common denominators of the dynamic TRS expansion mutation with potential therapeutic applications.

  10. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Science.gov (United States)

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697

  11. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese.

    Science.gov (United States)

    Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-08-22

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).

  12. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese

    Science.gov (United States)

    Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-01-01

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555

  13. [Association of aggressive behaviors of schizophrenia with short tandem repeats loci].

    Science.gov (United States)

    Yang, Chun; Ba, Huajie; Tan, Xingqi; Zhao, Hanqing; Zhang, Shuyou; Yu, Haiying

    2017-12-10

    To assess the association of short tandem repeats (STRs) loci with aggressive behaviors of schizophrenia. Blood samples from 123 schizophrenic patients with aggressive behaviors and 489 schizophrenic patients without aggressive behaviors were collected. DNA from all samples was amplified with a PowerPlex 21 system and separated by electrophoresis to determine the genotypes and allelic frequencies of 20 STR loci including D3S1368, D1S1656, D6S1043, D13S317, Penta E, D16S639, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433, and FGA. All of the 20 STR loci have reached Hardy-Weinberg equilibrium in both groups. A significant difference was found in allelic and genotypic frequencies of loci Penta D between the two groups (alleles: P=0.042; genotypes: P=0.014) but not for the remaining 19 loci (P> 0.05). Univariate analysis also showed a significant difference for allele 10 and genotypes 10-12 of Penta D between the two groups (P=0.0027, P=0.0001), with the OR being 1.81 (95%CI: 1.22-2.67) and 4.33 (95%CI: 1.95-9.59), respectively. Penta D may be associated with aggressive behaviors of schizophrenia. Allele 10 and genotypes 10-12 of Penta D may confer a risk for the disease.

  14. Ruthenium Hydride/Brønsted Acid-Catalyzed Tandem Isomerization/N-Acyliminium Cyclization Sequence for the Synthesis of Tetrahydro-β-carbolines

    DEFF Research Database (Denmark)

    Hansen, Casper Lykke; Clausen, Janie Regitse Waël; Ohm, Ragnhild Gaard

    2013-01-01

    This paper describes an efficient tandem sequence for the synthesis of 1,2,3,4-tetrahydro-β-carbolines (THBCs) relying on a ruthenium hydride/Brønsted acid- catalyzed isomerization of allylic amides to N-acyliminium ion intermediates which are trapped by a tethered indolenucleophile. The methodol...... the Suzuki cross-coupling reaction to the isomerization/N-acyliminium cyclization sequence. Finally, diastereo- and enantioselective versions of the title reaction have been examined using substrate control (with dr >15: 1) and asymmetric catalysis (ee up to 57%), respectively...

  15. simple sequence repeats (EST-SSR)

    African Journals Online (AJOL)

    Yomi

    2012-01-19

    Jan 19, 2012 ... 212 primer pairs selected, based on repeat patterns of n≥8 for di-, tri-, tetra- and penta-nucleotide repeat ... Cluster analysis revealed a high genetic similarity among the sugarcane (Saccharum spp.) breeding lines which could reduce the genetic gain in ..... The multiple allele characteristic of SSR com-.

  16. Establishment of a tandem ionization chamber system in standard mammography beams

    International Nuclear Information System (INIS)

    Silva, Jonas O. da; Caldas, L.V.E.

    2011-01-01

    A double-faced tandem ionization chamber system was developed at the Calibration Laboratory of IPEN. It has different collecting electrode materials: aluminium and graphite. The response repeatability and reproducibility and the energy dependence test of this tandem ionization chamber were evaluated. The chamber response stability is within the ±3% limit recommended in international standards. The energy dependence test of the ionization chamber system using the tandem curve obtained, presented agreement with literature results. (author)

  17. Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

    Science.gov (United States)

    Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...

  18. [open quotes]Cryptic[close quotes] repeating triplets of purines and pyrimidines (cRRY(i)) are frequent and polymorphic: Analysis of coding cRRY(i) in the proopiomelanocortin (POMC) and TATA-binding protein (TBP) genes

    Energy Technology Data Exchange (ETDEWEB)

    Gostout, B.; Qiang Liu; Sommer, S.S. (Mayo Clinic/Foundation, Rochester, MN (United States))

    1993-06-01

    Triplets of the form of purine, purine, pyrimidine (RRY(i)) are enhanced in frequency in the genomes of primates, rodents, and bacteria. Some RRY(i) are [open quotes]cryptic[close quotes] repeats (cRRY(i)) in which no one tandem run of a trinucleotide predominates. A search of human GenBank sequence revealed that the sequences of cRRY(i) are highly nonrandom. Three randomly chosen human cRRY(i) were sequenced in search of polymorphic alleles. Multiple polymorphic alleles were found in cRRY(i) in the coding regions of the genes for proopiomelanocortin (POMC) and TATA-binding protein (TBP). The highly polymorphic TBP cRRY(i) was characterized in detail. Direct sequencing of 157 unrelated human alleles demonstrated the presence of 20 different alleles which resulted in 29--40 consecutive glutamines in the amino-terminal region of TBP. These alleles are differently distributed among the races. PCR was used to screen 1,846 additional alleles in order to characterize more fully the range of variation in the population. Three additional alleles were discovered, but there was no example of a substantial sequence amplification as is seen in the repeat sequences associated with X-linked spinal and bulbar muscular atrophy, myotonic dystrophy, or the fragile-X syndrome. The structure of the TBP cRRY(i) is conserved in the five monkey species examined. In the chimpanzee, examination of four individuals revealed that the cRRY(i) was highly polymorphic, but the pattern of polymorphism differed from that in humans. The TBP cRRY(i) displays both similarities with and differences from the previously described RRY(i) in the coding sequence of the androgen receptor. The data suggest how simple tandem repeats could evolve from cryptic repeats. 18 refs., 3 figs., 6 tabs.

  19. Exploiting BAC-end sequences for the mining, characterization and utility of new short sequences repeat (SSR) markers in Citrus.

    Science.gov (United States)

    Biswas, Manosh Kumar; Chai, Lijun; Mayer, Christoph; Xu, Qiang; Guo, Wenwu; Deng, Xiuxin

    2012-05-01

    The aim of this study was to develop a large set of microsatellite markers based on publicly available BAC-end sequences (BESs), and to evaluate their transferability, discriminating capacity of genotypes and mapping ability in Citrus. A set of 1,281 simple sequence repeat (SSR) markers were developed from the 46,339 Citrus clementina BAC-end sequences (BES), of them 20.67% contained SSR longer than 20 bp, corresponding to roughly one perfect SSR per 2.04 kb. The most abundant motifs were di-nucleotide (16.82%) repeats. Among all repeat motifs (TA/AT)n is the most abundant (8.38%), followed by (AG/CT)n (4.51%). Most of the BES-SSR are located in the non-coding region, but 1.3% of BES-SSRs were found to be associated with transposable element (TE). A total of 400 novel SSR primer pairs were synthesized and their transferability and polymorphism tested on a set of 16 Citrus and Citrus relative's species. Among these 333 (83.25%) were successfully amplified and 260 (65.00%) showed cross-species transferability with Poncirus trifoliata and Fortunella sp. These cross-species transferable markers could be useful for cultivar identification, for genomic study of Citrus, Poncirus and Fortunella sp. Utility of the developed SSR marker was demonstrated by identifying a set of 118 markers each for construction of linkage map of Citrus reticulata and Poncirus trifoliata. Genetic diversity and phylogenetic relationship among 40 Citrus and its related species were conducted with the aid of 25 randomly selected SSR primer pairs and results revealed that citrus genomic SSRs are superior to genic SSR for genetic diversity and germplasm characterization of Citrus spp.

  20. Entropic fluctuations in DNA sequences

    Science.gov (United States)

    Thanos, Dimitrios; Li, Wentian; Provata, Astero

    2018-03-01

    The Local Shannon Entropy (LSE) in blocks is used as a complexity measure to study the information fluctuations along DNA sequences. The LSE of a DNA block maps the local base arrangement information to a single numerical value. It is shown that despite this reduction of information, LSE allows to extract meaningful information related to the detection of repetitive sequences in whole chromosomes and is useful in finding evolutionary differences between organisms. More specifically, large regions of tandem repeats, such as centromeres, can be detected based on their low LSE fluctuations along the chromosome. Furthermore, an empirical investigation of the appropriate block sizes is provided and the relationship of LSE properties with the structure of the underlying repetitive units is revealed by using both computational and mathematical methods. Sequence similarity between the genomic DNA of closely related species also leads to similar LSE values at the orthologous regions. As an application, the LSE covariance function is used to measure the evolutionary distance between several primate genomes.

  1. ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants.

    Science.gov (United States)

    Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh

    2014-01-01

    Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.

  2. Selection pressure on human STR loci and its relevance in repeat expansion disease

    KAUST Repository

    Shimada, Makoto K.; Sanbonmatsu, Ryoko; Yamaguchi-Kabata, Yumi; Yamasaki, Chisato; Suzuki, Yoshiyuki; Chakraborty, Ranajit; Gojobori, Takashi; Imanishi, Tadashi

    2016-01-01

    Short Tandem Repeats (STRs) comprise repeats of one to several base pairs. Because of the high mutability due to strand slippage during DNA synthesis, rapid evolutionary change in the number of repeating units directly shapes the range of repeat

  3. Allele frequencies of ten short tandem repeats loci in the central ...

    Indian Academy of Sciences (India)

    2009-04-03

    Apr 3, 2009 ... c Indian Academy of Sciences. RESEARCH NOTE. Allele frequencies of ten short tandem ... Statistical parameters of forensic importance, the power of discrimination (PD), observed and expected ... rameters indicated the usefulness of the loci in forensic per- sonal identification and paternity testing among ...

  4. Multi-locus variable number tandem repeat analysis of 7th pandemic Vibrio cholerae

    Directory of Open Access Journals (Sweden)

    Lam Connie

    2012-05-01

    Full Text Available Abstract Background Seven pandemics of cholera have been recorded since 1817, with the current and ongoing pandemic affecting almost every continent. Cholera remains endemic in developing countries and is still a significant public health issue. In this study we use multilocus variable number of tandem repeats (VNTRs analysis (MLVA to discriminate between isolates of the 7th pandemic clone of Vibrio cholerae. Results MLVA of six VNTRs selected from previously published data distinguished 66 V. cholerae isolates collected between 1961–1999 into 60 unique MLVA profiles. Only 4 MLVA profiles consisted of more than 2 isolates. The discriminatory power was 0.995. Phylogenetic analysis showed that, except for the closely related profiles, the relationships derived from MLVA profiles were in conflict with that inferred from Single Nucleotide Polymorphism (SNP typing. The six SNP groups share consensus VNTR patterns and two SNP groups contained isolates which differed by only one VNTR locus. Conclusions MLVA is highly discriminatory in differentiating 7th pandemic V. cholerae isolates and MLVA data was most useful in resolving the genetic relationships among isolates within groups previously defined by SNPs. Thus MLVA is best used in conjunction with SNP typing in order to best determine the evolutionary relationships among the 7th pandemic V. cholerae isolates and for longer term epidemiological typing.

  5. Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

    Science.gov (United States)

    Raynard, Steven J; Baker, Mark D

    2004-01-01

    In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.

  6. C-terminal low-complexity sequence repeats of Mycobacterium smegmatis Ku modulate DNA binding.

    Science.gov (United States)

    Kushwaha, Ambuj K; Grove, Anne

    2013-01-24

    Ku protein is an integral component of the NHEJ (non-homologous end-joining) pathway of DSB (double-strand break) repair. Both eukaryotic and prokaryotic Ku homologues have been characterized and shown to bind DNA ends. A unique feature of Mycobacterium smegmatis Ku is its basic C-terminal tail that contains several lysine-rich low-complexity PAKKA repeats that are absent from homologues encoded by obligate parasitic mycobacteria. Such PAKKA repeats are also characteristic of mycobacterial Hlp (histone-like protein) for which they have been shown to confer the ability to appose DNA ends. Unexpectedly, removal of the lysine-rich extension enhances DNA-binding affinity, but an interaction between DNA and the PAKKA repeats is indicated by the observation that only full-length Ku forms multiple complexes with a short stem-loop-containing DNA previously designed to accommodate only one Ku dimer. The C-terminal extension promotes DNA end-joining by T4 DNA ligase, suggesting that the PAKKA repeats also contribute to efficient end-joining. We suggest that low-complexity lysine-rich sequences have evolved repeatedly to modulate the function of unrelated DNA-binding proteins.

  7. The complete mitochondrial genome sequence of the Tibetan red fox (Vulpes vulpes montana).

    Science.gov (United States)

    Zhang, Jin; Zhang, Honghai; Zhao, Chao; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2015-01-01

    In this study, the complete mitochondrial genome of the Tibetan red fox (Vulpes Vulpes montana) was sequenced for the first time using blood samples obtained from a wild female red fox captured from Lhasa in Tibet, China. Qinghai--Tibet Plateau is the highest plateau in the world with an average elevation above 3500 m. Sequence analysis showed it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region (CR). The variable tandem repeats in CR is the main reason of the length variability of mitochondrial genome among canide animals.

  8. Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

    Science.gov (United States)

    Szymanski, Maciej; Karlowski, Wojciech M

    2016-01-01

    In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.

  9. Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation

    International Nuclear Information System (INIS)

    O'Hara, P.J.; Grant, F.J.; Haldeman, B.A.; Gray, C.L.; Insley, M.Y.; Hagen, F.S.; Murray, M.J.

    1987-01-01

    Activated factor VII (factor VIIa) is a vitamin K-dependent plasma serine protease that participates in a cascade of reactions leading to the coagulation of blood. Two overlapping genomic clones containing sequences encoding human factor VII were isolated and characterized. The complete sequence of the gene was determined and found to span about 12.8 kilobases. The mRNA for factor VII as demonstrated by cDNA cloning is polyadenylylated at multiple sites but contains only one AAUAAA poly(A) signal sequence. The mRNA can undergo alternative splicing, forming one transcript containing eight segments as exons and another with an additional exon that encodes a larger prepro leader sequence. The latter transcript has no known counterpart in the other vitamin K-dependent proteins. The positions of the introns with respect to the amino acid sequence encoded by the eight essential exons of factor VII are the same as those present in factor IX, factor X, protein C, and the first three exons of prothrombin. These exons code for domains generally conserved among members of this gene family. The comparable introns in these genes, however, are dissimilar with respect to size and sequence, with the exception of intron C in factor VII and protein C. The gene for factor VII also contains five regions made up of tandem repeats of oligonucleotide monomer elements. More than a quarter of the intron sequences and more than a third of the 3' untranslated portion of the mRNA transcript consist of these minisatellite tandem repeats

  10. In Silico Mining of Microsatellites in Coding Sequences of the Date Palm (Arecaceae Genome, Characterization, and Transferability

    Directory of Open Access Journals (Sweden)

    Frédérique Aberlenc-Bertossi

    2014-01-01

    Full Text Available Premise of the study: To complement existing sets of primarily dinucleotide microsatellite loci from noncoding sequences of date palm, we developed primers for tri- and hexanucleotide microsatellite loci identified within genes. Due to their conserved genomic locations, the primers should be useful in other palm taxa, and their utility was tested in seven other Phoenix species and in Chamaerops, Livistona, and Hyphaene. Methods and Results: Tandem repeat motifs of 3–6 bp were searched using a simple sequence repeat (SSR–pipeline package in coding portions of the date palm draft genome sequence. Fifteen loci produced highly consistent amplification, intraspecific polymorphisms, and stepwise mutation patterns. Conclusions: These microsatellite loci showed sufficient levels of variability and transferability to make them useful for population genetic, selection signature, and interspecific gene flow studies in Phoenix and other Coryphoideae genera.

  11. Molecular Methods for Typing of Streptococcus agalactiae with Special Emphasis on the Development and Validation of a Multi-Locus Variable Number of Tandem Repeats Assay (MLVA)

    OpenAIRE

    Radtke, Andreas

    2012-01-01

    Molekylære metoder for typing av Streptococcus agalactiae med særlig vektlegging av utvikling og validering av et multi-locus variable number of tandem repeats assay (MLVA) Sammendraget: Streptococcus agalactiae eller gruppe B streptokokker (GBS) forårsaker livsfarlige infeksjoner hos nyfødte, gravide eller voksne med kroniske sykdommer. Den forårsaker også jurbetennelse i storfe. Typing av GBS gir innblikk i bakteriens epidemiologi og dens fylogenetiske slektskap. Ulike deler av bakterie...

  12. Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability

    Science.gov (United States)

    Xiao, Jing; Zhao, Jin; Liu, Mengjun; Liu, Ping; Dai, Li; Zhao, Zhihui

    2015-01-01

    Chinese jujube (Ziziphus jujuba), an economically important species in the Rhamnaceae family, is a popular fruit tree in Asia. Here, we surveyed and characterized simple sequence repeats (SSRs) in the jujube genome. A total of 436,676 SSR loci were identified, with an average distance of 0.93 Kb between the loci. A large proportion of the SSRs included mononucleotide, dinucleotide and trinucleotide repeat motifs, which accounted for 64.87%, 24.40%, and 8.74% of all repeats, respectively. Among the mononucleotide repeats, A/T was the most common, whereas AT/TA was the most common dinucleotide repeat. A total of 30,565 primer pairs were successfully designed and screened using a series of criteria. Moreover, 725 of 1,000 randomly selected primer pairs were effective among 6 cultivars, and 511 of these primer pairs were polymorphic. Sequencing the amplicons of two SSRs across three jujube cultivars revealed variations in the repeats. The transferability of jujube SSR primers proved that 35/64 SSRs could be transferred across family boundary. Using jujube SSR primers, clustering analysis results from 15 species were highly consistent with the Angiosperm Phylogeny Group (APGIII) System. The genome-wide characterization of SSRs in Chinese jujube is very valuable for whole-genome characterization and marker-assisted selection in jujube breeding. In addition, the transferability of jujube SSR primers could provide a solid foundation for their further utilization. PMID:26000739

  13. Effects of loading sequences and size of repeated stress block of loads on fatigue life calculated using fatigue functions

    International Nuclear Information System (INIS)

    Schott, G.

    1989-01-01

    It is well-known that collective form, stress intensity and loading sequence of individual stresses as well as size of repeated stress blocks can influence fatigue life, significantly. The basic variant of the consecutive Woehler curve concept will permit these effects to be involved into fatigue life computation. The paper presented will demonstrate that fatigue life computations using fatigue functions reflect the loading sequence effect with multilevel loading precisely and provide reliable fatigue life data. Effects of size of repeated stress block and loading sequence on fatigue life as observed with block program tests can be reproduced using the new computation method. (orig.) [de

  14. A Tandem Repeat in Decay Accelerating Factor 1 Is Associated with Severity of Murine Mercury-Induced Autoimmunity

    Directory of Open Access Journals (Sweden)

    David M. Cauvi

    2014-01-01

    Full Text Available Decay accelerating factor (DAF, a complement-regulatory protein, protects cells from bystander complement-mediated lysis and negatively regulates T cells. Reduced expression of DAF occurs in several systemic autoimmune diseases including systemic lupus erythematosus, and DAF deficiency exacerbates disease in several autoimmune models, including murine mercury-induced autoimmunity (mHgIA. Daf1, located within Hmr1, a chromosome 1 locus associated in DBA/2 mice with resistance to mHgIA, could be a candidate. Here we show that reduced Daf1 transcription in lupus-prone mice was not associated with a reduction in the Daf1 transcription factor SP1. Studies of NZB mice congenic for the mHgIA-resistant DBA/2 Hmr1 locus suggested that Daf1 expression was controlled by the host genome and not the Hmr1 locus. A unique pentanucleotide repeat variant in the second intron of Daf1 in DBA/2 mice was identified and shown in F2 intercrosses to be associated with less severe disease; however, analysis of Hmr1 congenics indicated that this most likely reflected the presence of autoimmunity-predisposing genetic variants within the Hmr1 locus or that Daf1 expression is mediated by the tandem repeat in epistasis with other genetic variants present in autoimmune-prone mice. These studies argue that the effect of DAF on autoimmunity is complex and may require multiple genetic elements.

  15. Identification of apple cultivars on the basis of simple sequence repeat markers.

    Science.gov (United States)

    Liu, G S; Zhang, Y G; Tao, R; Fang, J G; Dai, H Y

    2014-09-12

    DNA markers are useful tools that play an important role in plant cultivar identification. They are usually based on polymerase chain reaction (PCR) and include simple sequence repeats (SSRs), inter-simple sequence repeats, and random amplified polymorphic DNA. However, DNA markers were not used effectively in the complete identification of plant cultivars because of the lack of known DNA fingerprints. Recently, a novel approach called the cultivar identification diagram (CID) strategy was developed to facilitate the use of DNA markers for separate plant individuals. The CID was designed whereby a polymorphic maker was generated from each PCR that directly allowed for cultivar sample separation at each step. Therefore, it could be used to identify cultivars and varieties easily with fewer primers. In this study, 60 apple cultivars, including a few main cultivars in fields and varieties from descendants (Fuji x Telamon) were examined. Of the 20 pairs of SSR primers screened, 8 pairs gave reproducible, polymorphic DNA amplification patterns. The banding patterns obtained from these 8 primers were used to construct a CID map. Each cultivar or variety in this study was distinguished from the others completely, indicating that this method can be used for efficient cultivar identification. The result contributed to studies on germplasm resources and the seedling industry in fruit trees.

  16. Mitochondrial genome of the Komodo dragon: efficient sequencing method with reptile-oriented primers and novel gene rearrangements.

    Science.gov (United States)

    Kumazawa, Yoshinori; Endo, Hideki

    2004-04-30

    The mitochondrial genome of the Komodo dragon (Varanus komodoensis) was nearly completely sequenced, except for two highly repetitive noncoding regions. An efficient sequencing method for squamate mitochondrial genomes was established by combining the long polymerase chain reaction (PCR) technology and a set of reptile-oriented primers designed for nested PCR amplifications. It was found that the mitochondrial genome had novel gene arrangements in which genes from NADH dehydrogenase subunit 6 to proline tRNA were extensively shuffled with duplicate control regions. These control regions had 99% sequence similarity over 700 bp. Although snake mitochondrial genomes are also known to possess duplicate control regions with nearly identical sequences, the location of the second control region suggested independent occurrence of the duplication on lineages leading to snakes and the Komodo dragon. Another feature of the mitochondrial genome of the Komodo dragon was the considerable number of tandem repeats, including sequences with a strong secondary structure, as a possible site for the slipped-strand mispairing in replication. These observations are consistent with hypotheses that tandem duplications via the slipped-strand mispairing may induce mitochondrial gene rearrangements and may serve to maintain similar copies of the control region.

  17. Interleukin-1 Receptor Antagonist and Interleukin-4 Genes Variable Number Tandem Repeats Are Associated with Adiposity in Malaysian Subjects

    Directory of Open Access Journals (Sweden)

    Yung-Yean Kok

    2017-01-01

    Full Text Available Interleukin-1 receptor antagonist (IL1RA intron 2 86 bp repeat and interleukin-4 (IL4 intron 3 70 bp repeat are variable number tandem repeats (VNTRs that have been associated with various diseases, but their role in obesity is elusive. The objective of this study was to investigate the association of IL1RA and IL4 VNTRs with obesity and adiposity in 315 Malaysian subjects (128 M/187 F; 23 Malays/251 ethnic Chinese/41 ethnic Indians. The allelic distributions of IL1RA and IL4 were significantly different among ethnicities, and the alleles were associated with total body fat (TBF classes. Individuals with IL1RA I/II genotype or allele II had greater risk of having higher overall adiposity, relative to those having the I/I genotype or I allele, respectively, even after controlling for ethnicity [Odds Ratio (OR of I/II genotype = 12.21 (CI = 2.54, 58.79; p=0.002; II allele = 5.78 (CI = 1.73, 19.29; p=0.004]. However, IL4 VNTR B2 allele was only significantly associated with overall adiposity status before adjusting for ethnicity [OR = 1.53 (CI = 1.04, 2.23; p=0.03]. Individuals with IL1RA II allele had significantly higher TBF than those with I allele (31.79±2.52 versus 23.51±0.40; p=0.005. Taken together, IL1RA intron 2 VNTR seems to be a genetic marker for overall adiposity status in Malaysian subjects.

  18. Sub-typing of extended-spectrum-β-lactamase-producing isolates from a nosocomial outbreak: application of a 10-loci generic Escherichia coli multi-locus variable number tandem repeat analysis.

    Directory of Open Access Journals (Sweden)

    Nahid Karami

    Full Text Available Extended-spectrum β-lactamase producing Escherichia coli (ESBL-E. coli were isolated from infants hospitalized in a neonatal, post-surgery ward during a four-month-long nosocomial outbreak and six-month follow-up period. A multi-locus variable number tandem repeat analysis (MLVA, using 10 loci (GECM-10, for 'generic' (i.e., non-STEC E. coli was applied for sub-species-level (i.e., sub-typing delineation and characterization of the bacterial isolates. Ten distinct GECM-10 types were detected among 50 isolates, correlating with the types defined by pulsed-field gel electrophoresis (PFGE, which is recognized to be the 'gold-standard' method for clinical epidemiological analyses. Multi-locus sequence typing (MLST, multiplex PCR genotyping of bla CTX-M, bla TEM, bla OXA and bla SHV genes and antibiotic resistance profiling, as well as a PCR assay specific for detecting isolates of the pandemic O25b-ST131 strain, further characterized the outbreak isolates. Two clusters of isolates with distinct GECM-10 types (G06-04 and G07-02, corresponding to two major PFGE types and the MLST-based sequence types (STs 131 and 1444, respectively, were confirmed to be responsible for the outbreak. The application of GECM-10 sub-typing provided reliable, rapid and cost-effective epidemiological characterizations of the ESBL-producing isolates from a nosocomial outbreak that correlated with and may be used to replace the laborious PFGE protocol for analyzing generic E. coli.

  19. Distribution and evolution of repeated sequences in genomes of Triatominae (Hemiptera-Reduviidae inferred from genomic in situ hybridization.

    Directory of Open Access Journals (Sweden)

    Sebastian Pita

    Full Text Available The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily.

  20. Simple sequence repeats in Neurospora crassa: distribution, polymorphism and evolutionary inference

    Directory of Open Access Journals (Sweden)

    Park Jongsun

    2008-01-01

    Full Text Available Abstract Background Simple sequence repeats (SSRs have been successfully used for various genetic and evolutionary studies in eukaryotic systems. The eukaryotic model organism Neurospora crassa is an excellent system to study evolution and biological function of SSRs. Results We identified and characterized 2749 SSRs of 963 SSR types in the genome of N. crassa. The distribution of tri-nucleotide (nt SSRs, the most common SSRs in N. crassa, was significantly biased in exons. We further characterized the distribution of 19 abundant SSR types (AST, which account for 71% of total SSRs in the N. crassa genome, using a Poisson log-linear model. We also characterized the size variation of SSRs among natural accessions using Polymorphic Index Content (PIC and ANOVA analyses and found that there are genome-wide, chromosome-dependent and local-specific variations. Using polymorphic SSRs, we have built linkage maps from three line-cross populations. Conclusion Taking our computational, statistical and experimental data together, we conclude that 1 the distributions of the SSRs in the sequenced N. crassa genome differ systematically between chromosomes as well as between SSR types, 2 the size variation of tri-nt SSRs in exons might be an important mechanism in generating functional variation of proteins in N. crassa, 3 there are different levels of evolutionary forces in variation of amino acid repeats, and 4 SSRs are stable molecular markers for genetic studies in N. crassa.

  1. Sequence variations in C9orf72 downstream of the hexanucleotide repeat region and its effect on repeat-primed PCR interpretation

    DEFF Research Database (Denmark)

    Nordin, Angelica; Akimoto, Chizuru; Wuolikainen, Anna

    2017-01-01

    A large GGGGCC-repeat expansion mutation (HREM) in C9orf72 is the most common known cause of ALS and FTD in European populations. Sequence variations immediately downstream of the HREM region have previously been observed and have been suggested to be one reason for difficulties in interpreting R...

  2. Journal of Genetics | Indian Academy of Sciences

    Indian Academy of Sciences (India)

    pp 49-54 Research Article. Exact Tandem Repeats Analyzer (E-TRA): A new program for DNA sequence mining · Mehmet Karaca Mehmet Bilgen A. Naci Onus Ayse Gul Ince Safinaz Y. Elmasulu · More Details Abstract Fulltext PDF. Exact Tandem Repeats Analyzer 1.0 (E-TRA) combines sequence motif searches with ...

  3. Length and repeat-sequence variation in 58 STRs and 94 SNPs in two Spanish populations.

    Science.gov (United States)

    Casals, Ferran; Anglada, Roger; Bonet, Núria; Rasal, Raquel; van der Gaag, Kristiaan J; Hoogenboom, Jerry; Solé-Morata, Neus; Comas, David; Calafell, Francesc

    2017-09-01

    We have genotyped the 58 STRs (27 autosomal, 24 Y-STRs and 7 X-STRs) and 94 autosomal SNPs in Illumina ForenSeq™ Primer Mix A in 88 Spanish Roma (Gypsy) samples and 143 Catalans. Since this platform is based in massive parallel sequencing, we have used simple R scripts to uncover the sequence variation in the repeat region. Thus, we have found, across 58 STRs, 541 length-based alleles, which, after considering repeat-sequence variation, became 804 different alleles. All loci in both populations were in Hardy-Weinberg equilibrium. F ST between both populations was 0.0178 for autosomal SNPs, 0.0146 for autosomal STRs, 0.0101 for X-STRs and 0.1866 for Y-STRs. Combined a priori statistics showed quite large; for instance, pooling all the autosomal loci, the a priori probabilities of discriminating a suspect become 1-(2.3×10 -70 ) and 1-(5.9×10 -73 ), for Roma and Catalans respectively, and the chances of excluding a false father in a trio are 1-(2.6×10 -20 ) and 1-(2.0×10 -21 ). Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

    Science.gov (United States)

    Amirhaeri, S; Wohlrab, F; Wells, R D

    1995-02-17

    The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.

  5. Molecular typing of Salmonella enterica serovar typhi isolates from various countries in Asia by a multiplex PCR assay on variable-number tandem repeats.

    Science.gov (United States)

    Liu, Yichun; Lee, May-Ann; Ooi, Eng-Eong; Mavis, Yeo; Tan, Ai-Ling; Quek, Hung-Hiang

    2003-09-01

    A multiplex PCR method incorporating primers flanking three variable-number tandem repeat (VNTR) loci (arbitrarily labeled TR1, TR2, and TR3) in the CT18 strain of Salmonella enterica serovar Typhi has been developed for molecular typing of S. enterica serovar Typhi clinical isolates from several Asian countries, including Singapore, Indonesia, India, Bangladesh, Malaysia, and Nepal. We have demonstrated that the multiplex PCR could be performed on crude cell lysates and that the VNTR banding profiles produced could be easily analyzed by visual inspection after conventional agarose gel electrophoresis. The assay was highly discriminative in identifying 49 distinct VNTR profiles among 59 individual isolates. A high level of VNTR profile heterogeneity was observed in isolates from within the same country and among countries. These VNTR profiles remained stable after the strains were passaged extensively under routine laboratory culture conditions. In contrast to the S. enterica serovar Typhi isolates, an absence of TR3 amplicons and a lack of length polymorphisms in TR1 and TR2 amplicons were observed for other S. enterica serovars, such as Salmonella enterica serovar Typhimurium, Salmonella enterica serovar Enteritidis, and Salmonella enterica serovar Paratyphi A, B, and C. DNA sequencing of the amplified VNTR regions substantiated these results, suggesting the high stability of the multiplex PCR assay. The multiplex-PCR-based VNTR profiling developed in this study provides a simple, rapid, reproducible, and high-resolution molecular tool for the epidemiological analysis of S. enterica serovar Typhi strains.

  6. Substructure of a Tunisian Berber population as inferred from 15 autosomal short tandem repeat loci.

    Science.gov (United States)

    Khodjet-El-Khil, Houssein; Fadhlaoui-Zid, Karima; Gusmão, Leonor; Alves, Cíntia; Benammar-Elgaaied, Amel; Amorim, Antonio

    2008-08-01

    Currently, language and cultural practices are the only criteria to distinguish between Berber autochthonous Tunisian populations. To evaluate these populations' possible genetic structure and differentiation, we have analyzed 15 autosomal short tandem repeat loci (CSF1PO, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D21S11, FGA, TH01, TPOX, VWA, D2S1338, and D19S433) in three southern Tunisian Berber groups: Sened, Matmata, and Chenini-Douiret. The exact test of population differentiation based on allele frequencies at the 15 loci shows significant P values at 7 loci between Chenini-Douiret and both Sened and Matmata, whereas just 5 loci show significant P values between Sened and Matmata. Comparative analyses between the three Berber groups based on genetic distances show that P values for F(ST) distances are significant between the three Berber groups. Population analysis performed using Structure shows a clear differentiation between these Berber groups, with strong genetic isolation of Chenini-Douiret. These results confirm at the autosomal level the high degree of heterogeneity of Tunisian Berber populations that had been previously reported for uniparental markers.

  7. Core genome conservation of Staphylococcus haemolyticus limits sequence based population structure analysis.

    Science.gov (United States)

    Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson

    2012-06-01

    The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.

  8. Rapid functional and sequence differentiation of a tandemly repeated species-specific multigene family in Drosophila

    DEFF Research Database (Denmark)

    Clifton, Bryan D.; Sanz, Pablo Librado; Yeh, Shu-Dan

    2017-01-01

    Gene clusters of recently duplicated genes are hotbeds for evolutionary change. However, our understanding of how mutational mechanisms and evolutionary forces shape the structural and functional evolution of these clusters is hindered by the high sequence identity among the copies, which typical...

  9. Development of simple sequence repeat markers and diversity analysis in alfalfa (Medicago sativa L.).

    Science.gov (United States)

    Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen

    2013-04-01

    Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.

  10. Extrachromosomal circles of satellite repeats and 5S ribosomal DNA in human cells

    Directory of Open Access Journals (Sweden)

    Cohen Sarit

    2010-03-01

    Full Text Available Abstract Background Extrachomosomal circular DNA (eccDNA is ubiquitous in eukaryotic organisms and was detected in every organism tested, including in humans. A two-dimensional gel electrophoresis facilitates the detection of eccDNA in preparations of genomic DNA. Using this technique we have previously demonstrated that most of eccDNA consists of exact multiples of chromosomal tandemly repeated DNA, including both coding genes and satellite DNA. Results Here we report the occurrence of eccDNA in every tested human cell line. It has heterogeneous mass ranging from less than 2 kb to over 20 kb. We describe eccDNA homologous to human alpha satellite and the SstI mega satellite. Moreover, we show, for the first time, circular multimers of the human 5S ribosomal DNA (rDNA, similar to previous findings in Drosophila and plants. We further demonstrate structures that correspond to intermediates of rolling circle replication, which emerge from the circular multimers of 5S rDNA and SstI satellite. Conclusions These findings, and previous reports, support the general notion that every chromosomal tandem repeat is prone to generate eccDNA in eukryoric organisms including humans. They suggest the possible involvement of eccDNA in the length variability observed in arrays of tandem repeats. The implications of eccDNA on genome biology may include mechanisms of centromere evolution, concerted evolution and homogenization of tandem repeats and genomic plasticity.

  11. Transcription arrest by a G quadruplex forming-trinucleotide repeat sequence from the human c-myb gene.

    Science.gov (United States)

    Broxson, Christopher; Beckett, Joshua; Tornaletti, Silvia

    2011-05-17

    Non canonical DNA structures correspond to genomic regions particularly susceptible to genetic instability. The transcription process facilitates formation of these structures and plays a major role in generating the instability associated with these genomic sites. However, little is known about how non canonical structures are processed when encountered by an elongating RNA polymerase. Here we have studied the behavior of T7 RNA polymerase (T7RNAP) when encountering a G quadruplex forming-(GGA)(4) repeat located in the human c-myb proto-oncogene. To make direct correlations between formation of the structure and effects on transcription, we have taken advantage of the ability of the T7 polymerase to transcribe single-stranded substrates and of G4 DNA to form in single-stranded G-rich sequences in the presence of potassium ions. Under physiological KCl concentrations, we found that T7 RNAP transcription was arrested at two sites that mapped to the c-myb (GGA)(4) repeat sequence. The extent of arrest did not change with time, indicating that the c-myb repeat represented an absolute block and not a transient pause to T7 RNAP. Consistent with G4 DNA formation, arrest was not observed in the absence of KCl or in the presence of LiCl. Furthermore, mutations in the c-myb (GGA)(4) repeat, expected to prevent transition to G4, also eliminated the transcription block. We show T7 RNAP arrest at the c-myb repeat in double-stranded DNA under conditions mimicking the cellular concentration of biomolecules and potassium ions, suggesting that the G4 structure formed in the c-myb repeat may represent a transcription roadblock in vivo. Our results support a mechanism of transcription-coupled DNA repair initiated by arrest of transcription at G4 structures.

  12. Comparison of the capillary and agarose electrophoresis based multiple locus VNTR (variable number of tandem repeats) analysis (MLVA) on Mycobacterium bovis isolates.

    Science.gov (United States)

    Jenkins, A O; Venter, E H; Hutamo, K; Godfroid, J

    2010-09-28

    Electrophoretic techniques that can be used for genotyping of bacterial pathogens ranges from manual, low-cost, agarose gels to high-throughput capillary electrophoresis sequencing machines. These two methods are currently employed in the electrophoresis of PCR products used in multiple locus VNTR (variable number of tandem repeats) analysis (MLVA), i.e. the agarose electrophoresis (AE) and the capillary electrophoresis (CE). Some authors have suggested that clusters generated by AE are less reliable than those generated by CE and that the latter is a more sensitive technique than the former when typing Mycobacterium tuberculosis complex (MTC) isolates. Because such a claim could have significant consequences for investigators in this field, a comparison was made on 19 Belgian Mycobacterium bovis strains which had previously been genotyped using CE VNTR analysis. The VNTR profiles of the CE VNTR analysis were compared with those obtained by AE VNTR analysis at 14 VNTR loci. Our results indicated that there were no differences in copy numbers at all loci tested when the copy numbers obtained by the AE VNTR analysis were compared with those obtained by CE VNTR analysis. The use of AE VNTR analysis in mycobacterial genotyping does not alter the sensitivity of the MLVA technique compared with the CE VNTR analysis. The AE VNTR can therefore be regarded as a viable alternative in moderately equipped laboratories that cannot afford the expensive equipment required for CE VNTR analysis and data obtained by AE VNTR analysis can be shared between laboratories which use the CE VNTR method. (c) 2010 Elsevier B.V. All rights reserved.

  13. Flanking Variation Influences Rates of Stutter in Simple Repeats

    Directory of Open Access Journals (Sweden)

    August E. Woerner

    2017-11-01

    Full Text Available It has been posited that the longest uninterrupted stretch (LUS of tandem repeats, as defined by the number of exactly matching repeating motif units, is a better predictor of rates of stutter than the parental allele length (PAL. While there are cases where this hypothesis is likely correct, such as the 9.3 allele in the TH01 locus, there can be situations where it may not apply as well. For example, the PAL may capture flanking indel variations while remaining insensitive to polymorphisms in the repeat, and these haplotypic changes may impact the stutter rate. To address this, rates of stutter were contrasted against the LUS as well as the PAL on different flanking haplotypic backgrounds. This study shows that rates of stutter can vary substantially depending on the flanking haplotype, and while there are cases where the LUS is a better predictor of stutter than the PAL, examples to the contrary are apparent in commonly assayed forensic markers. Further, flanking variation that is 7 bp from the repeat region can impact rates of stutter. These findings suggest that non-proximal effects, such as DNA secondary structure, may be impacting the rates of stutter in common forensic short tandem repeat markers.

  14. Identifying uniformly mutated segments within repeats.

    Science.gov (United States)

    Sahinalp, S Cenk; Eichler, Evan; Goldberg, Paul; Berenbrink, Petra; Friedetzky, Tom; Ergun, Funda

    2004-12-01

    Given a long string of characters from a constant size alphabet we present an algorithm to determine whether its characters have been generated by a single i.i.d. random source. More specifically, consider all possible n-coin models for generating a binary string S, where each bit of S is generated via an independent toss of one of the n coins in the model. The choice of which coin to toss is decided by a random walk on the set of coins where the probability of a coin change is much lower than the probability of using the same coin repeatedly. We present a procedure to evaluate the likelihood of a n-coin model for given S, subject a uniform prior distribution over the parameters of the model (that represent mutation rates and probabilities of copying events). In the absence of detailed prior knowledge of these parameters, the algorithm can be used to determine whether the a posteriori probability for n=1 is higher than for any other n>1. Our algorithm runs in time O(l4logl), where l is the length of S, through a dynamic programming approach which exploits the assumed convexity of the a posteriori probability for n. Our test can be used in the analysis of long alignments between pairs of genomic sequences in a number of ways. For example, functional regions in genome sequences exhibit much lower mutation rates than non-functional regions. Because our test provides means for determining variations in the mutation rate, it may be used to distinguish functional regions from non-functional ones. Another application is in determining whether two highly similar, thus evolutionarily related, genome segments are the result of a single copy event or of a complex series of copy events. This is particularly an issue in evolutionary studies of genome regions rich with repeat segments (especially tandemly repeated segments).

  15. Analyse des génomes à la recherche de répétitions en tandem polymorphes : outils d?épidémiologie bactérienne et locus hypermutables humains

    OpenAIRE

    Denoeud , France

    2003-01-01

    thèse soutenue par la DGA; Tandem repeats are consecutive occurrences of a DNA unit. Such structures are found in all organisms, prokaryotes as well as eukaryotes. Although their biological function is not fully understood, they have diverse practical applications. In bacteria, polymorphic tandem repeats (with varying copy numbers), are powerful tools for strain identification in bacterial epidemiology. In humans, some tandem repeats mutate at a very high rate: hypermutable minisatellites are...

  16. Analysis of sequence diversity through internal transcribed spacers and simple sequence repeats to identify Dendrobium species.

    Science.gov (United States)

    Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y

    2014-04-08

    The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.

  17. Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

    Science.gov (United States)

    Oggioni, M R; Claverys, J P

    1999-10-01

    A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.

  18. Structural organization of glycophorin A and B genes: Glycophorin B gene evolved by homologous recombination at Alu repeat sequences

    International Nuclear Information System (INIS)

    Kudo, Shinichi; Fukuda, Minoru

    1989-01-01

    Glycophorins A (GPA) and B (GPB) are two major sialoglycoproteins of the human erythrocyte membrane. Here the authors present a comparison of the genomic structures of GPA and GPB developed by analyzing DNA clones isolated from a K562 genomic library. Nucleotide sequences of exon-intron junctions and 5' and 3' flanking sequences revealed that the GPA and GPB genes consist of 7 and 5 exons, respectively, and both genes have >95% identical sequence from the 5' flanking region to the region ∼ 1 kilobase downstream from the exon encoding the transmembrane regions. In this homologous part of the genes, GPB lacks one exon due to a point mutation at the 5' splicing site of the third intron, which inactivates the 5' cleavage event of splicing and leads to ligation of the second to the fourth exon. Following these very homologous sequences, the genomic sequences for GPA and GPB diverge significantly and no homology can be detected in their 3' end sequences. The analysis of the Alu sequences and their flanking direct repeat sequences suggest that an ancestral genomic structure has been maintained in the GPA gene, whereas the GPB gene has arisen from the acquisition of 3' sequences different from those of the GPA gene by homologous recombination at the Alu repeats during or after gene duplication

  19. Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

    Science.gov (United States)

    Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

    2012-08-01

    Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.

  20. Prenylcoumarins in One or Two Steps by a Microwave-Promoted Tandem Claisen Rearrangement/Wittig Olefination/Cyclization Sequence.

    Science.gov (United States)

    Schultze, Christiane; Schmidt, Bernd

    2018-05-04

    The one-pot synthesis of 8-prenylcoumarins from 1,1-dimethylallylated salicylaldehydes and the stabilized ylide [(ethoxycarbonyl)methylene]triphenylphosphorane under microwave conditions was found to have a limited scope. The sequence suffers from a difficult and sometimes low-yielding synthesis of the precursors and from a competing deprenylation upon microwave irradiation. This side reaction occurs in particular with electron rich arenes with two or more alkoxy groups at adjacent positions, a prominent substitution pattern in naturally occurring 8-prenylcoumarins. Both limitations of this one-step sequence were overcome by a two-step synthesis consisting of a microwave-promoted tandem allyl ether Claisen rearrangement/Wittig olefination and a subsequent olefin cross metathesis with 2-methyl-2-butene. The cross metathesis step proceeds with a high selectivity and yields exclusively the desired prenyl, rather than the alternative crotyl substituent. Several naturally occurring 8-prenylcoumarins that were previously inaccessible have been synthesized in good overall yields along this route.

  1. Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

    Science.gov (United States)

    Lewis, Cecil M

    2010-02-01

    This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America. 2009 Wiley-Liss, Inc.

  2. Development of a Multiple Loci Variable Number of Tandem Repeats Analysis (MLVA) to Unravel the Intra-Pathovar Structure of Pseudomonas syringae pv. actinidiae Populations Worldwide

    Science.gov (United States)

    Ciarroni, Serena; Gallipoli, Lorenzo; Taratufolo, Maria C.; Butler, Margi I.; Poulter, Russell T. M.; Pourcel, Christine; Vergnaud, Gilles; Balestra, Giorgio M.; Mazzaglia, Angelo

    2015-01-01

    The bacterial canker of kiwifruit by Pseudomonas syringae pv. actinidiae is an emblematic example of a catastrophic disease of fruit crops. In 2008 a new, extremely virulent form of the pathogen emerged and rapidly devastated many Actinidia spp. orchards all over the world. In order to understand differences in populations within this pathovar and to elucidate their diffusion and movements on world scale, it is necessary to be able to quickly and on a routine basis compare new isolates with previous records. In this report a worldwide collection of 142 strains was analyzed by MLVA, chosen as investigative technique for its efficacy, reproducibility, simplicity and low cost. A panel of 13 Variable Number of Tandem Repeats (VNTR) loci was identified and used to describe the pathogen population. The MLVA clustering is highly congruent with the population structure as previously established by other molecular approaches including whole genome sequencing and correlates with geographic origin, time of isolation and virulence. For convenience, we divided the VNTR loci in two panels. Panel 1 assay, using six loci, recognizes 23 different haplotypes, clustered into ten complexes with highest congruence with previous classifications. Panel 2, with seven VNTR loci, provides discriminatory power. Using the total set of 13 VNTR loci, 58 haplotypes can be distinguished. The recent hypervirulent type shows very limited diversity and includes, beside the strains from Europe, New Zealand and Chile, a few strains from Shaanxi, China. A broad genetic variability is observed in China, but different types are also retrievable in Japan and Korea. The low virulent strains cluster together and are very different from the other MLVA genotypes. Data were used to generate a public database in MLVAbank. MLVA represents a very promising first-line assay for large-scale routine genotyping, prior to whole genome sequencing of only the most relevant samples. PMID:26262683

  3. Transcription of tandemly repetitive DNA: functional roles.

    Science.gov (United States)

    Biscotti, Maria Assunta; Canapa, Adriana; Forconi, Mariko; Olmo, Ettore; Barucca, Marco

    2015-09-01

    A considerable fraction of the eukaryotic genome is made up of satellite DNA constituted of tandemly repeated sequences. These elements are mainly located at centromeres, pericentromeres, and telomeres and are major components of constitutive heterochromatin. Although originally satellite DNA was thought silent and inert, an increasing number of studies are providing evidence on its transcriptional activity supporting, on the contrary, an unexpected dynamicity. This review summarizes the multiple structural roles of satellite noncoding RNAs at chromosome level. Indeed, satellite noncoding RNAs play a role in the establishment of a heterochromatic state at centromere and telomere. These highly condensed structures are indispensable to preserve chromosome integrity and genome stability, preventing recombination events, and ensuring the correct chromosome pairing and segregation. Moreover, these RNA molecules seem to be involved also in maintaining centromere identity and in elongation, capping, and replication of telomere. Finally, the abnormal variation of centromeric and pericentromeric DNA transcription across major eukaryotic lineages in stress condition and disease has evidenced the critical role that these transcripts may play and the potentially dire consequences for the organism.

  4. Multicolor-based discrimination of 21 short tandem repeats and amelogenin using four fluorescent universal primers.

    Science.gov (United States)

    Asari, Masaru; Okuda, Katsuhiro; Hoshina, Chisato; Omura, Tomohiro; Tasaki, Yoshikazu; Shiono, Hiroshi; Matsubara, Kazuo; Shimizu, Keiko

    2016-02-01

    The aim of this study was to develop a cost-effective genotyping method using high-quality DNA for human identification. A total of 21 short tandem repeats (STRs) and amelogenin were selected, and fluorescent fragments at 22 loci were simultaneously amplified in a single-tube reaction using locus-specific primers with 24-base universal tails and four fluorescent universal primers. Several nucleotide substitutions in universal tails and fluorescent universal primers enabled the detection of specific fluorescent fragments from the 22 loci. Multiplex polymerase chain reaction (PCR) produced intense FAM-, VIC-, NED-, and PET-labeled fragments ranging from 90 to 400 bp, and these fragments were discriminated using standard capillary electrophoretic analysis. The selected 22 loci were also analyzed using two commercial kits (the AmpFLSTR Identifiler Kit and the PowerPlex ESX 17 System), and results for two loci (D19S433 and D16S539) were discordant between these kits due to mutations at the primer binding sites. All genotypes from the 100 samples were determined using 2.5 ng of DNA by our method, and the expected alleles were completely recovered. Multiplex 22-locus genotyping using four fluorescent universal primers effectively reduces the costs to less than 20% of genotyping using commercial kits, and our method would be useful to detect silent alleles from commercial kit analysis. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

    Science.gov (United States)

    M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan

    2009-01-01

    The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...

  6. Evaluation of a highly discriminating multiplex multi-locus variable-number of tandem-repeats (MLVA) analysis for Vibrio cholerae.

    Science.gov (United States)

    Olsen, Jaran S; Aarskaug, Tone; Skogan, Gunnar; Fykse, Else Marie; Ellingsen, Anette Bauer; Blatny, Janet M

    2009-09-01

    Vibrio cholerae is the etiological agent of cholera and may be used in bioterror actions due to the easiness of its dissemination, and the public fear for acquiring the cholera disease. A simple and highly discriminating method for connecting clinical and environmental isolates of V. cholerae is needed in microbial forensics. Twelve different loci containing variable numbers of tandem-repeats (VNTRs) were evaluated in which six loci were polymorphic. Two multiplex reactions containing PCR primers targeting these six VNTRs resulted in successful DNA amplification of 142 various environmental and clinical V. cholerae isolates. The genetic distribution inside the V. cholerae strain collection was used to evaluate the discriminating power (Simpsons Diversity Index=0.99) of this new MLVA analysis, showing that the assay have a potential to differentiate between various strains, but also to identify those isolates which are collected from a common V. cholerae outbreak. This work has established a rapid and highly discriminating MLVA assay useful for track back analyses and/or forensic studies of V. cholerae infections.

  7. Genotyping and Molecular Identification of Date Palm Cultivars Using Inter-Simple Sequence Repeat (ISSR) Markers.

    Science.gov (United States)

    Ayesh, Basim M

    2017-01-01

    Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.

  8. Analysis of the 9p21.3 sequence associated with coronary artery disease reveals a tendency for duplication in a CAD patient

    Science.gov (United States)

    Kouprina, Natalay; Noskov, Vladimir N.; Waterfall, Joshua J.; Walker, Robert L.; Meltzer, Paul S.; Topol, Eric J.; Larionov, Vladimir

    2018-01-01

    Tandem segmental duplications (SDs) greater than 10 kb are widespread in complex genomes. They provide material for gene divergence and evolutionary adaptation, while formation of specific de novo SDs is a hallmark of cancer and some human diseases. Most SDs map to distinct genomic regions termed ‘duplication blocks’. SDs organization within these blocks is often poorly characterized as they are mosaics of ancestral duplicons juxtaposed with younger duplicons arising from more recent duplication events. Structural and functional analysis of SDs is further hampered as long repetitive DNA structures are underrepresented in existing BAC and YAC libraries. We applied Transformation-Associated Recombination (TAR) cloning, a versatile technique for large DNA manipulation, to selectively isolate the coronary artery disease (CAD) interval sequence within the 9p21.3 chromosome locus from a patient with coronary artery disease and normal individuals. Four tandem head-to-tail duplicons, each ∼50 kb long, were recovered in the patient but not in normal individuals. Sequence analysis revealed that the repeats varied by 10-15 SNPs between each other and by 82 SNPs between the human genome sequence (version hg19). SNPs polymorphism within the junctions between repeats allowed two junction types to be distinguished, Type 1 and Type 2, which were found at a 2:1 ratio. The junction sequences contained an Alu element, a sequence previously shown to play a role in duplication. Knowledge of structural variation in the CAD interval from more patients could help link this locus to cardiovascular diseases susceptibility, and maybe relevant to other cases of regional amplification, including cancer. PMID:29632643

  9. The complete chloroplast genome sequence of Taxus chinensis var. mairei (Taxaceae): loss of an inverted repeat region and comparative analysis with related species.

    Science.gov (United States)

    Zhang, Yanzhen; Ma, Ji; Yang, Bingxian; Li, Ruyi; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Zhang, Lin

    2014-05-01

    Taxus chinensis var. mairei (Taxaceae) is a domestic variety of yew species in local China. This plant is one of the sources for paclitaxel, which is a promising antineoplastic chemotherapy drugs during the last decade. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of T. chinensis var. mairei. The T. chinensis var. mairei cp genome is 129,513 bp in length, with 113 single copy genes and two duplicated genes (trnI-CAU, trnQ-UUG). Among the 113 single copy genes, 9 are intron-containing. Compared to other land plant cp genomes, the T. chinensis var. mairei cp genome has lost one of the large inverted repeats (IRs) found in angiosperms, fern, liverwort, and gymnosperm such as Cycas revoluta and Ginkgo biloba L. Compared to related species, the gene order of T. chinensis var. mairei has a large inversion of ~110kb including 91 genes (from rps18 to accD) with gene contents unarranged. Repeat analysis identified 48 direct and 2 inverted repeats 30 bp long or longer with a sequence identity greater than 90%. Repeated short segments were found in genes rps18, rps19 and clpP. Analysis also revealed 22 simple sequence repeat (SSR) loci and almost all are composed of A or T. Copyright © 2014 Elsevier B.V. All rights reserved.

  10. Short tandem repeat (STR) DNA markers are hypervariable and informative in Cannabis sativa: implications for forensic investigations.

    Science.gov (United States)

    Gilmore, Simon; Peakall, Rod; Robertson, James

    2003-01-09

    Short tandem repeat (STR) markers are the DNA marker of choice in forensic analysis of human DNA. Here we extend the application of STR markers to Cannabis sativa and demonstrate their potential for forensic investigations. Ninety-three individual cannabis plants, representing drug and fibre accessions of widespread origin were profiled with five STR makers. A total of 79 alleles were detected across the five loci. All but four individuals from a single drug-type accession had a unique multilocus genotype. An analysis of molecular variance (AMOVA) revealed significant genetic variation among accessions, with an average of 25% genetic differentiation. By contrast, only 6% genetic difference was detected between drug and fibre crop accessions and it was not possible to unequivocally assign plants as either drug or fibre type. However, our results suggest that drug strains may typically possess lower genetic diversity than fibre strains, which may ultimately provide a means of genetic delineation. Our findings demonstrate the promise of cannabis STR markers to provide information on: (1) agronomic type, (2) the geographical origin of drug seizures, and (3) evidence of conspiracy in production of clonally propagated drug crops.

  11. Selection pressure on human STR loci and its relevance in repeat expansion disease

    KAUST Repository

    Shimada, Makoto K.

    2016-06-11

    Short Tandem Repeats (STRs) comprise repeats of one to several base pairs. Because of the high mutability due to strand slippage during DNA synthesis, rapid evolutionary change in the number of repeating units directly shapes the range of repeat-number variation according to selection pressure. However, the remaining questions include: Why are STRs causing repeat expansion diseases maintained in the human population; and why are these limited to neurodegenerative diseases? By evaluating the genome-wide selection pressure on STRs using the database we constructed, we identified two different patterns of relationship in repeat-number polymorphisms between DNA and amino-acid sequences, although both patterns are evolutionary consequences of avoiding the formation of harmful long STRs. First, a mixture of degenerate codons is represented in poly-proline (poly-P) repeats. Second, long poly-glutamine (poly-Q) repeats are favored at the protein level; however, at the DNA level, STRs encoding long poly-Qs are frequently divided by synonymous SNPs. Furthermore, significant enrichments of apoptosis and neurodevelopment were biological processes found specifically in genes encoding poly-Qs with repeat polymorphism. This suggests the existence of a specific molecular function for polymorphic and/or long poly-Q stretches. Given that the poly-Qs causing expansion diseases were longer than other poly-Qs, even in healthy subjects, our results indicate that the evolutionary benefits of long and/or polymorphic poly-Q stretches outweigh the risks of long CAG repeats predisposing to pathological hyper-expansions. Molecular pathways in neurodevelopment requiring long and polymorphic poly-Q stretches may provide a clue to understanding why poly-Q expansion diseases are limited to neurodegenerative diseases. © 2016, Springer-Verlag Berlin Heidelberg.

  12. Comparison of serum creatine kinase estimation with short tandem repeats based linkage analysis in carriers and affected children of duchenne muscular dystrophy

    International Nuclear Information System (INIS)

    Hashim, R.; Ahmad, S.; Sattar, A.; Khan, F.A.

    2011-01-01

    Background: Duchenne Muscular Dystrophy (DMD) is an X-linked recessive lethal, genetic disorder characterised by progressive weakness of skeletal muscles which is untreatable and transmitted to males by carrier females. Advances in laboratory techniques now focus direct mutational analysis as the most reliable and indirect analysis based on Short Tandem Repeats (STR) based linkage analysis as feasible, inexpensive, and efficient method for carrier detection and prenatal diagnosis. The objective of this study was to compare the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and diagnostic efficiency of Serum Creatine Kinase (SCK) with Short Tandem Repeats (STR based linkage analysis in carriers and affected children of Duchenne Muscular Dystrophy. Methods: The study was carried out from Dec 2006 to Dec 2007 in families having index clinical cases of DMD who were referred from different hospitals for evaluation/workup of DMD. SCK was done as a preliminary investigation in all index cases. The PCR assay with STR based linkage analysis with Intron 44, 45, 49 and 50 of DMD gene were performed in all families. Six families were informative with Intron 44 of DMD gene and one family was non-informative with all four intronic markers of DMD. SCK analyses were done in all the family members and compared with PCR analysis in informative families. SCK was not performed on Chorionic villous sample (CVS) done for prenatal diagnosis of DMD, and CVS and non-informative family members were excluded from the study. Results: In carriers of DMD, the sensitivity and negative predictive value of SCK were 33.3%, and specificity and positive predictive were 100% with diagnostic efficiency of 50%. In affected cases of DMD the sensitivity and negative predictive value of SCK were 100%, and specificity and positive predictive were 91% and 88.8% respectively and diagnostic efficiency of 94.1%. Conclusion: The SCK is an excellent screening test for

  13. Large Diversity of Porcine Yersinia enterocolitica 4/O:3 in Eight European Countries Assessed by Multiple-Locus Variable-Number Tandem-Repeat Analysis.

    Science.gov (United States)

    Alakurtti, Sini; Keto-Timonen, Riikka; Virtanen, Sonja; Martínez, Pilar Ortiz; Laukkanen-Ninios, Riikka; Korkeala, Hannu

    2016-06-01

    A total of 253 multiple-locus variable-number tandem-repeat analysis (MLVA) types among 634 isolates were discovered while studying the genetic diversity of porcine Yersinia enterocolitica 4/O:3 isolates from eight different European countries. Six variable-number tandem-repeat (VNTR) loci V2A, V4, V5, V6, V7, and V9 were used to study the isolates from 82 farms in Belgium (n = 93, 7 farms), England (n = 41, 8 farms), Estonia (n = 106, 12 farms), Finland (n = 70, 13 farms), Italy (n = 111, 20 farms), Latvia (n = 66, 3 farms), Russia (n = 60, 10 farms), and Spain (n = 87, 9 farms). Cluster analysis revealed mainly country-specific clusters, and only one MLVA type consisting of two isolates was found from two countries: Russia and Italy. Also, farm-specific clusters were discovered, but same MLVA types could also be found from different farms. Analysis of multiple isolates originating either from the same tonsils (n = 4) or from the same farm, but 6 months apart, revealed both identical and different MLVA types. MLVA showed a very good discriminatory ability with a Simpson's discriminatory index (DI) of 0.989. DIs for VNTR loci V2A, V4, V5, V6, V7, and V9 were 0.916, 0.791, 0.901, 0.877, 0.912, and 0.785, respectively, when studying all isolates together, but variation was evident between isolates originating from different countries. Locus V4 in the Spanish isolates and locus V9 in the Latvian isolates did not differentiate (DI 0.000), and locus V9 in the English isolates showed very low discriminatory power (DI 0.049). The porcine Y. enterocolitica 4/O:3 isolates were diverse, but the variation in DI demonstrates that the well discriminating loci V2A, V5, V6, and V7 should be included in MLVA protocol when maximal discriminatory power is needed.

  14. Superfamily of ankyrin repeat proteins in tomato.

    Science.gov (United States)

    Yuan, Xiaowei; Zhang, Shizhong; Qing, Xiaohe; Sun, Meihong; Liu, Shiyang; Su, Hongyan; Shu, Huairui; Li, Xinzheng

    2013-07-10

    The ankyrin repeat (ANK) protein family plays a crucial role in plant growth and development and in response to biotic and abiotic stresses. However, no detailed information concerning this family is available for tomato (Solanum lycopersicum) due to the limited information on whole genome sequences. In this study, we identified a total of 130 ANK genes in tomato genome (SlANK), and these genes were distributed across all 12 chromosomes at various densities. And chromosomal localizations of SlANK genes indicated 25 SlANK genes were involved in tandem duplications. Based on their domain composition, all of the SlANK proteins were grouped into 13 subgroups. A combined phylogenetic tree was constructed with the aligned SlANK protein sequences. This tree revealed that the SlANK proteins comprise five major groups. An analysis of the expression profiles of SlANK genes in tomato in different tissues and in response to stresses showed that the SlANK proteins play roles in plant growth, development and stress responses. To our knowledge, this is the first report of a genome-wide analysis of the tomato ANK gene family. This study provides valuable information regarding the classification and putative functions of SlANK genes in tomato. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.

  15. Authentication of Fish Products by Large-Scale Comparison of Tandem Mass Spectra

    DEFF Research Database (Denmark)

    Wulff, Tune; Nielsen, Michael Engelbrecht; Deelder, André M.

    2013-01-01

    Authentication of food is a major concern worldwide to ensure that food products are correctly labeled in terms of which animals are actually processed for consumption. Normally authentication is based on species recognition by comparison of selected sequences of DNA or protein. We here present...... a new robust, proteome-wide tandem mass spectrometry method for species recognition and food product authentication. The method does not use or require any genome sequences or selection of tandem mass spectra but uses all acquired data. The experimental steps were performed in a simple, standardized...

  16. Population data of 17 short tandem repeat loci in 2923 individuals from the Han population of Nantong in East China.

    Science.gov (United States)

    Yang, Min; Li, Liming; Han, Haijun; Jin, Li; Jia, Dongtao; Li, Shilin

    2016-09-01

    Nantong is located in mid-eastern China, and the Han population in Nantong may be greatly affected by population admixture between northern and southern Han Chinese populations. In this study, we analyzed 17 autosomal short tandem repeat (STR) loci on 2923 unrelated individuals collected from the Han population of Nantong. No significant deviation from Hardy-Weinberg equilibrium was observed at all STR loci, and the expected heterozygosity ranged from 0.6184 to 0.9187. The combined match probability (CMP) was 3.87 × 10(-21), and the combined power of discrimination (CPD) was 99.999999999999999999613 %. No significant difference of allele frequencies was observed between Nantong and other Han populations at all STR loci, as well as Dai, Mongolian, and Tibetan. Significant differences were only observed between Nantong Han and Uyghur at TH01, as well as Nantong Han and Dong at CSF1PO and FGA. Nantong Han showed significant differences between She, Bouyei, and Miao at multiple STR loci.

  17. Inter-simple sequence repeat (ISSR) loci mapping in the genome of perennial ryegrass

    DEFF Research Database (Denmark)

    Pivorienė, O; Pašakinskienė, I; Brazauskas, G

    2008-01-01

    The aim of this study was to identify and characterize new ISSR markers and their loci in the genome of perennial ryegrass. A subsample of the VrnA F2 mapping family of perennial ryegrass comprising 92 individuals was used to develop a linkage map including inter-simple sequence repeat markers...... demonstrated a 70% similarity to the Hordeum vulgare germin gene GerA. Inter-SSR mapping will provide useful information for gene targeting, quantitative trait loci mapping and marker-assisted selection in perennial ryegrass....

  18. Complete Sequence and Analysis of Coconut Palm (Cocos nucifera) Mitochondrial Genome.

    Science.gov (United States)

    Aljohi, Hasan Awad; Liu, Wanfei; Lin, Qiang; Zhao, Yuhui; Zeng, Jingyao; Alamer, Ali; Alanazi, Ibrahim O; Alawad, Abdullah O; Al-Sadi, Abdullah M; Hu, Songnian; Yu, Jun

    2016-01-01

    Coconut (Cocos nucifera L.), a member of the palm family (Arecaceae), is one of the most economically important crops in tropics, serving as an important source of food, drink, fuel, medicine, and construction material. Here we report an assembly of the coconut (C. nucifera, Oman local Tall cultivar) mitochondrial (mt) genome based on next-generation sequencing data. This genome, 678,653bp in length and 45.5% in GC content, encodes 72 proteins, 9 pseudogenes, 23 tRNAs, and 3 ribosomal RNAs. Within the assembly, we find that the chloroplast (cp) derived regions account for 5.07% of the total assembly length, including 13 proteins, 2 pseudogenes, and 11 tRNAs. The mt genome has a relatively large fraction of repeat content (17.26%), including both forward (tandem) and inverted (palindromic) repeats. Sequence variation analysis shows that the Ti/Tv ratio of the mt genome is lower as compared to that of the nuclear genome and neutral expectation. By combining public RNA-Seq data for coconut, we identify 734 RNA editing sites supported by at least two datasets. In summary, our data provides the second complete mt genome sequence in the family Arecaceae, essential for further investigations on mitochondrial biology of seed plants.

  19. Effects of GABA[subscript A] Modulators on the Repeated Acquisition of Response Sequences in Squirrel Monkeys

    Science.gov (United States)

    Campbell, Una C.; Winsauer, Peter J.; Stevenson, Michael W.; Moerschbaecher, Joseph M.

    2004-01-01

    The present study investigated the effects of positive and negative GABA[subscript A] modulators under three different baselines of repeated acquisition in squirrel monkeys in which the monkeys acquired a three-response sequence on three keys under a second-order fixed-ratio (FR) schedule of food reinforcement. In two of these baselines, the…

  20. Short tandem repeat (STR based genetic diversity and relationship of indigenous Niger cattle

    Directory of Open Access Journals (Sweden)

    M. Grema

    2017-11-01

    Full Text Available The diversity of cattle in Niger is predominantly represented by three indigenous breeds: Zebu Arabe, Zebu Bororo and Kuri. This study aimed at characterizing the genetic diversity and relationship of Niger cattle breeds using short tandem repeat (STR marker variations. A total of 105 cattle from all three breeds were genotyped at 27 STR loci. High levels of allelic and gene diversity were observed with an overall mean of 8.7 and 0.724 respectively. The mean inbreeding estimate within breeds was found to be moderate with 0.024, 0.043 and 0.044 in Zebu Arabe, Zebu Bororo and Kuri cattle respectively. The global F statistics showed low genetic differentiation among Niger cattle with about 2.6 % of total variation being attributed to between-breed differences. Neighbor-joining tree derived from pairwise allele sharing distance revealed Zebu Arabe and Kuri clustering together while Zebu Bororo appeared to be relatively distinct from the other two breeds. High levels of admixture were evident from the distribution of pairwise inter-individual allele sharing distances that showed individuals across populations being more related than individuals within populations. Individuals were assigned to their respective source populations based on STR genotypes, and the percent correct assignment of Zebu Bororo (87.5 to 93.8 % was consistently higher than Zebu Arabe (59.3 to 70.4 % and Kuri (80.0 to 83.3 % cattle. The qualitative and quantitative tests for mutation drift equilibrium revealed absence of genetic bottleneck events in Niger cattle in the recent past. High genetic diversity and poor genetic structure among indigenous cattle breeds of Niger might be due to historic zebu–taurine admixture and ongoing breeding practices in the region. The results of the present study are expected to help in formulating effective strategies for conservation and genetic improvement of indigenous Niger cattle breeds.

  1. Differential Regulation of Strand-Specific Transcripts from Arabidopsis Centromeric Satellite Repeats.

    Directory of Open Access Journals (Sweden)

    2005-12-01

    Full Text Available Centromeres interact with the spindle apparatus to enable chromosome disjunction and typically contain thousands of tandemly arranged satellite repeats interspersed with retrotransposons. While their role has been obscure, centromeric repeats are epigenetically modified and centromere specification has a strong epigenetic component. In the yeast Schizosaccharomyces pombe, long heterochromatic repeats are transcribed and contribute to centromere function via RNA interference (RNAi. In the higher plant Arabidopsis thaliana, as in mammalian cells, centromeric satellite repeats are short (180 base pairs, are found in thousands of tandem copies, and are methylated. We have found transcripts from both strands of canonical, bulk Arabidopsis repeats. At least one subfamily of 180-base pair repeats is transcribed from only one strand and regulated by RNAi and histone modification. A second subfamily of repeats is also silenced, but silencing is lost on both strands in mutants in the CpG DNA methyltransferase MET1, the histone deacetylase HDA6/SIL1, or the chromatin remodeling ATPase DDM1. This regulation is due to transcription from Athila2 retrotransposons, which integrate in both orientations relative to the repeats, and differs between strains of Arabidopsis. Silencing lost in met1 or hda6 is reestablished in backcrosses to wild-type, but silencing lost in RNAi mutants and ddm1 is not. Twenty-four-nucleotide small interfering RNAs from centromeric repeats are retained in met1 and hda6, but not in ddm1, and may have a role in this epigenetic inheritance. Histone H3 lysine-9 dimethylation is associated with both classes of repeats. We propose roles for transcribed repeats in the epigenetic inheritance and evolution of centromeres.

  2. First Worldwide Proficiency Study on Variable-Number Tandem-Repeat Typing of Mycobacterium tuberculosis Complex Strains

    Science.gov (United States)

    de Beer, Jessica L.; Kremer, Kristin; Ködmön, Csaba; Supply, Philip

    2012-01-01

    Although variable-number tandem-repeat (VNTR) typing has gained recognition as the new standard for the DNA fingerprinting of Mycobacterium tuberculosis complex (MTBC) isolates, external quality control programs have not yet been developed. Therefore, we organized the first multicenter proficiency study on 24-locus VNTR typing. Sets of 30 DNAs of MTBC strains, including 10 duplicate DNA samples, were distributed among 37 participating laboratories in 30 different countries worldwide. Twenty-four laboratories used an in-house-adapted method with fragment sizing by gel electrophoresis or an automated DNA analyzer, nine laboratories used a commercially available kit, and four laboratories used other methods. The intra- and interlaboratory reproducibilities of VNTR typing varied from 0% to 100%, with averages of 72% and 60%, respectively. Twenty of the 37 laboratories failed to amplify particular VNTR loci; if these missing results were ignored, the number of laboratories with 100% interlaboratory reproducibility increased from 1 to 5. The average interlaboratory reproducibility of VNTR typing using a commercial kit was better (88%) than that of in-house-adapted methods using a DNA analyzer (70%) or gel electrophoresis (50%). Eleven laboratories using in-house-adapted manual typing or automated typing scored inter- and intralaboratory reproducibilities of 80% or higher, which suggests that these approaches can be used in a reliable way. In conclusion, this first multicenter study has documented the worldwide quality of VNTR typing of MTBC strains and highlights the importance of international quality control to improve genotyping in the future. PMID:22170917

  3. Spoligotyping and variable number tandem repeat analysis of Mycobacterium bovis isolates from cattle in Brazil

    Directory of Open Access Journals (Sweden)

    Patrícia Martins Parreiras

    2012-02-01

    Full Text Available We performed spoligotyping and 12-mycobacterial interspersed repetitive unit-variable number tandem repeats (MIRU-VNTRs typing to characterise Mycobacterium bovis isolates collected from tissue samples of bovines with lesions suggestive for tuberculosis during slaughter inspection procedures in abattoirs in Brazil. High-quality genotypes were obtained with both procedures for 61 isolates that were obtained from 185 bovine tissue samples and all of these isolates were identified as M. bovis by conventional identification procedures. On the basis of the spoligotyping, 53 isolates were grouped into nine clusters and the remaining eight isolates were unique types, resulting in 17 spoligotypes. The majority of the Brazilian M. bovis isolates displayed spoligotype patterns that have been previously observed in strains isolated from cattle in other countries. MIRU-VNTR typing produced 16 distinct genotypes, with 53 isolates forming eight of the groups, and individual isolates with unique VNTR profiles forming the remaining eight groups. The allelic diversity of each VNTR locus was calculated and only two of the 12-MIRU-VNTR loci presented scores with either a moderate (0.4, MIRU16 or high (0.6, MIRU26 discriminatory index (h. Both typing methods produced similar discriminatory indexes (spoligotyping h = 0.85; MIRU-VNTR h = 0.86 and the combination of the two methods increased the h value to 0.94, resulting in 29 distinct patterns. These results confirm that spoligotyping and VNTR analysis are valuable tools for studying the molecular epidemiology of M. bovis infections in Brazil.

  4. Repeatability and Reproducibility in Proteomic Identifications by Liquid Chromatography—Tandem Mass Spectrometry

    Science.gov (United States)

    Tabb, David L.; Vega-Montoto, Lorenzo; Rudnick, Paul A.; Variyath, Asokan Mulayath; Ham, Amy-Joan L.; Bunk, David M.; Kilpatrick, Lisa E.; Billheimer, Dean D.; Blackman, Ronald K.; Cardasis, Helene L.; Carr, Steven A.; Clauser, Karl R.; Jaffe, Jacob D.; Kowalski, Kevin A.; Neubert, Thomas A.; Regnier, Fred E.; Schilling, Birgit; Tegeler, Tony J.; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Fisher, Susan J.; Gibson, Bradford W.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Steven E.; Tempst, Paul; Paulovich, Amanda G.; Liebler, Daniel C.; Spiegelman, Cliff

    2009-01-01

    The complexity of proteomic instrumentation for LC-MS/MS introduces many possible sources of variability. Data-dependent sampling of peptides constitutes a stochastic element at the heart of discovery proteomics. Although this variation impacts the identification of peptides, proteomic identifications are far from completely random. In this study, we analyzed interlaboratory data sets from the NCI Clinical Proteomic Technology Assessment for Cancer to examine repeatability and reproducibility in peptide and protein identifications. Included data spanned 144 LC-MS/MS experiments on four Thermo LTQ and four Orbitrap instruments. Samples included yeast lysate, the NCI-20 defined dynamic range protein mix, and the Sigma UPS 1 defined equimolar protein mix. Some of our findings reinforced conventional wisdom, such as repeatability and reproducibility being higher for proteins than for peptides. Most lessons from the data, however, were more subtle. Orbitraps proved capable of higher repeatability and reproducibility, but aberrant performance occasionally erased these gains. Even the simplest protein digestions yielded more peptide ions than LC-MS/MS could identify during a single experiment. We observed that peptide lists from pairs of technical replicates overlapped by 35–60%, giving a range for peptide-level repeatability in these experiments. Sample complexity did not appear to affect peptide identification repeatability, even as numbers of identified spectra changed by an order of magnitude. Statistical analysis of protein spectral counts revealed greater stability across technical replicates for Orbitraps, making them superior to LTQ instruments for biomarker candidate discovery. The most repeatable peptides were those corresponding to conventional tryptic cleavage sites, those that produced intense MS signals, and those that resulted from proteins generating many distinct peptides. Reproducibility among different instruments of the same type lagged behind

  5. Development and Characterization of Simple Sequence Repeat (SSR) Markers Based on RNA-Sequencing of Medicago sativa and In silico Mapping onto the M. truncatula Genome

    Science.gov (United States)

    Wang, Zan; Yu, Guohui; Shi, Binbin; Wang, Xuemin; Qiang, Haiping; Gao, Hongwen

    2014-01-01

    Sufficient codominant genetic markers are needed for various genetic investigations in alfalfa since the species is an outcrossing autotetraploid. With the newly developed next generation sequencing technology, a large amount of transcribed sequences of alfalfa have been generated and are available for identifying SSR markers by data mining. A total of 54,278 alfalfa non-redundant unigenes were assembled through the Illumina HiSeqTM 2000 sequencing technology. Based on 3,903 unigene sequences, 4,493 SSRs were identified. Tri-nucleotide repeats (56.71%) were the most abundant motif class while AG/CT (21.7%), AGG/CCT (19.8%), AAC/GTT (10.3%), ATC/ATG (8.8%), and ACC/GGT (6.3%) were the subsequent top five nucleotide repeat motifs. Eight hundred and thirty- seven EST-SSR primer pairs were successfully designed. Of these, 527 (63%) primer pairs yielded clear and scored PCR products and 372 (70.6%) exhibited polymorphisms. High transferability was observed for ssp falcata at 99.2% (523) and 71.7% (378) in M. truncatula. In addition, 313 of 527 SSR marker sequences were in silico mapped onto the eight M. truncatula chromosomes. Thirty-six polymorphic SSR primer pairs were used in the genetic relatedness analysis of 30 Chinese alfalfa cultivated accessions generating a total of 199 scored alleles. The mean observed heterozygosity and polymorphic information content were 0.767 and 0.635, respectively. The codominant markers not only enriched the current resources of molecular markers in alfalfa, but also would facilitate targeted investigations in marker-trait association, QTL mapping, and genetic diversity analysis in alfalfa. PMID:24642969

  6. Complete chloroplast genome and 45S nrDNA sequences of the medicinal plant species Glycyrrhiza glabra and Glycyrrhiza uralensis.

    Science.gov (United States)

    Kang, Sang-Ho; Lee, Jeong-Hoon; Lee, Hyun Oh; Ahn, Byoung Ohg; Won, So Youn; Sohn, Seong-Han; Kim, Jung Sun

    2017-10-06

    Glycyrrhiza uralensis and G. glabra, members of the Fabaceae, are medicinally important species that are native to Asia and Europe. Extracts from these plants are widely used as natural sweeteners because of their much greater sweetness than sucrose. In this study, the three complete chloroplast genomes and five 45S nuclear ribosomal (nr)DNA sequences of these two licorice species and an interspecific hybrid are presented. The chloroplast genomes of G. glabra, G. uralensis and G. glabra × G. uralensis were 127,895 bp, 127,716 bp and 127,939 bp, respectively. The three chloroplast genomes harbored 110 annotated genes, including 76 protein-coding genes, 30 tRNA genes and 4 rRNA genes. The 45S nrDNA sequences were either 5,947 or 5,948 bp in length. Glycyrrhiza glabra and G. glabra × G. uralensis showed two types of nrDNA, while G. uralensis contained a single type. The complete 45S nrDNA sequence unit contains 18S rRNA, ITS1, 5.8S rRNA, ITS2 and 26S rRNA. We identified simple sequence repeat and tandem repeat sequences. We also developed four reliable markers for analysis of Glycyrrhiza diversity authentication.

  7. Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

    Directory of Open Access Journals (Sweden)

    Varala Kranthi

    2007-05-01

    Full Text Available Abstract Background Extensive computational and database tools are available to mine genomic and genetic databases for model organisms, but little genomic data is available for many species of ecological or agricultural significance, especially those with large genomes. Genome surveys using conventional sequencing techniques are powerful, particularly for detecting sequences present in many copies per genome. However these methods are time-consuming and have potential drawbacks. High throughput 454 sequencing provides an alternative method by which much information can be gained quickly and cheaply from high-coverage surveys of genomic DNA. Results We sequenced 78 million base-pairs of randomly sheared soybean DNA which passed our quality criteria. Computational analysis of the survey sequences provided global information on the abundant repetitive sequences in soybean. The sequence was used to determine the copy number across regions of large genomic clones or contigs and discover higher-order structures within satellite repeats. We have created an annotated, online database of sequences present in multiple copies in the soybean genome. The low bias of pyrosequencing against repeat sequences is demonstrated by the overall composition of the survey data, which matches well with past estimates of repetitive DNA content obtained by DNA re-association kinetics (Cot analysis. Conclusion This approach provides a potential aid to conventional or shotgun genome assembly, by allowing rapid assessment of copy number in any clone or clone-end sequence. In addition, we show that partial sequencing can provide access to partial protein-coding sequences.

  8. Association between Interleukin-1 Receptor Antagonist (IL1RN) Variable Number of Tandem Repeats (VNTR) Polymorphism and Pulmonary Tuberculosis.

    Science.gov (United States)

    Hashemi, Mohammad; Naderi, Mohammad; Ebrahimi, Mahboubeh; Amininia, Shadi; Bahari, Gholamreza; Taheri, Mohsen; Eskandari-Nasab, Ebrahim; Ghavami, Saeid

    2015-02-01

    Macrophages and T-lymphocytes are involved in immune response to Mycobacterium tuberculosis. Macrophage produces interleukin (IL)-1 as an inflammatory mediator. IL-1 receptor antagonist (IL1-Ra) is a natural antagonist of IL-1 receptors. In this study we aimed to examine the possible association between the variable number of tandem repeats (VNTR) of the IL-1 receptor antagonist (IL1RN) gene and pulmonary tuberculosis (TB) in a sample of Iranian population. Our study is a case-control study and we examined the VNTR of the IL1RN gene in 265 PTB and 250 healthy subjects by PCR. Neither the overall chi-square comparison of PTB and control subjects nor the logistic regression analysis indicated any association between VNTR IL1RN polymorphism and PTB. Our data suggest that VNTR IL1RN polymorphism may not be associated with the risk of PTB in a sample of Iranian population. Larger studies with different ethnicities are needed to find out the impact of IL1RN VNTR polymorphism on risk of developing TB.

  9. A Mitochondrial Genome of Rhyparochromidae (Hemiptera: Heteroptera) and a Comparative Analysis of Related Mitochondrial Genomes.

    Science.gov (United States)

    Li, Teng; Yang, Jie; Li, Yinwan; Cui, Ying; Xie, Qiang; Bu, Wenjun; Hillis, David M

    2016-10-19

    The Rhyparochromidae, the largest family of Lygaeoidea, encompasses more than 1,850 described species, but no mitochondrial genome has been sequenced to date. Here we describe the first mitochondrial genome for Rhyparochromidae: a complete mitochondrial genome of Panaorus albomaculatus (Scott, 1874). This mitochondrial genome is comprised of 16,345 bp, and contains the expected 37 genes and control region. The majority of the control region is made up of a large tandem-repeat region, which has a novel pattern not previously observed in other insects. The tandem-repeats region of P. albomaculatus consists of 53 tandem duplications (including one partial repeat), which is the largest number of tandem repeats among all the known insect mitochondrial genomes. Slipped-strand mispairing during replication is likely to have generated this novel pattern of tandem repeats. Comparative analysis of tRNA gene families in sequenced Pentatomomorpha and Lygaeoidea species shows that the pattern of nucleotide conservation is markedly higher on the J-strand. Phylogenetic reconstruction based on mitochondrial genomes suggests that Rhyparochromidae is not the sister group to all the remaining Lygaeoidea, and supports the monophyly of Lygaeoidea.

  10. Strategies in protein sequencing and characterization: Multi-enzyme digestion coupled with alternate CID/ETD tandem mass spectrometry

    Energy Technology Data Exchange (ETDEWEB)

    Nardiello, Donatella; Palermo, Carmen, E-mail: carmen.palermo@unifg.it; Natale, Anna; Quinto, Maurizio; Centonze, Diego

    2015-01-07

    Highlights: • Multi-enzyme digestion for protein sequencing and characterization by CID/ETD. • Simultaneous use of trypsin/chymotrypsin for the maximization of sequence. • Identification of PTMs, sequence variants and species-specific residues. • Increase of accuracy in sequence assignments by orthogonal fragmentation techniques. - Abstract: A strategy based on a simultaneous multi-enzyme digestion coupled with electron transfer dissociation (ETD) and collision-induced dissociation (CID) was developed for protein sequencing and characterization, as a valid alternative platform in ion-trap based proteomics. The effect of different proteolytic procedures using chymotrypsin, trypsin, a combination of both, and Lys-C, was carefully evaluated in terms of number of identified peptides, protein coverage, and score distribution. A systematic comparison between CID and ETD is shown for the analysis of peptides originating from the in-solution digestion of standard caseins. The best results were achieved with a trypsin/chymotrypsin mix combined with CID and ETD operating in alternating mode. A post-database search validation of MS/MS dataset was performed, then, the matched peptides were cross checked by the evaluation of ion scores, rank, number of experimental product ions, and their relative abundances in the MS/MS spectrum. By integrated CID/ETD experiments, high quality-spectra have been obtained, thus allowing a confirmation of spectral information and an increase of accuracy in peptide sequence assignments. Overlapping peptides, produced throughout the proteins, reduce the ambiguity in mapping modifications between natural variants and animal species, and allow the characterization of post translational modifications. The advantages of using the enzymatic mix trypsin/chymotrypsin were confirmed by the nanoLC and CID/ETD tandem mass spectrometry of goat milk proteins, previously separated by two-dimensional gel electrophoresis.

  11. Effect of ATRX and G-Quadruplex Formation by the VNTR Sequence on α-Globin Gene Expression.

    Science.gov (United States)

    Li, Yue; Syed, Junetha; Suzuki, Yuki; Asamitsu, Sefan; Shioda, Norifumi; Wada, Takahito; Sugiyama, Hiroshi

    2016-05-17

    ATR-X (α-thalassemia/mental retardation X-linked) syndrome is caused by mutations in chromatin remodeler ATRX. ATRX can bind the variable number of tandem repeats (VNTR) sequence in the promoter region of the α-globin gene cluster. The VNTR sequence, which contains the potential G-quadruplex-forming sequence CGC(GGGGCGGGG)n , is involved in the downregulation of α-globin expression. We investigated G-quadruplex and i-motif formation in single-stranded DNA and long double-stranded DNA. The promoter region without the VNTR sequence showed approximately twofold higher luciferase activity than the promoter region harboring the VNTR sequence. G-quadruplex stabilizers hemin and TMPyP4 reduced the luciferase activity, whereas expression of ATRX led to a recovery in reporter activity. Our results demonstrate that stable G-quadruplex formation by the VNTR sequence downregulates the expression of α-globin genes and that ATRX might bind to and resolve the G-quadruplex. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Genotyping analysis of Helicobacter pylori using multiple-locus variable-number tandem-repeats analysis in five regions of China and Japan

    Directory of Open Access Journals (Sweden)

    Zhang Jinyong

    2011-09-01

    Full Text Available Abstract Background H. pylori (Helicobacter pylori is the major causative agent of chronic active gastritis. The population of H. pylori shows a high genomic variability among isolates. And the polymorphism of repeat-units of genomics had participated the important process of evolution. Its long term colonization of the stomach caused different clinical outcomes, which may relate to the high degree of genetic variation of H. pylori. A variety of molecular typing tools have been developed to access genetic relatedness in H. pylori isolates. However, there is still no standard genotyping system of this bacterium. The MLVA (Multi-locus of variable number of tandem repeat analysis method is useful for performing phylogenetic analysis and is widely used in bacteria genotyping; however, there's little application in H. pylori analysis. This article is the first application of the MLVA method to investigate H. pylori from different districts and ethnic groups of China. Results MLVA of 12 VNTR loci with high discrimination power based on 30 candidates were performed on a collection of 202 strains of H. pylori which originated from five regions of China and Japan. Phylogenetic tree was constructed using MLVA profiles. 12 VNTR loci presented with high various polymorphisms, and the results demonstrated very close relationships between genotypes and ethnic groups. Conclusions This study used MLVA methodology providing a new perspective on the ethnic groups and distribution characteristics of H. pylori.

  13. [Evaluation of different sets of variable number of tandem repeats ioci for genotyping Mycobacterium tuberculosis isolates in China].

    Science.gov (United States)

    Liu, Mei; Luo, Tao; Yang, Chongguang; Liu, Qingyun; Gao, Qian

    2015-10-01

    To identify a variable number of tandem repeats (VNTR) typing method that is suitable for molecular epidemiological study of tuberculosis in China. We systematically evaluated the commonly used VNTR typing methods, including 4 methods (MIRU-12, VNTR-15/VNTR-24 and VNTR "24+4") proposed by foreign colleagues and 2 methods (VNTR-L15 and VNTR"9+3") developed by domestic researchers using population-based collection of 891 clinical isolates from 5 provinces across the country. The order (from high to low) of discriminatory power for the 6 VNTR typing methods was VNTR"24+4", VNTR"9+3", VNTR-24, VNTR-15, VNTR-L15 and MIRU-12. The discriminatory power of VNTR"9+3" was comparable with VNTR"24+4" and higher than that of VNTR-15/24. The concordance for defining clustered and unique genotypes between VNTR"9+3" and VNTR"24+4" was 96.59%. Our results suggest that VNTR"9+3" is a suitable method for molecular typing of M. tuberculosis in China by considering its high discriminatory power, high consistency with VNTR"24+4" and relative small number of VNTR locus.

  14. Features of Variable Number of Tandem Repeats in Yersinia pestis and the Development of a Hierarchical Genotyping Scheme.

    Directory of Open Access Journals (Sweden)

    Yanjun Li

    Full Text Available Variable number of tandem repeats (VNTRs that are widely distributed in the genome of Yersinia pestis proved to be useful markers for the genotyping and source-tracing of this notorious pathogen. In this study, we probed into the features of VNTRs in the Y. pestis genome and developed a simple hierarchical genotyping system based on optimized VNTR loci.Capillary electrophoresis was used in this study for multi-locus VNTR analysis (MLVA in 956 Y. pestis strains. The general features and genetic diversities of 88 VNTR loci in Y. pestis were analyzed with BioNumerics, and a "14+12" loci-based hierarchical genotyping system, which is compatible with single nucleotide polymorphism-based phylogenic analysis, was established.Appropriate selection of target loci reduces the impact of homoplasies caused by the rapid mutation rates of VNTR loci. The optimized "14+12" loci are highly discriminative in genotyping and source-tracing Y. pestis for molecular epidemiological or microbial forensic investigations with less time and lower cost. An MLVA genotyping datasets of representative strains will improve future research on the source-tracing and microevolution of Y. pestis.

  15. JAERI tandem-accelerator and tandem-booster

    Energy Technology Data Exchange (ETDEWEB)

    Yoshida, Tadashi [Japan Atomic Energy Research Inst., Tokai, Ibaraki (Japan). Tokai Research Establishment

    1998-03-01

    In 1982, aiming at the new development of atomic energy research, the tandem accelerator of Japan Atomic Energy Research Institute (JAERI) was installed. In fiscal year 1993, the superconducting boosters which can increase the ion energy by up to 4 times were added, and the research in the region below 1000 MeV became possible. Those are electrostatic type accelerators which are easy to be used especially in basic research field, and are useful for future research. The tandem accelerator has been operated while maintaining the first class performance as the accelerator for various kinds of heavy ion beam. It has the special shape among electrostatic type accelerators, and is excellent in the easiness of control and stability. The main particulars of the tandem accelerator are shown. As for the ion sources of the tandem accelerator, three cesium sputter type ion sources are installed on two high voltage stands. The kinds of the ions which can be accelerated are mainly negative ions. As the improvement, electron cyclotron resonance (ECR) ion sources are expected to be adopted. As for the tandem boosters, the 1/4 wavelength type resonance hollow cylinder was adopted. The constitution of the tandem boosters is explained. The way of utilizing the tandem accelerator system and the aim for hereafter are reported. (K.I.)

  16. Genomic Characterization for Parasitic Weeds of the Genus Striga by Sample Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Matt C. Estep

    2012-03-01

    Full Text Available Generation of ∼2200 Sanger sequence reads or ∼10,000 454 reads for seven Lour. DNA samples (five species allowed identification of the highly repetitive DNA content in these genomes. The 14 most abundant repeats in these species were identified and partially assembled. Annotation indicated that they represent nine long terminal repeat (LTR retrotransposon families, three tandem satellite repeats, one long interspersed element (LINE retroelement, and one DNA transposon. All of these repeats are most closely related to repetitive elements in other closely related plants and are not products of horizontal transfer from their host species. These repeats were differentially abundant in each species, with the LTR retrotransposons and satellite repeats most responsible for variation in genome size. Each species had some repetitive elements that were more abundant and some less abundant than the other species examined, indicating that no single element or any unilateral growth or decrease trend in genome behavior was responsible for variation in genome size and composition. Genome sizes were determined by flow sorting, and the values of 615 Mb [ (L. Kuntze], 1330 Mb [ (Willd. Vatke], 1425 Mb [ (Delile Benth.] and 2460 Mb ( Benth. suggest a ploidy series, a prediction supported by repetitive DNA sequence analysis. Phylogenetic analysis using six chloroplast loci indicated the ancestral relationships of the five most agriculturally important species, with the unexpected result that the one parasite of dicotyledonous plants ( was found to be more closely related to some of the grass parasites than many of the grass parasites are to each other.

  17. Potential Role of the Last Half Repeat in TAL Effectors Revealed by a Molecular Simulation Study

    Directory of Open Access Journals (Sweden)

    Hua Wan

    2016-01-01

    Full Text Available TAL effectors (TALEs contain a modular DNA-binding domain that is composed of tandem repeats. In all naturally occurring TALEs, the end of tandem repeats is invariantly a truncated half repeat. To investigate the potential role of the last half repeat in TALEs, we performed comparative molecular dynamics simulations for the crystal structure of DNA-bound TALE AvrBs3 lacking the last half repeat and its modeled structure having the last half repeat. The structural stability analysis indicates that the modeled system is more stable than the nonmodeled system. Based on the principle component analysis, it is found that the AvrBs3 increases its structural compactness in the presence of the last half repeat. The comparison of DNA groove parameters of the two systems implies that the last half repeat also causes the change of DNA major groove binding efficiency. The following calculation of hydrogen bond reveals that, by stabilizing the phosphate binding with DNA at the C-terminus, the last half repeat helps to adopt a compact conformation at the protein-DNA interface. It further mediates more contacts between TAL repeats and DNA nucleotide bases. Finally, we suggest that the last half repeat is required for the high-efficient recognition of DNA by TALE.

  18. Genetic characterization of UCS region of Pneumocystis jirovecii and construction of allelic profiles of Indian isolates based on sequence typing at three regions.

    Science.gov (United States)

    Gupta, Rashmi; Mirdha, Bijay Ranjan; Guleria, Randeep; Kumar, Lalit; Luthra, Kalpana; Agarwal, Sanjay Kumar; Sreenivas, Vishnubhatla

    2013-01-01

    Pneumocystis jirovecii is an opportunistic pathogen that causes severe pneumonia in immunocompromised patients. To study the genetic diversity of P. jirovecii in India the upstream conserved sequence (UCS) region of Pneumocystis genome was amplified, sequenced and genotyped from a set of respiratory specimens obtained from 50 patients with a positive result for nested mitochondrial large subunit ribosomal RNA (mtLSU rRNA) PCR during the years 2005-2008. Of these 50 cases, 45 showed a positive PCR for UCS region. Variations in the tandem repeats in UCS region were characterized by sequencing all the positive cases. Of the 45 cases, one case showed five repeats, 11 cases showed four repeats, 29 cases showed three repeats and four cases showed two repeats. By running amplified DNA from all these cases on a high-resolution gel, mixed infection was observed in 12 cases (26.7%, 12/45). Forty three of 45 cases included in this study had previously been typed at mtLSU rRNA and internal transcribed spacer (ITS) region by our group. In the present study, the genotypes at those two regions were combined with UCS repeat patterns to construct allelic profiles of 43 cases. A total of 36 allelic profiles were observed in 43 isolates indicating high genetic variability. A statistically significant association was observed between mtLSU rRNA genotype 1, ITS type Ea and UCS repeat pattern 4. Copyright © 2012 Elsevier B.V. All rights reserved.

  19. Use of multiple-locus variable-number of tandem repeats analysis (MLVA) to investigate genetic diversity of Salmonella enterica subsp. enterica serovar Typhimurium isolates from human, food, and veterinary sources

    DEFF Research Database (Denmark)

    Mateva, Gergana; Pedersen, Karl; Sørensen, Gitte

    2017-01-01

    -locus variable-number of tandem repeats analysis (MLVA) and compared results with antimicrobial resistance (AMR) determinations for 100 S. Typhimurium strains isolated in Bulgaria during 2008-2012 (50 veterinary/food and 50 human isolates). Results showed that isolates were divided into 80 and 34 groups using......). No clustering of isolates related to susceptibility/resistance to antimicrobials, source of isolation, or year of isolation was observed. Some MLVA types were found in both human and veterinary/food isolates, indicating a possible route of transmission. A majority (83%) of the isolates were found...

  20. Comparison of the degree of homology of DNA and quantity of repeated sequences in an intact plant and cell structure

    International Nuclear Information System (INIS)

    Solov'yan, V.T.; Kunaleh, V.A.; Shumnyl, V.K.; Vershinin, A.V.

    1986-01-01

    This paper attempts to assess the quantity of repeated sequences and degree of homology of DNA in the intact plant and two lines of callus tissue of Rauwolfia serpentina Benth maintained for 20 years, which differ among themselves in the level of biosynthesis of the pharmacologically valuable alkaloid ajmaline. The tritium-labeled repeats of plants and calli were used in direct and reverse hybridization on nitrocellulose filters. Hybridization of H 3-labeled repeats with phage 17 DNA was used as control. The radioactivity of filters after washing was measured in a liquid scintillation counter

  1. Survey and analysis of simple sequence repeats in the Laccaria bicolor genome, with development of microsatellite markers

    Energy Technology Data Exchange (ETDEWEB)

    Labbe, Jessy L [ORNL; Murat, Claude [INRA, Nancy, France; Morin, Emmanuelle [INRA, Nancy, France; Le Tacon, F [UMR, France; Martin, Francis [INRA, Nancy, France

    2011-01-01

    It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in the L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.

  2. Association of number of tandem repeats in two important adhesins in Mycoplasma hyopneumoniae

    Directory of Open Access Journals (Sweden)

    L. F. dos Santos

    2015-10-01

    Full Text Available RESUMODiversidade genética de Mycoplasma hyopneumoniae tem sido relatada em análise múltipla de repetições em tandem em número variável (MLVA. O objetivo deste estudo foi descrever a distribuição espacial e a heterogeneidade genética de tipos de M. hyopneumoniae no Brasil, bem como investigar a correlação entre regiões de repetição 1 (RR1 e 3 (RR3 de duas adesinas importantes (P97 e P146. Foram identificados 39 tipos de MLVA baseados no número de repetições em tandem em P97 RR1 e RR3 P146. A correlação negativa significativa (Spearman's rho = -0,26; P = 0,022 entre P97 RR1 e RR3 P146 foi observada, o que sugere um possível mecanismo compensatório que permitiria a bactéria manter a sua capacidade de adesão. Os resultados contribuem para compreender a epidemiologia das M. hyopneumoniae no quarto maior país produtor de suínos do mundo.

  3. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation.

    Directory of Open Access Journals (Sweden)

    Amanda Malvessi Cattani

    Full Text Available Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.

  4. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation.

    Science.gov (United States)

    Cattani, Amanda Malvessi; Siqueira, Franciele Maboni; Guedes, Rafael Lucas Muniz; Schrank, Irene Silveira

    2016-01-01

    Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.

  5. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    Directory of Open Access Journals (Sweden)

    Gao Zhihong

    2010-07-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.

  6. Phylogenetic analysis of Gossypium L. using restriction fragment length polymorphism of repeated sequences.

    Science.gov (United States)

    Zhang, Meiping; Rong, Ying; Lee, Mi-Kyung; Zhang, Yang; Stelly, David M; Zhang, Hong-Bin

    2015-10-01

    Cotton is the world's leading textile fiber crop and is also grown as a bioenergy and food crop. Knowledge of the phylogeny of closely related species and the genome origin and evolution of polyploid species is significant for advanced genomics research and breeding. We have reconstructed the phylogeny of the cotton genus, Gossypium L., and deciphered the genome origin and evolution of its five polyploid species by restriction fragment analysis of repeated sequences. Nuclear DNA of 84 accessions representing 35 species and all eight genomes of the genus were analyzed. The phylogenetic tree of the genus was reconstructed using the parsimony method on 1033 polymorphic repeated sequence restriction fragments. The genome origin of its polyploids was determined by calculating the diploid-polyploid restriction fragment correspondence (RFC). The tree is consistent with the morphological classification, genome designation and geographic distribution of the species at subgenus, section and subsection levels. Gossypium lobatum (D7) was unambiguously shown to have the highest RFC with the D-subgenomes of all five polyploids of the genus, while the common ancestor of Gossypium herbaceum (A1) and Gossypium arboreum (A2) likely contributed to the A-subgenomes of the polyploids. These results provide a comprehensive phylogenetic tree of the cotton genus and new insights into the genome origin and evolution of its polyploid species. The results also further demonstrate a simple, rapid and inexpensive method suitable for phylogenetic analysis of closely related species, especially congeneric species, and the inference of genome origin of polyploids that constitute over 70 % of flowering plants.

  7. Linkage of congenital isolated adrenocorticotropic hormone deficiency to the corticotropin releasing hormone locus using simple sequence repeat polymorphisms

    Energy Technology Data Exchange (ETDEWEB)

    Kyllo, J.H.; Collins, M.M.; Vetter, K.L. [Univ. of Iowa College of Medicine, Iowa City, IA (United States)] [and others

    1996-03-29

    Genetic screening techniques using simple sequence repeat polymorphisms were applied to investigate the molecular nature of congenital isolated adrenocorticotropic hormone (ACTH) deficiency. We hypothesize that this rare cause of hypocortisolism shared by a brother and sister with two unaffected sibs and unaffected parents is inherited as an autosomal recessive single gene mutation. Genes involved in the hypothalamic-pituitary axis controlling cortisol sufficiency were investigated for a causal role in this disorder. Southern blotting showed no detectable mutations of the gene encoding pro-opiomelanocortin (POMC), the ACTH precursor. Other candidate genes subsequently considered were those encoding neuroendocrine convertase-1, and neuroendocrine convertase-2 (NEC-1, NEC-2), and corticotropin releasing hormone (CRH). Tests for linkage were performed using polymorphic di- and tetranucleotide simple sequence repeat markers flanking the reported map locations for POMC, NEC-1, NEC-2, and CRH. The chromosomal haplotypes determined by the markers flanking the loci for POMC, NEC-1, and NEC-2 were not compatible with linkage. However, 22 individual markers defining the chromosomal haplotypes flanking CRH were compatible with linkage of the disorder to the immediate area of this gene of chromosome 8. Based on these data, we hypothesize that the ACTH deficiency in this family is due to an abnormality of CRH gene structure or expression. These results illustrate the useful application of high density genetic maps constructed with simple sequence repeat markers for inclusion/exclusion studies of candidate genes in even very small nuclear families segregating for unusual phenotypes. 25 refs., 5 figs., 2 tabs.

  8. Development of new VNTR markers for pike and assessment of variability at di- and tetranucleotide repeat microsatellite loci

    DEFF Research Database (Denmark)

    Hansen, Michael Møller; Taggart, J.B.; Meldrup, Dorte

    1999-01-01

    Levels of variation at six VNTR (variable number of tandem repeats) loci, one minisatellite and five microsatellite loci, isolated from tri- and tetranucleotide enriched DNA libraries for northern pike were generally low in two Danish populations (1-4 alleles; expected heterozygosity 0-0.57), tho......Levels of variation at six VNTR (variable number of tandem repeats) loci, one minisatellite and five microsatellite loci, isolated from tri- and tetranucleotide enriched DNA libraries for northern pike were generally low in two Danish populations (1-4 alleles; expected heterozygosity 0...

  9. Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple

    Science.gov (United States)

    2012-01-01

    Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding

  10. Creation and structure determination of an artificial protein with three complete sequence repeats

    Energy Technology Data Exchange (ETDEWEB)

    Adachi, Motoyasu, E-mail: adachi.motoyasu@jaea.go.jp; Shimizu, Rumi; Kuroki, Ryota [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Blaber, Michael [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Florida State University, Tallahassee, FL 32306-4300 (United States)

    2013-11-01

    An artificial protein with three complete sequence repeats was created and the structure was determined by X-ray crystallography. The structure showed threefold symmetry even though there is an amino- and carboxy-terminal. The artificial protein with threefold symmetry may be useful as a scaffold to capture small materials with C3 symmetry. Symfoil-4P is a de novo protein exhibiting the threefold symmetrical β-trefoil fold designed based on the human acidic fibroblast growth factor. First three asparagine–glycine sequences of Symfoil-4P are replaced with glutamine–glycine (Symfoil-QG) or serine–glycine (Symfoil-SG) sequences protecting from deamidation, and His-Symfoil-II was prepared by introducing a protease digestion site into Symfoil-QG so that Symfoil-II has three complete repeats after removal of the N-terminal histidine tag. The Symfoil-QG and SG and His-Symfoil-II proteins were expressed in Eschericha coli as soluble protein, and purified by nickel affinity chromatography. Symfoil-II was further purified by anion-exchange chromatography after removing the HisTag by proteolysis. Both Symfoil-QG and Symfoil-II were crystallized in 0.1 M Tris-HCl buffer (pH 7.0) containing 1.8 M ammonium sulfate as precipitant at 293 K; several crystal forms were observed for Symfoil-QG and II. The maximum diffraction of Symfoil-QG and II crystals were 1.5 and 1.1 Å resolution, respectively. The Symfoil-II without histidine tag diffracted better than Symfoil-QG with N-terminal histidine tag. Although the crystal packing of Symfoil-II is slightly different from Symfoil-QG and other crystals of Symfoil derivatives having the N-terminal histidine tag, the refined crystal structure of Symfoil-II showed pseudo-threefold symmetry as expected from other Symfoils. Since the removal of the unstructured N-terminal histidine tag did not affect the threefold structure of Symfoil, the improvement of diffraction quality of Symfoil-II may be caused by molecular characteristics of

  11. Recommendation of short tandem repeat profiling for authenticating human cell lines, stem cells, and tissues.

    Science.gov (United States)

    Barallon, Rita; Bauer, Steven R; Butler, John; Capes-Davis, Amanda; Dirks, Wilhelm G; Elmore, Eugene; Furtado, Manohar; Kline, Margaret C; Kohara, Arihiro; Los, Georgyi V; MacLeod, Roderick A F; Masters, John R W; Nardone, Mark; Nardone, Roland M; Nims, Raymond W; Price, Paul J; Reid, Yvonne A; Shewale, Jaiprakash; Sykes, Gregory; Steuer, Anton F; Storts, Douglas R; Thomson, Jim; Taraporewala, Zenobia; Alston-Roberts, Christine; Kerrigan, Liz

    2010-10-01

    Cell misidentification and cross-contamination have plagued biomedical research for as long as cells have been employed as research tools. Examples of misidentified cell lines continue to surface to this day. Efforts to eradicate the problem by raising awareness of the issue and by asking scientists voluntarily to take appropriate actions have not been successful. Unambiguous cell authentication is an essential step in the scientific process and should be an inherent consideration during peer review of papers submitted for publication or during review of grants submitted for funding. In order to facilitate proper identity testing, accurate, reliable, inexpensive, and standardized methods for authentication of cells and cell lines must be made available. To this end, an international team of scientists is, at this time, preparing a consensus standard on the authentication of human cells using short tandem repeat (STR) profiling. This standard, which will be submitted for review and approval as an American National Standard by the American National Standards Institute, will provide investigators guidance on the use of STR profiling for authenticating human cell lines. Such guidance will include methodological detail on the preparation of the DNA sample, the appropriate numbers and types of loci to be evaluated, and the interpretation and quality control of the results. Associated with the standard itself will be the establishment and maintenance of a public STR profile database under the auspices of the National Center for Biotechnology Information. The consensus standard is anticipated to be adopted by granting agencies and scientific journals as appropriate methodology for authenticating human cell lines, stem cells, and tissues.

  12. Development of a Hierarchical Variable-Number Tandem Repeat Typing Scheme for Mycobacterium tuberculosis in China

    Science.gov (United States)

    Luo, Tao; Yang, Chongguang; Pang, Yu; Zhao, Yanlin; Mei, Jian; Gao, Qian

    2014-01-01

    Molecular typing based on variable-number tandem repeats (VNTR) analysis is a promising tool for identifying transmission of Mycobacterium tuberculosis. However, the currently proposed 15- and 24-locus VNTR sets (VNTR-15/24) only have limited resolution and contain too many loci for large-scale typing in high burden countries. To develop an optimal typing scheme in China, we evaluated the resolution and robustness of 25 VNTR loci, using population-based collections of 1362 clinical isolates from six provinces across the country. The resolution of most loci showed considerable variations among regions. By calculating the average resolution of all possible combinations of 20 robust loci, we identified an optimal locus set with a minimum of 9 loci (VNTR-9) that could achieve comparable resolution of the standard VNTR-15. The VNTR-9 had consistently high resolutions in all six regions, and it was highly concordant with VNTR-15 for defining both clustered and unique genotypes. Furthermore, VNTR-9 was phylogenetically informative for classifying lineages/sublineages of M. tuberculosis. Three hypervariable loci (HV-3), VNTR 3232, VNTR 3820 and VNTR 4120, were proved important for further differentiating unrelated clustered strains based on VNTR-9. We propose the optimized VNTR-9 as first-line method and the HV-3 as second-line method for molecular typing of M. tuberculosis in China and surrounding countries. The development of hierarchical VNTR typing methods that can achieve high resolution with a small number of loci could be suitable for molecular epidemiology study in other high burden countries. PMID:24586989

  13. Recommendation of short tandem repeat profiling for authenticating human cell lines, stem cells, and tissues

    Science.gov (United States)

    Barallon, Rita; Bauer, Steven R.; Butler, John; Capes-Davis, Amanda; Dirks, Wilhelm G.; Furtado, Manohar; Kline, Margaret C.; Kohara, Arihiro; Los, Georgyi V.; MacLeod, Roderick A. F.; Masters, John R. W.; Nardone, Mark; Nardone, Roland M.; Nims, Raymond W.; Price, Paul J.; Reid, Yvonne A.; Shewale, Jaiprakash; Sykes, Gregory; Steuer, Anton F.; Storts, Douglas R.; Thomson, Jim; Taraporewala, Zenobia; Alston-Roberts, Christine; Kerrigan, Liz

    2010-01-01

    Cell misidentification and cross-contamination have plagued biomedical research for as long as cells have been employed as research tools. Examples of misidentified cell lines continue to surface to this day. Efforts to eradicate the problem by raising awareness of the issue and by asking scientists voluntarily to take appropriate actions have not been successful. Unambiguous cell authentication is an essential step in the scientific process and should be an inherent consideration during peer review of papers submitted for publication or during review of grants submitted for funding. In order to facilitate proper identity testing, accurate, reliable, inexpensive, and standardized methods for authentication of cells and cell lines must be made available. To this end, an international team of scientists is, at this time, preparing a consensus standard on the authentication of human cells using short tandem repeat (STR) profiling. This standard, which will be submitted for review and approval as an American National Standard by the American National Standards Institute, will provide investigators guidance on the use of STR profiling for authenticating human cell lines. Such guidance will include methodological detail on the preparation of the DNA sample, the appropriate numbers and types of loci to be evaluated, and the interpretation and quality control of the results. Associated with the standard itself will be the establishment and maintenance of a public STR profile database under the auspices of the National Center for Biotechnology Information. The consensus standard is anticipated to be adopted by granting agencies and scientific journals as appropriate methodology for authenticating human cell lines, stem cells, and tissues. PMID:20614197

  14. TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads.

    Science.gov (United States)

    Novák, Petr; Ávila Robledillo, Laura; Koblížková, Andrea; Vrbová, Iva; Neumann, Pavel; Macas, Jirí

    2017-07-07

    Satellite DNA is one of the major classes of repetitive DNA, characterized by tandemly arranged repeat copies that form contiguous arrays up to megabases in length. This type of genomic organization makes satellite DNA difficult to assemble, which hampers characterization of satellite sequences by computational analysis of genomic contigs. Here, we present tandem repeat analyzer (TAREAN), a novel computational pipeline that circumvents this problem by detecting satellite repeats directly from unassembled short reads. The pipeline first employs graph-based sequence clustering to identify groups of reads that represent repetitive elements. Putative satellite repeats are subsequently detected by the presence of circular structures in their cluster graphs. Consensus sequences of repeat monomers are then reconstructed from the most frequent k-mers obtained by decomposing read sequences from corresponding clusters. The pipeline performance was successfully validated by analyzing low-pass genome sequencing data from five plant species where satellite DNA was previously experimentally characterized. Moreover, novel satellite repeats were predicted for the genome of Vicia faba and three of these repeats were verified by detecting their sequences on metaphase chromosomes using fluorescence in situ hybridization. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. UV-induced tandem double mutations in the trpA gene of E. coli

    International Nuclear Information System (INIS)

    Piechocki, R.; Langhammer, R.

    1980-01-01

    The ultraviolet light induction of tandem double mutations in a reverse mutation system was shown using trpA mutants which are characterized by the codon sequences GAA and AAG in codon position 211. Among 597 Trp + independent revertants of the trpA (AAG211) strain 3 full revertants were detected arising from UV-induced tandem double base exchanges. In the codon unit 211 full revertants due to single base exchanges are at least 20 times as frequent as full revertants due to tandem double base exchanges. (author)

  16. Expansion and contraction of the DUP240 multigene family in Saccharomyces cerevisiae populations.

    OpenAIRE

    Leh-Louis, Véronique; Wirth, Bénédicte; Potier, Serge; Souciet, Jean-Luc; Despons, Laurence

    2004-01-01

    The influence of duplicated sequences on chromosomal stability is poorly understood. To characterize chromosomal rearrangements involving duplicated sequences, we compared the organization of tandem repeats of the DUP240 gene family in 15 Saccharomyces cerevisiae strains of various origins. The DUP240 gene family consists of 10 members of unknown function in the reference strain S288C. Five DUP240 paralogs on chromosome I and two on chromosome VII are arranged as tandem repeats that are highl...

  17. Mutation rates at 42 Y chromosomal short tandem repeats in Chinese Han population in Eastern China.

    Science.gov (United States)

    Wu, Weiwei; Ren, Wenyan; Hao, Honglei; Nan, Hailun; He, Xin; Liu, Qiuling; Lu, Dejian

    2018-01-31

    Mutation analysis of 42 Y chromosomal short tandem repeats (Y-STRs) loci was performed using a sample of 1160 father-son pairs from the Chinese Han population in Eastern China. The results showed that the average mutation rate across the 42 Y-STR loci was 0.0041 (95% CI 0.0036-0.0047) per locus per generation. The locus-specific mutation rates varied from 0.000 to 0.0190. No mutation was found at DYS388, DYS437, DYS448, DYS531, and GATA_H4. DYS627, DYS570, DYS576, and DYS449 could be classified as rapidly mutating Y-STRs, with mutation rates higher than 1.0 × 10 -2 . DYS458, DYS630, and DYS518 were moderately mutating Y-STRs, with mutation rates ranging from 8 × 10 -3 to 1 × 10 -2 . Although the characteristics of the Y-STR mutations were consistent with those in previous studies, mutation rate differences between our data and previous published data were found at some rapidly mutating Y-STRs. The single-copy loci located on the short arm of the Y chromosome (Yp) showed relatively higher mutation rates more frequently than the multi-copy loci. These results will not only extend the data for Y-STR mutations but also be important for kinship analysis, paternal lineage identification, and family relationship reconstruction in forensic Y-STR analysis.

  18. Genotypic characterization by multi locus variable number of tandem repeats analysis international Bordetella pertussis vaccine strains

    Directory of Open Access Journals (Sweden)

    M. Fatah Moghadam

    2017-10-01

    Full Text Available Background: In 1930's first whole cell pertussis vaccines became available to the public heralding a dramatic success in overcoming the global burden of the disease. To date only a handful of B. pertussis strains have been used by international/local pertussis vaccine manufacturers. Inevitable well-documented genetic changes in the world population of this pathogen have prompted serious questions on suitability of traditional vaccine strains protect human against currently circulating wild isolates of Bordetella pertussis. Objective: Analyzing the genetic diversity within the most frequently-used vaccine strains of B. pertussis in the world Methods: A recently developed multi locus variable number of tandem repeats analysis (MLVA genotyping system along with a bioinforamtic piece of analysis was conducted on 11 strain / substrains of B137, B203 (10536, C393, Cs, E476, Tohama I, J445 (134, B202 and J446 (509 plus 2 sub-strains of 134 and 509 that are used at Razi institute for preparation of pertussis vaccine. In this study have used 6 individual loci of VNTR1, VNTR3a, VNTR3b, VNTR4, VNTR5 and VNTR6. Findings: Six distinct genotypes were recognized among the examined strains by comparing our data with the Dutch MLVA databank. These were all new and not reported before in the database. Conclusion: This observation reiterates on necessity for detection of predominant native strains to include in vaccine preparations suitable for different countries.

  19. Triplet repeat sequences in human DNA can be detected by hybridization to a synthetic (5'-CGG-3')17 oligodeoxyribonucleotide

    DEFF Research Database (Denmark)

    Behn-Krappa, A; Mollenhauer, J; Doerfler, W

    1993-01-01

    The seemingly autonomous amplification of naturally occurring triplet repeat sequences in the human genome has been implicated in the causation of human genetic disease, such as the fragile X (Martin-Bell) syndrome, myotonic dystrophy (Curshmann-Steinert), spinal and bulbar muscular atrophy...

  20. Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats.

    Science.gov (United States)

    Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

    2013-08-01

    Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

  1. An ultra-high discrimination Y chromosome short tandem repeat multiplex DNA typing system.

    Directory of Open Access Journals (Sweden)

    Erin K Hanson

    Full Text Available In forensic casework, Y chromosome short tandem repeat markers (Y-STRs are often used to identify a male donor DNA profile in the presence of excess quantities of female DNA, such as is found in many sexual assault investigations. Commercially available Y-STR multiplexes incorporating 12-17 loci are currently used in forensic casework (Promega's PowerPlex Y and Applied Biosystems' AmpFlSTR Yfiler. Despite the robustness of these commercial multiplex Y-STR systems and the ability to discriminate two male individuals in most cases, the coincidence match probabilities between unrelated males are modest compared with the standard set of autosomal STR markers. Hence there is still a need to develop new multiplex systems to supplement these for those cases where additional discriminatory power is desired or where there is a coincidental Y-STR match between potential male participants. Over 400 Y-STR loci have been identified on the Y chromosome. While these have the potential to increase the discrimination potential afforded by the commercially available kits, many have not been well characterized. In the present work, 91 loci were tested for their relative ability to increase the discrimination potential of the commonly used 'core' Y-STR loci. The result of this extensive evaluation was the development of an ultra high discrimination (UHD multiplex DNA typing system that allows for the robust co-amplification of 14 non-core Y-STR loci. Population studies with a mixed African American and American Caucasian sample set (n = 572 indicated that the overall discriminatory potential of the UHD multiplex was superior to all commercial kits tested. The combined use of the UHD multiplex and the Applied Biosystems' AmpFlSTR Yfiler kit resulted in 100% discrimination of all individuals within the sample set, which presages its potential to maximally augment currently available forensic casework markers. It could also find applications in human evolutionary

  2. Multiple-locus variable-number tandem repeat analysis for molecular typing of Aspergillus fumigatus

    Directory of Open Access Journals (Sweden)

    Chermette René

    2010-12-01

    Full Text Available Abstract Background Multiple-locus variable-number tandem repeat (VNTR analysis (MLVA is a prominent subtyping method to resolve closely related microbial isolates to provide information for establishing genetic patterns among isolates and to investigate disease outbreaks. The usefulness of MLVA was recently demonstrated for the avian major pathogen Chlamydophila psittaci. In the present study, we developed a similar method for another pathogen of birds: the filamentous fungus Aspergillus fumigatus. Results We selected 10 VNTR markers located on 4 different chromosomes (1, 5, 6 and 8 of A. fumigatus. These markers were tested with 57 unrelated isolates from different hosts or their environment (53 isolates from avian species in France, China or Morocco, 3 isolates from humans collected at CHU Henri Mondor hospital in France and the reference strain CBS 144.89. The Simpson index for individual markers ranged from 0.5771 to 0.8530. A combined loci index calculated with all the markers yielded an index of 0.9994. In a second step, the panel of 10 markers was used in different epidemiological situations and tested on 277 isolates, including 62 isolates from birds in Guangxi province in China, 95 isolates collected in two duck farms in France and 120 environmental isolates from a turkey hatchery in France. A database was created with the results of the present study http://minisatellites.u-psud.fr/MLVAnet/. Three major clusters of isolates were defined by using the graphing algorithm termed Minimum Spanning Tree (MST. The first cluster comprised most of the avian isolates collected in the two duck farms in France, the second cluster comprised most of the avian isolates collected in poultry farms in China and the third one comprised most of the isolates collected in the turkey hatchery in France. Conclusions MLVA displayed excellent discriminatory power. The method showed a good reproducibility. MST analysis revealed an interesting clustering with a

  3. Short tandem repeats in CdLS-causing genes: distribution and ...

    Indian Academy of Sciences (India)

    and SMC3, as all STRs for these genes fall in noncoding region only. ... This indicates that more repeated STRs are at the risk of replication ... patients versus controls. ... ing from a balance between slippage events and point mutations. Proc.

  4. Genome-Wide Analysis of Simple Sequence Repeats in Bitter Gourd (Momordica charantia

    Directory of Open Access Journals (Sweden)

    Junjie Cui

    2017-06-01

    Full Text Available Bitter gourd (Momordica charantia is widely cultivated as a vegetable and medicinal herb in many Asian and African countries. After the sequencing of the cucumber (Cucumis sativus, watermelon (Citrullus lanatus, and melon (Cucumis melo genomes, bitter gourd became the fourth cucurbit species whose whole genome was sequenced. However, a comprehensive analysis of simple sequence repeats (SSRs in bitter gourd, including a comparison with the three aforementioned cucurbit species has not yet been published. Here, we identified a total of 188,091 and 167,160 SSR motifs in the genomes of the bitter gourd lines ‘Dali-11’ and ‘OHB3-1,’ respectively. Subsequently, the SSR content, motif lengths, and classified motif types were characterized for the bitter gourd genomes and compared among all the cucurbit genomes. Lastly, a large set of 138,727 unique in silico SSR primer pairs were designed for bitter gourd. Among these, 71 primers were selected, all of which successfully amplified SSRs from the two bitter gourd lines ‘Dali-11’ and ‘K44’. To further examine the utilization of unique SSR primers, 21 SSR markers were used to genotype a collection of 211 bitter gourd lines from all over the world. A model-based clustering method and phylogenetic analysis indicated a clear separation among the geographic groups. The genomic SSR markers developed in this study have considerable potential value in advancing bitter gourd research.

  5. A multiple-locus variable-number tandem repeat analysis (MLVA) of Listeria monocytogenes isolated from Norwegian salmon-processing factories and from listeriosis patients.

    Science.gov (United States)

    Lunestad, B T; Truong, T T T; Lindstedt, B-A

    2013-10-01

    The objective of this study was to characterize Listeria monocytogenes isolated from farmed Atlantic salmon (Salmo salar) and the processing environment in three different Norwegian factories, and compare these to clinical isolates by multiple-locus variable-number tandem repeat analysis (MLVA). The 65 L. monocytogenes isolates obtained gave 15 distinct MLVA profiles. There was great heterogeneity in the distribution of MLVA profiles in factories and within each factory. Nine of the 15 MLVA profiles found in the fish-associated isolates were found to match human profiles. The MLVA profile 07-07-09-10-06 was the most common strain in Norwegian listeriosis patients. L. monocytogenes with this profile has previously been associated with at least two known listeriosis outbreaks in Norway, neither determined to be due to fish consumption. However, since this profile was also found in fish and in the processing environment, fish should be considered as a possible food vehicle during sporadic cases and outbreaks of listeriosis.

  6. Length of Variable Numbers of Tandem Repeats in the Carboxyl Ester Lipase (CEL) Gene May Confer Susceptibility to Alcoholic Liver Cirrhosis but Not Alcoholic Chronic Pancreatitis.

    Science.gov (United States)

    Fjeld, Karianne; Beer, Sebastian; Johnstone, Marianne; Zimmer, Constantin; Mössner, Joachim; Ruffert, Claudia; Krehan, Mario; Zapf, Christian; Njølstad, Pål Rasmus; Johansson, Stefan; Bugert, Peter; Miyajima, Fabio; Liloglou, Triantafillos; Brown, Laura J; Winn, Simon A; Davies, Kelly; Latawiec, Diane; Gunson, Bridget K; Criddle, David N; Pirmohamed, Munir; Grützmann, Robert; Michl, Patrick; Greenhalf, William; Molven, Anders; Sutton, Robert; Rosendahl, Jonas

    2016-01-01

    Carboxyl-ester lipase (CEL) contributes to fatty acid ethyl ester metabolism, which is implicated in alcoholic pancreatitis. The CEL gene harbours a variable number of tandem repeats (VNTR) region in exon 11. Variation in this VNTR has been linked to monogenic pancreatic disease, while conflicting results were reported for chronic pancreatitis (CP). Here, we aimed to investigate a potential association of CEL VNTR lengths with alcoholic CP. Overall, 395 alcoholic CP patients, 218 patients with alcoholic liver cirrhosis (ALC) serving as controls with a comparable amount of alcohol consumed, and 327 healthy controls from Germany and the United Kingdom (UK) were analysed by determination of fragment lengths by capillary electrophoresis. Allele frequencies and genotypes of different VNTR categories were compared between the groups. Twelve repeats were overrepresented in UK ACP patients (P = 0.04) compared to controls, whereas twelve repeats were enriched in German ALC compared to alcoholic CP patients (P = 0.03). Frequencies of CEL VNTR lengths of 14 and 15 repeats differed between German ALC patients and healthy controls (P = 0.03 and 0.008, respectively). However, in the genotype and pooled analysis of VNTR lengths no statistical significant association was depicted. Additionally, the 16-16 genotype as well as 16 repeats were more frequent in UK ALC than in alcoholic CP patients (P = 0.034 and 0.02, respectively). In all other calculations, including pooled German and UK data, allele frequencies and genotype distributions did not differ significantly between patients and controls or between alcoholic CP and ALC. We did not obtain evidence that CEL VNTR lengths are associated with alcoholic CP. However, our results suggest that CEL VNTR lengths might associate with ALC, a finding that needs to be clarified in larger cohorts.

  7. A novel tandem reporter quantifies RNA polymerase II termination in mammalian cells.

    Directory of Open Access Journals (Sweden)

    Ayan Banerjee

    2009-07-01

    Full Text Available Making the correct choice between transcription elongation and transcription termination is essential to the function of RNA polymerase II, and fundamental to gene expression. This choice can be influenced by factors modifying the transcription complex, factors modifying chromatin, or signals mediated by the template or transcript. To aid in the study of transcription elongation and termination we have developed a transcription elongation reporter system that consists of tandem luciferase reporters flanking a test sequence of interest. The ratio of expression from the reporters provides a measure of the relative rates of successful elongation through the intervening sequence.Size matched fragments containing the polyadenylation signal of the human beta-actin gene (ACTB and the human beta-globin gene (HBB were evaluated for transcription termination using this new ratiometric tandem reporter assay. Constructs bearing just 200 base pairs on either side of the consensus poly(A addition site terminated 98% and 86% of transcription for ACTB and HBB sequences, respectively. The nearly 10-fold difference in read-through transcription between the two short poly(A regions was eclipsed when additional downstream poly(A sequence was included for each gene. Both poly(A regions proved very effective at termination when 1100 base pairs were included, stopping 99.6% of transcription. To determine if part of the increased termination was simply due to the increased template length, we inserted several kilobases of heterologous coding sequence downstream of each poly(A region test fragment. Unexpectedly, the additional length reduced the effectiveness of termination of HBB sequences 2-fold and of ACTB sequences 3- to 5-fold.The tandem construct provides a sensitive measure of transcription termination in human cells. Decreased Xrn2 or Senataxin levels produced only a modest release from termination. Our data support overlap in allosteric and torpedo mechanisms

  8. Population genetic study of 10 short tandem repeat loci from 600 domestic dogs in Korea.

    Science.gov (United States)

    Moon, Seo Hyun; Jang, Yoon-Jeong; Han, Myun Soo; Cho, Myung-Haing

    2016-09-30

    Dogs have long shared close relationships with many humans. Due to the large number of dogs in human populations, they are often involved in crimes. Occasionally, canine biological evidence such as saliva, bloodstains and hairs can be found at crime scenes. Accordingly, canine DNA can be used as forensic evidence. The use of short tandem repeat (STR) loci from biological evidence is valuable for forensic investigations. In Korea, canine STR profiling-related crimes are being successfully analyzed, leading to diverse crimes such as animal cruelty, dog-attacks, murder, robbery, and missing and abandoned dogs being solved. However, the probability of random DNA profile matches cannot be analyzed because of a lack of canine STR data. Therefore, in this study, 10 STR loci were analyzed in 600 dogs in Korea (344 dogs belonging to 30 different purebreds and 256 crossbred dogs) to estimate canine forensic genetic parameters. Among purebred dogs, a separate statistical analysis was conducted for five major subgroups, 97 Maltese, 47 Poodles, 31 Shih Tzus, 32 Yorkshire Terriers, and 25 Pomeranians. Allele frequencies, expected (Hexp) and observed heterozygosity (Hobs), fixation index (F), probability of identity (P(ID)), probability of sibling identity (P(ID)sib) and probability of exclusion (PE) were then calculated. The Hexp values ranged from 0.901 (PEZ12) to 0.634 (FHC2079), while the P(ID)sib values were between 0.481 (FHC2079) and 0.304 (PEZ12) and the P(ID)sib was about 3.35 × 10(-)⁵ for the combination of all 10 loci. The results presented herein will strengthen the value of canine DNA to solving dog-related crimes.

  9. Transcription factor IID in the Archaea: sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes

    Science.gov (United States)

    Marsh, T. L.; Reich, C. I.; Whitelock, R. B.; Olsen, G. J.; Woese, C. R. (Principal Investigator)

    1994-01-01

    The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

  10. Genus-specific protein binding to the large clusters of DNA repeats (short regularly spaced repeats) present in Sulfolobus genomes

    DEFF Research Database (Denmark)

    Peng, Xu; Brügger, Kim; Shen, Biao

    2003-01-01

    terminally modified and corresponds to SSO454, an open reading frame of previously unassigned function. It binds specifically to DNA fragments carrying double and single repeat sequences, binding on one side of the repeat structure, and producing an opening of the opposite side of the DNA structure. It also...... recognizes both main families of repeat sequences in S. solfataricus. The recombinant protein, expressed in Escherichia coli, showed the same binding properties to the SRSR repeat as the native one. The SSO454 protein exhibits a tripartite internal repeat structure which yields a good sequence match...... with a helix-turn-helix DNA-binding motif. Although this putative motif is shared by other archaeal proteins, orthologs of SSO454 were only detected in species within the Sulfolobus genus and in the closely related Acidianus genus. We infer that the genus-specific protein induces an opening of the structure...

  11. Application of Tandem Two-Dimensional Mass Spectrometry for Top-Down Deep Sequencing of Calmodulin.

    Science.gov (United States)

    Floris, Federico; Chiron, Lionel; Lynch, Alice M; Barrow, Mark P; Delsuc, Marc-André; O'Connor, Peter B

    2018-06-04

    Two-dimensional mass spectrometry (2DMS) involves simultaneous acquisition of the fragmentation patterns of all the analytes in a mixture by correlating their precursor and fragment ions by modulating precursor ions systematically through a fragmentation zone. Tandem two-dimensional mass spectrometry (MS/2DMS) unites the ultra-high accuracy of Fourier transform ion cyclotron resonance (FT-ICR) MS/MS and the simultaneous data-independent fragmentation of 2DMS to achieve extensive inter-residue fragmentation of entire proteins. 2DMS was recently developed for top-down proteomics (TDP), and applied to the analysis of calmodulin (CaM), reporting a cleavage coverage of about ~23% using infrared multiphoton dissociation (IRMPD) as fragmentation technique. The goal of this work is to expand the utility of top-down protein analysis using MS/2DMS in order to extend the cleavage coverage in top-down proteomics further into the interior regions of the protein. In this case, using MS/2DMS, the cleavage coverage of CaM increased from ~23% to ~42%. Graphical Abstract Two-dimensional mass spectrometry, when applied to primary fragment ions from the source, allows deep-sequencing of the protein calmodulin.

  12. Assembly of Repeat Content Using Next Generation Sequencing Data

    Energy Technology Data Exchange (ETDEWEB)

    labutti, Kurt; Kuo, Alan; Grigoriev, Igor; Copeland, Alex

    2014-03-17

    Repetitive organisms pose a challenge for short read assembly, and typically only unique regions and repeat regions shorter than the read length, can be accurately assembled. Recently, we have been investigating the use of Pacific Biosciences reads for de novo fungal assembly. We will present an assessment of the quality and degree of repeat reconstruction possible in a fungal genome using long read technology. We will also compare differences in assembly of repeat content using short read and long read technology.

  13. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures

    Directory of Open Access Journals (Sweden)

    Hayes Michael L

    2012-05-01

    Full Text Available Abstract Background Pentatricopeptide repeat (PPR proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82 and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. Results All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3’ UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. Conclusion PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative

  14. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

    Science.gov (United States)

    Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

    2012-05-14

    Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target

  15. Catalytic stereoselective synthesis of highly substituted indanones via tandem Nazarov cyclization and electrophilic fluorination trapping.

    Science.gov (United States)

    Nie, Jing; Zhu, Hong-Wei; Cui, Han-Feng; Hua, Ming-Qing; Ma, Jun-An

    2007-08-02

    A new catalytic stereoselective tandem transformation via Nazarov cyclization/electrophilic fluorination has been accomplished. This sequence is efficiently catalyzed by a Cu(II) complex to afford fluorine-containing 1-indanone derivatives with two new stereocenters with high diastereoselectivity (trans/cis up to 49/1). Three examples of catalytic enantioselective tandem transformation are presented.

  16. Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi and related species

    Directory of Open Access Journals (Sweden)

    Odvody Gary N

    2008-11-01

    Full Text Available Abstract Background A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites to detect differences at the DNA level. Results Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55% with dinucleotide repeats and 6 (11% with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40% and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis, sugar cane (P. sacchari, pearl millet (Sclerospora graminicola and rose (Peronospora sparsa indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34

  17. Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi) and related species.

    Science.gov (United States)

    Perumal, Ramasamy; Nimmakayala, Padmavathi; Erattaimuthu, Saradha R; No, Eun-Gyu; Reddy, Umesh K; Prom, Louis K; Odvody, Gary N; Luster, Douglas G; Magill, Clint W

    2008-11-29

    A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites) to detect differences at the DNA level. Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55%) with dinucleotide repeats and 6 (11%) with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40%) and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis), sugar cane (P. sacchari), pearl millet (Sclerospora graminicola) and rose (Peronospora sparsa) indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production) were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34 Peronosclerospora, Peronospora and Sclerospora

  18. Direct repeat sequences are essential for function of the cis-acting locus of transfer (clt) of Streptomyces phaeochromogenes plasmid pJV1.

    Science.gov (United States)

    Franco, Bernardo; González-Cerón, Gabriela; Servín-González, Luis

    2003-11-01

    The functionality of direct and inverted repeat sequences inside the cis acting locus of transfer (clt) of the Streptomyces plasmid pJV1 was determined by testing the effect of different deletions on plasmid transfer. The results show that the single most important element for pJV1 clt function is a series of evenly spaced 9 bp long direct repeats which match the consensus CCGCACA(C/G)(C/G), since their deletion caused a dramatic reduction in plasmid transfer. The presence of these repeats in the absence of any other clt sequences allowed plasmid transfer to occur at a frequency that was at least two orders of magnitude higher than that obtained in the complete absence of clt. A database search revealed regions with a similar organization, and in the same position, in Streptomyces plasmids pSN22 and pSLS, which have transfer proteins homologous to those of pJV1.

  19. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng

    Directory of Open Access Journals (Sweden)

    Jinhui eChen

    2015-06-01

    Full Text Available Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around ten species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR region, which was found to be IR region A (IRA, was lost in the M. glyptostroboides cp ge-nome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for relat-ed species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostro-boides is a sister species to Cryptomeria japonica (L. F. D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyp-tostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the conif-erous cp genomes, especially for the position of M. glyptostroboides in plant systemat-ics and evolution.

  20. The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng.

    Science.gov (United States)

    Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen

    2015-01-01

    Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution.

  1. PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile.

    Science.gov (United States)

    Pereira, Luísa; Alshamali, Farida; Andreassen, Rune; Ballard, Ruth; Chantratita, Wasun; Cho, Nam Soo; Coudray, Clotilde; Dugoujon, Jean-Michel; Espinoza, Marta; González-Andrade, Fabricio; Hadi, Sibte; Immel, Uta-Dorothee; Marian, Catalin; Gonzalez-Martin, Antonio; Mertens, Gerhard; Parson, Walther; Perone, Carlos; Prieto, Lourdes; Takeshita, Haruo; Rangel Villalobos, Héctor; Zeng, Zhaoshu; Zhivotovsky, Lev; Camacho, Rui; Fonseca, Nuno A

    2011-09-01

    Because of their sensitivity and high level of discrimination, short tandem repeat (STR) maker systems are currently the method of choice in routine forensic casework and data banking, usually in multiplexes up to 15-17 loci. Constraints related to sample amount and quality, frequently encountered in forensic casework, will not allow to change this picture in the near future, notwithstanding the technological developments. In this study, we present a free online calculator named PopAffiliator ( http://cracs.fc.up.pt/popaffiliator ) for individual population affiliation in the three main population groups, Eurasian, East Asian and sub-Saharan African, based on genotype profiles for the common set of STRs used in forensics. This calculator performs affiliation based on a model constructed using machine learning techniques. The model was constructed using a data set of approximately fifteen thousand individuals collected for this work. The accuracy of individual population affiliation is approximately 86%, showing that the common set of STRs routinely used in forensics provide a considerable amount of information for population assignment, in addition to being excellent for individual identification.

  2. [Discriminatory power of variable number on tandem repeats loci for genotyping Mycobacterium tuberculosis strains in China].

    Science.gov (United States)

    Chen, H X; Cai, C; Liu, J Y; Zhang, Z G; Yuan, M; Jia, J N; Sun, Z G; Huang, H R; Gao, J M; Li, W M

    2017-06-10

    Objective: Using the standard genotype method, variable number of tandem repeats (VNTR), we constructed a VNTR database to cover all provinces and proposed a set of optimized VNTR loci combinations for each province, in order to improve the preventive and control programs on tuberculosis, in China. Methods: A total of 15 loci VNTR was used to analyze 4 116 Mycobacterium tuberculosis strains, isolated from national survey of Drug Resistant Tuberculosis, in 2007. Hunter-Gaston Index (HGI) was also used to analyze the discriminatory power of each VNTR site. A set combination of 12-VNTR, 10-VNTR, 8-VNTR and 5-VNTR was respectively constructed for each province, based on 1) epidemic characteristics of M. tuberculosis lineages in China, with high discriminatory power and genetic stability. Results: Through the completed 15 loci VNTR patterns of 3 966 strains under 96.36 % (3 966/4 116) coverage, we found seven high HGI loci (including QUB11b and MIRU26) as well as low stable loci (including QUB26, MIRU16, Mtub21 and QUB11b) in several areas. In all the 31 provinces, we found an optimization VNTR combination as 10-VNTR loci in Inner Mongolia, Chongqing and Heilongjiang, but with 8-VNTR combination shared in other provinces. Conclusions: It is necessary to not only use the VNTR database for tracing the source of infection and cluster of M. tuberculosis in the nation but also using the set of optimized VNTR combinations in monitoring those local epidemics and M. tuberculosis (genetics in local) population.

  3. Genome-wide identification and validation of simple sequence repeats (SSRs) from Asparagus officinalis.

    Science.gov (United States)

    Li, Shufen; Zhang, Guojun; Li, Xu; Wang, Lianjun; Yuan, Jinhong; Deng, Chuanliang; Gao, Wujun

    2016-06-01

    Garden asparagus (Asparagus officinalis), an important vegetable cultivated worldwide, can also serve as a model dioecious plant species in the study of sex determination and sex chromosome evolution. However, limited DNA marker resources have been developed and used for this species. To expand these resources, we examined the DNA sequences for simple sequence repeats (SSRs) in 163,406 scaffolds representing approximately 400 Mbp of the A. officinalis genome. A total of 87,576 SSRs were identified in 59,565 scaffolds. The most abundant SSR repeats were trinucleotide and tetranucleotide, accounting for 29.2 and 29.1% of the total SSRs, respectively, followed by di-, penta-, hexa-, hepta-, and octanucleotides. The AG motif was most common among dinucleotides and was also the most frequent motif in the entire A. officinalis genome, representing 14.7% of all SSRs. A total of 41,917 SSR primers pairs were designed to amplify SSRs. Twenty-two genomic SSR markers were tested in 39 asparagus accessions belonging to ten cultivars and one accession of Asparagus setaceus for determination of genetic diversity. The intra-species polymorphism information content (PIC) values of the 22 genomic SSR markers were intermediate, with an average of 0.41. The genetic diversity between the ten A. officinalis cultivars was low, and the UPGMA dendrogram was largely unrelated to cultivars. It is here suggested that the sex of individuals is an important factor influencing the clustering results. The information reported here provides new information about the organization of the microsatellites in A. officinalis genome and lays a foundation for further genetic studies and breeding applications of A. officinalis and related species. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Identification of a highly sulfated fucoidan from sea cucumber Pearsonothuria graeffei with well-repeated tetrasaccharides units.

    Science.gov (United States)

    Hu, Yaqin; Li, Shan; Li, Junhui; Ye, Xingqian; Ding, Tian; Liu, Donghong; Chen, Jianchu; Ge, Zhiwei; Chen, Shiguo

    2015-12-10

    Sea cucumber fucoidan is a major bioactive component of sea cucumber. The structures of fucoidans have significant influences on their biological activities. The present study clarified the delicate structure of a fucoidan from Pearsonothuria graeffei. Fucoidan was obtained after papain digestion and purified by ion chromatography. The carbohydrate sequence of fucoidan was firstly determined by negative-ion electrospray tandem mass spectrometry (ES-MS) with collision-induced dissociation of the oligosaccharide fragments, which were obtained by mild acid hydrolysis, and completed by NMR for assignment of the anomeric conformation. It was unambiguously identified as a tetrasaccharide repeating unit with a backbone of [ → 3Fuc (2S, 4S) α1 → 3Fucα1→ 3Fuc (4S) α1 → 3Fuc#7 × 10#]n. The glycosidic bonds between the non-sulfated and 2,4-O-disulfated fucose residues were selectively cleaved, and highly ordered oligosaccharide fragments with a tetrasaccharide repeating unit were obtained. The highly 4-O- and 2, 4-di-O-sulfated polysaccharide deserves further developments for Pharmacia use. Copyright © 2015 Elsevier Ltd. All rights reserved.

  5. FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise.

    Science.gov (United States)

    Hoogenboom, Jerry; van der Gaag, Kristiaan J; de Leeuw, Rick H; Sijen, Titia; de Knijff, Peter; Laros, Jeroen F J

    2017-03-01

    Massively parallel sequencing (MPS) is on the advent of a broad scale application in forensic research and casework. The improved capabilities to analyse evidentiary traces representing unbalanced mixtures is often mentioned as one of the major advantages of this technique. However, most of the available software packages that analyse forensic short tandem repeat (STR) sequencing data are not well suited for high throughput analysis of such mixed traces. The largest challenge is the presence of stutter artefacts in STR amplifications, which are not readily discerned from minor contributions. FDSTools is an open-source software solution developed for this purpose. The level of stutter formation is influenced by various aspects of the sequence, such as the length of the longest uninterrupted stretch occurring in an STR. When MPS is used, STRs are evaluated as sequence variants that each have particular stutter characteristics which can be precisely determined. FDSTools uses a database of reference samples to determine stutter and other systemic PCR or sequencing artefacts for each individual allele. In addition, stutter models are created for each repeating element in order to predict stutter artefacts for alleles that are not included in the reference set. This information is subsequently used to recognise and compensate for the noise in a sequence profile. The result is a better representation of the true composition of a sample. Using Promega Powerseq™ Auto System data from 450 reference samples and 31 two-person mixtures, we show that the FDSTools correction module decreases stutter ratios above 20% to below 3%. Consequently, much lower levels of contributions in the mixed traces are detected. FDSTools contains modules to visualise the data in an interactive format allowing users to filter data with their own preferred thresholds. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  6. Instability of expanded simple tandem repeats is induced in cell culture by a variety of agents: N-Nitroso-N-ethylurea, benzo(a)pyrene, etoposide and okadaic acid

    Energy Technology Data Exchange (ETDEWEB)

    Polyzos, Aris [Environmental Health Centre, Environmental and occupational Toxicology Division, Health Canada, Tunney' s Pasture, P.L. 0803A, Ottawa, Ont., K1A 0L2 (Canada); Parfett, Craig [Environmental Health Centre, Environmental and occupational Toxicology Division, Health Canada, Tunney' s Pasture, P.L. 0803A, Ottawa, Ont., K1A 0L2 (Canada); Healy, Caroline [Environmental Health Centre, Environmental and occupational Toxicology Division, Health Canada, Tunney' s Pasture, P.L. 0803A, Ottawa, Ont., K1A 0L2 (Canada); Douglas, George R. [Environmental Health Centre, Environmental and occupational Toxicology Division, Health Canada, Tunney' s Pasture, P.L. 0803A, Ottawa, Ont., K1A 0L2 (Canada); Yauk, Carole L. [Environmental Health Centre, Environmental and occupational Toxicology Division, Health Canada, Tunney' s Pasture, P.L. 0803A, Ottawa, Ont., K1A 0L2 (Canada)]. E-mail: Carole_Yauk@hc-sc.gc.ca

    2006-06-25

    Expanded simple tandem repeat (ESTR) sequences have proven useful biomarkers to detect genotoxicity in vivo. Their high sensitivity has been used to assess environmentally relevant doses of mutagens such as ionizing radiation, DNA alkylating agents and airborne particulate pollution, for germline mutations in mouse assays. The mutagenic response involves size alteration of these ESTR loci induced by agents causing a variety of cellular damage. The mechanistic aspects of this induced instability remain unclear and have not been studied in detail. Mechanistic knowledge is important to help understand the relevance of increased ESTR mutation frequencies. In this study, we applied a murine cell culture system to examine induced response to four agents exhibiting different modes of toxic action including: N-nitroso-N-ethylurea (ENU), benzo(a)pyrene (BaP), okadaic acid and etoposide at slightly sub-toxic levels. We used single-molecule-polymerase chain reaction (SM-PCR) to assess the relative mutant frequency after 4-week chemical treatments at the Ms6-hm ESTR sequence of cultured C3H/10T1/2 cells (a mouse embryonic cell line). Increased mutation was observed with both 0.64 mM ENU (1.95-fold increase, P < 0.0001), 1 {mu}M benzo(a)pyrene (1.87-fold increase, P = 0.0006) and 3 nM etoposide (1.89-fold increase, P = 0.0003). The putative ESTR mutagen okadaic acid (1.27-fold increase, P = 0.2289), administered at 0.5 nM, did not affect the C3H/10T1/2 Ms6-hm locus. Therefore, agents inducing small and bulky adducts, and indirectly causing strand breaks through inhibition of topoisomerase, caused similar induction of instability at an ESTR locus at matched toxicities. As size spectra for induced mutations were identical, the data indicate that although these chemicals exhibit distinct modes of action, a similar indirect process is influencing ESTR instability. In contrast, a potent tumour promoter that is a kinase inhibitor does not contribute to induced ESTR instability in

  7. Instability of expanded simple tandem repeats is induced in cell culture by a variety of agents: N-Nitroso-N-ethylurea, benzo(a)pyrene, etoposide and okadaic acid

    International Nuclear Information System (INIS)

    Polyzos, Aris; Parfett, Craig; Healy, Caroline; Douglas, George R.; Yauk, Carole L.

    2006-01-01

    Expanded simple tandem repeat (ESTR) sequences have proven useful biomarkers to detect genotoxicity in vivo. Their high sensitivity has been used to assess environmentally relevant doses of mutagens such as ionizing radiation, DNA alkylating agents and airborne particulate pollution, for germline mutations in mouse assays. The mutagenic response involves size alteration of these ESTR loci induced by agents causing a variety of cellular damage. The mechanistic aspects of this induced instability remain unclear and have not been studied in detail. Mechanistic knowledge is important to help understand the relevance of increased ESTR mutation frequencies. In this study, we applied a murine cell culture system to examine induced response to four agents exhibiting different modes of toxic action including: N-nitroso-N-ethylurea (ENU), benzo(a)pyrene (BaP), okadaic acid and etoposide at slightly sub-toxic levels. We used single-molecule-polymerase chain reaction (SM-PCR) to assess the relative mutant frequency after 4-week chemical treatments at the Ms6-hm ESTR sequence of cultured C3H/10T1/2 cells (a mouse embryonic cell line). Increased mutation was observed with both 0.64 mM ENU (1.95-fold increase, P < 0.0001), 1 μM benzo(a)pyrene (1.87-fold increase, P = 0.0006) and 3 nM etoposide (1.89-fold increase, P = 0.0003). The putative ESTR mutagen okadaic acid (1.27-fold increase, P = 0.2289), administered at 0.5 nM, did not affect the C3H/10T1/2 Ms6-hm locus. Therefore, agents inducing small and bulky adducts, and indirectly causing strand breaks through inhibition of topoisomerase, caused similar induction of instability at an ESTR locus at matched toxicities. As size spectra for induced mutations were identical, the data indicate that although these chemicals exhibit distinct modes of action, a similar indirect process is influencing ESTR instability. In contrast, a potent tumour promoter that is a kinase inhibitor does not contribute to induced ESTR instability in cell

  8. The Epidemiological Significance and Temporal Stability of Mycobacterial Interspersed Repetitive Units-Variable Number of Tandem Repeats-Based Method Applied to Mycobacterium tuberculosis in China

    Directory of Open Access Journals (Sweden)

    Yang Li

    2018-04-01

    Full Text Available This study aimed to validate the epidemiological significance and temporal stability of Mycobacterial Interspersed Repetitive Units-Variable Number of Tandem Repeats (MIRU-VNTR typing in a genetically and geographically diverse set of clinical isolates from patients diagnosed with pulmonary tuberculosis in China. Between 2010 and 2013, a total of 982 Mycobacterium tuberculosis isolates were collected from four population-based investigations in China. Apart from the currently applied 24-locus MIRU-VNTR, six additional hypervariable loci were analyzed in order to validate the MIRU-VNTR combinations in terms of their epidemiological links, clustering time span, and paired geographic distance. In vitro temporal stability was analyzed for both individual MIRU-VNTR loci, and for several combinations of loci. In the present study, four MIRU-VNTR combinations, including the hypervariable loci 3820, 3232, 2163a, and 4120, were evaluated. All of these combinations obtained a Hunter-Gaston discriminatory index (HGDI value over 0.9900 with a reduced clustering proportion (from 32.0% to 25.6%. By comparing epidemiological links, clustering time span, and paired geographic distance, we found that the performances of the four MIRU-VNTR combinations were comparable to the insertion sequence 6110 restriction fragment length polymorphism (IS6110-RFLP, and significantly better than that of 24-locus MIRU-VNTR genotyping alone. The proportion of temporally stable loci ranged from 90.5% to 92.5% within the combined MIRU-VNTR genotyping, which is higher than IS6110-RFLP (85.4%. By adding four hypervariable loci to the standard 24-locus MIRU-VNTR genotyping, we obtained a high discriminatory power, stability and epidemiological significance. This algorithm could therefore be used to improve tuberculosis transmission surveillance and outbreak investigation in China.

  9. Association of ECRG2 TCA short tandem repeat polymorphism with the risk of oesophageal cancer in a North Indian population.

    Science.gov (United States)

    Jain, Meenu; Kumar, Shaleen; Ghoshal, Uday C; Mittal, Balraj

    2008-06-01

    Oesophageal cancer-related gene (ECRG2) is a tumour suppressor gene and it has been suggested that a triplet TCA short tandem repeat (STR) in the noncoding region of exon 4 plays a role in genetic susceptibility to oesophageal cancer. In the present study, ECRG2 STR polymorphism was studied in 134 patients with oesophageal cancer and 194 controls, using PCR and polyacrylamide gel electrophoresis. The results showed a higher frequency of the ECRG2 TCA (3)/TCA (4) genotype in cancer patients than in controls (odds ratio 2.6, 95% CI 1.0-6.4, p = 0.03). The association of the ECRG2 TCA (3)/TCA (4) genotype with clinical characteristics showed an increased risk for squamous cell histology (2.8, 95% CI 1.1-7.1, p = 0.03), while no association with tumor location or lymph node involvement was observed. Interaction of tobacco, alcohol and occupational exposure with the ECRG2 genotypes did not show modulation of risk. In conclusion, the ECRG2 TCA (3)/TCA (4) genotype is associated with the risk of oesophageal carcinoma in a North Indian population.

  10. Specific multilocus variable-number tandem-repeat analysis genotypes of Mycoplasma pneumoniae are associated with diseases severity and macrolide susceptibility.

    Directory of Open Access Journals (Sweden)

    Jiuxin Qu

    Full Text Available Clinical relevance of multilocus variable-number tandem-repeat (VNTR analysis (MLVA in patients with community-acquired pneumonia (CAP by Mycoplasma pneumoniae (M. pneumoniae is unknown. A multi-center, prospective study was conducted from November 2010 to April 2012. Nine hundred and fifty-four CAP patients were consecutively enrolled. M. pneumoniae clinical isolates were obtained from throat swabs. MLVA typing was applied to all isolates. Comparison of pneumonia severity index (PSI and clinical features among patients infected with different MLVA types of M. pneumoniae were conducted. One hundred and thirty-six patients were positive with M. pneumoniae culture. The clinical isolates were clustered into 18 MLVA types. One hundred and fourteen (88.3% isolates were resistant to macrolide, covering major MLVA types. The macrolide non-resistant rate of M. pneumoniae isolates with Mpn13-14-15-16 profile of 3-5-6-2 was significantly higher than that of other types (p ≤ 0.001. Patients infected with types U (5-4-5-7-2 and J (3-4-5-7-2 had significantly higher PSI scores (p<0.001 and longer total duration of cough (p = 0.011. Therefore it seems that there is a correlation between certain MLVA types and clinical severity of disease and the presence of macrolide resistance.

  11. Towards Development of Clustering Applications for Large-Scale Comparative Genotyping and Kinship Analysis Using Y-Short Tandem Repeats.

    Science.gov (United States)

    Seman, Ali; Sapawi, Azizian Mohd; Salleh, Mohd Zaki

    2015-06-01

    Y-chromosome short tandem repeats (Y-STRs) are genetic markers with practical applications in human identification. However, where mass identification is required (e.g., in the aftermath of disasters with significant fatalities), the efficiency of the process could be improved with new statistical approaches. Clustering applications are relatively new tools for large-scale comparative genotyping, and the k-Approximate Modal Haplotype (k-AMH), an efficient algorithm for clustering large-scale Y-STR data, represents a promising method for developing these tools. In this study we improved the k-AMH and produced three new algorithms: the Nk-AMH I (including a new initial cluster center selection), the Nk-AMH II (including a new dominant weighting value), and the Nk-AMH III (combining I and II). The Nk-AMH III was the superior algorithm, with mean clustering accuracy that increased in four out of six datasets and remained at 100% in the other two. Additionally, the Nk-AMH III achieved a 2% higher overall mean clustering accuracy score than the k-AMH, as well as optimal accuracy for all datasets (0.84-1.00). With inclusion of the two new methods, the Nk-AMH III produced an optimal solution for clustering Y-STR data; thus, the algorithm has potential for further development towards fully automatic clustering of any large-scale genotypic data.

  12. A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

    Science.gov (United States)

    Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

    1994-09-01

    A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.

  13. The complete chloroplast genome sequence of Podocarpus lambertii: genome structure, evolutionary aspects, gene content and SSR detection.

    Directory of Open Access Journals (Sweden)

    Leila do Nascimento Vieira

    Full Text Available BACKGROUND: Podocarpus lambertii (Podocarpaceae is a native conifer from the Brazilian Atlantic Forest Biome, which is considered one of the 25 biodiversity hotspots in the world. The advancement of next-generation sequencing technologies has enabled the rapid acquisition of whole chloroplast (cp genome sequences at low cost. Several studies have proven the potential of cp genomes as tools to understand enigmatic and basal phylogenetic relationships at different taxonomic levels, as well as further probe the structural and functional evolution of plants. In this work, we present the complete cp genome sequence of P. lambertii. METHODOLOGY/PRINCIPAL FINDINGS: The P. lambertii cp genome is 133,734 bp in length, and similar to other sequenced cupressophytes, it lacks one of the large inverted repeat regions (IR. It contains 118 unique genes and one duplicated tRNA (trnN-GUU, which occurs as an inverted repeat sequence. The rps16 gene was not found, which was previously reported for the plastid genome of another Podocarpaceae (Nageia nagi and Araucariaceae (Agathis dammara. Structurally, P. lambertii shows 4 inversions of a large DNA fragment ∼20,000 bp compared to the Podocarpus totara cp genome. These unexpected characteristics may be attributed to geographical distance and different adaptive needs. The P. lambertii cp genome presents a total of 28 tandem repeats and 156 SSRs, with homo- and dipolymers being the most common and tri-, tetra-, penta-, and hexapolymers occurring with less frequency. CONCLUSION: The complete cp genome sequence of P. lambertii revealed significant structural changes, even in species from the same genus. These results reinforce the apparently loss of rps16 gene in Podocarpaceae cp genome. In addition, several SSRs in the P. lambertii cp genome are likely intraspecific polymorphism sites, which may allow highly sensitive phylogeographic and population structure studies, as well as phylogenetic studies of species of

  14. A novel monoclonal antibody to a defined peptide epitope in MUC16

    DEFF Research Database (Denmark)

    Marcos-Silva, Lara; Ricardo, Sara; Chen, Kowa

    2015-01-01

    with the tandem-repeat region, their epitopes appear to be conformational dependent and not definable by a short peptide. Aberrant glycoforms of MUC16 may constitute promising targets for diagnostic and immunotherapeutic intervention, and it is important to develop well-defined immunogens for induction of potent...... immunodominant linear peptide epitopes within the tandem repeat. We developed one monoclonal antibody, 5E11, reactive with a minimum epitope with the sequence FNTTER. This sequence contains potential N- and O-glycosylation sites and, interestingly, glycosylation blocked binding of 5E11. In immunochemistry...

  15. TGC repeat expansion in the TCF4 gene increases the risk of Fuchs' endothelial corneal dystrophy in Australian cases.

    Directory of Open Access Journals (Sweden)

    Abraham Kuot

    Full Text Available Fuchs' endothelial corneal dystrophy (FECD is a progressive, vision impairing disease. Common single nucleotide polymorphisms (SNPs and a trinucleotide repeat polymorphism, thymine-guanine-cytosine (TGC, in the TCF4 gene have been associated with the risk of FECD in some populations. We previously reported association of SNPs in TCF4 with FECD risk in the Australian population. The aim of this study was to determine whether TGC repeat polymorphism in TCF4 is associated with FECD in the Australian population. In 189 unrelated Australian cases with advanced late-onset FECD and 183 matched controls, the TGC repeat polymorphism located in intron 3 of TCF4 was genotyped using a short tandem repeat (STR assay. The repeat length was verified by direct sequencing in selected homozygous carriers. We found significant association between the expanded TGC repeat (≥ 40 repeats in TCF4 and advanced FECD (P = 2.58 × 10-22; OR = 15.66 (95% CI: 7.79-31.49. Genotypic analysis showed that 51% of cases (97 compared to 5% of controls (9 were heterozygous or homozygous for the expanded repeat allele. Furthermore, the repeat expansion showed stronger association than the most significantly associated SNP, rs613872, in TCF4, with the disease in the Australian cohort. This and haplotype analysis of both the polymorphisms suggest that considering both the polymorphisms together rather than either of the two alone would better predict susceptibility to FECD in the Australian population. This is the first study to report association of the TGC trinucleotide repeat expansion in TCF4 with advanced FECD in the Australian population.

  16. Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results.

    Science.gov (United States)

    Just, Rebecca S; Irwin, Jodi A

    2018-05-01

    Some of the expected advantages of next generation sequencing (NGS) for short tandem repeat (STR) typing include enhanced mixture detection and genotype resolution via sequence variation among non-homologous alleles of the same length. However, at the same time that NGS methods for forensic DNA typing have advanced in recent years, many caseworking laboratories have implemented or are transitioning to probabilistic genotyping to assist the interpretation of complex autosomal STR typing results. Current probabilistic software programs are designed for length-based data, and were not intended to accommodate sequence strings as the product input. Yet to leverage the benefits of NGS for enhanced genotyping and mixture deconvolution, the sequence variation among same-length products must be utilized in some form. Here, we propose use of the longest uninterrupted stretch (LUS) in allele designations as a simple method to represent sequence variation within the STR repeat regions and facilitate - in the nearterm - probabilistic interpretation of NGS-based typing results. An examination of published population data indicated that a reference LUS region is straightforward to define for most autosomal STR loci, and that using repeat unit plus LUS length as the allele designator can represent greater than 80% of the alleles detected by sequencing. A proof of concept study performed using a freely available probabilistic software demonstrated that the LUS length can be used in allele designations when a program does not require alleles to be integers, and that utilizing sequence information improves interpretation of both single-source and mixed contributor STR typing results as compared to using repeat unit information alone. The LUS concept for allele designation maintains the repeat-based allele nomenclature that will permit backward compatibility to extant STR databases, and the LUS lengths themselves will be concordant regardless of the NGS assay or analysis tools

  17. BioNano genome mapping of individual chromosomes supports physical mapping and sequence assembly in complex plant genomes.

    Science.gov (United States)

    Staňková, Helena; Hastie, Alex R; Chan, Saki; Vrána, Jan; Tulpová, Zuzana; Kubaláková, Marie; Visendi, Paul; Hayashi, Satomi; Luo, Mingcheng; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana

    2016-07-01

    The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  18. Non-invasive prenatal detection of trisomy 21 using tandem single nucleotide polymorphisms.

    Directory of Open Access Journals (Sweden)

    Sujana Ghanta

    Full Text Available BACKGROUND: Screening tests for Trisomy 21 (T21, also known as Down syndrome, are routinely performed for the majority of pregnant women. However, current tests rely on either evaluating non-specific markers, which lead to false negative and false positive results, or on invasive tests, which while highly accurate, are expensive and carry a risk of fetal loss. We outline a novel, rapid, highly sensitive, and targeted approach to non-invasively detect fetal T21 using maternal plasma DNA. METHODS AND FINDINGS: Highly heterozygous tandem Single Nucleotide Polymorphism (SNP sequences on chromosome 21 were analyzed using High-Fidelity PCR and Cycling Temperature Capillary Electrophoresis (CTCE. This approach was used to blindly analyze plasma DNA obtained from peripheral blood from 40 high risk pregnant women, in adherence to a Medical College of Wisconsin Institutional Review Board approved protocol. Tandem SNP sequences were informative when the mother was heterozygous and a third paternal haplotype was present, permitting a quantitative comparison between the maternally inherited haplotype and the paternally inherited haplotype to infer fetal chromosomal dosage by calculating a Haplotype Ratio (HR. 27 subjects were assessable; 13 subjects were not informative due to either low DNA yield or were not informative at the tandem SNP sequences examined. All results were confirmed by a procedure (amniocentesis/CVS or at postnatal follow-up. Twenty subjects were identified as carrying a disomy 21 fetus (with two copies of chromosome 21 and seven subjects were identified as carrying a T21 fetus. The sensitivity and the specificity of the assay was 100% when HR values lying between 3/5 and 5/3 were used as a threshold for normal subjects. CONCLUSIONS: In summary, a targeted approach, based on calculation of Haplotype Ratios from tandem SNP sequences combined with a sensitive and quantitative DNA measurement technology can be used to accurately detect fetal

  19. Association of STin2 Variable Number of Tandem Repeat (VNTR) Polymorphism of Serotonin Transporter Gene with Lifelong Premature Ejaculation: A Case-Control Study in Han Chinese Subjects

    Science.gov (United States)

    Huang, Yuanyuan; Zhang, Xiansheng; Gao, Jingjing; Tang, Dongdong; Gao, Pan; Peng, Dangwei; Liang, Chaozhao

    2016-01-01

    Background The STin2 VNTR polymorphism has a variable number of tandem repeats in intron 2 of the serotonin transporter gene. We aimed to explore the relationship between STin2 VNTR polymorphism and lifelong premature ejaculation (LPE). Material/Methods We recruited a total of 115 outpatients who complained of ejaculating prematurely and who were diagnosed as LPE, and 101 controls without PE complaint. Allelic variations of STin2 VNTR were genotyped using PCR-based technology. We evaluated the associations between STin2 VNTR allelic and genotypic frequencies and LPE, as well as the intravaginal ejaculation latency time (IELT) of different STin2 VNTR genotypes among LPE patients. Results The patients and controls did not differ significantly in terms of any characteristic except age. A significantly higher frequency of STin2.12/12 genotype was found among LPE patients versus controls (P=0.026). Frequency of patients carrying at least 1 copy of the 10-repeat allele was significantly lower compared to the control group (28.3% vs. 41.8%, OR=0.55; 95%CI=0.31–0.97, P=0.040). In the LPE group, the mean IELT showed significant difference in STin2.12/12 genotype when compared to those with STin2.12/10 and STin2.10/10 genotypes. The mean IELT in10-repeat allele carriers was 50% longer compared to homozygous carriers of the STin2.12 allele. Conclusions Our results indicate the presence of STin2.10 allele is a protective factor for LPE. Men carrying the higher expression genotype STin2. 12/12 have shorter IELT than 10-repeat allele carriers. PMID:27713390

  20. t2prhd: a tool to study the patterns of repeat evolution

    Directory of Open Access Journals (Sweden)

    Pénzes Zsolt

    2008-01-01

    Full Text Available Abstract Background The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading. Results We have developed a novel rapid visual approach to study repeat evolution, that takes into account the exact repeat position in a sequence. The "pairwise repeat homology diagram" visualizes sequence repeats detected by a profile HMM in a pair of sequences and highlights their homology relations inferred by a phylogenetic tree. The method is implemented in a Perl script (t2prhd available for downloading at http://t2prhd.sourceforge.net and is also accessible as an online tool at http://t2prhd.brc.hu. The power of the method is demonstrated on the EGF-like and fibronectin-III-like (Fn-III domain repeats of three selected mammalian Tenascin sequences. Conclusion Although pairwise repeat homology diagrams do not carry all the information provided by the phylogenetic tree, they allow a rapid and intuitive assessment of repeat evolution. We believe, that t2prhd is a helpful tool with which to study the pattern of repeat evolution. This method can be particularly useful in cases of large datasets (such as large gene families, as the command line interface makes it possible to automate the generation of pairwise repeat homology diagrams with the aid of scripts.

  1. New Multilocus Variable-Number Tandem-Repeat Analysis (MLVA) Scheme for Fine-Scale Monitoring and Microevolution-Related Study of Ralstonia pseudosolanacearum Phylotype I Populations

    Science.gov (United States)

    Guinard, Jérémy; Latreille, Anne; Guérin, Fabien; Poussier, Stéphane

    2016-01-01

    ABSTRACT Bacterial wilt caused by the Ralstonia solanacearum species complex (RSSC) is considered one of the most harmful plant diseases in the world. Special attention should be paid to R. pseudosolanacearum phylotype I due to its large host range, its worldwide distribution, and its high evolutionary potential. So far, the molecular epidemiology and population genetics of this bacterium are poorly understood. Until now, the genetic structure of the RSSC has been analyzed on the worldwide and regional scales. Emerging questions regarding evolutionary forces in RSSC adaptation to hosts now require genetic markers that are able to monitor RSSC field populations. In this study, we aimed to evaluate the multilocus variable-number tandem-repeat analysis (MLVA) approach for its ability to discriminate genetically close phylotype I strains and for population genetics studies. We developed a new MLVA scheme (MLVA-7) allowing us to genotype 580 R. pseudosolanacearum phylotype I strains extracted from susceptible and resistant hosts and from different habitats (stem, soil, and rhizosphere). Based on specificity, polymorphism, and the amplification success rate, we selected seven fast-evolving variable-number tandem-repeat (VNTR) markers. The newly developed MLVA-7 scheme showed higher discriminatory power than the previously published MLVA-13 scheme when applied to collections sampled from the same location on different dates and to collections from different locations on very small scales. Our study provides a valuable tool for fine-scale monitoring and microevolution-related study of R. pseudosolanacearum phylotype I populations. IMPORTANCE Understanding the evolutionary dynamics of adaptation of plant pathogens to new hosts or ecological niches has become a key point for the development of innovative disease management strategies, including durable resistance. Whereas the molecular mechanisms underlying virulence or pathogenicity changes have been studied thoroughly, the

  2. Multilocus Variable-Number Tandem-Repeat Analysis, Pulsed-Field Gel Electrophoresis, and Antimicrobial Susceptibility Patterns in Discrimination of Sporadic and Outbreak-Related Strains of Yersinia enterocolitica

    Directory of Open Access Journals (Sweden)

    Skurnik Mikael

    2011-02-01

    Full Text Available Abstract Background We assessed the potential of multilocus variable-number tandem-repeat analysis (MLVA, pulsed-field gel electrophoresis (PFGE, and antimicrobial susceptibility testing for discriminating 104 sporadic and outbreak-related Yersinia enterocolitica (YE bio/serotype 3-4/O:3 and 2/O:9 isolates. MLVA using six VNTR markers was performed in two separate multiplex PCRs, and the fluorescently labeled PCR products were accurately sized on an automated DNA sequencer. Results MLVA discriminated 82 sporadic YE 3-4/O:3 and 2/O:9 strains into 77 types, whereas PFGE with the restriction enzyme NotI discriminated the strains into 23 different PFGE pulsotypes. The discriminatory index for a sporadic strain was 0.862 for PFGE and 0.999 for MLVA. MLVA confirmed that a foodborne outbreak in the city of Kotka, Finland in 2003 had been caused by a multiresistant YE 4/O:3 strain that was distinctly different from those of epidemiologically unrelated strains with an identical PFGE pulsotype. The multiresistance of Y. enterocolitica strains (19% of the sporadic strains correlated significantly (p = 0.002 with travel abroad. All of the multiresistant Y. enterocolitica strains belonged to four PFGE pulsotypes that did not contain any susceptible strains. Resistance to nalidixic acid was related to changes in codons 83 or 87 that stemmed from mutations in the gyrA gene. The conjugation experiments demonstrated that resistance to CHL, STR, and SUL was carried by a conjugative plasmid. Conclusions MLVA using six loci had better discriminatory power than PFGE with the NotI enzyme. MLVA was also a less labor-intensive method than PFGE and the results were easier to analyze. The conjugation experiments demonstrated that a resistance plasmid can easily be transferred between Y. enterocolitica strains. Antimicrobial multiresistance of Y. enterocolitica strains was significantly associated with travel abroad.

  3. High Sequence Variations in Mitochondrial DNA Control Region among Worldwide Populations of Flathead Mullet Mugil cephalus

    Directory of Open Access Journals (Sweden)

    Brian Wade Jamandre

    2014-01-01

    Full Text Available The sequence and structure of the complete mtDNA control region (CR of M. cephalus from African, Pacific, and Atlantic populations are presented in this study to assess its usefulness in phylogeographic studies of this species. The mtDNA CR sequence variations among M. cephalus populations largely exceeded intraspecific polymorphisms that are generally observed in other vertebrates. The length of CR sequence varied among M. cephalus populations due to the presence of indels and variable number of tandem repeats at the 3′ hypervariable domain. The high evolutionary rate of the CR in this species probably originated from these mutations. However, no excessive homoplasic mutations were noticed. Finally, the star shaped tree inferred from the CR polymorphism stresses a rapid radiation worldwide, in this species. The CR still appears as a good marker for phylogeographic investigations and additional worldwide samples are warranted to further investigate the genetic structure and evolution in M. cephalus.

  4. Sequence determinants of human microsatellite variability

    Directory of Open Access Journals (Sweden)

    Jakobsson Mattias

    2009-12-01

    Full Text Available Abstract Background Microsatellite loci are frequently used in genomic studies of DNA sequence repeats and in population studies of genetic variability. To investigate the effect of sequence properties of microsatellites on their level of variability we have analyzed genotypes at 627 microsatellite loci in 1,048 worldwide individuals from the HGDP-CEPH cell line panel together with the DNA sequences of these microsatellites in the human RefSeq database. Results Calibrating PCR fragment lengths in individual genotypes by using the RefSeq sequence enabled us to infer repeat number in the HGDP-CEPH dataset and to calculate the mean number of repeats (as opposed to the mean PCR fragment length, under the assumption that differences in PCR fragment length reflect differences in the numbers of repeats in the embedded repeat sequences. We find the mean and maximum numbers of repeats across individuals to be positively correlated with heterozygosity. The size and composition of the repeat unit of a microsatellite are also important factors in predicting heterozygosity, with tetra-nucleotide repeat units high in G/C content leading to higher heterozygosity. Finally, we find that microsatellites containing more separate sets of repeated motifs generally have higher heterozygosity. Conclusions These results suggest that sequence properties of microsatellites have a significant impact in determining the features of human microsatellite variability.

  5. Sex-linked pheromone receptor genes of the European corn borer, Ostrinia nubilalis, are in tandem arrays.

    Directory of Open Access Journals (Sweden)

    Yuji Yasukochi

    Full Text Available BACKGROUND: Tuning of the olfactory system of male moths to conspecific female sex pheromones is crucial for correct species recognition; however, little is known about the genetic changes that drive speciation in this system. Moths of the genus Ostrinia are good models to elucidate this question, since significant differences in pheromone blends are observed within and among species. Odorant receptors (ORs play a critical role in recognition of female sex pheromones; eight types of OR genes expressed in male antennae were previously reported in Ostrinia moths. METHODOLOGY/PRINCIPAL FINDINGS: We screened an O. nubilalis bacterial artificial chromosome (BAC library by PCR, and constructed three contigs from isolated clones containing the reported OR genes. Fluorescence in situ hybridization (FISH analysis using these clones as probes demonstrated that the largest contig, which contained eight OR genes, was located on the Z chromosome; two others harboring two and one OR genes were found on two autosomes. Sequence determination of BAC clones revealed the Z-linked OR genes were closely related and tandemly arrayed; moreover, four of them shared 181-bp direct repeats spanning exon 7 and intron 7. CONCLUSIONS/SIGNIFICANCE: This is the first report of tandemly arrayed sex pheromone receptor genes in Lepidoptera. The localization of an OR gene cluster on the Z chromosome agrees with previous findings for a Z-linked locus responsible for O. nubilalis male behavioral response to sex pheromone. The 181-bp direct repeats might enhance gene duplications by unequal crossovers. An autosomal locus responsible for male response to sex pheromone in Heliothis virescens and H. subflexa was recently reported to contain at least four OR genes. Taken together, these findings support the hypothesis that generation of additional copies of OR genes can increase the potential for male moths to acquire altered specificity for pheromone components, and accordingly

  6. A novel polymorphic repeat in the upstream regulatory region of the estrogen-induced gene EIG121 is not associated with the risk of developing breast or endometrial cancer.

    Science.gov (United States)

    Bolton, Katherine A; Holliday, Elizabeth G; Attia, John; Bowden, Nikola A; Avery-Kiejda, Kelly A; Scott, Rodney J

    2016-05-26

    The estrogen-induced gene 121 (EIG121) has been associated with breast and endometrial cancers, but its mechanism of action remains unknown. In a genome-wide search for tandem repeats, we found that EIG121 contains a short tandem repeat (STR) in its upstream regulatory region which has the potential to alter gene expression. The presence of this STR has not previously been analysed in relation to breast or endometrial cancer risk. In this study, the lengths of this STR were determined by PCR, fragment analysis and sequencing using DNA from 223 breast cancer patients, 204 endometrial cancer patients and 220 healthy controls to determine if they were associated with the risk of developing breast or endometrial cancer. We found this repeat to be highly variable with the number of copies of the AG motif ranging from 27 to 72 and having a bimodal distribution. No statistically significant association was identified between the length of this STR and the risk of developing breast or endometrial cancer or age at diagnosis. The STR in the upstream regulatory region of EIG121 is highly polymorphic, but is not associated with the risk of developing breast or endometrial cancer in the cohorts analysed here. While this polymorphic STR in the regulatory region of EIG121 appears to have no impact on the risk of developing breast or endometrial cancer, its association with disease recurrence or overall survival remains to be determined.

  7. [Rapid, simple genotyping method by the variable numbers of tandem repeats (VNTR) for Mycobacterium tuberculosis isolates in Japan--analytical procedure of JATA (12)-VNTR].

    Science.gov (United States)

    Maeda, Shinji; Murase, Yoshiro; Mitarai, Satoshi; Sugawara, Isamu; Kato, Seiya

    2008-10-01

    The discriminatory power of each locus in variable numbers of tandem repeats (VNTR) analyses was evaluated for development of the genotyping method of Mycobacterium tuberculosis (TB) in Japan. By using 325 TB strains collected from whole Japan and 24 mass infection cases (74 isolates), IS6110 restriction fragment length polymorphism (RFLP), spoligotyping and VNTR (35 loci) were analyzed. We excluded 4 loci (VNTRs 2163a, 3232, 3820, and 4120) and selected in top 12 loci (VNTRs 0424, 0960, 1955, 2074, 2163b, 2372, 2996, 3155, 3192, 3336, 4052, and 4156). The cluster rate of IS6110 RFLP was higher than that of 12-locus [Japan Anti-Tuberculosis Association (JATA)] VNTR. And in comparison of the discriminatory power of 12-locus JATA VNTR and that of Supply (15)-VNTR, the JATA (12)-VNTR was superior, even though less loci analyses. Therefore, this JATA (12)-VNTR could be used for TB genotyping in areas where Beijing strains are prevalent.

  8. The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

    Science.gov (United States)

    Nallaseth, Ferez Soli

    The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1

  9. Development of Simple Sequence Repeats (SSR) markers in Setaria italica (Poaceae) and cross-amplification in related species.

    Science.gov (United States)

    Lin, Heng-Sheng; Chiang, Chih-Yun; Chang, Song-Bin; Kuoh, Chang-Sheng

    2011-01-01

    Foxtail millet is one of the world's oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR) markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21%) and CAT (46.15%). The average number of alleles (N(a)), the average heterozygosities observed (H(o)) and expected (H(e)) are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.

  10. The 1.7 Å resolution structure of At2g44920, a pentapeptide-repeat protein in the thylakoid lumen of Arabidopsis thaliana

    International Nuclear Information System (INIS)

    Ni, Shuisong; McGookey, Michael E.; Tinch, Stuart L.; Jones, Alisha N.; Jayaraman, Seetharaman; Tong, Liang; Kennedy, Michael A.

    2011-01-01

    The crystal structure of At2g44920, a pentapeptide repeat protein (PRP) from Arabidopsis thaliana, has been determined at 1.7 Å resolution. The structure represents the first PRP protein whose subcellular localization has been experimentally confirmed to be the thylakoid lumen of a plant species. At2g44920 belongs to a diverse family (Pfam PF00805) of pentapeptide-repeat proteins (PRPs) that are present in all known organisms except yeast. PRPs contain at least eight tandem-repeating sequences of five amino acids with an approximate consensus sequence (STAV)(D/N)(L/F)(S/T/R)(X). Recent crystal structures show that PRPs adopt a highly regular four-sided right-handed β-helical structure consisting mainly of type II and type IV β-turns, sometimes referred to as a repeated five-residue (or Rfr) fold. Among sequenced genomes, PRP genes are most abundant in cyanobacteria, leading to speculation that PRPs play an important role in the unique lifestyle of photosynthetic cyanobacteria. Despite the recent structural characterization of several cyanobacterial PRPs, most of their functions remain unknown. Plants, whose chloroplasts are of cyanobacterial origin, have only four PRP genes in their genomes. At2g44920 is one of three PRPs located in the thylakoid lumen. Here, the crystal structure of a double methionine mutant of residues 81–224 of At2g44920, the naturally processed fragment of one of its full-length isoforms, is reported at 1.7 Å resolution. The structure of At2g44920 consists of the characteristic Rfr fold with five uninterrupted coils made up of 25 pentapeptide repeats and α-helical elements capping both termini. A disulfide bridge links the two α-helices with a conserved loop between the helical elements at its C-terminus. This structure represents the first structure of a PRP protein whose subcellular location has been experimentally confirmed to be the thylakoid lumen in a plant species

  11. Tandem ring-closing metathesis/isomerization reactions for the total synthesis of violacein

    DEFF Research Database (Denmark)

    Petersen, Mette Terp; Nielsen, Thomas Eiland

    2013-01-01

    A series of 5-substituted 2-pyrrolidinones was synthesized through a one-pot ruthenium alkylidene-catalyzed tandem RCM/isomerization/nucleophilic addition sequence. The intermediates resulting from RCM/isomerization showed reactivity toward electrophiles in aldol condensation reactions which...

  12. Interleukin 6-174 G/C promoter and variable number of tandem repeats (VNTR) gene polymorphisms in sporadic Alzheimer's disease.

    Science.gov (United States)

    Capurso, Cristiano; Solfrizzi, Vincenzo; Colacicco, Anna Maria; D'Introno, Alessia; Frisardi, Vincenza; Imbimbo, Bruno P; Lorusso, Maria; Vendemiale, Gianluigi; Denitto, Marta; Santamato, Andrea; Seripa, Davide; Pilotto, Alberto; Fiore, Pietro; Capurso, Antonio; Panza, Francesco

    2010-02-01

    Previous studies examining the association between the interleukin 6 (IL-6)-174 C/G polymorphism and Alzheimer's disease (AD) have yielded conflicting results. Furthermore, the C allele of the IL-6 variable number of tandem repeats (VNTR) polymorphism was associated with a delayed onset and a decreased risk of AD. A total sample of 149 AD patients, and 298 age- and sex-matched unrelated caregivers from Apulia, southern Italy, were genotyped for the apolipoprotein E (APOE) polymorphism, the VNTR polymorphism in the 3' flanking region, and the -174G/C single-nucleotide polymorphism (SNP) in the promoter region of IL-6 gene on chromosome 7. Furthermore, we performed a haplotype analysis on these two polymorphisms on IL-6 locus. IL-6 VNTR and -174G/C allele and genotype frequencies were similar between AD patients and controls, also after stratification for late-onset (> or =65 years) and early-onset (VNTR and -174G/C polymorphisms, not supporting a previous reported additive effect of both IL-6 polymorphisms on AD risk. Our findings did not support a role of IL-6-174 G/C and IL-6 VNTR polymorphisms in the risk of sporadic AD in southern Italy, suggesting that these polymorphisms of IL-6 gene were at most weak genetic determinants of AD. Copyright 2009 Elsevier Inc. All rights reserved.

  13. Development of a Tandem Repeat-Based Polymerase Chain Displacement Reaction Method for Highly Sensitive Detection of 'Candidatus Liberibacter asiaticus'.

    Science.gov (United States)

    Lou, Binghai; Song, Yaqin; RoyChowdhury, Moytri; Deng, Chongling; Niu, Ying; Fan, Qijun; Tang, Yan; Zhou, Changyong

    2018-02-01

    Huanglongbing (HLB) is one of the most destructive diseases in citrus production worldwide. Early detection of HLB pathogens can facilitate timely removal of infected citrus trees in the field. However, low titer and uneven distribution of HLB pathogens in host plants make reliable detection challenging. Therefore, the development of effective detection methods with high sensitivity is imperative. This study reports the development of a novel method, tandem repeat-based polymerase chain displacement reaction (TR-PCDR), for the detection of 'Candidatus Liberibacter asiaticus', a widely distributed HLB-associated bacterium. A uniquely designed primer set (TR2-PCDR-F/TR2-PCDR-1R) and a thermostable Taq DNA polymerase mutant with strand displacement activity were used for TR-PCDR amplification. Performed in a regular thermal cycler, TR-PCDR could produce more than two amplicons after each amplification cycle. Sensitivity of the developed TR-PCDR was 10 copies of target DNA fragment. The sensitive level was proven to be 100× higher than conventional PCR and similar to real-time PCR. Data from the detection of 'Ca. L. asiaticus' with filed samples using the above three methods also showed similar results. No false-positive TR-PCDR amplification was observed from healthy citrus samples and water controls. These results thereby illustrated that the developed TR-PCDR method can be applied to the reliable, highly sensitive, and cost-effective detection of 'Ca. L. asiaticus'.

  14. Detection, characterization and evolution of internal repeats in Chitinases of known 3-D structure.

    Directory of Open Access Journals (Sweden)

    Manigandan Sivaji

    Full Text Available Chitinase proteins have evolved and diversified almost in all organisms ranging from prokaryotes to eukaryotes. During evolution, internal repeats may appear in amino acid sequences of proteins which alter the structural and functional features. Here we deciphered the internal repeats from Chitinase and characterized the structural similarities between them. Out of 24 diverse Chitinase sequences selected, six sequences (2CJL, 2DSK, 2XVP, 2Z37, 3EBV and 3HBE did not contain any internal repeats of amino acid sequences. Ten sequences contained repeats of length <50, and the remaining 8 sequences contained repeat length between 50 and 100 residues. Two Chitinase sequences, 1ITX and 3SIM, were found to be structurally similar when analyzed using secondary structure of Chitinase from secondary and 3-Dimensional structure database of Protein Data Bank. Internal repeats of 3N17 and 1O6I were also involved in the ligand-binding site of those Chitinase proteins, respectively. Our analyses enhance our understanding towards the identification of structural characteristics of internal repeats in Chitinase proteins.

  15. Comparative Study of IS6110 Restriction Fragment Length Polymorphism and Variable-Number Tandem-Repeat Typing of Mycobacterium tuberculosis Isolates in the Netherlands, Based on a 5-Year Nationwide Survey

    Science.gov (United States)

    de Beer, Jessica L.; van Ingen, Jakko; de Vries, Gerard; Erkens, Connie; Sebek, Maruschka; Mulder, Arnout; Sloot, Rosa; van den Brandt, Anne-Marie; Enaimi, Mimount; Kremer, Kristin; Supply, Philip

    2013-01-01

    In order to switch from IS6110 and polymorphic GC-rich repetitive sequence (PGRS) restriction fragment length polymorphism (RFLP) to 24-locus variable-number tandem-repeat (VNTR) typing of Mycobacterium tuberculosis complex isolates in the national tuberculosis control program in The Netherlands, a detailed evaluation on discriminatory power and agreement with findings in a cluster investigation was performed on 3,975 tuberculosis cases during the period of 2004 to 2008. The level of discrimination of the two typing methods did not differ substantially: RFLP typing yielded 2,733 distinct patterns compared to 2,607 in VNTR typing. The global concordance, defined as isolates labeled unique or identically distributed in clusters by both methods, amounted to 78.5% (n = 3,123). Of the remaining 855 cases, 12% (n = 479) of the cases were clustered only by VNTR, 7.7% (n = 305) only by RFLP typing, and 1.8% (n = 71) revealed different cluster compositions in the two approaches. A cluster investigation was performed for 87% (n = 1,462) of the cases clustered by RFLP. For the 740 cases with confirmed or presumed epidemiological links, 92% were concordant with VNTR typing. In contrast, only 64% of the 722 cases without an epidemiological link but clustered by RFLP typing were also clustered by VNTR typing. We conclude that VNTR typing has a discriminatory power equal to IS6110 RFLP typing but is in better agreement with findings in a cluster investigation performed on an RFLP-clustering-based cluster investigation. Both aspects make VNTR typing a suitable method for tuberculosis surveillance systems. PMID:23363841

  16. Sequence Ready Characterization of the Pericentromeric Region of 19p12

    Energy Technology Data Exchange (ETDEWEB)

    Evan E. Eichler

    2006-08-31

    Current mapping and sequencing strategies have been inadequate within the proximal portion of 19p12 due, in part, to the presence of a recently expanded ZNF (zinc-finger) gene family and the presence of large (25-50 kb) inverted beta-satellite repeat structures which bracket this tandemly duplicated gene family. The virtual of absence of classically defined “unique” sequence within the region has hampered efforts to identify and characterize a suitable minimal tiling path of clones which can be used as templates required for finished sequencing of the region. The goal of this proposal is to develop and implement a novel sequence-anchor strategy to generate a contiguous BAC map of the most proximal portion of chromosome 19p12 for the purpose of complete sequence characterization. The target region will be an estimated 4.5 Mb of DNA extending from STS marker D19S450 (the beginning of the ZNF gene cluster) to the centromeric (alpha-satellite) junction of 19p11. The approach will entail 1) pre-selection of 19p12 BAC and cosmid clones (NIH approved library) utilizing both 19p12 -unique and 19p12-SPECIFIC repeat probes (Eichler et al., 1998); 2) the generation of a BAC/cosmid end-sequence map across the region with a density of one marker every 8kb; 3) the development of a second-generation of STS (sequence tagged sites) which will be used to identify and verify clonal overlap at the level of the sequence; 4) incorporation of these sequence-anchored overlapping clones into existing cosmid/BAC restriction maps developed at Livermore National Laboratory; and 5) validation of the organization of this region utilizing high-resolution FISH techniques (extended chromatin analysis) on monochromosomal 19 somatic cell hybrids and parental cell lines of source material. The data generated will be used in the selection of the most parsimonious tiling path of BAC clones to be sequenced as part of the JGI effort on chromosome 19 and should serve as a model for the sequence

  17. Expanded simple tandem repeat (ESTR) mutation induction in the male germline: Lessons learned from lab mice

    Energy Technology Data Exchange (ETDEWEB)

    Somers, Christopher M. [University of Regina, Department of Biology, 3737 Wascana Parkway, Regina, SK, S4S 0A2 (Canada)]. E-mail: chris.somers@uregina.ca

    2006-06-25

    Expanded simple tandem repeat (ESTR) DNA loci that are unstable in the germline have provided the most sensitive tool ever developed for investigating low-dose heritable mutation induction in laboratory mice. Ionizing radiation exposures have shown that ESTR mutations occur mainly in pre-meiotic spermatogonia and stem cells. The average spermatogonial doubling dose is 0.62-0.69 Gy for low LET, and 0.18-0.34 Gy for high LET radiation. Chemical alkylating agents also cause significant ESTR mutation induction in pre-meiotic spermatogonia and stem cells, but are much less effective per unit dose than radiation. ESTR mutation induction efficiency is maximal at low doses of radiation or chemical mutagens, and may decrease at higher dose ranges. DNA repair deficient mice (SCID and PARP-1) with elevated levels of single and double-strand DNA breaks have spontaneously elevated ESTR mutation frequencies, and surprisingly do not show additional ESTR mutation induction following irradiation. In contrast, ESTR mutation induction in p53 knock-outs is indistinguishable from that of wild-type mice. Studies of sentinel mice exposed in situ to ambient air pollution showed elevated ESTR mutation frequencies in males exposed to high levels of particulate matter. These studies highlight the application of the ESTR assay for assessing environmental hazards under real-world conditions. All ESTR studies to date have shown untargeted mutations that occur at much higher frequencies than predicted. The mechanism of this untargeted mutation induction is unknown, and must be elucidated before we can fully understand the biological significance of ESTR mutations, or use these markers for formal risk assessment. Future studies should focus on the mechanism of ESTR mutation induction, refining dose responses, and developing ESTR markers for other animal species.

  18. Expanded simple tandem repeat (ESTR) mutation induction in the male germline: Lessons learned from lab mice

    International Nuclear Information System (INIS)

    Somers, Christopher M.

    2006-01-01

    Expanded simple tandem repeat (ESTR) DNA loci that are unstable in the germline have provided the most sensitive tool ever developed for investigating low-dose heritable mutation induction in laboratory mice. Ionizing radiation exposures have shown that ESTR mutations occur mainly in pre-meiotic spermatogonia and stem cells. The average spermatogonial doubling dose is 0.62-0.69 Gy for low LET, and 0.18-0.34 Gy for high LET radiation. Chemical alkylating agents also cause significant ESTR mutation induction in pre-meiotic spermatogonia and stem cells, but are much less effective per unit dose than radiation. ESTR mutation induction efficiency is maximal at low doses of radiation or chemical mutagens, and may decrease at higher dose ranges. DNA repair deficient mice (SCID and PARP-1) with elevated levels of single and double-strand DNA breaks have spontaneously elevated ESTR mutation frequencies, and surprisingly do not show additional ESTR mutation induction following irradiation. In contrast, ESTR mutation induction in p53 knock-outs is indistinguishable from that of wild-type mice. Studies of sentinel mice exposed in situ to ambient air pollution showed elevated ESTR mutation frequencies in males exposed to high levels of particulate matter. These studies highlight the application of the ESTR assay for assessing environmental hazards under real-world conditions. All ESTR studies to date have shown untargeted mutations that occur at much higher frequencies than predicted. The mechanism of this untargeted mutation induction is unknown, and must be elucidated before we can fully understand the biological significance of ESTR mutations, or use these markers for formal risk assessment. Future studies should focus on the mechanism of ESTR mutation induction, refining dose responses, and developing ESTR markers for other animal species

  19. Multi-locus variable-number tandem repeat analysis of Chinese Brucella strains isolated from 1953 to 2013.

    Science.gov (United States)

    Tian, Guo-Zhong; Cui, Bu-Yun; Piao, Dong-Ri; Zhao, Hong-Yan; Li, Lan-Yu; Liu, Xi; Xiao, Pei; Zhao, Zhong-Zhi; Xu, Li-Qing; Jiang, Hai; Li, Zhen-Jun

    2017-05-02

    Brucellosis was a common human and livestock disease caused by Brucella strains, the category B priority pathogens by the US Center for Disease Control (CDC). Identified as a priority disease in human and livestock populations, the increasing incidence in recent years in China needs urgent control measures for this disease but the molecular background important for monitoring the epidemiology of Brucella strains at the national level is still lacking. A total of 600 Brucella isolates collected during 60 years (from 1953 to 2013) in China were genotyped by multiple locus variable-number tandem repeat analysis (MLVA) and the variation degree of MLVA11 loci was calculated by the Hunter Gaston Diversity Index (HGDI) values. The charts and map were processed by Excel 2013, and cluster analysis and epidemiological distribution was performed using BioNumerics (version 5.1). The 600 representative Brucella isolates fell into 104 genotypes with 58 singleton genotypes by the MLVA11 assay, including B. melitensis biovars 2 and 3 (five main genotypes), B. abortus biovars 1 and 3 (two main genotypes), B. suis biovars 1 and 3 (three main genotypes), and B. canis (two main genotypes) respectively. While most B. suis biovar 1 and biovar 3 were respectively found in northern provinces and southern provinces, B. melitensis and B. abortus strains were dominant in China. Canine Brucellosis was only found in animals without any human cases reported. Eight Brucellosis epidemic peaks emerged during the 60 years between 1953 and 2013: 1955 - 1959, 1962 - 1969, 1971 - 1975, 1977 - 1983, 1985 - 1989, 1992 - 1997, 2000 - 2008 and 2010 - 2013 in China. Brucellosis has its unique molecular epidemiological patterns with specific spatial and temporal distribution according to MLVA. IDOP-D-16-00101.

  20. A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum.

    Science.gov (United States)

    Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin

    2016-01-07

    The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.

  1. Deletion of Repeats in the Alpha C Protein Enhances the Pathogenicity of Group B Streptococci in Immune Mice

    OpenAIRE

    Gravekamp, C.; Rosner, Bernard; Madoff, L. C.

    1998-01-01

    The alpha C protein is a protective surface-associated antigen of group B streptococci (GBS). The prototype alpha C protein of GBS (strain A909) contains nine identical tandem repeats, each comprising 82 amino acids, flanked by N- and C-terminal domains. Clinical isolates of GBS show variable numbers of repeats with a normal distribution and a median of 9 to 10 repeats. Here, we show that escape mutants of GBS expressing one-repeat alpha C protein were 100-fold more pathogenic than GBS expres...

  2. Analysis of simple sequence repeats in rice bean (Vigna umbellata using an SSR-enriched library

    Directory of Open Access Journals (Sweden)

    Lixia Wang

    2016-02-01

    Full Text Available Rice bean (Vigna umbellata Thunb., a warm-season annual legume, is grown in Asia mainly for dried grain or fodder and plays an important role in human and animal nutrition because the grains are rich in protein and some essential fatty acids and minerals. With the aim of expediting the genetic improvement of rice bean, we initiated a project to develop genomic resources and tools for molecular breeding in this little-known but important crop. Here we report the construction of an SSR-enriched genomic library from DNA extracted from pooled young leaf tissues of 22 rice bean genotypes and developing SSR markers. In 433,562 reads generated by a Roche 454 GS-FLX sequencer, we identified 261,458 SSRs, of which 48.8% were of compound form. Dinucleotide repeats were predominant with an absolute proportion of 81.6%, followed by trinucleotides (17.8%. Other types together accounted for 0.6%. The motif AC/GT accounted for 77.7% of the total, followed by AAG/CTT (14.3%, and all others accounted for 12.0%. Among the flanking sequences, 2928 matched putative genes or gene models in the protein database of Arabidopsis thaliana, corresponding with 608 non-redundant Gene Ontology terms. Of these sequences, 11.2% were involved in cellular components, 24.2% were involved molecular functions, and 64.6% were associated with biological processes. Based on homolog analysis, 1595 flanking sequences were similar to mung bean and 500 to common bean genomic sequences. Comparative mapping was conducted using 350 sequences homologous to both mung bean and common bean sequences. Finally, a set of primer pairs were designed, and a validation test showed that 58 of 220 new primers can be used in rice bean and 53 can be transferred to mung bean. However, only 11 were polymorphic when tested on 32 rice bean varieties. We propose that this study lays the groundwork for developing novel SSR markers and will enhance the mapping of qualitative and quantitative traits and marker

  3. Single Strand Annealing Plays a Major Role in RecA-Independent Recombination between Repeated Sequences in the Radioresistant Deinococcus radiodurans Bacterium.

    Directory of Open Access Journals (Sweden)

    Solenne Ithurbide

    2015-10-01

    Full Text Available The bacterium Deinococcus radiodurans is one of the most radioresistant organisms known. It is able to reconstruct a functional genome from hundreds of radiation-induced chromosomal fragments. Our work aims to highlight the genes involved in recombination between 438 bp direct repeats separated by intervening sequences of various lengths ranging from 1,479 bp to 10,500 bp to restore a functional tetA gene in the presence or absence of radiation-induced DNA double strand breaks. The frequency of spontaneous deletion events between the chromosomal direct repeats were the same in recA+ and in ΔrecA, ΔrecF, and ΔrecO bacteria, whereas recombination between chromosomal and plasmid DNA was shown to be strictly dependent on the RecA and RecF proteins. The presence of mutations in one of the repeated sequence reduced, in a MutS-dependent manner, the frequency of the deletion events. The distance between the repeats did not influence the frequencies of deletion events in recA+ as well in ΔrecA bacteria. The absence of the UvrD protein stimulated the recombination between the direct repeats whereas the absence of the DdrB protein, previously shown to be involved in DNA double strand break repair through a single strand annealing (SSA pathway, strongly reduces the frequency of RecA- (and RecO- independent deletions events. The absence of the DdrB protein also increased the lethal sectoring of cells devoid of RecA or RecO protein. γ-irradiation of recA+ cells increased about 10-fold the frequencies of the deletion events, but at a lesser extend in cells devoid of the DdrB protein. Altogether, our results suggest a major role of single strand annealing in DNA repeat deletion events in bacteria devoid of the RecA protein, and also in recA+ bacteria exposed to ionizing radiation.

  4. PERF: an exhaustive algorithm for ultra-fast and efficient identification of microsatellites from large DNA sequences.

    Science.gov (United States)

    Avvaru, Akshay Kumar; Sowpati, Divya Tej; Mishra, Rakesh Kumar

    2018-03-15

    Microsatellites or Simple Sequence Repeats (SSRs) are short tandem repeats of DNA motifs present in all genomes. They have long been used for a variety of purposes in the areas of population genetics, genotyping, marker-assisted selection and forensics. Numerous studies have highlighted their functional roles in genome organization and gene regulation. Though several tools are currently available to identify SSRs from genomic sequences, they have significant limitations. We present a novel algorithm called PERF for extremely fast and comprehensive identification of microsatellites from DNA sequences of any size. PERF is several fold faster than existing algorithms and uses up to 5-fold lesser memory. It provides a clean and flexible command-line interface to change the default settings, and produces output in an easily-parseable tab-separated format. In addition, PERF generates an interactive and stand-alone HTML report with charts and tables for easy downstream analysis. PERF is implemented in the Python programming language. It is freely available on PyPI under the package name perf_ssr, and can be installed directly using pip or easy_install. The documentation of PERF is available at https://github.com/rkmlab/perf. The source code of PERF is deposited in GitHub at https://github.com/rkmlab/perf under an MIT license. tej@ccmb.res.in. Supplementary data are available at Bioinformatics online.

  5. DB2: a probabilistic approach for accurate detection of tandem duplication breakpoints using paired-end reads.

    Science.gov (United States)

    Yavaş, Gökhan; Koyutürk, Mehmet; Gould, Meetha P; McMahon, Sarah; LaFramboise, Thomas

    2014-03-05

    With the advent of paired-end high throughput sequencing, it is now possible to identify various types of structural variation on a genome-wide scale. Although many methods have been proposed for structural variation detection, most do not provide precise boundaries for identified variants. In this paper, we propose a new method, Distribution Based detection of Duplication Boundaries (DB2), for accurate detection of tandem duplication breakpoints, an important class of structural variation, with high precision and recall. Our computational experiments on simulated data show that DB2 outperforms state-of-the-art methods in terms of finding breakpoints of tandem duplications, with a higher positive predictive value (precision) in calling the duplications' presence. In particular, DB2's prediction of tandem duplications is correct 99% of the time even for very noisy data, while narrowing down the space of possible breakpoints within a margin of 15 to 20 bps on the average. Most of the existing methods provide boundaries in ranges that extend to hundreds of bases with lower precision values. Our method is also highly robust to varying properties of the sequencing library and to the sizes of the tandem duplications, as shown by its stable precision, recall and mean boundary mismatch performance. We demonstrate our method's efficacy using both simulated paired-end reads, and those generated from a melanoma sample and two ovarian cancer samples. Newly discovered tandem duplications are validated using PCR and Sanger sequencing. Our method, DB2, uses discordantly aligned reads, taking into account the distribution of fragment length to predict tandem duplications along with their breakpoints on a donor genome. The proposed method fine tunes the breakpoint calls by applying a novel probabilistic framework that incorporates the empirical fragment length distribution to score each feasible breakpoint. DB2 is implemented in Java programming language and is freely available

  6. [Polymorphism analysis of 20 autosomal short-tandem repeat loci in southern Chinese Han population].

    Science.gov (United States)

    Chen, Ling; Lu, Hui-Jie; DU, Wei-An; Qiu, Ping-Ming; Liu, Chao

    2016-02-20

    To evaluate the value of PowerPlex ® 21 System (Promega) and study the genetic polymorphism of its 20 short-tandem repeat (STR) loci in southern Chinese Han population. We conducted genotyping experiments using PowerPlex ® 21 System on 20 autosomal STR loci (D3S1358, D1S1656, D6S1043, D13S317, Penta E, D16S539, D18S51, D2S1338, CSF1PO, Penta D, TH01, vWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433 and FGA) in 2367 unrelated Chinese Han individuals living in South China. The allele frequencies and parameters commonly used in forensic science were statistically analyzed in these individuals and compared with the reported data of other populations. The PowerPlex ® 21 System had a power of discrimination (PD) ranging from 0.7839 to 0.9852 and a power of exclusion (PE) ranging from 0.2974 to 0.8099 for the 20 loci. No significant deviation from Hardy-Weinberg expectations was found for all the loci except for D5S818. This southern Chinese Han population had significant differences in the allele frequencies from 8 ethnic groups reported in China, and showed significant differences at 8 to 20 STR foci from 5 foreign populations. The allele frequency at the locus D1S1656 in this southern Chinese Han population differed significantly from those in the 5 foreign populations and from 3 reported Han populations in Beijing, Zhejiang Province and Fujian Province of China. The neighbor-joining phylogenetictree showed clustering of all the Asian populations in one branch, while the northern Italian and Argentina populations clustered in a separate branch. This southern Chinese Han population had the nearest affinity with the Yi ethnic population in Yunnan Province of China. The 20 STR loci are highly polymorphic in this southern Chinese Han population, suggesting the value of this set of STR loci in forensic personal identification, paternity testing and anthropological study.

  7. Influence of IL-1RN intron 2 variable number of tandem repeats (VNTR) polymorphism on bipolar disorder.

    Science.gov (United States)

    Rafiei, A; Hosseini, S H; Taheri, M; Hosseni-khah, Z; Hajilooi, M; Mazaheri, Z

    2013-01-01

    Several lines of evidence point to the role of neurobiological mechanisms and genetic background in bipolar disorder (BD). The interleukin-1 receptor antagonist (IL-1Ra) is the principal regulator of IL-1α and IL-1β bioactivities. This study aimed to investigate the potential role of the variable number of tandem repeats (VNTR) polymorphisms of the IL-1Ra gene (IL1RN) in conferring susceptibility to BD. In total, 217 patients meeting DSM-IV-TR criteria for BD and 212 controls were recruited for the study. Genotyping of IL1RN was determined by polymerase chain reaction amplification of VNTR of 86 base pairs in intron 2 of IL1RN. The genotype distribution of IL1RN polymorphism was significantly different between BD patients and controls. The IL1RN*1/2 genotype was more prevalent in BD patients than in controls (44.2 vs. 30.2%, p = 0.003). Multiple logistic regression analysis demonstrated that IL1RN*1/2 heterozygotes had a significantly higher risk for BD (OR 1.83 and 95% CI 1.22-2.74, p = 0.003). Further stratification of the BD patients into IL1RN*2 allele carrier and noncarrier subgroups revealed a strong association between IL1RN*2 carriage and prolongation of the disease (p = 0.02). These findings suggest a positive association between VNTR polymorphism in IL1RN and BD. Additional studies, particularly with a prospective approach, are necessary to clarify the precise role of the VNTR polymorphism on the disease in different ethnic populations. Copyright © 2013 S. Karger AG, Basel.

  8. Automated genotyping of dinucleotide repeat markers

    Energy Technology Data Exchange (ETDEWEB)

    Perlin, M.W.; Hoffman, E.P. [Carnegie Mellon Univ., Pittsburgh, PA (United States)]|[Univ. of Pittsburgh, PA (United States)

    1994-09-01

    The dinucleotide repeats (i.e., microsatellites) such as CA-repeats are a highly polymorphic, highly abundant class of PCR-amplifiable markers that have greatly streamlined genetic mapping experimentation. It is expected that over 30,000 such markers (including tri- and tetranucleotide repeats) will be characterized for routine use in the next few years. Since only size determination, and not sequencing, is required to determine alleles, in principle, dinucleotide repeat genotyping is easily performed on electrophoretic gels, and can be automated using DNA sequencers. Unfortunately, PCR stuttering with these markers generates not one band for each allele, but a pattern of bands. Since closely spaced alleles must be disambiguated by human scoring, this poses a key obstacle to full automation. We have developed methods that overcome this obstacle. Our model is that the observed data is generated by arithmetic superposition (i.e., convolution) of multiple allele patterns. By quantitatively measuring the size of each component band, and exploiting the unique stutter pattern associated with each marker, closely spaced alleles can be deconvolved; this unambiguously reconstructs the {open_quotes}true{close_quotes} allele bands, with stutter artifact removed. We used this approach in a system for automated diagnosis of (X-linked) Duchenne muscular dystrophy; four multiplexed CA-repeats within the dystrophin gene were assayed on a DNA sequencer. Our method accurately detected small variations in gel migration that shifted the allele size estimate. In 167 nonmutated alleles, 89% (149/167) showed no size variation, 9% (15/167) showed 1 bp variation, and 2% (3/167) showed 2 bp variation. We are currently developing a library of dinucleotide repeat patterns; together with our deconvolution methods, this library will enable fully automated genotyping of dinucleotide repeats from sizing data.

  9. Peptide de novo sequencing of mixture tandem mass spectra

    DEFF Research Database (Denmark)

    Gorshkov, Vladimir; Hotta, Stéphanie Yuki Kolbeck; Braga, Thiago Verano

    2016-01-01

    they decrease the identification performance using database search engines. De novo sequencing approaches are expected to be even more sensitive to the reduction in mass spectrum quality resulting from peptide precursor co-isolation and thus prone to false identifications. The deconvolution approach matched...... complementary b-, y-ions to each precursor peptide mass, which allowed the creation of virtual spectra containing sequence specific fragment ions of each co-isolated peptide. Deconvolution processing resulted in equally efficient identification rates but increased the absolute number of correctly sequenced...... peptides. The improvement was in the range of 20–35% additional peptide identifications for a HeLa lysate sample. Some correct sequences were identified only using unprocessed spectra; however, the number of these was lower than those where improvement was obtained by mass spectral deconvolution. Tight...

  10. Development of Simple Sequence Repeats (SSR Markers in Setaria italica (Poaceae and Cross-Amplification in Related Species

    Directory of Open Access Journals (Sweden)

    Chih-Yun Chiang

    2011-11-01

    Full Text Available Foxtail millet is one of the world’s oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21% and CAT (46.15%. The average number of alleles (Na, the average heterozygosities observed (Ho and expected (He are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.

  11. [Analytical procedure of variable number of tandem repeats (VNTR) analysis and effective use of analysis results for tuberculosis control].

    Science.gov (United States)

    Hachisu, Yushi; Hashimoto, Ruiko; Kishida, Kazunori; Yokoyama, Eiji

    2013-12-01

    Variable number of tandem repeats (VNTR) analysis is one of the methods for molecular epidemiological studies of Mycobacterium tuberculosis. VNTR analysis is a method based on PCR, provides rapid highly reproducible results and higher strain discrimination power than the restriction fragment length polymorphism (RFLP) analysis widely used in molecular epidemiological studies of Mycobacterium tuberculosis. Genetic lineage compositions of Mycobacterium tuberculosis clinical isolates differ among the regions from where they are isolated, and allelic diversity at each locus also differs among the genetic lineages of Mycobacterium tuberculosis. Therefore, the combination of VNTR loci that can provide high discrimination capacity for analysis is not common in every region. The Japan Anti-Tuberculosis Association (JATA) 12 (15) reported a standard combination of VNTR loci for analysis in Japan, and the combination with hypervariable (HV) loci added to JATA12 (15), which has very high discrimination capacity, was also reported. From these reports, it is thought that data sharing between institutions and construction of a nationwide database will progress from now on. Using database construction of VNTR profiles, VNTR analysis has become an effective tool to trace the route of tuberculosis infection, and also helps in decision-making in the treatment course. However, in order to utilize the results of VNTR analysis effectively, it is important that each related organization cooperates closely, and analysis should be appropriately applied in the system in which accurate control and private information protection are ensured.

  12. The complete chloroplast genome sequence of Mahonia bealei (Berberidaceae) reveals a significant expansion of the inverted repeat and phylogenetic relationship with other angiosperms.

    Science.gov (United States)

    Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin

    2013-10-10

    Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae

  13. Pericentric satellite DNA sequences in Pipistrellus pipistrellus (Vespertilionidae; Chiroptera).

    Science.gov (United States)

    Barragán, M J L; Martínez, S; Marchal, J A; Fernández, R; Bullejos, M; Díaz de la Guardia, R; Sánchez, A

    2003-09-01

    This paper reports the molecular and cytogenetic characterization of a HindIII family of satellite DNA in the bat species Pipistrellus pipistrellus. This satellite is organized in tandem repeats of 418 bp monomer units, and represents approximately 3% of the whole genome. The consensus sequence from five cloned monomer units has an A-T content of 62.20%. We have found differences in the ladder pattern of bands between two populations of the same species. These differences are probably because of the absence of the target sites for the HindIII enzyme in most monomer units of one population, but not in the other. Fluorescent in situ hybridization (FISH) localized the satellite DNA in the pericentromeric regions of all autosomes and the X chromosome, but it was absent from the Y chromosome. Digestion of genomic DNAs with HpaII and its isoschizomer MspI demonstrated that these repetitive DNA sequences are not methylated. Other bat species were tested for the presence of this repetitive DNA. It was absent in five Vespertilionidae and one Rhinolophidae species, indicating that it could be a species/genus specific, repetitive DNA family.

  14. Tandem Mannich/Diels–Alder reactions for the synthesis of indole compound libraries

    DEFF Research Database (Denmark)

    Wu, Peng; Petersen, Michael Åxman; Petersen, Rico

    2016-01-01

    A tandem Mannich/Diels–Alder sequence for the synthesis of small-molecule libraries with an indolyl-octahydro-3a,6-epoxy-isoindole core structure is demonstrated in this study. Representative diversification examples based on this scaffold were performed, and a library is being produced within...

  15. Lack of support for a role of the insulin gene variable number of tandem repeats minisatellite (INS-VNTR) locus in fetal growth or type 2 diabetes-related intermediate traits in United Kingdom populations.

    Science.gov (United States)

    Mitchell, Simon M S; Hattersley, Andrew T; Knight, Beatrice; Turner, Tina; Metcalf, Bradley S; Voss, Linda D; Davies, David; McCarthy, Anne; Wilkin, Terence J; Smith, George Davey; Ben-Shlomo, Yoav; Frayling, Timothy M

    2004-01-01

    The insulin gene variable number of tandem repeats minisatellite (INS-VNTR) class III allele is associated with altered fetal growth, type 2 diabetes risk (especially when paternally inherited), and insulin and IGF2 gene expression. Further studies are needed to establish the role of the INS-VNTR in fetal growth and assess whether its effects depend on the parent of origin. We analyzed the INS-VNTR-linked -23 Hph1 polymorphism in 2283 subjects, comprising 1184 children and 1099 parents. There were no differences (P VNTR was nominally associated (P VNTR in fetal growth and nominal association with type 2 diabetes-related intermediate traits.

  16. Large tandem accelerators

    International Nuclear Information System (INIS)

    Jones, C.M.

    1976-01-01

    The increasing importance of energetic heavy ion beams in the study of atomic physics, nuclear physics, and materials science has partially or wholly motivated the construction of a new generation of tandem accelerators designed to operate at maximum terminal potentials in the range 14 to 30 MV. In addition, a number of older tandem accelerators are now being significantly upgraded to improve their heavy ion performance. Both of these developments have reemphasized the importance of negative heavy ion sources. The new large tandem accelerators are described, and the requirements placed on negative heavy ion source technology by these and other tandem accelerators used for the acceleration of heavy ions are discussed. First, a brief description is given of the large tandem accelerators which have been completed recently, are under construction, or are funded for construction, second, the motivation for construction of these accelerators is discussed, and last, criteria for negative ion sources for use with these accelerators are presented

  17. Nonlinear analysis of sequence repeats of multi-domain proteins

    Energy Technology Data Exchange (ETDEWEB)

    Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: lmf_bill@sina.com

    2007-11-15

    Many multi-domain proteins have repetitive three-dimensional structures but nearly-random amino acid sequences. In the present paper, by using a modified recurrence plot proposed by us previously, we show that these amino acid sequences have hidden repetitions in fact. These results indicate that the repetitive domain structures are encoded by the repetitive sequences. This also gives a method to detect the repetitive domain structures directly from amino acid sequences.

  18. Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats.

    Science.gov (United States)

    Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang

    2016-09-01

    Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.

  19. Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

    Science.gov (United States)

    Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

    2012-12-01

    Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.

  20. Molecular characterization of African swine fever virus in apparently ...

    African Journals Online (AJOL)

    SAM

    2014-06-18

    Jun 18, 2014 ... Bands of correct size were excised and purified by ... acid sequences were manually aligned with gaps being inserted to .... Amino acid sequence alignment of the tetrameric tandem repeats identified within the central variable ...

  1. Tandem fusion of hepatitis B core antigen allows assembly of virus-like particles in bacteria and plants with enhanced capacity to accommodate foreign proteins.

    Directory of Open Access Journals (Sweden)

    Hadrien Peyret

    Full Text Available The core protein of the hepatitis B virus, HBcAg, assembles into highly immunogenic virus-like particles (HBc VLPs when expressed in a variety of heterologous systems. Specifically, the major insertion region (MIR on the HBcAg protein allows the insertion of foreign sequences, which are then exposed on the tips of surface spike structures on the outside of the assembled particle. Here, we present a novel strategy which aids the display of whole proteins on the surface of HBc particles. This strategy, named tandem core, is based on the production of the HBcAg dimer as a single polypeptide chain by tandem fusion of two HBcAg open reading frames. This allows the insertion of large heterologous sequences in only one of the two MIRs in each spike, without compromising VLP formation. We present the use of tandem core technology in both plant and bacterial expression systems. The results show that tandem core particles can be produced with unmodified MIRs, or with one MIR in each tandem dimer modified to contain the entire sequence of GFP or of a camelid nanobody. Both inserted proteins are correctly folded and the nanobody fused to the surface of the tandem core particle (which we name tandibody retains the ability to bind to its cognate antigen. This technology paves the way for the display of natively folded proteins on the surface of HBc particles either through direct fusion or through non-covalent attachment via a nanobody.

  2. Phylogeny of the Serrasalmidae (Characiformes based on mitochondrial DNA sequences

    Directory of Open Access Journals (Sweden)

    Guillermo Ortí

    2008-01-01

    Full Text Available Previous studies based on DNA sequences of mitochondrial (mt rRNA genes showed three main groups within the subfamily Serrasalminae: (1 a "pacu" clade of herbivores (Colossoma, Mylossoma, Piaractus; (2 the "Myleus" clade (Myleus, Mylesinus, Tometes, Ossubtus; and (3 the "piranha" clade (Serrasalmus, Pygocentrus, Pygopristis, Pristobrycon, Catoprion, Metynnis. The genus Acnodon was placed as the sister taxon of clade (2+3. However, poor resolution within each clade was obtained due to low levels of variation among rRNA gene sequences. Complete sequences of the hypervariable mtDNA control region for a total of 45 taxa, and additional sequences of 12S and 16S rRNA from a total of 74 taxa representing all genera in the family are now presented to address intragroup relationships. Control region sequences of several serrasalmid species exhibit tandem repeats of short motifs (12 to 33 bp in the 3' end of this region, accounting for substantial length variation. Bayesian inference and maximum parsimony analyses of these sequences identify the same groupings as before and provide further evidence to support the following observations: (a Serrasalmus gouldingi and species of Pristobrycon (non-striolatus form a monophyletic group that is the sister group to other species of Serrasalmus and Pygocentrus; (b Catoprion, Pygopristis, and Pristobrycon striolatus form a well supported clade, sister to the group described above; (c some taxa assigned to the genus Myloplus (M. asterias, M tiete, M ternetzi, and M rubripinnis form a well supported group whereas other Myloplus species remain with uncertain affinities (d Mylesinus, Tometes and Myleus setiger form a monophyletic group.

  3. Local repeat sequence organization of an intergenic spacer

    Indian Academy of Sciences (India)

    The amplification yielded the same uniquely ``sequence-scrambled” product, whether the template used for PCR was total cellular DNA, chloroplast DNA or a plasmid clone DNA corresponding to that region. The PCR product, a ``unique” new sequence, had lost the repetitive organization of the template genome where it ...

  4. Reduction of starch granule size by expression of an engineered tandem starch-binding domain in potato plants

    NARCIS (Netherlands)

    Ji, Q.; Oomen, R.J.F.J.; Vincken, J.P.; Bolam, D.N.; Gilbert, H.J.; Suurs, L.C.J.M.; Visser, R.G.F.

    2004-01-01

    Granule size is an important parameter when using starch in industrial applications. An artificial tandem repeat of a family 20 starch-binding domain (SBD2) was engineered by two copies of the SBD derived from Bacillus circulans cyclodextrin glycosyltransferase via the Pro-Thr-rich linker peptice

  5. Fingerprinting for discriminating tea germplasm using inter-simple sequence repeat (ISSR) markers

    International Nuclear Information System (INIS)

    Liu, B.Y.; Li, Y.Y.; Wang, P.S.; Wang, L.Y.; Wang, P.S.

    2012-01-01

    For the discrimination of tea germplasm at the inter-specific level, 134 tea varieties preserved in the China National Germplasm Tea Repositories (CNGTR) were analyzed using inter simple sequence repeat (ISSR) markers. Eighteen primers were chosen from 60 screened for ISSR amplification, generating 99.4% polymorphic bands. The mean Nei's gene diversity (H) and the overall mean Shannon's Information index (I) were 0.396 and 0.578, respectively, indicating a wide gene pool. Using the presence, sometimes absence of unique ISSR markers, it was possible to discriminate 32 of the genotypes tested. No single primer could discriminate all the 134 genotypes. However, UBC811 provided rich band patterns and it can discriminate 35 genotypes. The combination of two and three primers could discriminate 99 and 121 genotypes, respectively. Furthermore, the combination of band patterns or the DNA fingerprinting based on specific ISSR markers generated by UBC811, UBC835, ISSR2 and ISSR3 could discriminate all 134 genotypes tested. ISSR markers also provide a powerful tool to discriminate tea germplasm at the inter-specific level. (author)

  6. Unique TTC repeat base pair loss mutation in cases of pure neural leprosy: A survival strategy of Mycobacterium leprae?

    Directory of Open Access Journals (Sweden)

    Abhishek De

    2015-01-01

    Full Text Available Background: Genomic reduction helps obligate intracellular microbes to survive difficult host niches. Adaptation of Mycobacterium leprae in cases of pure neural leprosy (PNL in the intracellular niche of peripheral nerves can be associated with some gene loss. Recently, a stable but variable number of tandem repefzats (TTC have been reported in strains of M. leprae. FolP and rpoB genes are the two common mutation sites which deal with the susceptibility of the bacteria to drugs. Aim: We attempted to find if genomic reduction of M. leprae in context of these TTC repeats or mutations in folP1 and rpoB can be the reason for the restriction of M. leprae in the nerves in PNL. Materials and Methods: DNA extracts taken from fine needle aspiration of affected nerves of 24 PNL cases were studied for tandem repeats with 21TTC primer in multiplex-PCR. Mutations were also studied by PCR Amplification of SRDR (Sulphone Resistance Determining Region of the folP1 and multiple primer PCR amplification refractory mutation system (MARS of the rpoB. Results: Of the 24 PNL, only 1 patient showed mutation in the rpoB gene and none in the folp1 gene. Studying the mutation in TTC region of the M. leprae gene we found that all the cases have a loss of a few bases in the sequence. Conclusion: We can conclude that there is consistent loss in the bases in the TTC region in all cases of pure neural Hansen and we postulate that it may be an adaptive response of the bacteria to survive host niche resulting in its restriction to peripheral nerves.

  7. Variable Number of Tandem Repeat Markers in the Genome Sequence of Mycosphaerella Fijiensis, the Causal Agent of Black Leaf Streak Disease of Banana (Musa spp.)

    Science.gov (United States)

    Mycosphaerella fijiensis, the causal agent of banana leaf streak disease (commonly known as black Sigatoka), is the most devastating pathogen attacking bananas (Musa spp). Recently the whole genome sequence of M. fijiensis became available. This sequence was screened for the presence of Variable Num...

  8. Repeat-aware modeling and correction of short read errors.

    Science.gov (United States)

    Yang, Xiao; Aluru, Srinivas; Dorman, Karin S

    2011-02-15

    High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors

  9. Genetic diversity studies in pea (Pisum sativum L.) using simple sequence repeat markers.

    Science.gov (United States)

    Kumari, P; Basal, N; Singh, A K; Rai, V P; Srivastava, C P; Singh, P K

    2013-03-13

    The genetic diversity among 28 pea (Pisum sativum L.) genotypes was analyzed using 32 simple sequence repeat markers. A total of 44 polymorphic bands, with an average of 2.1 bands per primer, were obtained. The polymorphism information content ranged from 0.657 to 0.309 with an average of 0.493. The variation in genetic diversity among these cultivars ranged from 0.11 to 0.73. Cluster analysis based on Jaccard's similarity coefficient using the unweighted pair-group method with arithmetic mean (UPGMA) revealed 2 distinct clusters, I and II, comprising 6 and 22 genotypes, respectively. Cluster II was further differentiated into 2 subclusters, IIA and IIB, with 12 and 10 genotypes, respectively. Principal component (PC) analysis revealed results similar to those of UPGMA. The first, second, and third PCs contributed 21.6, 16.1, and 14.0% of the variation, respectively; cumulative variation of the first 3 PCs was 51.7%.

  10. Initial study of stability and repeatability of measuring R2' and oxygen extraction fraction values in the healthy brain with gradient-echo sampling of spin-echo sequence

    International Nuclear Information System (INIS)

    Hui Lihong; Zhang Xiaodong; He Chao; Xie Sheng; Xiao Jiangxi; Zhang jue; Wang Xiaoying; Jiang Xuexiang

    2010-01-01

    Objective: To evaluate the stability and repeatability of gradient-echo sampling of spin- echo (GESSE) sequence in measuring the R 2 ' value in volunteers, by comparison with traditional GRE sequence (T 2 * ]nap and T 2 map). Methods: Eight normal healthy volunteers were enrolled in this study and written informed consents were obtained from all subjects. MR scanning including sequences of GESSE, T 2 map and T 2 * map were performed in these subjects at resting status. The same protocol was repeated one day later. Raw data from GESSE sequence were transferred to PC to conduct postprocessing with the software built in house. R 2 ' map and OEF map were got consequently. To obtain quantitative R 2 ' and OEF values in the brain parenchyma, six ROIs were equally placed in the anterior, middle and posterior part of bilateral hemispheres. Both mean and standard deviation of R 2 ' and OEF were recorded. All images from T 2 * map and T 2 map were transferred to the Workstation for postprocessing. The ROIs were put at the same areas as those for GESSE sequence. R 2 ' is defined as R 2 ' = R 2 * - R 2 , R 2 * = 1/T 2 * . The R 2 ' value of GESSE sequence were compared with that of GRE sequence. Results: The mean R 2 ' values of GESSE at the first and second scan and those of the GRE were (4.21±0.92), (4.45±0.94) Hz and (7.37±1.47), (6.42±2.33) Hz respectively. The mean OEF values of GESSE at the first and second scan is 0.327±0.036 and 0.336± 0.035 respectively. The R 2 ' value and OEF value obtained from GESSE were not significantly different between the first and second scan (t=-0.83, -1.48, P>0.05). The R 2 ' value of first GRE imaging had significantly statistical difference from that of second GRE imaging (t=1.80, P 2 ' value of GESSE sequence was less than that of GRE sequence, and there was significantly statistical difference between them (t=1.71, P<0.05). Conclusion: The GESSE sequence has good stability and repeatability with promising clinical practicability

  11. Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus).

    Science.gov (United States)

    Cech, Jennifer N; Peichel, Catherine L

    2015-12-01

    Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.

  12. Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

    Directory of Open Access Journals (Sweden)

    Huaiyong Luo

    Full Text Available The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.

  13. Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

    Science.gov (United States)

    Luo, Huaiyong; Wang, Xiaojie; Zhan, Gangming; Wei, Guorong; Zhou, Xinli; Zhao, Jing; Huang, Lili; Kang, Zhensheng

    2015-01-01

    The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs) are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.

  14. DEVELOPMENT OF A MULTIPLE-LOCUS VARIABLE NUMBER OF TANDEM REPEAT ANALYSIS (MLVA FOR HELICOBACTER PYLORI AND ITS APPLICATION TO HELICOBACTER PYLORI ISOLATES FROM ROSTOV REGION,RUSSIA

    Directory of Open Access Journals (Sweden)

    Sorokin VM

    2012-09-01

    Full Text Available Stomach infection with Helicobacter pylori (H. pylori is the second most common infectious disease of humans. The severe pathological consequences of this infection include gastric and duodenal ulcer disease, the development of gastric mucosal atrophy, gastric carcinoma, and, more rarely, malignant tumors of the lymphoma. H. pylori infections cause very high morbidity and mortality and are of particular concern in developing countries, where H. pylori prevalences as high as 90% have been reported. The population of H. pylori shows a high genomic variability among isolates. And the polymorphism of repeat-units of genomics had participated the important process of evolution. A variety of molecular typing tools have been developed to access genetic relatedness in H. pylori isolates. However, there is still no standard genotyping system of this bacterium. The MLVA (Multi-Locus of Variable number of tandem repeat Analysis method is useful for performing phylogenetic analysis and is widely used in bacteria genotyping; however, there's little application in H. pylori analysis. This article is the first application of the MLVA method to investigate H. pylori isolates in Russia. MLVA of 4 VNTR loci with high discrimination power based on 10 candidates were performed on a collection of 22 strains of H. pylori which originated from Rostov region of Russia. This method provides a starting point on which improvements to the method and comparisons to other techniques can be made.

  15. Molecular characterization and chromosomal distribution of a species-specific transcribed centromeric satellite repeat from the olive fruit fly, Bactrocera oleae.

    Directory of Open Access Journals (Sweden)

    Konstantina T Tsoumani

    Full Text Available Satellite repetitive sequences that accumulate in the heterochromatin consist a large fraction of a genome and due to their properties are suggested to be implicated in centromere function. Current knowledge of heterochromatic regions of Bactrocera oleae genome, the major pest of the olive tree, is practically nonexistent. In our effort to explore the repetitive DNA portion of B. oleae genome, a novel satellite sequence designated BoR300 was isolated and cloned. The present study describes the genomic organization, abundance and chromosomal distribution of BoR300 which is organized in tandem, forming arrays of 298 bp-long monomers. Sequence analysis showed an AT content of 60.4%, a CENP-B like-motif and a high curvature value based on predictive models. Comparative analysis among randomly selected monomers demonstrated a high degree of sequence homogeneity (88%-97% of BoR300 repeats, which are present at approximately 3,000 copies per haploid genome accounting for about 0.28% of the total genomic DNA, based on two independent qPCR approaches. In addition, expression of the repeat was also confirmed through RT-PCR, by which BoR300 transcripts were detected in both sexes. Fluorescence in situ hybridization (FISH of BoR300 on mitotic metaphases and polytene chromosomes revealed signals to the centromeres of two out of the six chromosomes which indicated a chromosome-specific centromeric localization. Moreover, BoR300 is not conserved in the closely related Bactrocera species tested and it is also absent in other dipterans, but it's rather restricted to the B. oleae genome. This feature of species-specificity attributed to BoR300 satellite makes it a good candidate as an identification probe of the insect among its relatives at early development stages.

  16. DNA radio-induced tandem lesions: formation, introduction in oligonucleotides and repair

    International Nuclear Information System (INIS)

    Bourdat, Anne-Gaelle

    2000-01-01

    -oxodGuo-dβF were found to be generated. Interestingly, 8-oxodGuo-dβF was produced in a much higher yield than the reversed sequence lesion. In addition, indirect evidence is provided for the formation of other tandem lesions involving 8-oxodGuo. (author) [fr

  17. [Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

    Science.gov (United States)

    Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

    2015-04-01

    This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.

  18. The use of mycobacterial interspersed repetitive unit typing and whole genome sequencing to inform tuberculosis prevention and control activities.

    Science.gov (United States)

    Gilbert, Gwendolyn L; Sintchenko, Vitali

    2013-07-01

    Molecular strain typing of Mycobacterium tuberculosis has been possible for only about 20 years; it has significantly improved our understanding of the evolution and epidemiology of Mycobacterium tuberculosis and tuberculosis disease. Mycobacterial interspersed repetitive unit typing, based on 24 variable number tandem repeat unit loci, is highly discriminatory, relatively easy to perform and interpret and is currently the most widely used molecular typing system for tuberculosis surveillance. Nevertheless, clusters identified by mycobacterial interspersed repetitive unit typing sometimes cannot be confirmed or adequately defined by contact tracing and additional methods are needed. Recently, whole genome sequencing has been used to identify single nucleotide polymorphisms and other mutations, between genotypically indistinguishable isolates from the same cluster, to more accurately trace transmission pathways. Rapidly increasing speed and quality and reduced costs will soon make large scale whole genome sequencing feasible, combined with the use of sophisticated bioinformatics tools, for epidemiological surveillance of tuberculosis.

  19. Complete Chloroplast Genome Sequences and Comparative Analysis of Chenopodium quinoa and C. album.

    Science.gov (United States)

    Hong, Su-Young; Cheon, Kyeong-Sik; Yoo, Ki-Oug; Lee, Hyun-Oh; Cho, Kwang-Soo; Suh, Jong-Taek; Kim, Su-Jeong; Nam, Jeong-Hwan; Sohn, Hwang-Bae; Kim, Yul-Ho

    2017-01-01

    The Chenopodium genus comprises ~150 species, including Chenopodium quinoa and Chenopodium album , two important crops with high nutritional value. To elucidate the phylogenetic relationship between the two species, the complete chloroplast (cp) genomes of these species were obtained by next generation sequencing. We performed comparative analysis of the sequences and, using InDel markers, inferred phylogeny and genetic diversity of the Chenopodium genus. The cp genome is 152,099 bp ( C. quinoa ) and 152,167 bp ( C. album ) long. In total, 119 genes (78 protein-coding, 37 tRNA, and 4 rRNA) were identified. We found 14 ( C. quinoa ) and 15 ( C. album ) tandem repeats (TRs); 14 TRs were present in both species and C. album and C. quinoa each had one species-specific TR. The trnI-GAU intron sequences contained one ( C. quinoa ) or two ( C. album ) copies of TRs (66 bp); the InDel marker was designed based on the copy number variation in TRs. Using the InDel markers, we detected this variation in the TR copy number in four species, Chenopodium hybridum, Chenopodium pumilio, Chenopodium ficifolium , and Chenopodium koraiense , but not in Chenopodium glaucum . A comparison of coding and non-coding regions between C. quinoa and C. album revealed divergent sites. Nucleotide diversity >0.025 was found in 17 regions-14 were located in the large single copy region (LSC), one in the inverted repeats, and two in the small single copy region (SSC). A phylogenetic analysis based on 59 protein-coding genes from 25 taxa resolved Chenopodioideae monophyletic and sister to Betoideae. The complete plastid genome sequences and molecular markers based on divergence hotspot regions in the two Chenopodium taxa will help to resolve the phylogenetic relationships of Chenopodium .

  20. A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli

    Directory of Open Access Journals (Sweden)

    Li Mingji

    2012-02-01

    Full Text Available Abstract Background For metabolic engineering, many rate-limiting steps may exist in the pathways of accumulating the target metabolites. Increasing copy number of the desired genes in these pathways is a general method to solve the problem, for example, the employment of the multi-copy plasmid-based expression system. However, this method may bring genetic instability, structural instability and metabolic burden to the host, while integrating of the desired gene into the chromosome may cause inadequate transcription or expression. In this study, we developed a strategy for obtaining gene overexpression by engineering promoter clusters consisted of multiple core-tac-promoters (MCPtacs in tandem. Results Through a uniquely designed in vitro assembling process, a series of promoter clusters were constructed. The transcription strength of these promoter clusters showed a stepwise enhancement with the increase of tandem repeats number until it reached the critical value of five. Application of the MCPtacs promoter clusters in polyhydroxybutyrate (PHB production proved that it was efficient. Integration of the phaCAB genes with the 5CPtacs promoter cluster resulted in an engineered E.coli that can accumulate 23.7% PHB of the cell dry weight in batch cultivation. Conclusions The transcription strength of the MCPtacs promoter cluster can be greatly improved by increasing the tandem repeats number of the core-tac-promoter. By integrating the desired gene together with the MCPtacs promoter cluster into the chromosome of E. coli, we can achieve high and stale overexpression with only a small size. This strategy has an application potential in many fields and can be extended to other bacteria.

  1. Utilization of a cloned alphoid repeating sequence of human DNA in the study of polymorphism of chromosomal heterochromatin regions

    International Nuclear Information System (INIS)

    Kruminya, A.R.; Kroshkina, V.G.; Yurov, Yu.B.; Aleksandrov, I.A.; Mitkevich, S.P.; Gindilis, V.M.

    1988-01-01

    The chromosomal distribution of the cloned PHS05 fragment of human alphoid DNA was studied by in situ hybridization in 38 individuals. It was shown that this DNA fraction is primarily localized in the pericentric regions of practically all chromosomes of the set. Significant interchromosomal differences and a weakly expressed interindividual polymorphism were discovered in the copying ability of this class of repeating DNA sequences; associations were not found between the results of hybridization and the pattern of Q-polymorphism

  2. Repetitive sequences and epigenetic modification: inseparable partners play important roles in the evolution of plant sex chromosomes.

    Science.gov (United States)

    Li, Shu-Fen; Zhang, Guo-Jun; Yuan, Jin-Hong; Deng, Chuan-Liang; Gao, Wu-Jun

    2016-05-01

    The present review discusses the roles of repetitive sequences played in plant sex chromosome evolution, and highlights epigenetic modification as potential mechanism of repetitive sequences involved in sex chromosome evolution. Sex determination in plants is mostly based on sex chromosomes. Classic theory proposes that sex chromosomes evolve from a specific pair of autosomes with emergence of a sex-determining gene(s). Subsequently, the newly formed sex chromosomes stop recombination in a small region around the sex-determining locus, and over time, the non-recombining region expands to almost all parts of the sex chromosomes. Accumulation of repetitive sequences, mostly transposable elements and tandem repeats, is a conspicuous feature of the non-recombining region of the Y chromosome, even in primitive one. Repetitive sequences may play multiple roles in sex chromosome evolution, such as triggering heterochromatization and causing recombination suppression, leading to structural and morphological differentiation of sex chromosomes, and promoting Y chromosome degeneration and X chromosome dosage compensation. In this article, we review the current status of this field, and based on preliminary evidence, we posit that repetitive sequences are involved in sex chromosome evolution probably via epigenetic modification, such as DNA and histone methylation, with small interfering RNAs as the mediator.

  3. Myelodysplastic syndromes and acute myeloid leukemia in cats infected with feline leukemia virus clone33 containing a unique long terminal repeat.

    Science.gov (United States)

    Hisasue, Masaharu; Nagashima, Naho; Nishigaki, Kazuo; Fukuzawa, Isao; Ura, Shigeyoshi; Katae, Hiromi; Tsuchiya, Ryo; Yamada, Takatsugu; Hasegawa, Atsuhiko; Tsujimoto, Hajime

    2009-03-01

    Feline leukemia virus (FeLV) clone33 was obtained from a domestic cat with acute myeloid leukemia (AML). The long terminal repeat (LTR) of this virus, like the LTRs present in FeLV from other cats with AML, differs from the LTRs of other known FeLV in that it has 3 tandem direct 47-bp repeats in the upstream region of the enhancer (URE). Here, we injected cats with FeLV clone33 and found 41% developed myelodysplastic syndromes (MDS) characterized by peripheral blood cytopenias and dysplastic changes in the bone marrow. Some of the cats with MDS eventually developed AML. The bone marrow of the majority of cats with FeLV clone33 induced MDS produced fewer erythroid and myeloid colonies upon being cultured with erythropoietin or granulocyte-macrophage colony-stimulating factor (GM-SCF) than bone marrow from normal control cats. Furthermore, the bone marrow of some of the cats expressed high-levels of the apoptosis-related genes TNF-alpha and survivin. Analysis of the proviral sequences obtained from 13 cats with naturally occurring MDS reveal they also bear the characteristic URE repeats seen in the LTR of FeLV clone33 and other proviruses from cats with AML. Deletions and mutations within the enhancer elements are frequently observed in naturally occurring MDS as well as AML. These results suggest that FeLV variants that bear URE repeats in their LTR strongly associate with the induction of both MDS and AML in cats.

  4. In silico reversal of repeat-induced point mutation (RIP identifies the origins of repeat families and uncovers obscured duplicated genes

    Directory of Open Access Journals (Sweden)

    Hane James K

    2010-11-01

    Full Text Available Abstract Background Repeat-induced point mutation (RIP is a fungal genome defence mechanism guarding against transposon invasion. RIP mutates the sequence of repeated DNA and over time renders the affected regions unrecognisable by similarity search tools such as BLAST. Results DeRIP is a new software tool developed to predict the original sequence of a RIP-mutated region prior to the occurrence of RIP. In this study, we apply deRIP to the genome of the wheat pathogen Stagonospora nodorum SN15 and predict the origin of several previously uncharacterised classes of repetitive DNA. Conclusions Five new classes of transposon repeats and four classes of endogenous gene repeats were identified after deRIP. The deRIP process is a new tool for fungal genomics that facilitates the identification and understanding of the role and origin of fungal repetitive DNA. DeRIP is open-source and is available as part of the RIPCAL suite at http://www.sourceforge.net/projects/ripcal.

  5. THE USE OF INTER SIMPLE SEQUENCE REPEATS (ISSR) IN DISTINGUISHING NEIGHBORING DOUGLAS-FIR TREES AS A MEANS TO IDENTIFYING TREE ROOTS WITH ABOVE-GROUND BIOMASS

    Science.gov (United States)

    We are attempting to identify specific root fragments from soil cores with individual trees. We successfully used Inter Simple Sequence Repeats (ISSR) to distinguish neighboring old-growth Douglas-fir trees from one another, while maintaining identity among each tree's parts. W...

  6. Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

    Science.gov (United States)

    Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

    2017-04-01

    5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  7. Rate-determining Step of Flap Endonuclease 1 (FEN1) Reflects a Kinetic Bias against Long Flaps and Trinucleotide Repeat Sequences.

    Science.gov (United States)

    Tarantino, Mary E; Bilotti, Katharina; Huang, Ji; Delaney, Sarah

    2015-08-21

    Flap endonuclease 1 (FEN1) is a structure-specific nuclease responsible for removing 5'-flaps formed during Okazaki fragment maturation and long patch base excision repair. In this work, we use rapid quench flow techniques to examine the rates of 5'-flap removal on DNA substrates of varying length and sequence. Of particular interest are flaps containing trinucleotide repeats (TNR), which have been proposed to affect FEN1 activity and cause genetic instability. We report that FEN1 processes substrates containing flaps of 30 nucleotides or fewer at comparable single-turnover rates. However, for flaps longer than 30 nucleotides, FEN1 kinetically discriminates substrates based on flap length and flap sequence. In particular, FEN1 removes flaps containing TNR sequences at a rate slower than mixed sequence flaps of the same length. Furthermore, multiple-turnover kinetic analysis reveals that the rate-determining step of FEN1 switches as a function of flap length from product release to chemistry (or a step prior to chemistry). These results provide a kinetic perspective on the role of FEN1 in DNA replication and repair and contribute to our understanding of FEN1 in mediating genetic instability of TNR sequences. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  8. Loss of heterozygosity at thymidylate synthase locus in Barrett's metaplasia, dysplasia, and carcinoma sequences

    Directory of Open Access Journals (Sweden)

    Vallbohmer Daniel

    2009-05-01

    Full Text Available Abstract Background Thymidylate synthase (TS is known to have a unique 28 bp tandemly repeated sequence in the promoter region, and the majorities of subjects have a heterozygous double repeat/triple repeat genotype in their non-cancerous tissue. Loss of heterozygosity (LOH at the TS locus is known to occur in cancer patients, but there is no evidence that it is present in precancerous tissue. The aim of this study was to analyze the frequency and timing of LOH at the TS locus in Barrett-associated adenocarcinoma (BA and its precursory lesions, such as intestinal metaplasia (IM and dysplasia. Methods One hundred twenty-three samples (including 37 with gastroesophageal reflux disease (GERD, 29 with IM, 13 with dysplasia, and 44 with BA were obtained from 100 patients. Biopsies were obtained from the lower esophageal mucosa/IM/dysplasia/BA, when available. Normal squamous tissue from the upper esophagus was taken as a control. All tissues were analyzed for the TS genotype and TS mRNA expression using the real-time reverse-transcription polymerase chain reaction (RT-PCR method after laser-capture microdissection. Results Among the patients with informative heterozygous genotype in their control samples, no sample with LOH at the TS locus was observed in the lower esophageal mucosa in GERD patients (0/22 samples. However, 6 out of 21 samples (28.6% had LOH in IM, 2 of 7 (28.6% in dysplasia, and 10 of 25 (40.0% in BA. No significant difference in TS mRNA expression levels was observed between TS genotypes. Conclusion Our results demonstrate that LOH is a relatively frequent and early event in the IM-BA sequence.

  9. A study of reflex tandem accelerator

    Energy Technology Data Exchange (ETDEWEB)

    Nakajima, Takao; Morinobu, Shunpei; Gono, Yasuyuki; Sagara, Kenji; Sugimitsu, Tsuyoshi; Mitarai, Shiro; Nakamura, Hiroyuki; Ikeda, Nobuo; Morikawa, Tsuneyasu [Kyushu Univ., Fukuoka (Japan). Faculty of Science

    1996-12-01

    An investigation on `developing research theme and its realizing experimental apparatus` based on the tandem accelerator facility is executed. At a standpoint of recognition on essentiality of preparation, improvement or novel technical development capable of extreme increase in capacity of the tandem accelerator facility to form COE with high uniqueness, proposal of numerous ideas and their investigations and searches were conducted. In this paper, consideration results of `beam reacceleration using tandem accelerator` were shown as follows: (1) Short life unstable nuclei formed by nuclear reaction using tandem acceleration primary beam is ionized to negative and to reaccelerate by using the same tandem accelerator. And (2) by combination of plural electrons with the tandem primary accelerated beam, numbers of charge is reduced to reaccelerate by the tandem. (G.K.)

  10. Assessment of Cultivar Distinctness in Alfalfa: A Comparison of Genotyping-by-Sequencing, Simple-Sequence Repeat Marker, and Morphophysiological Observations

    Directory of Open Access Journals (Sweden)

    Paolo Annicchiarico

    2016-07-01

    Full Text Available Cultivar registration agencies typically require morphophysiological trait-based distinctness of candidate cultivars. This requirement is difficult to achieve for cultivars of major perennial forages because of their genetic structure and ever-increasing number of registered material, leading to possible rejection of agronomically valuable cultivars. This study aimed to explore the value of molecular markers applied to replicated bulked plants (three bulks of 100 independent plants each per cultivar to assess alfalfa ( L. subsp. cultivar distinctness. We compared genotyping-by-sequencing information based on 2902 polymorphic single-nucleotide polymorphism (SNP markers (>30 reads per DNA sample with morphophysiological information based on 11 traits and with simple-sequence repeat (SSR marker information from 41 polymorphic markers for their ability to distinguish 11 alfalfa landraces representative of the germplasm from northern Italy. Three molecular criteria, one based on cultivar differences for individual SSR bands and two based on overall SNP marker variation assessed either by statistically significant cultivar differences on principal component axes or discriminant analysis, distinctly outperformed the morphophysiological criterion. Combining the morphophysiological criterion with either molecular marker method increased discrimination among cultivars, since morphophysiological diversity was unrelated to SSR marker-based diversity ( = 0.04 and poorly related to SNP marker-based diversity ( = 0.23, < 0.15. The criterion based on statistically significant SNP allele frequency differences was less discriminating than morphophysiological variation. Marker-based distinctness, which can be assessed at low cost and without interactions with testing conditions, could validly substitute for (or complement morphophysiological distinctness in alfalfa cultivar registration schemes. It also has interest in sui generis registration systems aimed at

  11. Ariadne: a database search engine for identification and chemical analysis of RNA using tandem mass spectrometry data.

    Science.gov (United States)

    Nakayama, Hiroshi; Akiyama, Misaki; Taoka, Masato; Yamauchi, Yoshio; Nobe, Yuko; Ishikawa, Hideaki; Takahashi, Nobuhiro; Isobe, Toshiaki

    2009-04-01

    We present here a method to correlate tandem mass spectra of sample RNA nucleolytic fragments with an RNA nucleotide sequence in a DNA/RNA sequence database, thereby allowing tandem mass spectrometry (MS/MS)-based identification of RNA in biological samples. Ariadne, a unique web-based database search engine, identifies RNA by two probability-based evaluation steps of MS/MS data. In the first step, the software evaluates the matches between the masses of product ions generated by MS/MS of an RNase digest of sample RNA and those calculated from a candidate nucleotide sequence in a DNA/RNA sequence database, which then predicts the nucleotide sequences of these RNase fragments. In the second step, the candidate sequences are mapped for all RNA entries in the database, and each entry is scored for a function of occurrences of the candidate sequences to identify a particular RNA. Ariadne can also predict post-transcriptional modifications of RNA, such as methylation of nucleotide bases and/or ribose, by estimating mass shifts from the theoretical mass values. The method was validated with MS/MS data of RNase T1 digests of in vitro transcripts. It was applied successfully to identify an unknown RNA component in a tRNA mixture and to analyze post-transcriptional modification in yeast tRNA(Phe-1).

  12. Agarose gel electrophoresis and polyacrylamide gel electrophoresis for visualization of simple sequence repeats.

    Science.gov (United States)

    Anderson, James; Wright, Drew; Meksem, Khalid

    2013-01-01

    In the modern age of genetic research there is a constant search for ways to improve the efficiency of plant selection. The most recent technology that can result in a highly efficient means of selection and still be done at a low cost is through plant selection directed by simple sequence repeats (SSRs or microsatellites). The molecular markers are used to select for certain desirable plant traits without relying on ambiguous phenotypic data. The best way to detect these is the use of gel electrophoresis. Gel electrophoresis is a common technique in laboratory settings which is used to separate deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) by size. Loading DNA and RNA onto gels allows for visualization of the size of fragments through the separation of DNA and RNA fragments. This is achieved through the use of the charge in the particles. As the fragments separate, they form into distinct bands at set sizes. We describe the ability to visualize SSRs on slab gels of agarose and polyacrylamide gel electrophoresis.

  13. A tandem sequence motif acts as a distance-dependent enhancer in a set of genes involved in translation by binding the proteins NonO and SFPQ

    Directory of Open Access Journals (Sweden)

    Roepcke Stefan

    2011-12-01

    Full Text Available Abstract Background Bioinformatic analyses of expression control sequences in promoters of co-expressed or functionally related genes enable the discovery of common regulatory sequence motifs that might be involved in co-ordinated gene expression. By studying promoter sequences of the human ribosomal protein genes we recently identified a novel highly specific Localized Tandem Sequence Motif (LTSM. In this work we sought to identify additional genes and LTSM-binding proteins to elucidate potential regulatory mechanisms. Results Genome-wide analyses allowed finding a considerable number of additional LTSM-positive genes, the products of which are involved in translation, among them, translation initiation and elongation factors, and 5S rRNA. Electromobility shift assays then showed specific signals demonstrating the binding of protein complexes to LTSM in ribosomal protein gene promoters. Pull-down assays with LTSM-containing oligonucleotides and subsequent mass spectrometric analysis identified the related multifunctional nucleotide binding proteins NonO and SFPQ in the binding complex. Functional characterization then revealed that LTSM enhances the transcriptional activity of the promoters in dependency of the distance from the transcription start site. Conclusions Our data demonstrate the power of bioinformatic analyses for the identification of biologically relevant sequence motifs. LTSM and the here found LTSM-binding proteins NonO and SFPQ were discovered through a synergistic combination of bioinformatic and biochemical methods and are regulators of the expression of a set of genes of the translational apparatus in a distance-dependent manner.

  14. Evaluation of Mammalian Interspersed Repeats to investigate the goat genome

    Directory of Open Access Journals (Sweden)

    P. Mariani

    2010-01-01

    Full Text Available Among the repeated sequences present in most eukaryotic genomes, SINEs (Short Interspersed Nuclear Elements are widely used to investigate evolution in the mammalian order (Buchanan et al., 1999. One family of these repetitive sequences, the MIR (Mammalian Interspersed Repeats; Jurka et al., 1995, is ubiquitous in all mammals.MIR elements are tRNA-derived SINEs and are identifiable by a conserved core region of about 70 nucleotides.

  15. A β-solenoid model of the Pmel17 repeat domain: insights to the formation of functional amyloid fibrils

    Science.gov (United States)

    Louros, Nikolaos N.; Baltoumas, Fotis A.; Hamodrakas, Stavros J.; Iconomidou, Vassiliki A.

    2016-02-01

    Pmel17 is a multidomain protein involved in biosynthesis of melanin. This process is facilitated by the formation of Pmel17 amyloid fibrils that serve as a scaffold, important for pigment deposition in melanosomes. A specific luminal domain of human Pmel17, containing 10 tandem imperfect repeats, designated as repeat domain (RPT), forms amyloid fibrils in a pH-controlled mechanism in vitro and has been proposed to be essential for the formation of the fibrillar matrix. Currently, no three-dimensional structure has been resolved for the RPT domain of Pmel17. Here, we examine the structure of the RPT domain by performing sequence threading. The resulting model was subjected to energy minimization and validated through extensive molecular dynamics simulations. Structural analysis indicated that the RPT model exhibits several distinct properties of β-solenoid structures, which have been proposed to be polymerizing components of amyloid fibrils. The derived model is stabilized by an extensive network of hydrogen bonds generated by stacking of highly conserved polar residues of the RPT domain. Furthermore, the key role of invariant glutamate residues is proposed, supporting a pH-dependent mechanism for RPT domain assembly. Conclusively, our work attempts to provide structural insights into the RPT domain structure and to elucidate its contribution to Pmel17 amyloid fibril formation.

  16. Regulation of HFE expression by poly(ADP-ribose) polymerase-1 (PARP1) through an inverted repeat DNA sequence in the distal promoter.

    Science.gov (United States)

    Pelham, Christopher; Jimenez, Tamara; Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M Rafiq

    2013-12-01

    Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700bp (-1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. © 2013.

  17. Rhoptry-associated protein (rap-1) genes in the sheep pathogen Babesia sp. Xinjiang: Multiple transcribed copies differing by 3' end repeated sequences.

    Science.gov (United States)

    Niu, Qingli; Marchand, Jordan; Yang, Congshan; Bonsergent, Claire; Guan, Guiquan; Yin, Hong; Malandrin, Laurence

    2015-07-30

    Sheep babesiosis occurs mainly in tropical and subtropical areas. The sheep parasite Babesia sp. Xinjiang is widespread in China, and our goal is to characterize rap-1 (rhoptry-associated protein 1) gene diversity and expression as a first step of a long term goal aiming at developing a recombinant subunit vaccine. Seven different rap-1a genes were amplified in Babesia sp. Xinjiang, using degenerate primers designed from conserved motifs. Rap-1b and rap-1c gene types could not be identified. In all seven rap-1a genes, the 5' regions exhibited identical sequences over 936 nt, and the 3' regions differed at 28 positions over 147 nt, defining two types of genes designated α and β. The remaining 3' part varied from 72 to 360 nt in length, depending on the gene. This region consists of a succession of two to ten 36 nt repeats, which explains the size differences. Even if the nucleotide sequences varied, 6 repeats encoded the same stretch of amino acids. Transcription of at least four α and two β genes was demonstrated by standard RT-PCR. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. R-loops: targets for nuclease cleavage and repeat instability.

    Science.gov (United States)

    Freudenreich, Catherine H

    2018-01-11

    R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.

  19. Germ-line CAG repeat instability causes extreme CAG repeat expansion with infantile-onset spinocerebellar ataxia type 2

    DEFF Research Database (Denmark)

    Vinther-Jensen, Tua; Ek, Jakob; Duno, Morten

    2013-01-01

    The spinocerebellar ataxias (SCA) are a genetically and clinically heterogeneous group of diseases, characterized by dominant inheritance, progressive cerebellar ataxia and diverse extracerebellar symptoms. A subgroup of the ataxias is caused by unstable CAG-repeat expansions in their respective ...... of paternal germ-line repeat sequence instability of the expanded SCA2 locus.European Journal of Human Genetics advance online publication, 10 October 2012; doi:10.1038/ejhg.2012.231....

  20. A family of DNA repeats in Aspergillus nidulans has assimilated degenerated retrotransposons

    DEFF Research Database (Denmark)

    Nielsen, M.L.; Hermansen, T.D.; Aleksenko, Alexei Y.

    2001-01-01

    In the course of a chromosomal walk towards the centromere of chromosome IV of Aspergillus nidulans, several cross- hybridizing genomic cosmid clones were isolated. Restriction mapping of two such clones revealed that their restriction patterns were similar in a region of at least 15 kb, indicati......) phenomenon, first described in Neurospora crassa, may have operated in A. nidulans. The data indicate that this family of repeats has assimilated mobile elements that subsequently degenerated but then underwent further duplications as a part of the host repeats....... the presence of a large repeat. The nature of the repeat was further investigated by sequencing and Southern analysis. The study revealed a family of long dispersed repeats with a high degree of sequence similarity. The number and location of the repeats vary between wild isolates. Two copies of the repeat...

  1. Study of simple sequence repeat (SSR) polymorphism for biotic ...

    African Journals Online (AJOL)

    home

    2013-10-02

    Oct 2, 2013 ... G. Siva Kumar1, K. Aruna Kumari1*, Ch. V. Durga Rani1, R. M. Sundaram2, S. Vanisree3, Md. ..... review by Jena and Mackill (2008) provided the list of .... repeat protein and is a member of a resistance gene cluster on rice.

  2. Mononucleotide repeats are asymmetrically distributed in fungal genes

    NARCIS (Netherlands)

    Passel, van M.W.J.; Graaff, de L.H.

    2008-01-01

    ABSTRACT: BACKGROUND: Systematic analyses of sequence features have resulted in a better characterisation of the organisation of the genome. A previous study in prokaryotes on the distribution of sequence repeats, which are notoriously variable and can disrupt the reading frame in genes, showed that

  3. An annotated genetic map of loblolly pine based on microsatellite and cDNA markers

    Science.gov (United States)

    Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective o...

  4. Tyms double (2R) and triple repeat (3R) confers risk for human oral squamous cell carcinoma.

    Science.gov (United States)

    Bezerra, Alexandre Medeiros; Sant'Ana, Thalita Araújo; Gomes, Adriana Vieira; de Lacerda Vidal, Aurora Karla; Muniz, Maria Tereza Cartaxo

    2014-12-01

    The oral cancer is responsible for approximately 3 % of cases of cancer in Brazil. Epidemiological studies have associated low folate intake with an increased risk of epithelial cancers, including oral cancer. Folic acid has a key role in DNA synthesis, repair, methylation and this is the basis of explanations for a putative role for folic acid in cancer prevention. The role of folic acid in carcinogenesis may be modulated by polymorphism C677T in MTHFR and tandem repeats 2R/3R in the promoter site of TYMS gene that are related to decreased enzymatic activity and quantity and availability of the enzyme, respectively. These events cause a decrease in the synthesis, repair and DNA methylation, which can lead to a disruption in the expression of tumor suppressor genes as TP53. The objective of this study was investigate the distribution of polymorphisms C677T and tandem repeats 2R/3R associated with the development of oral squamous cell carcinoma (OSCC). 53 paraffin-embedded samples from patients who underwent surgery but are no longer at the institution and 43 samples collected by method of oral exfoliation by cytobrush were selected. 132 healthy subjects were selected by specialists at the dental clinics of the Faculdade de Odontologia de Pernambuco-FOP. The MTHFR genotyping was performed by PCR-RFLP, and the TYMS genotyping was performed by conventional PCR. Fisher's Exact test at significant level of 5 %. Odds ratios (ORs) and 95 % confidence intervals (CIs) were used to measure the strength of association between genotype frequency and OSCC development. The results were statistically significant for the tandem repeats of the TYMS gene (p = 0.015). The TYMS 2R3R genotype was significantly associated with the development of OSCC (OR = 3.582; 95 % CI 1.240-10.348; p = 0.0262) and also the genotype 3R3R (OR = 3.553; 95 % CI 1.293-9.760; p = 0.0345). When analyzed together, the TYMS 2R3R + 3R3R genotypes also showed association (OR = 3.518; 95 % CI 11.188-10.348; p

  5. Development of novel simple sequence repeat markers in bitter gourd (Momordica charantia L.) through enriched genomic libraries and their utilization in analysis of genetic diversity and cross-species transferability.

    Science.gov (United States)

    Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B

    2015-01-01

    Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.

  6. "Polymeromics": Mass spectrometry based strategies in polymer science toward complete sequencing approaches: a review.

    Science.gov (United States)

    Altuntaş, Esra; Schubert, Ulrich S

    2014-01-15

    Mass spectrometry (MS) is the most versatile and comprehensive method in "OMICS" sciences (i.e. in proteomics, genomics, metabolomics and lipidomics). The applications of MS and tandem MS (MS/MS or MS(n)) provide sequence information of the full complement of biological samples in order to understand the importance of the sequences on their precise and specific functions. Nowadays, the control of polymer sequences and their accurate characterization is one of the significant challenges of current polymer science. Therefore, a similar approach can be very beneficial for characterizing and understanding the complex structures of synthetic macromolecules. MS-based strategies allow a relatively precise examination of polymeric structures (e.g. their molar mass distributions, monomer units, side chain substituents, end-group functionalities, and copolymer compositions). Moreover, tandem MS offer accurate structural information from intricate macromolecular structures; however, it produces vast amount of data to interpret. In "OMICS" sciences, the software application to interpret the obtained data has developed satisfyingly (e.g. in proteomics), because it is not possible to handle the amount of data acquired via (tandem) MS studies on the biological samples manually. It can be expected that special software tools will improve the interpretation of (tandem) MS output from the investigations of synthetic polymers as well. Eventually, the MS/MS field will also open up for polymer scientists who are not MS-specialists. In this review, we dissect the overall framework of the MS and MS/MS analysis of synthetic polymers into its key components. We discuss the fundamentals of polymer analyses as well as recent advances in the areas of tandem mass spectrometry, software developments, and the overall future perspectives on the way to polymer sequencing, one of the last Holy Grail in polymer science. Copyright © 2013 Elsevier B.V. All rights reserved.

  7. Sequence protein identification by randomized sequence database and transcriptome mass spectrometry (SPIDER-TMS): from manual to automatic application of a 'de novo sequencing' approach.

    Science.gov (United States)

    Pascale, Raffaella; Grossi, Gerarda; Cruciani, Gabriele; Mecca, Giansalvatore; Santoro, Donatello; Sarli Calace, Renzo; Falabella, Patrizia; Bianco, Giuliana

    Sequence protein identification by a randomized sequence database and transcriptome mass spectrometry software package has been developed at the University of Basilicata in Potenza (Italy) and designed to facilitate the determination of the amino acid sequence of a peptide as well as an unequivocal identification of proteins in a high-throughput manner with enormous advantages of time, economical resource and expertise. The software package is a valid tool for the automation of a de novo sequencing approach, overcoming the main limits and a versatile platform useful in the proteomic field for an unequivocal identification of proteins, starting from tandem mass spectrometry data. The strength of this software is that it is a user-friendly and non-statistical approach, so protein identification can be considered unambiguous.

  8. The First Molecular Identification of an Olive Collection Applying Standard Simple Sequence Repeats and Novel Expressed Sequence Tag Markers.

    Science.gov (United States)

    Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo

    2017-01-01

    Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections

  9. Evolutionary dynamics and sites of illegitimate recombination revealed in the interspersion and sequence junctions of two nonhomologous satellite DNAs in cactophilic Drosophila species.

    Science.gov (United States)

    Kuhn, G C S; Teo, C H; Schwarzacher, T; Heslop-Harrison, J S

    2009-05-01

    Satellite DNA (satDNA) is a major component of genomes but relatively little is known about the fine-scale organization of unrelated satDNAs residing at the same chromosome location, and the sequence structure and dynamics of satDNA junctions. We studied the organization and sequence junctions of two nonhomologous satDNAs, pBuM and DBC-150, in three species from the neotropical Drosophila buzzatii cluster (repleta group). In situ hybridization to microchromosomes, interphase nuclei and extended DNA fibers showed frequent interspersion of the two satellites in D. gouveai, D. antonietae and, to a lesser extent, D. seriema. We isolated by PCR six pBuM x DBC-150 junctions: four are exclusive to D. gouveai and two are exclusive to D. antonietae. The six junction breakpoints occur at different positions within monomers, suggesting independent origin. Four junctions showed abrupt transitions between the two satellites, whereas two junctions showed a distinct 10 bp tandem duplication before the junction. Unlike pBuM, DBC-150 junction repeats are more variable than randomly cloned monomers and showed diagnostic features in common to a 3-monomer higher-order repeat seen in the sister species D. serido. The high levels of interspersion between pBuM and DBC-150 repeats suggest extensive rearrangements between the two satellites, maybe favored by specific features of the microchromosomes. Our interpretation is that the junctions evolved by multiples events of illegitimate recombination between nonhomologous satDNA repeats, with subsequent rounds of unequal crossing-over expanding the copy number of some of the junctions.

  10. C-terminal sequences of hsp70 and hsp90 as non-specific anchors for tetratricopeptide repeat (TPR) proteins.

    Science.gov (United States)

    Ramsey, Andrew J; Russell, Lance C; Chinkers, Michael

    2009-10-12

    Steroid-hormone-receptor maturation is a multi-step process that involves several TPR (tetratricopeptide repeat) proteins that bind to the maturation complex via the C-termini of hsp70 (heat-shock protein 70) and hsp90 (heat-shock protein 90). We produced a random T7 peptide library to investigate the roles played by the C-termini of the two heat-shock proteins in the TPR-hsp interactions. Surprisingly, phages with the MEEVD sequence, found at the C-terminus of hsp90, were not recovered from our biopanning experiments. However, two groups of phages were isolated that bound relatively tightly to HsPP5 (Homo sapiens protein phosphatase 5) TPR. Multiple copies of phages with a C-terminal sequence of LFG were isolated. These phages bound specifically to the TPR domain of HsPP5, although mutation studies produced no evidence that they bound to the domain's hsp90-binding groove. However, the most abundant family obtained in the initial screen had an aspartate residue at the C-terminus. Two members of this family with a C-terminal sequence of VD appeared to bind with approximately the same affinity as the hsp90 C-12 control. A second generation pseudo-random phage library produced a large number of phages with an LD C-terminus. These sequences acted as hsp70 analogues and had relatively low affinities for hsp90-specific TPR domains. Unfortunately, we failed to identify residues near hsp90's C-terminus that impart binding specificity to individual hsp90-TPR interactions. The results suggest that the C-terminal sequences of hsp70 and hsp90 act primarily as non-specific anchors for TPR proteins.

  11. Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Yuan Tong

    2010-01-01

    Full Text Available Abstract Background Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In Arabidopsis thaliana, there are at least 223 Leucine-rich repeat receptor-like kinases (LRR-RLKs, representing one of the largest protein families. Although functional roles for a handful of LRR-RLKs have been revealed, the functions of the majority of members in this protein family have not been elucidated. Results As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for in planta functional analysis. All clones from this study have been submitted to the Arabidopsis Biological Resource Center (ABRC at Ohio State University for full accessibility by the Arabidopsis research community. Conclusions Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.

  12. Combined deficiency of MSH2 and Sμ region abolishes class switch recombination.

    Science.gov (United States)

    Leduc, Claire; Haddad, Dania; Laviolette-Malirat, Nathalie; Nguyen Huu, Ngoc-Sa; Khamlichi, Ahmed Amine

    2010-10-01

    Class switch recombination (CSR) is mediated by G-rich tandem repeated sequences termed switch regions. Transcription of switch regions generates single-stranded R loops that provide substrates for activation-induced cytidine deaminase. Mice deficient in MSH2 have a mild defect in CSR and analysis of their switch junctions has led to a model in which MSH2 is more critical for switch recombination events outside than within the tandem repeats. It is also known that deletion of the whole Sμ region severely impairs but does not abrogate CSR despite the lack of detectable R loops. Here, we demonstrate that deficiency of both MSH2 and the Sμ region completely abolishes CSR and that the abrogation occurs at the genomic level. This finding further supports the crucial role of MSH2 outside the tandem repeats. It also indicates that during CSR, MSH2 has access to activation-induced cytidine deaminase targets in R-loop-deficient Iμ-Cμ sequences rarely used in CSR, suggesting an MSH2-dependent DNA processing activity at the Iμ exon that may decrease with transcription elongation across the Sμ region.

  13. Karyological characterization and identification of four repetitive element groups (the 18S – 28S rRNA gene, telomeric sequences, microsatellite repeat motifs, Rex retroelements) of the Asian swamp eel (Monopterus albus)

    Science.gov (United States)

    Suntronpong, Aorarat; Thapana, Watcharaporn; Twilprawat, Panupon; Prakhongcheep, Ornjira; Somyong, Suthasinee; Muangmai, Narongrit; Surin Peyachoknagul; Srikulnath, Kornsorn

    2017-01-01

    Abstract Among teleost fishes, Asian swamp eel (Monopterus albus Zuiew, 1793) possesses the lowest chromosome number, 2n = 24. To characterize the chromosome constitution and investigate the genome organization of repetitive sequences in M. albus, karyotyping and chromosome mapping were performed with the 18S – 28S rRNA gene, telomeric repeats, microsatellite repeat motifs, and Rex retroelements. The 18S – 28S rRNA genes were observed to the pericentromeric region of chromosome 4 at the same position with large propidium iodide and C-positive bands, suggesting that the molecular structure of the pericentromeric regions of chromosome 4 has evolved in a concerted manner with amplification of the 18S – 28S rRNA genes. (TTAGGG)n sequences were found at the telomeric ends of all chromosomes. Eight of 19 microsatellite repeat motifs were dispersedly mapped on different chromosomes suggesting the independent amplification of microsatellite repeat motifs in M. albus. Monopterus albus Rex1 (MALRex1) was observed at interstitial sites of all chromosomes and in the pericentromeric regions of most chromosomes whereas MALRex3 was scattered and localized to all chromosomes and MALRex6 to several chromosomes. This suggests that these retroelements were independently amplified or lost in M. albus. Among MALRexs (MALRex1, MALRex3, and MALRex6), MALRex6 showed higher interspecific sequence divergences from other teleost species in comparison. This suggests that the divergence of Rex6 sequences of M. albus might have occurred a relatively long time ago. PMID:29093797

  14. Refined repetitive sequence searches utilizing a fast hash function and cross species information retrievals

    Directory of Open Access Journals (Sweden)

    Reneker Jeff

    2005-05-01

    Full Text Available Abstract Background Searching for small tandem/disperse repetitive DNA sequences streamlines many biomedical research processes. For instance, whole genomic array analysis in yeast has revealed 22 PHO-regulated genes. The promoter regions of all but one of them contain at least one of the two core Pho4p binding sites, CACGTG and CACGTT. In humans, microsatellites play a role in a number of rare neurodegenerative diseases such as spinocerebellar ataxia type 1 (SCA1. SCA1 is a hereditary neurodegenerative disease caused by an expanded CAG repeat in the coding sequence of the gene. In bacterial pathogens, microsatellites are proposed to regulate expression of some virulence factors. For example, bacteria commonly generate intra-strain diversity through phase variation which is strongly associated with virulence determinants. A recent analysis of the complete sequences of the Helicobacter pylori strains 26695 and J99 has identified 46 putative phase-variable genes among the two genomes through their association with homopolymeric tracts and dinucleotide repeats. Life scientists are increasingly interested in studying the function of small sequences of DNA. However, current search algorithms often generate thousands of matches – most of which are irrelevant to the researcher. Results We present our hash function as well as our search algorithm to locate small sequences of DNA within multiple genomes. Our system applies information retrieval algorithms to discover knowledge of cross-species conservation of repeat sequences. We discuss our incorporation of the Gene Ontology (GO database into these algorithms. We conduct an exhaustive time analysis of our system for various repetitive sequence lengths. For instance, a search for eight bases of sequence within 3.224 GBases on 49 different chromosomes takes 1.147 seconds on average. To illustrate the relevance of the search results, we conduct a search with and without added annotation terms for the

  15. Determination of 4-Methylimidazole in Ammonia Caramel Using Gas Chromatography–Tandem Mass Spectrometry (GC-MS/MS

    Directory of Open Access Journals (Sweden)

    Martyna N. Wieczorek

    2018-01-01

    Full Text Available One of Maillard reaction products formed in the production of ammonia caramel is 4(5-methylimidazole (4-MeI classified as carcinogen. A method of 4-MeI determination based on ion-pair extraction and derivatisation with isobutyl chloroformate with subsequent gas chromatography-tandem mass spectrometry analysis was proposed. Tandem mass spectrometry was applied to reduce the influence of matrix and increase the selectivity and sensitivity of the method. Triple quadrupole GC-MS system was used for this study. The collision energies were optimized for MRM mode. The detection (LOD and quantification limits (LOQ of the elaborated method were 17 and 37.8 μg kg−1, respectively, repeatability was <15% RSD for analyzed caramel samples, and the recovery for 4-MeI was 101%. Comparison of MS/MS with SIM detection on the same instrument proved almost 30 times lower LODs achieved by tandem mass spectrometry compared to SIM. Described method can be routinely used for monitoring 4-MeI as a quality and safety marker in the production of ammonia caramel.

  16. Tissue identity testing of cancer by short tandem repeat polymorphism: pitfalls of interpretation in the presence of microsatellite instability.

    Science.gov (United States)

    Much, Melissa; Buza, Natalia; Hui, Pei

    2014-03-01

    Tissue identity testing by short tandem repeat (STR) polymorphism offers discriminating power in resolving tissue mix-up or contamination. However, one caveat is the presence of microsatellite unstable tumors, in which genetic alterations may drastically change the STR wild-type polymorphism leading to unexpected allelic discordance. We examined how tissue identity testing results can be altered by the presence of microsatellite instability (MSI). Eleven cases of MSI-unstable (9 intestinal and 2 endometrial adenocarcinomas) and 10 cases of MSI-stable tumors (all colorectal adenocarcinomas) were included. All had been previously tested by polymerase chain reaction testing at 5 National Cancer Institute (NCI) recommended MSI loci and/or immunohistochemistry for DNA mismatch repair proteins (MLH1, MSH2, MSH6, and PMS2). Tissue identity testing targeting 15 STR loci was performed using AmpF/STR Identifiler Amplification. Ten of 11 MSI-unstable tumors demonstrated novel alleles at 5 to 12 STR loci per case and frequently with 3 or more allelic peaks. However, all affected loci showed identifiable germline allele(s) in MSI-high tumors. A wild-type allelic profile was seen in 7 of 10 MSI-stable tumors. In the remaining 3 cases, isolated novel alleles were present at a unique single locus in addition to germline alleles. Loss of heterozygosity was observed frequently in both MSI-stable (6/11 cases) and MSI-unstable tumors (8/10 cases). In conclusion, MSI may significantly alter the wild-type allelic polymorphism, leading to potential interpretation errors of STR genotyping. Careful examination of the STR allelic pattern, high index of suspicion, and follow-up MSI testing are crucial to avoid erroneous conclusions and subsequent clinical and legal consequences. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

    Science.gov (United States)

    Tetreault, Hannah M.; Ungerer, Mark C.

    2016-01-01

    The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667

  18. Loss of heterozygosity at thymidylate synthase locus in Barrett's metaplasia, dysplasia, and carcinoma sequences

    International Nuclear Information System (INIS)

    Kuramochi, Hidekazu; Uchida, Kazumi; Peters, Jeffery H; Shimizu, Daisuke; Vallbohmer, Daniel; Schneider, Sylke; Danenberg, Kathleen D; Danenberg, Peter V

    2009-01-01

    Thymidylate synthase (TS) is known to have a unique 28 bp tandemly repeated sequence in the promoter region, and the majorities of subjects have a heterozygous double repeat/triple repeat genotype in their non-cancerous tissue. Loss of heterozygosity (LOH) at the TS locus is known to occur in cancer patients, but there is no evidence that it is present in precancerous tissue. The aim of this study was to analyze the frequency and timing of LOH at the TS locus in Barrett-associated adenocarcinoma (BA) and its precursory lesions, such as intestinal metaplasia (IM) and dysplasia. One hundred twenty-three samples (including 37 with gastroesophageal reflux disease (GERD), 29 with IM, 13 with dysplasia, and 44 with BA) were obtained from 100 patients. Biopsies were obtained from the lower esophageal mucosa/IM/dysplasia/BA, when available. Normal squamous tissue from the upper esophagus was taken as a control. All tissues were analyzed for the TS genotype and TS mRNA expression using the real-time reverse-transcription polymerase chain reaction (RT-PCR) method after laser-capture microdissection. Among the patients with informative heterozygous genotype in their control samples, no sample with LOH at the TS locus was observed in the lower esophageal mucosa in GERD patients (0/22 samples). However, 6 out of 21 samples (28.6%) had LOH in IM, 2 of 7 (28.6%) in dysplasia, and 10 of 25 (40.0%) in BA. No significant difference in TS mRNA expression levels was observed between TS genotypes. Our results demonstrate that LOH is a relatively frequent and early event in the IM-BA sequence

  19. Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

    Science.gov (United States)

    Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

    2012-01-01

    Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604

  20. Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae.

    Science.gov (United States)

    Redwan, R M; Saidin, A; Kumar, S V

    2015-08-12

    Pineapple (Ananas comosus var. comosus) is known as the king of fruits for its crown and is the third most important tropical fruit after banana and citrus. The plant, which is indigenous to South America, is the most important species in the Bromeliaceae family and is largely traded for fresh fruit consumption. Here, we report the complete chloroplast sequence of the MD-2 pineapple that was sequenced using the PacBio sequencing technology. In this study, the high error rate of PacBio long sequence reads of A. comosus's total genomic DNA were improved by leveraging on the high accuracy but short Illumina reads for error-correction via the latest error correction module from Novocraft. Error corrected long PacBio reads were assembled by using a single tool to produce a contig representing the pineapple chloroplast genome. The genome of 159,636 bp in length is featured with the conserved quadripartite structure of chloroplast containing a large single copy region (LSC) with a size of 87,482 bp, a small single copy region (SSC) with a size of 18,622 bp and two inverted repeat regions (IRA and IRB) each with the size of 26,766 bp. Overall, the genome contained 117 unique coding regions and 30 were repeated in the IR region with its genes contents, structure and arrangement similar to its sister taxon, Typha latifolia. A total of 35 repeats structure were detected in both the coding and non-coding regions with a majority being tandem repeats. In addition, 205 SSRs were detected in the genome with six protein-coding genes contained more than two SSRs. Comparative chloroplast genomes from the subclass Commelinidae revealed a conservative protein coding gene albeit located in a highly divergence region. Analysis of selection pressure on protein-coding genes using Ka/Ks ratio showed significant positive selection exerted on the rps7 gene of the pineapple chloroplast with P less than 0.05. Phylogenetic analysis confirmed the recent taxonomical relation among the member of

  1. TANDEM

    Data.gov (United States)

    Federal Laboratory Consortium — The Tandem Van de Graaff facility provides researchers with beams of more than 40 different types of ions - atoms that have been stripped of their electrons. One of...

  2. [Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

    Science.gov (United States)

    Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

    2009-11-01

    Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.

  3. Human minisatellite alleles detectable only after PCR amplification.

    Science.gov (United States)

    Armour, J A; Crosier, M; Jeffreys, A J

    1992-01-01

    We present evidence that a proportion of alleles at two human minisatellite loci is undetected by standard Southern blot hybridization. In each case the missing allele(s) can be identified after PCR amplification and correspond to tandem arrays too short to detect by hybridization. At one locus, there is only one undetected allele (population frequency 0.3), which contains just three repeat units. At the second locus, there are at least five undetected alleles (total population frequency 0.9) containing 60-120 repeats; they are not detected because these tandem repeats give very poor signals when used as a probe in standard Southern blot hybridization, and also cross-hybridize with other sequences in the genome. Under these circumstances only signals from the longest tandemly repeated alleles are detectable above the nonspecific background. The structures of these loci have been compared in human and primate DNA, and at one locus the short human allele containing three repeat units is shown to be an intermediate state in the expansion of a monomeric precursor allele in primates to high copy number in the longer human arrays. We discuss the implications of such loci for studies of human populations, minisatellite isolation by cloning, and the evolution of highly variable tandem arrays.

  4. Staphylococcus aureus from 152 cases of bovine, ovine and caprine mastitis investigated by Multiple-locus variable number of tandem repeat analysis (MLVA).

    Science.gov (United States)

    Bergonier, Dominique; Sobral, Daniel; Feßler, Andrea T; Jacquet, Eric; Gilbert, Florence B; Schwarz, Stefan; Treilles, Michaël; Bouloc, Philippe; Pourcel, Christine; Vergnaud, Gilles

    2014-10-02

    Staphylococcus aureus is one of the main etiological agents of mastitis in ruminants. In the present retrospective study, we evaluated the potential interest of a previously described automated multiple loci Variable Number of Tandem Repeats (VNTR) Assay (MLVA) comprising 16 loci as a first line tool to investigate the population structure of S. aureus from mastitis. We determined the genetic diversity of S. aureus strains from cases of clinical and subclinical mastitis in dairy cattle (n = 118, of which 16 were methicillin-resistant), sheep (n = 18) and goats (n = 16). The 152 strains could be subdivided into 115 MLVA genotypes (including 14 genotypes for the ovine strains and 15 genotypes for the caprine strains). This corresponds to a discriminatory index (D) value of 0.9936. Comparison with published MLVA data obtained using the same protocol applied to strains from diverse human and animal origins revealed a low number (8.5%) of human-related MLVA genotypes among the present collection. Eighteen percent of the S. aureus mastitis collection belonged to clonal complexes apparently not associated with other pathological conditions. Some of them displayed a relatively low level of diversity in agreement with a restricted ecological niche. These findings provide arguments suggesting that specific S. aureus lineages particularly adapted to ruminant mammary glands have emerged and that MLVA is a convenient tool to provide a broad overview of the population, owing to the availability via internet of databases compiling published MLVA genotypes.

  5. Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

    Energy Technology Data Exchange (ETDEWEB)

    Novelli, G.; Sineo, L.; Pontieri, E. [Catholic Univ. of Rome (Italy)]|[Univ. of Milan (Italy)]|[Univ. Florence (Italy)] [and others

    1994-09-01

    Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PK gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.

  6. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

    Directory of Open Access Journals (Sweden)

    Vergnaud Gilles

    2007-05-01

    Full Text Available Abstract Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the

  7. De Novo Centromere Formation and Centromeric Sequence Expansion in Wheat and its Wide Hybrids.

    Directory of Open Access Journals (Sweden)

    Xiang Guo

    2016-04-01

    Full Text Available Centromeres typically contain tandem repeat sequences, but centromere function does not necessarily depend on these sequences. We identified functional centromeres with significant quantitative changes in the centromeric retrotransposons of wheat (CRW contents in wheat aneuploids (Triticum aestivum and the offspring of wheat wide hybrids. The CRW signals were strongly reduced or essentially lost in some wheat ditelosomic lines and in the addition lines from the wide hybrids. The total loss of the CRW sequences but the presence of CENH3 in these lines suggests that the centromeres were formed de novo. In wheat and its wide hybrids, which carry large complex genomes or no sequenced genome, we performed CENH3-ChIP-dot-blot methods alone or in combination with CENH3-ChIP-seq and identified the ectopic genomic sequences present at the new centromeres. In adcdition, the transcription of the identified DNA sequences was remarkably increased at the new centromere, suggesting that the transcription of the corresponding sequences may be associated with de novo centromere formation. Stable alien chromosomes with two and three regions containing CRW sequences induced by centromere breakage were observed in the wheat-Th. elongatum hybrid derivatives, but only one was a functional centromere. In wheat-rye (Secale cereale hybrids, the rye centromere-specific sequences spread along the chromosome arms and may have caused centromere expansion. Frequent and significant quantitative alterations in the centromere sequence via chromosomal rearrangement have been systematically described in wheat wide hybridizations, which may affect the retention or loss of the alien chromosomes in the hybrids. Thus, the centromere behavior in wide crosses likely has an important impact on the generation of biodiversity, which ultimately has implications for speciation.

  8. Potential measurements in tandem mirrors

    International Nuclear Information System (INIS)

    Glowienka, J.C.

    1985-11-01

    The US mirror program has begun conducting experiments with a thermal barrier tandem mirror configuration. This configuration requires a specific axial potential profile and implies measurements of potential for documentation and optimization of the configuration. This report briefly outlines the motivation for the thermal barrier tandem mirror and then outlines the techniques used to document the potential profile in conventional and thermal barrier tandem mirrors. Examples of typical data sets from the world's major tandem mirror experiments, TMX and TMX-U at Lawrence Livermore National Laboratory (LLNL) and Gamma 10 at Tsukuba University in Japan, and the current interpretation of the data are discussed together with plans for the future improvement of measurements of plasma potential

  9. Microsatellites in varied arenas of research

    Directory of Open Access Journals (Sweden)

    K S Remya

    2010-01-01

    Full Text Available Microsatellites known as simple-sequence repeats (SSRs or short-tandem repeats (STRs, represent specific sequences of DNA consisting of tandemly repeated units of one to six nucleotides. The repetitive nature of microsatellites makes them particularly prone to grow or shrink in length and these changes can have both good and bad consequences for the organisms that possess them. They are responsible for various neurological diseases and hence the same cause is now utilized for the early detection of various diseases, such as, Schizophrenia and Bipolar Disorder, Congenital generalized Hypertrichosis, Asthma, and Bronchial Hyperresponsiveness. These agents are widely used for forensic identification and relatedness testing, and are predominant genetic markers in this area of application. The application of microsatellites is an extending web and covers the varied scenarios of science, such as, conservation biology, plant genetics, and population studies. At present, researches are progressing round the globe to extend the use of these genetic repeaters to unmask the hidden genetic secrets behind the creation of the world.

  10. The chloroplast genome sequence of the green alga Leptosira terrestris: multiple losses of the inverted repeat and extensive genome rearrangements within the Trebouxiophyceae

    Directory of Open Access Journals (Sweden)

    Turmel Monique

    2007-07-01

    Full Text Available Abstract Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales. Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate

  11. Interpretation of Tandem Mass Spectrometry (MSMS) Spectra for Peptide Analysis

    DEFF Research Database (Denmark)

    Hjernø, Karin; Højrup, Peter

    2015-01-01

    The aim of this chapter is to give a short introduction to peptide analysis by mass spectrometry (MS) and interpretation of fragment mass spectra. Through examples and guidelines we demonstrate how to understand and validate search results and how to perform de novo sequencing based on the often...... very complex fragmentation pattern obtained by tandem mass spectrometry (also referred to as MSMS). The focus is on simple rules for interpretation of MSMS spectra of tryptic as well as non-tryptic peptides....

  12. Characterization and expression of the maize β-carbonic anhydrase gene repeat regions.

    Science.gov (United States)

    Tems, Ursula; Burnell, James N

    2010-12-01

    In maize, carbonic anhydrase (CA; EC 4.2.1.1) catalyzes the first reaction of the C(4) photosynthetic pathway; it catalyzes the hydration of CO(2) to bicarbonate and provides an inorganic carbon source for the primary carboxylation reaction catalyzed by phosphoenolpyruvate (PEP) carboxylase. The β-CA isozymes from maize, as well as other agronomically important NADP-malic enzyme (NADP-ME) type C(4) crops, have remained relatively uncharacterized but differ significantly from the β-CAs of other C(4) monocot species primarily due to transcript length and the presence of repeat sequences. This research confirmed earlier findings of repeat sequences in maize CA transcripts, and demonstrated that the gene encoding these transcripts is also composed of repeat sequences. One of the maize CA genes was sequenced and found to encode two domains, with distinct groups of exons corresponding to the repeat regions of the transcript. We have also shown that expression of a single repeat region of the CA transcript produced active enzyme that associated as a dimer and was composed primarily of α-helices, consistent with that observed for other plant CAs. As the presence of repeat regions in the CA gene is unique to NADP-ME type C(4) monocot species, the implications of these findings in the context of the evolution of the location and function of this C(4) pathway enzyme are strongly suggestive of CA gene duplication resulting in an evolutionary advantage and a higher photosynthetic efficiency. Copyright © 2010 Elsevier Masson SAS. All rights reserved.

  13. Sequence analysis of two alleles reveals that intra-and intergenic recombination played a role in the evolution of the radish fertility restorer (Rfo

    Directory of Open Access Journals (Sweden)

    Budar Françoise

    2010-02-01

    Full Text Available Abstract Background Land plant genomes contain multiple members of a eukaryote-specific gene family encoding proteins with pentatricopeptide repeat (PPR motifs. Some PPR proteins were shown to participate in post-transcriptional events involved in organellar gene expression, and this type of function is now thought to be their main biological role. Among PPR genes, restorers of fertility (Rf of cytoplasmic male sterility systems constitute a peculiar subgroup that is thought to evolve in response to the presence of mitochondrial sterility-inducing genes. Rf genes encoding PPR proteins are associated with very close relatives on complex loci. Results We sequenced a non-restoring allele (L7rfo of the Rfo radish locus whose restoring allele (D81Rfo was previously described, and compared the two alleles and their PPR genes. We identified a ca 13 kb long fragment, likely originating from another part of the radish genome, inserted into the L7rfo sequence. The L7rfo allele carries two genes (PPR-1 and PPR-2 closely related to the three previously described PPR genes of the restorer D81Rfo allele (PPR-A, PPR-B, and PPR-C. Our results indicate that alleles of the Rfo locus have experienced complex evolutionary events, including recombination and insertion of extra-locus sequences, since they diverged. Our analyses strongly suggest that present coding sequences of Rfo PPR genes result from intragenic recombination. We found that the 10 C-terminal PPR repeats in Rfo PPR gene encoded proteins result from the tandem duplication of a 5 PPR repeat block. Conclusions The Rfo locus appears to experience more complex evolution than its flanking sequences. The Rfo locus and PPR genes therein are likely to evolve as a result of intergenic and intragenic recombination. It is therefore not possible to determine which genes on the two alleles are direct orthologs. Our observations recall some previously reported data on pathogen resistance complex loci.

  14. Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

    International Nuclear Information System (INIS)

    Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

    1987-01-01

    Apolipoprotein(a) [apo(a)] is a glycoprotein with M/sub r/ ∼ 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain

  15. Loss and recovery of Arabidopsis-type telomere repeat sequences 5'-(TTTAGGG)(n)-3' in the evolution of a major radiation of flowering plants.

    OpenAIRE

    Adams, S. P.; Hartman, T. P.; Lim, K. Y.; Chase, M. W.; Bennett, M. D.; Leitch, I. J.; Leitch, A. R.

    2001-01-01

    Fluorescent in situ hybridization and Southern blotting were used for showing the predominant absence of the Arabidopsis-type telomere repeat sequence (TRS) 5'-(TTTAGGG)(n)-3' (the 'typical' telomere) in a monocot clade which comprises up to 6300 species within Asparagales. Initially, two apparently disparate genera that lacked the typical telomere were identified. Here, we used the new angiosperm phylogenetic classification for predicting in which other related families such telomeres might ...

  16. Comparative analysis of ADS gene promoter in seven Artemisia ...

    Indian Academy of Sciences (India)

    ... were more in the high artemisinin producer species, A. annua, than the other species. We have reported that the light-responsive elements, W-box, CAAT-box, 5′-UTR py-rich stretch, TATA-box sequence and tandem repeat sequences have been identified as important factors in the increased expression of ADS gene.

  17. ASAP: Amplification, sequencing & annotation of plastomes

    Directory of Open Access Journals (Sweden)

    Folta Kevin M

    2005-12-01

    Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and

  18. Attention deficit/hyperactivity disorder children with a 7-repeat allele of the dopamine recepter D4 gene have extreme behavior but normal performance on critical neuropsychological tests of attention

    NARCIS (Netherlands)

    Swanson, J.; Oosterlaan, J.; Murias, M.; Schuck, S.; Flodman, P.; Spence, M.A.; Wasdell, M.; Ding, Y.; Chi, H-C.; Smith, M.; Mann, M.; Carlson, C.; Kennedy, J.L.; Sergeant, J.A.; Leung, P.; Zhang, Y-P.; Sadeh, A.; Chan, C.; Whalen, C.K.; Babb, K.; Moyzis, R.; Posner, M.I.

    2000-01-01

    An association of the dopamine receptor D4 (DRD4) gene located on chromosome 11p15.5 and attention deficit/hyperactivity disorder (ADHD) has been demonstrated and replicated by multiple investigators. A specific allele [the 7-repeat of a 48-bp variable number of tandem repeats (VNTR) in exon 3] has

  19. Solution Structure of the Tandem Acyl Carrier Protein Domains from a Polyunsaturated Fatty Acid Synthase Reveals Beads-on-a-String Configuration

    KAUST Repository

    Trujillo, Uldaeliz

    2013-02-28

    The polyunsaturated fatty acid (PUFA) synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP) domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect) and in structural stabilization of the multidomain protein (synergistic effect). While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS) revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of multiple ACP

  20. Solution structure of the tandem acyl carrier protein domains from a polyunsaturated fatty acid synthase reveals beads-on-a-string configuration.

    Directory of Open Access Journals (Sweden)

    Uldaeliz Trujillo

    Full Text Available The polyunsaturated fatty acid (PUFA synthases from deep-sea bacteria invariably contain multiple acyl carrier protein (ACP domains in tandem. This conserved tandem arrangement has been implicated in both amplification of fatty acid production (additive effect and in structural stabilization of the multidomain protein (synergistic effect. While the more accepted model is one in which domains act independently, recent reports suggest that ACP domains may form higher oligomers. Elucidating the three-dimensional structure of tandem arrangements may therefore give important insights into the functional relevance of these structures, and hence guide bioengineering strategies. In an effort to elucidate the three-dimensional structure of tandem repeats from deep-sea anaerobic bacteria, we have expressed and purified a fragment consisting of five tandem ACP domains from the PUFA synthase from Photobacterium profundum. Analysis of the tandem ACP fragment by analytical gel filtration chromatography showed a retention time suggestive of a multimeric protein. However, small angle X-ray scattering (SAXS revealed that the multi-ACP fragment is an elongated monomer which does not form a globular unit. Stokes radii calculated from atomic monomeric SAXS models were comparable to those measured by analytical gel filtration chromatography, showing that in the gel filtration experiment, the molecular weight was overestimated due to the elongated protein shape. Thermal denaturation monitored by circular dichroism showed that unfolding of the tandem construct was not cooperative, and that the tandem arrangement did not stabilize the protein. Taken together, these data are consistent with an elongated beads-on-a-string arrangement of the tandem ACP domains in PUFA synthases, and speak against synergistic biocatalytic effects promoted by quaternary structuring. Thus, it is possible to envision bioengineering strategies which simply involve the artificial linking of