repeat ltr sequences: Topics by WorldWideScience.org

Sample records for repeat ltr sequences

The effects of multiple UV exposures on HIV-LTR (long terminal repeat) expression

International Nuclear Information System (INIS)

Schreck, S.; Milton, J.; Panozzo, J.; Libertin, C.R.; Woloschak, G.E.; Loyola Univ., Maywood, IL

1995-01-01

Previous studies have shown that cellular stress agents such as UV radiation induce transcription from the long terminal repeat (LTR) of the human immunodeficiency virus (HIV). Using HeLa cells stably transfected with the HIV-LTR sequence, which transcriptionally drives the chloramphenicol acetyl transferase (CAT) reporter gene, we examined the effects of multiple exposures to UVC (254 nm) on HIV-LTR-CAT expression. Low doses (≤ 5 J m -2 ) had no effect on CAT expression, but up to 29-fold induction was observed with 10 J m -2 when cells were harvested 48 h after completion of the exposure. Little difference was noted in induction levels when cells were exposed to one 25 J m -2 dose, viable cells were harvested at 24 h, 48 h or 72 h, and cell lysates were assayed for CAT expression. Two sequential 12.5 J m -2 exposures, given 24 h apart, resulted in an additive effect on CAT expression; these two exposures produced CAT activity equivalent to that induced following a single 25 J m -2 dose. Our data suggest that HIV-LTR requires a specific threshold UV dose in order to elicit induction; a maximal induction dose is also evident; exposures higher than this maximal dose contribute no more to HIV-LTR induction in viable cells. (author)
Cellular specificity of HIV-1 replication can be controlled by LTR sequences

International Nuclear Information System (INIS)

Reed-Inderbitzin, Edward; Maury, Wendy

2003-01-01

Two well-established determinants of retroviral tropism are envelope sequences that regulate entry and LTR sequences that can regulate viral expression in a cell-specific manner. Studies with human immunodeficiency virus-1 (HIV-1) have demonstrated that tropism of this virus maps primarily to variable envelope sequences. Studies have demonstrated that T cell and macrophage-specific transcription factor binding motifs exist in the upstream region of the LTR U3; however, the ability of the core enhancer/promoter proximal elements (two NF-κB and three Sp1 sites) to function well in macrophages and T cells have led many to conclude that HIV LTR sequences are not primary determinants of HIV tropism. To determine if cellular specificity could be imparted to HIV by the core enhancer elements, the enhancer/promoter proximal region of the HIV LTR was substituted with motifs that control gene expression in a myeloid-specific manner. The enhancer region from equine infectious anemia virus (EIAV) when substituted for the HIV enhancer/promoter proximal region was found to drive expression in a macrophage-specific manner and was responsive to HIV Tat. The addition of a 5' methylation-dependent binding site (MDBP) and a promoter proximal Sp1 motif increased expression without altering cellular specificity. Spacing between the promoter proximal region and the TATA box was also found to influence LTR activity. Infectivity studies using chimeric LTRs within the context of a dual-tropic infectious molecular clone established that these LTRs directed HIV replication and production of infectious virions in macrophages but not primary T cells or T cell lines. This investigation demonstrates that cellular specificity can be imparted onto HIV-1 replication at the level of viral transcription and not entry
Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

Science.gov (United States)

Tetreault, Hannah M.; Ungerer, Mark C.

2016-01-01

The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667
LTR retrotransposon landscape in Medicago truncatula: more rapid removal than in rice

Directory of Open Access Journals (Sweden)

Liu Jin-Song

2008-08-01

Full Text Available Abstract Background Long terminal repeat retrotransposons (LTR elements are ubiquitous Eukaryotic TEs that transpose through RNA intermediates. Accounting for significant proportion of many plant genomes, LTR elements have been well established as one of the major forces underlying the evolution of plant genome size, structure and function. The accessibility of more than 40% of genomic sequences of the model legume Medicago truncatula (Mt has made the comprehensive study of its LTR elements possible. Results We use a newly developed tool LTR_FINDER to identify LTR retrotransposons in the Mt genome and detect 526 full-length elements as well as a great number of copies related to them. These elements constitute about 9.6% of currently available genomic sequences. They are classified into 85 families of which 64 are reported for the first time. The majority of the LTR retrotransposons belong to either Copia or Gypsy superfamily and the others are categorized as TRIMs or LARDs by their length. We find that the copy-number of Copia-like families is 3 times more than that of Gypsy-like ones but the latter contribute more to the genome. The analysis of PBS and protein-coding domain structure of the LTR families reveals that they tend to use only 4–5 types of tRNAs and many families have quite conservative ORFs besides known TE domains. For several important families, we describe in detail their abundance, conservation, insertion time and structure. We investigate the amplification-deletion pattern of the elements and find that the detectable full-length elements are relatively young and most of them were inserted within the last 0.52 MY. We also estimate that more than ten million bp of the Mt genomic sequences have been removed by the deletion of LTR elements and the removal of the full-length structures in Mt has been more rapid than in rice. Conclusion This report is the first comprehensive description and analysis of LTR retrotransposons in the
Identification of a non-LTR retrotransposon from the gypsy moth

Science.gov (United States)

K.J. Garner; J.M. Slavicek

1999-01-01

A family of highly repetitive elements, named LDT1, has been identified in the gypsy moth, Lymantria dispar. The complete element is 5.4 kb in length and lacks long-terminal repeats, The element contains two open reading frames with a significant amino acid sequence similarity to several non-LTR retrotransposons. The first open reading frame contains...
Regulation of FeLV-945 by c-Myb binding and CBP recruitment to the LTR

Directory of Open Access Journals (Sweden)

Finstad Samantha L

2004-09-01

Full Text Available Abstract Background Feline leukemia virus (FeLV induces degenerative, proliferative and malignant hematologic disorders in its natural host, the domestic cat. FeLV-945 is a viral variant identified as predominant in a cohort of naturally infected animals. FeLV-945 contains a unique sequence motif in the long terminal repeat (LTR comprised of a single copy of transcriptional enhancer followed by a 21-bp sequence triplicated in tandem. The LTR is precisely conserved among independent cases of multicentric lymphoma, myeloproliferative disease and anemia in animals from the cohort. The 21-bp triplication was previously shown to act as a transcriptional enhancer preferentially in hematopoietic cells and to confer a replicative advantage. The objective of the present study was to examine the molecular mechanism by which the 21-bp triplication exerts its influence and the selective advantage responsible for its precise conservation. Results Potential binding sites for the transcription factor, c-Myb, were identified across the repeat junctions of the 21-bp triplication. Such sites would not occur in the absence of the repeat; thus, a requirement for c-Myb binding to the repeat junctions of the triplication would exert a selective pressure to conserve its sequence precisely. Electrophoretic mobility shift assays demonstrated specific binding of c-Myb to the 21-bp triplication. Reporter gene assays showed that the triplication-containing LTR is responsive to c-Myb, and that responsiveness requires the presence of both c-Myb binding sites. Results further indicated that c-Myb in complex with the 21-bp triplication recruits the transcriptional co-activator, CBP, a regulator of normal hematopoiesis. FeLV-945 replication was shown to be positively regulated by CBP in a manner dependent on the presence of the 21-bp triplication. Conclusion Binding sites for c-Myb across the repeat junctions of the 21-bp triplication may account for its precise conservation in
Regulatory elements involved in tax-mediated transactivation of the HTLV-I LTR.

Science.gov (United States)

Seeler, J S; Muchardt, C; Podar, M; Gaynor, R B

1993-10-01

HTLV-I is the etiologic agent of adult T-cell leukemia. In this study, we investigated the regulatory elements and cellular transcription factors which function in modulating HTLV-I gene expression in response to the viral transactivator protein, tax. Transfection experiments into Jurkat cells of a variety of site-directed mutants in the HTLV-1 LTR indicated that each of the three motifs A, B, and C within the 21-bp repeats, the binding sites for the Ets family of proteins, and the TATA box all influenced the degree of tax-mediated activation. Tax is also able to activate gene expression of other viral and cellular promoters. Tax activation of the IL-2 receptor and the HIV-1 LTR is mediated through NF-kappa B motifs. Interestingly, sequences in the 21-bp repeat B and C motifs contain significant homology with NF-kappa B regulatory elements. We demonstrated that an NF-kappa B binding protein, PRDII-BF1, but not the rel protein, bound to the B and C motifs in the 21-bp repeat. PRDII-BF1 was also able to stimulate activation of HTLV-I gene expression by tax. The role of the Ets proteins on modulating tax activation was also studied. Ets 1 but not Ets 2 was capable of increasing the degree of tax activation of the HTLV-I LTR. These results suggest that tax activates gene expression by either direct or indirect interaction with several cellular transcription factors that bind to the HTLV-I LTR.
Genome-wide analysis of LTR-retrotransposons in oil palm.

Science.gov (United States)

Beulé, Thierry; Agbessi, Mawussé Dt; Dussert, Stephane; Jaligot, Estelle; Guyot, Romain

2015-10-15

The oil palm (Elaeis guineensis Jacq.) is a major cultivated crop and the world's largest source of edible vegetable oil. The genus Elaeis comprises two species E. guineensis, the commercial African oil palm and E. oleifera, which is used in oil palm genetic breeding. The recent publication of both the African oil palm genome assembly and the first draft sequence of its Latin American relative now allows us to tackle the challenge of understanding the genome composition, structure and evolution of these palm genomes through the annotation of their repeated sequences. In this study, we identified, annotated and compared Transposable Elements (TE) from the African and Latin American oil palms. In a first step, Transposable Element databases were built through de novo detection in both genome sequences then the TE content of both genomes was estimated. Then putative full-length retrotransposons with Long Terminal Repeats (LTRs) were further identified in the E. guineensis genome for characterization of their structural diversity, copy number and chromosomal distribution. Finally, their relative expression in several tissues was determined through in silico analysis of publicly available transcriptome data. Our results reveal a congruence in the transpositional history of LTR retrotransposons between E. oleifera and E. guineensis, especially the Sto-4 family. Also, we have identified and described 583 full-length LTR-retrotransposons in the Elaeis guineensis genome. Our work shows that these elements are most likely no longer mobile and that no recent insertion event has occurred. Moreover, the analysis of chromosomal distribution suggests a preferential insertion of Copia elements in gene-rich regions, whereas Gypsy elements appear to be evenly distributed throughout the genome. Considering the high proportion of LTR retrotransposon in the oil palm genome, our work will contribute to a greater understanding of their impact on genome organization and evolution
Ex vivo response to histone deacetylase (HDAC inhibitors of the HIV long terminal repeat (LTR derived from HIV-infected patients on antiretroviral therapy.

Directory of Open Access Journals (Sweden)

Hao K Lu

Full Text Available Histone deacetylase inhibitors (HDACi can induce human immunodeficiency virus (HIV transcription from the HIV long terminal repeat (LTR. However, ex vivo and in vivo responses to HDACi are variable and the activity of HDACi in cells other than T-cells have not been well characterised. Here, we developed a novel assay to determine the activity of HDACi on patient-derived HIV LTRs in different cell types. HIV LTRs from integrated virus were amplified using triple-nested Alu-PCR from total memory CD4+ T-cells (CD45RO+ isolated from HIV-infected patients prior to and following suppressive antiretroviral therapy. NL4-3 or patient-derived HIV LTRs were cloned into the chromatin forming episomal vector pCEP4, and the effect of HDACi investigated in the astrocyte and epithelial cell lines SVG and HeLa, respectively. There were no significant differences in the sequence of the HIV LTRs isolated from CD4+ T-cells prior to and after 18 months of combination antiretroviral therapy (cART. We found that in both cell lines, the HDACi panobinostat, trichostatin A, vorinostat and entinostat activated patient-derived HIV LTRs to similar levels seen with NL4-3 and all patient derived isolates had similar sensitivity to maximum HDACi stimulation. We observed a marked difference in the maximum fold induction of luciferase by HDACi in HeLa and SVG, suggesting that the effect of HDACi may be influenced by the cellular environment. Finally, we observed significant synergy in activation of the LTR with vorinostat and the viral protein Tat. Together, our results suggest that the LTR sequence of integrated virus is not a major determinant of a functional response to an HDACi.
Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia.

Science.gov (United States)

Kojima, Kenji K; Jurka, Jerzy

2015-01-01

Most non-long terminal repeat (non-LTR) retrotransposons encoding a restriction-like endonuclease show target-specific integration into repetitive sequences such as ribosomal RNA genes and microsatellites. However, only a few target-specific lineages of non-LTR retrotransposons are distributed widely and no lineage is found across the eukaryotic kingdoms. Here we report the most widely distributed lineage of target sequence-specific non-LTR retrotransposons, designated Utopia. Utopia is found in three supergroups of eukaryotes: Amoebozoa, SAR, and Opisthokonta. Utopia is inserted into a specific site of U2 small nuclear RNA genes with different strength of specificity for each family. Utopia families from oomycetes and wasps show strong target specificity while only a small number of Utopia copies from reptiles are flanked with U2 snRNA genes. Oomycete Utopia families contain an "archaeal" RNase H domain upstream of reverse transcriptase (RT), which likely originated from a plant RNase H gene. Analysis of Utopia from oomycetes indicates that multiple lineages of Utopia have been maintained inside of U2 genes with few copy numbers. Phylogenetic analysis of RT suggests the monophyly of Utopia, and it likely dates back to the early evolution of eukaryotes.
LTR retrotransposons in fungi.

Directory of Open Access Journals (Sweden)

Anna Muszewska

Full Text Available Transposable elements with long terminal direct repeats (LTR TEs are one of the best studied groups of mobile elements. They are ubiquitous elements present in almost all eukaryotic genomes. Their number and state of conservation can be a highlight of genome dynamics. We searched all published fungal genomes for LTR-containing retrotransposons, including both complete, functional elements and remnant copies. We identified a total of over 66,000 elements, all of which belong to the Ty1/Copia or Ty3/Gypsy superfamilies. Most of the detected Gypsy elements represent Chromoviridae, i.e. they carry a chromodomain in the pol ORF. We analyzed our data from a genome-ecology perspective, looking at the abundance of various types of LTR TEs in individual genomes and at the highest-copy element from each genome. The TE content is very variable among the analyzed genomes. Some genomes are very scarce in LTR TEs (8000 elements. The data shows that transposon expansions in fungi usually involve an increase both in the copy number of individual elements and in the number of element types. The majority of the highest-copy TEs from all genomes are Ty3/Gypsy transposons. Phylogenetic analysis of these elements suggests that TE expansions have appeared independently of each other, in distant genomes and at different taxonomical levels. We also analyzed the evolutionary relationships between protein domains encoded by the transposon pol ORF and we found that the protease is the fastest evolving domain whereas reverse transcriptase and RNase H evolve much slower and in correlation with each other.
LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons.

Science.gov (United States)

Steinbiss, Sascha; Kastens, Sascha; Kurtz, Stefan

2012-11-07

Long terminal repeat (LTR) retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-)families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets), making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow. We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort. LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining the output of software for predicting LTR
[Non-LTR retrotransposons: LINEs and SINEs in plant genome].

Science.gov (United States)

Cheng, Xu-Dong; Ling, Hong-Qing

2006-06-01

Retrotransposons are one of the drivers of genome evolution. They include LTR (long terminal repeat) retrotransposons, which widespread in Eukaryotagenomes, show structural similarity to retroviruses. Non-LTR retrotransposons were first discovered in animal genomes and then identified as ubiquitous components of nuclear genomes in many species across the plant kingdom. They constitute a large fraction of the repetitive DNA. Non-LTR retrotransposons are divided into LINEs (long interspersed nuclear elements) and SINEs (short interspersed nuclear elements). Transposition of non-LTR retrotransposons is rarely observed in plants indicating that most of them are inactive and/or under regulation of the host genome. Transposition is poorly understood, but experimental evidence from other genetic systems shows that LINEs are able to transpose autonomously while non-autonomous SINEs depend on the reverse transcription machinery of other retrotransposons. Phylogenic analysis shows LINEs are probably the most ancient class of retrotransposons in plant genomes, while the origin of SINEs is unknown. This review sums up the above data and wants to show readers a clear picture of non-LTR retrotransposons.
Discovery and analysis of an active long terminal repeat-retrotransposable element in Aspergillus oryzae.

Science.gov (United States)

Jie Jin, Feng; Hara, Seiichi; Sato, Atsushi; Koyama, Yasuji

2014-01-01

Wild-type Aspergillus oryzae RIB40 contains two copies of the AO090005001597 gene. We previously constructed A. oryzae RIB40 strain, RKuAF8B, with multiple chromosomal deletions, in which the AO090005001597 copy number was found to be increased significantly. Sequence analysis indicated that AO090005001597 is part of a putative 6,000-bp retrotransposable element, flanked by two long terminal repeats (LTRs) of 669 bp, with characteristics of retroviruses and retrotransposons, and thus designated AoLTR (A. oryzae LTR-retrotransposable element). AoLTR comprised putative reverse transcriptase, RNase H, and integrase domains. The deduced amino acid sequence alignment of AoLTR showed 94% overall identity with AFLAV, an A. flavus Tf1/sushi retrotransposon. Quantitative real-time RT-PCR showed that AoLTR gene expression was significantly increased in the RKuAF8B, in accordance with the increased copy number. Inverse PCR indicated that the full-length retrotransposable element was randomly integrated into multiple genomic locations. However, no obvious phenotypic changes were associated with the increased AoLTR gene copy number.
Insertion of a solo LTR retrotransposon associates with spur mutations in 'Red Delicious' apple (Malus × domestica).

Science.gov (United States)

Han, Mengxue; Sun, Qibao; Zhou, Junyong; Qiu, Huarong; Guo, Jing; Lu, Lijuan; Mu, Wenlei; Sun, Jun

2017-09-01

Insertion of a solo LTR, which possesses strong bidirectional, stem-specific promoter activities, is associated with the evolution of a dwarfing apple spur mutation. Spur mutations in apple scions revolutionized global apple production. Since long terminal repeat (LTR) retrotransposons are tightly related to natural mutations, inter-retrotransposon-amplified polymorphism technique and genome walking were used to find sequences in the apple genome based on these LTRs. In 'Red Delicious' spur mutants, a novel, 2190-bp insertion was identified as a spur-specific, solo LTR (sLTR) located at the 1038th nucleotide of another sLTR, which was 1536 bp in length. This insertion-within-an-insertion was localized within a preexisting Gypsy-50 retrotransposon at position 3,762,767 on chromosome 4. The analysis of transcriptional activity of the two sLTRs (the 2190- and 1536-bp inserts) indicated that the 2190-bp sLTR is a promoter, capable of bidirectional transcription. GUS expression in the 2190-bp-sense and 2190-bp-antisense transgenic lines was prominent in stems. In contrast, no promoter activity from either the sense or the antisense strand of the 1536-bp sLTR was detected. From ~150 kb of DNA on each side of the 2190 bp, sLTR insertion site, corresponding to 300 kb of the 'Golden Delicious' genome, 23 genes were predicted. Ten genes had predicted functions that could affect shoot development. This first report, of a sLTR insertion associated with the evolution of apple spur mutation, will facilitate apple breeding, cloning of spur-related genes, and discovery of mechanisms behind dwarf habit.
Low levels of LTR retrotransposon deletion by ectopic recombination in the gigantic genomes of salamanders.

Science.gov (United States)

Frahry, Matthew Blake; Sun, Cheng; Chong, Rebecca A; Mueller, Rachel Lockridge

2015-02-01

Across the tree of life, species vary dramatically in nuclear genome size. Mutations that add or remove sequences from genomes-insertions or deletions, or indels-are the ultimate source of this variation. Differences in the tempo and mode of insertion and deletion across taxa have been proposed to contribute to evolutionary diversity in genome size. Among vertebrates, most of the largest genomes are found within the salamanders, an amphibian clade with genome sizes ranging from ~14 to ~120 Gb. Salamander genomes have been shown to experience slower rates of DNA loss through small (i.e., genomes. However, no studies have addressed DNA loss from salamander genomes resulting from larger deletions. Here, we focus on one type of large deletion-ectopic-recombination-mediated removal of LTR retrotransposon sequences. In ectopic recombination, double-strand breaks are repaired using a "wrong" (i.e., ectopic, or non-allelic) template sequence-typically another locus of similar sequence. When breaks occur within the LTR portions of LTR retrotransposons, ectopic-recombination-mediated repair can produce deletions that remove the internal transposon sequence and the equivalent of one of the two LTR sequences. These deletions leave a signature in the genome-a solo LTR sequence. We compared levels of solo LTRs in the genomes of four salamander species with levels present in five vertebrates with smaller genomes. Our results demonstrate that salamanders have low levels of solo LTRs, suggesting that ectopic-recombination-mediated deletion of LTR retrotransposons occurs more slowly than in other vertebrates with smaller genomes.
LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons

Directory of Open Access Journals (Sweden)

Steinbiss Sascha

2012-11-01

Full Text Available Abstract Background Long terminal repeat (LTR retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets, making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow. Results We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort. Conclusions LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining
Convergent evolution of ribonuclease h in LTR retrotransposons and retroviruses.

Science.gov (United States)

Ustyantsev, Kirill; Novikova, Olga; Blinov, Alexander; Smyshlyaev, Georgy

2015-05-01

Ty3/Gypsy long terminals repeat (LTR) retrotransposons are structurally and phylogenetically close to retroviruses. Two notable structural differences between these groups of genetic elements are 1) the presence in retroviruses of an additional envelope gene, env, which mediates infection, and 2) a specific dual ribonuclease H (RNH) domain encoded by the retroviral pol gene. However, similar to retroviruses, many Ty3/Gypsy LTR retrotransposons harbor additional env-like genes, promoting concepts of the infective mode of these retrotransposons. Here, we provide a further line of evidence of similarity between retroviruses and some Ty3/Gypsy LTR retrotransposons. We identify that, together with their additional genes, plant Ty3/Gypsy LTR retrotransposons of the Tat group have a second RNH, as do retroviruses. Most importantly, we show that the resulting dual RNHs of Tat LTR retrotransposons and retroviruses emerged independently, providing strong evidence for their convergent evolution. The convergent resemblance of Tat LTR retrotransposons and retroviruses may indicate similar selection pressures acting on these diverse groups of elements and reveal potential evolutionary constraints on their structure. We speculate that dual RNH is required to accelerate retrotransposon evolution through increased rates of strand transfer events and subsequent recombination events. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Novel expressed sequence tag- simple sequence repeats (EST ...

African Journals Online (AJOL)

Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...
Structure of long terminal repeats of transcriptionally active and inactive copies of Drosophila mobile dispersed genetic elements mdg3

International Nuclear Information System (INIS)

Dzhumagaliev, E.B.; Mazo, A.N.; Baev, A.A. Jr.; Gorelova, T.V.; Arkhipova, I.R.; Shuppe, N.G.; Il'in, Yu.V.

1986-01-01

The authors have determined the nucleotide sequences of long terminal repeats (LTRS) and adjacent regions in the transcribed and nontranscribed variants of the mobile dispersed gene mdg3. In its main characteristics the mdg3 is similar to other mdg. Its integration into chromosomal DNA brings about duplication of the 4 bp of the host DNA, no specificity of the mdg integration at the nucleotide level being detected. The mdg3 is flanked by a 5 bp inverted repeat. The variations in the length of the LTR in different mdg copies is mainly due to duplication of certain sequences in the U3 and R regions. mdg3 copies with a LTR length of 267 bp are the most abundant and are completely conservative in their primary structure. They are transcribed in the cells of the 67J25D culture, but not transcribed in the K/sub c/ line, where another mdg3 variant with a LTR length of 293 bp is transcriptionally active. The SI mapping of transcription initiation and termination sites has shown that in both mdg3 variants they are localized in the same LTR regions, and that the LTR itself has a characteristic U3-R-U5 structure-like retroviral LTRs. The possible factors involved in the regulation of mdg transcription are discussed

Genome-wide analysis of LTR-retrotransposon diversity and its impact on the evolution of the genus Helianthus (L.).

Science.gov (United States)

Mascagni, Flavia; Giordani, Tommaso; Ceccarelli, Marilena; Cavallini, Andrea; Natali, Lucia

2017-08-18

Genome divergence by mobile elements activity and recombination is a continuous process that plays a key role in the evolution of species. Nevertheless, knowledge on retrotransposon-related variability among species belonging to the same genus is still limited. Considering the importance of the genus Helianthus, a model system for studying the ecological genetics of speciation and adaptation, we performed a comparative analysis of the repetitive genome fraction across ten species and one subspecies of sunflower, focusing on long terminal repeat retrotransposons at superfamily, lineage and sublineage levels. After determining the relative genome size of each species, genomic DNA was isolated and subjected to Illumina sequencing. Then, different assembling and clustering approaches allowed exploring the repetitive component of all genomes. On average, repetitive DNA in Helianthus species represented more than 75% of the genome, being composed mostly by long terminal repeat retrotransposons. Also, the prevalence of Gypsy over Copia superfamily was observed and, among lineages, Chromovirus was by far the most represented. Although nearly all the same sublineages are present in all species, we found considerable variability in the abundance of diverse retrotransposon lineages and sublineages, especially between annual and perennial species. This large variability should indicate that different events of amplification or loss related to these elements occurred following species separation and should have been involved in species differentiation. Our data allowed us inferring on the extent of interspecific repetitive DNA variation related to LTR-RE abundance, investigating the relationship between changes of LTR-RE abundance and the evolution of the genus, and determining the degree of coevolution of different LTR-RE lineages or sublineages between and within species. Moreover, the data suggested that LTR-RE abundance in a species was affected by the annual or perennial
Structure and possible function of a G-quadruplex in the long terminal repeat of the proviral HIV-1 genome.

Science.gov (United States)

De Nicola, Beatrice; Lech, Christopher J; Heddi, Brahim; Regmi, Sagar; Frasson, Ilaria; Perrone, Rosalba; Richter, Sara N; Phan, Anh Tuân

2016-07-27

The long terminal repeat (LTR) of the proviral human immunodeficiency virus (HIV)-1 genome is integral to virus transcription and host cell infection. The guanine-rich U3 region within the LTR promoter, previously shown to form G-quadruplex structures, represents an attractive target to inhibit HIV transcription and replication. In this work, we report the structure of a biologically relevant G-quadruplex within the LTR promoter region of HIV-1. The guanine-rich sequence designated LTR-IV forms a well-defined structure in physiological cationic solution. The nuclear magnetic resonance (NMR) structure of this sequence reveals a parallel-stranded G-quadruplex containing a single-nucleotide thymine bulge, which participates in a conserved stacking interaction with a neighboring single-nucleotide adenine loop. Transcription analysis in a HIV-1 replication competent cell indicates that the LTR-IV region may act as a modulator of G-quadruplex formation in the LTR promoter. Consequently, the LTR-IV G-quadruplex structure presented within this work could represent a valuable target for the design of HIV therapeutics. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Nuclear Matrix protein SMAR1 represses HIV-1 LTR mediated transcription through chromatin remodeling

International Nuclear Information System (INIS)

Sreenath, Kadreppa; Pavithra, Lakshminarasimhan; Singh, Sandeep; Sinha, Surajit; Dash, Prasanta K.; Siddappa, Nagadenahalli B.; Ranga, Udaykumar; Mitra, Debashis; Chattopadhyay, Samit

2010-01-01

Nuclear Matrix and MARs have been implicated in the transcriptional regulation of host as well as viral genes but their precise role in HIV-1 transcription remains unclear. Here, we show that > 98% of HIV sequences contain consensus MAR element in their promoter. We show that SMAR1 binds to the LTR MAR and reinforces transcriptional silencing by tethering the LTR MAR to nuclear matrix. SMAR1 associated HDAC1-mSin3 corepressor complex is dislodged from the LTR upon cellular activation by PMA/TNFα leading to an increase in the acetylation and a reduction in the trimethylation of histones, associated with the recruitment of RNA Polymerase II on the LTR. Overexpression of SMAR1 lead to reduction in LTR mediated transcription, both in a Tat dependent and independent manner, resulting in a decreased virion production. These results demonstrate the role of SMAR1 in regulating viral transcription by alternative compartmentalization of LTR between the nuclear matrix and chromatin.
Repeated DNA sequences in fungi

Energy Technology Data Exchange (ETDEWEB)

Dutta, S K

1974-11-01

Several fungal species, representatives of all broad groups like basidiomycetes, ascomycetes and phycomycetes, were examined for the nature of repeated DNA sequences by DNA:DNA reassociation studies using hydroxyapatite chromatography. All of the fungal species tested contained 10 to 20 percent repeated DNA sequences. There are approximately 100 to 110 copies of repeated DNA sequences of approximately 4 x 10/sup 7/ daltons piece size of each. Repeated DNA sequence homoduplexes showed on average 5/sup 0/C difference of T/sub e/50 (temperature at which 50 percent duplexes dissociate) values from the corresponding homoduplexes of unfractionated whole DNA. It is suggested that a part of repetitive sequences in fungi constitutes mitochondrial DNA and a part of it constitutes nuclear DNA. (auth)
Human immunodeficiency virus long terminal repeat responds to T-cell activation signals

International Nuclear Information System (INIS)

Tong-Starksen, S.E.; Luciw, P.A.; Peterlin, B.M.

1987-01-01

Human immunodeficiency virus (HIV), the causative agent of AIDS, infects and kills lymphoid cells bearing the CD4 antigen. In an infected cell, a number of cellular as well as HIV-encoded gene products determine the levels of viral gene expression and HIV replication. Efficient HIV replication occurs in activated T cells. Utilizing transient expression assays, the authors show that gene expression directed by the HIV long terminal repeat (LTR) increases in response to T-cell activation signals. The effects of T-cell activation and of the HIV-encoded trans-activator (TAT) are multiplicative. Analysis of mutations and deletions in the HIV LTR reveals that the region responding to T-cell activation signals is located at positions -105 to -80. These sequences are composed of two direct repeats, which are homologous to the core transcriptional enhancer elements in the simian virus 40 genome. The studies reveal that these elements function as the HIV enhancer. By acting directly on the HIV LTR, T-cell activation may play an important role in HIV gene expression and in the activation of latent HIV
Human Immunodeficiency Virus-Type 1 LTR DNA contains an intrinsic gene producing antisense RNA and protein products

Directory of Open Access Journals (Sweden)

Hsiao Chiu-Bin

2006-11-01

Full Text Available Abstract Background While viruses have long been shown to capitalize on their limited genomic size by utilizing both strands of DNA or complementary DNA/RNA intermediates to code for viral proteins, it has been assumed that human retroviruses have all their major proteins translated only from the plus or sense strand of RNA, despite their requirement for a dsDNA proviral intermediate. Several studies, however, have suggested the presence of antisense transcription for both HIV-1 and HTLV-1. More recently an antisense transcript responsible for the HTLV-1 bZIP factor (HBZ protein has been described. In this study we investigated the possibility of an antisense gene contained within the human immunodeficiency virus type 1 (HIV-1 long terminal repeat (LTR. Results Inspection of published sequences revealed a potential transcription initiator element (INR situated downstream of, and in reverse orientation to, the usual HIV-1 promoter and transcription start site. This antisense initiator (HIVaINR suggested the possibility of an antisense gene responsible for RNA and protein production. We show that antisense transcripts are generated, in vitro and in vivo, originating from the TAR DNA of the HIV-1 LTR. To test the possibility that protein(s could be translated from this novel HIV-1 antisense RNA, recombinant HIV antisense gene-FLAG vectors were designed. Recombinant protein(s were produced and isolated utilizing carboxy-terminal FLAG epitope (DYKDDDDK sequences. In addition, affinity-purified antisera to an internal peptide derived from the HIV antisense protein (HAP sequences identified HAPs from HIV+ human peripheral blood lymphocytes. Conclusion HIV-1 contains an antisense gene in the U3-R regions of the LTR responsible for both an antisense RNA transcript and proteins. This antisense transcript has tremendous potential for intrinsic RNA regulation because of its overlap with the beginning of all HIV-1 sense RNA transcripts by 25 nucleotides. The
Not so bad after all: retroviruses and long terminal repeat retrotransposons as a source of new genes in vertebrates.

Science.gov (United States)

Naville, M; Warren, I A; Haftek-Terreau, Z; Chalopin, D; Brunet, F; Levin, P; Galiana, D; Volff, J-N

2016-04-01

Viruses and transposable elements, once considered as purely junk and selfish sequences, have repeatedly been used as a source of novel protein-coding genes during the evolution of most eukaryotic lineages, a phenomenon called 'molecular domestication'. This is exemplified perfectly in mammals and other vertebrates, where many genes derived from long terminal repeat (LTR) retroelements (retroviruses and LTR retrotransposons) have been identified through comparative genomics and functional analyses. In particular, genes derived from gag structural protein and envelope (env) genes, as well as from the integrase-coding and protease-coding sequences, have been identified in humans and other vertebrates. Retroelement-derived genes are involved in many important biological processes including placenta formation, cognitive functions in the brain and immunity against retroelements, as well as in cell proliferation, apoptosis and cancer. These observations support an important role of retroelement-derived genes in the evolution and diversification of the vertebrate lineage. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Characterization of EIAV LTR variability and compartmentalization in various reservoir tissues of long-term inapparent carrier ponies

International Nuclear Information System (INIS)

Reis, Jenner K.P.; Craigo, Jodi K.; Cook, Sheila J.; Issel, Charles J.; Montelaro, Ronald C.

2003-01-01

Dynamic genomic variation resulting in changes in envelope antigenicity has been established as a fundamental mechanism of persistence by equine infectious anemia virus (EIAV), as observed with other lentiviruses, including HIV-1. In addition to the reported changes in envelope sequences, however, certain studies indicate the viral LTR as a second variable EIAV gene, with the enhancer region being designated as hypervariable. These observations have lead to the suggestion that LTR variation may alter viral replication properties to optimize to the microenvironment of particular tissue reservoirs. To test this hypothesis directly, we examined the population of LTR quasispecies contained in various tissues of two inapparent carrier ponies experimentally infected with a reference EIAV biological clone for 18 months. The results of these studies demonstrated that the EIAV LTR is in fact highly conserved with respect to the infecting LTR species after 1.5 years of persistent infection and regardless of the tissue reservoir. Thus, these comprehensive analyses demonstrate for the first time that the EIAV LTR is highly conserved during long-term persistent infection and that the observed variations in viral LTR are associated more with in vitro adaptation to replication in cultured cells rather than in vivo replication in natural target cells
Optimization of sequence alignment for simple sequence repeat regions

Directory of Open Access Journals (Sweden)

Ogbonnaya Francis C

2011-07-01

Full Text Available Abstract Background Microsatellites, or simple sequence repeats (SSRs, are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs. SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic
Evolutionary characterization of Ty3/gypsy-like LTR retrotransposons in the parasitic cestode Echinococcus granulosus.

Science.gov (United States)

Bae, Young-An

2016-11-01

Cyclophyllidean cestodes including Echinococcus granulosus have a smaller genome and show characteristics such as loss of the gut, a segmented body plan, and accelerated growth rate in hosts compared with other tissue-invading helminths. In an effort to address the molecular mechanism relevant to genome shrinkage, the evolutionary status of long-terminal-repeat (LTR) retrotransposons, which are known as the most potent genomic modulators, was investigated in the E. granulosus draft genome. A majority of the E. granulosus LTR retrotransposons were classified into a novel characteristic clade, named Saci-2, of the Ty3/gypsy family, while the remaining elements belonged to the CsRn1 clade of identical family. Their nucleotide sequences were heavily corrupted by frequent base substitutions and segmental losses. The ceased mobile activity of the major retrotransposons and the following intrinsic DNA loss in their inactive progenies might have contributed to decrease in genome size. Apart from the degenerate copies, a gag gene originating from a CsRn1-like element exhibited substantial evidences suggesting its domestication including a preserved coding profile and transcriptional activity, the presence of syntenic orthologues in cestodes, and selective pressure acting on the gene. To my knowledge, the endogenized gag gene is reported for the first time in invertebrates, though its biological function remains elusive.
Large-scale transcriptome data reveals transcriptional activity of fission yeast LTR retrotransposons

DEFF Research Database (Denmark)

Mourier, Tobias; Willerslev, Eske

2010-01-01

of transcriptional activity are observed from both strands of solitary LTR sequences. Transcriptome data collected during meiosis suggests that transcription of solitary LTRs is correlated with the transcription of nearby protein-coding genes. CONCLUSIONS: Presumably, the host organism negatively regulates...
Analysis of plant LTR-retrotransposons at the fine-scale family level reveals individual molecular patterns

Directory of Open Access Journals (Sweden)

Domingues Douglas S

2012-04-01

Full Text Available Abstract Background Sugarcane is an important crop worldwide for sugar production and increasingly, as a renewable energy source. Modern cultivars have polyploid, large complex genomes, with highly unequal contributions from ancestral genomes. Long Terminal Repeat retrotransposons (LTR-RTs are the single largest components of most plant genomes and can substantially impact the genome in many ways. It is therefore crucial to understand their contribution to the genome and transcriptome, however a detailed study of LTR-RTs in sugarcane has not been previously carried out. Results Sixty complete LTR-RT elements were classified into 35 families within four Copia and three Gypsy lineages. Structurally, within lineages elements were similar, between lineages there were large size differences. FISH analysis resulted in the expected pattern of Gypsy/heterochromatin, Copia/euchromatin, but in two lineages there was localized clustering on some chromosomes. Analysis of related ESTs and RT-PCR showed transcriptional variation between tissues and families. Four distinct patterns were observed in sRNA mapping, the most unusual of which was that of Ale1, with very large numbers of 24nt sRNAs in the coding region. The results presented support the conclusion that distinct small RNA-regulated pathways in sugarcane target the lineages of LTR-RT elements. Conclusions Individual LTR-RT sugarcane families have distinct structures, and transcriptional and regulatory signatures. Our results indicate that in sugarcane individual LTR-RT families have distinct behaviors and can potentially impact the genome in diverse ways. For instance, these transposable elements may affect nearby genes by generating a diverse set of small RNA's that trigger gene silencing mechanisms. There is also some evidence that ancestral genomes contribute significantly different element numbers from particular LTR-RT lineages to the modern sugarcane cultivar genome.
Effects of As2O3 on DNA methylation, genomic instability, and LTR retrotransposon polymorphism in Zea mays.

Science.gov (United States)

Erturk, Filiz Aygun; Aydin, Murat; Sigmaz, Burcu; Taspinar, M Sinan; Arslan, Esra; Agar, Guleray; Yagci, Semra

2015-12-01

Arsenic is a well-known toxic substance on the living organisms. However, limited efforts have been made to study its DNA methylation, genomic instability, and long terminal repeat (LTR) retrotransposon polymorphism causing properties in different crops. In the present study, effects of As2O3 (arsenic trioxide) on LTR retrotransposon polymorphism and DNA methylation as well as DNA damage in Zea mays seedlings were investigated. The results showed that all of arsenic doses caused a decreasing genomic template stability (GTS) and an increasing Random Amplified Polymorphic DNAs (RAPDs) profile changes (DNA damage). In addition, increasing DNA methylation and LTR retrotransposon polymorphism characterized a model to explain the epigenetically changes in the gene expression were also found. The results of this experiment have clearly shown that arsenic has epigenetic effect as well as its genotoxic effect. Especially, the increasing of polymorphism of some LTR retrotransposon under arsenic stress may be a part of the defense system against the stress.
Association of endogenous retroviruses and long terminal repeats with human disorders

Directory of Open Access Journals (Sweden)

Iyoko eKatoh

2013-09-01

Full Text Available Since the human genome sequences became available in 2001, our knowledge about the human transposable elements which comprise ~40% of the total nucleotides has been expanding. Non- LTR (long terminal repeat retrotransposons are actively transposing in the present-day human genome, and have been found to cause ~100 identified clinical cases of varied disorders. In contrast, almost all of the human endogenous retroviruses (HERVs originating from ancient infectious retroviruses lost their infectivity and transposing activity at various times before the human-chimpanzee speciation (~6 million years ago, and no known HERV is presently infectious. Insertion of HERVs and mammalian apparent LTR retrotransposons (MaLRs into the chromosomal DNA influenced a number of host genes in various modes during human evolution. Apart from the aspect of genome evolution, HERVs and solitary LTRs being suppressed in normal biological processes can potentially act as extra transcriptional apparatuses of cellular genes by re-activation in individuals. There has been a reasonable prediction that aberrant LTR activation could trigger malignant disorders and autoimmune responses if epigenetic changes including DNA hypomethylation occur in somatic cells. Evidence supporting this hypothesis has begun to emerge only recently: a MaLR family LTR activation in the pathogenesis of Hodgkin’s lymphoma and a HERV-E antigen expression in an anti-renal cell carcinoma immune response. This mini review addresses the impacts of the remnant-form LTR retrotransposons on human pathogenesis.
LTR-retrotransposons-based molecular markers in cultivated ...

African Journals Online (AJOL)

GRACE

2006-07-03

Jul 3, 2006 ... LTR-retrotransposons represent a standard component of the Gossypium Genome (Zaki and Abdel Ghany,. 2003). The analysis of the molecular existence and distribution of ancient and active LTR-retrotransposons, therefore, provides a comprehensive evaluation of the evolutionary history of Gossypium.
Evolutionary genomics revealed interkingdom distribution of Tcn1-like chromodomain-containing Gypsy LTR retrotransposons among fungi and plants

Directory of Open Access Journals (Sweden)

Blinov Alexander

2010-04-01

Full Text Available Abstract Background Chromodomain-containing Gypsy LTR retrotransposons or chromoviruses are widely distributed among eukaryotes and have been found in plants, fungi and vertebrates. The previous comprehensive survey of chromoviruses from mosses (Bryophyta suggested that genomes of non-seed plants contain the clade which is closely related to the retrotransposons from fungi. The origin, distribution and evolutionary history of this clade remained unclear mainly due to the absence of information concerning the diversity and distribution of LTR retrotransposons in other groups of non-seed plants as well as in fungal genomes. Results In present study we preformed in silico analysis of chromodomain-containing LTR retrotransposons in 25 diverse fungi and a number of plant species including spikemoss Selaginella moellendorffii (Lycopodiophyta coupled with an experimental survey of chromodomain-containing Gypsy LTR retrotransposons from diverse non-seed vascular plants (lycophytes, ferns, and horsetails. Our mining of Gypsy LTR retrotransposons in genomic sequences allowed identification of numerous families which have not been described previously in fungi. Two new well-supported clades, Galahad and Mordred, as well as several other previously unknown lineages of chromodomain-containing Gypsy LTR retrotransposons were described based on the results of PCR-mediated survey of LTR retrotransposon fragments from ferns, horsetails and lycophytes. It appeared that one of the clades, namely Tcn1 clade, was present in basidiomycetes and non-seed plants including mosses (Bryophyta and lycophytes (genus Selaginella. Conclusions The interkingdom distribution is not typical for chromodomain-containing LTR retrotransposons clades which are usually very specific for a particular taxonomic group. Tcn1-like LTR retrotransposons from fungi and non-seed plants demonstrated high similarity to each other which can be explained by strong selective constraints and the
LTR retrotransposon dynamics in the evolution of the olive (Olea europaea) genome

Czech Academy of Sciences Publication Activity Database

Barghini, E.; Natali, L.; Giordani, T.; Cossu, R.M.; Scalabrin, S.; Cattonaro, F.; Šimková, Hana; Vrána, Jan; Doležel, Jaroslav; Morgante, M.; Cavallini, A.

2015-01-01

Roč. 22, č. 1 (2015), s. 91-100 ISSN 1340-2838 R&D Projects: GA ČR GBP501/12/G090; GA MŠk(CZ) LO1204 Institutional support: RVO:61389030 Keywords : LTR retrotransposons * next-generation sequencing * olive Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 5.267, year: 2015
Myelodysplastic syndromes and acute myeloid leukemia in cats infected with feline leukemia virus clone33 containing a unique long terminal repeat.

Science.gov (United States)

Hisasue, Masaharu; Nagashima, Naho; Nishigaki, Kazuo; Fukuzawa, Isao; Ura, Shigeyoshi; Katae, Hiromi; Tsuchiya, Ryo; Yamada, Takatsugu; Hasegawa, Atsuhiko; Tsujimoto, Hajime

2009-03-01

Feline leukemia virus (FeLV) clone33 was obtained from a domestic cat with acute myeloid leukemia (AML). The long terminal repeat (LTR) of this virus, like the LTRs present in FeLV from other cats with AML, differs from the LTRs of other known FeLV in that it has 3 tandem direct 47-bp repeats in the upstream region of the enhancer (URE). Here, we injected cats with FeLV clone33 and found 41% developed myelodysplastic syndromes (MDS) characterized by peripheral blood cytopenias and dysplastic changes in the bone marrow. Some of the cats with MDS eventually developed AML. The bone marrow of the majority of cats with FeLV clone33 induced MDS produced fewer erythroid and myeloid colonies upon being cultured with erythropoietin or granulocyte-macrophage colony-stimulating factor (GM-SCF) than bone marrow from normal control cats. Furthermore, the bone marrow of some of the cats expressed high-levels of the apoptosis-related genes TNF-alpha and survivin. Analysis of the proviral sequences obtained from 13 cats with naturally occurring MDS reveal they also bear the characteristic URE repeats seen in the LTR of FeLV clone33 and other proviruses from cats with AML. Deletions and mutations within the enhancer elements are frequently observed in naturally occurring MDS as well as AML. These results suggest that FeLV variants that bear URE repeats in their LTR strongly associate with the induction of both MDS and AML in cats.
Inability of Kaplan radiation leukemia virus to replicate on mouse fibroblasts is conferred by its long terminal repeat

International Nuclear Information System (INIS)

Rassart, E.; Paquette, Y.; Jolicoeur, P.

1988-01-01

The molecularly cloned infectious Kaplan radiation leukemia virus has previously been shown to be unable to replicate on mouse fibroblasts. To map the viral sequences responsible for this, we constructed chimeric viral DNA genomes in vitro with parental cloned infectious viral DNAs from the nonfibrotropic (F-) BL/VL3 V-13 radiation leukemia virus and the fibrotropic (F+) endogenous BALB/c or Moloney murine leukemia viruses (MuLV). Infectious chimeric MuLVs, recovered after transfection of Ti-6 lymphocytes with these recombinant DNAs, were tested for capacity to replicate on mouse fibroblasts in vitro. We found that chimeric MuLVs harboring the long terminal repeat (LTR) of a fibrotropic MuLV replicated well on mouse fibroblasts. Conversely, chimeric MuLVs harboring the LTR of a nonfibrotropic MuLV were restricted on mouse fibroblasts. These results indicate that the LTR of BL/VL3 radiation leukemia virus harbors the primary determinant responsible for its inability to replicate on mouse fibroblasts in vitro. Our results also show that the primary determinant allowing F+ MuLVs (endogenous BALB/c and Moloney MuLVs) to replicate on mouse fibroblasts in vitro resides within the LTR
FoxA1 binding to the MMTV LTR modulates chromatin structure and transcription

International Nuclear Information System (INIS)

Holmqvist, Per-Henrik; Belikov, Sergey; Zaret, Kenneth S.; Wrange, Oerjan

2005-01-01

Novel binding sites for the forkhead transcription factor family member Forkhead box A (FoxA), previously referred to as Hepatocyte Nuclear Factor 3 (HNF3), were found within the mouse mammary tumor virus long terminal repeat (MMTV LTR). The effect of FoxA1 on MMTV LTR chromatin structure, and expression was evaluated in Xenopus laevis oocytes. Mutagenesis of either of the two main FoxA binding sites showed that the distal site, -232/-221, conferred FoxA1-dependent partial inhibition of glucocorticoid receptor (GR) driven MMTV transcription. The proximal FoxA binding segment consisted of two individual FoxA sites at -57/-46 and -45/-34, respectively, that mediated an increased basal MMTV transcription. FoxA1 binding altered the chromatin structure of both the inactive- and the hormone-activated MMTV LTR. Hydroxyl radical foot printing revealed FoxA1-mediated changes in the nucleosome arrangement. Micrococcal nuclease digestion showed the hormone-dependent sub-nucleosome complex, containing ∼120 bp of DNA, to be expanded by FoxA1 binding to the proximal segment into a larger complex containing ∼200 bp. The potential function of the FoxA1-mediated expression of the MMTV provirus for maintenance of expression in different tissues is discussed

A parametric LTR solution for discrete-time systems

DEFF Research Database (Denmark)

Niemann, Hans Henrik; Jannerup, Ole Erik

1989-01-01

A parametric LTR (loop transfer recovery) solution for discrete-time compensators incorporating filtering observers which achieve exact recovery is presented for both minimum- and non-minimum-phase systems. First the recovery error, which defines the difference between the target loop transfer...... and the full loop transfer function, is manipulated into a general form involving the target loop transfer matrix and the fundamental recovery matrix. A parametric LTR solution based on the recovery matrix is developed. It is shown that the LQR/LTR (linear quadratic Gaussian/loop transfer recovery) solution...
simple sequence repeat (SSR)

African Journals Online (AJOL)

In the present study, 78 mapped simple sequence repeat (SSR) markers representing 11 linkage groups of adzuki bean were evaluated for transferability to mungbean and related Vigna spp. 41 markers amplified characteristic bands in at least one Vigna species. The transferability percentage across the genotypes ranged ...
Genomic Characterization for Parasitic Weeds of the Genus Striga by Sample Sequence Analysis

Directory of Open Access Journals (Sweden)

Matt C. Estep

2012-03-01

Full Text Available Generation of ∼2200 Sanger sequence reads or ∼10,000 454 reads for seven Lour. DNA samples (five species allowed identification of the highly repetitive DNA content in these genomes. The 14 most abundant repeats in these species were identified and partially assembled. Annotation indicated that they represent nine long terminal repeat (LTR retrotransposon families, three tandem satellite repeats, one long interspersed element (LINE retroelement, and one DNA transposon. All of these repeats are most closely related to repetitive elements in other closely related plants and are not products of horizontal transfer from their host species. These repeats were differentially abundant in each species, with the LTR retrotransposons and satellite repeats most responsible for variation in genome size. Each species had some repetitive elements that were more abundant and some less abundant than the other species examined, indicating that no single element or any unilateral growth or decrease trend in genome behavior was responsible for variation in genome size and composition. Genome sizes were determined by flow sorting, and the values of 615 Mb [ (L. Kuntze], 1330 Mb [ (Willd. Vatke], 1425 Mb [ (Delile Benth.] and 2460 Mb ( Benth. suggest a ploidy series, a prediction supported by repetitive DNA sequence analysis. Phylogenetic analysis using six chloroplast loci indicated the ancestral relationships of the five most agriculturally important species, with the unexpected result that the one parasite of dicotyledonous plants ( was found to be more closely related to some of the grass parasites than many of the grass parasites are to each other.
An application of LTR design in fault detection

DEFF Research Database (Denmark)

Niemann, Hans Henrik

1998-01-01

The fault detection and isolation (FDI) problem is considered in this paper. The FDI problem is formulated as a filter design problem, where the faults in the system is estimated and the disturbance acting on the system is rejected. It turns out that the filter design problem can be considered...... as a standard Loop Transfer Recovery (LTR) design problem. As a consequence of the connection between LTR and FDI design, it is shown in an example how the LQG/LTR design method for full order and a proportional-integral observer can be applied with advantages in connection with FDI....
A novel function for spumaretrovirus integrase: an early requirement for integrase-mediated cleavage of 2 LTR circles

Directory of Open Access Journals (Sweden)

Mouscadet Jean-François

2005-05-01

Full Text Available Abstract Retroviral integration is central to viral persistence and pathogenesis, cancer as well as host genome evolution. However, it is unclear why integration appears essential for retrovirus production, especially given the abundance and transcriptional potential of non-integrated viral genomes. The involvement of retroviral endonuclease, also called integrase (IN, in replication steps apart from integration has been proposed, but is usually considered to be accessory. We observe here that integration of a retrovirus from the spumavirus family depends mainly on the quantity of viral DNA produced. Moreover, we found that IN directly participates to linear DNA production from 2-LTR circles by specifically cleaving the conserved palindromic sequence found at LTR-LTR junctions. These results challenge the prevailing view that integrase essential function is to catalyze retroviral DNA integration. Integrase activity upstream of this step, by controlling linear DNA production, is sufficient to explain the absolute requirement for this enzyme. The novel role of IN over 2-LTR circle junctions accounts for the pleiotropic effects observed in cells infected with IN mutants. It may explain why 1 2-LTR circles accumulate in vivo in mutants carrying a defective IN while their linear and integrated DNA pools decrease; 2 why both LTRs are processed in a concerted manner. It also resolves the original puzzle concerning the integration of spumaretroviruses. More generally, it suggests to reassess 2-LTR circles as functional intermediates in the retrovirus cycle and to reconsider the idea that formation of the integrated provirus is an essential step of retrovirus production.
Deletion of the LTR enhancer/promoter has no impact on the integration profile of MLV vectors in human hematopoietic progenitors.

Directory of Open Access Journals (Sweden)

Arianna Moiani

Full Text Available Moloney murine leukemia virus (MLV-derived gamma-retroviral vectors integrate preferentially near transcriptional regulatory regions in the human genome, and are associated with a significant risk of insertional gene deregulation. Self-inactivating (SIN vectors carry a deletion of the U3 enhancer and promoter in the long terminal repeat (LTR, and show reduced genotoxicity in pre-clinical assays. We report a high-definition analysis of the integration preferences of a SIN MLV vector compared to a wild-type-LTR MLV vector in the genome of CD34(+ human hematopoietic stem/progenitor cells (HSPCs. We sequenced 13,011 unique SIN-MLV integration sites and compared them to 32,574 previously generated MLV sites in human HSPCs. The SIN-MLV vector recapitulates the integration pattern observed for MLV, with the characteristic clustering of integrations around enhancer and promoter regions associated to H3K4me3 and H3K4me1 histone modifications, specialized chromatin configurations (presence of the H2A.Z histone variant and binding of RNA Pol II. SIN-MLV and MLV integration clusters and hot spots overlap in most cases and are generated at a comparable frequency, indicating that the reduced genotoxicity of SIN-MLV vectors in hematopoietic cells is not due to a modified integration profile.
Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

Science.gov (United States)

Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

1997-12-01

Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.
A Theory of LTR Junk-food Consumption

OpenAIRE

Levy, Amnon

2003-01-01

LTR junk-food consumption balances the marginal satisfaction with the marginal deterioration of health. An LTR person discounts the instantaneous marginal satisfaction from junk-food consumption by its implications for his survival probability. His change rate of health evaluation is increased (decreased) by junk-food consumption when health is better (worse) than a critical level. The moderating direct effects of age and relative price on junk-food consumption may be amplified, or dimmed, by...
Survey of transposable elements in sugarcane expressed sequence tags (ESTs

Directory of Open Access Journals (Sweden)

Rossi Magdalena

2001-01-01

Full Text Available The sugarcane expressed sequence tag (SUCEST project has produced a large number of cDNA sequences from several plant tissues submitted or not to different conditions of stress. In this paper we report the result of a search for transposable elements (TEs revealing a surprising amount of expressed TEs homologues. Of the 260,781 sequences grouped in 81,223 fragment assembly program (Phrap clusters, a total of 276 clones showed homology to previously reported TEs using a stringent cut-off value of e-50 or better. Homologous clones to Copia/Ty1 and Gypsy/Ty3 groups of long terminal repeat (LTR retrotransposons were found but no non-LTR retroelements were identified. All major transposon families were represented in sugarcane including Activator (Ac, Mutator (MuDR, Suppressor-mutator (En/Spm and Mariner. In order to compare the TE diversity in grasses genomes, we carried out a search for TEs described in sugarcane related species O.sativa, Z. mays and S. bicolor. We also present preliminary results showing the potential use of TEs insertion pattern polymorphism as molecular markers for cultivar identification.
Genetic alterations of the long terminal repeat of an ecotropic porcine endogenous retrovirus during passage in human cells

International Nuclear Information System (INIS)

Denner, Joachim; Specke, Volker; Thiesen, Ulla; Karlas, Alexander; Kurth, Reinhard

2003-01-01

Human-tropic porcine endogenous retroviruses (PERV) such as PERV-A and PERV-B can infect human cells and are therefore a potential risk to recipients of xenotransplants. A similar risk is posed by recombinant viruses containing the receptor-binding site of PERV-A and large parts of the genome of the ecotropic PERV-C including its long terminal repeat (LTR). We describe here the unique organization of the PERV-C LTR and its changes during serial passage of recombinant virus in human cells. An increase in virus titer correlated with an increase in LTR length, caused by multiplication of 37-bp repeats containing nuclear factor Y binding sites. Luciferase dual reporter assays revealed a correlation between the number of repeats and the extent of expression. No alterations have been observed in the receptor-binding site, indicating that the increased titer is due to the changes in the LTR. These data indicate that recombinant PERVs generated during infection of human cells can adapt and subsequently replicate with greater efficiency
A specific insertion of a solo-LTR characterizes the Y-chromosome of Bryonia dioica (Cucurbitaceae).

Science.gov (United States)

Oyama, Ryan K; Silber, Martina V; Renner, Susanne S

2010-06-14

Relatively few species of flowering plants are dioecious and even fewer are known to have sex chromosomes. Current theory posits that homomorphic sex chromosomes, such as found in Bryonia dioica (Cucurbitaceae), offer insight into the early stages in the evolution of sex chromosomes from autosomes. Little is known about these early steps, but an accumulation of transposable element sequences has been observed on the Y-chromosomes of some species with heteromorphic sex chromosomes. Recombination, by which transposable elements are removed, is suppressed on at least part of the emerging Y-chromosome, and this may explain the correlation between the emergence of sex chromosomes and transposable element enrichment. We sequenced 2321 bp of the Y-chromosome in Bryonia dioica that flank a male-linked marker, BdY1, reported previously. Within this region, which should be suppressed for recombination, we observed a solo-LTR nested in a Copia-like transposable element. We also found other, presumably paralogous, solo-LTRs in a consensus sequence of the underlying Copia-like transposable element. Given that solo-LTRs arise via recombination events, it is noteworthy that we find one in a genomic region where recombination should be suppressed. Although the solo-LTR could have arisen before recombination was suppressed, creating the male-linked marker BdY1, our previous study on B. dioica suggested that BdY1 may not lie in the recombination-suppressed region of the Y-chromosome in all populations. Presence of a solo-LTR near BdY1 therefore fits with the observed correlation between retrotransposon accumulation and the suppression of recombination early in the evolution of sex chromosomes. These findings further suggest that the homomorphic sex chromosomes of B. dioica, the first organism for which genetic XY sex-determination was inferred, are evolutionarily young and offer reference information for comparative studies of other plant sex chromosomes.
Determinants of Genomic RNA Encapsidation in the Saccharomyces cerevisiae Long Terminal Repeat Retrotransposons Ty1 and Ty3

Directory of Open Access Journals (Sweden)

Katarzyna Pachulska-Wieczorek

2016-07-01

Full Text Available Long-terminal repeat (LTR retrotransposons are transposable genetic elements that replicate intracellularly, and can be considered progenitors of retroviruses. Ty1 and Ty3 are the most extensively characterized LTR retrotransposons whose RNA genomes provide the template for both protein translation and genomic RNA that is packaged into virus-like particles (VLPs and reverse transcribed. Genomic RNAs are not divided into separate pools of translated and packaged RNAs, therefore their trafficking and packaging into VLPs requires an equilibrium between competing events. In this review, we focus on Ty1 and Ty3 genomic RNA trafficking and packaging as essential steps of retrotransposon propagation. We summarize the existing knowledge on genomic RNA sequences and structures essential to these processes, the role of Gag proteins in repression of genomic RNA translation, delivery to VLP assembly sites, and encapsidation.
An Analysis Of Pole/zero Cancellation In LTR-based Feedback Design

DEFF Research Database (Denmark)

Niemann, Hans Henrik; Jannerup, Ole Erik

1990-01-01

The pole/zero cancellation in LTR-based feedback design will be analyzed for both full-order as well as minimal-order observers. The asymptotic behaviour of the sensitivity function from the LTR-procedure are given in explicit expressions in the case when a zero is not cancelled by an equivalent...... pole. It will be shown that the non-minimum phase case is included as a special case. The results are not based on any specific LTR-method....
Simple sequence repeat marker development and genetic mapping ...

Indian Academy of Sciences (India)

polymorphic SSR (simple sequence repeats) markers from libraries enriched for GA, CAA and AAT repeats, as well as 6 ... ers for quinoa was the development of a genetic linkage map ...... Weber J. L. 1990 Informativeness of human (dC-dA)n.
Rapid turnover of 2-LTR HIV-1 DNA during early stage of highly active antiretroviral therapy.

Directory of Open Access Journals (Sweden)

Weijun Zhu

Full Text Available BACKGROUND: Despite prolonged treatment with highly active antiretroviral therapy (HAART, the infectious HIV-1 continues to replicate and resides latently in the resting memory CD4+ T lymphocytes, which blocks the eradication of HIV-1. The viral persistence of HIV-1 is mainly caused by its proviral DNA being either linear nonintegrated, circular nonintegrated, or integrated. Previous reports have largely focused on the dynamics of HIV-1 DNA from the samples collected with relatively long time intervals during the process of disease and HAART treatment, which may have missed the intricate changes during the intervals in early treatment. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we investigated the dynamics of HIV-1 DNA in patients during the early phase of HARRT treatment. Using optimized real time PCR, we observed significant changes in 2-LTR during the first 12-week of treatment, while total and integrated HIV-1 DNA remained stable. The doubling time and half-life of 2-LTR were not correlated with the baseline and the rate of changes in plasma viral load and various CD4+ T-cell populations. Longitudinal analyses on 2-LTR sequences and plasma lipopolysaccharide (LPS levels did not reveal any significant changes in the same treatment period. CONCLUSIONS/SIGNIFICANCE: Our study revealed the rapid changes in 2-LTR concentration in a relatively large number of patients during the early HAART treatment. The rapid changes indicate the rapid infusion and clearance of cells bearing 2-LTR in the peripheral blood. Those changes are not expected to be caused by the blocking of viral integration, as our study did not include the integrase inhibitor raltegravir. Our study helps better understand the dynamics of HIV-DNA and its potential role as a biomarker for the diseases and for the treatment efficacy of HAART.
An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data.

Science.gov (United States)

Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E; Greenwood, Alex D

2015-11-24

Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.
An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data

Science.gov (United States)

Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E.; Greenwood, Alex D.

2015-01-01

Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals. PMID:26610552
The dual action of poly(ADP-ribose polymerase -1 (PARP-1 inhibition in HIV-1 infection: HIV-1 LTR inhibition and diminution in Rho GTPase activity

Directory of Open Access Journals (Sweden)

Slava eRom

2015-08-01

Full Text Available The transcription of HIV-1 (HIV is regulated by complex mechanisms involving various cellular factors and virus-encoded transactivators. Poly(ADP-ribose polymerase 1 (PARP-1 inhibition has emerged recently as a potent anti-inflammatory tool, since PARP-1 is involved in the regulation of some genes through its interaction with various transcription factors. We propose a novel approach to diminish HIV replication via PARP-1 inhibition using human primary monocyte-derived macrophages (MDM as an in vitro model system. PARP-1 inhibitors were able to reduce HIV replication in MDM by 60-80% after 7 days infection. Long Terminal Repeat (LTR acts as a switch in virus replication and can be triggered by several agents such as: Tat, tumor necrosis factor α (TNFα, and phorbol 12-myristate 13-acetate (PMA. Overexpression of Tat in MDM transfected with an LTR reporter plasmid led to a 4.2-fold increase in LTR activation; PARP inhibition resulted in 70% reduction of LTR activity. LTR activity, which increased 3-fold after PMA or TNFα treatment, was reduced by PARP inhibition (by 85-95%. MDM treated with PARP inhibitors showed 90% reduction in NFκB activity (known to mediate PMA- and TNFα-induced HIV LTR activation. Cytoskeleton rearrangements are important in effective HIV-1 infection. PARP inactivation reduced actin cytoskeleton rearrangements by affecting Rho GTPase machinery. These findings suggest that HIV replication in MDM could be suppressed by PARP inhibition via NFκB suppression, diminution of LTR activation and its effects on the cytoskeleton. PARP appears to be essential for HIV replication and its inhibition may provide a potent approach to treatment of HIV infection.
Analysis of transposons and repeat composition of the sunflower (Helianthus annuus L.) genome.

Science.gov (United States)

Cavallini, Andrea; Natali, Lucia; Zuccolo, Andrea; Giordani, Tommaso; Jurman, Irena; Ferrillo, Veronica; Vitacolonna, Nicola; Sarri, Vania; Cattonaro, Federica; Ceccarelli, Marilena; Cionini, Pier Giorgio; Morgante, Michele

2010-02-01

A sample-sequencing strategy combined with slot-blot hybridization and FISH was used to study the composition of the repetitive component of the sunflower genome. One thousand six hundred thirty-eight sequences for a total of 954,517 bp were analyzed. The fraction of sequences that can be classified as repetitive using computational and hybridization approaches amounts to 62% in total. Almost two thirds remain as yet uncharacterized in nature. Of those characterized, most belong to the gypsy superfamily of LTR-retrotransposons. Unlike in other species, where single families can account for large fractions of the genome, it appears that no transposon family has been amplified to very high levels in sunflower. All other known classes of transposable elements were also found. One family of unknown nature (contig 61) was the most repeated in the sunflower genome. The evolution of the repetitive component in the Helianthus genus and in other Asteraceae was studied by comparative analysis of the hybridization of total genomic DNAs from these species to the sunflower small-insert library and compared to gene-based phylogeny. Very little similarity is observed between Helianthus species and two related Asteraceae species outside of the genus. Most repetitive elements are similar in annual and perennial Helianthus species indicating that sequence amplification largely predates such divergence. Gypsy-like elements are more represented in the annuals than in the perennials, while copia-like elements are similarly represented, attesting a different amplification history of the two superfamilies of LTR-retrotransposons in the Helianthus genus.
Intracellular high mobility group B1 protein (HMGB1) represses HIV-1 LTR-directed transcription in a promoter- and cell-specific manner

International Nuclear Information System (INIS)

Naghavi, Mojgan H.; Nowak, Piotr; Andersson, Jan; Soennerborg, Anders; Yang Huan; Tracey, Kevin J.; Vahlne, Anders

2003-01-01

We investigated whether the high mobility group B 1 (HMGB1), an abundant nuclear protein in all mammalian cells, affects HIV-1 transcription. Intracellular expression of human HMGB1 repressed HIV-1 gene expression in epithelial cells. This inhibitory effect of HMGB1 was caused by repression of long terminal repeat (LTR)-mediated transcription. Other viral promoters/enhancers, including simian virus 40 or cytomegalovirus, were not inhibited by HMGB1. In addition, HMGB1 inhibition of HIV-1 subtype C expression was dependent on the number of NFκB sites in the LTR region. The inhibitory effect of HMGB1 on viral gene expression observed in HeLa cells was confirmed by an upregulation of viral replication in the presence of antisense HMGB1 in monocytic cells. In contrast to what was found in HeLa cells and monocytic cells, endogenous HMGB1 expression did not affect HIV-1 replication in unstimulated Jurkat cells. Thus, intracellular HMGB1 affects HIV-1 LTR-directed transcription in a promoter- and cell-specific manner

Identification, variation and transcription of pneumococcal repeat sequences

Science.gov (United States)

2011-01-01

Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003
An Evolutionarily Young Polar Bear (Ursus maritimus Endogenous Retrovirus Identified from Next Generation Sequence Data

Directory of Open Access Journals (Sweden)

Kyriakos Tsangaras

2015-11-01

Full Text Available Transcriptome analysis of polar bear (Ursus maritimus tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV. Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos and black bear (Ursus americanus but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.
Alterations in HIV-1 LTR promoter activity during AIDS progression

International Nuclear Information System (INIS)

Hiebenthal-Millow, Kirsten; Greenough, Thomas C.; Bretttler, Doreen B.; Schindler, Michael; Wildum, Steffen; Sullivan, John L.; Kirchhoff, Frank

2003-01-01

HIV-1 variants evolving in AIDS patients frequently show increased replicative capacity compared to those present during early asymptomatic infection. It is known that late stage HIV-1 variants often show an expanded coreceptor tropism and altered Nef function. In the present study we investigated whether enhanced HIV-1 LTR promoter activity might also evolve during disease progression. Our results demonstrate increased LTR promoter activity after AIDS progression in 3 of 12 HIV-1-infected individuals studied. Further analysis revealed that multiple alterations in the U3 core-enhancer and in the transactivation-response (TAR) region seem to be responsible for the enhanced functional activity. Our findings show that in a subset of HIV-1-infected individuals enhanced LTR transcription contributes to the increased replicative potential of late stage virus isolates and might accelerate disease progression
[Using IRAP markers for analysis of genetic variability in populations of resource and rare species of plants].

Science.gov (United States)

Boronnikova, S V; Kalendar', R N

2010-01-01

Species-specific LTR retrotransposons were first cloned in five rare relic species of drug plants located in the Perm' region. Sequences of LTR retrotransposons were used for PCR analysis based on amplification of repeated sequences from LTR or other sites of retrotransposons (IRAP). Genetic diversity was studied in six populations of rare relic species of plants Adonis vernalis L. by means of the IRAP method; 125 polymorphic IRAP-markers were analyzed. Parameters for DNA polymorphism and genetic diversity of A. vernalis populations were determined.
Always look on both sides: phylogenetic information conveyed by simple sequence repeat allele sequences.

Directory of Open Access Journals (Sweden)

Stéphanie Barthe

Full Text Available Simple sequence repeat (SSR markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily, mutations in the target sequences follow the stepwise mutation model (SMM. Generally speaking, PCR amplicon sizes are used as direct indicators of the number of SSR repeats composing an allele with the data analysis either ignoring the extent of allele size differences or assuming that there is a direct correlation between differences in amplicon size and evolutionary distance. However, without precisely knowing the kind and distribution of polymorphism within an allele (SSR and the associated flanking region (FR sequences, it is hard to say what kind of evolutionary message is conveyed by such a synthetic descriptor of polymorphism as DNA amplicon size. In this study, we sequenced several SSR alleles in multiple populations of three divergent tree genera and disentangled the types of polymorphisms contained in each portion of the DNA amplicon containing an SSR. The patterns of diversity provided by amplicon size variation, SSR variation itself, insertions/deletions (indels, and single nucleotide polymorphisms (SNPs observed in the FRs were compared. Amplicon size variation largely reflected SSR repeat number. The amount of variation was as large in FRs as in the SSR itself. The former contributed significantly to the phylogenetic information and sometimes was the main source of differentiation among individuals and populations contained by FR and SSR regions of SSR markers. The presence of mutations occurring at different rates within a marker's sequence offers the opportunity to analyse evolutionary events occurring on various timescales, but at the same time calls for caution in the interpretation of SSR marker data when the distribution of within
Multineuronal Spike Sequences Repeat with Millisecond Precision

Directory of Open Access Journals (Sweden)

Koki eMatsumoto

2013-06-01

Full Text Available Cortical microcircuits are nonrandomly wired by neurons. As a natural consequence, spikes emitted by microcircuits are also nonrandomly patterned in time and space. One of the prominent spike organizations is a repetition of fixed patterns of spike series across multiple neurons. However, several questions remain unsolved, including how precisely spike sequences repeat, how the sequences are spatially organized, how many neurons participate in sequences, and how different sequences are functionally linked. To address these questions, we monitored spontaneous spikes of hippocampal CA3 neurons ex vivo using a high-speed functional multineuron calcium imaging technique that allowed us to monitor spikes with millisecond resolution and to record the location of spiking and nonspiking neurons. Multineuronal spike sequences were overrepresented in spontaneous activity compared to the statistical chance level. Approximately 75% of neurons participated in at least one sequence during our observation period. The participants were sparsely dispersed and did not show specific spatial organization. The number of sequences relative to the chance level decreased when larger time frames were used to detect sequences. Thus, sequences were precise at the millisecond level. Sequences often shared common spikes with other sequences; parts of sequences were subsequently relayed by following sequences, generating complex chains of multiple sequences.
Accurate episomal HIV 2-LTR circles quantification using optimized DNA isolation and droplet digital PCR.

Science.gov (United States)

Malatinkova, Eva; Kiselinova, Maja; Bonczkowski, Pawel; Trypsteen, Wim; Messiaen, Peter; Vermeire, Jolien; Verhasselt, Bruno; Vervisch, Karen; Vandekerckhove, Linos; De Spiegelaere, Ward

2014-01-01

In HIV-infected patients on combination antiretroviral therapy (cART), the detection of episomal HIV 2-LTR circles is a potential marker for ongoing viral replication. Quantification of 2-LTR circles is based on quantitative PCR or more recently on digital PCR assessment, but is hampered due to its low abundance. Sample pre-PCR processing is a critical step for 2-LTR circles quantification, which has not yet been sufficiently evaluated in patient derived samples. We compared two sample processing procedures to more accurately quantify 2-LTR circles using droplet digital PCR (ddPCR). Episomal HIV 2-LTR circles were either isolated by genomic DNA isolation or by a modified plasmid DNA isolation, to separate the small episomal circular DNA from chromosomal DNA. This was performed in a dilution series of HIV-infected cells and HIV-1 infected patient derived samples (n=59). Samples for the plasmid DNA isolation method were spiked with an internal control plasmid. Genomic DNA isolation enables robust 2-LTR circles quantification. However, in the lower ranges of detection, PCR inhibition caused by high genomic DNA load substantially limits the amount of sample input and this impacts sensitivity and accuracy. Moreover, total genomic DNA isolation resulted in a lower recovery of 2-LTR templates per isolate, further reducing its sensitivity. The modified plasmid DNA isolation with a spiked reference for normalization was more accurate in these low ranges compared to genomic DNA isolation. A linear correlation of both methods was observed in the dilution series (R2=0.974) and in the patient derived samples with 2-LTR numbers above 10 copies per million peripheral blood mononuclear cells (PBMCs), (R2=0.671). Furthermore, Bland-Altman analysis revealed an average agreement between the methods within the 27 samples in which 2-LTR circles were detectable with both methods (bias: 0.3875±1.2657 log10). 2-LTR circles quantification in HIV-infected patients proved to be more
Identification of genes in anonymous DNA sequences. Annual performance report, February 1, 1991--January 31, 1992

Energy Technology Data Exchange (ETDEWEB)

Fields, C.A.

1996-06-01

The objective of this project is the development of practical software to automate the identification of genes in anonymous DNA sequences from the human, and other higher eukaryotic genomes. A software system for automated sequence analysis, gm (gene modeler) has been designed, implemented, tested, and distributed to several dozen laboratories worldwide. A significantly faster, more robust, and more flexible version of this software, gm 2.0 has now been completed, and is being tested by operational use to analyze human cosmid sequence data. A range of efforts to further understand the features of eukaryoyic gene sequences are also underway. This progress report also contains papers coming out of the project including the following: gm: a Tool for Exploratory Analysis of DNA Sequence Data; The Human THE-LTR(O) and MstII Interspersed Repeats are subfamilies of a single widely distruted highly variable repeat family; Information contents and dinucleotide compostions of plant intron sequences vary with evolutionary origin; Splicing signals in Drosophila: intron size, information content, and consensus sequences; Integration of automated sequence analysis into mapping and sequencing projects; Software for the C. elegans genome project.
Development of simple sequence repeat (SSR) markers that are ...

African Journals Online (AJOL)

Simple sequence repeats (SSRs) markers were developed through data mining of 3,803 expressed sequence tags (ESTs) previously published. A total of 144 di- to penta-type SSRs were identified and they were screened for polymorphism between two turnip cultivars, 'Tsuda' and 'Yurugi Akamaru'. Out of 90 EST-SSRs for ...
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

Directory of Open Access Journals (Sweden)

Charlotte Rehm

Full Text Available In prokaryotes simple sequence repeats (SSRs with unit sizes of 1-5 nucleotides (nt are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4 structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc, Xanthomonas axonopodis pv. citri str. 306 (Xac, and Nostoc sp. strain PCC7120 (Ana. In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

Science.gov (United States)

Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology.

Science.gov (United States)

Ustek, Duran; Sirma, Sema; Gumus, Ergun; Arikan, Muzaffer; Cakiris, Aris; Abaci, Neslihan; Mathew, Jaicy; Emrence, Zeliha; Azakli, Hulya; Cosan, Fulya; Cakar, Atilla; Parlak, Mahmut; Kursun, Olcay

2012-10-01

One application of next-generation sequencing (NGS) is the targeted resequencing of interested genes which has not been used in viral integration site analysis of gene therapy applications. Here, we combined targeted sequence capture array and next generation sequencing to address the whole genome profiling of viral integration sites. Human 293T and K562 cells were transduced with a HIV-1 derived vector. A custom made DNA probe sets targeted pLVTHM vector used to capture lentiviral vector/human genome junctions. The captured DNA was sequenced using GS FLX platform. Seven thousand four hundred and eighty four human genome sequences flanking the long terminal repeats (LTR) of pLVTHM fragment sequences matched with an identity of at least 98% and minimum 50 bp criteria in both cells. In total, 203 unique integration sites were identified. The integrations in both cell lines were totally distant from the CpG islands and from the transcription start sites and preferentially located in introns. A comparison between the two cell lines showed that the lentiviral-transduced DNA does not have the same preferred regions in the two different cell lines. Copyright © 2012 Elsevier B.V. All rights reserved.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

Science.gov (United States)

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

Science.gov (United States)

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Simple sequence repeat marker loci discovery using SSR primer.

Science.gov (United States)

Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David

2004-06-12

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Linking Maternal and Somatic 5S rRNA types with Different Sequence-Specific Non-LTR Retrotransposons

NARCIS (Netherlands)

Locati, M.D.; Pagano, J.F.B.; Ensink, W.A.; van Olst, M.; van Leeuwen, S.; Nehrdich, U.; Zhu, K.; Spaink, H.P.; Girard, G.; Rauwerda, H.; Jonker, M.J.; Dekker, R.J.; Breit, T.M.

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo and adult tissue,
Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

Directory of Open Access Journals (Sweden)

Graner Andreas

2008-10-01

Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular
SeqEntropy: genome-wide assessment of repeats for short read sequencing.

Directory of Open Access Journals (Sweden)

Hsueh-Ting Chu

Full Text Available BACKGROUND: Recent studies on genome assembly from short-read sequencing data reported the limitation of this technology to reconstruct the entire genome even at very high depth coverage. We investigated the limitation from the perspective of information theory to evaluate the effect of repeats on short-read genome assembly using idealized (error-free reads at different lengths. METHODOLOGY/PRINCIPAL FINDINGS: We define a metric H(k to be the entropy of sequencing reads at a read length k and use the relative loss of entropy ΔH(k to measure the impact of repeats for the reconstruction of whole-genome from sequences of length k. In our experiments, we found that entropy loss correlates well with de-novo assembly coverage of a genome, and a score of ΔH(k>1% indicates a severe loss in genome reconstruction fidelity. The minimal read lengths to achieve ΔH(k<1% are different for various organisms and are independent of the genome size. For example, in order to meet the threshold of ΔH(k<1%, a read length of 60 bp is needed for the sequencing of human genome (3.2 10(9 bp and 320 bp for the sequencing of fruit fly (1.8×10(8 bp. We also calculated the ΔH(k scores for 2725 prokaryotic chromosomes and plasmids at several read lengths. Our results indicate that the levels of repeats in different genomes are diverse and the entropy of sequencing reads provides a measurement for the repeat structures. CONCLUSIONS/SIGNIFICANCE: The proposed entropy-based measurement, which can be calculated in seconds to minutes in most cases, provides a rapid quantitative evaluation on the limitation of idealized short-read genome sequencing. Moreover, the calculation can be parallelized to scale up to large euakryotic genomes. This approach may be useful to tune the sequencing parameters to achieve better genome assemblies when a closely related genome is already available.
simple sequence repeat (SSR) markers in genetic analysis of

African Journals Online (AJOL)

Yomi

2012-08-28

1998). Cross- species amplification of soybean (Glycine max) simple sequence repeats (SSRs) within the genus and other legume genera: implications for the transferability of SSRs in plants. Mol. Biol. Evol. 15:1275-1287.
Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

Science.gov (United States)

Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

2017-04-01

5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

Comparative effectiveness of inter-simple sequence repeat and ...

African Journals Online (AJOL)

A study to compare the effectiveness of inter-simple sequence repeats (ISSR) and randomly amplified polymorphic DNA (RAPD) profiling was carried out with a total of 65 DNA samples using 12 species of Indian Garcinia. ISSR and RAPD profiling were performed with 19 and 12 primers, respectively. ISSR markers ...
Diversity, distribution and dynamics of full-length Copia and Gypsy LTR retroelements in Solanum lycopersicum.

Science.gov (United States)

Paz, Rosalía Cristina; Kozaczek, Melisa Eliana; Rosli, Hernán Guillermo; Andino, Natalia Pilar; Sanchez-Puerta, Maria Virginia

2017-10-01

Transposable elements are the most abundant components of plant genomes and can dramatically induce genetic changes and impact genome evolution. In the recently sequenced genome of tomato (Solanum lycopersicum), the estimated fraction of elements corresponding to retrotransposons is nearly 62%. Given that tomato is one of the most important vegetable crop cultivated and consumed worldwide, understanding retrotransposon dynamics can provide insight into its evolution and domestication processes. In this study, we performed a genome-wide in silico search of full-length LTR retroelements in the tomato nuclear genome and annotated 736 full-length Gypsy and Copia retroelements. The dispersion level across the 12 chromosomes, the diversity and tissue-specific expression of those elements were estimated. Phylogenetic analysis based on the retrotranscriptase region revealed the presence of 12 major lineages of LTR retroelements in the tomato genome. We identified 97 families, of which 77 and 20 belong to the superfamilies Copia and Gypsy, respectively. Each retroelement family was characterized according to their element size, relative frequencies and insertion time. These analyses represent a valuable resource for comparative genomics within the Solanaceae, transposon-tagging and for the design of cultivar-specific molecular markers in tomato.
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.

Science.gov (United States)

Anwar, Tamanna; Khan, Asad U

2006-02-20

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.
Simple sequence repeat (SSR)-based genetic variability among ...

African Journals Online (AJOL)

The objective of this study was to compare if simple sequence repeat (SSR) markers could correctly identify peanut genotypes with difference in specific leaf weight (SLW) and relative water content (RWC). Four peanut genotypes and two water regimes (FC and 1/3 available water; 1/3 AW) were arranged in factorial ...
Potentials and limitations of histone repeat sequences for phylogenetic reconstruction of Sophophora.

Science.gov (United States)

Baldo, A M; Les, D H; Strausbaugh, L D

1999-11-01

Simplified DNA sequence acquisition has provided many new data sets that are useful for phylogenetic reconstruction, including single- and multiple-copy nuclear and organellar genes. Although transcribed regions receive much attention, nontranscribed regions have recently been added to the repertoire of sequences suitable for phylogenetic studies, especially for closely related taxa. We evaluated the efficacy of a small portion of the histone repeat for phylogenetic reconstruction among Drosophila species. Histone repeats in invertebrates offer distinct advantages similar to those of widely used ribosomal repeats. First, the units are tandemly repeated and undergo concerted evolution. Second, histone repeats include both highly conserved coding and variable intergenic regions. This composition facilitates application of "universal" primers spanning potentially informative sites. We examined a small region of the histone repeat, including the intergenic spacer segments of coding regions from the divergently transcribed H2A and H2B histone genes. The spacer (about 230 bp) exists as a mosaic with highly conserved functional motifs interspersed with rapidly diverging regions; the former aid in alignment of the spacer. There are no ambiguities in alignment of coding regions. Coding and noncoding regions were analyzed together and separately for phylogenetic information. Parsimony, distance, and maximum-likelihood methods successfully retrieve the corroborated phylogeny for the taxa examined. This study demonstrates the resolving power of a small histone region which may now be added to the growing collection of phylogenetically useful DNA sequences.
PSSRdb: a relational database of polymorphic simple sequence repeats extracted from prokaryotic genomes.

Science.gov (United States)

Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A

2011-01-01

PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
Full Length Research Paper LTR-retrotransposons-based molecular ...

African Journals Online (AJOL)

LTR-retrotransposons possess unique properties that make them appropriate for investigating relationships between closely related species and populations. The aim of the current study was to employ Ty1-copia group retrotransposons as molecular markers in cultivated Egyptian cottons, G. barbadense L. Restriction site ...
Effects of integration and replication on transcription of the HIV-1 long terminal repeat

NARCIS (Netherlands)

Jeang, K. T.; Berkhout, B.; Dropulic, B.

1993-01-01

The activity of a promoter is influenced by chromosomal and cell cycle/replication context. We analyzed the influences of integration and replication on transcription of the human immunodeficiency virus (HIV)-1 long terminal repeat (LTR). We found that one requirement for Tat trans-activated
Eliminating HIV-1 Packaging Sequences from Lentiviral Vector Proviruses Enhances Safety and Expedites Gene Transfer for Gene Therapy.

Science.gov (United States)

Vink, Conrad A; Counsell, John R; Perocheau, Dany P; Karda, Rajvinder; Buckley, Suzanne M K; Brugman, Martijn H; Galla, Melanie; Schambach, Axel; McKay, Tristan R; Waddington, Simon N; Howe, Steven J

2017-08-02

Lentiviral vector genomic RNA requires sequences that partially overlap wild-type HIV-1 gag and env genes for packaging into vector particles. These HIV-1 packaging sequences constitute 19.6% of the wild-type HIV-1 genome and contain functional cis elements that potentially compromise clinical safety. Here, we describe the development of a novel lentiviral vector (LTR1) with a unique genomic structure designed to prevent transfer of HIV-1 packaging sequences to patient cells, thus reducing the total HIV-1 content to just 4.8% of the wild-type genome. This has been achieved by reconfiguring the vector to mediate reverse-transcription with a single strand transfer, instead of the usual two, and in which HIV-1 packaging sequences are not copied. We show that LTR1 vectors offer improved safety in their resistance to remobilization in HIV-1 particles and reduced frequency of splicing into human genes. Following intravenous luciferase vector administration to neonatal mice, LTR1 sustained a higher level of liver transgene expression than an equivalent dose of a standard lentivirus. LTR1 vectors produce reverse-transcription products earlier and start to express transgenes significantly quicker than standard lentiviruses after transduction. Finally, we show that LTR1 is an effective lentiviral gene therapy vector as demonstrated by correction of a mouse hemophilia B model. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
APE1 incision activity at abasic sites in tandem repeat sequences.

Science.gov (United States)

Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

2014-05-29

Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
D20S16 is a complex interspersed repeated sequence: Genetic and physical analysis of the locus

Energy Technology Data Exchange (ETDEWEB)

Bowden, D.W.; Krawchuk, M.D.; Howard, T.D. [Wake Forest Univ., Winston-Salem, NC (United States)] [and others

1995-01-20

The genomic structure of the D20S16 locus has been evaluated using genetic and physical methods. D20S16, originally detected with the probe CRI-L1214, is a highly informative, complex restriction fragment length polymorphism consisting of two separate allelic systems. The allelic systems have the characteristics of conventional VNTR polymorphisms and are separated by recombination ({theta} = 0.02, Z{sub max} = 74.82), as demonstrated in family studies. Most of these recombination events are meiotic crossovers and are maternal in origin, but two, including deletion of the locus in a cell line from a CEPH family member, occur without evidence for exchange of flanking markers. DNA sequence analysis suggests that the basis of the polymorphism is variable numbers of a 98-bp sequence tandemly repeated with 87 to 90% sequence similarity between repeats. The 98-bp repeat is a dimer of 49 bp sequence with 45 to 98% identity between the elements. In addition, nonpolymorphic genomic sequences adjacent to the polymorphic 98-bp repeat tracts are also repeated but are not polymorphic, i.e., show no individual to individual variation. Restriction enzyme mapping of cosmids containing the CRI-L1214 sequence suggests that there are multiple interspersed repeats of the CRI-L1214 sequence on chromosome 20. The results of dual-color fluorescence in situ hybridization experiments with interphase nuclei are also consistent with multiple repeats of an interspersed sequence on chromosome 20. 23 refs., 6 figs.
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

Science.gov (United States)

Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

2015-05-01

Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.
Two cis-acting elements responsible for posttranscriptional trans-regulation of gene expression of human T-cell leukemia virus type I

International Nuclear Information System (INIS)

Seiki, Motoharu; Inoue, Junichiro; Hidaka, Makoto; Yoshida, Mitsuaki

1988-01-01

The pX sequence of human T-cell leukemia virus type I codes for two nuclear proteins, p40 tax and p27 rex and a cytoplasmic protein, p21 X-III . p40 tax activates transcription from the long terminal repeat (LTR), whereas p27 rex modulates posttranscriptional processing to accumulate gag and env mRNAs that retain intron sequences. In this paper, the authors identify two cis-acting sequence elements needed for regulation by p27 rex : a 5' splice signal and a specific sequence in the 3' LTR. These two sequence elements are sufficient for regulation by p27 rex ; expression of a cellular gene (metallothionein I) became sensitive to rex regulation when the LTR was inserted at the 3' end of this gene. The requirement for these two elements suggests and unusual regulatory mechanism of RNA processing in the nucleus
Long Terminal Repeat Circular DNA as Markers of Active Viral Replication of Human T Lymphotropic Virus-1 in Vivo

Directory of Open Access Journals (Sweden)

James M Fox

2016-03-01

Full Text Available Clonal expansion of human T-lymphotropic virus type-1 (HTLV-1 infected cells in vivo is well documented. Unlike human immunodeficiency virus type 1 (HIV-1, HTLV-1 plasma RNA is sparse. The contribution of the “mitotic” spread of HTLV-1 compared with infectious spread of the virus to HTLV-1 viral burden in established infection is uncertain. Since extrachromosomal long terminal repeat (LTR DNA circles are indicators of viral replication in HIV-1 carriers with undetectable plasma HIV RNA, we hypothesised that HTLV-1 LTR circles could indicate reverse transcriptase (RT usage and infectious activity. 1LTR and 2LTR DNA circles were measured in HTLV-1 cell lines and peripheral blood mononuclear cells (PBMC of asymptomatic carriers (ACs and patients with HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP or adult T cell leukaemia/lymphoma (ATLL. 1LTR DNA circles were detected in 14/20 patients at a mean of 1.38/100 PBMC but did not differentiate disease status nor correlate with HTLV-1 DNA copies. 2LTR DNA circles were detected in 30/31 patients and at higher concentrations in patients with HTLV-1-associated diseases, independent of HTLV-1 DNA load. In an incident case the 2LTR DNA circle concentration increased 2.1 fold at the onset of HAM/TSP compared to baseline. Detectable and fluctuating levels of HTLV-1 DNA circles in patients indicate viral RT usage and virus replication. Our results indicate HTLV-1 viral replication capacity is maintained in chronic infection and may be associated with disease onset.
A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

Directory of Open Access Journals (Sweden)

Glass John I

2010-07-01

Full Text Available Abstract Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT. Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the
Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

KAUST Repository

Cahill, Matt J.

2010-07-12

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.
Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies.

Directory of Open Access Journals (Sweden)

Matt J Cahill

Full Text Available BACKGROUND: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. METHODOLOGY/PRINCIPAL FINDINGS: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. CONCLUSIONS: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length.
Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

KAUST Repository

Cahill, Matt J.; Kö ser, Claudio U.; Ross, Nicholas E.; Archer, John A.C.

2010-01-01

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.
Transcriptional and Bioinformatic Analysis Provide a Relationship between Host Response Changes to Marek’s Disease Viruses Infection and an Integrated Long Terminal Repeat

Directory of Open Access Journals (Sweden)

Ning eCui

2016-04-01

Full Text Available GX0101, Marek’s disease virus (MDV strain with a long terminal repeat (LTR insert of reticuloendotheliosis virus (REV, was isolated from CVI988/Rispens vaccinated birds showing tumors. We have constructed a LTR deleted strain GX0101∆LTR in our previous study. To compare the host responses to GX0101 and GX0101∆LTR, chicken embryo fibroblasts (CEF cells were infected with two MDV strains and a gene-chip containing chicken genome was employed to examine gene transcription changes in host cells in the present study. Of the 42 368 chicken transcripts on the chip, there were 2199 genes that differentially expressed in CEF infected with GX0101 compared to GX0101∆LTR significantly. Differentially expressed genes were distributed to 25 possible gene networks according to their intermolecular connections and were annotated to 56 pathways. The insertion of REV LTR showed the greatest influence on cancer formation and metastasis, followed with immune changes, atherosclerosis and nervous system disorders in MDV-infected CEF cells. Based on these bio functions, GX0101 infection was predicated with a greater growth and survival inhibition but lower oncogenicity in chickens than GX0101∆LTR, at least in the acute phase of infection. In summary, the insertion of REV LTR altered the expression of host genes in response to MDV infection, possibly resulting in novel phenotypic properties in chickens. Our study has provided the evidence of retroviral insertional changes of host responses to herpesvirus infection for the first time, which will promote to elucidation of the possible relationship between the LTR insertion and the observed phenotypes.
Tandemly repeated sequence in 5'end of mtDNA control region of ...

African Journals Online (AJOL)

Extensive length variability was observed in 5' end sequence of the mitochondrial DNA control region of the Japanese Spanish mackerel (Scomberomorus niphonius). This length variability was due to the presence of varying numbers of a 56-bp tandemly repeated sequence and a 46-bp insertion/deletion (indel).

Repeat Sequence Proteins as Matrices for Nanocomposites

Energy Technology Data Exchange (ETDEWEB)

Drummy, L.; Koerner, H; Phillips, D; McAuliffe, J; Kumar, M; Farmer, B; Vaia, R; Naik, R

2009-01-01

Recombinant protein-inorganic nanocomposites comprised of exfoliated Na+ montmorillonite (MMT) in a recombinant protein matrix based on silk-like and elastin-like amino acid motifs (silk elastin-like protein (SELP)) were formed via a solution blending process. Charged residues along the protein backbone are shown to dominate long-range interactions, whereas the SELP repeat sequence leads to local protein/MMT compatibility. Up to a 50% increase in room temperature modulus and a comparable decrease in high temperature coefficient of thermal expansion occur for cast films containing 2-10 wt.% MMT.
Marcadores virológicos no convencionales en pacientes infectados con el virus de la inmunodeficiencia humana: ADN HIV-T, ADN HIV- 2LTR y ARN de HIV Non conventional virological markers in HIV-infected patients: T-HIV DNA, 2LTR-HIV DNA and HIV RNA

Directory of Open Access Journals (Sweden)

Rosana Gariglio

2004-10-01

study, we analyzed the presence of total HIV DNA (T-HIV DNA, non-integrated DNA with 2LTR (2LTR-HIV DNA and HIV RNA in a group of 55 HIV-positive subjects from Rosario City, with different clinical stages, with and without HAART. All markers were evaluated by PCR assays optimized in our laboratory that included colorimetric detection in microplate. HIV RNA clinical sensitivity was compared with a reference test, bDNA, resulting in 74% and 64% respectively, with an 85% of agreement. Thus, our HIV RNA assay could be used to monitor patients under HAART and at risk of infection. The 2LTR-HIV DNA was 54% positive although it was absent in patients with high VL. This marker was considered a labile product therefore its presence was associated with recent infection. However, current evidences question its stability. Thus, its clinical significance should be reconsidered. The absence of 2LTR-HIV DNA in patients with detectable VL may relate to the heterogeneity of the sequence used for its detection. T-HIV DNA was present in 100% of the samples and could be a relevant remission marker when therapies that effectively eradicate the infection became available.
Nucleotide sequence analysis of HTLV-I isolated from cerebrospinal fluid of a patient with TSP/HAM: comparison to other HTLV-I isolates.

Science.gov (United States)

Mukhopadhyaya, R; Sadaie, M R

1993-02-01

Human T-cell leukemia virus type I (HTLV-I) has been associated with adult T-cell leukemia/lymphoma and the chronic neurologic disorder tropical spastic paraparesis/HTLV-I-associated myelopathy (TSP/HAM). To study the genetic structure of the virus associated with TSP/HAM, we have obtained and sequenced a partial genomic clone from an HTLV-I-positive cell line established from cerebrospinal fluid (CSF) of a Jamaican patient with TSP/HAM. This clone consisted of a 4.3-kb viral sequence containing the 5' long terminal repeat (LTR), gag, and N-terminal portion of the pol gene, with an overall 1.3% sequence variation resulting from mostly nucleotide substitutions, as compared to the prototype HTLV-I ATK-1. The gag and pol regions showed only 1.4% and 1.2% nucleotide variations, respectively. However, the U3 region of the LTR showed the highest sequence variation (3.6%), where several changes appear to be common among certain TSP/HAM isolates. Several of these changes reside within the 21-bp boundaries and the Tax-responsive element. It would be important to determine if the observed changes are sufficient to cause neurologic disorders similar to the murine leukemia virus system or simply reflect the divergent pool of HTLV-I from different geographic locations. At this time, we cannot rule out the possibility that the observed changes have either direct or indirect significance for the HTLV-I pathogenesis in TSP/HAM.
Roles of genes and Alu repeats in nonlinear correlations of HUMHBB DNA sequence

International Nuclear Information System (INIS)

Xiao Yi; Huang Yanzhao

2004-01-01

DNA sequences of different species and different portion of the DNA of the same species may have completely different correlation properties, but the origin of these correlations is still not very clear and is currently being investigated, especially in different particular cases. We report here a study of the DNA sequence of human beta globin region (HUMHBB) which has strong linear and nonlinear correlations. We studied the roles of two of the typical elements of DNA sequence, genes and Alu repeats, in the nonlinear correlations of HUMHBB. We find that there exist strong nonlinear correlations between the exons or introns in different genes and between the Alu repeats. They may be one of the major sources of the nonlinear correlations in HUMBHB
Induction of transcription from the long terminal repeat of Moloney murine sarcoma provirus by UV-irradiation, x-irradiation, and phorbol ester

International Nuclear Information System (INIS)

Lin, C.S.; Goldthwait, D.A.; Samols, D.

1990-01-01

The long terminal repeat (LTR) of Moloney murine sarcoma virus (Mo-MuSV) was used as a model system to study the stress response of mammalian cells to physical carcinogens. The chloramphenicol acetyltransferase (CAT) gene was inserted between two Mo-MuSV LTRs, and the LTR-CAT-LTR construct was used for virus production and was integrated into the genome of NIH 3T3 cells in the proviral form. This construct was used to assure that the integrated CAT gene was driven by the promoter of the LTR. Expression of the CAT gene was stimulated 4-fold by UV irradiation, and the peak of activity was observed at 18 hr. In contrast, stimulation of the CAT expression after x-irradiation was 2-fold and occurred at 6 hr. Phorbol myristate acetate also stimulated CAT activity 4-fold with a peak at 6 hr. Down-regulation of protein kinase C blocked totally the response to x-irradiation but only partially the response to UV. The protein kinase inhibitor H7 blocked the response to treatment by UV, x-ray, and phorbol ester
Influence of cAMP on reporter bioassays for dioxin and dioxin-like compounds

International Nuclear Information System (INIS)

Kasai, Ayumi; Yao, Jian; Yamauchi, Kozue; Hiramatsu, Nobuhiko; Hayakawa, Kunihiro; Meng, Yiman; Maeda, Shuichiro; Kitamura, Masanori

2006-01-01

In reporter assays for detection of dioxins, the dioxin-responsive element (DRE) is generally used as a sensor sequence. In several systems, the CYP1A1 promoter containing DREs (DRE cyp ) is inserted into a part of the long terminal repeat of mouse mammary tumor virus (LTR MMTV ) to improve sensitivity of assays. We found that DRE cyp -LTR MMTV responds not only to dioxins and dioxin-like compounds but also to forskolin, a cAMP-elevating agent. This effect was dose-dependent and reproduced by other cAMP-elevating agents including 8-bromo-cAMP and 3-isobutyl-methylxanthine. The cAMP response element (CRE) and CRE-like sequences were absent in DRE cyp -LTR MMTV and not involved in this process. In contrast to the effect of dioxin, the activation of DRE cyp -LTR MMTV by cAMP was independent of the aryl hydrocarbon receptor (AhR), a ligand-dependent transcription factor for DRE. Furthermore, neither DRE cyp , LTR MMTV nor the consensus sequence of DRE alone was activated in response to cAMP. These data elucidated for the first time that the combination of DRE cyp with LTR MMTV causes a peculiar response to cAMP and suggested that use of AhR antagonists is essential to exclude false-positive responses of DRE cyp -LTR MMTV -based bioassays for detection and quantification of dioxins and dioxin-like compounds
Repetitive DNA in the pea (Pisum sativum L. genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

Directory of Open Access Journals (Sweden)

Navrátilová Alice

2007-11-01

Full Text Available Abstract Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum. Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data
GABBR1 has a HERV-W LTR in its regulatory region – a possible implication for schizophrenia

Directory of Open Access Journals (Sweden)

Hegyi Hedi

2013-02-01

Full Text Available Abstract Schizophrenia is a complex disease with uncertain aetiology. We suggest GABBR1, GABA receptor B1 implicated in schizophrenia based on a HERV-W LTR in the regulatory region of GABBR1. Our hypothesis is supported by: (i GABBR1 is in the 6p22 genomic region most often implicated in schizophrenia; (ii microarray studies found that only presynaptic pathway-related genes, including GABA receptors, have altered expression in schizophrenic patients and (iii it explains how HERV-W elements, expressed in schizophrenia, play a role in the disease: by altering the expression of GABBR1 via a long terminal repeat that is also a regulatory element to GABBR1. Reviewers This paper was reviewed by Sandor Pongor and Martijn Huynen.
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

Science.gov (United States)

Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

2016-01-01

A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.
Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

Science.gov (United States)

Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

Science.gov (United States)

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Tandemly repeated sequence in 5'end of mtDNA control region of ...

African Journals Online (AJOL)

STORAGESEVER

2008-12-17

Dec 17, 2008 ... chain reaction (PCR). Japanese Spanish ... mainly covered general ecology and fishery biology. No study concerning the ... Conserved sequence blocks and the repeat units are indicated by boxes. performed using the exact ...
Inverted repeats in the promoter as an autoregulatory sequence for TcrX in Mycobacterium tuberculosis

International Nuclear Information System (INIS)

Bhattacharya, Monolekha; Das, Amit Kumar

2011-01-01

Highlights: ► The regulatory sequences recognized by TcrX have been identified. ► The regulatory region comprises of inverted repeats segregated by 30 bp region. ► The mode of binding of TcrX with regulatory sequence is unique. ► In silico TcrX–DNA docked model binds one of the inverted repeats. ► Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has not been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by ∼30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.
Identification of an osteoclast transcription factor that binds to the human T cell leukemia virus type I-long terminal repeat enhancer element.

Science.gov (United States)

Inoue, D; Santiago, P; Horne, W C; Baron, R

1997-10-03

Transgenic mice expressing human T cell leukemia virus type I (HTLV-I)-tax under the control of HTLV-I-long terminal repeat (LTR) promoter develop skeletal abnormalities with high bone turnover and myelofibrosis. In these animals, Tax is highly expressed in bone with a pattern of expression restricted to osteoclasts and spindle-shaped cells within the endosteal myelofibrosis. To test the hypothesis that lineage-specific transcription factors promote transgene expression from the HTLV-I-LTR in osteoclasts, we first examined tax expression in transgenic bone marrow cultures. Expression was dependent on 1alpha,25-dihydroxycholecalciferol and coincided with tartrate-resistant acid phosphatase (TRAP) expression, a marker of osteoclast differentiation. Furthermore, Tax was expressed in vitronectin receptor-positive mononuclear precursors as well as in mature osteoclast-like cells (OCLs). Consistent with our hypothesis, electrophoretic mobility shift assays revealed the presence of an OCL nuclear factor (NFOC-1) that binds to the LTR 21-base pair direct repeat, a region critical for the promoter activity. This binding is further enhanced by Tax. Since NFOC-1 is absent in macrophages and conserved in osteoclasts among species including human, such a factor may play a role in lineage determination and/or in expression of the differentiated osteoclast phenotype.
TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

Science.gov (United States)

Richard, François D; Kajava, Andrey V

2014-06-01

The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.
Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence

NARCIS (Netherlands)

Semenova, E.V.; Jore, M.M.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

2011-01-01

Prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR)/Cas (CRISPR-associated sequences) systems provide adaptive immunity against viruses when a spacer sequence of small CRISPR RNA (crRNA) matches a protospacer sequence in the viral genome. Viruses that escape CRISPR/Cas
MSDB: A Comprehensive Database of Simple Sequence Repeats.

Science.gov (United States)

Avvaru, Akshay Kumar; Saxena, Saketh; Sowpati, Divya Tej; Mishra, Rakesh Kumar

2017-06-01

Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Simple sequence repeat (SSR) markers are effective for identifying ...

African Journals Online (AJOL)

DNA was extracted from newly formed leaves and amplified using 21 simple sequence repeat (SSR) markers (NH001c, NH002b, NH005b, NH007b, NH008b, NH009b, NH011b, NH013b, NH012a, NH014a, NH015a, NH017a, KA4b, KA5, KA14, KA16, KB16, KU10, BGA35, BGT23b and HGA8b). The data was analyzed by ...
Tat-dependent repression of human immunodeficiency virus type 1 long terminal repeat promoter activity by fusion of cellular transcription factors

International Nuclear Information System (INIS)

Zhao Cunyou; Chen Yali; Park, Jiyoung; Kim, Jae Bum; Tang Hong

2004-01-01

Transcription initiation from HIV-1 long terminal repeat (LTR) promoter requires the virally encoded transactivator, Tat, and several cellular co-factors to accomplish the Tat-dependent processive transcription elongation. Individual cellular transcription activators, LBP-1b and Oct-1, on the other hand, have been shown to inhibit LTR promoter activities probably via competitive binding against TFIID to the TATA-box in LTR promoter. To explore the genetic interference strategies against the viral replication, we took advantage of the existence of the bipartite DNA binding domains and the repression domains of LBP-1b and Oct-1 factors to generate a chimeric transcription repressor. Our results indicated that the fusion protein of LBP-1b and Oct-1 exhibited higher DNA binding affinity to the viral promoter than the individual factors, and little interference with the host cell gene expression due to its anticipated rare cognate DNA sites in the host cell genome. Moreover, the chimera exerted increased Tat-dependent repression of transcription initiation at the LTR promoter both in vitro and in vivo compared to LBP-1b, Oct-1 or combination of LBP-1b and Oct-1. These results might provide the lead in generating a therapeutic reagent useful to suppress HIV-1 replication
Repeated-Sprint Sequences During Female Soccer Matches Using Fixed and Individual Speed Thresholds.

Science.gov (United States)

Nakamura, Fábio Y; Pereira, Lucas A; Loturco, Irineu; Rosseti, Marcelo; Moura, Felipe A; Bradley, Paul S

2017-07-01

Nakamura, FY, Pereira, LA, Loturco, I, Rosseti, M, Moura, FA, and Bradley, PS. Repeated-sprint sequences during female soccer matches using fixed and individual speed thresholds. J Strength Cond Res 31(7): 1802-1810, 2017-The main objective of this study was to characterize the occurrence of single sprint and repeated-sprint sequences (RSS) during elite female soccer matches, using fixed (20 km·h) and individually based speed thresholds (>90% of the mean speed from a 20-m sprint test). Eleven elite female soccer players from the same team participated in the study. All players performed a 20-m linear sprint test, and were assessed in up to 10 official matches using Global Positioning System technology. Magnitude-based inferences were used to test for meaningful differences. Results revealed that irrespective of adopting fixed or individual speed thresholds, female players produced only a few RSS during matches (2.3 ± 2.4 sequences using the fixed threshold and 3.3 ± 3.0 sequences using the individually based threshold), with most sequences composing of just 2 sprints. Additionally, central defenders performed fewer sprints (10.2 ± 4.1) than other positions (fullbacks: 28.1 ± 5.5; midfielders: 21.9 ± 10.5; forwards: 31.9 ± 11.1; with the differences being likely to almost certainly associated with effect sizes ranging from 1.65 to 2.72), and sprinting ability declined in the second half. The data do not support the notion that RSS occurs frequently during soccer matches in female players, irrespective of using fixed or individual speed thresholds to define sprint occurrence. However, repeated-sprint ability development cannot be ruled out from soccer training programs because of its association with match-related performance.

Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

Directory of Open Access Journals (Sweden)

Natalie L. Dillon

2014-01-01

Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.
Genomic organization and developmental fate of adjacent repeated sequences in a foldback DNA clone of Tetrahymena thermophila

International Nuclear Information System (INIS)

Tschunko, A.H.; Loechel, R.H.; McLaren, N.C.; Allen, S.L.

1987-01-01

DNA sequence elimination and rearrangement occurs during the development of somatic cell lineages of eukaryotes and was first discovered over a century ago. However, the significance and mechanism of chromatin elimination are not understood. DNA elimination also occurs during the development of the somatic macronucleus from the germinal micronucleus in unicellular ciliated protozoa such as Tetrahymena thermophila. In this study foldback DNA from the micronucleus was used as a probe to isolate ten clones. All of those tested (4/4) contained sequences that were repetitive in the micronucleus and rearranged in the macronucleus. Inverted repeated sequences were present in one clone. This clone, pTtFBl, was subjected to a detailed analysis of its developmental fate. Subregions were subcloned and used as probes against Southern blots of micronuclear and macronuclear DNA. DNA was labeled with [ 33 P]-labeled dATP. The authors found that all subregions defined repeated sequence families in the micronuclear genome. A minimum of four different families was defined, two of which are retained in the macronucleus and two of which are completely eliminated. The inverted repeat family is retained with little rearrangement. Two of the families, defined by subregions that do not contain parts of the inverted repeat are totally eliminated during macronuclear development-and contain open reading frames. The significance of retained inverted repeats to the process of elimination is discussed
Characterization of Equine Infectious Anemia Virus Long Terminal Repeat Quasispecies In Vitro and In Vivo.

Science.gov (United States)

Wang, Xue-Feng; Liu, Qiang; Wang, Yu-Hong; Wang, Shuai; Chen, Jie; Lin, Yue-Zhi; Ma, Jian; Zhou, Jian-Hua; Wang, Xiaojun

2018-04-15

The equine infectious anemia virus (EIAV) attenuated vaccine was developed by long-term passaging of a field-isolated virulent strain in cross-species hosts, followed by successive cultivation in cells in vitro To explore the molecular mechanism underlying the evolution of the EIAV attenuated vaccine, a systematic study focusing on long-terminal-repeat (LTR) variation in numerous virus strains ranging from virulent EIAV to attenuated EIAV was performed over time both in vitro and in vivo Two hypervariable regions were identified within the U3 region in the enhancer region (EHR) and the negative regulatory element (NRE) and within the R region in the transcription start site (TSS) and the Tat-activating region (TAR). Among these sites, variation in the U3 region resulted in the formation of additional transcription factor binding sites; this variation of the in vitro -adapted strains was consistent with the loss of pathogenicity. Notably, the same LTR variation pattern was observed both in vitro and in vivo Generally, the LTR variation in both the attenuated virus and the virulent strain fluctuated over time in vivo Interestingly, the attenuated-virus-specific LTR variation was also detected in horses infected with the virulent strain, supporting the hypothesis that the evolution of an attenuated virus might have involved branching from EIAV quasispecies. This hypothesis was verified by phylogenetic analysis. The present systematic study examining the molecular evolution of attenuated EIAV from EIAV quasispecies may provide an informative model reflecting the evolution of similar lentiviruses. IMPORTANCE The attenuated EIAV vaccine was the first lentiviral vaccine used to successfully control for equine infectious anemia in China. This vaccine provides an important reference for studying the relationship between EIAV gene variation and changes in biological characteristics. Importantly, the vaccine provides a model for the investigation of lentiviral quasispecies
Fulltext PDF

Indian Academy of Sciences (India)

2012-01-24

Jan 24, 2012 ... Based on this, we constructed periodic consensus sequences of these three kinds of ... T is the period of P. To determine the value of the parameter par, we ..... Mager DL 2007 Repeated recruitment of LTR retrotransposons.
RePS: a sequence assembler that masks exact repeats identified from the shotgun data

DEFF Research Database (Denmark)

Wang, Jun; Wong, Gane Ka-Shu; Ni, Peixiang

2002-01-01

We describe a sequence assembler, RePS (repeat-masked Phrap with scaffolding), that explicitly identifies exact 20mer repeats from the shotgun data and removes them prior to the assembly. The established software is used to compute meaningful error probabilities for each base. Clone......-end-pairing information is used to construct scaffolds that order and orient the contigs. We show with real data for human and rice that reasonable assemblies are possible even at coverages of only 4x to 6x, despite having up to 42.2% in exact repeats. Udgivelsesdato: 2002-May...
Unexpected Modulation of Recall B and T Cell Responses after Immunization with Rotavirus-like Particles in the Presence of LT-R192G

Directory of Open Access Journals (Sweden)

Christelle Basset

2010-08-01

Full Text Available LT-R192G, a mutant of the thermolabile enterotoxin of E. coli, is a potent adjuvant of immunization. Immune responses are generally analyzed at the end of protocols including at least 2 administrations, but rarely after a prime. To investigate this point, we compared B and T cell responses in mice after one and two intrarectal immunizations with 2/6 rotavirus-like particles (2/6-VLP and LT-R192G. After a boost, we found, an unexpected lower B cell expansion measured by flow cytometry, despite a secondary antibody response. We then analyzed CD4+CD25+Foxp3+ regulatory T cells (Tregs and CD4+CD25+Foxp3− helper T cells after in vitro (restimulation of mesenteric lymph node cells with the antigen (2/6-VLP, the adjuvant (LT-R192G or both. 2/6-VLP did not activate CD4+CD25+Foxp3− nor Foxp3+ T cells from non-immunized and 2/6-VLP immunized mice, whereas they did activate both subsets from mice immunized with 2/6-VLP in the presence of adjuvant. LT-R192G dramatically decreased CD4+CD25+Foxp3+ T cells from non-immunized and 2/6-VLP immunized mice but not from mice immunized with 2/6-VLP and adjuvant. Moreover, in this case, LT-R192G increased Foxp3 expression on CD4+CD25+Foxp3+ cells, suggesting specific Treg activation during the recall. Finally, when both 2/6-VLP and LT-R192G were used for restimulation, LT-R192G clearly suppressed both 2/6-VLP-specific CD4+CD25+Foxp3− and Foxp3+ T cells. All together, these results suggest that LT-R192G exerts different effects on CD4+CD25+Foxp3+ T cells, depending on a first or a second contact. The unexpected immunomodulation observed during the recall should be considered in designing vaccination protocols.
Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats

OpenAIRE

Gymrek, Melissa

2016-01-01

This was presented as a BitesizeBio Webinar entitled "Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats"Accompanying scripts can be accessed on github:https://github.com/mgymrek/mgymrek-bitesizebio-webinar
In silico analysis of Simple Sequence Repeats from chloroplast genomes of Solanaceae species

Directory of Open Access Journals (Sweden)

Evandro Vagner Tambarussi

2009-01-01

Full Text Available The availability of chloroplast genome (cpDNA sequences of Atropa belladonna, Nicotiana sylvestris, N.tabacum, N. tomentosiformis, Solanum bulbocastanum, S. lycopersicum and S. tuberosum, which are Solanaceae species,allowed us to analyze the organization of cpSSRs in their genic and intergenic regions. In general, the number of cpSSRs incpDNA ranged from 161 in S. tuberosum to 226 in N. tabacum, and the number of intergenic cpSSRs was higher than geniccpSSRs. The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, pentaandhexanucleotide repeats. Multiple alignments of all cpSSRs sequences from Solanaceae species made the identification ofnucleotide variability possible and the phylogeny was estimated by maximum parsimony. Our study showed that the plastomedatabase can be exploited for phylogenetic analysis and biotechnological approaches.
simple sequence repeats (EST-SSR)

African Journals Online (AJOL)

Yomi

2012-01-19

Jan 19, 2012 ... 212 primer pairs selected, based on repeat patterns of n≥8 for di-, tri-, tetra- and penta-nucleotide repeat ... Cluster analysis revealed a high genetic similarity among the sugarcane (Saccharum spp.) breeding lines which could reduce the genetic gain in ..... The multiple allele characteristic of SSR com-.
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

Science.gov (United States)

Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
A chromosome conformation capture ordered sequence of the barley genome

Czech Academy of Sciences Publication Activity Database

Mascher, M.; Gundlach, H.; Himmelbach, A.; Beier, S.; Twardziok, S. O.; Wicker, T.; Šimková, Hana; Staňková, Helena; Vrána, Jan; Chan, S.; Munoz-Amatrian, M.; Houben, A.; Doležel, Jaroslav; Ayling, S.; Lonardi, S.; Mayer, K.F.X.; Zhang, G.; Braumann, I.; Spannagl, M.; Li, C.; Waugh, R.; Stein, N.

2017-01-01

Roč. 544, č. 7651 (2017), s. 427-433 ISSN 0028-0836 R&D Projects: GA MŠk(CZ) LO1204 Institutional support: RVO:61389030 Keywords : bacterial artificial chromosomes * inverted-repeat elements * complex-plant genomes * hi-c * environmental adaptation * ltr retrotransposons * structural variation * maize genome * software * database Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Plant sciences, botany Impact factor: 40.137, year: 2016
Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals.

Science.gov (United States)

Kaneko-Ishino, Tomoko; Ishino, Fumitoshi

2015-01-01

Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is "mammalian-specific genomic functions", a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of "mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons", based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes.
Exploiting BAC-end sequences for the mining, characterization and utility of new short sequences repeat (SSR) markers in Citrus.

Science.gov (United States)

Biswas, Manosh Kumar; Chai, Lijun; Mayer, Christoph; Xu, Qiang; Guo, Wenwu; Deng, Xiuxin

2012-05-01

The aim of this study was to develop a large set of microsatellite markers based on publicly available BAC-end sequences (BESs), and to evaluate their transferability, discriminating capacity of genotypes and mapping ability in Citrus. A set of 1,281 simple sequence repeat (SSR) markers were developed from the 46,339 Citrus clementina BAC-end sequences (BES), of them 20.67% contained SSR longer than 20 bp, corresponding to roughly one perfect SSR per 2.04 kb. The most abundant motifs were di-nucleotide (16.82%) repeats. Among all repeat motifs (TA/AT)n is the most abundant (8.38%), followed by (AG/CT)n (4.51%). Most of the BES-SSR are located in the non-coding region, but 1.3% of BES-SSRs were found to be associated with transposable element (TE). A total of 400 novel SSR primer pairs were synthesized and their transferability and polymorphism tested on a set of 16 Citrus and Citrus relative's species. Among these 333 (83.25%) were successfully amplified and 260 (65.00%) showed cross-species transferability with Poncirus trifoliata and Fortunella sp. These cross-species transferable markers could be useful for cultivar identification, for genomic study of Citrus, Poncirus and Fortunella sp. Utility of the developed SSR marker was demonstrated by identifying a set of 118 markers each for construction of linkage map of Citrus reticulata and Poncirus trifoliata. Genetic diversity and phylogenetic relationship among 40 Citrus and its related species were conducted with the aid of 25 randomly selected SSR primer pairs and results revealed that citrus genomic SSRs are superior to genic SSR for genetic diversity and germplasm characterization of Citrus spp.
ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants.

Science.gov (United States)

Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh

2014-01-01

Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.
Reverse transcriptase sequences from mulberry LTR retrotransposons: characterization analysis

Directory of Open Access Journals (Sweden)

Ma Bi

2017-10-01

Full Text Available Copia and Gypsy play important roles in structural, functional and evolutionary dynamics of plant genomes. In this study, a total of 106 and 101, Copia and Gypsy reverse transcriptase (rt were amplified respectively in the Morus notabilis genome using degenerate primers. All sequences exhibited high levels of heterogeneity, were rich in AT and possessed higher sequence divergence of Copia rt in comparison to Gypsy rt. Two reasons are likely to account for this phenomenon: a these elements often experience deletions or fragmentation by illegitimate or unequal homologous recombination in the transposition process; b strong purifying selective pressure drives the evolution of these elements through “selective silencing” with random mutation and eventual deletion from the host genome. Interestingly, mulberry rt clustered with other rt from distantly related taxa according to the phylogenetic analysis. This phenomenon did not result from horizontal transposable element transfer. Results obtained from fluorescence in situ hybridization revealed that most of the hybridization signals were preferentially concentrated in pericentromeric and distal regions of chromosomes, and these elements may play important roles in the regions in which they are found. Results of this study support the continued pursuit of further functional studies of Copia and Gypsy in the mulberry genome.
Studies on emerging radiation leukemia virus variants in C57BL/Ka mice

International Nuclear Information System (INIS)

Rassart, E.; Shang, M.; Boie, Y.; Jolicoeur, P.

1986-01-01

To analyze the emergence of radiation leukemia virus (RadLV) variants in primary X-ray-induced C57BL/Ka thymoma and to identify the virus responsible for the very high leukemogenic potential of passaged Kaplan strain BL/VL3 preparation, we cloned several primary and passaged ecotropic RadLV infectious genomes. By restriction analysis, we found that BL/VL3 cells harbor three related but different ecotropic RadLVs. Their restriction map differs significantly from those of primary RadLVs. Hybridization analysis also indicated that BL/VL3 and primary RadLVs differ in their p15E and long terminal repeat (LTR) regions. The LTR sequence of primary weakly leukemogenic RadLV has only one change, a C-rich sequence, generating a 6-base-pair direct repeat just in front of the promotor. The LTR of the primary nonleukemogenic RadLV only showed few base changes, mainly clustered in R and U5. The LTR from a moderately leukemogenic passaged BL/VL3 RadLV had conserved the C-rich sequence and acquired a 43-base-pair direct repeat in U3 and several other point mutations, small insertions, and deletions scattered in U3, R, and U5. All cloned primary RadLVs were fibrotropic, and some were weakly leukemogenic. All cloned BL/VL3 RadLVs were thymotropic and nonfibrotropic. The block of their replication was found to be after the synthesis of unintegrated linear and supercoiled viral DNA. Most of the BL/VL3 RadLVs were moderately leukemogenic, and one (V-13) was highly leukemogenic, being as virulent as the Moloney strain. We propose a model for the emergence of the RadLV variants and show that the virus responsible for the high leukemogenic potential of BL/VL3 preparation is a nondefective, ecotropic, lymphotropic, nonfibrotropic, unique retrovirus which most likely arose from a parental primary RadLV similar to those studied here
Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

Science.gov (United States)

Raynard, Steven J; Baker, Mark D

2004-01-01

In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.
C-terminal low-complexity sequence repeats of Mycobacterium smegmatis Ku modulate DNA binding.

Science.gov (United States)

Kushwaha, Ambuj K; Grove, Anne

2013-01-24

Ku protein is an integral component of the NHEJ (non-homologous end-joining) pathway of DSB (double-strand break) repair. Both eukaryotic and prokaryotic Ku homologues have been characterized and shown to bind DNA ends. A unique feature of Mycobacterium smegmatis Ku is its basic C-terminal tail that contains several lysine-rich low-complexity PAKKA repeats that are absent from homologues encoded by obligate parasitic mycobacteria. Such PAKKA repeats are also characteristic of mycobacterial Hlp (histone-like protein) for which they have been shown to confer the ability to appose DNA ends. Unexpectedly, removal of the lysine-rich extension enhances DNA-binding affinity, but an interaction between DNA and the PAKKA repeats is indicated by the observation that only full-length Ku forms multiple complexes with a short stem-loop-containing DNA previously designed to accommodate only one Ku dimer. The C-terminal extension promotes DNA end-joining by T4 DNA ligase, suggesting that the PAKKA repeats also contribute to efficient end-joining. We suggest that low-complexity lysine-rich sequences have evolved repeatedly to modulate the function of unrelated DNA-binding proteins.
Analysis of transposable elements in the genome of Asparagus officinalis from high coverage sequence data.

Science.gov (United States)

Li, Shu-Fen; Gao, Wu-Jun; Zhao, Xin-Peng; Dong, Tian-Yu; Deng, Chuan-Liang; Lu, Long-Dou

2014-01-01

Asparagus officinalis is an economically and nutritionally important vegetable crop that is widely cultivated and is used as a model dioecious species to study plant sex determination and sex chromosome evolution. To improve our understanding of its genome composition, especially with respect to transposable elements (TEs), which make up the majority of the genome, we performed Illumina HiSeq2000 sequencing of both male and female asparagus genomes followed by bioinformatics analysis. We generated 17 Gb of sequence (12×coverage) and assembled them into 163,406 scaffolds with a total cumulated length of 400 Mbp, which represent about 30% of asparagus genome. Overall, TEs masked about 53% of the A. officinalis assembly. Majority of the identified TEs belonged to LTR retrotransposons, which constitute about 28% of genomic DNA, with Ty1/copia elements being more diverse and accumulated to higher copy numbers than Ty3/gypsy. Compared with LTR retrotransposons, non-LTR retrotransposons and DNA transposons were relatively rare. In addition, comparison of the abundance of the TE groups between male and female genomes showed that the overall TE composition was highly similar, with only slight differences in the abundance of several TE groups, which is consistent with the relatively recent origin of asparagus sex chromosomes. This study greatly improves our knowledge of the repetitive sequence construction of asparagus, which facilitates the identification of TEs responsible for the early evolution of plant sex chromosomes and is helpful for further studies on this dioecious plant.
Cloning and heterologous expression of a hydrophobin gene Ltr.hyd from the tiger milk mushroom Lentinus tuber-regium in yeast-like cells of Tremella fuciformis

Directory of Open Access Journals (Sweden)

Dongmei Liu

2018-03-01

Full Text Available Background: Hydrophobins are small proteins secreted by filamentous fungi, which show a highly surface activity. Because of the signally self-assembling abilities and surface activities, hydrophobins were considered as candidates in many aspects, for example, stabilizing foams and emulsions in food products. Lentinus tuber-regium, known as tiger milk mushroom, is both an edible and medicinal sclerotium-producing mushroom. Up to now, the hydrophobins of L. tuber-regium have not been identified. Results: In this paper, a Class I hydrophobin gene, Ltr.hyd, was cloned from L. tuber-regium and expressed in the yeast-like cells of Tremella fuciformis mediated by Agrobacterium tumefaciens. The expression vector pGEH-GH was under the control of T. fuciformis glyceraldehyde-3-phosphate dehydrogenase gene (gpd promoter. The integration of Ltr.hyd into the genome of T. fuciformis was confirmed by PCR, Southern blot, fluorescence observation and quantitative real-time PCR (qRT-PCR. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE demonstrated that recombinant hydrophobin rLtr.HYD with an expected molecular mass of 13 kDa was extracted. The yield of rLtr.HYD was 0.66 mg/g dry weight. The emulsifying activity of rLtr.HYD was better than the typical food emulsifiers sodium caseinate and Tween 20. Conclusions: We evaluated the emulsifying property of hydrophobin Ltr.HYD, which can be potentially used as a food emulsifier. Keywords: Agrobacterium tumefaciens, Emulsifier, Expression vector, Filamentous fungi, Gel electrophoresis, Glyceraldehyde-3-phosphate dehydrogenase, Heterogenous expression, Hydrophobin, Quantitative real-time PCR, Southern blot, Surface activity

The sunflower (Helianthus annuus L.) genome reflects a recent history of biased accumulation of transposable elements.

Science.gov (United States)

Staton, S Evan; Bakken, Bradley H; Blackman, Benjamin K; Chapman, Mark A; Kane, Nolan C; Tang, Shunxue; Ungerer, Mark C; Knapp, Steven J; Rieseberg, Loren H; Burke, John M

2012-10-01

Aside from polyploidy, transposable elements are the major drivers of genome size increases in plants. Thus, understanding the diversity and evolutionary dynamics of transposable elements in sunflower (Helianthus annuus L.), especially given its large genome size (∼3.5 Gb) and the well-documented cases of amplification of certain transposons within the genus, is of considerable importance for understanding the evolutionary history of this emerging model species. By analyzing approximately 25% of the sunflower genome from random sequence reads and assembled bacterial artificial chromosome (BAC) clones, we show that it is composed of over 81% transposable elements, 77% of which are long terminal repeat (LTR) retrotransposons. Moreover, the LTR retrotransposon fraction in BAC clones harboring genes is disproportionately composed of chromodomain-containing Gypsy LTR retrotransposons ('chromoviruses'), and the majority of the intact chromoviruses contain tandem chromodomain duplications. We show that there is a bias in the efficacy of homologous recombination in removing LTR retrotransposon DNA, thereby providing insight into the mechanisms associated with transposable element (TE) composition in the sunflower genome. We also show that the vast majority of observed LTR retrotransposon insertions have likely occurred since the origin of this species, providing further evidence that biased LTR retrotransposon activity has played a major role in shaping the chromatin and DNA landscape of the sunflower genome. Although our findings on LTR retrotransposon age and structure could be influenced by the selection of the BAC clones analyzed, a global analysis of random sequence reads indicates that the evolutionary patterns described herein apply to the sunflower genome as a whole. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability

Science.gov (United States)

Xiao, Jing; Zhao, Jin; Liu, Mengjun; Liu, Ping; Dai, Li; Zhao, Zhihui

2015-01-01

Chinese jujube (Ziziphus jujuba), an economically important species in the Rhamnaceae family, is a popular fruit tree in Asia. Here, we surveyed and characterized simple sequence repeats (SSRs) in the jujube genome. A total of 436,676 SSR loci were identified, with an average distance of 0.93 Kb between the loci. A large proportion of the SSRs included mononucleotide, dinucleotide and trinucleotide repeat motifs, which accounted for 64.87%, 24.40%, and 8.74% of all repeats, respectively. Among the mononucleotide repeats, A/T was the most common, whereas AT/TA was the most common dinucleotide repeat. A total of 30,565 primer pairs were successfully designed and screened using a series of criteria. Moreover, 725 of 1,000 randomly selected primer pairs were effective among 6 cultivars, and 511 of these primer pairs were polymorphic. Sequencing the amplicons of two SSRs across three jujube cultivars revealed variations in the repeats. The transferability of jujube SSR primers proved that 35/64 SSRs could be transferred across family boundary. Using jujube SSR primers, clustering analysis results from 15 species were highly consistent with the Angiosperm Phylogeny Group (APGIII) System. The genome-wide characterization of SSRs in Chinese jujube is very valuable for whole-genome characterization and marker-assisted selection in jujube breeding. In addition, the transferability of jujube SSR primers could provide a solid foundation for their further utilization. PMID:26000739
Effects of loading sequences and size of repeated stress block of loads on fatigue life calculated using fatigue functions

International Nuclear Information System (INIS)

Schott, G.

1989-01-01

It is well-known that collective form, stress intensity and loading sequence of individual stresses as well as size of repeated stress blocks can influence fatigue life, significantly. The basic variant of the consecutive Woehler curve concept will permit these effects to be involved into fatigue life computation. The paper presented will demonstrate that fatigue life computations using fatigue functions reflect the loading sequence effect with multilevel loading precisely and provide reliable fatigue life data. Effects of size of repeated stress block and loading sequence on fatigue life as observed with block program tests can be reproduced using the new computation method. (orig.) [de
Identification of apple cultivars on the basis of simple sequence repeat markers.

Science.gov (United States)

Liu, G S; Zhang, Y G; Tao, R; Fang, J G; Dai, H Y

2014-09-12

DNA markers are useful tools that play an important role in plant cultivar identification. They are usually based on polymerase chain reaction (PCR) and include simple sequence repeats (SSRs), inter-simple sequence repeats, and random amplified polymorphic DNA. However, DNA markers were not used effectively in the complete identification of plant cultivars because of the lack of known DNA fingerprints. Recently, a novel approach called the cultivar identification diagram (CID) strategy was developed to facilitate the use of DNA markers for separate plant individuals. The CID was designed whereby a polymorphic maker was generated from each PCR that directly allowed for cultivar sample separation at each step. Therefore, it could be used to identify cultivars and varieties easily with fewer primers. In this study, 60 apple cultivars, including a few main cultivars in fields and varieties from descendants (Fuji x Telamon) were examined. Of the 20 pairs of SSR primers screened, 8 pairs gave reproducible, polymorphic DNA amplification patterns. The banding patterns obtained from these 8 primers were used to construct a CID map. Each cultivar or variety in this study was distinguished from the others completely, indicating that this method can be used for efficient cultivar identification. The result contributed to studies on germplasm resources and the seedling industry in fruit trees.
Distribution and evolution of repeated sequences in genomes of Triatominae (Hemiptera-Reduviidae inferred from genomic in situ hybridization.

Directory of Open Access Journals (Sweden)

Sebastian Pita

Full Text Available The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily.
Simple sequence repeats in Neurospora crassa: distribution, polymorphism and evolutionary inference

Directory of Open Access Journals (Sweden)

Park Jongsun

2008-01-01

Full Text Available Abstract Background Simple sequence repeats (SSRs have been successfully used for various genetic and evolutionary studies in eukaryotic systems. The eukaryotic model organism Neurospora crassa is an excellent system to study evolution and biological function of SSRs. Results We identified and characterized 2749 SSRs of 963 SSR types in the genome of N. crassa. The distribution of tri-nucleotide (nt SSRs, the most common SSRs in N. crassa, was significantly biased in exons. We further characterized the distribution of 19 abundant SSR types (AST, which account for 71% of total SSRs in the N. crassa genome, using a Poisson log-linear model. We also characterized the size variation of SSRs among natural accessions using Polymorphic Index Content (PIC and ANOVA analyses and found that there are genome-wide, chromosome-dependent and local-specific variations. Using polymorphic SSRs, we have built linkage maps from three line-cross populations. Conclusion Taking our computational, statistical and experimental data together, we conclude that 1 the distributions of the SSRs in the sequenced N. crassa genome differ systematically between chromosomes as well as between SSR types, 2 the size variation of tri-nt SSRs in exons might be an important mechanism in generating functional variation of proteins in N. crassa, 3 there are different levels of evolutionary forces in variation of amino acid repeats, and 4 SSRs are stable molecular markers for genetic studies in N. crassa.
Sequence variations in C9orf72 downstream of the hexanucleotide repeat region and its effect on repeat-primed PCR interpretation

DEFF Research Database (Denmark)

Nordin, Angelica; Akimoto, Chizuru; Wuolikainen, Anna

2017-01-01

A large GGGGCC-repeat expansion mutation (HREM) in C9orf72 is the most common known cause of ALS and FTD in European populations. Sequence variations immediately downstream of the HREM region have previously been observed and have been suggested to be one reason for difficulties in interpreting R...
Length and repeat-sequence variation in 58 STRs and 94 SNPs in two Spanish populations.

Science.gov (United States)

Casals, Ferran; Anglada, Roger; Bonet, Núria; Rasal, Raquel; van der Gaag, Kristiaan J; Hoogenboom, Jerry; Solé-Morata, Neus; Comas, David; Calafell, Francesc

2017-09-01

We have genotyped the 58 STRs (27 autosomal, 24 Y-STRs and 7 X-STRs) and 94 autosomal SNPs in Illumina ForenSeq™ Primer Mix A in 88 Spanish Roma (Gypsy) samples and 143 Catalans. Since this platform is based in massive parallel sequencing, we have used simple R scripts to uncover the sequence variation in the repeat region. Thus, we have found, across 58 STRs, 541 length-based alleles, which, after considering repeat-sequence variation, became 804 different alleles. All loci in both populations were in Hardy-Weinberg equilibrium. F ST between both populations was 0.0178 for autosomal SNPs, 0.0146 for autosomal STRs, 0.0101 for X-STRs and 0.1866 for Y-STRs. Combined a priori statistics showed quite large; for instance, pooling all the autosomal loci, the a priori probabilities of discriminating a suspect become 1-(2.3×10 -70 ) and 1-(5.9×10 -73 ), for Roma and Catalans respectively, and the chances of excluding a false father in a trio are 1-(2.6×10 -20 ) and 1-(2.0×10 -21 ). Copyright © 2017 Elsevier B.V. All rights reserved.
Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

Science.gov (United States)

Amirhaeri, S; Wohlrab, F; Wells, R D

1995-02-17

The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.
Molecular epidemiology of endemic human T-lymphotropic virus type 1 in a rural community in Guinea-Bissau.

Directory of Open Access Journals (Sweden)

Carla van Tienen

Full Text Available Human T-Lymphotropic Virus Type 1 (HTLV-1 infection causes lethal adult T-cell leukemia (ATL and severely debilitating HTLV-associated myelopathy/tropical spastic paraparesis (HAM/TSP in up to 5% of infected adults. HTLV-1 is endemic in parts of Africa and the highest prevalence in West Africa (5% has been reported in Caio, a rural area in the North-West of Guinea-Bissau. It is not known which HTLV-1 variants are present in this community. Sequence data can provide insights in the molecular epidemiology and help to understand the origin and spread of HTLV-1.To gain insight into the molecular diversity of HTLV-1 in West Africa.HTLV-1 infected individuals were identified in community surveys between 1990-2007. The complete Long Terminal Repeat (LTR and p24 coding region of HTLV-1 was sequenced from infected subjects. Socio-demographic data were obtained from community census and from interviews performed by fieldworkers. Phylogenetic analyses were performed to characterize the relationship between the Caio HTLV-1 and HTLV-1 from other parts of the world.LTR and p24 sequences were obtained from 72 individuals (36 LTR, 24 p24 only and 12 both. Consistent with the low evolutionary change of HTLV-1, many of the sequences from unrelated individuals showed 100% nucleotide identity. Most (45 of 46 of the LTR sequences clustered with the Cosmopolitan HTLV-1 subtype 1a, subgroup D (1aD. LTR and p24 sequences from two subjects were divergent and formed a significant cluster with HTLV-1 subtype 1g, and with the most divergent African Simian T-cell Lymphotropic Virus, Tan90.The Cosmopolitan HTLV-1 1aD predominates in this rural West African community. However, HTLV-1 subtype 1g is also present. This subtype has not been described before in West Africa and may be more widespread than previously thought. These data are in line with the hypothesis that multiple monkey-to-man zoonotic events are contributing to HTLV-1 diversity.
Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L. genome

Directory of Open Access Journals (Sweden)

González Leonardo Galindo

2012-11-01

Full Text Available Abstract Background Flax (Linum usitatissimum L. is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Results Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage, followed by Long Interspersed Nuclear Element (LINE retrotransposons (2.10% and Mutator DNA transposons (1.99%. Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. Conclusions The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include
Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome.

Science.gov (United States)

González, Leonardo Galindo; Deyholos, Michael K

2012-11-21

Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of
Development of simple sequence repeat markers and diversity analysis in alfalfa (Medicago sativa L.).

Science.gov (United States)

Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen

2013-04-01

Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.
Transcription arrest by a G quadruplex forming-trinucleotide repeat sequence from the human c-myb gene.

Science.gov (United States)

Broxson, Christopher; Beckett, Joshua; Tornaletti, Silvia

2011-05-17

Non canonical DNA structures correspond to genomic regions particularly susceptible to genetic instability. The transcription process facilitates formation of these structures and plays a major role in generating the instability associated with these genomic sites. However, little is known about how non canonical structures are processed when encountered by an elongating RNA polymerase. Here we have studied the behavior of T7 RNA polymerase (T7RNAP) when encountering a G quadruplex forming-(GGA)(4) repeat located in the human c-myb proto-oncogene. To make direct correlations between formation of the structure and effects on transcription, we have taken advantage of the ability of the T7 polymerase to transcribe single-stranded substrates and of G4 DNA to form in single-stranded G-rich sequences in the presence of potassium ions. Under physiological KCl concentrations, we found that T7 RNAP transcription was arrested at two sites that mapped to the c-myb (GGA)(4) repeat sequence. The extent of arrest did not change with time, indicating that the c-myb repeat represented an absolute block and not a transient pause to T7 RNAP. Consistent with G4 DNA formation, arrest was not observed in the absence of KCl or in the presence of LiCl. Furthermore, mutations in the c-myb (GGA)(4) repeat, expected to prevent transition to G4, also eliminated the transcription block. We show T7 RNAP arrest at the c-myb repeat in double-stranded DNA under conditions mimicking the cellular concentration of biomolecules and potassium ions, suggesting that the G4 structure formed in the c-myb repeat may represent a transcription roadblock in vivo. Our results support a mechanism of transcription-coupled DNA repair initiated by arrest of transcription at G4 structures.
Use of short tandem repeat sequences to study Mycobacterium leprae in leprosy patients in Malawi and India.

Directory of Open Access Journals (Sweden)

Saroj K Young

2008-04-01

Full Text Available Inadequate understanding of the transmission of Mycobacterium leprae makes it difficult to predict the impact of leprosy control interventions. Genotypic tests that allow tracking of individual bacterial strains would strengthen epidemiological studies and contribute to our understanding of the disease.Genotyping assays based on variation in the copy number of short tandem repeat sequences were applied to biopsies collected in population-based epidemiological studies of leprosy in northern Malawi, and from members of multi-case households in Hyderabad, India. In the Malawi series, considerable genotypic variability was observed between patients, and also within patients, when isolates were collected at different times or from different tissues. Less within-patient variability was observed when isolates were collected from similar tissues at the same time. Less genotypic variability was noted amongst the closely related Indian patients than in the Malawi series.Lineages of M. leprae undergo changes in their pattern of short tandem repeat sequences over time. Genetic divergence is particularly likely between bacilli inhabiting different (e.g., skin and nerve tissues. Such variability makes short tandem repeat sequences unsuitable as a general tool for population-based strain typing of M. leprae, or for distinguishing relapse from reinfection. Careful use of these markers may provide insights into the development of disease within individuals and for tracking of short transmission chains.
Analysis of sequence diversity through internal transcribed spacers and simple sequence repeats to identify Dendrobium species.

Science.gov (United States)

Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y

2014-04-08

The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.
Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

Science.gov (United States)

Oggioni, M R; Claverys, J P

1999-10-01

A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Structural organization of glycophorin A and B genes: Glycophorin B gene evolved by homologous recombination at Alu repeat sequences

International Nuclear Information System (INIS)

Kudo, Shinichi; Fukuda, Minoru

1989-01-01

Glycophorins A (GPA) and B (GPB) are two major sialoglycoproteins of the human erythrocyte membrane. Here the authors present a comparison of the genomic structures of GPA and GPB developed by analyzing DNA clones isolated from a K562 genomic library. Nucleotide sequences of exon-intron junctions and 5' and 3' flanking sequences revealed that the GPA and GPB genes consist of 7 and 5 exons, respectively, and both genes have >95% identical sequence from the 5' flanking region to the region ∼ 1 kilobase downstream from the exon encoding the transmembrane regions. In this homologous part of the genes, GPB lacks one exon due to a point mutation at the 5' splicing site of the third intron, which inactivates the 5' cleavage event of splicing and leads to ligation of the second to the fourth exon. Following these very homologous sequences, the genomic sequences for GPA and GPB diverge significantly and no homology can be detected in their 3' end sequences. The analysis of the Alu sequences and their flanking direct repeat sequences suggest that an ancestral genomic structure has been maintained in the GPA gene, whereas the GPB gene has arisen from the acquisition of 3' sequences different from those of the GPA gene by homologous recombination at the Alu repeats during or after gene duplication
Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

Science.gov (United States)

Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

2012-08-01

Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.
Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

Science.gov (United States)

M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan

2009-01-01

The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...

Genotyping and Molecular Identification of Date Palm Cultivars Using Inter-Simple Sequence Repeat (ISSR) Markers.

Science.gov (United States)

Ayesh, Basim M

2017-01-01

Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
The complete chloroplast genome sequence of Taxus chinensis var. mairei (Taxaceae): loss of an inverted repeat region and comparative analysis with related species.

Science.gov (United States)

Zhang, Yanzhen; Ma, Ji; Yang, Bingxian; Li, Ruyi; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Zhang, Lin

2014-05-01

Taxus chinensis var. mairei (Taxaceae) is a domestic variety of yew species in local China. This plant is one of the sources for paclitaxel, which is a promising antineoplastic chemotherapy drugs during the last decade. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of T. chinensis var. mairei. The T. chinensis var. mairei cp genome is 129,513 bp in length, with 113 single copy genes and two duplicated genes (trnI-CAU, trnQ-UUG). Among the 113 single copy genes, 9 are intron-containing. Compared to other land plant cp genomes, the T. chinensis var. mairei cp genome has lost one of the large inverted repeats (IRs) found in angiosperms, fern, liverwort, and gymnosperm such as Cycas revoluta and Ginkgo biloba L. Compared to related species, the gene order of T. chinensis var. mairei has a large inversion of ~110kb including 91 genes (from rps18 to accD) with gene contents unarranged. Repeat analysis identified 48 direct and 2 inverted repeats 30 bp long or longer with a sequence identity greater than 90%. Repeated short segments were found in genes rps18, rps19 and clpP. Analysis also revealed 22 simple sequence repeat (SSR) loci and almost all are composed of A or T. Copyright © 2014 Elsevier B.V. All rights reserved.
Rotavirus 2/6 Viruslike Particles Administered Intranasally with Cholera Toxin, Escherichia coli Heat-Labile Toxin (LT), and LT-R192G Induce Protection from Rotavirus Challenge

Science.gov (United States)

O’Neal, Christine M.; Clements, John D.; Estes, Mary K.; Conner, Margaret E.

1998-01-01

We have shown that rotavirus 2/6 viruslike particles composed of proteins VP2 and VP6 (2/6-VLPs) administered to mice intranasally with cholera toxin (CT) induced protection from rotavirus challenge, as measured by virus shedding. Since it is unclear if CT will be approved for human use, we evaluated the adjuvanticity of Escherichia coli heat-labile toxin (LT) and LT-R192G. Mice were inoculated intranasally with 10 μg of 2/6-VLPs combined with CT, LT, or LT-R192G. All three adjuvants induced equivalent geometric mean titers of rotavirus-specific serum antibody and intestinal immunoglobulin G (IgG). Mice inoculated with 2/6-VLPs with LT produced significantly higher titers of intestinal IgA than mice given CT as the adjuvant. All mice inoculated with 2/6-VLPs mixed with LT and LT-R192G were totally protected (100%) from rotavirus challenge, while mice inoculated with 2/6-VLPs mixed with CT showed a mean 91% protection from challenge. The availability of a safe, effective mucosal adjuvant such as LT-R192G will increase the practicality of administering recombinant vaccines mucosally. PMID:9525668
No Evidence of XMRV or MuLV Sequences in Prostate Cancer, Diffuse Large B-Cell Lymphoma, or the UK Blood Donor Population

Directory of Open Access Journals (Sweden)

Mark James Robinson

2011-01-01

Full Text Available Xenotropic murine leukaemia virus-related virus (XMRV is a recently described retrovirus which has been claimed to infect humans and cause associated pathology. Initially identified in the US in patients with prostate cancer and subsequently in patients with chronic fatigue syndrome, doubt now exists that XMRV is a human pathogen. We studied the prevalence of genetic sequences of XMRV and related MuLV sequences in human prostate cancer, from B cell lymphoma patients and from UK blood donors. Nucleic acid was extracted from fresh prostate tissue biopsies, formalin-fixed paraffin-embedded (FFPE prostate tissue and FFPE B-cell lymphoma. The presence of XMRV-specific LTR or MuLV generic gag-like sequences was investigated by nested PCR. To control for mouse DNA contamination, a PCR that detected intracisternal A-type particle (IAP sequences was included. In addition, DNA and RNA were extracted from whole blood taken from UK blood donors and screened for XMRV sequences by real-time PCR. XMRV or MuLV-like sequences were not amplified from tissue samples. Occasionally MuLV gag and XMRV-LTR sequences were amplified from Indian prostate cancer samples, but were always detected in conjunction with contaminating murine genomic DNA. We found no evidence of XMRV or MuLV infection in the UK blood donors.
Enhancement of Intranasal Vaccination in Mice with Deglycosylated Chain A Ricin by LTR72, a Novel Mucosal Adjuvant

National Research Council Canada - National Science Library

Kende, Meir; Del Giudice, Giuseppe; Rivera, Noelia; Hewetson, John

2006-01-01

.... However, in the presence of 4, 2, or 1 microg of the mucosal adjuvant LTR72, a mutant of the heat-labile enterotoxin of Escherichia coli, the low antibody response and protection were substantially enhanced...
Enhancement of Intranasal Vaccination in Mice with Deglycosylated Chain A Ricin by LTR72, a Novel Mucosal Adjuvant

National Research Council Canada - National Science Library

Kende, Meir; Del Giudice, Giuseppe; Rivera, Noelia; Hewetson, John

2006-01-01

.... However, in the presence of 4, 2, or 1 micro-gram of the mucosal adjuvant LTR72, a mutant of the heat-labile enterotoxin of Escherichia coli, the low antibody response and protection were substantially enhanced...
Exceptional diversity, non-random distribution, and rapid evolution of retroelements in the B73 maize genome.

Directory of Open Access Journals (Sweden)

Regina S Baucom

2009-11-01

Full Text Available Recent comprehensive sequence analysis of the maize genome now permits detailed discovery and description of all transposable elements (TEs in this complex nuclear environment. Reiteratively optimized structural and homology criteria were used in the computer-assisted search for retroelements, TEs that transpose by reverse transcription of an RNA intermediate, with the final results verified by manual inspection. Retroelements were found to occupy the majority (>75% of the nuclear genome in maize inbred B73. Unprecedented genetic diversity was discovered in the long terminal repeat (LTR retrotransposon class of retroelements, with >400 families (>350 newly discovered contributing >31,000 intact elements. The two other classes of retroelements, SINEs (four families and LINEs (at least 30 families, were observed to contribute 1,991 and approximately 35,000 copies, respectively, or a combined approximately 1% of the B73 nuclear genome. With regard to fully intact elements, median copy numbers for all retroelement families in maize was 2 because >250 LTR retrotransposon families contained only one or two intact members that could be detected in the B73 draft sequence. The majority, perhaps all, of the investigated retroelement families exhibited non-random dispersal across the maize genome, with LINEs, SINEs, and many low-copy-number LTR retrotransposons exhibiting a bias for accumulation in gene-rich regions. In contrast, most (but not all medium- and high-copy-number LTR retrotransposons were found to preferentially accumulate in gene-poor regions like pericentromeric heterochromatin, while a few high-copy-number families exhibited the opposite bias. Regions of the genome with the highest LTR retrotransposon density contained the lowest LTR retrotransposon diversity. These results indicate that the maize genome provides a great number of different niches for the survival and procreation of a great variety of retroelements that have evolved to
Coactivator-associated arginine methyltransferase 1 enhances transcriptional activity of the human T-cell lymphotropic virus type 1 long terminal repeat through direct interaction with Tax.

Science.gov (United States)

Jeong, Soo-Jin; Lu, Hanxin; Cho, Won-Kyung; Park, Hyeon Ung; Pise-Masison, Cynthia; Brady, John N

2006-10-01

In this study, we demonstrate that the coactivator-associated arginine methyltransferase 1 (CARM1), which methylates histone H3 and other proteins such as p300/CBP, is positively involved in the regulation of Tax transactivation. First, transfection studies demonstrated that overexpression of CARM1 wild-type protein resulted in increased Tax transactivation of the human T-cell lymphotropic virus type 1 (HTLV-1) long terminal repeat (LTR). In contrast, transfection of a catalytically inactive CARM1 methyltransferase mutant did not enhance Tax transactivation. CARM1 facilitated Tax transactivation of the CREB-dependent cellular GEM promoter. A direct physical interaction between HTLV-1 Tax and CARM1 was demonstrated using in vitro glutathione S-transferase-Tax binding assays, in vivo coimmunoprecipitation, and confocal microscopy experiments. Finally, chromatin immunoprecipitation analysis of the activated HTLV-1 LTR promoter showed the association of CARM1 and methylated histone H3 with the template DNA. In vitro, Tax facilitates the binding of CARM1 to the transcription complex. Together, our data provide evidence that CARM1 enhances Tax transactivation of the HTLV-1 LTR through a direct interaction between CARM1 and Tax and this binding promotes methylation of histone H3 (R2, R17, and R26).
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

Science.gov (United States)

Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

2015-01-01

The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

Directory of Open Access Journals (Sweden)

Jiří Macas

Full Text Available The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57% of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%. Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Inter-simple sequence repeat (ISSR) loci mapping in the genome of perennial ryegrass

DEFF Research Database (Denmark)

Pivorienė, O; Pašakinskienė, I; Brazauskas, G

2008-01-01

The aim of this study was to identify and characterize new ISSR markers and their loci in the genome of perennial ryegrass. A subsample of the VrnA F2 mapping family of perennial ryegrass comprising 92 individuals was used to develop a linkage map including inter-simple sequence repeat markers...... demonstrated a 70% similarity to the Hordeum vulgare germin gene GerA. Inter-SSR mapping will provide useful information for gene targeting, quantitative trait loci mapping and marker-assisted selection in perennial ryegrass....
Effects of GABA[subscript A] Modulators on the Repeated Acquisition of Response Sequences in Squirrel Monkeys

Science.gov (United States)

Campbell, Una C.; Winsauer, Peter J.; Stevenson, Michael W.; Moerschbaecher, Joseph M.

2004-01-01

The present study investigated the effects of positive and negative GABA[subscript A] modulators under three different baselines of repeated acquisition in squirrel monkeys in which the monkeys acquired a three-response sequence on three keys under a second-order fixed-ratio (FR) schedule of food reinforcement. In two of these baselines, the…
Development and Characterization of Simple Sequence Repeat (SSR) Markers Based on RNA-Sequencing of Medicago sativa and In silico Mapping onto the M. truncatula Genome

Science.gov (United States)

Wang, Zan; Yu, Guohui; Shi, Binbin; Wang, Xuemin; Qiang, Haiping; Gao, Hongwen

2014-01-01

Sufficient codominant genetic markers are needed for various genetic investigations in alfalfa since the species is an outcrossing autotetraploid. With the newly developed next generation sequencing technology, a large amount of transcribed sequences of alfalfa have been generated and are available for identifying SSR markers by data mining. A total of 54,278 alfalfa non-redundant unigenes were assembled through the Illumina HiSeqTM 2000 sequencing technology. Based on 3,903 unigene sequences, 4,493 SSRs were identified. Tri-nucleotide repeats (56.71%) were the most abundant motif class while AG/CT (21.7%), AGG/CCT (19.8%), AAC/GTT (10.3%), ATC/ATG (8.8%), and ACC/GGT (6.3%) were the subsequent top five nucleotide repeat motifs. Eight hundred and thirty- seven EST-SSR primer pairs were successfully designed. Of these, 527 (63%) primer pairs yielded clear and scored PCR products and 372 (70.6%) exhibited polymorphisms. High transferability was observed for ssp falcata at 99.2% (523) and 71.7% (378) in M. truncatula. In addition, 313 of 527 SSR marker sequences were in silico mapped onto the eight M. truncatula chromosomes. Thirty-six polymorphic SSR primer pairs were used in the genetic relatedness analysis of 30 Chinese alfalfa cultivated accessions generating a total of 199 scored alleles. The mean observed heterozygosity and polymorphic information content were 0.767 and 0.635, respectively. The codominant markers not only enriched the current resources of molecular markers in alfalfa, but also would facilitate targeted investigations in marker-trait association, QTL mapping, and genetic diversity analysis in alfalfa. PMID:24642969
Mutations that abrogate transactivational activity of the feline leukemia virus long terminal repeat do not affect virus replication

International Nuclear Information System (INIS)

Abujamra, Ana L.; Faller, Douglas V.; Ghosh, Sajal K.

2003-01-01

The U3 region of the LTR of oncogenic Moloney murine leukemia virus (Mo-MuLV) and feline leukemia viruses (FeLV) have been previously reported to activate expression of specific cellular genes in trans, such as MHC class I, collagenase IV, and MCP-1, in an integration-independent manner. It has been suggested that transactivation of these specific cellular genes by leukemia virus U3-LTR may contribute to the multistage process of leukemogenesis. The U3-LTR region, necessary for gene transactivational activity, also contains multiple transcription factor-binding sites that are essential for normal virus replication. To dissect the promoter activity and the gene transactivational activity of the U3-LTR, we conducted mutational analysis of the U3-LTR region of FeLV-A molecular clone 61E. We identified minimal nucleotide substitution mutants on the U3 LTR that did not disturb transcription factor-binding sites but abrogated its ability to transactivate the collagenase gene promoter. To determine if these mutations actually have altered any uncharacterized important transcription factor-binding site, we introduced these U3-LTR mutations into the full-length infectious molecular clone 61E. We demonstrate that the mutant virus was replication competent but could not transactivate cellular gene expression. These results thus suggest that the gene transactivational activity is a distinct property of the LTR and possibly not related to its promoter activity. The cellular gene transactivational activity-deficient mutant FeLV generated in this study may also serve as a valuable reagent for testing the biological significance of LTR-mediated cellular gene activation in the tumorigenesis caused by leukemia viruses
Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

Directory of Open Access Journals (Sweden)

Varala Kranthi

2007-05-01

Full Text Available Abstract Background Extensive computational and database tools are available to mine genomic and genetic databases for model organisms, but little genomic data is available for many species of ecological or agricultural significance, especially those with large genomes. Genome surveys using conventional sequencing techniques are powerful, particularly for detecting sequences present in many copies per genome. However these methods are time-consuming and have potential drawbacks. High throughput 454 sequencing provides an alternative method by which much information can be gained quickly and cheaply from high-coverage surveys of genomic DNA. Results We sequenced 78 million base-pairs of randomly sheared soybean DNA which passed our quality criteria. Computational analysis of the survey sequences provided global information on the abundant repetitive sequences in soybean. The sequence was used to determine the copy number across regions of large genomic clones or contigs and discover higher-order structures within satellite repeats. We have created an annotated, online database of sequences present in multiple copies in the soybean genome. The low bias of pyrosequencing against repeat sequences is demonstrated by the overall composition of the survey data, which matches well with past estimates of repetitive DNA content obtained by DNA re-association kinetics (Cot analysis. Conclusion This approach provides a potential aid to conventional or shotgun genome assembly, by allowing rapid assessment of copy number in any clone or clone-end sequence. In addition, we show that partial sequencing can provide access to partial protein-coding sequences.
Akv murine leukemia virus enhances bone tumorigenesis in hMT-c-fos-LTR transgenic mice

DEFF Research Database (Denmark)

Schmidt, Jörg; Krump-Konvalinkova, Vera; Luz, Arne

1995-01-01

hMt-c-fos-LTR transgenic mice (U. Rüther, D. Komitowski, F. R. Schubert, and E. F. Wagner. Oncogene 4, 861–865, 1989) developed bone sarcomas in 20% (3/15) of females at 448 ± 25 days and in 8% (1/12) of males at 523 days. After infection of newborns with Akv, an infectious retrovirus derived from...
The leucine-rich repeat structure.

Science.gov (United States)

Bella, J; Hindle, K L; McEwan, P A; Lovell, S C

2008-08-01

The leucine-rich repeat is a widespread structural motif of 20-30 amino acids with a characteristic repetitive sequence pattern rich in leucines. Leucine-rich repeat domains are built from tandems of two or more repeats and form curved solenoid structures that are particularly suitable for protein-protein interactions. Thousands of protein sequences containing leucine-rich repeats have been identified by automatic annotation methods. Three-dimensional structures of leucine-rich repeat domains determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. As the essential structural principles become well established, the leucine-rich repeat architecture is emerging as an attractive framework for structural prediction and protein engineering. This review presents an update of the current understanding of leucine-rich repeat structure at the primary, secondary, tertiary and quaternary levels and discusses specific examples from recently determined three-dimensional structures.
Comparison of the degree of homology of DNA and quantity of repeated sequences in an intact plant and cell structure

International Nuclear Information System (INIS)

Solov'yan, V.T.; Kunaleh, V.A.; Shumnyl, V.K.; Vershinin, A.V.

1986-01-01

This paper attempts to assess the quantity of repeated sequences and degree of homology of DNA in the intact plant and two lines of callus tissue of Rauwolfia serpentina Benth maintained for 20 years, which differ among themselves in the level of biosynthesis of the pharmacologically valuable alkaloid ajmaline. The tritium-labeled repeats of plants and calli were used in direct and reverse hybridization on nitrocellulose filters. Hybridization of H 3-labeled repeats with phage 17 DNA was used as control. The radioactivity of filters after washing was measured in a liquid scintillation counter
Survey and analysis of simple sequence repeats in the Laccaria bicolor genome, with development of microsatellite markers

Energy Technology Data Exchange (ETDEWEB)

Labbe, Jessy L [ORNL; Murat, Claude [INRA, Nancy, France; Morin, Emmanuelle [INRA, Nancy, France; Le Tacon, F [UMR, France; Martin, Francis [INRA, Nancy, France

2011-01-01

It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in the L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

Directory of Open Access Journals (Sweden)

Gao Zhihong

2010-07-01

Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.

Phylogenetic analysis of Gossypium L. using restriction fragment length polymorphism of repeated sequences.

Science.gov (United States)

Zhang, Meiping; Rong, Ying; Lee, Mi-Kyung; Zhang, Yang; Stelly, David M; Zhang, Hong-Bin

2015-10-01

Cotton is the world's leading textile fiber crop and is also grown as a bioenergy and food crop. Knowledge of the phylogeny of closely related species and the genome origin and evolution of polyploid species is significant for advanced genomics research and breeding. We have reconstructed the phylogeny of the cotton genus, Gossypium L., and deciphered the genome origin and evolution of its five polyploid species by restriction fragment analysis of repeated sequences. Nuclear DNA of 84 accessions representing 35 species and all eight genomes of the genus were analyzed. The phylogenetic tree of the genus was reconstructed using the parsimony method on 1033 polymorphic repeated sequence restriction fragments. The genome origin of its polyploids was determined by calculating the diploid-polyploid restriction fragment correspondence (RFC). The tree is consistent with the morphological classification, genome designation and geographic distribution of the species at subgenus, section and subsection levels. Gossypium lobatum (D7) was unambiguously shown to have the highest RFC with the D-subgenomes of all five polyploids of the genus, while the common ancestor of Gossypium herbaceum (A1) and Gossypium arboreum (A2) likely contributed to the A-subgenomes of the polyploids. These results provide a comprehensive phylogenetic tree of the cotton genus and new insights into the genome origin and evolution of its polyploid species. The results also further demonstrate a simple, rapid and inexpensive method suitable for phylogenetic analysis of closely related species, especially congeneric species, and the inference of genome origin of polyploids that constitute over 70 % of flowering plants.
Linkage of congenital isolated adrenocorticotropic hormone deficiency to the corticotropin releasing hormone locus using simple sequence repeat polymorphisms

Energy Technology Data Exchange (ETDEWEB)

Kyllo, J.H.; Collins, M.M.; Vetter, K.L. [Univ. of Iowa College of Medicine, Iowa City, IA (United States)] [and others

1996-03-29

Genetic screening techniques using simple sequence repeat polymorphisms were applied to investigate the molecular nature of congenital isolated adrenocorticotropic hormone (ACTH) deficiency. We hypothesize that this rare cause of hypocortisolism shared by a brother and sister with two unaffected sibs and unaffected parents is inherited as an autosomal recessive single gene mutation. Genes involved in the hypothalamic-pituitary axis controlling cortisol sufficiency were investigated for a causal role in this disorder. Southern blotting showed no detectable mutations of the gene encoding pro-opiomelanocortin (POMC), the ACTH precursor. Other candidate genes subsequently considered were those encoding neuroendocrine convertase-1, and neuroendocrine convertase-2 (NEC-1, NEC-2), and corticotropin releasing hormone (CRH). Tests for linkage were performed using polymorphic di- and tetranucleotide simple sequence repeat markers flanking the reported map locations for POMC, NEC-1, NEC-2, and CRH. The chromosomal haplotypes determined by the markers flanking the loci for POMC, NEC-1, and NEC-2 were not compatible with linkage. However, 22 individual markers defining the chromosomal haplotypes flanking CRH were compatible with linkage of the disorder to the immediate area of this gene of chromosome 8. Based on these data, we hypothesize that the ACTH deficiency in this family is due to an abnormality of CRH gene structure or expression. These results illustrate the useful application of high density genetic maps constructed with simple sequence repeat markers for inclusion/exclusion studies of candidate genes in even very small nuclear families segregating for unusual phenotypes. 25 refs., 5 figs., 2 tabs.
Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple

Science.gov (United States)

2012-01-01

Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding
Creation and structure determination of an artificial protein with three complete sequence repeats

Energy Technology Data Exchange (ETDEWEB)

Adachi, Motoyasu, E-mail: adachi.motoyasu@jaea.go.jp; Shimizu, Rumi; Kuroki, Ryota [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Blaber, Michael [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Florida State University, Tallahassee, FL 32306-4300 (United States)

2013-11-01

An artificial protein with three complete sequence repeats was created and the structure was determined by X-ray crystallography. The structure showed threefold symmetry even though there is an amino- and carboxy-terminal. The artificial protein with threefold symmetry may be useful as a scaffold to capture small materials with C3 symmetry. Symfoil-4P is a de novo protein exhibiting the threefold symmetrical β-trefoil fold designed based on the human acidic fibroblast growth factor. First three asparagine–glycine sequences of Symfoil-4P are replaced with glutamine–glycine (Symfoil-QG) or serine–glycine (Symfoil-SG) sequences protecting from deamidation, and His-Symfoil-II was prepared by introducing a protease digestion site into Symfoil-QG so that Symfoil-II has three complete repeats after removal of the N-terminal histidine tag. The Symfoil-QG and SG and His-Symfoil-II proteins were expressed in Eschericha coli as soluble protein, and purified by nickel affinity chromatography. Symfoil-II was further purified by anion-exchange chromatography after removing the HisTag by proteolysis. Both Symfoil-QG and Symfoil-II were crystallized in 0.1 M Tris-HCl buffer (pH 7.0) containing 1.8 M ammonium sulfate as precipitant at 293 K; several crystal forms were observed for Symfoil-QG and II. The maximum diffraction of Symfoil-QG and II crystals were 1.5 and 1.1 Å resolution, respectively. The Symfoil-II without histidine tag diffracted better than Symfoil-QG with N-terminal histidine tag. Although the crystal packing of Symfoil-II is slightly different from Symfoil-QG and other crystals of Symfoil derivatives having the N-terminal histidine tag, the refined crystal structure of Symfoil-II showed pseudo-threefold symmetry as expected from other Symfoils. Since the removal of the unstructured N-terminal histidine tag did not affect the threefold structure of Symfoil, the improvement of diffraction quality of Symfoil-II may be caused by molecular characteristics of
Endogenous retrovirus EAV-HP linked to blue egg phenotype in Mapuche fowl.

Science.gov (United States)

Wragg, David; Mwacharo, Joram M; Alcalde, José A; Wang, Chen; Han, Jian-Lin; Gongora, Jaime; Gourichon, David; Tixier-Boichard, Michèle; Hanotte, Olivier

2013-01-01

Oocyan or blue/green eggshell colour is an autosomal dominant trait found in native chickens (Mapuche fowl) of Chile and in some of their descendants in European and North American modern breeds. We report here the identification of an endogenous avian retroviral (EAV-HP) insertion in oocyan Mapuche fowl and European breeds. Sequencing data reveals 100% retroviral identity between the Mapuche and European insertions. Quantitative real-time PCR analysis of European oocyan chicken indicates over-expression of the SLCO1B3 gene (P<0.05) in the shell gland and oviduct. Predicted transcription factor binding sites in the long terminal repeats (LTR) indicate AhR/Ar, a modulator of oestrogen, as a possible promoter/enhancer leading to reproductive tissue-specific over-expression of the SLCO1B3 gene. Analysis of all jungle fowl species Gallus sp. supports the retroviral insertion to be a post-domestication event, while identical LTR sequences within domestic chickens are in agreement with a recent de novo mutation.
Triplet repeat sequences in human DNA can be detected by hybridization to a synthetic (5'-CGG-3')17 oligodeoxyribonucleotide

DEFF Research Database (Denmark)

Behn-Krappa, A; Mollenhauer, J; Doerfler, W

1993-01-01

The seemingly autonomous amplification of naturally occurring triplet repeat sequences in the human genome has been implicated in the causation of human genetic disease, such as the fragile X (Martin-Bell) syndrome, myotonic dystrophy (Curshmann-Steinert), spinal and bulbar muscular atrophy...
The Pentapeptide Repeat Proteins

OpenAIRE

Vetting, Matthew W.; Hegde, Subray S.; Fajardo, J. Eduardo; Fiser, Andras; Roderick, Steven L.; Takiff, Howard E.; Blanchard, John S.

2006-01-01

The Pentapeptide Repeat Protein (PRP) family has over 500 members in the prokaryotic and eukaryotic kingdoms. These proteins are composed of, or contain domains composed of, tandemly repeated amino acid sequences with a consensus sequence of [S,T,A,V][D,N][L,F]-[S,T,R][G]. The biochemical function of the vast majority of PRP family members is unknown. The three-dimensional structure of the first member of the PRP family was determined for the fluoroquinolone resistance protein (MfpA) from Myc...
Rotavirus 2/6 Viruslike Particles Administered Intranasally with Cholera Toxin, Escherichia coli Heat-Labile Toxin (LT), and LT-R192G Induce Protection from Rotavirus Challenge

OpenAIRE

O’Neal, Christine M.; Clements, John D.; Estes, Mary K.; Conner, Margaret E.

1998-01-01

We have shown that rotavirus 2/6 viruslike particles composed of proteins VP2 and VP6 (2/6-VLPs) administered to mice intranasally with cholera toxin (CT) induced protection from rotavirus challenge, as measured by virus shedding. Since it is unclear if CT will be approved for human use, we evaluated the adjuvanticity of Escherichia coli heat-labile toxin (LT) and LT-R192G. Mice were inoculated intranasally with 10 μg of 2/6-VLPs combined with CT, LT, or LT-R192G. All three adjuvants induced ...
The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

Directory of Open Access Journals (Sweden)

Yandell Mark

2010-07-01

Full Text Available Abstract Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24. The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity elsewhere in the genome, but only 23% have identical copies (99% identity. The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is
Histone acetyltransferase (HAT) activity of p300 modulates human T lymphotropic virus type 1 p30II-mediated repression of LTR transcriptional activity

International Nuclear Information System (INIS)

Michael, Bindhu; Nair, Amrithraj M.; Datta, Antara; Hiraragi, Hajime; Ratner, Lee; Lairmore, Michael D.

2006-01-01

Human T-lymphotropic virus type-1 (HTLV-1) is a deltaretrovirus that causes adult T cell leukemia/lymphoma, and is implicated in a variety of lymphocyte-mediated inflammatory disorders. HTLV-1 provirus has regulatory and accessory genes in four pX open reading frames. HTLV-1 pX ORF-II encodes two proteins, p13 II and p30 II , which are incompletely defined in virus replication or pathogenesis. We have demonstrated that pX ORF-II mutations block virus replication in vivo and that ORF-II encoded p30 II , a nuclear-localizing protein that binds with CREB-binding protein (CBP)/p300, represses CREB and Tax responsive element (TRE)-mediated transcription. Herein, we have identified p30 II motifs important for p300 binding and in regulating TRE-mediated transcription in the absence and presence of HTLV-1 provirus. Within amino acids 100-179 of p30 II , a region important for repression of LTR-mediated transcription, we identified a single lysine residue at amino acid 106 (K3) that significantly modulates the ability of p30 II to repress TRE-mediated transcription. Exogenous p300, in a dose-responsive manner, reverses p30 II -dependent repression of TRE-mediated transcription, in the absence or presence of the provirus, In contrast to wild type p300, p300 HAT mutants (defective in histone acetyltransferase activity) only partially rescued p30 II -mediated LTR repression. Deacetylation by histone deacetylase-1 (HDAC-1) enhanced p30 II -mediated LTR repression, while inhibition of deacetylation by trichostatin A decreases p30 II -mediated LTR repression. Collectively, our data indicate that HTLV-1 p30 II modulates viral gene expression in a cooperative manner with p300-mediated acetylation
Genome-Wide Analysis of Simple Sequence Repeats in Bitter Gourd (Momordica charantia

Directory of Open Access Journals (Sweden)

Junjie Cui

2017-06-01

Full Text Available Bitter gourd (Momordica charantia is widely cultivated as a vegetable and medicinal herb in many Asian and African countries. After the sequencing of the cucumber (Cucumis sativus, watermelon (Citrullus lanatus, and melon (Cucumis melo genomes, bitter gourd became the fourth cucurbit species whose whole genome was sequenced. However, a comprehensive analysis of simple sequence repeats (SSRs in bitter gourd, including a comparison with the three aforementioned cucurbit species has not yet been published. Here, we identified a total of 188,091 and 167,160 SSR motifs in the genomes of the bitter gourd lines ‘Dali-11’ and ‘OHB3-1,’ respectively. Subsequently, the SSR content, motif lengths, and classified motif types were characterized for the bitter gourd genomes and compared among all the cucurbit genomes. Lastly, a large set of 138,727 unique in silico SSR primer pairs were designed for bitter gourd. Among these, 71 primers were selected, all of which successfully amplified SSRs from the two bitter gourd lines ‘Dali-11’ and ‘K44’. To further examine the utilization of unique SSR primers, 21 SSR markers were used to genotype a collection of 211 bitter gourd lines from all over the world. A model-based clustering method and phylogenetic analysis indicated a clear separation among the geographic groups. The genomic SSR markers developed in this study have considerable potential value in advancing bitter gourd research.
Protein domains involved in both in vivo and in vitro interactions between human T-cell leukemia virus type I tax and CREB.

Science.gov (United States)

Yin, M J; Paulssen, E J; Seeler, J S; Gaynor, R B

1995-06-01

Gene expression from the human T-cell leukemia virus type I (HTLV-I) long terminal repeat (LTR) is mediated by three cis-acting regulatory elements known as 21-bp repeats and the transactivator protein Tax. The 21-bp repeats can be subdivided into three motifs known as A, B, and C, each of which is important for maximal gene expression in response to Tax. The B motif contains nucleotide sequences known as a cyclic AMP response element (CRE) or tax-response element which binds members of the ATF/CREB family of transcription factors. Though mutations of this element in the HTLV-I LTR eliminate tax activation, Tax will not activate most other promoters containing these CRE sites. In this study, we investigated the mechanism by which Tax activates gene expression in conjunction with members of the ATF/CREB family. We found that Tax enhanced the binding of one member of the ATF/CREB family, CREB 1, to each of the three HTLV-I LTR 21-bp repeats but not another member designated CRE-BP1 or CREB2. Tax enhanced the binding of CREB1 to nonpalindromic CRE binding sites such as those found in the HTLV-I LTR, but Tax did not enhance the binding of CREB1 to palindromic CRE binding sites such as found in the somatostatin promoter. This finding may help explain the failure of Tax to activate promoters containing consensus CRE sites. These studies were extended by use of the mammalian two-hybrid system. Tax was demonstrated to interact directly with CREB1 but not with other bZIP proteins, including CREB2 and Jun. Site-directed mutagenesis of both Tax and CREB1 demonstrated that the amino terminus of Tax and both the basic and the leucine zipper regions of CREB1 were required for direct interactions between these proteins both in vivo and in vitro. This interaction occurred in vivo and thus did not require the presence of the HTLV-I 21-bp repeats, as previously suggested. These results define the domains required for interaction between Tax and CREB that are likely critical for the
Genus-specific protein binding to the large clusters of DNA repeats (short regularly spaced repeats) present in Sulfolobus genomes

DEFF Research Database (Denmark)

Peng, Xu; Brügger, Kim; Shen, Biao

2003-01-01

terminally modified and corresponds to SSO454, an open reading frame of previously unassigned function. It binds specifically to DNA fragments carrying double and single repeat sequences, binding on one side of the repeat structure, and producing an opening of the opposite side of the DNA structure. It also...... recognizes both main families of repeat sequences in S. solfataricus. The recombinant protein, expressed in Escherichia coli, showed the same binding properties to the SRSR repeat as the native one. The SSO454 protein exhibits a tripartite internal repeat structure which yields a good sequence match...... with a helix-turn-helix DNA-binding motif. Although this putative motif is shared by other archaeal proteins, orthologs of SSO454 were only detected in species within the Sulfolobus genus and in the closely related Acidianus genus. We infer that the genus-specific protein induces an opening of the structure...
Systematic identification and characterization of regulatory elements derived from human endogenous retroviruses.

Directory of Open Access Journals (Sweden)

Jumpei Ito

2017-07-01

Full Text Available Human endogenous retroviruses (HERVs and other long terminal repeat (LTR-type retrotransposons (HERV/LTRs have regulatory elements that possibly influence the transcription of host genes. We systematically identified and characterized these regulatory elements based on publicly available datasets of ChIP-Seq of 97 transcription factors (TFs provided by ENCODE and Roadmap Epigenomics projects. We determined transcription factor-binding sites (TFBSs using the ChIP-Seq datasets and identified TFBSs observed on HERV/LTR sequences (HERV-TFBSs. Overall, 794,972 HERV-TFBSs were identified. Subsequently, we identified "HERV/LTR-shared regulatory element (HSRE," defined as a TF-binding motif in HERV-TFBSs, shared within a substantial fraction of a HERV/LTR type. HSREs could be an indication that the regulatory elements of HERV/LTRs are present before their insertions. We identified 2,201 HSREs, comprising specific associations of 354 HERV/LTRs and 84 TFs. Clustering analysis showed that HERV/LTRs can be grouped according to the TF binding patterns; HERV/LTR groups bounded to pluripotent TFs (e.g., SOX2, POU5F1, and NANOG, embryonic endoderm/mesendoderm TFs (e.g., GATA4/6, SOX17, and FOXA1/2, hematopoietic TFs (e.g., SPI1 (PU1, GATA1/2, and TAL1, and CTCF were identified. Regulatory elements of HERV/LTRs tended to locate nearby and/or interact three-dimensionally with the genes involved in immune responses, indicating that the regulatory elements play an important role in controlling the immune regulatory network. Further, we demonstrated subgroup-specific TF binding within LTR7, LTR5B, and LTR5_Hs, indicating that gains or losses of the regulatory elements occurred during genomic invasions of the HERV/LTRs. Finally, we constructed dbHERV-REs, an interactive database of HERV/LTR regulatory elements (http://herv-tfbs.com/. This study provides fundamental information in understanding the impact of HERV/LTRs on host transcription, and offers insights into
De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

Science.gov (United States)

Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

2013-01-01

Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176
Assembly of Repeat Content Using Next Generation Sequencing Data

Energy Technology Data Exchange (ETDEWEB)

labutti, Kurt; Kuo, Alan; Grigoriev, Igor; Copeland, Alex

2014-03-17

Repetitive organisms pose a challenge for short read assembly, and typically only unique regions and repeat regions shorter than the read length, can be accurately assembled. Recently, we have been investigating the use of Pacific Biosciences reads for de novo fungal assembly. We will present an assessment of the quality and degree of repeat reconstruction possible in a fungal genome using long read technology. We will also compare differences in assembly of repeat content using short read and long read technology.
Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi and related species

Directory of Open Access Journals (Sweden)

Odvody Gary N

2008-11-01

Full Text Available Abstract Background A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites to detect differences at the DNA level. Results Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55% with dinucleotide repeats and 6 (11% with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40% and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis, sugar cane (P. sacchari, pearl millet (Sclerospora graminicola and rose (Peronospora sparsa indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34
Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi) and related species.

Science.gov (United States)

Perumal, Ramasamy; Nimmakayala, Padmavathi; Erattaimuthu, Saradha R; No, Eun-Gyu; Reddy, Umesh K; Prom, Louis K; Odvody, Gary N; Luster, Douglas G; Magill, Clint W

2008-11-29

A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites) to detect differences at the DNA level. Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55%) with dinucleotide repeats and 6 (11%) with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40%) and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis), sugar cane (P. sacchari), pearl millet (Sclerospora graminicola) and rose (Peronospora sparsa) indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production) were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34 Peronosclerospora, Peronospora and Sclerospora
Direct repeat sequences are essential for function of the cis-acting locus of transfer (clt) of Streptomyces phaeochromogenes plasmid pJV1.

Science.gov (United States)

Franco, Bernardo; González-Cerón, Gabriela; Servín-González, Luis

2003-11-01

The functionality of direct and inverted repeat sequences inside the cis acting locus of transfer (clt) of the Streptomyces plasmid pJV1 was determined by testing the effect of different deletions on plasmid transfer. The results show that the single most important element for pJV1 clt function is a series of evenly spaced 9 bp long direct repeats which match the consensus CCGCACA(C/G)(C/G), since their deletion caused a dramatic reduction in plasmid transfer. The presence of these repeats in the absence of any other clt sequences allowed plasmid transfer to occur at a frequency that was at least two orders of magnitude higher than that obtained in the complete absence of clt. A database search revealed regions with a similar organization, and in the same position, in Streptomyces plasmids pSN22 and pSLS, which have transfer proteins homologous to those of pJV1.
Generating markers based on biotic stress of protein system in and tandem repeats sequence for Aquilaria sp

International Nuclear Information System (INIS)

Azhar Mohamad; Muhammad Hanif Azhari N; Siti Norhayati Ismail

2014-01-01

Aquilaria sp. belongs to the Thymelaeaceae family and is well distributed in Asia region. The species has multipurpose use from root to shoot and is an economically important crop, which generates wide interest in understanding genetic diversity of the species. Knowledge on DNA-based markers has become a prerequisite for more effective application of molecular marker techniques in breeding and mapping programs. In this work, both targeted genes and tandem repeat sequences were used for DNA fingerprinting in Aquilaria sp. A total of 100 ISSR (inter simple sequence repeat) primers and 50 combination pairs of specific primers derived from conserved region of a specific protein known as system in were optimized. 38 ISSR primers were found affirmative for polymorphism evaluation study and were generated from both specific and degenerate ISSR primers. And one utmost combination of system in primers showed significant results in distinguishing the Aquilaria sp. In conclusion, polymorphism derived from ISSR profiling and targeted stress genes of protein system in proved as a powerful approach for identification and molecular classification of Aquilaria sp. which will be useful for diversification in identifying any mutant lines derived from nature. (author)

Genome-wide identification and validation of simple sequence repeats (SSRs) from Asparagus officinalis.

Science.gov (United States)

Li, Shufen; Zhang, Guojun; Li, Xu; Wang, Lianjun; Yuan, Jinhong; Deng, Chuanliang; Gao, Wujun

2016-06-01

Garden asparagus (Asparagus officinalis), an important vegetable cultivated worldwide, can also serve as a model dioecious plant species in the study of sex determination and sex chromosome evolution. However, limited DNA marker resources have been developed and used for this species. To expand these resources, we examined the DNA sequences for simple sequence repeats (SSRs) in 163,406 scaffolds representing approximately 400 Mbp of the A. officinalis genome. A total of 87,576 SSRs were identified in 59,565 scaffolds. The most abundant SSR repeats were trinucleotide and tetranucleotide, accounting for 29.2 and 29.1% of the total SSRs, respectively, followed by di-, penta-, hexa-, hepta-, and octanucleotides. The AG motif was most common among dinucleotides and was also the most frequent motif in the entire A. officinalis genome, representing 14.7% of all SSRs. A total of 41,917 SSR primers pairs were designed to amplify SSRs. Twenty-two genomic SSR markers were tested in 39 asparagus accessions belonging to ten cultivars and one accession of Asparagus setaceus for determination of genetic diversity. The intra-species polymorphism information content (PIC) values of the 22 genomic SSR markers were intermediate, with an average of 0.41. The genetic diversity between the ten A. officinalis cultivars was low, and the UPGMA dendrogram was largely unrelated to cultivars. It is here suggested that the sex of individuals is an important factor influencing the clustering results. The information reported here provides new information about the organization of the microsatellites in A. officinalis genome and lays a foundation for further genetic studies and breeding applications of A. officinalis and related species. Copyright © 2016 Elsevier Ltd. All rights reserved.
Identification and characterisation of Short Interspersed Nuclear Elements in the olive tree (Olea europaea L.) genome.

Science.gov (United States)

Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Giordani, Tommaso; Cavallini, Andrea

2017-02-01

Short Interspersed Nuclear Elements (SINEs) are nonautonomous retrotransposons in the genome of most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, SINE identification has been carried out only in a limited number of plant species. This lack of information is apparent especially in non-model plants whose genome has not been sequenced yet. The aim of this work was to produce a specific bioinformatics pipeline for analysing second generation sequence reads of a non-model species and identifying SINEs. We have identified, for the first time, 227 putative SINEs of the olive tree (Olea europaea), that constitute one of the few sets of such sequences in dicotyledonous species. The identified SINEs ranged from 140 to 362 bp in length and were characterised with regard to the occurrence of the tRNA domain in their sequence. The majority of identified elements resulted in single copy or very lowly repeated, often in association with genic sequences. Analysis of sequence similarity allowed us to identify two major groups of SINEs showing different abundances in the olive tree genome, the former with sequence similarity to SINEs of Scrophulariaceae and Solanaceae and the latter to SINEs of Salicaceae. A comparison of sequence conservation between olive SINEs and LTR retrotransposon families suggested that SINE expansion in the genome occurred especially in very ancient times, before LTR retrotransposon expansion, and presumably before the separation of the rosids (to which Oleaceae belong) from the Asterids. Besides providing data on olive SINEs, our results demonstrate the suitability of the pipeline employed for SINE identification. Applying this pipeline will favour further structural and functional analyses on these relatively unknown elements to be performed also in other plant species, even in the absence of a reference genome, and will allow establishing general evolutionary patterns for this kind of repeats in
A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

Science.gov (United States)

Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

1994-09-01

A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.
Identification and Mapping of Simple Sequence Repeat Markers from Common Bean (Phaseolus vulgaris L. Bacterial Artificial Chromosome End Sequences for Genome Characterization and Genetic–Physical Map Integration

Directory of Open Access Journals (Sweden)

Juana M. Córdoba

2010-11-01

Full Text Available Microsatellite markers or simple sequence repeat (SSR loci are useful for diversity characterization and genetic–physical mapping. Different in silico microsatellite search methods have been developed for mining bacterial artificial chromosome (BAC end sequences for SSRs. The overall goal of this study was genome characterization based on SSRs in 89,017 BAC end sequences (BESs from the G19833 common bean ( L. library. Another objective was to identify new SSR taking into account three tandem motif identification programs (Automated Microsatellite Marker Development [AMMD], Tandem Repeats Finder [TRF], and SSRLocator [SSRL]. Among the microsatellite search engines, SSRL identified the highest number of SSRs; however, when primer design was attempted, the number dropped due to poor primer design regions. Automated Microsatellite Marker Development software identified many SSRs with valuable AT/TA or AG/TC motifs, while TRF found fewer SSRs and produced no primers. A subgroup of 323 AT-rich, di-, and trinucleotide SSRs were selected from the AMMD results and used in a parental survey with DOR364 and G19833, of which 75 could be mapped in the corresponding population; these represented 4052 BAC clones. Together with 92 previously mapped BES- and 114 non-BES-derived markers, a total of 280 SSRs were included in the polymerase chain reaction (PCR-based map, integrating a total of 8232 BAC clones in 162 contigs from the physical map.
Structure, organization, and sequence of alpha satellite DNA from human chromosome 17: evidence for evolution by unequal crossing-over and an ancestral pentamer repeat shared with the human X chromosome.

Science.gov (United States)

Waye, J S; Willard, H F

1986-09-01

The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.
t2prhd: a tool to study the patterns of repeat evolution

Directory of Open Access Journals (Sweden)

Pénzes Zsolt

2008-01-01

Full Text Available Abstract Background The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading. Results We have developed a novel rapid visual approach to study repeat evolution, that takes into account the exact repeat position in a sequence. The "pairwise repeat homology diagram" visualizes sequence repeats detected by a profile HMM in a pair of sequences and highlights their homology relations inferred by a phylogenetic tree. The method is implemented in a Perl script (t2prhd available for downloading at http://t2prhd.sourceforge.net and is also accessible as an online tool at http://t2prhd.brc.hu. The power of the method is demonstrated on the EGF-like and fibronectin-III-like (Fn-III domain repeats of three selected mammalian Tenascin sequences. Conclusion Although pairwise repeat homology diagrams do not carry all the information provided by the phylogenetic tree, they allow a rapid and intuitive assessment of repeat evolution. We believe, that t2prhd is a helpful tool with which to study the pattern of repeat evolution. This method can be particularly useful in cases of large datasets (such as large gene families, as the command line interface makes it possible to automate the generation of pairwise repeat homology diagrams with the aid of scripts.
PpRT1: the first complete gypsy-like retrotransposon isolated in Pinus pinaster.

Science.gov (United States)

Rocheta, Margarida; Cordeiro, Jorge; Oliveira, M; Miguel, Célia

2007-02-01

We have isolated and characterized a complete retrotransposon sequence, named PpRT1, from the genome of Pinus pinaster. PpRT1 is 5,966 bp long and is closely related to IFG7 gypsy retrotransposon from Pinus radiata. The long terminal repeats (LTRs) have 333 bp each and show a 5.4% sequence divergence between them. In addition to the characteristic polypurine tract (PPT) and the primer binding site (PBS), PpRT1 carries internal regions with homology to retroviral genes gag and pol. The pol region contains sequence motifs related to the enzymes protease, reverse transcriptase, RNAseH and integrase in the same typical order known for Ty3/gypsy-like retrotransposons. PpRT1 was extended from an EST database sequence indicating that its transcription is occurring in pine tissues. Southern blot analyses indicate however, that PpRT1 is present in a unique or a low number of copies in the P. pinaster genome. The differences in nucleotide sequence found between PpRT1 and IFG7 may explain the strikingly different copy number in the two pine species genome. Based on the homologies observed when comparing LTR region among different gypsy elements we propose that the highly conserved LTR regions may be useful to amplify other retrotransposon sequences of the same or close retrotransposon family.
Selection of reliable reference genes for gene expression studies in Trichoderma afroharzianum LTR-2 under oxalic acid stress.

Science.gov (United States)

Lyu, Yuping; Wu, Xiaoqing; Ren, He; Zhou, Fangyuan; Zhou, Hongzi; Zhang, Xinjian; Yang, Hetong

2017-10-01

An appropriate reference gene is required to get reliable results from gene expression analysis by quantitative real-time reverse transcription PCR (qRT-PCR). In order to identify stable and reliable reference genes in Trichoderma afroharzianum under oxalic acid (OA) stress, six commonly used housekeeping genes, i.e., elongation factor 1, ubiquitin, ubiquitin-conjugating enzyme, glyceraldehyde-3-phosphate dehydrogenase, α-tubulin, actin, from the effective biocontrol isolate T. afroharzianum strain LTR-2 were tested for their expression during growth in liquid culture amended with OA. Four in silico programs (comparative ΔCt, NormFinder, geNorm and BestKeeper) were used to evaluate the expression stabilities of six candidate reference genes. The elongation factor 1 gene EF-1 was identified as the most stably expressed reference gene, and was used as the normalizer to quantify the expression level of the oxalate decarboxylase coding gene OXDC in T. afroharzianum strain LTR-2 under OA stress. The result showed that the expression of OXDC was significantly up-regulated as expected. This study provides an effective method to quantify expression changes of target genes in T. afroharzianum under OA stress. Copyright © 2017 Elsevier B.V. All rights reserved.
Proliferation of Endogenous Retroviruses in the Early Stages of a Host Germ Line Invasion

Science.gov (United States)

Ishida, Yasuko; Zhao, Kai; Greenwood, Alex D.; Roca, Alfred L.

2015-01-01

Endogenous retroviruses (ERVs) comprise 8% of the human genome and are common in all vertebrate genomes. The only retrovirus known to be currently transitioning from exogenous to endogenous form is the koala retrovirus (KoRV), making koalas (Phascolarctos cinereus) ideal for examining the early stages of retroviral endogenization. To distinguish endogenous from exogenous KoRV proviruses, we isolated koala genomic regions flanking KoRV integration sites. In three wild southern Australian koalas, there were fewer KoRV loci than in three captive Queensland koalas, consistent with reports that southern Australian koalas carry fewer KoRVs. Of 39 distinct KoRV proviral loci examined in a sire–dam–progeny triad, all proved to be vertically transmitted and endogenous; none was exogenous. Of the 39 endogenous KoRVs (enKoRVs), only one was present in the genomes of both the sire and the dam, suggesting that, at this early stage in the retroviral invasion of a host germ line, very large numbers of ERVs have proliferated at very low frequencies in the koala population. Sequence divergence between the 5′- and 3′-long terminal repeats (LTRs) of a provirus can be used as a molecular clock. Within each of ten enKoRVs, the 5′-LTR sequence was identical to the 3′-LTR sequence, suggesting a maximum age for enKoRV invasion of the koala germ line of approximately 22,200–49,900 years ago, although a much younger age is possible. Across the ten proviruses, seven LTR haplotypes were detected, indicating that at least seven different retroviral sequences had entered the koala germ line. PMID:25261407
Sequence determinants of human microsatellite variability

Directory of Open Access Journals (Sweden)

Jakobsson Mattias

2009-12-01

Full Text Available Abstract Background Microsatellite loci are frequently used in genomic studies of DNA sequence repeats and in population studies of genetic variability. To investigate the effect of sequence properties of microsatellites on their level of variability we have analyzed genotypes at 627 microsatellite loci in 1,048 worldwide individuals from the HGDP-CEPH cell line panel together with the DNA sequences of these microsatellites in the human RefSeq database. Results Calibrating PCR fragment lengths in individual genotypes by using the RefSeq sequence enabled us to infer repeat number in the HGDP-CEPH dataset and to calculate the mean number of repeats (as opposed to the mean PCR fragment length, under the assumption that differences in PCR fragment length reflect differences in the numbers of repeats in the embedded repeat sequences. We find the mean and maximum numbers of repeats across individuals to be positively correlated with heterozygosity. The size and composition of the repeat unit of a microsatellite are also important factors in predicting heterozygosity, with tetra-nucleotide repeat units high in G/C content leading to higher heterozygosity. Finally, we find that microsatellites containing more separate sets of repeated motifs generally have higher heterozygosity. Conclusions These results suggest that sequence properties of microsatellites have a significant impact in determining the features of human microsatellite variability.
Different histories of two highly variable LTR retrotransposons in sunflower species.

Science.gov (United States)

Mascagni, Flavia; Cavallini, Andrea; Giordani, Tommaso; Natali, Lucia

2017-11-15

In the Helianthus genus, very large intra- and interspecific variability related to two specific retrotransposons of Helianthus annuus (Helicopia and SURE) exists. When comparing these two sequences to sunflower sequence databases recently produced by our lab, the Helicopia family was shown to belong to the Maximus/SIRE lineage of the Sirevirus genus of the Copia superfamily, whereas the SURE element (whose superfamily was not even previously identified) was classified as a Gypsy element of the Ogre/Tat lineage of the Metavirus genus. Bioinformatic analysis of the two retrotransposon families revealed their genomic abundance and relative proliferation timing. The genomic abundance of these families differed significantly among 12 Helianthus species. The ratio between the abundance of long terminal repeats and their reverse transcriptases suggested that the SURE family has relatively more solo long terminal repeats than does Helicopia. Pairwise comparisons of Illumina reads encoding the reverse transcriptase domain indicated that SURE amplification may have occurred more recently than that of Helicopia. Finally, the analysis of population structure based on the SURE and Helicopia polymorphisms of 32 Helianthus species evidenced two subpopulations, which roughly corresponded to species of the Helianthus and Divaricati/Ciliares sections. However, a number of species showed an admixed structure, confirming the importance of interspecific hybridisation in the evolution of this genus. In general, these two retrotransposon families differentially contributed to interspecific variability, emphasising the need to refer to specific families when studying genome evolution. Copyright © 2017 Elsevier B.V. All rights reserved.
The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

Science.gov (United States)

Nallaseth, Ferez Soli

The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1
Novel 3′-Processing Integrase Activity Assay by Real-Time PCR for Screening and Identification of HIV-1 Integrase Inhibitors

Directory of Open Access Journals (Sweden)

Supachai Sakkhachornphop

2015-01-01

Full Text Available The 3′-end processing (3′P of each viral long terminal repeat (LTR during human immunodeficiency virus type-1 (HIV-1 integration is a vital step in the HIV life cycle. Blocking the 3′P using 3′P inhibitor has recently become an attractive strategy for HIV-1 therapeutic intervention. Recently, we have developed a novel real-time PCR based assay for the detection of 3′P activity in vitro. The methodology usually involves biotinylated HIV-1 LTR, HIV-1 integrase (IN, and specific primers and probe. In this novel assay, we designed the HIV-1 LTR substrate based on a sequence with a homology to HIV-1 LTR labeled at its 3′ end with biotin on the sense strand. Two nucleotides at the 3′ end were subsequently removed by IN activity. Only two nucleotides labeled biotin were captured on an avidin-coated tube; therefore, inhibiting the binding of primers and probe results in late signals in the real-time PCR. This novel assay has successfully detected both the 3′P activity of HIV-1 IN and the anti-IN activity by Raltegravir and sodium azide agent. This real-time PCR assay has been shown to be effective and inexpensive for a high-throughput screening of novel IN inhibitors.
Development of Simple Sequence Repeats (SSR) markers in Setaria italica (Poaceae) and cross-amplification in related species.

Science.gov (United States)

Lin, Heng-Sheng; Chiang, Chih-Yun; Chang, Song-Bin; Kuoh, Chang-Sheng

2011-01-01

Foxtail millet is one of the world's oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR) markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21%) and CAT (46.15%). The average number of alleles (N(a)), the average heterozygosities observed (H(o)) and expected (H(e)) are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.
Detection, characterization and evolution of internal repeats in Chitinases of known 3-D structure.

Directory of Open Access Journals (Sweden)

Manigandan Sivaji

Full Text Available Chitinase proteins have evolved and diversified almost in all organisms ranging from prokaryotes to eukaryotes. During evolution, internal repeats may appear in amino acid sequences of proteins which alter the structural and functional features. Here we deciphered the internal repeats from Chitinase and characterized the structural similarities between them. Out of 24 diverse Chitinase sequences selected, six sequences (2CJL, 2DSK, 2XVP, 2Z37, 3EBV and 3HBE did not contain any internal repeats of amino acid sequences. Ten sequences contained repeats of length <50, and the remaining 8 sequences contained repeat length between 50 and 100 residues. Two Chitinase sequences, 1ITX and 3SIM, were found to be structurally similar when analyzed using secondary structure of Chitinase from secondary and 3-Dimensional structure database of Protein Data Bank. Internal repeats of 3N17 and 1O6I were also involved in the ligand-binding site of those Chitinase proteins, respectively. Our analyses enhance our understanding towards the identification of structural characteristics of internal repeats in Chitinase proteins.
A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum.

Science.gov (United States)

Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin

2016-01-07

The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.
Analysis of simple sequence repeats in rice bean (Vigna umbellata using an SSR-enriched library

Directory of Open Access Journals (Sweden)

Lixia Wang

2016-02-01

Full Text Available Rice bean (Vigna umbellata Thunb., a warm-season annual legume, is grown in Asia mainly for dried grain or fodder and plays an important role in human and animal nutrition because the grains are rich in protein and some essential fatty acids and minerals. With the aim of expediting the genetic improvement of rice bean, we initiated a project to develop genomic resources and tools for molecular breeding in this little-known but important crop. Here we report the construction of an SSR-enriched genomic library from DNA extracted from pooled young leaf tissues of 22 rice bean genotypes and developing SSR markers. In 433,562 reads generated by a Roche 454 GS-FLX sequencer, we identified 261,458 SSRs, of which 48.8% were of compound form. Dinucleotide repeats were predominant with an absolute proportion of 81.6%, followed by trinucleotides (17.8%. Other types together accounted for 0.6%. The motif AC/GT accounted for 77.7% of the total, followed by AAG/CTT (14.3%, and all others accounted for 12.0%. Among the flanking sequences, 2928 matched putative genes or gene models in the protein database of Arabidopsis thaliana, corresponding with 608 non-redundant Gene Ontology terms. Of these sequences, 11.2% were involved in cellular components, 24.2% were involved molecular functions, and 64.6% were associated with biological processes. Based on homolog analysis, 1595 flanking sequences were similar to mung bean and 500 to common bean genomic sequences. Comparative mapping was conducted using 350 sequences homologous to both mung bean and common bean sequences. Finally, a set of primer pairs were designed, and a validation test showed that 58 of 220 new primers can be used in rice bean and 53 can be transferred to mung bean. However, only 11 were polymorphic when tested on 32 rice bean varieties. We propose that this study lays the groundwork for developing novel SSR markers and will enhance the mapping of qualitative and quantitative traits and marker
Single Strand Annealing Plays a Major Role in RecA-Independent Recombination between Repeated Sequences in the Radioresistant Deinococcus radiodurans Bacterium.

Directory of Open Access Journals (Sweden)

Solenne Ithurbide

2015-10-01

Full Text Available The bacterium Deinococcus radiodurans is one of the most radioresistant organisms known. It is able to reconstruct a functional genome from hundreds of radiation-induced chromosomal fragments. Our work aims to highlight the genes involved in recombination between 438 bp direct repeats separated by intervening sequences of various lengths ranging from 1,479 bp to 10,500 bp to restore a functional tetA gene in the presence or absence of radiation-induced DNA double strand breaks. The frequency of spontaneous deletion events between the chromosomal direct repeats were the same in recA+ and in ΔrecA, ΔrecF, and ΔrecO bacteria, whereas recombination between chromosomal and plasmid DNA was shown to be strictly dependent on the RecA and RecF proteins. The presence of mutations in one of the repeated sequence reduced, in a MutS-dependent manner, the frequency of the deletion events. The distance between the repeats did not influence the frequencies of deletion events in recA+ as well in ΔrecA bacteria. The absence of the UvrD protein stimulated the recombination between the direct repeats whereas the absence of the DdrB protein, previously shown to be involved in DNA double strand break repair through a single strand annealing (SSA pathway, strongly reduces the frequency of RecA- (and RecO- independent deletions events. The absence of the DdrB protein also increased the lethal sectoring of cells devoid of RecA or RecO protein. γ-irradiation of recA+ cells increased about 10-fold the frequencies of the deletion events, but at a lesser extend in cells devoid of the DdrB protein. Altogether, our results suggest a major role of single strand annealing in DNA repeat deletion events in bacteria devoid of the RecA protein, and also in recA+ bacteria exposed to ionizing radiation.
Automated genotyping of dinucleotide repeat markers

Energy Technology Data Exchange (ETDEWEB)

Perlin, M.W.; Hoffman, E.P. [Carnegie Mellon Univ., Pittsburgh, PA (United States)]|[Univ. of Pittsburgh, PA (United States)

1994-09-01

The dinucleotide repeats (i.e., microsatellites) such as CA-repeats are a highly polymorphic, highly abundant class of PCR-amplifiable markers that have greatly streamlined genetic mapping experimentation. It is expected that over 30,000 such markers (including tri- and tetranucleotide repeats) will be characterized for routine use in the next few years. Since only size determination, and not sequencing, is required to determine alleles, in principle, dinucleotide repeat genotyping is easily performed on electrophoretic gels, and can be automated using DNA sequencers. Unfortunately, PCR stuttering with these markers generates not one band for each allele, but a pattern of bands. Since closely spaced alleles must be disambiguated by human scoring, this poses a key obstacle to full automation. We have developed methods that overcome this obstacle. Our model is that the observed data is generated by arithmetic superposition (i.e., convolution) of multiple allele patterns. By quantitatively measuring the size of each component band, and exploiting the unique stutter pattern associated with each marker, closely spaced alleles can be deconvolved; this unambiguously reconstructs the {open_quotes}true{close_quotes} allele bands, with stutter artifact removed. We used this approach in a system for automated diagnosis of (X-linked) Duchenne muscular dystrophy; four multiplexed CA-repeats within the dystrophin gene were assayed on a DNA sequencer. Our method accurately detected small variations in gel migration that shifted the allele size estimate. In 167 nonmutated alleles, 89% (149/167) showed no size variation, 9% (15/167) showed 1 bp variation, and 2% (3/167) showed 2 bp variation. We are currently developing a library of dinucleotide repeat patterns; together with our deconvolution methods, this library will enable fully automated genotyping of dinucleotide repeats from sizing data.
A yeast model for target-primed (non-LTR retrotransposition

Directory of Open Access Journals (Sweden)

Busby Jason N

2007-08-01

Full Text Available Abstract Background Target-primed (non-LTR retrotransposons, such as the human L1 element, are mobile genetic elements found in many eukaryotic genomes. They are often present in large numbers and their retrotransposition can cause mutations and genomic rearrangements. Despite their importance, many aspects of their replication are not well understood. Results We have developed a yeast model system for studying target-primed retrotransposons. This system uses the Zorro3 element from Candida albicans. A cloned copy of Zorro3, tagged with a retrotransposition indicator gene, retrotransposes at a high frequency when introduced into an appropriate C. albicans host strain. Retrotransposed copies of the tagged element exhibit similar features to the native copies, indicating that the natural retrotransposition pathway is being used. Retrotransposition is dependent on the products of the tagged element's own genes and is highly temperature-regulated. The new assay permits the analysis of the effects of specific mutations introduced into the cloned element. Conclusion This Zorro3 retrotransposition assay system complements previously available target-primed retrotransposition assays. Due to the relative simplicity of the growth, manipulation and analysis of yeast cells, the system should advance our understanding of target-primed retrotransposition.

Ulysses transposable element of Drosophila shows high structural similarities to functional domains of retroviruses.

Science.gov (United States)

Evgen'ev, M B; Corces, V G; Lankenau, D H

1992-06-05

We have determined the DNA structure of the Ulysses transposable element of Drosophila virilis and found that this transposon is 10,653 bp and is flanked by two unusually large direct repeats 2136 bp long. Ulysses shows the characteristic organization of LTR-containing retrotransposons, with matrix and capsid protein domains encoded in the first open reading frame. In addition, Ulysses contains protease, reverse transcriptase, RNase H and integrase domains encoded in the second open reading frame. Ulysses lacks a third open reading frame present in some retrotransposons that could encode an env-like protein. A dendrogram analysis based on multiple alignments of the protease, reverse transcriptase, RNase H, integrase and tRNA primer binding site of all known Drosophila LTR-containing retrotransposon sequences establishes a phylogenetic relationship of Ulysses to other retrotransposons and suggests that Ulysses belongs to a new family of this type of elements.
In vivo activation of human immunodeficiency virus type 1 long terminal repeat by UV type A (UV-A) light plus psoralen and UV-B light in the skin of transgenic mice

OpenAIRE

Morrey, John D; Bourn, S M; Bunch, T D; Jackson, M K; Sidwell, R W; Barrows, L R; Daynes, R A; Rosen, C A

1991-01-01

UV irradiation has been shown to activate the human immunodeficiency virus type 1 (HIV-1) long terminal repeat (LTR) in cell culture; however, only limited studies have been described in vivo. UV light has been categorized as UV-A (400 to 315 nm), -B (315 to 280 nm), or -C (less than 280 nm); the longer wavelengths are less harmful but more penetrative. Highly penetrative UV-A radiation constitutes the vast majority of UV sunlight reaching the earth's surface but is normally harmless. UV-B ir...
Proliferation of endogenous retroviruses in the early stages of a host germ line invasion.

Science.gov (United States)

Ishida, Yasuko; Zhao, Kai; Greenwood, Alex D; Roca, Alfred L

2015-01-01

Endogenous retroviruses (ERVs) comprise 8% of the human genome and are common in all vertebrate genomes. The only retrovirus known to be currently transitioning from exogenous to endogenous form is the koala retrovirus (KoRV), making koalas (Phascolarctos cinereus) ideal for examining the early stages of retroviral endogenization. To distinguish endogenous from exogenous KoRV proviruses, we isolated koala genomic regions flanking KoRV integration sites. In three wild southern Australian koalas, there were fewer KoRV loci than in three captive Queensland koalas, consistent with reports that southern Australian koalas carry fewer KoRVs. Of 39 distinct KoRV proviral loci examined in a sire-dam-progeny triad, all proved to be vertically transmitted and endogenous; none was exogenous. Of the 39 endogenous KoRVs (enKoRVs), only one was present in the genomes of both the sire and the dam, suggesting that, at this early stage in the retroviral invasion of a host germ line, very large numbers of ERVs have proliferated at very low frequencies in the koala population. Sequence divergence between the 5'- and 3'-long terminal repeats (LTRs) of a provirus can be used as a molecular clock. Within each of ten enKoRVs, the 5'-LTR sequence was identical to the 3'-LTR sequence, suggesting a maximum age for enKoRV invasion of the koala germ line of approximately 22,200-49,900 years ago, although a much younger age is possible. Across the ten proviruses, seven LTR haplotypes were detected, indicating that at least seven different retroviral sequences had entered the koala germ line. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development of Simple Sequence Repeats (SSR Markers in Setaria italica (Poaceae and Cross-Amplification in Related Species

Directory of Open Access Journals (Sweden)

Chih-Yun Chiang

2011-11-01

Full Text Available Foxtail millet is one of the world’s oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21% and CAT (46.15%. The average number of alleles (Na, the average heterozygosities observed (Ho and expected (He are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.
The complete chloroplast genome sequence of Mahonia bealei (Berberidaceae) reveals a significant expansion of the inverted repeat and phylogenetic relationship with other angiosperms.

Science.gov (United States)

Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin

2013-10-10

Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae
Roles of repetitive sequences

Energy Technology Data Exchange (ETDEWEB)

Bell, G.I.

1991-12-31

The DNA of higher eukaryotes contains many repetitive sequences. The study of repetitive sequences is important, not only because many have important biological function, but also because they provide information on genome organization, evolution and dynamics. In this paper, I will first discuss some generic effects that repetitive sequences will have upon genome dynamics and evolution. In particular, it will be shown that repetitive sequences foster recombination among, and turnover of, the elements of a genome. I will then consider some examples of repetitive sequences, notably minisatellite sequences and telomere sequences as examples of tandem repeats, without and with respectively known function, and Alu sequences as an example of interspersed repeats. Some other examples will also be considered in less detail.
Bond graph modeling and LQG/LTR controller design of magnetically levitation systems

International Nuclear Information System (INIS)

Kim, Jong Shik; Park, Jeon Soo

1991-01-01

A logical and systematic procedure to derive a mathematical model for magnetically levitation (MAGLEV) systems with a combined lift and guidance is developed by using bond graph modeling techniques. First, bond graph is contructed for the 1 st -dimensional MAGLEV system in which three subsystems (energy feeding, track and vehicle) are considered. And, the 2 nd -dimensional MAGLEV system in which lift and guidance dynamics are coupled is modeled by using the concept of multi-port field in bond graph languages. Finally, the LQG/LTR control system is designed for a multivariable MAGLEV system with stagger configuration type. In this paper, it has been shown that the bond graph is an excellent effective method for modeling multi-energy domain systems such as MAGLEV systems with uncertainties such as mass variations, track irregularities and wind gusts. (Author)
Bond graph modeling and LQG/LTR controller design of magnetically levitation systems

Energy Technology Data Exchange (ETDEWEB)

Kim, Jong Shik; Park, Jeon Soo [Busan National Univ. (Korea, Republic of)

1991-09-01

A logical and systematic procedure to derive a mathematical model for magnetically levitation (MAGLEV) systems with a combined lift and guidance is developed by using bond graph modeling techniques. First, bond graph is contructed for the 1{sup st}-dimensional MAGLEV system in which three subsystems (energy feeding, track and vehicle) are considered. And, the 2{sup nd}-dimensional MAGLEV system in which lift and guidance dynamics are coupled is modeled by using the concept of multi-port field in bond graph languages. Finally, the LQG/LTR control system is designed for a multivariable MAGLEV system with stagger configuration type. In this paper, it has been shown that the bond graph is an excellent effective method for modeling multi-energy domain systems such as MAGLEV systems with uncertainties such as mass variations, track irregularities and wind gusts. (Author).
Nonlinear analysis of sequence repeats of multi-domain proteins

Energy Technology Data Exchange (ETDEWEB)

Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: lmf_bill@sina.com

2007-11-15

Many multi-domain proteins have repetitive three-dimensional structures but nearly-random amino acid sequences. In the present paper, by using a modified recurrence plot proposed by us previously, we show that these amino acid sequences have hidden repetitions in fact. These results indicate that the repetitive domain structures are encoded by the repetitive sequences. This also gives a method to detect the repetitive domain structures directly from amino acid sequences.
Young, intact and nested retrotransposons are abundant in the onion and asparagus genomes.

Science.gov (United States)

Vitte, C; Estep, M C; Leebens-Mack, J; Bennetzen, J L

2013-09-01

Although monocotyledonous plants comprise one of the two major groups of angiosperms and include >65 000 species, comprehensive genome analysis has been focused mainly on the Poaceae (grass) family. Due to this bias, most of the conclusions that have been drawn for monocot genome evolution are based on grasses. It is not known whether these conclusions apply to many other monocots. To extend our understanding of genome evolution in the monocots, Asparagales genomic sequence data were acquired and the structural properties of asparagus and onion genomes were analysed. Specifically, several available onion and asparagus bacterial artificial chromosomes (BACs) with contig sizes >35 kb were annotated and analysed, with a particular focus on the characterization of long terminal repeat (LTR) retrotransposons. The results reveal that LTR retrotransposons are the major components of the onion and garden asparagus genomes. These elements are mostly intact (i.e. with two LTRs), have mainly inserted within the past 6 million years and are piled up into nested structures. Analysis of shotgun genomic sequence data and the observation of two copies for some transposable elements (TEs) in annotated BACs indicates that some families have become particularly abundant, as high as 4-5 % (asparagus) or 3-4 % (onion) of the genome for the most abundant families, as also seen in large grass genomes such as wheat and maize. Although previous annotations of contiguous genomic sequences have suggested that LTR retrotransposons were highly fragmented in these two Asparagales genomes, the results presented here show that this was largely due to the methodology used. In contrast, this current work indicates an ensemble of genomic features similar to those observed in the Poaceae.
Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats.

Science.gov (United States)

Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang

2016-09-01

Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.
Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

Science.gov (United States)

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
Local repeat sequence organization of an intergenic spacer

Indian Academy of Sciences (India)

The amplification yielded the same uniquely ``sequence-scrambled” product, whether the template used for PCR was total cellular DNA, chloroplast DNA or a plasmid clone DNA corresponding to that region. The PCR product, a ``unique” new sequence, had lost the repetitive organization of the template genome where it ...
The soybean-Phytophthora resistance locus Rps1-k encompasses coiled coil-nucleotide binding-leucine rich repeat-like genes and repetitive sequences

Directory of Open Access Journals (Sweden)

Bhattacharyya Madan K

2008-03-01

Full Text Available Abstract Background A series of Rps (resistance to Pytophthora sojae genes have been protecting soybean from the root and stem rot disease caused by the Oomycete pathogen, Phytophthora sojae. Five Rps genes were mapped to the Rps1 locus located near the 28 cM map position on molecular linkage group N of the composite genetic soybean map. Among these five genes, Rps1-k was introgressed from the cultivar, Kingwa. Rps1-k has been providing stable and broad-spectrum Phytophthora resistance in the major soybean-producing regions of the United States. Rps1-k has been mapped and isolated. More than one functional Rps1-k gene was identified from the Rps1-k locus. The clustering feature at the Rps1-k locus might have facilitated the expansion of Rps1-k gene numbers and the generation of new recognition specificities. The Rps1-k region was sequenced to understand the possible evolutionary steps that shaped the generation of Phytophthora resistance genes in soybean. Results Here the analyses of sequences of three overlapping BAC clones containing the 184,111 bp Rps1-k region are reported. A shotgun sequencing strategy was applied in sequencing the BAC contig. Sequence analysis predicted a few full-length genes including two Rps1-k genes, Rps1-k-1 and Rps1-k-2. Previously reported Rps1-k-3 from this genomic region 1 was evolved through intramolecular recombination between Rps1-k-1 and Rps1-k-2 in Escherichia coli. The majority of the predicted genes are truncated and therefore most likely they are nonfunctional. A member of a highly abundant retroelement, SIRE1, was identified from the Rps1-k region. The Rps1-k region is primarily composed of repetitive sequences. Sixteen simple repeat and 63 tandem repeat sequences were identified from the locus. Conclusion These data indicate that the Rps1 locus is located in a gene-poor region. The abundance of repetitive sequences in the Rps1-k region suggested that the location of this locus is in or near a
Fingerprinting for discriminating tea germplasm using inter-simple sequence repeat (ISSR) markers

International Nuclear Information System (INIS)

Liu, B.Y.; Li, Y.Y.; Wang, P.S.; Wang, L.Y.; Wang, P.S.

2012-01-01

For the discrimination of tea germplasm at the inter-specific level, 134 tea varieties preserved in the China National Germplasm Tea Repositories (CNGTR) were analyzed using inter simple sequence repeat (ISSR) markers. Eighteen primers were chosen from 60 screened for ISSR amplification, generating 99.4% polymorphic bands. The mean Nei's gene diversity (H) and the overall mean Shannon's Information index (I) were 0.396 and 0.578, respectively, indicating a wide gene pool. Using the presence, sometimes absence of unique ISSR markers, it was possible to discriminate 32 of the genotypes tested. No single primer could discriminate all the 134 genotypes. However, UBC811 provided rich band patterns and it can discriminate 35 genotypes. The combination of two and three primers could discriminate 99 and 121 genotypes, respectively. Furthermore, the combination of band patterns or the DNA fingerprinting based on specific ISSR markers generated by UBC811, UBC835, ISSR2 and ISSR3 could discriminate all 134 genotypes tested. ISSR markers also provide a powerful tool to discriminate tea germplasm at the inter-specific level. (author)
FIV establishes a latent infection in feline peripheral blood CD4+ T lymphocytes in vivo during the asymptomatic phase of infection

Directory of Open Access Journals (Sweden)

Murphy Brian

2012-02-01

Full Text Available Abstract Background Feline immunodeficiency virus (FIV is a lentivirus of cats that establishes a lifelong persistent infection with immunologic impairment. Results In an approximately 2 year-long experimental infection study, cats infected with a biological isolate of FIV clade C demonstrated undetectable plasma viral loads from 10 months post-infection onward. Viral DNA was detected in CD4+CD25+ and CD4+CD25- T cells isolated from infected cats whereas viral RNA was not detected at multiple time points during the early chronic phase of infection. Viral transcription could be reactivated in latently infected CD4+ T cells ex vivo as demonstrated by detectable FIV gag RNA and 2-long terminal repeat (LTR circle junctions. Viral LTR and gag sequences amplified from peripheral blood mononuclear cells during early and chronic stages of infection demonstrated minimal to no viral sequence variation. Conclusions Collectively, these findings are consistent with FIV latency in peripheral blood CD4+ T cells isolated from chronically infected cats. The ability to isolate latently FIV-infected CD4+ T lymphocytes from FIV-infected cats provides a platform for the study of in vivo mechanisms of lentiviral latency.
Repeat-aware modeling and correction of short read errors.

Science.gov (United States)

Yang, Xiao; Aluru, Srinivas; Dorman, Karin S

2011-02-15

High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors
Genetic diversity studies in pea (Pisum sativum L.) using simple sequence repeat markers.

Science.gov (United States)

Kumari, P; Basal, N; Singh, A K; Rai, V P; Srivastava, C P; Singh, P K

2013-03-13

The genetic diversity among 28 pea (Pisum sativum L.) genotypes was analyzed using 32 simple sequence repeat markers. A total of 44 polymorphic bands, with an average of 2.1 bands per primer, were obtained. The polymorphism information content ranged from 0.657 to 0.309 with an average of 0.493. The variation in genetic diversity among these cultivars ranged from 0.11 to 0.73. Cluster analysis based on Jaccard's similarity coefficient using the unweighted pair-group method with arithmetic mean (UPGMA) revealed 2 distinct clusters, I and II, comprising 6 and 22 genotypes, respectively. Cluster II was further differentiated into 2 subclusters, IIA and IIB, with 12 and 10 genotypes, respectively. Principal component (PC) analysis revealed results similar to those of UPGMA. The first, second, and third PCs contributed 21.6, 16.1, and 14.0% of the variation, respectively; cumulative variation of the first 3 PCs was 51.7%.
Initial study of stability and repeatability of measuring R2' and oxygen extraction fraction values in the healthy brain with gradient-echo sampling of spin-echo sequence

International Nuclear Information System (INIS)

Hui Lihong; Zhang Xiaodong; He Chao; Xie Sheng; Xiao Jiangxi; Zhang jue; Wang Xiaoying; Jiang Xuexiang

2010-01-01

Objective: To evaluate the stability and repeatability of gradient-echo sampling of spin- echo (GESSE) sequence in measuring the R 2 ' value in volunteers, by comparison with traditional GRE sequence (T 2 * ]nap and T 2 map). Methods: Eight normal healthy volunteers were enrolled in this study and written informed consents were obtained from all subjects. MR scanning including sequences of GESSE, T 2 map and T 2 * map were performed in these subjects at resting status. The same protocol was repeated one day later. Raw data from GESSE sequence were transferred to PC to conduct postprocessing with the software built in house. R 2 ' map and OEF map were got consequently. To obtain quantitative R 2 ' and OEF values in the brain parenchyma, six ROIs were equally placed in the anterior, middle and posterior part of bilateral hemispheres. Both mean and standard deviation of R 2 ' and OEF were recorded. All images from T 2 * map and T 2 map were transferred to the Workstation for postprocessing. The ROIs were put at the same areas as those for GESSE sequence. R 2 ' is defined as R 2 ' = R 2 * - R 2 , R 2 * = 1/T 2 * . The R 2 ' value of GESSE sequence were compared with that of GRE sequence. Results: The mean R 2 ' values of GESSE at the first and second scan and those of the GRE were (4.21±0.92), (4.45±0.94) Hz and (7.37±1.47), (6.42±2.33) Hz respectively. The mean OEF values of GESSE at the first and second scan is 0.327±0.036 and 0.336± 0.035 respectively. The R 2 ' value and OEF value obtained from GESSE were not significantly different between the first and second scan (t=-0.83, -1.48, P>0.05). The R 2 ' value of first GRE imaging had significantly statistical difference from that of second GRE imaging (t=1.80, P 2 ' value of GESSE sequence was less than that of GRE sequence, and there was significantly statistical difference between them (t=1.71, P<0.05). Conclusion: The GESSE sequence has good stability and repeatability with promising clinical practicability
Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus).

Science.gov (United States)

Cech, Jennifer N; Peichel, Catherine L

2015-12-01

Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.

Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Directory of Open Access Journals (Sweden)

Huaiyong Luo

Full Text Available The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Science.gov (United States)

Luo, Huaiyong; Wang, Xiaojie; Zhan, Gangming; Wei, Guorong; Zhou, Xinli; Zhao, Jing; Huang, Lili; Kang, Zhensheng

2015-01-01

The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs) are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes.

Science.gov (United States)

Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H; Cavallini, Andrea; Natali, Lucia

2015-11-24

The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes--7 wild accessions and 8 cultivars--of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
[Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

Science.gov (United States)

Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

2015-04-01

This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
Comparative genomic analysis reveals multiple long terminal repeats, lineage-specific amplification, and frequent interelement recombination for Cassandra retrotransposon in pear (Pyrus bretschneideri Rehd.).

Science.gov (United States)

Yin, Hao; Du, Jianchang; Li, Leiting; Jin, Cong; Fan, Lian; Li, Meng; Wu, Jun; Zhang, Shaoling

2014-06-04

Cassandra transposable elements belong to a specific group of terminal-repeat retrotransposons in miniature (TRIM). Although Cassandra TRIM elements have been found in almost all vascular plants, detailed investigations on the nature, abundance, amplification timeframe, and evolution have not been performed in an individual genome. We therefore conducted a comprehensive analysis of Cassandra retrotransposons using the newly sequenced pear genome along with four other Rosaceae species, including apple, peach, mei, and woodland strawberry. Our data reveal several interesting findings for this particular retrotransposon family: 1) A large number of the intact copies contain three, four, or five long terminal repeats (LTRs) (∼20% in pear); 2) intact copies and solo LTRs with or without target site duplications are both common (∼80% vs. 20%) in each genome; 3) the elements exhibit an overall unbiased distribution among the chromosomes; 4) the elements are most successfully amplified in pear (5,032 copies); and 5) the evolutionary relationships of these elements vary among different lineages, species, and evolutionary time. These results indicate that Cassandra retrotransposons contain more complex structures (elements with multiple LTRs) than what we have known previously, and that frequent interelement unequal recombination followed by transposition may play a critical role in shaping and reshaping host genomes. Thus this study provides insights into the property, propensity, and molecular mechanisms governing the formation and amplification of Cassandra retrotransposons, and enhances our understanding of the structural variation, evolutionary history, and transposition process of LTR retrotransposons in plants. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Mutations in the Lactococcus lactis Ll.LtrB group II intron that retain mobility in vivo

Directory of Open Access Journals (Sweden)

D'Souza Lisa M

2002-12-01

Full Text Available Abstract Background Group II introns are mobile genetic elements that form conserved secondary and tertiary structures. In order to determine which of the conserved structural elements are required for mobility, a series of domain and sub-domain deletions were made in the Lactococcus lactis group II intron (Ll.LtrB and tested for mobility in a genetic assay. Point mutations in domains V and VI were also tested. Results The largest deletion that could be made without severely compromising mobility was 158 nucleotides in DIVb(1–2. This mutant had a mobility frequency comparable to the wild-type Ll.LtrB intron (ΔORF construct. Hence, all subsequent mutations were done in this mutant background. Deletion of DIIb reduced mobility to approximately 18% of wild-type, while another deletion in domain II (nts 404–459 was mobile to a minor extent. Only two deletions in DI and none in DIII were tolerated. Some mobility was also observed for a DIVa deletion mutant. Of the three point mutants at position G3 in DV, only G3A retained mobility. In DVI, deletion of the branch-point nucleotide abolished mobility, but the presence of any nucleotide at the branch-point position restored mobility to some extent. Conclusions The smallest intron capable of efficient retrohoming was 725 nucleotides, comprising the DIVb(1–2 and DII(iia,b deletions. The tertiary elements found to be nonessential for mobility were alpha, kappa and eta. In DV, only the G3A mutant was mobile. A branch-point residue is required for intron mobility.
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

Science.gov (United States)

Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

2017-01-01

TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
Utilization of a cloned alphoid repeating sequence of human DNA in the study of polymorphism of chromosomal heterochromatin regions

International Nuclear Information System (INIS)

Kruminya, A.R.; Kroshkina, V.G.; Yurov, Yu.B.; Aleksandrov, I.A.; Mitkevich, S.P.; Gindilis, V.M.

1988-01-01

The chromosomal distribution of the cloned PHS05 fragment of human alphoid DNA was studied by in situ hybridization in 38 individuals. It was shown that this DNA fraction is primarily localized in the pericentric regions of practically all chromosomes of the set. Significant interchromosomal differences and a weakly expressed interindividual polymorphism were discovered in the copying ability of this class of repeating DNA sequences; associations were not found between the results of hybridization and the pattern of Q-polymorphism
Endogenous retrovirus insertion in the KIT oncogene determines white and white spotting in domestic cats.

Science.gov (United States)

David, Victor A; Menotti-Raymond, Marilyn; Wallace, Andrea Coots; Roelke, Melody; Kehler, James; Leighty, Robert; Eizirik, Eduardo; Hannah, Steven S; Nelson, George; Schäffer, Alejandro A; Connelly, Catherine J; O'Brien, Stephen J; Ryugo, David K

2014-08-01

The Dominant White locus (W) in the domestic cat demonstrates pleiotropic effects exhibiting complete penetrance for absence of coat pigmentation and incomplete penetrance for deafness and iris hypopigmentation. We performed linkage analysis using a pedigree segregating White to identify KIT (Chr. B1) as the feline W locus. Segregation and sequence analysis of the KIT gene in two pedigrees (P1 and P2) revealed the remarkable retrotransposition and evolution of a feline endogenous retrovirus (FERV1) as responsible for two distinct phenotypes of the W locus, Dominant White, and white spotting. A full-length (7125 bp) FERV1 element is associated with white spotting, whereas a FERV1 long terminal repeat (LTR) is associated with all Dominant White individuals. For purposes of statistical analysis, the alternatives of wild-type sequence, FERV1 element, and LTR-only define a triallelic marker. Taking into account pedigree relationships, deafness is genetically linked and associated with this marker; estimated P values for association are in the range of 0.007 to 0.10. The retrotransposition interrupts a DNAase I hypersensitive site in KIT intron 1 that is highly conserved across mammals and was previously demonstrated to regulate temporal and tissue-specific expression of KIT in murine hematopoietic and melanocytic cells. A large-population genetic survey of cats (n = 270), representing 30 cat breeds, supports our findings and demonstrates statistical significance of the FERV1 LTR and full-length element with Dominant White/blue iris (P < 0.0001) and white spotting (P < 0.0001), respectively. Copyright © 2014 David et al.
iPBS: a universal method for DNA fingerprinting and retrotransposon isolation.

Science.gov (United States)

Kalendar, Ruslan; Antonius, Kristiina; Smýkal, Petr; Schulman, Alan H

2010-11-01

Molecular markers are essential in plant and animal breeding and biodiversity applications, in human forensics, and for map-based cloning of genes. The long terminal repeat (LTR) retrotransposons are well suited as molecular markers. As dispersed and ubiquitous transposable elements, their "copy and paste" life cycle of replicative transposition leads to new genome insertions without excision of the original element. Both the overall structure of retrotransposons and the domains responsible for the various phases of their replication are highly conserved in all eukaryotes. Nevertheless, up to a year has been required to develop a retrotransposon marker system in a new species, involving cloning and sequencing steps as well as the development of custom primers. Here, we describe a novel PCR-based method useful both as a marker system in its own right and for the rapid isolation of retrotransposon termini and full-length elements, making it ideal for "orphan crops" and other species with underdeveloped marker systems. The method, iPBS amplification, is based on the virtually universal presence of a tRNA complement as a reverse transcriptase primer binding site (PBS) in LTR retrotransposons. The method differs from earlier retrotransposon isolation methods because it is applicable not only to endogenous retroviruses and retroviruses, but also to both Gypsy and Copia LTR retrotransposons, as well as to non-autonomous LARD and TRIM elements, throughout the plant kingdom and to animals. Furthermore, the inter-PBS amplification technique as such has proved to be a powerful DNA fingerprinting technology without the need for prior sequence knowledge.
In situ detection of tandem DNA repeat length

Energy Technology Data Exchange (ETDEWEB)

Yaar, R.; Szafranski, P.; Cantor, C.R.; Smith, C.L. [Boston Univ., MA (United States)

1996-11-01

A simple method for scoring short tandem DNA repeats is presented. An oligonucleotide target, containing tandem repeats embedded in a unique sequence, was hybridized to a set of complementary probes, containing tandem repeats of known lengths. Single-stranded loop structures formed on duplexes containing a mismatched (different) number of tandem repeats. No loop structure formed on duplexes containing a matched (identical) number of tandem repeats. The matched and mismatched loop structures were enzymatically distinguished and differentially labeled by treatment with S1 nuclease and the Klenow fragment of DNA polymerase. 7 refs., 4 figs.
In silico reversal of repeat-induced point mutation (RIP identifies the origins of repeat families and uncovers obscured duplicated genes

Directory of Open Access Journals (Sweden)

Hane James K

2010-11-01

Full Text Available Abstract Background Repeat-induced point mutation (RIP is a fungal genome defence mechanism guarding against transposon invasion. RIP mutates the sequence of repeated DNA and over time renders the affected regions unrecognisable by similarity search tools such as BLAST. Results DeRIP is a new software tool developed to predict the original sequence of a RIP-mutated region prior to the occurrence of RIP. In this study, we apply deRIP to the genome of the wheat pathogen Stagonospora nodorum SN15 and predict the origin of several previously uncharacterised classes of repetitive DNA. Conclusions Five new classes of transposon repeats and four classes of endogenous gene repeats were identified after deRIP. The deRIP process is a new tool for fungal genomics that facilitates the identification and understanding of the role and origin of fungal repetitive DNA. DeRIP is open-source and is available as part of the RIPCAL suite at http://www.sourceforge.net/projects/ripcal.
THE USE OF INTER SIMPLE SEQUENCE REPEATS (ISSR) IN DISTINGUISHING NEIGHBORING DOUGLAS-FIR TREES AS A MEANS TO IDENTIFYING TREE ROOTS WITH ABOVE-GROUND BIOMASS

Science.gov (United States)

We are attempting to identify specific root fragments from soil cores with individual trees. We successfully used Inter Simple Sequence Repeats (ISSR) to distinguish neighboring old-growth Douglas-fir trees from one another, while maintaining identity among each tree's parts. W...
Rate-determining Step of Flap Endonuclease 1 (FEN1) Reflects a Kinetic Bias against Long Flaps and Trinucleotide Repeat Sequences.

Science.gov (United States)

Tarantino, Mary E; Bilotti, Katharina; Huang, Ji; Delaney, Sarah

2015-08-21

Flap endonuclease 1 (FEN1) is a structure-specific nuclease responsible for removing 5'-flaps formed during Okazaki fragment maturation and long patch base excision repair. In this work, we use rapid quench flow techniques to examine the rates of 5'-flap removal on DNA substrates of varying length and sequence. Of particular interest are flaps containing trinucleotide repeats (TNR), which have been proposed to affect FEN1 activity and cause genetic instability. We report that FEN1 processes substrates containing flaps of 30 nucleotides or fewer at comparable single-turnover rates. However, for flaps longer than 30 nucleotides, FEN1 kinetically discriminates substrates based on flap length and flap sequence. In particular, FEN1 removes flaps containing TNR sequences at a rate slower than mixed sequence flaps of the same length. Furthermore, multiple-turnover kinetic analysis reveals that the rate-determining step of FEN1 switches as a function of flap length from product release to chemistry (or a step prior to chemistry). These results provide a kinetic perspective on the role of FEN1 in DNA replication and repair and contribute to our understanding of FEN1 in mediating genetic instability of TNR sequences. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Assessment of Cultivar Distinctness in Alfalfa: A Comparison of Genotyping-by-Sequencing, Simple-Sequence Repeat Marker, and Morphophysiological Observations

Directory of Open Access Journals (Sweden)

Paolo Annicchiarico

2016-07-01

Full Text Available Cultivar registration agencies typically require morphophysiological trait-based distinctness of candidate cultivars. This requirement is difficult to achieve for cultivars of major perennial forages because of their genetic structure and ever-increasing number of registered material, leading to possible rejection of agronomically valuable cultivars. This study aimed to explore the value of molecular markers applied to replicated bulked plants (three bulks of 100 independent plants each per cultivar to assess alfalfa ( L. subsp. cultivar distinctness. We compared genotyping-by-sequencing information based on 2902 polymorphic single-nucleotide polymorphism (SNP markers (>30 reads per DNA sample with morphophysiological information based on 11 traits and with simple-sequence repeat (SSR marker information from 41 polymorphic markers for their ability to distinguish 11 alfalfa landraces representative of the germplasm from northern Italy. Three molecular criteria, one based on cultivar differences for individual SSR bands and two based on overall SNP marker variation assessed either by statistically significant cultivar differences on principal component axes or discriminant analysis, distinctly outperformed the morphophysiological criterion. Combining the morphophysiological criterion with either molecular marker method increased discrimination among cultivars, since morphophysiological diversity was unrelated to SSR marker-based diversity ( = 0.04 and poorly related to SNP marker-based diversity ( = 0.23, < 0.15. The criterion based on statistically significant SNP allele frequency differences was less discriminating than morphophysiological variation. Marker-based distinctness, which can be assessed at low cost and without interactions with testing conditions, could validly substitute for (or complement morphophysiological distinctness in alfalfa cultivar registration schemes. It also has interest in sui generis registration systems aimed at
LQG/LTR [linear quadratic Gaussian with loop transfer recovery] robust control system design for a low-pressure feedwater heater train

International Nuclear Information System (INIS)

Murphy, G.V.; Bailey, J.M.

1990-01-01

This paper uses the linear quadratic Gaussian with loop transfer recovery (LQG/LTR) control system design method to obtain a level control system for a low-pressure feedwater heater train. The control system performance and stability robustness are evaluated for a given set of system design specifications. The tools for analysis are the return ratio, return difference, and inverse return difference singular-valve plots for a loop break at the plant output. 3 refs., 7 figs., 2 tabs
Agarose gel electrophoresis and polyacrylamide gel electrophoresis for visualization of simple sequence repeats.

Science.gov (United States)

Anderson, James; Wright, Drew; Meksem, Khalid

2013-01-01

In the modern age of genetic research there is a constant search for ways to improve the efficiency of plant selection. The most recent technology that can result in a highly efficient means of selection and still be done at a low cost is through plant selection directed by simple sequence repeats (SSRs or microsatellites). The molecular markers are used to select for certain desirable plant traits without relying on ambiguous phenotypic data. The best way to detect these is the use of gel electrophoresis. Gel electrophoresis is a common technique in laboratory settings which is used to separate deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) by size. Loading DNA and RNA onto gels allows for visualization of the size of fragments through the separation of DNA and RNA fragments. This is achieved through the use of the charge in the particles. As the fragments separate, they form into distinct bands at set sizes. We describe the ability to visualize SSRs on slab gels of agarose and polyacrylamide gel electrophoresis.
Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

Science.gov (United States)

Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

2002-07-01

Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Human T-cell leukemia virus types I and II exhibit different DNase I protection patterns

International Nuclear Information System (INIS)

Altman, R.; Harrich, D.; Garcia, J.A.; Gaynor, R.B.

1988-01-01

Human T-cell leukemia virus types I (HTLV-I) and II (HTLV-II) are human retroviruses which normally infect T-lymphoid cells. HTLV-I infection is associated with adult T-cell leukemia-lymphoma, and HTLV-II is associated with an indolent form of hairy-cell leukemia. To identify potential transcriptional regulatory elements of these two related human retroviruses, the authors performed DNase I footprinting of both the HTLV-I and HTLV-II long terminal repeats (LTRs) by using extracts prepared from uninfected T cells, HTLV-I and HTLV-II transformed T cells, and HeLa cells. Five regions of the HTLV-I LTR and three regions of the HTLV-II LTR showed protection by DNase I footprinting. All three of the 21-base-pair repeats previously shown to be important in HTLV transcriptional regulation were protected in the HTLV-I LTR, whereas only one of these repeats was protected in the HTLV-II LTR. Several regions exhibited altered protection in extracts prepared from lymphoid cells as compared with HeLa cells, but there were minimal differences in the protection patterns between HTLV-infected and uninfected lymphoid extracts. A number of HTLV-I and HTLV-II LTR fragments which contained regions showing protection in DNase I footprinting were able to function as inducible enhancer elements in transient CAT gene expression assays in the presence of the HTLV-II tat protein. The alterations in the pattern of the cellular proteins which bind to the HTLV-I and HTLV-II LTRs may in part be responsible for differences in the transcriptional regulation of these two related viruses
Inclusion of Moloney murine leukemia virus elements upstream of the transgene cassette in an E1-deleted adenovirus leads to an unusual genomic integration in epithelial cells

International Nuclear Information System (INIS)

Zheng Changyu; O'Connell, Brian C.; Baum, Bruce J.

2003-01-01

Classically, the 5' and 3' long terminal repeats (LTRs) are considered necessary but not sufficient for retroviral integration. Recently, we reported that inclusion of these and additional elements from Moloney murine leukemia virus (MoMLV) facilitated transgene integration, without retroviral integrase, when placed in an adenoviral context (AdLTR-luc vector) (Nat. Biotech. 18 (2000), 176; Biochem. Biophys. Res. Commun. 300 (2003), 115). To help understand this nonhomologous DNA recombination event, we constructed another vector, AdELP-luc, with 2.7 kb of MoMLV elements identically placed into an E1-deleted adenovirus type 5 backbone upstream of a luciferase cDNA reporter gene. Unlike AdLTR-luc, no MoMLV elements were placed downstream of the expression cassette. AdELP-luc readily infected epithelial cells in vitro. Southern hybridizations with DNA from cloned cells showed that disruption of the MoMLV sequences occurred. One cell clone, grown in vitro without any special selection medium for 9 months, exhibited stable vector integration and luciferase activity. Importantly, both Southern hybridization and FISH analyses showed that in addition to the MoMLV elements and expression cassette, substantial adenoviral sequence downstream of the luciferase cDNA was genomically integrated. These results suggest that the 2.7 kb of MoMLV sequence included in AdELP-luc have cis-acting functions and mediates an unusual integration event

Draft whole genome sequence of groundnut stem rot fungus Athelia rolfsii revealing genetic architect of its pathogenicity and virulence.

Science.gov (United States)

Iquebal, M A; Tomar, Rukam S; Parakhia, M V; Singla, Deepak; Jaiswal, Sarika; Rathod, V M; Padhiyar, S M; Kumar, Neeraj; Rai, Anil; Kumar, Dinesh

2017-07-13

Groundnut (Arachis hypogaea L.) is an important oil seed crop having major biotic constraint in production due to stem rot disease caused by fungus, Athelia rolfsii causing 25-80% loss in productivity. As chemical and biological combating strategies of this fungus are not very effective, thus genome sequencing can reveal virulence and pathogenicity related genes for better understanding of the host-parasite interaction. We report draft assembly of Athelia rolfsii genome of ~73 Mb having 8919 contigs. Annotation analysis revealed 16830 genes which are involved in fungicide resistance, virulence and pathogenicity along with putative effector and lethal genes. Secretome analysis revealed CAZY genes representing 1085 enzymatic genes, glycoside hydrolases, carbohydrate esterases, carbohydrate-binding modules, auxillary activities, glycosyl transferases and polysaccharide lyases. Repeat analysis revealed 11171 SSRs, LTR, GYPSY and COPIA elements. Comparative analysis with other existing ascomycotina genome predicted conserved domain family of WD40, CYP450, Pkinase and ABC transporter revealing insight of evolution of pathogenicity and virulence. This study would help in understanding pathogenicity and virulence at molecular level and development of new combating strategies. Such approach is imperative in endeavour of genome based solution in stem rot disease management leading to better productivity of groundnut crop in tropical region of world.
Evaluation of Mammalian Interspersed Repeats to investigate the goat genome

Directory of Open Access Journals (Sweden)

P. Mariani

2010-01-01

Full Text Available Among the repeated sequences present in most eukaryotic genomes, SINEs (Short Interspersed Nuclear Elements are widely used to investigate evolution in the mammalian order (Buchanan et al., 1999. One family of these repetitive sequences, the MIR (Mammalian Interspersed Repeats; Jurka et al., 1995, is ubiquitous in all mammals.MIR elements are tRNA-derived SINEs and are identifiable by a conserved core region of about 70 nucleotides.
Regulation of HFE expression by poly(ADP-ribose) polymerase-1 (PARP1) through an inverted repeat DNA sequence in the distal promoter.

Science.gov (United States)

Pelham, Christopher; Jimenez, Tamara; Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M Rafiq

2013-12-01

Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700bp (-1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. © 2013.
Rhoptry-associated protein (rap-1) genes in the sheep pathogen Babesia sp. Xinjiang: Multiple transcribed copies differing by 3' end repeated sequences.

Science.gov (United States)

Niu, Qingli; Marchand, Jordan; Yang, Congshan; Bonsergent, Claire; Guan, Guiquan; Yin, Hong; Malandrin, Laurence

2015-07-30

Sheep babesiosis occurs mainly in tropical and subtropical areas. The sheep parasite Babesia sp. Xinjiang is widespread in China, and our goal is to characterize rap-1 (rhoptry-associated protein 1) gene diversity and expression as a first step of a long term goal aiming at developing a recombinant subunit vaccine. Seven different rap-1a genes were amplified in Babesia sp. Xinjiang, using degenerate primers designed from conserved motifs. Rap-1b and rap-1c gene types could not be identified. In all seven rap-1a genes, the 5' regions exhibited identical sequences over 936 nt, and the 3' regions differed at 28 positions over 147 nt, defining two types of genes designated α and β. The remaining 3' part varied from 72 to 360 nt in length, depending on the gene. This region consists of a succession of two to ten 36 nt repeats, which explains the size differences. Even if the nucleotide sequences varied, 6 repeats encoded the same stretch of amino acids. Transcription of at least four α and two β genes was demonstrated by standard RT-PCR. Copyright © 2015 Elsevier B.V. All rights reserved.
R-loops: targets for nuclease cleavage and repeat instability.

Science.gov (United States)

Freudenreich, Catherine H

2018-01-11

R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.
Germ-line CAG repeat instability causes extreme CAG repeat expansion with infantile-onset spinocerebellar ataxia type 2

DEFF Research Database (Denmark)

Vinther-Jensen, Tua; Ek, Jakob; Duno, Morten

2013-01-01

The spinocerebellar ataxias (SCA) are a genetically and clinically heterogeneous group of diseases, characterized by dominant inheritance, progressive cerebellar ataxia and diverse extracerebellar symptoms. A subgroup of the ataxias is caused by unstable CAG-repeat expansions in their respective ...... of paternal germ-line repeat sequence instability of the expanded SCA2 locus.European Journal of Human Genetics advance online publication, 10 October 2012; doi:10.1038/ejhg.2012.231....
A family of DNA repeats in Aspergillus nidulans has assimilated degenerated retrotransposons

DEFF Research Database (Denmark)

Nielsen, M.L.; Hermansen, T.D.; Aleksenko, Alexei Y.

2001-01-01

In the course of a chromosomal walk towards the centromere of chromosome IV of Aspergillus nidulans, several cross- hybridizing genomic cosmid clones were isolated. Restriction mapping of two such clones revealed that their restriction patterns were similar in a region of at least 15 kb, indicati......) phenomenon, first described in Neurospora crassa, may have operated in A. nidulans. The data indicate that this family of repeats has assimilated mobile elements that subsequently degenerated but then underwent further duplications as a part of the host repeats....... the presence of a large repeat. The nature of the repeat was further investigated by sequencing and Southern analysis. The study revealed a family of long dispersed repeats with a high degree of sequence similarity. The number and location of the repeats vary between wild isolates. Two copies of the repeat...
The effects of 5-fluorouracil and doxorubicin on expression of human immunodeficiency virus type 1 long terminal repeat

International Nuclear Information System (INIS)

Panozzo, J.; Akan, E.; Griffiths, T.D.

1996-01-01

Previous work by many groups has documented induction of the HIV-LTR following exposure of cells to ultraviolet light and other DNA damaging agents. Our experiments set out to determine the relative activation or repression of the HIV-LTR in response to two classes of chemotherapeutic agents: Doxorubicin is a DNA-damage inducing agent, and 5-fluorouracil has an antimetabolic mode of action. Using HeLa cells stably transfected with a construct in which HIV-LTR drives expression of the chloramphenicol acetyl transferase reporter gene, we demonstrated an up to 10-fold induction following doxorubicin treatment in 24 h post-treatment. This induction was repressed by treatment with salicylic acid, suggesting a role for prostaglandin/cyclo-oxygenase pathways and/or NFKB in the inductive response. Induction by 5-fluorouracil, in contrast, was more modest (two-fold at most) though it was consistently elevated over controls
Conservation of Repeats at the Mammalian KCNQ1OT1-CDKN1C Region Suggests a Role in Genomic Imprinting

Directory of Open Access Journals (Sweden)

Marcos De Donato

2017-06-01

Full Text Available KCNQ1OT1 is located in the region with the highest number of genes showing genomic imprinting, but the mechanisms controlling the genes under its influence have not been fully elucidated. Therefore, we conducted a comparative analysis of the KCNQ1/KCNQ1OT1-CDKN1C region to study its conservation across the best assembled eutherian mammalian genomes sequenced to date and analyzed potential elements that may be implicated in the control of genomic imprinting in this region. The genomic features in these regions from human, mouse, cattle, and dog show a higher number of genes and CpG islands (detected using cpgplot from EMBOSS, but lower number of repetitive elements (including short interspersed nuclear elements and long interspersed nuclear elements, compared with their whole chromosomes (detected by RepeatMasker. The KCNQ1OT1-CDKN1C region contains the highest number of conserved noncoding sequences (CNS among mammals, where we found 16 regions containing about 38 different highly conserved repetitive elements (using mVista, such as LINE1 elements: L1M4, L1MB7, HAL1, L1M4a, L1Med, and an LTR element: MLT1H. From these elements, we found 74 CNS showing high sequence identity (>70% between human, cattle, and mouse, from which we identified 13 motifs (using Multiple Em for Motif Elicitation/Motif Alignment and Search Tool with a significant probability of occurrence, 3 of which were the most frequent and were used to find transcription factor–binding sites. We detected several transcription factors (using JASPAR suite from the families SOX, FOX, and GATA. A phylogenetic analysis of these CNS from human, marmoset, mouse, rat, cattle, dog, horse, and elephant shows branches with high levels of support and very similar phylogenetic relationships among these groups, confirming previous reports. Our results suggest that functional DNA elements identified by comparative genomics in a region densely populated with imprinted mammalian genes may be
Conserved loci of leaf and stem rust fungi of wheat share synteny interrupted by lineage-specific influx of repeat elements

Directory of Open Access Journals (Sweden)

Fellers John P

2013-01-01

Full Text Available Abstract Background Wheat leaf rust (Puccinia triticina Eriks; Pt and stem rust fungi (P. graminis f.sp. tritici; Pgt are significant economic pathogens having similar host ranges and life cycles, but different alternate hosts. The Pt genome, currently estimated at 135 Mb, is significantly larger than Pgt, at 88 Mb, but the reason for the expansion is unknown. Three genomic loci of Pt conserved proteins were characterized to gain insight into gene content, genome complexity and expansion. Results A bacterial artificial chromosome (BAC library was made from P. triticina race 1, BBBD and probed with Pt homologs of genes encoding two predicted Pgt secreted effectors and a DNA marker mapping to a region of avirulence. Three BACs, 103 Kb, 112 Kb, and 166 Kb, were sequenced, assembled, and open reading frames were identified. Orthologous genes were identified in Pgt and local conservation of gene order (microsynteny was observed. Pairwise protein identities ranged from 26 to 99%. One Pt BAC, containing a RAD18 ortholog, shares syntenic regions with two Pgt scaffolds, which could represent both haplotypes of Pgt. Gene sequence is diverged between the species as well as within the two haplotypes. In all three BAC clones, gene order is locally conserved, however, gene shuffling has occurred relative to Pgt. These regions are further diverged by differing insertion loci of LTR-retrotransposon, Gypsy, Copia, Mutator, and Harbinger mobile elements. Uncharacterized Pt open reading frames were also found; these proteins are high in lysine and similar to multiple proteins in Pgt. Conclusions The three Pt loci are conserved in gene order, with a range of gene sequence divergence. Conservation of predicted haustoria expressed secreted protein genes between Pt and Pgt is extended to the more distant poplar rust, Melampsora larici-populina. The loci also reveal that genome expansion in Pt is in part due to higher occurrence of repeat-elements in this species.
Study of simple sequence repeat (SSR) polymorphism for biotic ...

African Journals Online (AJOL)

home

2013-10-02

Oct 2, 2013 ... G. Siva Kumar1, K. Aruna Kumari1*, Ch. V. Durga Rani1, R. M. Sundaram2, S. Vanisree3, Md. ..... review by Jena and Mackill (2008) provided the list of .... repeat protein and is a member of a resistance gene cluster on rice.
Mononucleotide repeats are asymmetrically distributed in fungal genes

NARCIS (Netherlands)

Passel, van M.W.J.; Graaff, de L.H.

2008-01-01

ABSTRACT: BACKGROUND: Systematic analyses of sequence features have resulted in a better characterisation of the organisation of the genome. A previous study in prokaryotes on the distribution of sequence repeats, which are notoriously variable and can disrupt the reading frame in genes, showed that
Development of novel simple sequence repeat markers in bitter gourd (Momordica charantia L.) through enriched genomic libraries and their utilization in analysis of genetic diversity and cross-species transferability.

Science.gov (United States)

Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B

2015-01-01

Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.
Alu repeats as markers for forensic DNA analyses

Energy Technology Data Exchange (ETDEWEB)

Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Kass, D.H. [Louisiana State Univ., New Orleans, LA (United States)] [and others

1994-01-01

The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 98.9% nucleotide identity with the HS subfamily consensus sequence, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 inch and 3 inch unique flanking DNA sequences from each HS Alu that allow the locus to be assayed for the presence or absence of the Alu repeat. The dimorphic HS Alu sequences probably inserted in the human genome after the radiation of modem humans (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project. HS Alu family member insertions differ from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) in that polymorphisms due to Alu insertions arise as a result of a unique event which has occurred only one time in the human population and spread through the population from that point. Therefore, individuals that share HS Alu repeats inherited these elements from a common ancestor. Most VNTR and RFLP polymorphisms may arise multiple times in parallel within a population.
The First Molecular Identification of an Olive Collection Applying Standard Simple Sequence Repeats and Novel Expressed Sequence Tag Markers.

Science.gov (United States)

Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo

2017-01-01

Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections
C-terminal sequences of hsp70 and hsp90 as non-specific anchors for tetratricopeptide repeat (TPR) proteins.

Science.gov (United States)

Ramsey, Andrew J; Russell, Lance C; Chinkers, Michael

2009-10-12

Steroid-hormone-receptor maturation is a multi-step process that involves several TPR (tetratricopeptide repeat) proteins that bind to the maturation complex via the C-termini of hsp70 (heat-shock protein 70) and hsp90 (heat-shock protein 90). We produced a random T7 peptide library to investigate the roles played by the C-termini of the two heat-shock proteins in the TPR-hsp interactions. Surprisingly, phages with the MEEVD sequence, found at the C-terminus of hsp90, were not recovered from our biopanning experiments. However, two groups of phages were isolated that bound relatively tightly to HsPP5 (Homo sapiens protein phosphatase 5) TPR. Multiple copies of phages with a C-terminal sequence of LFG were isolated. These phages bound specifically to the TPR domain of HsPP5, although mutation studies produced no evidence that they bound to the domain's hsp90-binding groove. However, the most abundant family obtained in the initial screen had an aspartate residue at the C-terminus. Two members of this family with a C-terminal sequence of VD appeared to bind with approximately the same affinity as the hsp90 C-12 control. A second generation pseudo-random phage library produced a large number of phages with an LD C-terminus. These sequences acted as hsp70 analogues and had relatively low affinities for hsp90-specific TPR domains. Unfortunately, we failed to identify residues near hsp90's C-terminus that impart binding specificity to individual hsp90-TPR interactions. The results suggest that the C-terminal sequences of hsp70 and hsp90 act primarily as non-specific anchors for TPR proteins.
Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana

Directory of Open Access Journals (Sweden)

Yuan Tong

2010-01-01

Full Text Available Abstract Background Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In Arabidopsis thaliana, there are at least 223 Leucine-rich repeat receptor-like kinases (LRR-RLKs, representing one of the largest protein families. Although functional roles for a handful of LRR-RLKs have been revealed, the functions of the majority of members in this protein family have not been elucidated. Results As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for in planta functional analysis. All clones from this study have been submitted to the Arabidopsis Biological Resource Center (ABRC at Ohio State University for full accessibility by the Arabidopsis research community. Conclusions Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.
Nature of unstable insertional mutations and reversions at the cut locus of Drosophila melanogaster: Molecular mechanism for transpositional memory

International Nuclear Information System (INIS)

Mizrokhi, L.Yu.; Georgieva, S.G.; Obolenkova, L.A.; Priimyagi, A.F.; Gerasimova, T.I.; Il'in, Yu.V.

1988-01-01

A segment of the cut locus containing an mdg4 insertion as a result of ct MR and ct MRp10 mutations was cloned. Clones were obtained for the phenotypically different ct MR2 and ct MRpN10 mutants and for stable and unstable revertants. All mutations studied are associated with mdg4 insertion at an identical nucleotide sequence of the cut locus, the same site at which mdg4 is inserted at the ct 6 allele. The ct MRpN line differs from ct MR2 in that the mobile element jockey (3 kbp) is inserted in mdg4. Jockey is represented by about 1,000 copies per genome; it is homogeneous and lacks long terminal repeats (LTRs). In stable ct + reversions, mdg4 is completely excised. In unstable ct + reversions, in which there is a high degree of reverse directed transposition of mdg4 to the cut locus, an LTR of mdg4 is preserved at the site of the mutation. It is a sequence along which new copies of mdg4 or jockey-containing mdg4 are inserted into the genome. The authors discuss a molecular mechanism for transpositional memory involving homologous recombination of the remnant LTR and circular extrachromosomal copies of mdg4
Transposable elements and circular DNAs

KAUST Repository

Mourier, Tobias

2016-09-26

Circular DNAs are extra-chromosomal fragments that become circularized by genomic recombination events. We have recently shown that yeast LTR elements generate circular DNAs through recombination events between their flanking long terminal repeats (LTRs). Similarly, circular DNAs can be generated by recombination between LTRs residing at different genomic loci, in which case the circular DNA will contain the intervening sequence. In yeast, this can result in gene copy number variations when circles contain genes and origins of replication. Here, I speculate on the potential and implications of circular DNAs generated through recombination between human transposable elements.
Transposable elements and circular DNAs

KAUST Repository

Mourier, Tobias

2016-01-01

Circular DNAs are extra-chromosomal fragments that become circularized by genomic recombination events. We have recently shown that yeast LTR elements generate circular DNAs through recombination events between their flanking long terminal repeats (LTRs). Similarly, circular DNAs can be generated by recombination between LTRs residing at different genomic loci, in which case the circular DNA will contain the intervening sequence. In yeast, this can result in gene copy number variations when circles contain genes and origins of replication. Here, I speculate on the potential and implications of circular DNAs generated through recombination between human transposable elements.

The DUB/USP17 deubiquitinating enzymes: A gene family within a tandemly repeated sequence, is also embedded within the copy number variable Beta-defensin cluster

Directory of Open Access Journals (Sweden)

Scott Christopher J

2010-04-01

Full Text Available Abstract Background The DUB/USP17 subfamily of deubiquitinating enzymes were originally identified as immediate early genes induced in response to cytokine stimulation in mice (DUB-1, DUB-1A, DUB-2, DUB-2A. Subsequently we have identified a number of human family members and shown that one of these (DUB-3 is also cytokine inducible. We originally showed that constitutive expression of DUB-3 can block cell proliferation and more recently we have demonstrated that this is due to its regulation of the ubiquitination and activity of the 'CAAX' box protease RCE1. Results Here we demonstrate that the human DUB/USP17 family members are found on both chromosome 4p16.1, within a block of tandem repeats, and on chromosome 8p23.1, embedded within the copy number variable beta-defensin cluster. In addition, we show that the multiple genes observed in humans and other distantly related mammals have arisen due to the independent expansion of an ancestral sequence within each species. However, it is also apparent when sequences from humans and the more closely related chimpanzee are compared, that duplication events have taken place prior to these species separating. Conclusions The observation that the DUB/USP17 genes, which can influence cell growth and survival, have evolved from an unstable ancestral sequence which has undergone multiple and varied duplications in the species examined marks this as a unique family. In addition, their presence within the beta-defensin repeat raises the question whether they may contribute to the influence of this repeat on immune related conditions.
Karyological characterization and identification of four repetitive element groups (the 18S – 28S rRNA gene, telomeric sequences, microsatellite repeat motifs, Rex retroelements) of the Asian swamp eel (Monopterus albus)

Science.gov (United States)

Suntronpong, Aorarat; Thapana, Watcharaporn; Twilprawat, Panupon; Prakhongcheep, Ornjira; Somyong, Suthasinee; Muangmai, Narongrit; Surin Peyachoknagul; Srikulnath, Kornsorn

2017-01-01

Abstract Among teleost fishes, Asian swamp eel (Monopterus albus Zuiew, 1793) possesses the lowest chromosome number, 2n = 24. To characterize the chromosome constitution and investigate the genome organization of repetitive sequences in M. albus, karyotyping and chromosome mapping were performed with the 18S – 28S rRNA gene, telomeric repeats, microsatellite repeat motifs, and Rex retroelements. The 18S – 28S rRNA genes were observed to the pericentromeric region of chromosome 4 at the same position with large propidium iodide and C-positive bands, suggesting that the molecular structure of the pericentromeric regions of chromosome 4 has evolved in a concerted manner with amplification of the 18S – 28S rRNA genes. (TTAGGG)n sequences were found at the telomeric ends of all chromosomes. Eight of 19 microsatellite repeat motifs were dispersedly mapped on different chromosomes suggesting the independent amplification of microsatellite repeat motifs in M. albus. Monopterus albus Rex1 (MALRex1) was observed at interstitial sites of all chromosomes and in the pericentromeric regions of most chromosomes whereas MALRex3 was scattered and localized to all chromosomes and MALRex6 to several chromosomes. This suggests that these retroelements were independently amplified or lost in M. albus. Among MALRexs (MALRex1, MALRex3, and MALRex6), MALRex6 showed higher interspecific sequence divergences from other teleost species in comparison. This suggests that the divergence of Rex6 sequences of M. albus might have occurred a relatively long time ago. PMID:29093797
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

Science.gov (United States)

Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

2017-07-01

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
Revisiting the TALE repeat.

Science.gov (United States)

Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

2014-04-01

Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

Science.gov (United States)

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Comparative Methylation of ERVWE1/Syncytin-1 and Other Human Endogenous Retrovirus LTRs in Placenta Tissues

Science.gov (United States)

Gimenez, Juliette; Montgiraud, Cécile; Oriol, Guy; Pichon, Jean-Philippe; Ruel, Karine; Tsatsaris, Vassilis; Gerbaud, Pascale; Frendo, Jean-Louis; Evain-Brion, Danièle; Mallet, François

2009-01-01

Human endogenous retroviruses (HERVs) are globally silent in somatic cells. However, some HERVs display high transcription in physiological conditions. In particular, ERVWE1, ERVFRDE1 and ERV3, three proviruses of distinct families, are highly transcribed in placenta and produce envelope proteins associated with placenta development. As silencing of repeated elements is thought to occur mainly by DNA methylation, we compared the methylation of ERVWE1 and related HERVs to appreciate whether HERV methylation relies upon the family, the integration site, the tissue, the long terminal repeat (LTR) function or the associated gene function. CpG methylation of HERV-W LTRs in placenta-associated tissues was heterogeneous but a joint epigenetic control was found for ERVWE1 5′LTR and its juxtaposed enhancer, a mammalian apparent LTR retrotransposon. Additionally, ERVWE1, ERVFRDE1 and ERV3 5′LTRs were all essentially hypomethylated in cytotrophoblasts during pregnancy, but showed distinct and stage-dependent methylation profiles. In non-cytotrophoblastic cells, they also exhibited different methylation profiles, compatible with their respective transcriptional activities. Comparative analyses of transcriptional activity and LTR methylation in cell lines further sustained a role for methylation in the control of functional LTRs. These results suggest that HERV methylation might not be family related but copy-specific, and related to the LTR function and the tissue. In particular, ERVWE1 and ERV3 could be developmentally epigenetically regulated HERVs. PMID:19561344
[Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

Science.gov (United States)

Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

2009-11-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Inter- and intra-strain variability of tandem repeats in Mycoplasma pneumoniae based on next-generation sequencing data.

Science.gov (United States)

Zhang, Jing; Song, Xiaohong; Ma, Marella J; Xiao, Li; Kenri, Tsuyoshi; Sun, Hongmei; Ptacek, Travis; Li, Shaoli; Waites, Ken B; Atkinson, T Prescott; Shibayama, Keigo; Dybvig, Kevin; Feng, Yanmei

2017-02-01

To characterize inter- and intra-strain variability of variable-number tandem repeats (VNTRs) in Mycoplasma pneumoniae to determine the optimal multilocus VNTR analysis scheme for improved strain typing. Whole genome assemblies and next-generation sequencing data from diverse M. pneumoniae isolates were used to characterize VNTRs and their variability, and to compare the strain discriminability of new VNTR and existing markers. We identified 13 VNTRs including five reported previously. These VNTRs displayed different levels of inter- and intra-strain copy number variations. All new markers showed similar or higher discriminability compared with existing VNTR markers and the P1 typing system. Our study provides novel insights into VNTR variations and potential new multilocus VNTR analysis schemes for improved genotyping of M. pneumoniae.
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc Maxim, a Chinese Traditional Medicinal Plant

Directory of Open Access Journals (Sweden)

Gong Xiao

2013-06-01

Full Text Available Epimedium sagittatum (Sieb. et Zucc Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12. However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE repeats identified (65.37% of all TE repeats, particularly LTR (Long Terminal Repeat retrotransposons (52.27% of all TE repeats. Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant.
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

Science.gov (United States)

Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

2013-01-01

Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511
Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

Energy Technology Data Exchange (ETDEWEB)

Novelli, G.; Sineo, L.; Pontieri, E. [Catholic Univ. of Rome (Italy)]|[Univ. of Milan (Italy)]|[Univ. Florence (Italy)] [and others

1994-09-01

Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PK gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.
The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

Directory of Open Access Journals (Sweden)

Vergnaud Gilles

2007-05-01

Full Text Available Abstract Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the
The Sinbad retrotransposon from the genome of the human blood fluke, Schistosoma mansoni, and the distribution of related Pao-like elements

Directory of Open Access Journals (Sweden)

Morales Maria E

2005-02-01

Full Text Available Abstract Background Of the major families of long terminal repeat (LTR retrotransposons, the Pao/BEL family is probably the least well studied. It is becoming apparent that numerous LTR retrotransposons and other mobile genetic elements have colonized the genome of the human blood fluke, Schistosoma mansoni. Results A proviral form of Sinbad, a new LTR retrotransposon, was identified in the genome of S. mansoni. Phylogenetic analysis indicated that Sinbad belongs to one of five discreet subfamilies of Pao/BEL like elements. BLAST searches of whole genomes and EST databases indicated that members of this clade occurred in species of the Insecta, Nematoda, Echinodermata and Chordata, as well as Platyhelminthes, but were absent from all plants, fungi and lower eukaryotes examined. Among the deuterostomes examined, only aquatic species harbored these types of elements. All four species of nematode examined were positive for Sinbad sequences, although among insect and vertebrate genomes, some were positive and some negative. The full length, consensus Sinbad retrotransposon was 6,287 bp long and was flanked at its 5'- and 3'-ends by identical LTRs of 386 bp. Sinbad displayed a triple Cys-His RNA binding motif characteristic of Gag of Pao/BEL-like elements, followed by the enzymatic domains of protease, reverse transcriptase (RT, RNAseH, and integrase, in that order. A phylogenetic tree of deduced RT sequences from 26 elements revealed that Sinbad was most closely related to an unnamed element from the zebrafish Danio rerio and to Saci-1, also from S. mansoni. It was also closely related to Pao from Bombyx mori and to Ninja of Drosophila simulans. Sinbad was only distantly related to the other schistosome LTR retrotransposons Boudicca, Gulliver, Saci-2, Saci-3, and Fugitive, which are gypsy-like. Southern hybridization and bioinformatics analyses indicated that there were about 50 copies of Sinbad in the S. mansoni genome. The presence of ESTs
Comprehensive identification of genes driven by ERV9-LTRs reveals TNFRSF10B as a re-activatable mediator of testicular cancer cell death

Science.gov (United States)

Beyer, U; Krönung, S K; Leha, A; Walter, L; Dobbelstein, M

2016-01-01

The long terminal repeat (LTR) of human endogenous retrovirus type 9 (ERV9) acts as a germline-specific promoter that induces the expression of a proapoptotic isoform of the tumor suppressor homologue p63, GTAp63, in male germline cells. Testicular cancer cells silence this promoter, but inhibitors of histone deacetylases (HDACs) restore GTAp63 expression and give rise to apoptosis. We show here that numerous additional transcripts throughout the genome are driven by related ERV9-LTRs. 3' Rapid amplification of cDNA ends (3'RACE) was combined with next-generation sequencing to establish a large set of such mRNAs. HDAC inhibitors induce these ERV9-LTR-driven genes but not the LTRs from other ERVs. In particular, a transcript encoding the death receptor DR5 originates from an ERV9-LTR inserted upstream of the protein coding regions of the TNFRSF10B gene, and it shows an expression pattern similar to GTAp63. When treating testicular cancer cells with HDAC inhibitors as well as the death ligand TNF-related apoptosis-inducing ligand (TRAIL), rapid cell death was observed, which depended on TNFRSF10B expression. HDAC inhibitors also cooperate with cisplatin (cDDP) to promote apoptosis in testicular cancer cells. ERV9-LTRs not only drive a large set of human transcripts, but a subset of them acts in a proapoptotic manner. We propose that this avoids the survival of damaged germ cells. HDAC inhibition represents a strategy of restoring the expression of a class of ERV9-LTR-mediated genes in testicular cancer cells, thereby re-enabling tumor suppression. PMID:26024393
The chloroplast genome sequence of the green alga Leptosira terrestris: multiple losses of the inverted repeat and extensive genome rearrangements within the Trebouxiophyceae

Directory of Open Access Journals (Sweden)

Turmel Monique

2007-07-01

Full Text Available Abstract Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales. Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate
A super-family of transcriptional activators regulates bacteriophage packaging and lysis in Gram-positive bacteria

Science.gov (United States)

Quiles-Puchalt, Nuria; Tormo-Más, María Ángeles; Campoy, Susana; Toledo-Arana, Alejandro; Monedero, Vicente; Lasa, Íñigo; Novick, Richard P.; Christie, Gail E.; Penadés, José R.

2013-01-01

The propagation of bacteriophages and other mobile genetic elements requires exploitation of the phage mechanisms involved in virion assembly and DNA packaging. Here, we identified and characterized four different families of phage-encoded proteins that function as activators required for transcription of the late operons (morphogenetic and lysis genes) in a large group of phages infecting Gram-positive bacteria. These regulators constitute a super-family of proteins, here named late transcriptional regulators (Ltr), which share common structural, biochemical and functional characteristics and are unique to this group of phages. They are all small basic proteins, encoded by genes present at the end of the early gene cluster in their respective phage genomes and expressed under cI repressor control. To control expression of the late operon, the Ltr proteins bind to a DNA repeat region situated upstream of the terS gene, activating its transcription. This involves the C-terminal part of the Ltr proteins, which control specificity for the DNA repeat region. Finally, we show that the Ltr proteins are the only phage-encoded proteins required for the activation of the packaging and lysis modules. In summary, we provide evidence that phage packaging and lysis is a conserved mechanism in Siphoviridae infecting a wide variety of Gram-positive bacteria. PMID:23771138
Characterization and expression of the maize β-carbonic anhydrase gene repeat regions.

Science.gov (United States)

Tems, Ursula; Burnell, James N

2010-12-01

In maize, carbonic anhydrase (CA; EC 4.2.1.1) catalyzes the first reaction of the C(4) photosynthetic pathway; it catalyzes the hydration of CO(2) to bicarbonate and provides an inorganic carbon source for the primary carboxylation reaction catalyzed by phosphoenolpyruvate (PEP) carboxylase. The β-CA isozymes from maize, as well as other agronomically important NADP-malic enzyme (NADP-ME) type C(4) crops, have remained relatively uncharacterized but differ significantly from the β-CAs of other C(4) monocot species primarily due to transcript length and the presence of repeat sequences. This research confirmed earlier findings of repeat sequences in maize CA transcripts, and demonstrated that the gene encoding these transcripts is also composed of repeat sequences. One of the maize CA genes was sequenced and found to encode two domains, with distinct groups of exons corresponding to the repeat regions of the transcript. We have also shown that expression of a single repeat region of the CA transcript produced active enzyme that associated as a dimer and was composed primarily of α-helices, consistent with that observed for other plant CAs. As the presence of repeat regions in the CA gene is unique to NADP-ME type C(4) monocot species, the implications of these findings in the context of the evolution of the location and function of this C(4) pathway enzyme are strongly suggestive of CA gene duplication resulting in an evolutionary advantage and a higher photosynthetic efficiency. Copyright © 2010 Elsevier Masson SAS. All rights reserved.
Expansion of protein domain repeats.

Directory of Open Access Journals (Sweden)

Asa K Björklund

2006-08-01

Full Text Available Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein-protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.
Loss and recovery of Arabidopsis-type telomere repeat sequences 5'-(TTTAGGG)(n)-3' in the evolution of a major radiation of flowering plants.

OpenAIRE

Adams, S. P.; Hartman, T. P.; Lim, K. Y.; Chase, M. W.; Bennett, M. D.; Leitch, I. J.; Leitch, A. R.

2001-01-01

Fluorescent in situ hybridization and Southern blotting were used for showing the predominant absence of the Arabidopsis-type telomere repeat sequence (TRS) 5'-(TTTAGGG)(n)-3' (the 'typical' telomere) in a monocot clade which comprises up to 6300 species within Asparagales. Initially, two apparently disparate genera that lacked the typical telomere were identified. Here, we used the new angiosperm phylogenetic classification for predicting in which other related families such telomeres might ...
ASAP: Amplification, sequencing & annotation of plastomes

Directory of Open Access Journals (Sweden)

Folta Kevin M

2005-12-01

Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and

Exact Tandem Repeats Analyzer (E-TRA): A new program for DNA ...

Indian Academy of Sciences (India)

Unknown

Advanced user defined parameters/options let the researchers use different minimum motif repeats ... E-TRA, we used 5,465,605 human EST sequences derived from 18,814,550 ..... repeat rates of T-cells, embryo and testis were higher.
Molecular Characterization of Cultivated Bromeliad Accessions with Inter-Simple Sequence Repeat (ISSR Markers

Directory of Open Access Journals (Sweden)

Yongming Yu

2012-05-01

Full Text Available Bromeliads are of great economic importance in flower production; however little information is available with respect to genetic characterization of cultivated bromeliads thus far. In the present study, a selection of cultivated bromeliads was characterized via inter-simple sequence repeat (ISSR markers with an emphasis on genetic diversity and population structure. Twelve ISSR primers produced 342 bands, of which 287 (~84% were polymorphic, with polymorphic bands per primer ranging from 17 to 34. The Jaccard’s similarity ranged from 0.08 to 0.89 and averaged ~0.30 for the investigated bromeliads. The Bayesian-based approach, together with the un-weighted paired group method with arithmetic average (UPGMA-based clustering and the principal coordinate analysis (PCoA, distinctly grouped the bromeliads from Neoregelia, Guzmania, and Vriesea into three separately clusters, well corresponding with their botanical classifications; whereas the bromeliads of Aechmea other than the recently selected hybrids were not well assigned to a cluster. Additionally, ISSR marker was proven efficient for the identification of hybrids and bud sports of cultivated bromeliads. The findings achieved herein will further our knowledge about the genetic variability within cultivated bromeliads and therefore facilitate breeding for new varieties of cultivated bromeliads in future as well.
The proviral genome of radiation leukemia virus: Molecular cloning, nucleotide sequence of its long terminal repeat and integration in lymphoma cell DNA

International Nuclear Information System (INIS)

Janowski, M.; Merregaert, J.; Boniver, J.; Maisin, J.R.

1985-01-01

The proviral genome of a thymotropic and leukemogenic C57BL/Ka mouse retrovirus, RadLV/VL/sub 3/(T+L+), was cloned as a biologically active PstI insert in the bacterial plasmid pBR322. Its restriction map was compared to those, already known, of two nonthymotropic and nonleukemogenic viruses of the same mouse strain, the ecotropic BL/Ka(B) and the xenotropic constituent of the radiation leukemia virus complex (RadLV). Differences were observed in the pol gene and in the env gene. Moreover, the nucleotide sequence of the RadLV/VL/sub 3/(T+L+) long terminal repeat revealed the existence of two copies of a 42 bp long sequence, separated by 11 nucleotides and of which BL/Ka(B) possesses only one copy
Analysis of simple sequence repeats in the Gaeumannomyces graminis var. tritici genome and the development of microsatellite markers.

Science.gov (United States)

Li, Wei; Feng, Yanxia; Sun, Haiyan; Deng, Yuanyu; Yu, Hanshou; Chen, Huaigu

2014-11-01

Understanding the genetic structure of Gaeumannomyces graminis var. tritici is essential for the establishment of efficient disease control strategies. It is becoming clear that microsatellites, or simple sequence repeats (SSRs), play an important role in genome organization and phenotypic diversity, and are a large source of genetic markers for population genetics and meiotic maps. In this study, we examined the G. graminis var. tritici genome (1) to analyze its pattern of SSRs, (2) to compare it with other plant pathogenic filamentous fungi, such as Magnaporthe oryzae and M. poae, and (3) to identify new polymorphic SSR markers for genetic diversity. The G. graminis var. tritici genome was rich in SSRs; a total 13,650 SSRs have been identified with mononucleotides being the most common motifs. In coding regions, the densities of tri- and hexanucleotides were significantly higher than in noncoding regions. The di-, tri-, tetra, penta, and hexanucleotide repeats in the G. graminis var. tritici genome were more abundant than the same repeats in M. oryzae and M. poae. From 115 devised primers, 39 SSRs are polymorphic with G. graminis var. tritici isolates, and 8 primers were randomly selected to analyze 116 isolates from China. The number of alleles varied from 2 to 7 and the expected heterozygosity (He) from 0.499 to 0.837. In conclusion, SSRs developed in this study were highly polymorphic, and our analysis indicated that G. graminis var. tritici is a species with high genetic diversity. The results provide a pioneering report for several applications, such as the assessment of population structure and genetic diversity of G. graminis var. tritici.
Simple sequence repeats and compositional bias in the bipartite Ralstonia solanacearum GMI1000 genome

Directory of Open Access Journals (Sweden)

Vandamme Peter

2003-03-01

Full Text Available Abstract Background Ralstonia solanacearum is an important plant pathogen. The genome of R. solananearum GMI1000 is organised into two replicons (a 3.7-Mb chromosome and a 2.1-Mb megaplasmid and this bipartite genome structure is characteristic for most R. solanacearum strains. To determine whether the megaplasmid was acquired via recent horizontal gene transfer or is part of an ancestral single chromosome, we compared the abundance, distribution and compositon of simple sequence repeats (SSRs between both replicons and also compared the respective compositional biases. Results Our data show that both replicons are very similar in respect to distribution and composition of SSRs and presence of compositional biases. Minor variations in SSR and compositional biases observed may be attributable to minor differences in gene expression and regulation of gene expression or can be attributed to the small sample numbers observed. Conclusions The observed similarities indicate that both replicons have shared a similar evolutionary history and thus suggest that the megaplasmid was not recently acquired from other organisms by lateral gene transfer but is a part of an ancestral R. solanacearum chromosome.
ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

Science.gov (United States)

Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

2006-06-16

The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.
Highly sensitive detection of individual HEAT and ARM repeats with HHpred and COACH.

Science.gov (United States)

Kippert, Fred; Gerloff, Dietlind L

2009-09-24

HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high
Transcription of highly repetitive tandemly organized DNA in amphibians and birds: A historical overview and modern concepts.

Science.gov (United States)

Trofimova, Irina; Krasikova, Alla

2016-12-01

Tandemly organized highly repetitive DNA sequences are crucial structural and functional elements of eukaryotic genomes. Despite extensive evidence, satellite DNA remains an enigmatic part of the eukaryotic genome, with biological role and significance of tandem repeat transcripts remaining rather obscure. Data on tandem repeats transcription in amphibian and avian model organisms is fragmentary despite their genomes being thoroughly characterized. Review systematically covers historical and modern data on transcription of amphibian and avian satellite DNA in somatic cells and during meiosis when chromosomes acquire special lampbrush form. We highlight how transcription of tandemly repetitive DNA sequences is organized in interphase nucleus and on lampbrush chromosomes. We offer LTR-activation hypotheses of widespread satellite DNA transcription initiation during oogenesis. Recent explanations are provided for the significance of high-yield production of non-coding RNA derived from tandemly organized highly repetitive DNA. In many cases the data on the transcription of satellite DNA can be extrapolated from lampbrush chromosomes to interphase chromosomes. Lampbrush chromosomes with applied novel technical approaches such as superresolution imaging, chromosome microdissection followed by high-throughput sequencing, dynamic observation in life-like conditions provide amazing opportunities for investigation mechanisms of the satellite DNA transcription.
FRB 121102: A Starquake-induced Repeater?

Science.gov (United States)

Wang, Weiyang; Luo, Rui; Yue, Han; Chen, Xuelei; Lee, Kejia; Xu, Renxin

2018-01-01

Since its initial discovery, the fast radio burst (FRB) FRB 121102 has been found to be repeating with millisecond-duration pulses. Very recently, 14 new bursts were detected by the Green Bank Telescope during its continuous monitoring observations. In this paper, we show that the burst energy distribution has a power-law form which is very similar to the Gutenberg–Richter law of earthquakes. In addition, the distribution of burst waiting time can be described as a Poissonian or Gaussian distribution, which is consistent with earthquakes, while the aftershock sequence exhibits some local correlations. These findings suggest that the repeating FRB pulses may originate from the starquakes of a pulsar. Noting that the soft gamma-ray repeaters (SGRs) also exhibit such distributions, the FRB could be powered by some starquake mechanisms associated with the SGRs, including the crustal activity of a magnetar or solidification-induced stress of a newborn strangeon star. These conjectures could be tested with more repeating samples.
Targeted HIV-1 Latency Reversal Using CRISPR/Cas9-Derived Transcriptional Activator Systems.

Directory of Open Access Journals (Sweden)

Julia K Bialek

Full Text Available CRISPR/Cas9 technology is currently considered the most advanced tool for targeted genome engineering. Its sequence-dependent specificity has been explored for locus-directed transcriptional modulation. Such modulation, in particular transcriptional activation, has been proposed as key approach to overcome silencing of dormant HIV provirus in latently infected cellular reservoirs. Currently available agents for provirus activation, so-called latency reversing agents (LRAs, act indirectly through cellular pathways to induce viral transcription. However, their clinical performance remains suboptimal, possibly because reservoirs have diverse cellular identities and/or proviral DNA is intractable to the induced pathways. We have explored two CRISPR/Cas9-derived activator systems as targeted approaches to induce dormant HIV-1 proviral DNA. These systems recruit multiple transcriptional activation domains to the HIV 5' long terminal repeat (LTR, for which we have identified an optimal target region within the LTR U3 sequence. Using this target region, we demonstrate transcriptional activation of proviral genomes via the synergistic activation mediator complex in various in culture model systems for HIV latency. Observed levels of induction are comparable or indeed higher than treatment with established LRAs. Importantly, activation is complete, leading to production of infective viral particles. Our data demonstrate that CRISPR/Cas9-derived technologies can be applied to counteract HIV latency and may therefore represent promising novel approaches in the quest for HIV elimination.
Complete genome sequence and comparative genomics of the probiotic yeast Saccharomyces boulardii.

Science.gov (United States)

Khatri, Indu; Tomar, Rajul; Ganesan, K; Prasad, G S; Subramanian, Srikrishna

2017-03-23

The probiotic yeast, Saccharomyces boulardii (Sb) is known to be effective against many gastrointestinal disorders and antibiotic-associated diarrhea. To understand molecular basis of probiotic-properties ascribed to Sb we determined the complete genomes of two strains of Sb i.e. Biocodex and unique28 and the draft genomes for three other Sb strains that are marketed as probiotics in India. We compared these genomes with 145 strains of S. cerevisiae (Sc) to understand genome-level similarities and differences between these yeasts. A distinctive feature of Sb from other Sc is absence of Ty elements Ty1, Ty3, Ty4 and associated LTR. However, we could identify complete Ty2 and Ty5 elements in Sb. The genes for hexose transporters HXT11 and HXT9, and asparagine-utilization are absent in all Sb strains. We find differences in repeat periods and copy numbers of repeats in flocculin genes that are likely related to the differential adhesion of Sb as compared to Sc. Core-proteome based taxonomy places Sb strains along with wine strains of Sc. We find the introgression of five genes from Z. bailii into the chromosome IV of Sb and wine strains of Sc. Intriguingly, genes involved in conferring known probiotic properties to Sb are conserved in most Sc strains.
Alu repeats as markers for human population genetics

Energy Technology Data Exchange (ETDEWEB)

Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Bazan, H. [Louisiana State Univ., New Orleans, LA (United States). Medical Center] [and others

1993-09-01

The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 97.9% nucleotide identity with each other and an average of 98.9% nucleotide identity with the HS subfamily consensus sequence. HS Alu family members are thought to be derived from a single source ``master`` gene, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 in. and 3 in. unique flanking DNA sequences from each HS Alu that allows the locus to be assayed for the presence or absence of an Alu repeat. Individual HS Alu sequences were found to be either monomorphic or dimorphic for the presence or absence of each repeat. The monomorphic HS Alu family members inserted in the human genome after the human/great ape divergence (which is thought to have occurred 4--6 million years ago), but before the radiation of modem man. The dimorphic HS Alu sequences inserted in the human genome after the radiation of modem man (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project as well. HS Alu family member insertion dimorphism differs from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) because individuals share HS Alu family member insertions based upon identity by descent from a common ancestor as a result of a single event which occurred one time within the human population. The VNTR and RFLP polymorphisms may arise multiple times within a population and are identical by state only.
Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter baumannii and Interlaboratory Validation of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿†

Science.gov (United States)

Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo

2011-01-01

Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956
Derepression of the plant Chromovirus LORE1 induces germline transposition in regenerated plants.

Directory of Open Access Journals (Sweden)

Eigo Fukai

2010-03-01

Full Text Available Transposable elements represent a large proportion of the eukaryotic genomes. Long Terminal Repeat (LTR retrotransposons are very abundant and constitute the predominant family of transposable elements in plants. Recent studies have identified chromoviruses to be a widely distributed lineage of Gypsy elements. These elements contain chromodomains in their integrases, which suggests a preference for insertion into heterochromatin. In turn, this preference might have contributed to the patterning of heterochromatin observed in host genomes. Despite their potential importance for our understanding of plant genome dynamics and evolution, the regulatory mechanisms governing the behavior of chromoviruses and their activities remain largely uncharacterized. Here, we report a detailed analysis of the spatio-temporal activity of a plant chromovirus in the endogenous host. We examined LORE1a, a member of the endogenous chromovirus LORE1 family from the model legume Lotus japonicus. We found that this chromovirus is stochastically de-repressed in plant populations regenerated from de-differentiated cells and that LORE1a transposes in the male germline. Bisulfite sequencing of the 5' LTR and its surrounding region suggests that tissue culture induces a loss of epigenetic silencing of LORE1a. Since LTR promoter activity is pollen specific, as shown by the analysis of transgenic plants containing an LTR::GUS fusion, we conclude that male germline-specific LORE1a transposition in pollen grains is controlled transcriptionally by its own cis-elements. New insertion sites of LORE1a copies were frequently found in genic regions and show no strong insertional preferences. These distinctive novel features of LORE1 indicate that this chromovirus has considerable potential for generating genetic and epigenetic diversity in the host plant population. Our results also define conditions for the use of LORE1a as a genetic tool.
Colorimetric and dynamic light scattering detection of DNA sequences by using positively charged gold nanospheres: a comparative study with gold nanorods

Science.gov (United States)

Pylaev, T. E.; Khanadeev, V. A.; Khlebtsov, B. N.; Dykman, L. A.; Bogatyrev, V. A.; Khlebtsov, N. G.

2011-07-01

We introduce a new genosensing approach employing CTAB (cetyltrimethylammonium bromide)-coated positively charged colloidal gold nanoparticles (GNPs) to detect target DNA sequences by using absorption spectroscopy and dynamic light scattering. The approach is compared with a previously reported method employing unmodified CTAB-coated gold nanorods (GNRs). Both approaches are based on the observation that whereas the addition of probe and target ssDNA to CTAB-coated particles results in particle aggregation, no aggregation is observed after addition of probe and nontarget DNA sequences. Our goal was to compare the feasibility and sensitivity of both methods. A 21-mer ssDNA from the human immunodeficiency virus type 1 HIV-1 U5 long terminal repeat (LTR) sequence and a 23-mer ssDNA from the Bacillus anthracis cryptic protein and protective antigen precursor (pagA) genes were used as ssDNA models. In the case of GNRs, unexpectedly, the colorimetric test failed with perfect cigar-like particles but could be performed with dumbbell and dog-bone rods. By contrast, our approach with cationic CTAB-coated GNPs is easy to implement and possesses excellent feasibility with retention of comparable sensitivity—a 0.1 nM concentration of target cDNA can be detected with the naked eye and 10 pM by dynamic light scattering (DLS) measurements. The specificity of our method is illustrated by successful DLS detection of one-three base mismatches in cDNA sequences for both DNA models. These results suggest that the cationic GNPs and DLS can be used for genosensing under optimal DNA hybridization conditions without any chemical modifications of the particle surface with ssDNA molecules and signal amplification. Finally, we discuss a more than two-three-order difference in the reported estimations of the detection sensitivity of colorimetric methods (0.1 to 10-100 pM) to show that the existing aggregation models are inconsistent with the detection limits of about 0.1-1 pM DNA and that
Survey of clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) systems in multiple sequenced strains of Klebsiella pneumoniae.

Science.gov (United States)

Ostria-Hernández, Martha Lorena; Sánchez-Vallejo, Carlos Javier; Ibarra, J Antonio; Castro-Escarpulli, Graciela

2015-08-04

In recent years the emergence of multidrug resistant Klebsiella pneumoniae strains has been an increasingly common event. This opportunistic species is one of the five main bacterial pathogens that cause hospital infections worldwide and multidrug resistance has been associated with the presence of high molecular weight plasmids. Plasmids are generally acquired through horizontal transfer and therefore is possible that systems that prevent the entry of foreign genetic material are inactive or absent. One of these systems is CRISPR/Cas. However, little is known regarding the clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) system in K. pneumoniae. The adaptive immune system CRISPR/Cas has been shown to limit the entry of foreign genetic elements into bacterial organisms and in some bacteria it has been shown to be involved in regulation of virulence genes. Thus in this work we used bioinformatics tools to determine the presence or absence of CRISPR/Cas systems in available K. pneumoniae genomes. The complete CRISPR/Cas system was identified in two out of the eight complete K. pneumoniae genomes sequences and in four out of the 44 available draft genomes sequences. The cas genes in these strains comprises eight cas genes similar to those found in Escherichia coli, suggesting they belong to the type I-E group, although their arrangement is slightly different. As for the CRISPR sequences, the average lengths of the direct repeats and spacers were 29 and 33 bp, respectively. BLAST searches demonstrated that 38 of the 116 spacer sequences (33%) are significantly similar to either plasmid, phage or genome sequences, while the remaining 78 sequences (67%) showed no significant similarity to other sequences. The region where the CRISPR/Cas systems were located is the same in all the Klebsiella genomes containing it, it has a syntenic architecture, and is located among genes encoding for proteins likely involved in
Genetic Diversity of Pinus nigra Arn. Populations in Southern Spain and Northern Morocco Revealed By Inter-Simple Sequence Repeat Profiles

Directory of Open Access Journals (Sweden)

Oussama Ahrazem

2012-05-01

Full Text Available Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA and Nei’s genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst was 0.233. Cuenca showed the highest Nei’s genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups—Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco—while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra.
Unique CCT repeats mediate transcription of the TWIST1 gene in mesenchymal cell lines

International Nuclear Information System (INIS)

Ohkuma, Mizue; Funato, Noriko; Higashihori, Norihisa; Murakami, Masanori; Ohyama, Kimie; Nakamura, Masataka

2007-01-01

TWIST1, a basic helix-loop-helix transcription factor, plays critical roles in embryo development, cancer metastasis and mesenchymal progenitor differentiation. Little is known about transcriptional regulation of TWIST1 expression. Here we identified DNA sequences responsible for TWIST1 expression in mesenchymal lineage cell lines. Reporter assays with TWIST1 promoter mutants defined the -102 to -74 sequences that are essential for TWIST1 expression in human and mouse mesenchymal cell lines. Tandem repeats of CCT, but not putative CREB and NF-κB sites in the sequences substantially supported activity of the TWIST1 promoter. Electrophoretic mobility shift assay demonstrated that the DNA sequences with the CCT repeats formed complexes with nuclear factors, containing, at least, Sp1 and Sp3. These results suggest critical implication of the CCT repeats in association with Sp1 and Sp3 factors in sustaining expression of the TWIST1 gene in mesenchymal cells
Comparing Whole-Genome Sequencing with Sanger Sequencing for spa Typing of Methicillin-Resistant Staphylococcus aureus

DEFF Research Database (Denmark)

Bartels, Mette Damkjaer; Petersen, Andreas; Worning, Peder

2014-01-01

spa typing of methicillin-resistant Staphylococcus aureus (MRSA) has traditionally been done by PCR amplification and Sanger sequencing of the spa repeat region. At Hvidovre Hospital, Denmark, whole-genome sequencing (WGS) of all MRSA isolates has been performed routinely since January 2013, and ...
Interstitial telomere-like repeats in the Arabidopsis thaliana genome.

Science.gov (United States)

Uchida, Wakana; Matsunaga, Sachihiro; Sugiyama, Ryuji; Kawano, Shigeyuki

2002-02-01

Eukaryotic chromosomal ends are protected by telomeres, which are thought to play an important role in ensuring the complete replication of chromosomes. On the other hand, non-functional telomere-like repeats in the interchromosomal regions (interstitial telomeric repeats; ITRs) have been reported in several eukaryotes. In this study, we identified eight ITRs in the Arabidopsis thaliana genome, each consisting of complete and degenerate 300- to 1200-bp sequences. The ITRs were grouped into three classes (class IA-B, class II, and class IIIA-E) based on the degeneracy of the telomeric repeats in ITRs. The telomeric repeats of the two ITRs in class I were conserved for the most part, whereas the single ITR in class II, and the five ITRs in class III were relatively degenerated. In addition, degenerate ITRs were surrounded by common sequences that shared 70-100% homology to each other; these are named ITR-adjacent sequences (IAS). Although the genomic regions around ITRs in class I lacked IAS, those around ITRs in class II contained IAS (IASa), and those around five ITRs in class III had nine types of IAS (IASb, c, d, e, f, g, h, i, and j). Ten IAS types in classes II and III showed no significant homology to each other. The chromosomal locations of ITRs and IAS were not category-related, but most of them were adjacent to, or part of, a centromere. These results show that the A. thaliana genome has undergone chromosomal rearrangements, such as end-fusions and segmental duplications.

Transcriptionally active LTR retrotransposons in Eucalyptus genus are differentially expressed and insertionally polymorphic.

Science.gov (United States)

Marcon, Helena Sanches; Domingues, Douglas Silva; Silva, Juliana Costa; Borges, Rafael Junqueira; Matioli, Fábio Filippi; Fontes, Marcos Roberto de Mattos; Marino, Celso Luis

2015-08-14

In Eucalyptus genus, studies on genome composition and transposable elements (TEs) are particularly scarce. Nearly half of the recently released Eucalyptus grandis genome is composed by retrotransposons and this data provides an important opportunity to understand TE dynamics in Eucalyptus genome and transcriptome. We characterized nine families of transcriptionally active LTR retrotransposons from Copia and Gypsy superfamilies in Eucalyptus grandis genome and we depicted genomic distribution and copy number in two Eucalyptus species. We also evaluated genomic polymorphism and transcriptional profile in three organs of five Eucalyptus species. We observed contrasting genomic and transcriptional behavior in the same family among different species. RLC_egMax_1 was the most prevalent family and RLC_egAngela_1 was the family with the lowest copy number. Most families of both superfamilies have their insertions occurring Eucalyptus species. Using EST analysis and qRT-PCRs, we observed transcriptional activity in several tissues and in all evaluated species. In some families, osmotic stress increases transcript values. Our strategy was successful in isolating transcriptionally active retrotransposons in Eucalyptus, and each family has a particular genomic and transcriptional pattern. Overall, our results show that retrotransposon activity have differentially affected genome and transcriptome among Eucalyptus species.
Acquiring a cognitive skill with a new repeating version of the Tower of London task.

Science.gov (United States)

Ouellet, Marie-Christine; Beauchamp, Miriam H; Owen, Adrian M; Doyon, Julien

2004-12-01

A computerized version of the Tower of London task was used to investigate cognitive skill learning. Thirty-six healthy volunteers were assigned to either a random condition (nonrecurring problems), or to a sequence condition in which, unbeknownst to the subjects, a repeating sequence of three problems was presented. Indices of execution, planning, and total time, as well as number of moves performed, were used to measure behavioural change. Subjects' performance improved in both conditions across blocks of practice. A distinct learning effect related to the repeating sequence was also observed. This suggests that a specific skill that reflects procedural learning of the strategies, rules, and procedures pertaining to repeating problems can develop over and above a more general skill at solving cognitive planning problems with practice.
Gene conversion homogenizes the CMT1A paralogous repeats

Directory of Open Access Journals (Sweden)

Hurles Matthew E

2001-12-01

Full Text Available Abstract Background Non-allelic homologous recombination between paralogous repeats is increasingly being recognized as a major mechanism causing both pathogenic microdeletions and duplications, and structural polymorphism in the human genome. It has recently been shown empirically that gene conversion can homogenize such repeats, resulting in longer stretches of absolute identity that may increase the rate of non-allelic homologous recombination. Results Here, a statistical test to detect gene conversion between pairs of non-coding sequences is presented. It is shown that the 24 kb Charcot-Marie-Tooth type 1A paralogous repeats (CMT1A-REPs exhibit the imprint of gene conversion processes whilst control orthologous sequences do not. In addition, Monte Carlo simulations of the evolutionary divergence of the CMT1A-REPs, incorporating two alternative models for gene conversion, generate repeats that are statistically indistinguishable from the observed repeats. Bounds are placed on the rate of these conversion processes, with central values of 1.3 × 10-4 and 5.1 × 10-5 per generation for the alternative models. Conclusions This evidence presented here suggests that gene conversion may have played an important role in the evolution of the CMT1A-REP paralogous repeats. The rates of these processes are such that it is probable that homogenized CMT1A-REPs are polymorphic within modern populations. Gene conversion processes are similarly likely to play an important role in the evolution of other segmental duplications and may influence the rate of non-allelic homologous recombination between them.
Genetic characterization and phylogeny of human T-cell lymphotropic virus type I from Chile.

Science.gov (United States)

Ramirez, E; Cartier, L; Villota, C; Fernandez, J

2002-03-20

Infection with Human T-Cell Lymphotropic Virus type I (HTLV-I) have been associated with the development of the HTLV-I associated myelopathy/tropical spastic paraparesis (HAM/TSP). Phylogenetic analyses of HTLV-I isolates have revealed that HTLV-I can be classified into three major groups: the Cosmopolitan, Central African and Melanesian. In the present study, we analyzed the tax, 5' ltr, gag, pol, and env sequences of proviruses of PBMC from ten HAM/TSP patients to investigate the phylogenetic characterization of HTLV-I in Chilean patients. HTLV-I provirus in PBMC from ten Chilean patients with HAM/TSP were amplified by PCR using primers of tax, 5' ltr, gag, pol, and env genes. Amplified products of the five genes were purified and nucleotide sequence was determined by the dideoxy termination procedure. DNA sequences were aligned with the CLUSTAL W program. The results of this study showed that the tax, 5' ltr, gag, pol, and env gene of the Chilean HTLV-I strains had a nucleotide homology ranged from 98.1 to 100%, 95 to 97%, 98.9 to 100%, 94 to 98%, and 94.2 to 98.5% respect to ATK-1 clone, respectively. According to molecular phylogeny with 5' ltr gene, the Chilean HTLV-I strains were grouped with each other suggesting one cluster included in Transcontinental subgroup.
Transposon fingerprinting using low coverage whole genome shotgun sequencing in cacao (Theobroma cacao L.) and related species.

Science.gov (United States)

Sveinsson, Saemundur; Gill, Navdeep; Kane, Nolan C; Cronk, Quentin

2013-07-24

Transposable elements (TEs) and other repetitive elements are a large and dynamically evolving part of eukaryotic genomes, especially in plants where they can account for a significant proportion of genome size. Their dynamic nature gives them the potential for use in identifying and characterizing crop germplasm. However, their repetitive nature makes them challenging to study using conventional methods of molecular biology. Next generation sequencing and new computational tools have greatly facilitated the investigation of TE variation within species and among closely related species. (i) We generated low-coverage Illumina whole genome shotgun sequencing reads for multiple individuals of cacao (Theobroma cacao) and related species. These reads were analysed using both an alignment/mapping approach and a de novo (graph based clustering) approach. (ii) A standard set of ultra-conserved orthologous sequences (UCOS) standardized TE data between samples and provided phylogenetic information on the relatedness of samples. (iii) The mapping approach proved highly effective within the reference species but underestimated TE abundance in interspecific comparisons relative to the de novo methods. (iv) Individual T. cacao accessions have unique patterns of TE abundance indicating that the TE composition of the genome is evolving actively within this species. (v) LTR/Gypsy elements are the most abundant, comprising c.10% of the genome. (vi) Within T. cacao the retroelement families show an order of magnitude greater sequence variability than the DNA transposon families. (vii) Theobroma grandiflorum has a similar TE composition to T. cacao, but the related genus Herrania is rather different, with LTRs making up a lower proportion of the genome, perhaps because of a massive presence (c. 20%) of distinctive low complexity satellite-like repeats in this genome. (i) Short read alignment/mapping to reference TE contigs provides a simple and effective method of investigating
Epigenetic regulation of transcription and possible functions of mammalian short interspersed elements, SINEs.

Science.gov (United States)

Ichiyanagi, Kenji

2013-01-01

Short interspersed elements (SINEs) are a class of retrotransposons, which amplify their copy numbers in their host genomes by retrotransposition. More than a million copies of SINEs are present in a mammalian genome, constituting over 10% of the total genomic sequence. In contrast to the other two classes of retrotransposons, long interspersed elements (LINEs) and long terminal repeat (LTR) elements, SINEs are transcribed by RNA polymerase III. However, like LINEs and LTR elements, the SINE transcription is likely regulated by epigenetic mechanisms such as DNA methylation, at least for human Alu and mouse B1. Whereas SINEs and other transposable elements have long been thought as selfish or junk DNA, recent studies have revealed that they play functional roles at their genomic locations, for example, as distal enhancers, chromatin boundaries and binding sites of many transcription factors. These activities imply that SINE retrotransposition has shaped the regulatory network and chromatin landscape of their hosts. Whereas it is thought that the epigenetic mechanisms were originated as a host defense system against proliferation of parasitic elements, this review discusses a possibility that the same mechanisms are also used to regulate the SINE-derived functions.
Human-specific HERV-K insertion causes genomic variations in the human genome.

Directory of Open Access Journals (Sweden)

Wonseok Shin

Full Text Available Human endogenous retroviruses (HERV sequences account for about 8% of the human genome. Through comparative genomics and literature mining, we identified a total of 29 human-specific HERV-K insertions. We characterized them focusing on their structure and flanking sequence. The results showed that four of the human-specific HERV-K insertions deleted human genomic sequences via non-classical insertion mechanisms. Interestingly, two of the human-specific HERV-K insertion loci contained two HERV-K internals and three LTR elements, a pattern which could be explained by LTR-LTR ectopic recombination or template switching. In addition, we conducted a polymorphic test and observed that twelve out of the 29 elements are polymorphic in the human population. In conclusion, human-specific HERV-K elements have inserted into human genome since the divergence of human and chimpanzee, causing human genomic changes. Thus, we believe that human-specific HERV-K activity has contributed to the genomic divergence between humans and chimpanzees, as well as within the human population.
Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes.

Science.gov (United States)

Rius, Nuria; Guillén, Yolanda; Delprat, Alejandra; Kapusta, Aurélie; Feschotte, Cédric; Ruiz, Alfredo

2016-05-10

Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy.
High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform

DEFF Research Database (Denmark)

Fordyce, Sarah Louise; Avila Arcos, Maria del Carmen; Rockenbauer, Eszter

2011-01-01

repeat units. These methods do not allow for the full resolution of STR base composition that sequencing approaches could provide. Here we present an STR profiling method based on the use of the Roche Genome Sequencer (GS) FLX to simultaneously sequence multiple core STR loci. Using this method...
Cell type-specific termination of transcription by transposable element sequences.

Science.gov (United States)

Conley, Andrew B; Jordan, I King

2012-09-30

Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription
Complete chloroplast genome sequence of a major economic species, Ziziphus jujuba (Rhamnaceae).

Science.gov (United States)

Ma, Qiuyue; Li, Shuxian; Bi, Changwei; Hao, Zhaodong; Sun, Congrui; Ye, Ning

2017-02-01

Ziziphus jujuba is an important woody plant with high economic and medicinal value. Here, we analyzed and characterized the complete chloroplast (cp) genome of Z. jujuba, the first member of the Rhamnaceae family for which the chloroplast genome sequence has been reported. We also built a web browser for navigating the cp genome of Z. jujuba ( http://bio.njfu.edu.cn/gb2/gbrowse/Ziziphus_jujuba_cp/ ). Sequence analysis showed that this cp genome is 161,466 bp long and has a typical quadripartite structure of large (LSC, 89,120 bp) and small (SSC, 19,348 bp) single-copy regions separated by a pair of inverted repeats (IRs, 26,499 bp). The sequence contained 112 unique genes, including 78 protein-coding genes, 30 transfer RNAs, and four ribosomal RNAs. The genome structure, gene order, GC content, and codon usage are similar to other typical angiosperm cp genomes. A total of 38 tandem repeats, two forward repeats, and three palindromic repeats were detected in the Z. jujuba cp genome. Simple sequence repeat (SSR) analysis revealed that most SSRs were AT-rich. The homopolymer regions in the cp genome of Z. jujuba were verified and manually corrected by Sanger sequencing. One-third of mononucleotide repeats were found to be erroneously sequenced by the 454 pyrosequencing, which resulted in sequences of 1-4 bases shorter than that by the Sanger sequencing. Analyzing the cp genome of Z. jujuba revealed that the IR contraction and expansion events resulted in ycf1 and rps19 pseudogenes. A phylogenetic analysis based on 64 protein-coding genes showed that Z. jujuba was closely related to members of the Elaeagnaceae family, which will be helpful for phylogenetic studies of other Rosales species. The complete cp genome sequence of Z. jujuba will facilitate population, phylogenetic, and cp genetic engineering studies of this economic plant.
A Tat-conjugated Peptide Nucleic Acid Tat-PNA-DR Inhibits Hepatitis B Virus Replication In Vitro and In Vivo by Targeting LTR Direct Repeats of HBV RNA

Science.gov (United States)

Zeng, Zhengyang; Han, Shisong; Hong, Wei; Lang, Yange; Li, Fangfang; Liu, Yongxiang; Li, Zeyong; Wu, Yingliang; Li, Wenxin; Zhang, Xianzheng; Cao, Zhijian

2016-01-01

Hepatitis B virus (HBV) infection is a major cause of chronic active hepatitis, cirrhosis, and primary hepatocellular carcinoma, all of which are severe threats to human health. However, current clinical therapies for HBV are limited by potential side effects, toxicity, and drug-resistance. In this study, a cell-penetrating peptide-conjugated peptide nucleic acid (PNA), Tat-PNA-DR, was designed to target the direct repeat (DR) sequences of HBV. Tat-PNA-DR effectively inhibited HBV replication in HepG2.2.15 cells. Its anti-HBV effect relied on the binding of Tat-PNA-DR to the DR, whereby it suppressed the translation of hepatitis B e antigen (HBeAg), HBsAg, HBV core, hepatitis B virus x protein, and HBV reverse transcriptase (RT) and the reverse transcription of the HBV genome. Furthermore, Tat-PNA-DR administered by intravenous injection efficiently cleared HBeAg and HBsAg in an acute hepatitis B mouse model. Importantly, it induced an 80% decline in HBV DNA in mouse serum, which was similar to the effect of the widely used clinical drug Lamivudine (3TC). Additionally, a long-term hydrodynamics HBV mouse model also demonstrated Tat-PNA-DR's antiviral effect. Interestingly, Tat-PNA-DR displayed low cytotoxicity, low mouse acute toxicity, low immunogenicity, and high serum stability. These data indicate that Tat-PNA-DR is a unique PNA and a promising drug candidate against HBV. PMID:26978579
A Tat-conjugated Peptide Nucleic Acid Tat-PNA-DR Inhibits Hepatitis B Virus Replication In Vitro and In Vivo by Targeting LTR Direct Repeats of HBV RNA

Directory of Open Access Journals (Sweden)

Zhengyang Zeng

2016-01-01

Full Text Available Hepatitis B virus (HBV infection is a major cause of chronic active hepatitis, cirrhosis, and primary hepatocellular carcinoma, all of which are severe threats to human health. However, current clinical therapies for HBV are limited by potential side effects, toxicity, and drug-resistance. In this study, a cell-penetrating peptide-conjugated peptide nucleic acid (PNA, Tat-PNA-DR, was designed to target the direct repeat (DR sequences of HBV. Tat-PNA-DR effectively inhibited HBV replication in HepG2.2.15 cells. Its anti-HBV effect relied on the binding of Tat-PNA-DR to the DR, whereby it suppressed the translation of hepatitis B e antigen (HBeAg, HBsAg, HBV core, hepatitis B virus x protein, and HBV reverse transcriptase (RT and the reverse transcription of the HBV genome. Furthermore, Tat-PNA-DR administered by intravenous injection efficiently cleared HBeAg and HBsAg in an acute hepatitis B mouse model. Importantly, it induced an 80% decline in HBV DNA in mouse serum, which was similar to the effect of the widely used clinical drug Lamivudine (3TC. Additionally, a long-term hydrodynamics HBV mouse model also demonstrated Tat-PNA-DR's antiviral effect. Interestingly, Tat-PNA-DR displayed low cytotoxicity, low mouse acute toxicity, low immunogenicity, and high serum stability. These data indicate that Tat-PNA-DR is a unique PNA and a promising drug candidate against HBV.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.

Science.gov (United States)

Militello, Kevin T; Lazatin, Justine C

2017-05-01

Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Genome survey sequencing and genetic background characterization of Gracilariopsis lemaneiformis (Rhodophyta) based on next-generation sequencing.

Science.gov (United States)

Zhou, Wei; Hu, Yiyi; Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

2013-01-01

Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon.
Genome Survey Sequencing and Genetic Background Characterization of Gracilariopsis lemaneiformis (Rhodophyta) Based on Next-Generation Sequencing

Science.gov (United States)

Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

2013-01-01

Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon. PMID:23875008
Large scale analysis of small repeats via mining of the human genome

NARCIS (Netherlands)

van den Berg, I.; Bosnacki, D.; Hilbers, P.A.J.

2009-01-01

Small repetitive sequences, called tandem repeats, are abundant throughout the human genome, both in coding and in non-coding regions. Their role is still mostly unknown, but at least 20 of those repetitive sequences have been related to neurodegenerative disorders. The mutational process that is
Genetic diversity among Puccinia melanocephala isolates from Brazil assessed using simple sequence repeat markers.

Science.gov (United States)

Peixoto-Junior, R F; Creste, S; Landell, M G A; Nunes, D S; Sanguino, A; Campos, M F; Vencovsky, R; Tambarussi, E V; Figueira, A

2014-09-26

Brown rust (causal agent Puccinia melanocephala) is an important sugarcane disease that is responsible for large losses in yield worldwide. Despite its importance, little is known regarding the genetic diversity of this pathogen in the main Brazilian sugarcane cultivation areas. In this study, we characterized the genetic diversity of 34 P. melanocephala isolates from 4 Brazilian states using loci identified from an enriched simple sequence repeat (SSR) library. The aggressiveness of 3 isolates from major sugarcane cultivation areas was evaluated by inoculating an intermediately resistant and a susceptible cultivar. From the enriched library, 16 SSR-specific primers were developed, which produced scorable alleles. Of these, 4 loci were polymorphic and 12 were monomorphic for all isolates evaluated. The molecular characterization of the 34 isolates of P. melanocephala conducted using 16 SSR loci revealed the existence of low genetic variability among the isolates. The average estimated genetic distance was 0.12. Phenetic analysis based on Nei's genetic distance clustered the isolates into 2 major groups. Groups I and II included 18 and 14 isolates, respectively, and both groups contained isolates from all 4 geographic regions studied. Two isolates did not cluster with these groups. It was not possible to obtain clusters according to location or state of origin. Analysis of disease severity data revealed that the isolates did not show significant differences in aggressiveness between regions.
Massively parallel sequencing of forensic STRs

DEFF Research Database (Denmark)

Parson, Walther; Ballard, David; Budowle, Bruce

2016-01-01

The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that...
Analysis of CR1 Repeats in the Zebra Finch Genome

Directory of Open Access Journals (Sweden)

George E. Liu

2013-06-01

Full Text Available Most bird species have smaller genomes and fewer repeats than mammals. Chicken Repeat 1 (CR1 repeat is one of the most abundant families of repeats, ranging from ~133,000 to ~187,000 copies accounting for ~50 to ~80% of the interspersed repeats in the zebra finch and chicken genomes, respectively. CR1 repeats are believed to have arisen from the retrotransposition of a small number of master elements, which gave rise to multiple CR1 subfamilies in the chicken. In this study, we performed a global assessment of the divergence distributions, phylogenies, and consensus sequences of CR1 repeats in the zebra finch genome. We identified and validated 34 CR1 subfamilies and further analyzed the correlation between these subfamilies. We also discovered 4 novel lineage-specific CR1 subfamilies in the zebra finch when compared to the chicken genome. We built various evolutionary trees of these subfamilies and concluded that CR1 repeats may play an important role in reshaping the structure of bird genomes.

Variable number of tandem repeat markers in the genome sequence of Mycosphaerella fijiensis, the causal agent of black leaf streak disease of banana (Musa spp).

Science.gov (United States)

Garcia, S A L; Van der Lee, T A J; Ferreira, C F; Te Lintel Hekkert, B; Zapater, M-F; Goodwin, S B; Guzmán, M; Kema, G H J; Souza, M T

2010-11-09

We searched the genome of Mycosphaerella fijiensis for molecular markers that would allow population genetics analysis of this plant pathogen. M. fijiensis, the causal agent of banana leaf streak disease, also known as black Sigatoka, is the most devastating pathogen attacking bananas (Musa spp). Recently, the entire genome sequence of M. fijiensis became available. We screened this database for VNTR markers. Forty-two primer pairs were selected for validation, based on repeat type and length and the number of repeat units. Five VNTR markers showing multiple alleles were validated with a reference set of isolates from different parts of the world and a population from a banana plantation in Costa Rica. Polymorphism information content values varied from 0.6414 to 0.7544 for the reference set and from 0.0400 and 0.7373 for the population set. Eighty percent of the polymorphism information content values were above 0.60, indicating that the markers are highly informative. These markers allowed robust scoring of agarose gels and proved to be useful for variability and population genetics studies. In conclusion, the strategy we developed to identify and validate VNTR markers is an efficient means to incorporate markers that can be used for fungicide resistance management and to develop breeding strategies to control banana black leaf streak disease. This is the first report of VNTR-minisatellites from the M. fijiensis genome sequence.
Genomic organization, sequence divergence, and recombination of feline immunodeficiency virus from lions in the wild

Science.gov (United States)

Pecon-Slattery, Jill; McCracken, Carrie L; Troyer, Jennifer L; VandeWoude, Sue; Roelke, Melody; Sondgeroth, Kerry; Winterbach, Christiaan; Winterbach, Hanlie; O'Brien, Stephen J

2008-01-01

Background Feline immunodeficiency virus (FIV) naturally infects multiple species of cat and is related to human immunodeficiency virus in humans. FIV infection causes AIDS-like disease and mortality in the domestic cat (Felis catus) and serves as a natural model for HIV infection in humans. In African lions (Panthera leo) and other exotic felid species, disease etiology introduced by FIV infection are less clear, but recent studies indicate that FIV causes moderate to severe CD4 depletion. Results In this study, comparative genomic methods are used to evaluate the full proviral genome of two geographically distinct FIV subtypes isolated from free-ranging lions. Genome organization of FIVPle subtype B (9891 bp) from lions in the Serengeti National Park in Tanzania and FIVPle subtype E (9899 bp) isolated from lions in the Okavango Delta in Botswana, both resemble FIV genome sequence from puma, Pallas cat and domestic cat across 5' LTR, gag, pol, vif, orfA, env, rev and 3'LTR regions. Comparative analyses of available full-length FIV consisting of subtypes A, B and C from FIVFca, Pallas cat FIVOma and two puma FIVPco subtypes A and B recapitulate the species-specific monophyly of FIV marked by high levels of genetic diversity both within and between species. Across all FIVPle gene regions except env, lion subtypes B and E are monophyletic, and marginally more similar to Pallas cat FIVOma than to other FIV. Sequence analyses indicate the SU and TM regions of env vary substantially between subtypes, with FIVPle subtype E more related to domestic cat FIVFca than to FIVPle subtype B and FIVOma likely reflecting recombination between strains in the wild. Conclusion This study demonstrates the necessity of whole-genome analysis to complement population/gene-based studies, which are of limited utility in uncovering complex events such as recombination that may lead to functional differences in virulence and pathogenicity. These full-length lion lentiviruses are integral to
Genomic organization, sequence divergence, and recombination of feline immunodeficiency virus from lions in the wild

Directory of Open Access Journals (Sweden)

Sondgeroth Kerry

2008-02-01

Full Text Available Abstract Background Feline immunodeficiency virus (FIV naturally infects multiple species of cat and is related to human immunodeficiency virus in humans. FIV infection causes AIDS-like disease and mortality in the domestic cat (Felis catus and serves as a natural model for HIV infection in humans. In African lions (Panthera leo and other exotic felid species, disease etiology introduced by FIV infection are less clear, but recent studies indicate that FIV causes moderate to severe CD4 depletion. Results In this study, comparative genomic methods are used to evaluate the full proviral genome of two geographically distinct FIV subtypes isolated from free-ranging lions. Genome organization of FIVPle subtype B (9891 bp from lions in the Serengeti National Park in Tanzania and FIVPle subtype E (9899 bp isolated from lions in the Okavango Delta in Botswana, both resemble FIV genome sequence from puma, Pallas cat and domestic cat across 5' LTR, gag, pol, vif, orfA, env, rev and 3'LTR regions. Comparative analyses of available full-length FIV consisting of subtypes A, B and C from FIVFca, Pallas cat FIVOma and two puma FIVPco subtypes A and B recapitulate the species-specific monophyly of FIV marked by high levels of genetic diversity both within and between species. Across all FIVPle gene regions except env, lion subtypes B and E are monophyletic, and marginally more similar to Pallas cat FIVOma than to other FIV. Sequence analyses indicate the SU and TM regions of env vary substantially between subtypes, with FIVPle subtype E more related to domestic cat FIVFca than to FIVPle subtype B and FIVOma likely reflecting recombination between strains in the wild. Conclusion This study demonstrates the necessity of whole-genome analysis to complement population/gene-based studies, which are of limited utility in uncovering complex events such as recombination that may lead to functional differences in virulence and pathogenicity. These full-length lion
Simple Sequence Repeat Analysis of Selected NSIC-registered Coffee Varieties in the Philippines

Directory of Open Access Journals (Sweden)

Daisy May C. Santos

2016-06-01

Full Text Available Coffee (Coffea sp. is an important commercial crop worldwide. Three species of coffee are used as beverage, namely Coffea arabica, C. canephora, and C. liberica. Coffea arabica L. is the most cultivated among the three coffee species due to its taste quality, rich aroma, and low caffeine content. Despite its inferior taste and aroma, C. canephora Pierre ex A. Froehner, which has the highest caffeine content, is the second most widely cultivated because of its resistance to coffee diseases. On the other hand, C. liberica W.Bull ex Hierncomes is characterized by its very strong taste and flavor. The Philippines used to be a leading exporter of coffee until coffee rust destroyed the farms in Batangas, home of the famous Kapeng Barako. The country has been attempting to revive the coffee industry by focusing on the production of specialty coffee with registered varieties on the National Seed Industry Council (NSIC. Correct identification and isolation of pure coffee beans are the main factors that determine coffee’s market value. Local farms usually misidentify and mix coffee beans of different varieties, leading to the depreciation of their value. This study used simple sequence repeat (SSR markers to evaluate and distinguish Philippine NSIC-registered coffee species and varieties. The neighbor-joining tree generated using PAUP showed high bootstrap support, separating C. arabica, C. canephora, and C. liberica from each other. Among the twenty primer pairs used, seven were able to distinguish C. arabica, nine for C. liberica, and one for C. canephora.
In situ optical sequencing and structure analysis of a trinucleotide repeat genome region by localization microscopy after specific COMBO-FISH nano-probing

Science.gov (United States)

Stuhlmüller, M.; Schwarz-Finsterle, J.; Fey, E.; Lux, J.; Bach, M.; Cremer, C.; Hinderhofer, K.; Hausmann, M.; Hildenbrand, G.

2015-10-01

Trinucleotide repeat expansions (like (CGG)n) of chromatin in the genome of cell nuclei can cause neurological disorders such as for example the Fragile-X syndrome. Until now the mechanisms are not clearly understood as to how these expansions develop during cell proliferation. Therefore in situ investigations of chromatin structures on the nanoscale are required to better understand supra-molecular mechanisms on the single cell level. By super-resolution localization microscopy (Spectral Position Determination Microscopy; SPDM) in combination with nano-probing using COMBO-FISH (COMBinatorial Oligonucleotide FISH), novel insights into the nano-architecture of the genome will become possible. The native spatial structure of trinucleotide repeat expansion genome regions was analysed and optical sequencing of repetitive units was performed within 3D-conserved nuclei using SPDM after COMBO-FISH. We analysed a (CGG)n-expansion region inside the 5' untranslated region of the FMR1 gene. The number of CGG repeats for a full mutation causing the Fragile-X syndrome was found and also verified by Southern blot. The FMR1 promotor region was similarly condensed like a centromeric region whereas the arrangement of the probes labelling the expansion region seemed to indicate a loop-like nano-structure. These results for the first time demonstrate that in situ chromatin structure measurements on the nanoscale are feasible. Due to further methodological progress it will become possible to estimate the state of trinucleotide repeat mutations in detail and to determine the associated chromatin strand structural changes on the single cell level. In general, the application of the described approach to any genome region will lead to new insights into genome nano-architecture and open new avenues for understanding mechanisms and their relevance in the development of heredity diseases.
Genetic Diversity Assessment and Identification of New Sour Cherry Genotypes Using Intersimple Sequence Repeat Markers

Directory of Open Access Journals (Sweden)

Roghayeh Najafzadeh

2014-01-01

Full Text Available Iran is one of the chief origins of subgenus Cerasus germplasm. In this study, the genetic variation of new Iranian sour cherries (which had such superior growth characteristics and fruit quality as to be considered for the introduction of new cultivars was investigated and identified using 23 intersimple sequence repeat (ISSR markers. Results indicated a high level of polymorphism of the genotypes based on these markers. According to these results, primers tested in this study specially ISSR-4, ISSR-6, ISSR-13, ISSR-14, ISSR-16, and ISSR-19 produced good and various levels of amplifications which can be effectively used in genetic studies of the sour cherry. The genetic similarity among genotypes showed a high diversity among the genotypes. Cluster analysis separated improved cultivars from promising Iranian genotypes, and the PCoA supported the cluster analysis results. Since the Iranian genotypes were superior to the improved cultivars and were separated from them in most groups, these genotypes can be considered as distinct genotypes for further evaluations in the framework of breeding programs and new cultivar identification in cherries. Results also confirmed that ISSR is a reliable DNA marker that can be used for exact genetic studies and in sour cherry breeding programs.
Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

Science.gov (United States)

Martin, Andrew C R

2014-01-01

The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.
Comparison of the carboxy-terminal DP-repeat region in the co-chaperones Hop and Hip.

Science.gov (United States)

Nelson, Gregory M; Huffman, Holly; Smith, David F

2003-01-01

Functional steroid receptor complexes are assembled and maintained by an ordered pathway of interactions involving multiple components of the cellular chaperone machinery. Two of these components, Hop and Hip, serve as co-chaperones to the major heat shock proteins (Hsps), Hsp70 and Hsp90, and participate in intermediate stages of receptor assembly. In an effort to better understand the functions of Hop and Hip in the assembly process, we focused on a region of similarity located near the C-terminus of each co-chaperone. Contained within this region is a repeated sequence motif we have termed the DP repeat. Earlier mutagenesis studies implicated the DP repeat of either Hop or Hip in Hsp70 binding and in normal assembly of the co-chaperones with progesterone receptor (PR) complexes. We report here that the DP repeat lies within a protease-resistant domain that extends to or is near the C-terminus of both co-chaperones. Point mutations in the DP repeats render the C-terminal regions hypersensitive to proteolysis. In addition, a Hop DP mutant displays altered proteolytic digestion patterns, which suggest that the DP-repeat region influences the folding of other Hop domains. Although the respective DP regions of Hop and Hip share sequence and structural similarities, they are not functionally interchangeable. Moreover, a double-point mutation within the second DP-repeat unit of Hop that converts this to the sequence found in Hip disrupts Hop function; however, the corresponding mutation in Hip does not alter its function. We conclude that the DP repeats are important structural elements within a C-terminal domain, which is important for Hop and Hip function.
ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

Science.gov (United States)

Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

2012-11-07

Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40
Capillary electrophoresis fragment analysis and clone sequencing in detection of dynamic mutations of spinocerebellar ataxia

Directory of Open Access Journals (Sweden)

Yuan-yuan CHEN

2018-04-01

Full Text Available Objective To estimate the accuracy and stability of capillary electrophoresis fragment analysis and clone sequencing in detecting dynamic mutations of spinocerebellar ataxia (SCA. Methods Capillary electrophoresis fragment analysis and clone sequencing were used in detecting trinucleotide repeated sequence of 14 SCA patients (3 cases of SCA2, 2 cases of SCA7, 7 cases of SCA8 and 2 cases of SCA17. Results Capillary electrophoresis fragment analysis of 3 SCA2 cases showed the expanded cytosine-adenine-guanine (CAG repeats were 31, 30 and 32, and the copy numbers of 3 clone sequencing for 3 colonies in each case were 37/40/40, 37/38/39 and 38/39/40 respectively. Capillary electrophoresis fragment analysis of 2 SCA7 cases showed the expanded CAG repeats were 57 and 34, and the copy numbers of repeats were 69, 74, 75 in 3 colonies of one case, and was 45 in the other case. For the 7 SCA8 cases with the expanded cytosine-thymine-adenine (CTA/cytosine-thymine-guanine (CTG repeats of 99, 111, 104, 92, 89, 104 and 75, the results of clone sequencing were 97, 116, 104, 90, 90, 102 and 76 respectively. For 2 SCA17 cases with the short/expanded CAG repeats of 37/50 and 36/45, the results of clone sequencing were 51/50/52 and 45/44 for 3 and 2 colonies. Conclusions Although the higher mobility of polymerase chain reaction (PCR products containing dynamic mutation in the capillary electrophoresis fragment analysis might cause the deviation for analysis of copy numbers, the deviation was predictable and the results were repeatable. The clone sequencing results showed obvious instability, especially for SCA2 and SCA7 genes, which might owing to their simple CAG repeats. Consequently, clone sequencing is not suited for detection of dynamic mutation, not to mention the quantitative criteria of dynamic mutation sequencing. DOI: 10.3969/j.issn.1672-6731.2018.03.008
Evaluation of genetic diversity amongst Descurainia sophia L. genotypes by inter-simple sequence repeat (ISSR) marker.

Science.gov (United States)

Saki, Sahar; Bagheri, Hedayat; Deljou, Ali; Zeinalabedini, Mehrshad

2016-01-01

Descurainia sophia is a valuable medicinal plant in family of Brassicaceae. To determine the range of diversity amongst D. sophia in Iran, 32 naturally distributed plants belonging to six natural populations of the Iranian plateau were investigated by inter-simple sequence repeat (ISSR) markers. The average percentage of polymorphism produced by 12 ISSR primers was 86 %. The PIC values for primers ranged from 0.22 to 0.40 and Rp values ranged between 6.5 and 19.9. The relative genetic diversity of the populations was not high (Gst =0.32). However, the value of gene flow revealed by the ISSR marker was high (Nm = 1.03). UPGMA clustering method based on Jaccard similarity coefficient grouped the genotypes into two major clusters. Graph results from Neighbor-Net Network generated after a 1000 bootstrap test using Jaccard coefficient, and STRUCTURE analysis confirmed the UPGMA clustering. The first three PCAs represented 57.31 % of the total variation. The high levels of genetic diversity were observed within populations, which is useful in breeding and conservation programs. ISSR is found to be an eligible marker to study genetic diversity of D. sophia.
A novel family of sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats.

Science.gov (United States)

Beloglazova, Natalia; Brown, Greg; Zimmerman, Matthew D; Proudfoot, Michael; Makarova, Kira S; Kudritska, Marina; Kochinyan, Samvel; Wang, Shuren; Chruszcz, Maksymilian; Minor, Wladek; Koonin, Eugene V; Edwards, Aled M; Savchenko, Alexei; Yakunin, Alexander F

2008-07-18

Clustered regularly interspaced short palindromic repeats (CRISPRs) together with the associated CAS proteins protect microbial cells from invasion by foreign genetic elements using presently unknown molecular mechanisms. All CRISPR systems contain proteins of the CAS2 family, suggesting that these uncharacterized proteins play a central role in this process. Here we show that the CAS2 proteins represent a novel family of endoribonucleases. Six purified CAS2 proteins from diverse organisms cleaved single-stranded RNAs preferentially within U-rich regions. A representative CAS2 enzyme, SSO1404 from Sulfolobus solfataricus, cleaved the phosphodiester linkage on the 3'-side and generated 5'-phosphate- and 3'-hydroxyl-terminated oligonucleotides. The crystal structure of SSO1404 was solved at 1.6A resolution revealing the first ribonuclease with a ferredoxin-like fold. Mutagenesis of SSO1404 identified six residues (Tyr-9, Asp-10, Arg-17, Arg-19, Arg-31, and Phe-37) that are important for enzymatic activity and suggested that Asp-10 might be the principal catalytic residue. Thus, CAS2 proteins are sequence-specific endoribonucleases, and we propose that their role in the CRISPR-mediated anti-phage defense might involve degradation of phage or cellular mRNAs.
TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

Science.gov (United States)

Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio

2010-06-15

Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.
Sequence diversities of serine-aspartate repeat genes among Staphylococcus aureus isolates from different hosts presumably by horizontal gene transfer.

Directory of Open Access Journals (Sweden)

Huping Xue

Full Text Available BACKGROUND: Horizontal gene transfer (HGT is recognized as one of the major forces for bacterial genome evolution. Many clinically important bacteria may acquire virulence factors and antibiotic resistance through HGT. The comparative genomic analysis has become an important tool for identifying HGT in emerging pathogens. In this study, the Serine-Aspartate Repeat (Sdr family has been compared among different sources of Staphylococcus aureus (S. aureus to discover sequence diversities within their genomes. METHODOLOGY/PRINCIPAL FINDINGS: Four sdr genes were analyzed for 21 different S. aureus strains and 218 mastitis-associated S. aureus isolates from Canada. Comparative genomic analyses revealed that S. aureus strains from bovine mastitis (RF122 and mastitis isolates in this study, ovine mastitis (ED133, pig (ST398, chicken (ED98, and human methicillin-resistant S. aureus (MRSA (TCH130, MRSA252, Mu3, Mu50, N315, 04-02981, JH1 and JH9 were highly associated with one another, presumably due to HGT. In addition, several types of insertion and deletion were found in sdr genes of many isolates. A new insertion sequence was found in mastitis isolates, which was presumably responsible for the HGT of sdrC gene among different strains. Moreover, the sdr genes could be used to type S. aureus. Regional difference of sdr genes distribution was also indicated among the tested S. aureus isolates. Finally, certain associations were found between sdr genes and subclinical or clinical mastitis isolates. CONCLUSIONS: Certain sdr gene sequences were shared in S. aureus strains and isolates from different species presumably due to HGT. Our results also suggest that the distributional assay of virulence factors should detect the full sequences or full functional regions of these factors. The traditional assay using short conserved regions may not be accurate or credible. These findings have important implications with regard to animal husbandry practices that may
The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats.

Directory of Open Access Journals (Sweden)

Andrew J Alverson

2011-01-01

Full Text Available The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean, and show that despite its unexceptional size (401,262 nt, the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.
Transferability of simple sequence repeat (SSR) markers developed in guava (Psidium guajava L.) to four Myrtaceae species.

Science.gov (United States)

Rai, Manoj K; Phulwaria, Mahendra; Shekhawat, N S

2013-08-01

Present study demonstrated the cross-genera transferability of 23 simple sequence repeat (SSR) primer pairs developed for guava (Psidium guajava L.) to four new targets, two species of eucalypts (Eucalyptus citriodora, Eucalyptus camaldulensis), bottlebrush (Callistemon lanceolatus) and clove (Syzygium aromaticum), belonging to the family Myrtaceae and subfamily Myrtoideae. Off the 23 SSR loci assayed, 18 (78.2%) gave cross-amplification in E. citriodora, 14 (60.8%) in E. camaldulensis and 17-17 (73.9%) in C. lanceolatus and S. aromaticum. Eight primer pairs were found to be transferable to all four species. The number of alleles detected at each locus ranged from one to nine, with an average of 4.8, 2.6, 4.5 and 4.6 alleles in E. citriodora, E. camaldulensis, C. lanceolatus and S. aromaticum, respectively. The high levels of cross-genera transferability of guava SSRs may be applicable for the analysis of intra- and inter specific genetic diversity of target species, especially in E. citriodora, C. lanceolatus and S. aromaticum, for which till date no information about EST-derived as well as genomic SSR is available.
Plasmid P1 replication: negative control by repeated DNA sequences.

OpenAIRE

Chattoraj, D; Cordes, K; Abeles, A

1984-01-01

The incompatibility locus, incA, of the unit-copy plasmid P1 is contained within a fragment that is essentially a set of nine 19-base-pair repeats. One or more copies of the fragment destabilizes the plasmid when present in trans. Here we show that extra copies of incA interfere with plasmid DNA replication and that a deletion of most of incA increases plasmid copy number. Thus, incA is not essential for replication but is required for its control. When cloned in a high-copy-number vector, pi...
Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells

DEFF Research Database (Denmark)

Rodriguez, Jairo; Vives, Laura; Jordà, Mireia

2008-01-01

Methylation of the cytosine is the most frequent epigenetic modification of DNA in mammalian cells. In humans, most of the methylated cytosines are found in CpG-rich sequences within tandem and interspersed repeats that make up to 45% of the human genome, being Alu repeats the most common family....
The surface glycoprotein of a natural feline leukemia virus subgroup A variant, FeLV-945, as a determinant of disease outcome.

Science.gov (United States)

Bolin, Lisa L; Ahmad, Shamim; Levy, Laura S

2011-10-15

Feline leukemia virus (FeLV) is a natural retrovirus of domestic cats associated with degenerative, proliferative and malignant diseases. Studies of FeLV infection in a cohort of naturally infected cats were undertaken to examine FeLV variation, the selective pressures operative in FeLV infection that lead to predominance of natural variants, and the consequences for infection and disease progression. A unique variant, designated FeLV-945, was identified as the predominant isolate in the cohort and was associated with non-T-cell diseases including multicentric lymphoma. FeLV-945 was assigned to the FeLV-A subgroup based on sequence analysis and receptor utilization, but was shown to differ in sequence from a prototype member of FeLV-A, designated FeLV-A/61E, in the long terminal repeat (LTR) and the surface glycoprotein gene (SU). A unique sequence motif in the FeLV-945 LTR was shown to function as a transcriptional enhancer and to confer a replicative advantage. The FeLV-945 SU protein was observed to differ in sequence as compared to FeLV-A/61E within functional domains known to determine receptor selection and binding. Experimental infection of newborn cats was performed using wild type FeLV-A/61E or recombinant FeLV-A/61E in which the LTR (61E/945L) or LTR and SU (61E/945SL) were exchanged for that of FeLV-945. Infection with either FeLV-A/61E or 61E/945L resulted in T-cell lymphoma of the thymus, although 61E/945L caused disease significantly more rapidly. In contrast, infection with 61E/945SL resulted in the rapid induction of a multicentric lymphoma of B-cell origin, thus recapitulating the outcome of natural infection and implicating FeLV-945 SU as a determinant of disease outcome. Recombinant FeLV-B was detected infrequently and at low levels in multicentric lymphomas, and was thereby not implicated in disease induction. Preliminary studies of receptor interaction indicated that virus particles bearing FeLV-945 SU bind to the FeLV-A receptor more
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

Science.gov (United States)

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-06-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Characterization of sequence diversity in Plasmodium falciparum SERA5 from Indian isolates

Directory of Open Access Journals (Sweden)

Rahul C.N

2015-06-01

Full Text Available Objective: To characterize the sequence diversity of blood-stage Plasmodium falciparum serine repeat antigen-5 (PfSERA5 which is lacking in a malaria-endemic country like India. Methods: In this study, parasitic DNA was obtained from field isolates collected from various geographic regions. Subsequently, PfSERA5 gene sequence was PCR amplified and DNA sequenced. Results: We reported the existence of unique repeat polymorphisms and novel haplotypes for both the octamer repeat (OR and serine repeat (SR regions of the N-terminal fragment of PfSERA5 from Indian isolates. Several isolates from India were identical to low-frequency African haplotypes. Unique finding of our study was an Indian isolate showing deletion in a perfectly conserved 14 mer sequence within octamer repeat. Indian haplotypes reported in this study were found to be distributed into the three earlier classified allelic clusters of FCR3, K1 and Honduras showcasing broad diversity as compared to worldwide haplotypes. Conclusions: This study is the first report on genetic diversity of PfSERA5 antigen from India. Further evaluation of these haplotypes by serotyping would provide useful information for investigating variant-specific immunity and aid in malaria vaccine research.
Genetic variation and DNA fingerprinting of durian types in Malaysia using simple sequence repeat (SSR) markers.

Science.gov (United States)

Siew, Ging Yang; Ng, Wei Lun; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Tan, Soon Guan; Yeap, Swee Keong

2018-01-01

Durian ( Durio zibethinus ) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, H E = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10 -3 . Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called "clones", "varieties", or "cultivars". Such matters have a direct impact on the regulation and management of durian genetic resources in the region.
Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.

Science.gov (United States)

Šatović, Eva; Plohl, Miroslav

2017-10-01

Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
Genetic characterization of autochthonous grapevine cultivars from Eastern Turkey by simple sequence repeats (SSRs

Directory of Open Access Journals (Sweden)

Sadiye Peral Eyduran

2016-01-01

Full Text Available In this research, two well-recognized standard grape cultivars, Cabernet Sauvignon and Merlot, together with eight historical autochthonous grapevine cultivars from Eastern Anatolia in Turkey, were genetically characterized by using 12 pairs of simple sequence repeat (SSR primers in order to evaluate their genetic diversity and relatedness. All of the used SSR primers produced successful amplifications and revealed DNA polymorphisms, which were subsequently utilized to evaluate the genetic relatedness of the grapevine cultivars. Allele richness was implied by the identification of 69 alleles in 8 autochthonous cultivars with a mean value of 5.75 alleles per locus. The average expected heterozygosity and observed heterozygosity were found to be 0.749 and 0.739, respectively. Taking into account the generated alleles, the highest number was recorded in VVC2C3 and VVS2 loci (nine and eight alleles per locus, respectively, whereas the lowest number was recorded in VrZAG83 (three alleles per locus. Two main clusters were produced by using the unweighted pair-group method with arithmetic mean dendrogram constructed on the basis of the SSR data. Only Cabernet Sauvignon and Merlot cultivars were included in the first cluster. The second cluster involved the rest of the autochthonous cultivars. The results obtained during the study illustrated clearly that SSR markers have verified to be an effective tool for fingerprinting grapevine cultivars and carrying out grapevine biodiversity studies. The obtained data are also meaningful references for grapevine domestication.
Shifts in the evolutionary rate and intensity of purifying selection between two Brassica genomes revealed by analyses of orthologous transposons and relics of a whole genome triplication.

Science.gov (United States)

Zhao, Meixia; Du, Jianchang; Lin, Feng; Tong, Chaobo; Yu, Jingyin; Huang, Shunmou; Wang, Xiaowu; Liu, Shengyi; Ma, Jianxin

2013-10-01

Recent sequencing of the Brassica rapa and Brassica oleracea genomes revealed extremely contrasting genomic features such as the abundance and distribution of transposable elements between the two genomes. However, whether and how these structural differentiations may have influenced the evolutionary rates of the two genomes since their split from a common ancestor are unknown. Here, we investigated and compared the rates of nucleotide substitution between two long terminal repeats (LTRs) of individual orthologous LTR-retrotransposons, the rates of synonymous and non-synonymous substitution among triplicated genes retained in both genomes from a shared whole genome triplication event, and the rates of genetic recombination estimated/deduced by the comparison of physical and genetic distances along chromosomes and ratios of solo LTRs to intact elements. Overall, LTR sequences and genic sequences showed more rapid nucleotide substitution in B. rapa than in B. oleracea. Synonymous substitution of triplicated genes retained from a shared whole genome triplication was detected at higher rates in B. rapa than in B. oleracea. Interestingly, non-synonymous substitution was observed at lower rates in the former than in the latter, indicating shifted densities of purifying selection between the two genomes. In addition to evolutionary asymmetry, orthologous genes differentially regulated and/or disrupted by transposable elements between the two genomes were also characterized. Our analyses suggest that local genomic and epigenomic features, such as recombination rates and chromatin dynamics reshaped by independent proliferation of transposable elements and elimination between the two genomes, are perhaps partially the causes and partially the outcomes of the observed inter-specific asymmetric evolution. © 2013 Purdue University The Plant Journal © 2013 John Wiley & Sons Ltd.
On balanced minimal repeated measurements designs

Directory of Open Access Journals (Sweden)

Shakeel Ahmad Mir

2014-10-01

Full Text Available Repeated Measurements designs are concerned with scientific experiments in which each experimental unit is assigned more than once to a treatment either different or identical. This class of designs has the property that the unbiased estimators for elementary contrasts among direct and residual effects are obtainable. Afsarinejad (1983 provided a method of constructing balanced Minimal Repeated Measurements designs p < t , when t is an odd or prime power, one or more than one treatment may occur more than once in some sequences and designs so constructed no longer remain uniform in periods. In this paper an attempt has been made to provide a new method to overcome this drawback. Specifically, two cases have been considered RM[t,n=t(t-t/(p-1,p], λ2=1 for balanced minimal repeated measurements designs and RM[t,n=2t(t-t/(p-1,p], λ2=2 for balanced repeated measurements designs. In addition , a method has been provided for constructing extra-balanced minimal designs for special case RM[t,n=t2/(p-1,p], λ2=1.
Cell type-specific termination of transcription by transposable element sequences

Directory of Open Access Journals (Sweden)

Conley Andrew B

2012-09-01

Full Text Available Abstract Background Transposable elements (TEs encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Results Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3′ UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. Conclusions TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are
Persistence of attenuated HIV-1 rev alleles in an epidemiologically linked cohort of long-term survivors infected with nef-deleted virus

Directory of Open Access Journals (Sweden)

Wesselingh Steven L

2007-07-01

Full Text Available Abstract Background The Sydney blood bank cohort (SBBC of long-term survivors consists of multiple individuals infected with nef-deleted, attenuated strains of human immunodeficiency virus type 1 (HIV-1. Although the cohort members have experienced differing clinical courses and now comprise slow progressors (SP as well as long-term nonprogressors (LTNP, longitudinal analysis of nef/long-terminal repeat (LTR sequences demonstrated convergent nef/LTR sequence evolution in SBBC SP and LTNP. Thus, the in vivo pathogenicity of attenuated HIV-1 strains harboured by SBBC members is dictated by factors other than nef/LTR. Therefore, to determine whether defects in other viral genes contribute to attenuation of these HIV-1 strains, we characterized dominant HIV-1 rev alleles that persisted in 4 SBBC subjects; C18, C64, C98 and D36. Results The ability of Rev derived from D36 and C64 to bind the Rev responsive element (RRE in RNA binding assays was reduced by approximately 90% compared to Rev derived from HIV-1NL4-3, C18 or C98. D36 Rev also had a 50–60% reduction in ability to express Rev-dependent reporter constructs in mammalian cells. In contrast, C64 Rev had only marginally decreased Rev function despite attenuated RRE binding. In D36 and C64, attenuated RRE binding was associated with rare amino acid changes at 3 highly conserved residues; Gln to Pro at position 74 immediately N-terminal to the Rev activation domain, and Val to Leu and Ser to Pro at positions 104 and 106 at the Rev C-terminus, respectively. In D36, reduced Rev function was mapped to an unusual 13 amino acid extension at the Rev C-terminus. Conclusion These findings provide new genetic and mechanistic insights important for Rev function, and suggest that Rev function, not Rev/RRE binding may be rate limiting for HIV-1 replication. In addition, attenuated rev alleles may contribute to viral attenuation and long-term survival of HIV-1 infection in a subset of SBBC members.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.

Science.gov (United States)

Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav

2010-09-16

Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic
Using inter simple sequence repeat (ISSR) markers to study genetic ...

African Journals Online (AJOL)

enoh

2012-04-10

Apr 10, 2012 ... Genetic relationships among the cultivars was assessed by using six inter simple sequence ... polymorphism breeders of this species in order to find the ..... well as the high level of heterozygosity due to the cross- pollinating ...
Adaptation to diverse nitrogen-limited environments by deletion or extrachromosomal element formation of the GAP1 locus

DEFF Research Database (Denmark)

Gresham, D.; Usaite, Renata; Germann, S.M.

2010-01-01

and deletions at the GAP1 locus. GAP1 encodes the general amino acid permease, which transports amino acids across the plasma membrane. We identified a self-propagating extrachromosomal circular DNA molecule that results from intrachromosomal recombination between long terminal repeats (LTRs) flanking GAP1....... Extrachromosomal DNA circles (GAP1(circle)) contain GAP1, the replication origin ARS1116, and a single hybrid LTR derived from recombination between the two flanking LTRs. Formation of the GAP1(circle) is associated with deletion of chromosomal GAP1 (gap1 Delta) and production of a single hybrid LTR at the GAP1...
A LTR copia retrotransposon and Mutator transposons interrupt Pgip genes in cultivated and wild wheats.

Science.gov (United States)

Di Giovanni, Michela; Cenci, Alberto; Janni, Michela; D'Ovidio, Renato

2008-04-01

Polygalacturonase-inhibiting proteins (PGIPs) are leucine-rich repeat (LRR) proteins involved in plant defence. Wheat pgip genes have been isolated from the B (Tapgip1) and D (Tapgip2) genomes, and now we report the identification of pgip genes from the A genomes of wild and cultivated wheats. By Southern blots and sequence analysis of BAC clones we demonstrated that wheat contains a single copy pgip gene per genome and the one from the A genome, pgip3, is inactivated by the insertion of a long terminal repeat copia retrotranspon within the fourth LRR. We demonstrated also that this retrotransposon insertion is present in Triticum urartu and all the polyploidy wheats assayed, but is absent in T. monococcum (Tmpgip3), suggesting that this insertion took place after the divergence between T. monococcum and T. urartu, but before the formation of the polyploid wheats. We identified also two independent insertion events of new Class II transposable elements, Vacuna, belonging to the Mutator superfamily, that interrupted the Tdipgip1 gene of T. turgidum ssp. dicoccoides. The occurrence of these transposons within the coding region of Tdipgip1 facilitated the mapping of the Pgip locus in the pericentric region of the short arm of chromosome group 7. We speculate that the inactivation of pgip genes are tolerated because of redundancy of PGIP activities in the wheat genome.
The genome sequence of a widespread apex predator, the golden eagle (Aquila chrysaetos.

Directory of Open Access Journals (Sweden)

Jacqueline M Doyle

Full Text Available Biologists routinely use molecular markers to identify conservation units, to quantify genetic connectivity, to estimate population sizes, and to identify targets of selection. Many imperiled eagle populations require such efforts and would benefit from enhanced genomic resources. We sequenced, assembled, and annotated the first eagle genome using DNA from a male golden eagle (Aquila chrysaetos captured in western North America. We constructed genomic libraries that were sequenced using Illumina technology and assembled the high-quality data to a depth of ∼40x coverage. The genome assembly includes 2,552 scaffolds >10 Kb and 415 scaffolds >1.2 Mb. We annotated 16,571 genes that are involved in myriad biological processes, including such disparate traits as beak formation and color vision. We also identified repetitive regions spanning 92 Mb (∼6% of the assembly, including LINES, SINES, LTR-RTs and DNA transposons. The mitochondrial genome encompasses 17,332 bp and is ∼91% identical to the Mountain Hawk-Eagle (Nisaetus nipalensis. Finally, the data reveal that several anonymous microsatellites commonly used for population studies are embedded within protein-coding genes and thus may not have evolved in a neutral fashion. Because the genome sequence includes ∼800,000 novel polymorphisms, markers can now be chosen based on their proximity to functional genes involved in migration, carnivory, and other biological processes.
Characterization of Erwinia amylovora strains from different host plants using repetitive-sequences PCR analysis, and restriction fragment length polymorphism and short-sequence DNA repeats of plasmid pEA29.

Science.gov (United States)

Barionovi, D; Giorgi, S; Stoeger, A R; Ruppitsch, W; Scortichini, M

2006-05-01

The three main aims of the study were the assessment of the genetic relationship between a deviating Erwinia amylovora strain isolated from Amelanchier sp. (Maloideae) grown in Canada and other strains from Maloideae and Rosoideae, the investigation of the variability of the PstI fragment of the pEA29 plasmid using restriction fragment length polymorphism (RFLP) analysis and the determination of the number of short-sequence DNA repeats (SSR) by DNA sequence analysis in representative strains. Ninety-three strains obtained from 12 plant genera and different geographical locations were examined by repetitive-sequences PCR using Enterobacterial Repetitive Intergenic Consensus, BOX and Repetitive Extragenic Palindromic primer sets. Upon the unweighted pair group method with arithmetic mean analysis, a deviating strain from Amelanchier sp. was analysed using amplified ribosomal DNA restriction analysis (ARDRA) analysis and the sequencing of the 16S rDNA gene. This strain showed 99% similarity to other E. amylovora strains in the 16S gene and the same banding pattern with ARDRA. The RFLP analysis of pEA29 plasmid using MspI and Sau3A restriction enzymes showed a higher variability than that previously observed and no clear-cut grouping of the strains was possible. The number of SSR units reiterated two to 12 times. The strains obtained from pear orchards showing for the first time symptoms of fire blight had a low number of SSR units. The strains from Maloideae exhibit a wider genetic variability than previously thought. The RFLP analysis of a fragment of the pEA29 plasmid would not seem a reliable method for typing E. amylovora strains. A low number of SSR units was observed with first epidemics of fire blight. The current detection techniques are mainly based on the genetic similarities observed within the strains from the cultivated tree-fruit crops. For a more reliable detection of the fire blight pathogen also in wild and ornamentals Rosaceous plants the genetic
Local repeat sequence organization of an intergenic spacer in the ...

Indian Academy of Sciences (India)

Unknown

chloroplast genome of Chlamydomonas reinhardtii leads to DNA expansion and sequence ... The discovery of uniparentally inherited streptomycin resistant mutants ... resembles yeast, mitochondrial and phage recombination in that it is typically ...... Sager R and Lane D 1972 Molecular basis of maternal inheritance; Proc.
Chaotic generation of PN sequences : a VLSI implementation

NARCIS (Netherlands)

Dornbusch, A.; Pineda de Gyvez, J.

1999-01-01

Generation of repeatable pseudo-random sequences with chaotic analog electronics is not feasible using standard circuit topologies. Component variation caused by imperfect fabrication causes the same divergence of output sequences as does varying initial conditions. By quantizing the output of a
Aberrant splicing in transgenes containing introns, exons, and V5 epitopes: lessons from developing an FSHD mouse model expressing a D4Z4 repeat with flanking genomic sequences.

Directory of Open Access Journals (Sweden)

Eugénie Ansseau

Full Text Available The DUX4 gene, encoded within D4Z4 repeats on human chromosome 4q35, has recently emerged as a key factor in the pathogenic mechanisms underlying Facioscapulohumeral muscular dystrophy (FSHD. This recognition prompted development of animal models expressing the DUX4 open reading frame (ORF alone or embedded within D4Z4 repeats. In the first published model, we used adeno-associated viral vectors (AAV and strong viral control elements (CMV promoter, SV40 poly A to demonstrate that the DUX4 cDNA caused dose-dependent toxicity in mouse muscles. As a follow-up, we designed a second generation of DUX4-expressing AAV vectors to more faithfully genocopy the FSHD-permissive D4Z4 repeat region located at 4q35. This new vector (called AAV.D4Z4.V5.pLAM contained the D4Z4/DUX4 promoter region, a V5 epitope-tagged DUX4 ORF, and the natural 3' untranslated region (pLAM harboring two small introns, DUX4 exons 2 and 3, and the non-canonical poly A signal required for stabilizing DUX4 mRNA in FSHD. AAV.D4Z4.V5.pLAM failed to recapitulate the robust pathology of our first generation vectors following delivery to mouse muscle. We found that the DUX4.V5 junction sequence created an unexpected splice donor in the pre-mRNA that was preferentially utilized to remove the V5 coding sequence and DUX4 stop codon, yielding non-functional DUX4 protein with 55 additional residues on its carboxyl-terminus. Importantly, we further found that aberrant splicing could occur in any expression construct containing a functional splice acceptor and sequences resembling minimal splice donors. Our findings represent an interesting case study with respect to AAV.D4Z4.V5.pLAM, but more broadly serve as a note of caution for designing constructs containing V5 epitope tags and/or transgenes with downstream introns and exons.
Convergent adaptive evolution in marginal environments: unloading transposable elements as a common strategy among mangrove genomes.

Science.gov (United States)

Lyu, Haomin; He, Ziwen; Wu, Chung-I; Shi, Suhua

2018-01-01

Several clades of mangrove trees independently invade the interface between land and sea at the margin of woody plant distribution. As phenotypic convergence among mangroves is common, the possibility of convergent adaptation in their genomes is quite intriguing. To study this molecular convergence, we sequenced multiple mangrove genomes. In this study, we focused on the evolution of transposable elements (TEs) in relation to the genome size evolution. TEs, generally considered genomic parasites, are the most common components of woody plant genomes. Analyzing the long terminal repeat-retrotransposon (LTR-RT) type of TE, we estimated their death rates by counting solo-LTRs and truncated elements. We found that all lineages of mangroves massively and convergently reduce TE loads in comparison to their nonmangrove relatives; as a consequence, genome size reduction happens independently in all six mangrove lineages; TE load reduction in mangroves can be attributed to the paucity of young elements; the rarity of young LTR-RTs is a consequence of fewer births rather than access death. In conclusion, mangrove genomes employ a convergent strategy of TE load reduction by suppressing element origination in their independent adaptation to a new environment. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris

Directory of Open Access Journals (Sweden)

Scott eJackson

2014-07-01

Full Text Available Common bean (Phaseolus vulgaris is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3’LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. This transposon data provides a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.
A novel rat genomic simple repeat DNA with RNA-homology shows triplex (H-DNA)-like structure and tissue-specific RNA expression

International Nuclear Information System (INIS)

Dey, Indranil; Rath, Pramod C.

2005-01-01

Mammalian genome contains a wide variety of repetitive DNA sequences of relatively unknown function. We report a novel 227 bp simple repeat DNA (3.3 DNA) with a d {(GA) 7 A (AG) 7 } dinucleotide mirror repeat from the rat (Rattus norvegicus) genome. 3.3 DNA showed 75-85% homology with several eukaryotic mRNAs due to (GA/CU) n dinucleotide repeats by nBlast search and a dispersed distribution in the rat genome by Southern blot hybridization with [ 32 P]3.3 DNA. The d {(GA) 7 A (AG) 7 } mirror repeat formed a triplex (H-DNA)-like structure in vitro. Two large RNAs of 9.1 and 7.5 kb were detected by [ 32 P]3.3 DNA in rat brain by Northern blot hybridization indicating expression of such simple sequence repeats at RNA level in vivo. Further, several cDNAs were isolated from a rat cDNA library by [ 32 P]3.3 DNA probe. Three such cDNAs showed tissue-specific RNA expression in rat. pRT 4.1 cDNA showed strong expression of a 2.39 kb RNA in brain and spleen, pRT 5.5 cDNA showed strong expression of a 2.8 kb RNA in brain and a 3.9 kb RNA in lungs, and pRT 11.4 cDNA showed weak expression of a 2.4 kb RNA in lungs. Thus, genomic simple sequence repeats containing d (GA/CT) n dinucleotides are transcriptionally expressed and regulated in rat tissues. Such d (GA/CT) n dinucleotide repeats may form structural elements (e.g., triplex) which may be sites for functional regulation of genomic coding sequences as well as RNAs. This may be a general function of such transcriptionally active simple sequence repeats widely dispersed in mammalian genome

Organelle Simple Sequence Repeat Markers Help to Distinguish Carpelloid Stamen and Normal Cytoplasmic Male Sterile Sources in Broccoli

Science.gov (United States)

Shu, Jinshuai; Liu, Yumei; Li, Zhansheng; Zhang, Lili; Fang, Zhiyuan; Yang, Limei; Zhuang, Mu; Zhang, Yangyong; Lv, Honghao

2015-01-01

We previously discovered carpelloid stamens when breeding cytoplasmic male sterile lines in broccoli (Brassica oleracea var. italica). In this study, hybrids and multiple backcrosses were produced from different cytoplasmic male sterile carpelloid stamen sources and maintainer lines. Carpelloid stamens caused dysplasia of the flower structure and led to hooked or coiled siliques with poor seed setting, which were inherited in a maternal fashion. Using four distinct carpelloid stamens and twelve distinct normal stamens from cytoplasmic male sterile sources and one maintainer, we used 21 mitochondrial simple sequence repeat (mtSSR) primers and 32 chloroplast SSR primers to identify a mitochondrial marker, mtSSR2, that can differentiate between the cytoplasm of carpelloid and normal stamens. Thereafter, mtSSR2 was used to identify another 34 broccoli accessions, with an accuracy rate of 100%. Analysis of the polymorphic sequences revealed that the mtSSR2 open reading frame of carpelloid stamen sterile sources had a deletion of 51 bases (encoding 18 amino acids) compared with normal stamen materials. The open reading frame is located in the coding region of orf125 and orf108 of the mitochondrial genomes in Brassica crops and had the highest similarity with Raphanus sativus and Brassica carinata. The current study has not only identified a useful molecular marker to detect the cytoplasm of carpelloid stamens during broccoli breeding, but it also provides evidence that the mitochondrial genome is maternally inherited and provides a basis for studying the effect of the cytoplasm on flower organ development in plants. PMID:26407159
Low doses of neutrons induce changes in gene expression

International Nuclear Information System (INIS)

Woloschak, G.E.; Chang-Liu, C.M.; Panozzo, J.; Libertin, C.R.

1993-01-01

Studies were designed to identify genes induced following low-dose neutron but not following γ-ray exposure in fibroblasts. Our past work had shown differences in the expression of β-protein kinase C and c-fos genes, both being induced following γ-ray but not neutron exposure. We have identified two genes that are induced following neutron, but not γ-ray, exposure: Rp-8 (a gene induced by apoptosis) and the long terminal repeat (LTR) of the human immunodeficiency (HIV). Rp-8 mRNA induction was demonstrated in Syrian hamster embryo fibroblasts and was found to be induced in cells exposed to neutrons administered at low (0.5 cGy/min) and at high dose rate (12 cGy/min). The induction of transcription from the LTR of HIV was demonstrated in HeLa cells bearing a transfected construct of the chloramphenicol acetyl transferase (CAT) gene driven by the HIV-LTR promoter. Measures of CAT activity and CAT transcripts following irradiation demonstrated an unresponsiveness to γ rays over a broad range of doses. Twofold induction of the HIV-LTR was detected following neutron exposure (48 cGy) administered at low (0.5 cGy/min) but not high (12 cGy/min) dose rates. Ultraviolet-mediated HIV-LTR induction was inhibited by low-dose-rate neutron exposure
Non-radioactive detection of trinucleotide repeat size variability.

Science.gov (United States)

Tomé, Stéphanie; Nicole, Annie; Gomes-Pereira, Mario; Gourdon, Genevieve

2014-03-06

Many human diseases are associated with the abnormal expansion of unstable trinucleotide repeat sequences. The mechanisms of trinucleotide repeat size mutation have not been fully dissected, and their understanding must be grounded on the detailed analysis of repeat size distributions in human tissues and animal models. Small-pool PCR (SP-PCR) is a robust, highly sensitive and efficient PCR-based approach to assess the levels of repeat size variation, providing both quantitative and qualitative data. The method relies on the amplification of a very low number of DNA molecules, through sucessive dilution of a stock genomic DNA solution. Radioactive Southern blot hybridization is sensitive enough to detect SP-PCR products derived from single template molecules, separated by agarose gel electrophoresis and transferred onto DNA membranes. We describe a variation of the detection method that uses digoxigenin-labelled locked nucleic acid probes. This protocol keeps the sensitivity of the original method, while eliminating the health risks associated with the manipulation of radiolabelled probes, and the burden associated with their regulation, manipulation and waste disposal.
Effects of gamma rays, ultraviolet radiation, sunlight, microwaves and electromagnetic fields on gene expression mediated by human immunodeficiency virus promoter

International Nuclear Information System (INIS)

Libertin, C.R.; Woloschak, G.E.; Panozzo, J.; Groh, K.R.; Chang-Liu, Chin-Mei; Schreck, S.

1994-01-01

Previous work by our group and others has shown the modulation of human immunodeficiency virus (HIV) promoter or long terminal repeat (LTR) after exposure to neutrons and ultraviolet radiations. Using HeLa cells stably transfected with a construct containing the chloramphenicol acetyl transferase (CAT) gene, the transcription of which is mediated by the HIV-LTR, we designed experiments to examine the effects of exposure to different types of radiation (such as γ rays, ultraviolet and sunlight irradiations, electromagnetic fields and microwaves) in HIV-LTR-driven expression of CAT. These results demonstrated ultraviolet-light-induced transcription from the HIV promoter, as has been shown by others. Exposure to other DNA-damaging agents such as γ rays and sunlight (with limited exposures) had no significant effect on transcription mediated by HIV-LTR, suggesting that induction of HIV is not mediated by just any type of DNA damage but rather may require specific types of DNA damage. Microwaves did not cause cell killing when cells in culture were exposed in high volumes of medium, and the same cells showed no changes in expression. When microwave exposure was carried out in low volumes of medium (so that excessive heat was generated) induction of HIV-LTR transcription (as assayed by CAT activity) was evident. Electromagnetic field exposures had no effect on expression of HIV-LTR. These results demonstrate that not all types of radiation and not all DNA-damaging agents are capable of inducing HIV. We hypothesize that induction of HIV transcription may be mediated by several different signals exposure to radiation. 22 refs., 8 figs
Analysis of genetic relationships and identification of lily cultivars based on inter-simple sequence repeat markers.

Science.gov (United States)

Cui, G F; Wu, L F; Wang, X N; Jia, W J; Duan, Q; Ma, L L; Jiang, Y L; Wang, J H

2014-07-29

Inter-simple sequence repeat (ISSR) markers were used to discriminate 62 lily cultivars of 5 hybrid series. Eight ISSR primers generated 104 bands in total, which all showed 100% polymorphism, and an average of 13 bands were amplified by each primer. Two software packages, POPGENE 1.32 and NTSYSpc 2.1, were used to analyze the data matrix. Our results showed that the observed number of alleles (NA), effective number of alleles (NE), Nei's genetic diversity (H), and Shannon's information index (I) were 1.9630, 1.4179, 0.2606, and 0.4080, respectively. The highest genetic similarity (0.9601) was observed between the Oriental x Trumpet and Oriental lilies, which indicated that the two hybrids had a close genetic relationship. An unweighted pair-group method with arithmetic means dendrogram showed that the 62 lily cultivars clustered into two discrete groups. The first group included the Oriental and OT cultivars, while the Asiatic, LA, and Longiflorum lilies were placed in the second cluster. The distribution of individuals in the principal component analysis was consistent with the clustering of the dendrogram. Fingerprints of all lily cultivars built from 8 primers could be separated completely. This study confirmed the effect and efficiency of ISSR identification in lily cultivars.
Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

Science.gov (United States)

Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

1999-04-16

A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.
Correlation between fibroin amino acid sequence and physical silk properties.

Science.gov (United States)

Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

2003-09-12

The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.
Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs).

Science.gov (United States)

Liu, Fenyun; Kariyawasam, Subhashinie; Jayarao, Bhushan M; Barrangou, Rodolphe; Gerner-Smidt, Peter; Ribot, Efrain M; Knabel, Stephen J; Dudley, Edward G

2011-07-01

Salmonella enterica subsp. enterica serovar Enteritidis is a major cause of food-borne salmonellosis in the United States. Two major food vehicles for S. Enteritidis are contaminated eggs and chicken meat. Improved subtyping methods are needed to accurately track specific strains of S. Enteritidis related to human salmonellosis throughout the chicken and egg food system. A sequence typing scheme based on virulence genes (fimH and sseL) and clustered regularly interspaced short palindromic repeats (CRISPRs)-CRISPR-including multi-virulence-locus sequence typing (designated CRISPR-MVLST)-was used to characterize 35 human clinical isolates, 46 chicken isolates, 24 egg isolates, and 63 hen house environment isolates of S. Enteritidis. A total of 27 sequence types (STs) were identified among the 167 isolates. CRISPR-MVLST identified three persistent and predominate STs circulating among U.S. human clinical isolates and chicken, egg, and hen house environmental isolates in Pennsylvania, and an ST that was found only in eggs and humans. It also identified a potential environment-specific sequence type. Moreover, cluster analysis based on fimH and sseL identified a number of clusters, of which several were found in more than one outbreak, as well as 11 singletons. Further research is needed to determine if CRISPR-MVLST might help identify the ecological origins of S. Enteritidis strains that contaminate chickens and eggs.
Instability of (CTGn•(CAGn trinucleotide repeats and DNA synthesis

Directory of Open Access Journals (Sweden)

Liu Guoqi

2012-02-01

Full Text Available Abstract Expansion of (CTGn•(CAGn trinucleotide repeat (TNR microsatellite sequences is the cause of more than a dozen human neurodegenerative diseases. (CTGn and (CAGn repeats form imperfectly base paired hairpins that tend to expand in vivo in a length-dependent manner. Yeast, mouse and human models confirm that (CTGn•(CAGn instability increases with repeat number, and implicate both DNA replication and DNA damage response mechanisms in (CTGn•(CAGn TNR expansion and contraction. Mutation and knockdown models that abrogate the expression of individual genes might also mask more subtle, cumulative effects of multiple additional pathways on (CTGn•(CAGn instability in whole animals. The identification of second site genetic modifiers may help to explain the variability of (CTGn•(CAGn TNR instability patterns between tissues and individuals, and offer opportunities for prognosis and treatment.
A Sequence-Specific Interaction between the Saccharomyces cerevisiae rRNA Gene Repeats and a Locus Encoding an RNA Polymerase I Subunit Affects Ribosomal DNA Stability

Science.gov (United States)

Cahyani, Inswasti; Cridge, Andrew G.; Engelke, David R.; Ganley, Austen R. D.

2014-01-01

The spatial organization of eukaryotic genomes is linked to their functions. However, how individual features of the global spatial structure contribute to nuclear function remains largely unknown. We previously identified a high-frequency interchromosomal interaction within the Saccharomyces cerevisiae genome that occurs between the intergenic spacer of the ribosomal DNA (rDNA) repeats and the intergenic sequence between the locus encoding the second largest RNA polymerase I subunit and a lysine tRNA gene [i.e., RPA135-tK(CUU)P]. Here, we used quantitative chromosome conformation capture in combination with replacement mapping to identify a 75-bp sequence within the RPA135-tK(CUU)P intergenic region that is involved in the interaction. We demonstrate that the RPA135-IGS1 interaction is dependent on the rDNA copy number and the Msn2 protein. Surprisingly, we found that the interaction does not govern RPA135 transcription. Instead, replacement of a 605-bp region within the RPA135-tK(CUU)P intergenic region results in a reduction in the RPA135-IGS1 interaction level and fluctuations in rDNA copy number. We conclude that the chromosomal interaction that occurs between the RPA135-tK(CUU)P and rDNA IGS1 loci stabilizes rDNA repeat number and contributes to the maintenance of nucleolar stability. Our results provide evidence that the DNA loci involved in chromosomal interactions are composite elements, sections of which function in stabilizing the interaction or mediating a functional outcome. PMID:25421713
Molecular identification and characterization of clustered regularly interspaced short palindromic repeats (CRISPRs) in a urease-positive thermophilic Campylobacter sp. (UPTC).

Science.gov (United States)

Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M

2012-02-01

Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
Inter-simple sequence repeat (ISSR) markers in the evaluation of ...

African Journals Online (AJOL)

shawkat

2013-02-13

Feb 13, 2013 ... 666 Afr. J. Biotechnol. Table 1. Number and types of the ISSR bands as well as the total polymorphism percentages generated in six Capsicum hybrids. Primer code. Sequence. Monomorphic band. Polymorphic band. Total band. Polymorphism. (%). Unique. Shared. HB 1. (CAA)5. 4. 0. 1. 5. 20. HB 2. (CAG) ...
Dispersed repetitive sequences in eukaryotic genomes and their possible biological significance

International Nuclear Information System (INIS)

Georgiev, G.P.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Lukanidin, E.M.

1983-01-01

In this paper is described the properties of a novel mouse mdg-like element, the A2 sequence, which is the most abundant repetitive sequence. We also characterized an ubiquitous B2 sequence that represents, after B1, the dominant family among the short interspersed repeats of the mouse genome. The existence of some putative transposition intermediates was shown for repeats of both A and B types of the mouse genome. These are closed circular DNA of the A type and small polyadenylated B + RNAs. The fundamental question that arises is whether these sequences are simply selfish DNA capable of transpositions or do they fulfill some useful biological functions within the genome. 66 references, 11 figures, 1 table
Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes

NARCIS (Netherlands)

Al-Attar, S.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

2011-01-01

Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences
Amino acid sequence analysis of the annexin super-gene family of proteins.

Science.gov (United States)

Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

1991-06-15

The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of
The polymorphic integumentary mucin B.1 from Xenopus laevis contains the short consensus repeat.

Science.gov (United States)

Probst, J C; Hauser, F; Joba, W; Hoffmann, W

1992-03-25

The frog integumentary mucin B.1 (FIM-B.1), discovered by molecular cloning, contains a cysteine-rich C-terminal domain which is homologous with von Willebrand factor. With the help of the polymerase chain reaction, we now characterize a contiguous region 5' to the von Willebrand factor domain containing the short consensus repeat typical of many proteins from the complement system. Multiple transcripts have been cloned, which originate from a single animal and differ by a variable number of tandem repeats (rep-33 sequences). These different transcripts probably originate solely from two genes and are generated presumably by alternative splicing of an huge array of functional cassettes. This model is supported by analysis of genomic FIM-B.1 sequences from Xenopus laevis. Here, rep-33 sequences are arranged in an interrupted array of individual units. Additionally, results of Southern analysis revealed genetic polymorphism between different animals which is predicted to be within the tandem repeats. A first investigation of the predicted mucins with the help of a specific antibody against a synthetic peptide determined the molecular mass of FIM-B.1 to greater than 200 kDa. Here again, genetic polymorphism between different animals is detected.
Full-length cDNA sequences from Rhesus monkey placenta tissue: analysis and utility for comparative mapping

Directory of Open Access Journals (Sweden)

Lee Sang-Rae

2010-07-01

Full Text Available Abstract Background Rhesus monkeys (Macaca mulatta are widely-used as experimental animals in biomedical research and are closely related to other laboratory macaques, such as cynomolgus monkeys (Macaca fascicularis, and to humans, sharing a last common ancestor from about 25 million years ago. Although rhesus monkeys have been studied extensively under field and laboratory conditions, research has been limited by the lack of genetic resources. The present study generated placenta full-length cDNA libraries, characterized the resulting expressed sequence tags, and described their utility for comparative mapping with human RefSeq mRNA transcripts. Results From rhesus monkey placenta full-length cDNA libraries, 2000 full-length cDNA sequences were determined and 1835 rhesus placenta cDNA sequences longer than 100 bp were collected. These sequences were annotated based on homology to human genes. Homology search against human RefSeq mRNAs revealed that our collection included the sequences of 1462 putative rhesus monkey genes. Moreover, we identified 207 genes containing exon alterations in the coding region and the untranslated region of rhesus monkey transcripts, despite the highly conserved structure of the coding regions. Approximately 10% (187 of all full-length cDNA sequences did not represent any public human RefSeq mRNAs. Intriguingly, two rhesus monkey specific exons derived from the transposable elements of AluYRa2 (SINE family and MER11B (LTR family were also identified. Conclusion The 1835 rhesus monkey placenta full-length cDNA sequences described here could expand genomic resources and information of rhesus monkeys. This increased genomic information will greatly contribute to the development of evolutionary biology and biomedical research.
Estimation of genetic structure of a Mycosphaerella musicola population using inter-simple sequence repeat markers.

Science.gov (United States)

Peixouto, Y S; Dórea Bragança, C A; Andrade, W B; Ferreira, C F; Haddad, F; Oliveira, S A S; Darosci Brito, F S; Miller, R N G; Amorim, E P

2015-07-17

Among the diseases affecting banana (Musa sp), yellow Sigatoka, caused by the fungal pathogen Mycosphaerella musicola Leach, is considered one of the most important in Brazil, causing losses throughout the year. Understanding the genetic structure of pathogen populations will provide insight into the life history of pathogens, including the evolutionary processes occurring in agrosystems. Tools for estimating the possible emergence of pathogen variants with altered pathogenicity, virulence, or aggressiveness, as well as resistance to systemic fungicides, can also be developed from such data. The objective of this study was to analyze the genetic diversity and population genetics of M. musicola in the main banana-producing regions in Brazil. A total of 83 isolates collected from different banana cultivars in the Brazilian states of Bahia, Rio Grande do Norte, and Minas Gerais were evaluated using inter-simple sequence repeat markers. High variability was detected between the isolates, and 85.5% of the haplotypes were singletons in the populations. The highest source of genetic diversity (97.22%) was attributed to variations within populations. Bayesian cluster analysis revealed the presence of 2 probable ancestral groups, however, showed no relationship to population structure in terms of collection site, state of origin, or cultivar. Similarly, we detected noevidence of genetic recombination between individuals within different states, indicating that asexual cycles play a major role in M. musicola reproduction and that long-distance dispersal of the pathogen is the main factor contributing to the lack of population structure in the fungus.
Isolation and characterization of reverse transcriptase fragments of LTR retrotransposons from the genome of Chenopodium quinoa (Amaranthaceae).

Science.gov (United States)

Kolano, Bozena; Bednara, Edyta; Weiss-Schneeweiss, Hanna

2013-10-01

High heterogeneity was observed among conserved domains of reverse transcriptase ( rt ) isolated from quinoa. Only one Ty1- copia rt was highly amplified. Reverse transcriptase sequences were located predominantly in pericentromeric region of quinoa chromosomes. The heterogeneity, genomic abundance, and chromosomal distribution of reverse transcriptase (rt)-coding fragments of Ty1-copia and Ty3-gypsy long terminal repeat retrotransposons were analyzed in the Chenopodium quinoa genome. Conserved domains of the rt gene were amplified and characterized using degenerate oligonucleotide primer pairs. Sequence analyses indicated that half of Ty1-copia rt (51 %) and 39 % of Ty3-gypsy rt fragments contained intact reading frames. High heterogeneity among rt sequences was observed for both Ty1-copia and Ty3-gypsy rt amplicons, with Ty1-copia more heterogeneous than Ty3-gypsy. Most of the isolated rt fragments were present in quinoa genome in low copy numbers, with only one highly amplified Ty1-copia rt sequence family. The gypsy-like RNase H fragments co-amplified with Ty1-copia-degenerate primers were shown to be highly amplified in the quinoa genome indicating either higher abundance of some gypsy families of which rt domains could not be amplified, or independent evolution of this gypsy-region in quinoa. Both Ty1-copia and Ty3-gypsy retrotransposons were preferentially located in pericentromeric heterochromatin of quinoa chromosomes. Phylogenetic analyses of newly amplified rt fragments together with well-characterized retrotransposon families from other organisms allowed identification of major lineages of retroelements in the genome of quinoa and provided preliminary insight into their evolutionary dynamics.
Inferring repeat-protein energetics from evolutionary information.

Directory of Open Access Journals (Sweden)

Rocío Espada

2017-06-01

Full Text Available Natural protein sequences contain a record of their history. A common constraint in a given protein family is the ability to fold to specific structures, and it has been shown possible to infer the main native ensemble by analyzing covariations in extant sequences. Still, many natural proteins that fold into the same structural topology show different stabilization energies, and these are often related to their physiological behavior. We propose a description for the energetic variation given by sequence modifications in repeat proteins, systems for which the overall problem is simplified by their inherent symmetry. We explicitly account for single amino acid and pair-wise interactions and treat higher order correlations with a single term. We show that the resulting evolutionary field can be interpreted with structural detail. We trace the variations in the energetic scores of natural proteins and relate them to their experimental characterization. The resulting energetic evolutionary field allows the prediction of the folding free energy change for several mutants, and can be used to generate synthetic sequences that are statistically indistinguishable from the natural counterparts.

Stress-induced rearrangement of Fusarium retrotransposon sequences.

Science.gov (United States)

Anaya, N; Roncero, M I

1996-11-27

Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.
Transposable elements and G-quadruplexes

Czech Academy of Sciences Publication Activity Database

Kejnovský, Eduard; Tokan, Viktor; Lexa, M.

2015-01-01

Roč. 23, č. 3 (2015), s. 615-623 ISSN 0967-3849 R&D Projects: GA ČR(CZ) GA15-02891S Institutional support: RVO:68081707 Keywords : TRINUCLEOTIDE REPEAT DNA * LTR RETROTRANSPOSONS * BINDING PROTEIN Subject RIV: BO - Biophysics Impact factor: 2.590, year: 2015
Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

Energy Technology Data Exchange (ETDEWEB)

Jackson, P.J.; Walthers, E.A.; Richmond, K.L. [Los Alamos National Lab., NM (United States)] [and others

1997-04-01

PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats are generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.
Structural basis for sequence-specific recognition of DNA by TAL effectors

KAUST Repository

Deng, Dong; Yan, Chuangye; Pan, Xiaojing; Mahfouz, Magdy M.; Wang, Jiawei; Zhu, Jiankang; Shi, Yi Gong; Yan, Nieng

2012-01-01

TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat comprises 33 to 35 conserved amino acids and targets a specific base pair
Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis.

Science.gov (United States)

Zhu, Huayu; Song, Pengyao; Koo, Dal-Hoe; Guo, Luqin; Li, Yanman; Sun, Shouru; Weng, Yiqun; Yang, Luming

2016-08-05

Microsatellite markers are one of the most informative and versatile DNA-based markers used in plant genetic research, but their development has traditionally been difficult and costly. The whole genome sequencing with next-generation sequencing (NGS) technologies provides large amounts of sequence data to develop numerous microsatellite markers at whole genome scale. SSR markers have great advantage in cross-species comparisons and allow investigation of karyotype and genome evolution through highly efficient computation approaches such as in silico PCR. Here we described genome wide development and characterization of SSR markers in the watermelon (Citrullus lanatus) genome, which were then use in comparative analysis with two other important crop species in the Cucurbitaceae family: cucumber (Cucumis sativus L.) and melon (Cucumis melo L.). We further applied these markers in evaluating the genetic diversity and population structure in watermelon germplasm collections. A total of 39,523 microsatellite loci were identified from the watermelon draft genome with an overall density of 111 SSRs/Mbp, and 32,869 SSR primers were designed with suitable flanking sequences. The dinucleotide SSRs were the most common type representing 34.09 % of the total SSR loci and the AT-rich motifs were the most abundant in all nucleotide repeat types. In silico PCR analysis identified 832 and 925 SSR markers with each having a single amplicon in the cucumber and melon draft genome, respectively. Comparative analysis with these cross-species SSR markers revealed complicated mosaic patterns of syntenic blocks among the genomes of three species. In addition, genetic diversity analysis of 134 watermelon accessions with 32 highly informative SSR loci placed these lines into two groups with all accessions of C.lanatus var. citorides and three accessions of C. colocynthis clustered in one group and all accessions of C. lanatus var. lanatus and the remaining accessions of C. colocynthis
Organization and Evolution of Subtelomeric Satellite Repeats in the Potato Genome

Czech Academy of Sciences Publication Activity Database

Torres, A.T.; Gong, Z.; Iovene, M.; Hirsch, C.D.; Buell, C.R.; Bryan, G.J.; Novák, Petr; Macas, Jiří; Jiang, J.

2011-01-01

Roč. 1, July 2011 (2011), s. 85-92 ISSN 2160-1836 R&D Projects: GA MŠk(CZ) LH11058 Institutional research plan: CEZ:AV0Z50510513 Keywords : Satellite sequences * Potato genome * Repeats Subject RIV: EB - Genetics ; Molecular Biology
Gene mining a marama bean expressed sequence tags (ESTs ...

African Journals Online (AJOL)

The authors reported the identification of genes associated with embryonic development and microsatellite sequences. The future direction will entail characterization of these genes using gene over-expression and mutant assays. Key words: Namibia, simple sequence repeats (SSR), data mining, homology searches, ...
Dimerization of BTas is required for the transactivational activity of bovine foamy virus

International Nuclear Information System (INIS)

Tan Juan; Qiao Wentao; Xu Fengwen; Han Hongqi; Chen Qimin; Geng Yunqi

2008-01-01

The BTas protein of bovine foamy virus (BFV) is a 249-amino-acid nuclear regulatory protein which transactivates viral gene expression directed by the long terminal repeat promoter (LTR) and the internal promoter (IP). Here, we demonstrate the BTas protein forms a dimeric complex in mammalian cells by using mammalian two hybrid systems and cross-linking assay. Functional analyses with deletion mutants reveal that the region of 46-62aa is essential for dimer formation. Furthermore, our results show that deleting the dimerization region of BTas did not affect the localization of BTas, but that it did result in the loss of its transactivational activity on the LTR and IP. Furthermore, BTas (Δ46-62aa) retained binding ability to the LTR and IP similar to that of the wild-type BTas. These data suggest the dimerization region is necessary for the transactivational function of BTas and is crucial to the replication of BFV
Cocaine promotes both initiation and elongation phase of HIV-1 transcription by activating NF-κB and MSK1 and inducing selective epigenetic modifications at HIV-1 LTR

International Nuclear Information System (INIS)

Sahu, Geetaram; Farley, Kalamo; El-Hage, Nazira; Aiamkitsumrit, Benjamas; Fassnacht, Ryan; Kashanchi, Fatah; Ochem, Alex; Simon, Gary L.; Karn, Jonathan; Hauser, Kurt F.; Tyagi, Mudit

2015-01-01

Cocaine accelerates human immunodeficiency virus (HIV-1) replication by altering specific cell-signaling and epigenetic pathways. We have elucidated the underlying molecular mechanisms through which cocaine exerts its effect in myeloid cells, a major target of HIV-1 in central nervous system (CNS). We demonstrate that cocaine treatment promotes HIV-1 gene expression by activating both nuclear factor-kappa B (NF-ĸB) and mitogen- and stress-activated kinase 1 (MSK1). MSK1 subsequently catalyzes the phosphorylation of histone H3 at serine 10, and p65 subunit of NF-ĸB at 276th serine residue. These modifications enhance the interaction of NF-ĸB with P300 and promote the recruitment of the positive transcription elongation factor b (P-TEFb) to the HIV-1 LTR, supporting the development of an open/relaxed chromatin configuration, and facilitating the initiation and elongation phases of HIV-1 transcription. Results are also confirmed in primary monocyte derived macrophages (MDM). Overall, our study provides detailed insights into cocaine-driven HIV-1 transcription and replication. - Highlights: • Cocaine induces the initiation phase of HIV transcription by activating NF-ĸB. • Cocaine induced NF-ĸB phosphorylation promotes its interaction with P300. • Cocaine enhances the elongation phase of HIV transcription by stimulating MSK1. • Cocaine activated MSK1 catalyzes the phosphorylation of histone H3 at its Ser10. • Cocaine induced H3S10 phosphorylation facilitates the recruitment of P-TEFb at LTR
Cocaine promotes both initiation and elongation phase of HIV-1 transcription by activating NF-κB and MSK1 and inducing selective epigenetic modifications at HIV-1 LTR

Energy Technology Data Exchange (ETDEWEB)

Sahu, Geetaram; Farley, Kalamo [Division of Infectious Diseases, Department of Medicine, George Washington University, Washington, DC (United States); El-Hage, Nazira [Virginia Commonwealth University, Richmond, VA (United States); Aiamkitsumrit, Benjamas; Fassnacht, Ryan [Division of Infectious Diseases, Department of Medicine, George Washington University, Washington, DC (United States); Kashanchi, Fatah [George Mason University, Manassas, VA (United States); Ochem, Alex [ICGEB, Wernher and Beit Building, Anzio Road, Observatory, 7925 Cape Town (South Africa); Simon, Gary L. [Division of Infectious Diseases, Department of Medicine, George Washington University, Washington, DC (United States); Karn, Jonathan [Case Western Reserve University, Cleveland, OH (United States); Hauser, Kurt F. [Virginia Commonwealth University, Richmond, VA (United States); Tyagi, Mudit, E-mail: tmudit@email.gwu.edu [Division of Infectious Diseases, Department of Medicine, George Washington University, Washington, DC (United States); Department of Microbiology, Immunology and Tropical Medicine, George Washington University, Washington, DC 20037 (United States)

2015-09-15

Cocaine accelerates human immunodeficiency virus (HIV-1) replication by altering specific cell-signaling and epigenetic pathways. We have elucidated the underlying molecular mechanisms through which cocaine exerts its effect in myeloid cells, a major target of HIV-1 in central nervous system (CNS). We demonstrate that cocaine treatment promotes HIV-1 gene expression by activating both nuclear factor-kappa B (NF-ĸB) and mitogen- and stress-activated kinase 1 (MSK1). MSK1 subsequently catalyzes the phosphorylation of histone H3 at serine 10, and p65 subunit of NF-ĸB at 276th serine residue. These modifications enhance the interaction of NF-ĸB with P300 and promote the recruitment of the positive transcription elongation factor b (P-TEFb) to the HIV-1 LTR, supporting the development of an open/relaxed chromatin configuration, and facilitating the initiation and elongation phases of HIV-1 transcription. Results are also confirmed in primary monocyte derived macrophages (MDM). Overall, our study provides detailed insights into cocaine-driven HIV-1 transcription and replication. - Highlights: • Cocaine induces the initiation phase of HIV transcription by activating NF-ĸB. • Cocaine induced NF-ĸB phosphorylation promotes its interaction with P300. • Cocaine enhances the elongation phase of HIV transcription by stimulating MSK1. • Cocaine activated MSK1 catalyzes the phosphorylation of histone H3 at its Ser10. • Cocaine induced H3S10 phosphorylation facilitates the recruitment of P-TEFb at LTR.
Biased distribution of DNA uptake sequences towards genome maintenance genes

DEFF Research Database (Denmark)

Davidsen, T.; Rodland, E.A.; Lagesen, K.

2004-01-01

Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within...... in these organisms. Pasteurella multocida also displayed high frequencies of a putative DUS identical to that previously identified in H. influenzae and with a skewed distribution towards genome maintenance genes, indicating that this bacterium might be transformation competent under certain conditions....
Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.

Science.gov (United States)

Davis, C A; Wyatt, G R

1989-01-01

The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148
Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase

Directory of Open Access Journals (Sweden)

Komivi Dossa

2017-08-01

Full Text Available The sequencing of the full nuclear genome of sesame (Sesamum indicum L. provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78% were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/, which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.
Complete plastid genome sequencing of Trochodendraceae reveals a significant expansion of the inverted repeat and suggests a Paleogene divergence between the two extant species.

Directory of Open Access Journals (Sweden)

Yan-xia Sun

Full Text Available The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots, but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ∼4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8 that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern.
Selective histonedeacetylase inhibitor M344 intervenes in HIV-1 latency through increasing histone acetylation and activation of NF-kappaB.

Directory of Open Access Journals (Sweden)

Hao Ying

Full Text Available Histone deacetylase (HDAC inhibitors present an exciting new approach to activate HIV production from latently infected cells to potentially enhance elimination of these cells and achieve a cure. M344, a novel HDAC inhibitor, shows robust activity in a variety of cancer cells and relatively low toxicity compared to trichostatin A (TSA. However, little is known about the effects and action mechanism of M344 in inducing HIV expression in latently infected cells.Using the Jurkat T cell model of HIV latency, we demonstrate that M344 effectively reactivates HIV-1 gene expression in latently infected cells. Moreover, M344-mediated activation of the latent HIV LTR can be strongly inhibited by a NF-κB inhibitor aspirin. We further show that M344 acts by increasing the acetylation of histone H3 and histone H4 at the nucleosome 1 (nuc-1 site of the HIV-1 long terminal repeat (LTR and by inducing NF-κB p65 nuclear translocation and direct RelA DNA binding at the nuc-1 region of the HIV-1 LTR. We also found that M344 synergized with prostratin to activate the HIV-1 LTR promoter in latently infected cells.These results suggest the potential of M344 in anti-latency therapies and an important role for histone modifications and NF-κB transcription factors in regulating HIV-1 LTR gene expression.
Comparative molecular cytogenetics of major repetitive sequence families of three Dendrobium species (Orchidaceae) from Bangladesh

Science.gov (United States)

Begum, Rabeya; Alam, Sheikh Shamimul; Menzel, Gerhard; Schmidt, Thomas

2009-01-01

Background and Aims Dendrobium species show tremendous morphological diversity and have broad geographical distribution. As repetitive sequence analysis is a useful tool to investigate the evolution of chromosomes and genomes, the aim of the present study was the characterization of repetitive sequences from Dendrobium moschatum for comparative molecular and cytogenetic studies in the related species Dendrobium aphyllum, Dendrobium aggregatum and representatives from other orchid genera. Methods In order to isolate highly repetitive sequences, a c0t-1 DNA plasmid library was established. Repeats were sequenced and used as probes for Southern hybridization. Sequence divergence was analysed using bioinformatic tools. Repetitive sequences were localized along orchid chromosomes by fluorescence in situ hybridization (FISH). Key Results Characterization of the c0t-1 library resulted in the detection of repetitive sequences including the (GA)n dinucleotide DmoO11, numerous Arabidopsis-like telomeric repeats and the highly amplified dispersed repeat DmoF14. The DmoF14 repeat is conserved in six Dendrobium species but diversified in representative species of three other orchid genera. FISH analyses showed the genome-wide distribution of DmoF14 in D. moschatum, D. aphyllum and D. aggregatum. Hybridization with the telomeric repeats demonstrated Arabidopsis-like telomeres at the chromosome ends of Dendrobium species. However, FISH using the telomeric probe revealed two pairs of chromosomes with strong intercalary signals in D. aphyllum. FISH showed the terminal position of 5S and 18S–5·8S–25S rRNA genes and a characteristic number of rDNA sites in the three Dendrobium species. Conclusions The repeated sequences isolated from D. moschatum c0t-1 DNA constitute major DNA families of the D. moschatum, D. aphyllum and D. aggregatum genomes with DmoF14 representing an ancient component of orchid genomes. Large intercalary telomere-like arrays suggest chromosomal
Mechanical processes with repeated attenuated impacts

CERN Document Server

Nagaev, R F

1999-01-01

This book is devoted to considering in the general case - using typical concrete examples - the motion of machines and mechanisms of impact and vibro-impact action accompanied by a peculiar phenomenon called "impact collapse". This phenomenon is that after the initial collision, a sequence of repeated gradually quickening collisions of decreasing-to-zero intensity occurs, with the final establishment of protracted contact between the interacting bodies. The initiation conditions of the impact collapse are determined and calculation techniques for the quantitative characteristics of the corresp
Genetic Diversity of Arabica Coffee (Coffea arabica L. in Nicaragua as Estimated by Simple Sequence Repeat Markers

Directory of Open Access Journals (Sweden)

Mulatu Geleta

2012-01-01

Full Text Available Coffea arabica L. (arabica coffee, the only tetraploid species in the genus Coffea, represents the majority of the world’s coffee production and has a significant contribution to Nicaragua’s economy. The present paper was conducted to determine the genetic diversity of arabica coffee in Nicaragua for its conservation and breeding values. Twenty-six populations that represent eight varieties in Nicaragua were investigated using simple sequence repeat (SSR markers. A total of 24 alleles were obtained from the 12 loci investigated across 260 individual plants. The total Nei’s gene diversity (HT and the within-population gene diversity (HS were 0.35 and 0.29, respectively, which is comparable with that previously reported from other countries and regions. Among the varieties, the highest diversity was recorded in the variety Catimor. Analysis of variance (AMOVA revealed that about 87% of the total genetic variation was found within populations and the remaining 13% differentiate the populations (FST=0.13; P<0.001. The variation among the varieties was also significant. The genetic variation in Nicaraguan coffee is significant enough to be used in the breeding programs, and most of this variation can be conserved through ex situ conservation of a low number of populations from each variety.
A detailed linkage map of lettuce based on SSAP, AFLP and NBS markers

NARCIS (Netherlands)

Syed, H.; Sorensen, A.P.; Antonise, R.; van de Wiel, C.; van der Linden, C.G.; van 't Westende, W.; Hooftman, D.A.P.; den Nijs, J.C.M.; Flavell, A.J.

2006-01-01

Abstract Molecular markers based upon a novel lettuce LTR retrotransposon and the nucleotide binding site-leucine-rich repeat (NBS-LRR) family of disease resistance-associated genes have been combined with AFLP markers to generate a 458 locus genetic linkage map for lettuce. A total of 187
Long-read sequencing and de novo assembly of a Chinese genome

Science.gov (United States)

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arr...

Genes Altered by Intracisternal A Particles in Mouse Mammary Tumorigenesis

National Research Council Canada - National Science Library

Crowley, Michael

1997-01-01

...) in BALB/c mice which express high levels of intracistemal A-particles (IAP). Differential hybpridization and differential display strategies are being used to isolate transcripts which contained IAP LTR sequences...
Genetic variability in Brazilian populations of Biomphalaria straminea complex detected by simple sequence repeat anchored polymerase chain reaction amplification

Directory of Open Access Journals (Sweden)

Caldeira Roberta L

2001-01-01

Full Text Available Biomphalaria glabrata, B. tenagophila and B. straminea are intermediate hosts of Schistosoma mansoni, in Brazil. The latter is of epidemiological importance in the northwest of Brazil and, due to morphological similarities, has been grouped with B. intermedia and B. kuhniana in a complex named B. straminea. In the current work, we have standardized the simple sequence repeat anchored polymerase chain reaction (SSR-PCR technique, using the primers (CA8RY and K7, to study the genetic variability of these species. The similarity level was calculated using the Dice coefficient and genetic distance using the Nei and Li coefficient. The trees were obtained by the UPGMA and neighbor-joining methods. We have observed that the most related individuals belong to the same species and locality and that individuals from different localities, but of the same species, present clear heterogeneity. The trees generated using both methods showed similar topologies. The SSR-PCR technique was shown to be very efficient in intrapopulational and intraspecific studies of the B. straminea complex snails.
Sequencing of BAC pools by different next generation sequencing platforms and strategies

Directory of Open Access Journals (Sweden)

Scholz Uwe

2011-10-01

Full Text Available Abstract Background Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. Results Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. Conclusion Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.
Electricity sequence control

International Nuclear Information System (INIS)

Shin, Heung Ryeol

2010-03-01

The contents of the book are introduction of control system, like classification and control signal, introduction of electricity power switch, such as push-button and detection switch sensor for induction type and capacitance type machinery for control, solenoid valve, expression of sequence and type of electricity circuit about using diagram, time chart, marking and term, logic circuit like Yes, No, and, or and equivalence logic, basic electricity circuit, electricity sequence control, added condition, special program control about choice and jump of program, motor control, extra circuit on repeat circuit, pause circuit in a conveyer, safety regulations and rule about classification of electricity disaster and protective device for insulation.
Heterogeneity of the Epstein-Barr Virus (EBV) Major Internal Repeat Reveals Evolutionary Mechanisms of EBV and a Functional Defect in the Prototype EBV Strain B95-8.

Science.gov (United States)

Ba Abdullah, Mohammed M; Palermo, Richard D; Palser, Anne L; Grayson, Nicholas E; Kellam, Paul; Correia, Samantha; Szymula, Agnieszka; White, Robert E

2017-12-01

Epstein-Barr virus (EBV) is a ubiquitous pathogen of humans that can cause several types of lymphoma and carcinoma. Like other herpesviruses, EBV has diversified through both coevolution with its host and genetic exchange between virus strains. Sequence analysis of the EBV genome is unusually challenging because of the large number and lengths of repeat regions within the virus. Here we describe the sequence assembly and analysis of the large internal repeat 1 of EBV (IR1; also known as the BamW repeats) for more than 70 strains. The diversity of the latency protein EBV nuclear antigen leader protein (EBNA-LP) resides predominantly within the exons downstream of IR1. The integrity of the putative BWRF1 open reading frame (ORF) is retained in over 80% of strains, and deletions truncating IR1 always spare BWRF1. Conserved regions include the IR1 latency promoter (Wp) and one zone upstream of and two within BWRF1. IR1 is heterogeneous in 70% of strains, and this heterogeneity arises from sequence exchange between strains as well as from spontaneous mutation, with interstrain recombination being more common in tumor-derived viruses. This genetic exchange often incorporates regions of Epstein-Barr virus (EBV) infects the majority of the world population but causes illness in only a small minority of people. Nevertheless, over 1% of cancers worldwide are attributable to EBV. Recent sequencing projects investigating virus diversity to see if different strains have different disease impacts have excluded regions of repeating sequence, as they are more technically challenging. Here we analyze the sequence of the largest repeat in EBV (IR1). We first characterized the variations in protein sequences encoded across IR1. In studying variations within the repeat of each strain, we identified a mutation in the main laboratory strain of EBV that impairs virus function, and we suggest that tumor-associated viruses may be more likely to contain DNA mixed from two strains. The
Unusually effective microRNA targeting within repeat-rich coding regions of mammalian mRNAs

Science.gov (United States)

Schnall-Levin, Michael; Rissland, Olivia S.; Johnston, Wendy K.; Perrimon, Norbert; Bartel, David P.; Berger, Bonnie

2011-01-01

MicroRNAs (miRNAs) regulate numerous biological processes by base-pairing with target messenger RNAs (mRNAs), primarily through sites in 3′ untranslated regions (UTRs), to direct the repression of these targets. Although miRNAs have sometimes been observed to target genes through sites in open reading frames (ORFs), large-scale studies have shown such targeting to be generally less effective than 3′ UTR targeting. Here, we show that several miRNAs each target significant groups of genes through multiple sites within their coding regions. This ORF targeting, which mediates both predictable and effective repression, arises from highly repeated sequences containing miRNA target sites. We show that such sequence repeats largely arise through evolutionary duplications and occur particularly frequently within families of paralogous C2H2 zinc-finger genes, suggesting the potential for their coordinated regulation. Examples of ORFs targeted by miR-181 include both the well-known tumor suppressor RB1 and RBAK, encoding a C2H2 zinc-finger protein and transcriptional binding partner of RB1. Our results indicate a function for repeat-rich coding sequences in mediating post-transcriptional regulation and reveal circumstances in which miRNA-mediated repression through ORF sites can be reliably predicted. PMID:21685129
In Silico Genome Comparison and Distribution Analysis of Simple Sequences Repeats in Cassava

Directory of Open Access Journals (Sweden)

Andrea Vásquez

2014-01-01

Full Text Available We conducted a SSRs density analysis in different cassava genomic regions. The information obtained was useful to establish comparisons between cassava’s SSRs genomic distribution and those of poplar, flax, and Jatropha. In general, cassava has a low SSR density (~50 SSRs/Mbp and has a high proportion of pentanucleotides, (24,2 SSRs/Mbp. It was found that coding sequences have 15,5 SSRs/Mbp, introns have 82,3 SSRs/Mbp, 5′ UTRs have 196,1 SSRs/Mbp, and 3′ UTRs have 50,5 SSRs/Mbp. Through motif analysis of cassava’s genome SSRs, the most abundant motif was AT/AT while in intron sequences and UTRs regions it was AG/CT. In addition, in coding sequences the motif AAG/CTT was also found to occur most frequently; in fact, it is the third most used codon in cassava. Sequences containing SSRs were classified according to their functional annotation of Gene Ontology categories. The identified SSRs here may be a valuable addition for genetic mapping and future studies in phylogenetic analyses and genomic evolution.
RetroTector online, a rational tool for analysis of retroviral elements in small and medium size vertebrate genomic sequences

Directory of Open Access Journals (Sweden)

Benachenhou Farid

2009-06-01

Full Text Available Abstract Background The rapid accumulation of genomic information in databases necessitates rapid and specific algorithms for extracting biologically meaningful information. More or less complete retroviral sequences, also called proviral or endogenous retroviral sequences; ERVs, constitutes at least 5% of vertebrate genomes. After infecting the host, these retroviruses have integrated in germ line cells, and have then been carried in genomes for at least several 100 million years. A better understanding of structure and function of these sequences can have profound biological and medical consequences. Methods RetroTector© (ReTe is a platform-independent Java program for identification and characterization of proviral sequences in vertebrate genomes. The full ReTe requires a local installation with a MySQL database. Although not overly complicated, the installation may take some time. A "light" version of ReTe, (RetroTector online; ROL which does not require specific installation procedures is provided, via the World Wide Web. Results ROL http://www.fysiologi.neuro.uu.se/jbgs/ was implemented under the Batchelor web interface (A Lövgren et al. It allows both GenBank accession number, file and FASTA cut-and-paste admission of sequences (5 to 10 000 kilobases. Up to ten submissions can be done simultaneously, allowing batch analysis of Discussion Proviral sequences can be hard to recognize, especially if the integration occurred many million years ago. Precise delineation of LTR, gag, pro, pol and env can be difficult, requiring manual work. ROL is a way of simplifying these tasks. Conclusion ROL provides 1. annotation and presentation of known retroviral sequences, 2. detection of proviral chains in unknown genomic sequences, with up to 100 Mbase per submission.
The decorin sequence SYIRIADTNIT binds collagen type I

DEFF Research Database (Denmark)

Kalamajski, Sebastian; Aspberg, Anders; Oldberg, Ake

2007-01-01

Decorin belongs to the small leucine-rich repeat proteoglycan family, interacts with fibrillar collagens, and regulates the assembly, structure, and biomechanical properties of connective tissues. The decorin-collagen type I-binding region is located in leucine-rich repeats 5-6. Site......-directed mutagenesis of this 54-residue-long collagen-binding sequence identifies Arg-207 and Asp-210 in leucine-rich repeat 6 as crucial for the binding to collagen. The synthetic peptide SYIRIADTNIT, which includes Arg-207 and Asp-210, inhibits the binding of full-length recombinant decorin to collagen in vitro....... These collagen-binding amino acids are exposed on the exterior of the beta-sheet-loop structure of the leucine-rich repeat. This resembles the location of interacting residues in other leucine-rich repeat proteins....
Distribution and Evolution of Yersinia Leucine-Rich Repeat Proteins

Science.gov (United States)

Hu, Yueming; Huang, He; Hui, Xinjie; Cheng, Xi; White, Aaron P.

2016-01-01

Leucine-rich repeat (LRR) proteins are widely distributed in bacteria, playing important roles in various protein-protein interaction processes. In Yersinia, the well-characterized type III secreted effector YopM also belongs to the LRR protein family and is encoded by virulence plasmids. However, little has been known about other LRR members encoded by Yersinia genomes or their evolution. In this study, the Yersinia LRR proteins were comprehensively screened, categorized, and compared. The LRR proteins encoded by chromosomes (LRR1 proteins) appeared to be more similar to each other and different from those encoded by plasmids (LRR2 proteins) with regard to repeat-unit length, amino acid composition profile, and gene expression regulation circuits. LRR1 proteins were also different from LRR2 proteins in that the LRR1 proteins contained an E3 ligase domain (NEL domain) in the C-terminal region or an NEL domain-encoding nucleotide relic in flanking genomic sequences. The LRR1 protein-encoding genes (LRR1 genes) varied dramatically and were categorized into 4 subgroups (a to d), with the LRR1a to -c genes evolving from the same ancestor and LRR1d genes evolving from another ancestor. The consensus and ancestor repeat-unit sequences were inferred for different LRR1 protein subgroups by use of a maximum parsimony modeling strategy. Structural modeling disclosed very similar repeat-unit structures between LRR1 and LRR2 proteins despite the different unit lengths and amino acid compositions. Structural constraints may serve as the driving force to explain the observed mutations in the LRR regions. This study suggests that there may be functional variation and lays the foundation for future experiments investigating the functions of the chromosomally encoded LRR proteins of Yersinia. PMID:27217422
New polymorphisms within the variable number tandem repeat (VNTR) 7 locus of Mycobacterium avium subsp. paratuberculosis.

Science.gov (United States)

Fawzy, Ahmad; Zschöck, Michael; Ewers, Christa; Eisenberg, Tobias

2016-06-01

Variable number tandem repeat (VNTR) is a frequently employed typing method of Mycobacterium avium paratuberculosis (MAP) isolates. Based on whole genome sequencing in a previous study, allelic diversity at some VNTR loci seems to over- or under-estimate the actual phylogenetic variance among isolates. Interestingly, two closely related isolates on one farm showed polymorphism at the VNTR 7 locus, raising concerns about the misleading role that it might play in genotyping. We aimed to investigate the underlying basis of VNTR 7-polymorphism by analyzing sequence data for published genomes and field isolates of MAP and other M. avium complex (MAC) members. In contrast to MAP strains from cattle, strains from sheep displayed an "imperfect" repeat within VNTR 7, which was identical to respective allele types in other MAC genomes. Subspecies- and strain-specific single nucleotide polymorphisms (SNPs) and two novel (16 and 56 bp) repeats were detected. Given the combination of the three existing repeats, there are at least five different patterns for VNTR 7. The present findings highlight a higher polymorphism and probable instability of VNTR 7 locus that needs to be considered and challenged in future studies. Until then, sequencing of this locus in future studies is important to correctly assign the underlying allele types.(1). Copyright © 2016 Elsevier Ltd. All rights reserved.
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome

Science.gov (United States)

Gill, Navdeep; Buti, Matteo; Kane, Nolan; Bellec, Arnaud; Helmstetter, Nicolas; Berges, Hélène; Rieseberg, Loren H.

2014-01-01

Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence. PMID:24833511
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L. Genome

Directory of Open Access Journals (Sweden)

Navdeep Gill

2014-04-01

Full Text Available Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
Non-LTR R2 element evolutionary patterns: phylogenetic incongruences, rapid radiation and the maintenance of multiple lineages.

Directory of Open Access Journals (Sweden)

Andrea Luchetti

Full Text Available Retrotransposons of the R2 superclade specifically insert within the 28S ribosomal gene. They have been isolated from a variety of metazoan genomes and were found vertically inherited even if their phylogeny does not always agree with that of the host species. This was explained with the diversification/extinction of paralogous lineages, being proved the absence of horizontal transfer. We here analyze the widest available collection of R2 sequences, either newly isolated from recently sequenced genomes or drawn from public databases, in a phylogenetic framework. Results are congruent with previous analyses, but new important issues emerge. First, the N-terminal end of the R2-B clade protein, so far unknown, presents a new zinc fingers configuration. Second, the phylogenetic pattern is consistent with an ancient, rapid radiation of R2 lineages: being the estimated time of R2 origin (850-600 Million years ago placed just before the metazoan Cambrian explosion, the wide element diversity and the incongruence with the host phylogeny could be attributable to the sudden expansion of available niches represented by host's 28S ribosomal genes. Finally, we detect instances of coexisting multiple R2 lineages showing a non-random phylogenetic pattern, strongly similar to that of the "library" model known for tandem repeats: a collection of R2s were present in the ancestral genome and then differentially activated/repressed in the derived species. Models for activation/repression as well as mechanisms for sequence maintenance are also discussed within this framework.
Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

Science.gov (United States)

Spielmann, A; Stutz, E

1983-10-25

The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.
Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.

Science.gov (United States)

Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R

2006-12-01

Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Science.gov (United States)

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Identifying uniformly mutated segments within repeats.

Science.gov (United States)

Sahinalp, S Cenk; Eichler, Evan; Goldberg, Paul; Berenbrink, Petra; Friedetzky, Tom; Ergun, Funda

2004-12-01

Given a long string of characters from a constant size alphabet we present an algorithm to determine whether its characters have been generated by a single i.i.d. random source. More specifically, consider all possible n-coin models for generating a binary string S, where each bit of S is generated via an independent toss of one of the n coins in the model. The choice of which coin to toss is decided by a random walk on the set of coins where the probability of a coin change is much lower than the probability of using the same coin repeatedly. We present a procedure to evaluate the likelihood of a n-coin model for given S, subject a uniform prior distribution over the parameters of the model (that represent mutation rates and probabilities of copying events). In the absence of detailed prior knowledge of these parameters, the algorithm can be used to determine whether the a posteriori probability for n=1 is higher than for any other n>1. Our algorithm runs in time O(l4logl), where l is the length of S, through a dynamic programming approach which exploits the assumed convexity of the a posteriori probability for n. Our test can be used in the analysis of long alignments between pairs of genomic sequences in a number of ways. For example, functional regions in genome sequences exhibit much lower mutation rates than non-functional regions. Because our test provides means for determining variations in the mutation rate, it may be used to distinguish functional regions from non-functional ones. Another application is in determining whether two highly similar, thus evolutionarily related, genome segments are the result of a single copy event or of a complex series of copy events. This is particularly an issue in evolutionary studies of genome regions rich with repeat segments (especially tandemly repeated segments).
Giardia telomeric sequence d(TAGGG)4 forms two intramolecular G-quadruplexes in K+ solution: effect of loop length and sequence on the folding topology.

Science.gov (United States)

Hu, Lanying; Lim, Kah Wai; Bouaziz, Serge; Phan, Anh Tuân

2009-11-25

Recently, it has been shown that in K(+) solution the human telomeric sequence d[TAGGG(TTAGGG)(3)] forms a (3 + 1) intramolecular G-quadruplex, while the Bombyx mori telomeric sequence d[TAGG(TTAGG)(3)], which differs from the human counterpart only by one G deletion in each repeat, forms a chair-type intramolecular G-quadruplex, indicating an effect of G-tract length on the folding topology of G-quadruplexes. To explore the effect of loop length and sequence on the folding topology of G-quadruplexes, here we examine the structure of the four-repeat Giardia telomeric sequence d[TAGGG(TAGGG)(3)], which differs from the human counterpart only by one T deletion within the non-G linker in each repeat. We show by NMR that this sequence forms two different intramolecular G-quadruplexes in K(+) solution. The first one is a novel basket-type antiparallel-stranded G-quadruplex containing two G-tetrads, a G x (A-G) triad, and two A x T base pairs; the three loops are consecutively edgewise-diagonal-edgewise. The second one is a propeller-type parallel-stranded G-quadruplex involving three G-tetrads; the three loops are all double-chain-reversal. Recurrence of several structural elements in the observed structures suggests a "cut and paste" principle for the design and prediction of G-quadruplex topologies, for which different elements could be extracted from one G-quadruplex and inserted into another.
Maedi in slaughtered sheep: a pathology and polymerase chain reaction study in southwestern Iran.

Science.gov (United States)

Azizi, Shahrzad; Tajbakhsh, Elahe; Fathi, Farzad; Oryan, Ahmad; Momtaz, Hassan; Goodarzi, Mehdi

2012-01-01

Maedi-visna (MV) is an important slow viral disease of sheep leading to a progressive lymphoproliferative disease. It affects multiple organs primarily the lungs, where it causes interstitial pneumonia (maedi). In this study, the lungs of 1,000 sheep carcasses were grossly inspected and those suspected to have maedi were studied at histopathological and molecular levels. A polymerase chain reaction (PCR) technique that amplified a 291-base pair DNA in the long terminal repeat (LTR) sequence of MV provirus was conducted on all the 50 suspected lungs together with 10 normal appearing lungs as controls. Amplicons of the expected size were detected in 11 (n=11/50) suspected sheep, and one of the 10 control sheep. Histopathologic study of the pulmonary lesions of all 11 (n=11/11) positive sheep showed MV lesions, including hyperplasia of the perivascular and peribronchiolar lymphoid cells, interstitial lymphoplasmacytic infiltration and smooth muscle hyperplasia and the histopathologic findings were correlated with PCR results. In contrast, the tissue sections of control animals were almost normal at histopathological level; however, PCR technique demonstrated that one of them was affected by maedi. This study showed that the LTR-PCR had high specificity and sensitivity in diagnosis of this viral infection. This study is the first to evaluate the prevalence of MV virus infection in sheep in Iran.

Repeat-containing protein effectors of plant-associated organisms

Directory of Open Access Journals (Sweden)

Carl H. Mesarich

2015-10-01

Full Text Available Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.
LQG/LTR optimal attitude control of small flexible spacecraft using free-free boundary conditions

Science.gov (United States)

Fulton, Joseph M.

Due to the volume and power limitations of a small satellite, careful consideration must be taken while designing an attitude control system for 3-axis stabilization. Placing redundancy in the system proves difficult and utilizing power hungry, high accuracy, active actuators is not a viable option. Thus, it is customary to find dependable, passive actuators used in conjunction with small scale active control components. This document describes the application of Elastic Memory Composite materials in the construction of a flexible spacecraft appendage, such as a gravity gradient boom. Assumed modes methods are used with Finite Element Modeling information to obtain the equations of motion for the system while assuming free-free boundary conditions. A discussion is provided to illustrate how cantilever mode shapes are not always the best assumption when modeling small flexible spacecraft. A key point of interest is first resonant modes may be needed in the system design plant in spite of these modes being greater than one order of magnitude in frequency when compared to the crossover frequency of the controller. LQG/LTR optimal control techniques are implemented to compute attitude control gains while controller robustness considerations determine appropriate reduced order controllers and which flexible modes to include in the design model. Key satellite designer concerns in the areas of computer processor sizing, material uncertainty impacts on the system model, and system performance variations resulting from appendage length modifications are addressed.
Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

Science.gov (United States)

Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

2012-01-01

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697
Analysis of genetic diversity and population structure of oil palm (Elaeis guineensis) from China and Malaysia based on species-specific simple sequence repeat markers.

Science.gov (United States)

Zhou, L X; Xiao, Y; Xia, W; Yang, Y D

2015-12-08

Genetic diversity and patterns of population structure of the 94 oil palm lines were investigated using species-specific simple sequence repeat (SSR) markers. We designed primers for 63 SSR loci based on their flanking sequences and conducted amplification in 94 oil palm DNA samples. The amplification result showed that a relatively high level of genetic diversity was observed between oil palm individuals according a set of 21 polymorphic microsatellite loci. The observed heterozygosity (Ho) was 0.3683 and 0.4035, with an average of 0.3859. The Ho value was a reliable determinant of the discriminatory power of the SSR primer combinations. The principal component analysis and unweighted pair-group method with arithmetic averaging cluster analysis showed the 94 oil palm lines were grouped into one cluster. These results demonstrated that the oil palm in Hainan Province of China and the germplasm introduced from Malaysia may be from the same source. The SSR protocol was effective and reliable for assessing the genetic diversity of oil palm. Knowledge of the genetic diversity and population structure will be crucial for establishing appropriate management stocks for this species.
Determination of allele frequencies in nine short tandem repeat loci ...

African Journals Online (AJOL)

SERVER

2008-04-17

Apr 17, 2008 ... out the human genome. These loci are a rich source of highly polymorphic markers that may be detected using the polymerase chain reaction (PCR). PCR is a mimic of the normal cellular process of replication of DNA molecules. Each STR is distinguished by the number of times a sequence is repeated, ...
Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

Energy Technology Data Exchange (ETDEWEB)

Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.; Jansen, Robert K.

2006-01-09

Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 and ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies
Application of synthetic DNA probes to the analysis of DNA sequence variants in man

International Nuclear Information System (INIS)

Wallace, R.B.; Petz, L.D.; Yam, P.Y.

1986-01-01

Oligonucleotide probes provide a tool to discriminate between any two alleles on the basis of hybridization. Random sampling of the genome with different oligonucleotide probes should reveal polymorphism in a certain percentage of the cases. In the hope of identifying polymorphic regions more efficiently, we chose to take advantage of the proposed hypermutability of repeated DNA sequences and the specificity of oligonucleotide hybridization. Since, under appropriate conditions, oligonucleotide probes require complete base pairing for hybridization to occur, they will only hybridize to a subset of the members of a repeat family when all members of the family are not identical. The results presented here suggest that oligonucleotide hybridization can be used to extend the genomic sequences that can be tested for the presence of RFLPs. This expands the tools available to human genetics. In addition, the results suggest that repeated DNA sequences are indeed more polymorphic than single-copy sequences. 28 references, 2 figures
On the role of the second coding exon of the HIV-1 Tat protein in virus replication and MHC class I downregulation

NARCIS (Netherlands)

Verhoef, K.; Bauer, M.; Meyerhans, A.; Berkhout, B.

1998-01-01

Tat is an essential protein of human immunodeficiency virus type 1 (HIV-1) and activates transcription from the viral long terminal repeat (LTR) promoter. The tat gene is composed of two coding exons of which the first, corresponding to the N-terminal 72 amino acid residues, has been reported to be
Comparing whole-genome sequencing with Sanger sequencing for spa typing of methicillin-resistant Staphylococcus aureus.

Science.gov (United States)

Bartels, Mette Damkjær; Petersen, Andreas; Worning, Peder; Nielsen, Jesper Boye; Larner-Svensson, Hanna; Johansen, Helle Krogh; Andersen, Leif Percival; Jarløv, Jens Otto; Boye, Kit; Larsen, Anders Rhod; Westh, Henrik

2014-12-01

spa typing of methicillin-resistant Staphylococcus aureus (MRSA) has traditionally been done by PCR amplification and Sanger sequencing of the spa repeat region. At Hvidovre Hospital, Denmark, whole-genome sequencing (WGS) of all MRSA isolates has been performed routinely since January 2013, and an in-house analysis pipeline determines the spa types. Due to national surveillance, all MRSA isolates are sent to Statens Serum Institut, where the spa type is determined by PCR and Sanger sequencing. The purpose of this study was to evaluate the reliability of the spa types obtained by 150-bp paired-end Illumina WGS. MRSA isolates from new MRSA patients in 2013 (n = 699) in the capital region of Denmark were included. We found a 97% agreement between spa types obtained by the two methods. All isolates achieved a spa type by both methods. Nineteen isolates differed in spa types by the two methods, in most cases due to the lack of 24-bp repeats in the whole-genome-sequenced isolates. These related but incorrect spa types should have no consequence in outbreak investigations, since all epidemiologically linked isolates, regardless of spa type, will be included in the single nucleotide polymorphism (SNP) analysis. This will reveal the close relatedness of the spa types. In conclusion, our data show that WGS is a reliable method to determine the spa type of MRSA. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Outlier Loci and Selection Signatures of Simple Sequence Repeats (SSRs) in Flax (Linum usitatissimum L.).

Science.gov (United States)

Soto-Cerda, Braulio J; Cloutier, Sylvie

2013-01-01

Genomic microsatellites (gSSRs) and expressed sequence tag-derived SSRs (EST-SSRs) have gained wide application for elucidating genetic diversity and population structure in plants. Both marker systems are assumed to be selectively neutral when making demographic inferences, but this assumption is rarely tested. In this study, three neutrality tests were assessed for identifying outlier loci among 150 SSRs (85 gSSRs and 65 EST-SSRs) that likely influence estimates of population structure in three differentiated flax sub-populations ( F ST = 0.19). Moreover, the utility of gSSRs, EST-SSRs, and the combined sets of SSRs was also evaluated in assessing genetic diversity and population structure in flax. Six outlier loci were identified by at least two neutrality tests showing footprints of balancing selection. After removing the outlier loci, the STRUCTURE analysis and the dendrogram topology of EST-SSRs improved. Conversely, gSSRs and combined SSRs results did not change significantly, possibly as a consequence of the higher number of neutral loci assessed. Taken together, the genetic structure analyses established the superiority of gSSRs to determine the genetic relationships among flax accessions, although the combined SSRs produced the best results. Genetic diversity parameters did not differ statistically ( P > 0.05) between gSSRs and EST-SSRs, an observation partially explained by the similar number of repeat motifs. Our study provides new insights into the ability of gSSRs and EST-SSRs to measure genetic diversity and structure in flax and confirms the importance of testing for the occurrence of outlier loci to properly assess natural and breeding populations, particularly in studies considering only few loci.
The population history of endogenous retroviruses in mule deer (Odocoileus heminous)

Science.gov (United States)

Kamath, Pauline L.; Elleder, Daniel; Bao, Le; Cross, Paul C.; Powell, John H.; Poss, Mary

2013-01-01

Mobile elements are powerful agents of genomic evolution and can be exceptionally informative markers for investigating species and population-level evolutionary history. While several studies have utilized retrotransposon-based insertional polymorphisms to resolve phylogenies, few population studies exist outside of humans. Endogenous retroviruses are LTR-retrotransposons derived from retroviruses that have become stably integrated in the host genome during past infections and transmitted vertically to subsequent generations. They offer valuable insight into host-virus co-evolution and a unique perspective on host evolutionary history because they integrate into the genome at a discrete point in time. We examined the evolutionary history of a cervid endogenous gammaretrovirus (CrERVγ) in mule deer (Odocoileus hemionus). We sequenced 14 CrERV proviruses (CrERV-in1 to -in14), and examined the prevalence and distribution of 13 proviruses in 262 deer among 15 populations from Montana, Wyoming, and Utah. CrERV absence in white-tailed deer (O. virginianus), identical 5′ and 3′ long terminal repeat (LTR) sequences, insertional polymorphism, and CrERV divergence time estimates indicated that most endogenization events occurred within the last 200000 years. Population structure inferred from CrERVs (F ST = 0.008) and microsatellites (θ = 0.01) was low, but significant, with Utah, northwestern Montana, and a Helena herd being particularly differentiated. Clustering analyses indicated regional structuring, and non-contiguous clustering could often be explained by known translocations. Cluster ensemble results indicated spatial localization of viruses, specifically in deer from northeastern and western Montana. This study demonstrates the utility of endogenous retroviruses to elucidate and provide novel insight into both ERV evolutionary history and the history of contemporary host populations.
The population history of endogenous retroviruses in mule deer (Odocoileus hemionus).

Science.gov (United States)

Kamath, Pauline L; Elleder, Daniel; Bao, Le; Cross, Paul C; Powell, John H; Poss, Mary

2014-01-01

Mobile elements are powerful agents of genomic evolution and can be exceptionally informative markers for investigating species and population-level evolutionary history. While several studies have utilized retrotransposon-based insertional polymorphisms to resolve phylogenies, few population studies exist outside of humans. Endogenous retroviruses are LTR-retrotransposons derived from retroviruses that have become stably integrated in the host genome during past infections and transmitted vertically to subsequent generations. They offer valuable insight into host-virus co-evolution and a unique perspective on host evolutionary history because they integrate into the genome at a discrete point in time. We examined the evolutionary history of a cervid endogenous gammaretrovirus (CrERVγ) in mule deer (Odocoileus hemionus). We sequenced 14 CrERV proviruses (CrERV-in1 to -in14), and examined the prevalence and distribution of 13 proviruses in 262 deer among 15 populations from Montana, Wyoming, and Utah. CrERV absence in white-tailed deer (O. virginianus), identical 5' and 3' long terminal repeat (LTR) sequences, insertional polymorphism, and CrERV divergence time estimates indicated that most endogenization events occurred within the last 200000 years. Population structure inferred from CrERVs (F ST = 0.008) and microsatellites (θ = 0.01) was low, but significant, with Utah, northwestern Montana, and a Helena herd being particularly differentiated. Clustering analyses indicated regional structuring, and non-contiguous clustering could often be explained by known translocations. Cluster ensemble results indicated spatial localization of viruses, specifically in deer from northeastern and western Montana. This study demonstrates the utility of endogenous retroviruses to elucidate and provide novel insight into both ERV evolutionary history and the history of contemporary host populations.
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

Science.gov (United States)

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Estimating Genetic Conformism of Korean Mulberry Cultivars Using Random Amplified Polymorphic DNA and Inter-Simple Sequence Repeat Profiling

Directory of Open Access Journals (Sweden)

Sunirmal Sheet

2018-03-01

Full Text Available Apart from being fed to silkworms in sericulture, the ecologically important Mulberry plant has been used for traditional medicine in Asian countries as well as in manufacturing wine, food, and beverages. Germplasm analysis among Mulberry cultivars originating from South Korea is crucial in the plant breeding program for cultivar development. Hence, the genetic deviations and relations among 8 Morus alba plants, and one Morus lhou plant, of different cultivars collected from South Korea were investigated using 10 random amplified polymorphic DNA (RAPD and 10 inter-simple sequence repeat (ISSR markers in the present study. The ISSR markers exhibited a higher polymorphism (63.42% among mulberry genotypes in comparison to RAPD markers. Furthermore, the similarity coefficient was estimated for both markers and found to be varying between 0.183 and 0.814 for combined pooled data of ISSR and RAPD. The phenogram drawn using the UPGMA cluster method based on combined pooled data of RAPD and ISSR markers divided the nine mulberry genotypes into two divergent major groups and the two individual independent accessions. The distant relationship between Dae-Saug (SM1 and SangchonJo Sang Saeng (SM5 offers a possibility of utilizing them in mulberry cultivar improvement of Morus species of South Korea.
Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

Science.gov (United States)

Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

2012-12-01

In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
An infinitely expandable cloning strategy plus repeat-proof PCR for working with multiple shRNA.

Directory of Open Access Journals (Sweden)

Glen John McIntyre

Full Text Available Vector construction with restriction enzymes (REs typically involves the ligation of a digested donor fragment (insert to a reciprocally digested recipient fragment (vector backbone. Creating a suitable cloning plan becomes increasingly difficult for complex strategies requiring repeated insertions such as constructing multiple short hairpin RNA (shRNA expression vectors for RNA interference (RNAi studies. The problem lies in the reduced availability of suitable RE recognition sites with an increasing number of cloning events and or vector size. This report details a technically simple, directional cloning solution using REs with compatible cohesive ends that are repeatedly destroyed and simultaneously re-introduced with each round of cloning. Donor fragments can be made by PCR or sub-cloned from pre-existing vectors and inserted ad infinitum in any combination. The design incorporates several cloning cores in order to be compatible with as many donor sequences as possible. We show that joining sub-combinations made in parallel is more time-efficient than sequential construction (of one cassette at a time for any combination of 4 or more insertions. Screening for the successful construction of combinations using Taq polymerase based PCR became increasingly difficult with increasing number of repeated sequence elements. A Pfu polymerase based PCR was developed and successfully used to amplify combinations of up to eleven consecutive hairpin expression cassettes. The identified PCR conditions can be beneficial to others working with multiple shRNA or other repeated sequences, and the infinitely expandable cloning strategy serves as a general solution applicable to many cloning scenarios.
Constructs for the expression of repeating triple-helical protein domains

International Nuclear Information System (INIS)

Peng, Yong Y; Werkmeister, Jerome A; Vaughan, Paul R; Ramshaw, John A M

2009-01-01

The development of novel scaffolds will be an important aspect in future success of tissue engineering. Scaffolds will preferably contain information that directs the cellular content of constructs so that the new tissue that is formed is closely aligned in structure, composition and function to the target natural tissue. One way of approaching this will be the development of novel protein-based constructs that contain one or more repeats of functional elements derived from various proteins. In the present case, we describe a strategy to make synthetic, recombinant triple-helical constructs that contain repeat segments of biologically relevant domains. Copies of a DNA fragment prepared by PCR from human type III collagen have been inserted in a co-linear contiguous fashion into the yeast expression vector YEpFlag-1, using sequential addition between selected restriction sites. Constructs containing 1, 2 and 3 repeats were designed to maintain the (Gly-X-Y) repeat, which is essential for the formation of an extended triple helix. All constructs gave expressed protein, with the best being the 3-repeat construct which was readily secreted. This material had the expected composition and N-terminal sequence. Incubation of the product at low temperature led to triple-helix formation, shown by reaction with a conformation dependent monoclonal antibody.
Constructs for the expression of repeating triple-helical protein domains

Energy Technology Data Exchange (ETDEWEB)

Peng, Yong Y; Werkmeister, Jerome A; Vaughan, Paul R; Ramshaw, John A M, E-mail: jerome.werkmeister@csiro.a [CSIRO Molecular and Health Technologies, Bag 10, Clayton South, VIC 3169 (Australia)

2009-02-15

The development of novel scaffolds will be an important aspect in future success of tissue engineering. Scaffolds will preferably contain information that directs the cellular content of constructs so that the new tissue that is formed is closely aligned in structure, composition and function to the target natural tissue. One way of approaching this will be the development of novel protein-based constructs that contain one or more repeats of functional elements derived from various proteins. In the present case, we describe a strategy to make synthetic, recombinant triple-helical constructs that contain repeat segments of biologically relevant domains. Copies of a DNA fragment prepared by PCR from human type III collagen have been inserted in a co-linear contiguous fashion into the yeast expression vector YEpFlag-1, using sequential addition between selected restriction sites. Constructs containing 1, 2 and 3 repeats were designed to maintain the (Gly-X-Y) repeat, which is essential for the formation of an extended triple helix. All constructs gave expressed protein, with the best being the 3-repeat construct which was readily secreted. This material had the expected composition and N-terminal sequence. Incubation of the product at low temperature led to triple-helix formation, shown by reaction with a conformation dependent monoclonal antibody.
detectIR: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation.

Science.gov (United States)

Ye, Congting; Ji, Guoli; Li, Lei; Liang, Chun

2014-01-01

Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures--hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir.
Long Terminal Repeat CRISPR-CAR-Coupled "Universal" T Cells Mediate Potent Anti-leukemic Effects.

Science.gov (United States)

Georgiadis, Christos; Preece, Roland; Nickolay, Lauren; Etuk, Aniekan; Petrova, Anastasia; Ladon, Dariusz; Danyi, Alexandra; Humphryes-Kirilov, Neil; Ajetunmobi, Ayokunmi; Kim, Daesik; Kim, Jin-Soo; Qasim, Waseem

2018-03-06

Gene editing can be used to overcome allo-recognition, which otherwise limits allogeneic T cell therapies. Initial proof-of-concept applications have included generation of such "universal" T cells expressing chimeric antigen receptors (CARs) against CD19 target antigens combined with transient expression of DNA-targeting nucleases to disrupt the T cell receptor alpha constant chain (TRAC). Although relatively efficient, transgene expression and editing effects were unlinked, yields variable, and resulting T cell populations heterogeneous, complicating dosing strategies. We describe a self-inactivating lentiviral "terminal" vector platform coupling CAR expression with CRISPR/Cas9 effects through incorporation of an sgRNA element into the ΔU3 3' long terminal repeat (LTR). Following reverse transcription and duplication of the hybrid ΔU3-sgRNA, delivery of Cas9 mRNA resulted in targeted TRAC locus cleavage and allowed the enrichment of highly homogeneous (>96%) CAR + (>99%) TCR - populations by automated magnetic separation. Molecular analyses, including NGS, WGS, and Digenome-seq, verified on-target specificity with no evidence of predicted off-target events. Robust anti-leukemic effects were demonstrated in humanized immunodeficient mice and were sustained longer than by conventional CAR + TCR + T cells. Terminal-TRAC (TT) CAR T cells offer the possibility of a pre-manufactured, non-HLA-matched CAR cell therapy and will be evaluated in phase 1 trials against B cell malignancies shortly. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.

Structural analysis of a repetitive protein sequence motif in strepsirrhine primate amelogenin.

Directory of Open Access Journals (Sweden)

Rodrigo S Lacruz

2011-03-01

Full Text Available Strepsirrhines are members of a primate suborder that has a distinctive set of features associated with the development of the dentition. Amelogenin (AMEL, the better known of the enamel matrix proteins, forms 90% of the secreted organic matrix during amelogenesis. Although AMEL has been sequenced in numerous mammalian lineages, the only reported strepsirrhine AMEL sequences are those of the ring-tailed lemur and galago, which contain a set of additional proline-rich tandem repeats absent in all other primates species analyzed to date, but present in some non-primate mammals. Here, we first determined that these repeats are present in AMEL from three additional lemur species and thus are likely to be widespread throughout this group. To evaluate the functional relevance of these repeats in strepsirrhines, we engineered a mutated murine amelogenin sequence containing a similar proline-rich sequence to that of Lemur catta. In the monomeric form, the MQP insertions had no influence on the secondary structure or refolding properties, whereas in the assembled form, the insertions increased the hydrodynamic radii. We speculate that increased AMEL nanosphere size may influence enamel formation in strepsirrhine primates.
Identification and characterization of tandem repeats in exon III of dopamine receptor D4 (DRD4) genes from different mammalian species

DEFF Research Database (Denmark)

Larsen, Svend Arild; Mogensen, Line; Dietz, Rune

2005-01-01

repeat being found. In the domestic cow and gray seal we identified tandem repeats composed of 36-bp modules, each consisting of two closely related 18-bp basic units. A tandem repeat consisting of 9-bp modules was identified in sequences from mink and ferret. In the European otter we detected an 18-bp...
Direct and inverted repeats elicit genetic instability by both exploiting and eluding DNA double-strand break repair systems in mycobacteria.

Directory of Open Access Journals (Sweden)

Ewelina A Wojcik

Full Text Available Repetitive DNA sequences with the potential to form alternative DNA conformations, such as slipped structures and cruciforms, can induce genetic instability by promoting replication errors and by serving as a substrate for DNA repair proteins, which may lead to DNA double-strand breaks (DSBs. However, the contribution of each of the DSB repair pathways, homologous recombination (HR, non-homologous end-joining (NHEJ and single-strand annealing (SSA, to this sort of genetic instability is not fully understood. Herein, we assessed the genome-wide distribution of repetitive DNA sequences in the Mycobacterium smegmatis, Mycobacterium tuberculosis and Escherichia coli genomes, and determined the types and frequencies of genetic instability induced by direct and inverted repeats, both in the presence and in the absence of HR, NHEJ, and SSA. All three genomes are strongly enriched in direct repeats and modestly enriched in inverted repeats. When using chromosomally integrated constructs in M. smegmatis, direct repeats induced the perfect deletion of their intervening sequences ~1,000-fold above background. Absence of HR further enhanced these perfect deletions, whereas absence of NHEJ or SSA had no influence, suggesting compromised replication fidelity. In contrast, inverted repeats induced perfect deletions only in the absence of SSA. Both direct and inverted repeats stimulated excision of the constructs from the attB integration sites independently of HR, NHEJ, or SSA. With episomal constructs, direct and inverted repeats triggered DNA instability by activating nucleolytic activity, and absence of the DSB repair pathways (in the order NHEJ>HR>SSA exacerbated this instability. Thus, direct and inverted repeats may elicit genetic instability in mycobacteria by 1 directly interfering with replication fidelity, 2 stimulating the three main DSB repair pathways, and 3 enticing L5 site-specific recombination.
Direct and inverted repeats elicit genetic instability by both exploiting and eluding DNA double-strand break repair systems in mycobacteria.

Science.gov (United States)

Wojcik, Ewelina A; Brzostek, Anna; Bacolla, Albino; Mackiewicz, Pawel; Vasquez, Karen M; Korycka-Machala, Malgorzata; Jaworski, Adam; Dziadek, Jaroslaw

2012-01-01

Repetitive DNA sequences with the potential to form alternative DNA conformations, such as slipped structures and cruciforms, can induce genetic instability by promoting replication errors and by serving as a substrate for DNA repair proteins, which may lead to DNA double-strand breaks (DSBs). However, the contribution of each of the DSB repair pathways, homologous recombination (HR), non-homologous end-joining (NHEJ) and single-strand annealing (SSA), to this sort of genetic instability is not fully understood. Herein, we assessed the genome-wide distribution of repetitive DNA sequences in the Mycobacterium smegmatis, Mycobacterium tuberculosis and Escherichia coli genomes, and determined the types and frequencies of genetic instability induced by direct and inverted repeats, both in the presence and in the absence of HR, NHEJ, and SSA. All three genomes are strongly enriched in direct repeats and modestly enriched in inverted repeats. When using chromosomally integrated constructs in M. smegmatis, direct repeats induced the perfect deletion of their intervening sequences ~1,000-fold above background. Absence of HR further enhanced these perfect deletions, whereas absence of NHEJ or SSA had no influence, suggesting compromised replication fidelity. In contrast, inverted repeats induced perfect deletions only in the absence of SSA. Both direct and inverted repeats stimulated excision of the constructs from the attB integration sites independently of HR, NHEJ, or SSA. With episomal constructs, direct and inverted repeats triggered DNA instability by activating nucleolytic activity, and absence of the DSB repair pathways (in the order NHEJ>HR>SSA) exacerbated this instability. Thus, direct and inverted repeats may elicit genetic instability in mycobacteria by 1) directly interfering with replication fidelity, 2) stimulating the three main DSB repair pathways, and 3) enticing L5 site-specific recombination.
Quadruplex-forming sequences occupy discrete regions inside plant LTR retrotransposons

Czech Academy of Sciences Publication Activity Database

Lexa, M.; Kejnovský, Eduard; Šteflová, Pavlína; Konvalinová, H.; Vorlíčková, Michaela; Vyskot, Boris

2014-01-01

Roč. 42, č. 2 (2014), s. 968-978 ISSN 0305-1048 R&D Projects: GA ČR(CZ) GAP205/12/0466; GA ČR(CZ) GAP305/10/0930; GA ČR(CZ) GAP501/10/0102; GA ČR(CZ) GA522/09/0083; GA ČR GPP501/10/P483 Institutional support: RVO:68081707 Keywords : INTRAMOLECULAR DNA QUADRUPLEXES * VIRUS TYPE-1 RNA * CIRCULAR-DICHROISM Subject RIV: BO - Biophysics Impact factor: 9.112, year: 2014
CRISPRstrand: predicting repeat orientations to determine the crRNA-encoding strand at CRISPR loci

DEFF Research Database (Denmark)

Alkhnbashi, Omer S.; Costa, Fabrizio; Shah, Shiraz Ali

2014-01-01

Motivation: The discovery of CRISPR-Cas systems almost 20 years ago rapidly changed our perception of the bacterial and archaeal immune systems. CRISPR loci consist of several repetitive DNA sequences called repeats, inter-spaced by stretches of variable length sequences called spacers. This CRISPR...... array is transcribed and processed into multiple mature RNA species (crRNAs). A single crRNA is integrated into an interference complex, together with CRISPR-associated (Cas) proteins, to bind and degrade invading nucleic acids. Although existing bioinformatics tools can recognize CRISPR loci...... by their characteristic repeat-spacer architecture, they generally output CRISPR arrays of ambiguous orientation and thus do not determine the strand from which crRNAs are processed. Knowledge of the correct orientation is crucial for many tasks, including the classification of CRISPR conservation, the detection...
High quality maize centromere 10 sequence reveals evidence of frequent recombination events

Directory of Open Access Journals (Sweden)

Thomas Kai Wolfgruber

2016-03-01

Full Text Available The ancestral centromeres of maize contain long stretches of the tandemly arranged CentC repeat. The abundance of tandem DNA repeats and centromeric retrotransposons (CR have presented a significant challenge to completely assembling centromeres using traditional sequencing methods. Here we report a nearly complete assembly of the 1.85 Mb maize centromere 10 from inbred B73 using PacBio technology and BACs from the reference genome project. The error rates estimated from overlapping BAC sequences are 7 x 10-6 and 5 x 10-5 for mismatches and indels, respectively. The number of gaps in the region covered by the reassembly was reduced from 140 in the reference genome to three. Three expressed genes are located between 92 and 477 kb of the inferred ancestral CentC cluster, which lies within the region of highest centromeric repeat density. The improved assembly increased the count of full-length centromeric retrotransposons from 5 to 55 and revealed a 22.7 kb segmental duplication that occurred approximately 121,000 years ago. Our analysis provides evidence of frequent recombination events in the form of partial retrotransposons, deletions within retrotransposons, chimeric retrotransposons, segmental duplications including higher order CentC repeats, a deleted CentC monomer, centromere-proximal inversions, and insertion of mitochondrial sequences. Double-strand DNA break (DSB repair is the most plausible mechanism for these events and may be the major driver of centromere repeat evolution and diversity. This repair appears to be mediated by microhomology, suggesting that tandem repeats may have evolved to facilitate the repair of frequent DSBs in centromeres.
The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

Directory of Open Access Journals (Sweden)

Dong-Keun Yi

2016-06-01

Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.
CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

Science.gov (United States)

2012-01-01

Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas. PMID:23256920
CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

Directory of Open Access Journals (Sweden)

Liu Chang

2012-12-01

Full Text Available Abstract Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.
Mapping sequences by parts

Directory of Open Access Journals (Sweden)

Guziolowski Carito

2007-09-01

Full Text Available Abstract Background: We present the N-map method, a pairwise and asymmetrical approach which allows us to compare sequences by taking into account evolutionary events that produce shuffled, reversed or repeated elements. Basically, the optimal N-map of a sequence s over a sequence t is the best way of partitioning the first sequence into N parts and placing them, possibly complementary reversed, over the second sequence in order to maximize the sum of their gapless alignment scores. Results: We introduce an algorithm computing an optimal N-map with time complexity O (|s| × |t| × N using O (|s| × |t| × N memory space. Among all the numbers of parts taken in a reasonable range, we select the value N for which the optimal N-map has the most significant score. To evaluate this significance, we study the empirical distributions of the scores of optimal N-maps and show that they can be approximated by normal distributions with a reasonable accuracy. We test the functionality of the approach over random sequences on which we apply artificial evolutionary events. Practical Application: The method is illustrated with four case studies of pairs of sequences involving non-standard evolutionary events.
DNA dynamics is likely to be a factor in the genomic nucleotide repeats expansions related to diseases.

Directory of Open Access Journals (Sweden)

Boian S Alexandrov

Full Text Available Trinucleotide repeats sequences (TRS represent a common type of genomic DNA motif whose expansion is associated with a large number of human diseases. The driving molecular mechanisms of the TRS ongoing dynamic expansion across generations and within tissues and its influence on genomic DNA functions are not well understood. Here we report results for a novel and notable collective breathing behavior of genomic DNA of tandem TRS, leading to propensity for large local DNA transient openings at physiological temperature. Our Langevin molecular dynamics (LMD and Markov Chain Monte Carlo (MCMC simulations demonstrate that the patterns of openings of various TRSs depend specifically on their length. The collective propensity for DNA strand separation of repeated sequences serves as a precursor for outsized intermediate bubble states independently of the G/C-content. We report that repeats have the potential to interfere with the binding of transcription factors to their consensus sequence by altered DNA breathing dynamics in proximity of the binding sites. These observations might influence ongoing attempts to use LMD and MCMC simulations for TRS-related modeling of genomic DNA functionality in elucidating the common denominators of the dynamic TRS expansion mutation with potential therapeutic applications.
Identification and functional analysis of a second RBF-2 binding site within the HIV-1 promoter

International Nuclear Information System (INIS)

Dahabieh, Matthew S.; Ooms, Marcel; Malcolm, Tom; Simon, Viviana; Sadowski, Ivan

2011-01-01

Transcription from the HIV-1 long terminal repeat (LTR) is mediated by numerous host transcription factors. In this study we characterized an E-box motif (RBE1) within the core promoter that was previously implicated in both transcriptional activation and repression. We show that RBE1 is a binding site for the RBF-2 transcription factor complex (USF1, USF2, and TFII-I), previously shown to bind an upstream viral element, RBE3. The RBE1 and RBE3 elements formed complexes of identical mobility and protein constituents in gel shift assays, both with Jurkat T-cell nuclear extracts and recombinant USF/TFII-I. Furthermore, both elements are regulators of HIV-1 expression; mutations in LTR-luciferase reporters and in HIV-1 molecular clones resulted in decreased transcription, virion production, and proviral expression in infected cells. Collectively, our data indicate that RBE1 is a bona fide RBF-2 binding site and that the RBE1 and RBE3 elements are necessary for mediating proper transcription from the HIV-1 LTR.
Interaction of the phospholipid scramblase 1 with HIV-1 Tat results in the repression of Tat-dependent transcription

International Nuclear Information System (INIS)

Kusano, Shuichi; Eizuru, Yoshito

2013-01-01

Highlights: •PLSCR1 specifically interacted with HIV-1 Tat in vitro and in vivo. •PLSCR1 repressed Tat-dependent transactivation of the HIV-1 LTR. •Suppression of PLSCR1 expression enhanced the levels of HIV-1 transcripts. •PLSCR1 reduced the nuclear localization of Tat. -- Abstract: Human phospholipid scramblase 1 (PLSCR1) is an interferon (IFN)-stimulated gene and possesses an IFN-mediated antiviral function. We show here that PLSCR1 directly interacts with human immunodeficiency virus type-1 (HIV-1) Tat. This interaction occurs both in vitro and in vivo through amino acids 160–250 of PLSCR1. Overexpression of PLSCR1 efficiently represses the Tat-dependent transactivation of the HIV-1 long terminal repeat (LTR) and reduces the nuclear translocation of Tat. In addition, shRNA-mediated suppression of endogenous PLSCR1 expression enhances the levels of gag mRNA in an HIV-1-infected T-cell line. These findings indicate that PLSCR1 negatively regulates the Tat-dependent transactivation of the HIV-1 LTR during HIV-1 infection
Overexpression of octamer transcription factors 1 or 2 alone has no effect on HIV-1 transcription in primary human CD4 T cells

International Nuclear Information System (INIS)

Zhang Mingce; Genin, Anna; Cron, Randy Q.

2004-01-01

We explored the binding of octamer (Oct) transcription factors to the HIV-1 long terminal repeat (LTR) by gel shift assays and showed none of the previously identified four potential Oct binding sites bound Oct-1 or Oct-2. Overexpression of Oct-1 or Oct-2 had no effect on HIV-1 LTR activity in transiently transfected primary human CD4 T cells. Next, primary human CD4 T cells were co-transfected with a green fluorescent protein (GFP)-expression vector and an Oct-1 or Oct-2 expression plasmid. The transfected cells were stimulated for 2 days and then infected with the NL4-3 strain of HIV-1. After 3 days of infection, there were no differences in HIV-1 p24 supernatant levels. Apoptosis of infected or bystander cells overexpressing Oct-1 or Oct-2 compared to control was also unaffected. Our studies demonstrate that Oct-1 and Oct-2 fail to bind to the HIV-1 LTR and have no effect on HIV-1 transcription in primary human CD4 T cells
HIV-1 Promoter Single Nucleotide Polymorphisms Are Associated with Clinical Disease Severity.

Directory of Open Access Journals (Sweden)

Michael R Nonnemacher

Full Text Available The large majority of human immunodeficiency virus type 1 (HIV-1 markers of disease progression/severity previously identified have been associated with alterations in host genetic and immune responses, with few studies focused on viral genetic markers correlate with changes in disease severity. This study presents a cross-sectional/longitudinal study of HIV-1 single nucleotide polymorphisms (SNPs contained within the viral promoter or long terminal repeat (LTR in patients within the Drexel Medicine CNS AIDS Research and Eradication Study (CARES Cohort. HIV-1 LTR SNPs were found to associate with the classical clinical disease parameters CD4+ T-cell count and log viral load. They were found in both defined and undefined transcription factor binding sites of the LTR. A novel SNP identified at position 108 in a known COUP (chicken ovalbumin upstream promoter/AP1 transcription factor binding site was significantly correlated with binding phenotypes that are potentially the underlying cause of the associated clinical outcome (increase in viral load and decrease in CD4+ T-cell count.
Intrinsic Stability of Episomal Circles Formed during Human Immunodeficiency Virus Type 1 Replication

Science.gov (United States)

Pierson, TheodoreC.; Kieffer, Tara L.; Ruff, Christian T.; Buck, Christopher; Gange, Stephen J.; Siliciano, Robert F.

2002-01-01

The development of surrogate markers capable of detecting residual ongoing human immunodeficiency virus type 1 (HIV-1) replication in patients receiving highly active antiretroviral therapy is an important step in understanding viral dynamics and in developing new treatment strategies. In this study, we evaluated the utility of circular forms of the viral genome for the detection of recent infection of cells by HIV-1. We measured the fate of both one-long terminal repeat (1-LTR) and 2-LTR circles following in vitro infection of logarithmically growing CD4+ T cells under conditions in which cell death was not a significant contributing factor. Circular forms of the viral genome were found to be highly stable and to decrease in concentration only as a function of dilution resulting from cell division. We conclude that these DNA circles are not intrinsically unstable in all cell types and suggest that the utility of 2-LTR circle assays in measuring recent HIV-1 infection of susceptible cells in vivo needs to be reevaluated. PMID:11907256
Origin-Dependent Inverted-Repeat Amplification: Tests of a Model for Inverted DNA Amplification.

Directory of Open Access Journals (Sweden)

Bonita J Brewer

2015-12-01

Full Text Available DNA replication errors are a major driver of evolution--from single nucleotide polymorphisms to large-scale copy number variations (CNVs. Here we test a specific replication-based model to explain the generation of interstitial, inverted triplications. While no genetic information is lost, the novel inversion junctions and increased copy number of the included sequences create the potential for adaptive phenotypes. The model--Origin-Dependent Inverted-Repeat Amplification (ODIRA-proposes that a replication error at pre-existing short, interrupted, inverted repeats in genomic sequences generates an extrachromosomal, inverted dimeric, autonomously replicating intermediate; subsequent genomic integration of the dimer yields this class of CNV without loss of distal chromosomal sequences. We used a combination of in vitro and in vivo approaches to test the feasibility of the proposed replication error and its downstream consequences on chromosome structure in the yeast Saccharomyces cerevisiae. We show that the proposed replication error-the ligation of leading and lagging nascent strands to create "closed" forks-can occur in vitro at short, interrupted inverted repeats. The removal of molecules with two closed forks results in a hairpin-capped linear duplex that we show replicates in vivo to create an inverted, dimeric plasmid that subsequently integrates into the genome by homologous recombination, creating an inverted triplication. While other models have been proposed to explain inverted triplications and their derivatives, our model can also explain the generation of human, de novo, inverted amplicons that have a 2:1 mixture of sequences from both homologues of a single parent--a feature readily explained by a plasmid intermediate that arises from one homologue and integrates into the other homologue prior to meiosis. Our tests of key features of ODIRA lend support to this mechanism and suggest further avenues of enquiry to unravel the origins
Origin-Dependent Inverted-Repeat Amplification: Tests of a Model for Inverted DNA Amplification.

Science.gov (United States)

Brewer, Bonita J; Payen, Celia; Di Rienzi, Sara C; Higgins, Megan M; Ong, Giang; Dunham, Maitreya J; Raghuraman, M K

2015-12-01

DNA replication errors are a major driver of evolution--from single nucleotide polymorphisms to large-scale copy number variations (CNVs). Here we test a specific replication-based model to explain the generation of interstitial, inverted triplications. While no genetic information is lost, the novel inversion junctions and increased copy number of the included sequences create the potential for adaptive phenotypes. The model--Origin-Dependent Inverted-Repeat Amplification (ODIRA)-proposes that a replication error at pre-existing short, interrupted, inverted repeats in genomic sequences generates an extrachromosomal, inverted dimeric, autonomously replicating intermediate; subsequent genomic integration of the dimer yields this class of CNV without loss of distal chromosomal sequences. We used a combination of in vitro and in vivo approaches to test the feasibility of the proposed replication error and its downstream consequences on chromosome structure in the yeast Saccharomyces cerevisiae. We show that the proposed replication error-the ligation of leading and lagging nascent strands to create "closed" forks-can occur in vitro at short, interrupted inverted repeats. The removal of molecules with two closed forks results in a hairpin-capped linear duplex that we show replicates in vivo to create an inverted, dimeric plasmid that subsequently integrates into the genome by homologous recombination, creating an inverted triplication. While other models have been proposed to explain inverted triplications and their derivatives, our model can also explain the generation of human, de novo, inverted amplicons that have a 2:1 mixture of sequences from both homologues of a single parent--a feature readily explained by a plasmid intermediate that arises from one homologue and integrates into the other homologue prior to meiosis. Our tests of key features of ODIRA lend support to this mechanism and suggest further avenues of enquiry to unravel the origins of interstitial
Using Next Generation RAD Sequencing to Isolate Multispecies Microsatellites for Pilosocereus (Cactaceae.

Directory of Open Access Journals (Sweden)

Isabel A S Bonatelli

Full Text Available Microsatellite markers (also known as SSRs, Simple Sequence Repeats are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms.

Using Next Generation RAD Sequencing to Isolate Multispecies Microsatellites for Pilosocereus (Cactaceae).

Science.gov (United States)

Bonatelli, Isabel A S; Carstens, Bryan C; Moraes, Evandro M

2015-01-01

Microsatellite markers (also known as SSRs, Simple Sequence Repeats) are widely used in plant science and are among the most informative molecular markers for population genetic investigations, but the development of such markers presents substantial challenges. In this report, we discuss how next generation sequencing can replace the cloning, Sanger sequencing, identification of polymorphic loci, and testing cross-amplification that were previously required to develop microsatellites. We report the development of a large set of microsatellite markers for five species of the Neotropical cactus genus Pilosocereus using a restriction-site-associated DNA sequencing (RAD-seq) on a Roche 454 platform. We identified an average of 165 microsatellites per individual, with the absolute numbers across individuals proportional to the sequence reads obtained per individual. Frequency distribution of the repeat units was similar in the five species, with shorter motifs such as di- and trinucleotide being the most abundant repeats. In addition, we provide 72 microsatellites that could be potentially amplified in the sampled species and 22 polymorphic microsatellites validated in two populations of the species Pilosocereus machrisii. Although low coverage sequencing among individuals was observed for most of the loci, which we suggest to be more related to the nature of the microsatellite markers and the possible bias inserted by the restriction enzymes than to the genome size, our work demonstrates that an NGS approach is an efficient method to isolate multispecies microsatellites even in non-model organisms.
Replication error deficient and proficient colorectal cancer gene expression differences caused by 3'UTR polyT sequence deletions

DEFF Research Database (Denmark)

Wilding, Jennifer L; McGowan, Simon; Liu, Ying

2010-01-01

, and have distinct pathologies. Regulatory sequences controlling all aspects of mRNA processing, especially including message stability, are found in the 3'UTR sequence of most genes. The relevant sequences are typically A/U-rich elements or U repeats. Microarray analysis of 14 RER+ (deficient) and 16 RER......- (proficient) colorectal cancer cell lines confirms a striking difference in expression profiles. Analysis of the incidence of mononucleotide repeat sequences in the 3'UTRs, 5'UTRs, and coding sequences of those genes most differentially expressed in RER+ versus RER- cell lines has shown that much...... of this differential expression can be explained by the occurrence of a massive enrichment of genes with 3'UTR T repeats longer than 11 base pairs in the most differentially expressed genes. This enrichment was confirmed by analysis of two published consensus sets of RER differentially expressed probesets for a large...
Repeated fault rupture recorded by paleoenvironmental changes in a wetland sedimentary sequence ponded against the Alpine Fault, New Zealand

Science.gov (United States)

Clark, K.; Berryman, K. R.; Cochran, U. A.; Bartholomew, T.; Turner, G. M.

2010-12-01

At Hokuri Creek, in south Westland, New Zealand, an 18 m thickness of Holocene sediments has accumulated against the upthrown side of the Alpine Fault. Recent fluvial incision has created numerous exposures of this sedimentary sequence. At a decimetre to metre scale there are two dominant types of sedimentary units: clastic-dominated, grey silt packages, and organic-dominated, light brown peaty-silt units. These units represent repeated alternations of the paleoenvironment due to fault rupture over the past 7000 years. We have located the event horizons within the sedimentary sequence, and identified evidence to support earthquake-driven paleoenvironmental change (rather than climatic variability), and developed a model of paleoenvironmental changes over a typical seismic cycle. To quantitatively characterise the sediments we use high resolution photography, x-ray imaging, magnetic-susceptibility and total carbon analysis. To understand the depositional environment we used diatom and pollen studies. The organic-rich units have very low magnetic susceptibility and density values, with high greyscale and high total carbon values. Diatoms indicate these units represent stable wetland environments with standing water and predominantly in-situ organic material deposition. The clastic-rich units are characterised by higher magnetic susceptibility and density values, with low greyscale and total carbon. The clastic-rich units represent environments of flowing water and deep pond settings that received predominantly catchment-derived silt and sand. The event horizon is located at the upper contact of the organic-rich horizons. The event horizon contact marks a drastic change in hydrologic regime as fault rupture changed the stream base level and there was a synchronous influx of clastic sediment as the catchment responded to earthquake shaking. During the interseismic period the flowing-water environment gradually stabilised and returned to an organic-rich wetland. Such
Sequence finishing and mapping of Drosophila melanogasterheterochromatin

Energy Technology Data Exchange (ETDEWEB)

Hoskins, Roger A.; Carlson, Joseph W.; Kennedy, Cameron; Acevedo,David; Evans-Holm, Martha; Frise, Erwin; Wan, Kenneth H.; Park, Soo; Mendez-Lago, Maria; Rossi, Fabrizio; Villasante, Alfredo; Dimitri,Patrizio; Karpen, Gary H.; Celniker, Susan E.

2007-06-15

Genome sequences for most metazoans are incomplete due tothe presence of repeated DNA in the pericentromeric heterochromatin. Theheterochromatic regions of D. melanogaster contain 20 Mb of sequenceamenable to mapping, sequence assembly and finishing. Here we describethe generation of 15 Mb of finished or improved heterochromatic sequenceusing available clone resources and assembly and mapping methods. We alsoconstructed a BAC-based physical map that spans approximately 13 Mb ofthe pericentromeric heterochromatin, and a cytogenetic map that positionsapproximately 11 Mb of BAC contigs and sequence scaffolds in specificchromosomal locations. The integrated sequence assembly and maps greatlyimprove our understanding of the structure and composition of this poorlyunderstood fraction of a metazoan genome and provide a framework forfunctional analyses.
Whole-genome in-silico subtractive hybridization (WISH - using massive sequencing for the identification of unique and repetitive sex-specific sequences: the example of Schistosoma mansoni

Directory of Open Access Journals (Sweden)

Parrinello Hugues

2010-06-01

Full Text Available Abstract Background Emerging methods of massive sequencing that allow for rapid re-sequencing of entire genomes at comparably low cost are changing the way biological questions are addressed in many domains. Here we propose a novel method to compare two genomes (genome-to-genome comparison. We used this method to identify sex-specific sequences of the human blood fluke Schistosoma mansoni. Results Genomic DNA was extracted from male and female (heterogametic S. mansoni adults and sequenced with a Genome Analyzer (Illumina. Sequences are available at the NCBI sequence read archive http://www.ncbi.nlm.nih.gov/Traces/sra/ under study accession number SRA012151.6. Sequencing reads were aligned to the genome, and a pseudogenome composed of known repeats. Straightforward comparative bioinformatics analysis was performed to compare male and female schistosome genomes and identify female-specific sequences. We found that the S. mansoni female W chromosome contains only few specific unique sequences (950 Kb i.e. about 0.2% of the genome. The majority of W-specific sequences are repeats (10.5 Mb i.e. about 2.5% of the genome. Arbitrarily selected W-specific sequences were confirmed by PCR. Primers designed for unique and repetitive sequences allowed to reliably identify the sex of both larval and adult stages of the parasite. Conclusion Our genome-to-genome comparison method that we call "whole-genome in-silico subtractive hybridization" (WISH allows for rapid identification of sequences that are specific for a certain genotype (e.g. the heterogametic sex. It can in principle be used for the detection of any sequence differences between isolates (e.g. strains, pathovars or even closely related species.
Genetic variation in Rhodomyrtus tomentosa (Kemunting) populations from Malaysia as revealed by inter-simple sequence repeat markers.

Science.gov (United States)

Hue, T S; Abdullah, T L; Abdullah, N A P; Sinniah, U R

2015-12-14

Kemunting (Rhodomyrtus tomentosa) from the Myrtaceae family, is native to Malaysia. It is widely used in traditional medicine to treat various illnesses and possesses significant antibacterial properties. In addition, it has great potential as ornamental in landscape design. Genetic variability studies are important for the rational management and conservation of genetic material. In the present study, inter-simple sequence repeat markers were used to assess the genetic diversity of 18 R. tomentosa populations collected from ten states of Peninsular Malaysia. The 11 primers selected generated 173 bands that ranged in size from 1.6 kb to 130 bp, which corresponded to an average of 15.73 bands per primer. Of these bands, 97.69% (169 in total) were polymorphic. High genetic diversity was documented at the species level (H(T) = 0.2705; I = 0.3973; PPB = 97.69%) but there was a low diversity at population level (H(S) = 0.0073; I = 0 .1085; PPB = 20.14%). The high level of genetic differentiation revealed by G(ST) (73%) and analysis of molecular variance (63%), together with the limited gene flow among population (N(m) = 0.1851), suggests that the populations examined are isolated. Results from an unweighted pair group method with arithmetic mean dendrogram and principal coordinate analysis clearly grouped the populations into two geographic groups. This clear grouping can also be demonstrated by the significant Mantel test (r = 0.581, P = 0.001). We recommend that all the R. tomentosa populations be preserved in conservation program.
Mutated N-ras does not induce p19 arf in CO25 cell line | Saleh ...

African Journals Online (AJOL)

The mouse cell line (CO25) used in this study was transfected with a glucocorticoid inducible mutated human N-ras oncogene under transcriptional control of the steroid-sensitive promoter of the mouse mammary tumors virus long terminal repeat MMTV-LTR. This study was aimed to investigate the expression of p19arf and ...
Sequencing, Characterization, and Comparative Analyses of the Plastome of Caragana rosea var. rosea

Directory of Open Access Journals (Sweden)

Mei Jiang

2018-05-01

Full Text Available To exploit the drought-resistant Caragana species, we performed a comparative study of the plastomes from four species: Caragana rosea, C. microphylla, C. kozlowii, and C. Korshinskii. The complete plastome sequence of the C. rosea was obtained using the next generation DNA sequencing technology. The genome is a circular structure of 133,122 bases and it lacks inverted repeat. It contains 111 unique genes, including 76 protein-coding, 30 tRNA, and four rRNA genes. Repeat analyses obtained 239, 244, 258, and 246 simple sequence repeats in C. rosea, C. microphylla, C. kozlowii, and C. korshinskii, respectively. Analyses of sequence divergence found two intergenic regions: trnI-CAU-ycf2 and trnN-GUU-ycf1, exhibiting a high degree of variations. Phylogenetic analyses showed that the four Caragana species belong to a monophyletic clade. Analyses of Ka/Ks ratios revealed that five genes: rpl16, rpl20, rps11, rps7, and ycf1 and several sites having undergone strong positive selection in the Caragana branch. The results lay the foundation for the development of molecular markers and the understanding of the evolutionary process for drought-resistant characteristics.
Genome-wide analysis of tandem repeats in plants and green algae

Science.gov (United States)

Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

2014-01-01

Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...
[Clustered regularly interspaced short palindromic repeats (CRISPR) site in Bacillus anthracis].

Science.gov (United States)

Gao, Zhiqi; Wang, Dongshu; Feng, Erling; Wang, Bingxiang; Hui, Yiming; Han, Shaobo; Jiao, Lei; Liu, Xiankai; Wang, Hengliang

2014-11-04

To investigate the polymorphism of clustered regularly interspaced short palindromic repeats (CRISPR) in Bacillu santhracis and the application to molecular typing based on the polymorphism of CRISPR in B. anthracis. We downloaded the whole genome sequence of 6 B. anthracis strains and extracted the CRISPR sites. We designed the primers of CRISPR sites and amplified the CRISPR fragments in 193 B. anthracis strains by PCR and sequenced these fragments. In order to reveal the polymorphism of CRISPR in B. anthracis, wealigned all the extracted sequences and sequenced results by local blasting. At the same time, we also analyzed the CRISPR sites in B. cereus and B. thuringiensis. We did not find any polymorphism of CRISPR in B. anthracis. The molecular typing approach based on CRISPR polymorphism is not suitable for B. anthracis, but it is possible for us to distinguish B. anthracis from B. cereus and B. thuringiensis.
The diversity and evolution of Wolbachia ankyrin repeat domain genes.

Directory of Open Access Journals (Sweden)

Stefanos Siozios

Full Text Available Ankyrin repeat domain-encoding genes are common in the eukaryotic and viral domains of life, but they are rare in bacteria, the exception being a few obligate or facultative intracellular Proteobacteria species. Despite having a reduced genome, the arthropod strains of the alphaproteobacterium Wolbachia contain an unusually high number of ankyrin repeat domain-encoding genes ranging from 23 in wMel to 60 in wPip strain. This group of genes has attracted considerable attention for their astonishing large number as well as for the fact that ankyrin proteins are known to participate in protein-protein interactions, suggesting that they play a critical role in the molecular mechanism that determines host-Wolbachia symbiotic interactions. We present a comparative evolutionary analysis of the wMel-related ankyrin repeat domain-encoding genes present in different Drosophila-Wolbachia associations. Our results show that the ankyrin repeat domain-encoding genes change in size by expansion and contraction mediated by short directly repeated sequences. We provide examples of intra-genic recombination events and show that these genes are likely to be horizontally transferred between strains with the aid of bacteriophages. These results confirm previous findings that the Wolbachia genomes are evolutionary mosaics and illustrate the potential that these bacteria have to generate diversity in proteins potentially involved in the symbiotic interactions.
Characteristics of palindromic sequences in DNA of the sea urchin Stronglyocentrotus intermedius

International Nuclear Information System (INIS)

Brykov, V.A.; Kukhlevskii, A.D.

1986-01-01

The fraction of palindromic sequences in the nuclear DNA of the sea urchin S. intermedius was characterized. Using chromatography on hydroxyapatite and treatment with S1 nuclease, it was shown that the fraction of palindromic sequences more than doubles when the sodium concentration in solution is increased or the temperature of reassociation is lowered. The increase is due to the involvement of inverted repeats in reassociation, which are characterized by a substantial nonhomologous character and/or the presence of an extended intervening DNA sequence. It was found by the method of reassociation of a nicked palindrome fraction with an excess of total homologous DNA that most of the inverted repeats in the sea urchin genome are unique sequences. The complexity of the palindrome fraction was estimated at 8.2 x 10 7 nucleotide pairs, and the number of palindromes per haploid genome ∼ 500,000
Molecular characterization of three common olive (Olea europaea L.) cultivars in Palestine, using simple sequence repeat (SSR) markers.

Science.gov (United States)

Obaid, Ramiz; Abu-Qaoud, Hassan; Arafeh, Rami

2014-09-03

Eight accessions of olive trees from three common varieties in Palestine, Nabali Baladi, Nabali Mohassan and Surri, were genetically evaluated using five simple sequence repeat (SSR) markers. A total of 17 alleles from 5 loci were observed in which 15 (88.2%) were polymorphic and 2 (11.8%) were monomorphic. An average of 3.4 alleles per locus was found ranging from 2.0 alleles with the primers GAPU-103 and DCA-9 to 5.0 alleles with U9932 and DCA-16. The smallest amplicon size observed was 50 bp with the primer DCA-16, whereas the largest one (450 bp) with the primer U9932. Cluster analysis with the unweighted pair group method with arithmetic average (UPGMA) showed three clusters: a cluster with four accessions from the 'Nabali Baladi' cultivar, another cluster with three accessions that represents the 'Nabali Mohassen' cultivar and finally the 'Surri' cultivar. The similarity coefficient for the eight olive tree samples ranged from a maximum of 100% between two accessions from Nabali Baladi and also in two other samples from Nabali Mohassan, to a minimum similarity coefficient (0.315) between the Surri and two Nabali Baladi accessions. The results in this investigation clearly highlight the genetic dissimilarity between the three main olive cultivars that have been misidentified and mixed up in the past, based on conventional morphological characters.
Diversity and genetic stability in banana genotypes in a breeding program using inter simple sequence repeats (ISSR) markers.

Science.gov (United States)

Silva, A V C; Nascimento, A L S; Vitória, M F; Rabbani, A R C; Soares, A N R; Lédo, A S

2017-02-23

Banana (Musa spp) is a fruit species frequently cultivated and consumed worldwide. Molecular markers are important for estimating genetic diversity in germplasm and between genotypes in breeding programs. The objective of this study was to analyze the genetic diversity of 21 banana genotypes (FHIA 23, PA42-44, Maçã, Pacovan Ken, Bucaneiro, YB42-47, Grand Naine, Tropical, FHIA 18, PA94-01, YB42-17, Enxerto, Japira, Pacovã, Prata-Anã, Maravilha, PV79-34, Caipira, Princesa, Garantida, and Thap Maeo), by using inter-simple sequence repeat (ISSR) markers. Material was generated from the banana breeding program of Embrapa Cassava & Fruits and evaluated at Embrapa Coastal Tablelands. The 12 primers used in this study generated 97.5% polymorphism. Four clusters were identified among the different genotypes studied, and the sum of the first two principal components was 48.91%. From the Unweighted Pair Group Method using Arithmetic averages (UPGMA) dendrogram, it was possible to identify two main clusters and subclusters. Two genotypes (Garantida and Thap Maeo) remained isolated from the others, both in the UPGMA clustering and in the principal cordinate analysis (PCoA). Using ISSR markers, we could analyze the genetic diversity of the studied material and state that these markers were efficient at detecting sufficient polymorphism to estimate the genetic variability in banana genotypes.
Comparing Young and Elderly Serial Reaction Time Task Performance on Repeated and Random Conditions

Directory of Open Access Journals (Sweden)

Fatemeh Ehsani

2012-07-01

Full Text Available Objectives: Acquisition motor skill training in elderly is at great importance. The main purpose of this study was to compare young and elderly performance in serial reaction time task on different repeated and random conditions. Methods & Materials: A serial reaction time task by using software was applied for studying motor learning in 30 young and 30 elderly. Each group divided randomly implicitly and explicitly into subgroups. A task 4 squares with different colors appeared on the monitor and subjects were asked to press its defined key immediately after observing it. Subjects practiced 8 motor blocks (4 repeated blocks, then 2 random blocks and 2 repeated blocks. Block time that was dependent variable measured and Independent-samples t- test with repeated ANOVA measures were used in this test. Results: young groups performed both repeated and random sequences significantly faster than elderly (P0.05. Explicit older subgroup performed 7,8 blocks slower than 6 block with a significant difference (P<0.05. Conclusion: Young adults discriminate high level performance than elderly in both repeated and random practice. Elderly performed random practice better than repeated practice.
Abundance, distribution and potential impact of transposable elements in the genome of Mycosphaerella fijiensis.

Science.gov (United States)

Santana, Mateus F; Silva, José C F; Batista, Aline D; Ribeiro, Lílian E; da Silva, Gilvan F; de Araújo, Elza F; de Queiroz, Marisa V

2012-12-22

Mycosphaerella fijiensis is a ascomycete that causes Black Sigatoka in bananas. Recently, the M. fijiensis genome was sequenced. Repetitive sequences are ubiquitous components of fungal genomes. In most genomic analyses, repetitive sequences are associated with transposable elements (TEs). TEs are dispersed repetitive DNA sequences found in a host genome. These elements have the ability to move from one location to another within the genome, and their insertion can cause a wide spectrum of mutations in their hosts. Some of the deleterious effects of TEs may be due to ectopic recombination among TEs of the same family. In addition, some transposons are physically linked to genes and can control their expression. To prevent possible damage caused by the presence of TEs in the genome, some fungi possess TE-silencing mechanisms, such as RIP (Repeat Induced Point mutation). In this study, the abundance, distribution and potential impact of TEs in the genome of M. fijiensis were investigated. A total of 613 LTR-Gypsy and 27 LTR-Copia complete elements of the class I were detected. Among the class II elements, a total of 28 Mariner, five Mutator and one Harbinger complete elements were identified. The results of this study indicate that transposons were and are important ectopic recombination sites. A distribution analysis of a transposable element from each class of the M. fijiensis isolates revealed variable hybridization profiles, indicating the activity of these elements. Several genes encoding proteins involved in important metabolic pathways and with potential correlation to pathogenicity systems were identified upstream and downstream of transposable elements. A comparison of the sequences from different transposon groups suggested the action of the RIP silencing mechanism in the genome of this microorganism. The analysis of TEs in M. fijiensis suggests that TEs play an important role in the evolution of this organism because the activity of these elements, as well
Abundance, distribution and potential impact of transposable elements in the genome of Mycosphaerella fijiensis

Directory of Open Access Journals (Sweden)

Santana Mateus F

2012-12-01

Full Text Available Abstract Background Mycosphaerella fijiensis is a ascomycete that causes Black Sigatoka in bananas. Recently, the M. fijiensis genome was sequenced. Repetitive sequences are ubiquitous components of fungal genomes. In most genomic analyses, repetitive sequences are associated with transposable elements (TEs. TEs are dispersed repetitive DNA sequences found in a host genome. These elements have the ability to move from one location to another within the genome, and their insertion can cause a wide spectrum of mutations in their hosts. Some of the deleterious effects of TEs may be due to ectopic recombination among TEs of the same family. In addition, some transposons are physically linked to genes and can control their expression. To prevent possible damage caused by the presence of TEs in the genome, some fungi possess TE-silencing mechanisms, such as RIP (Repeat Induced Point mutation. In this study, the abundance, distribution and potential impact of TEs in the genome of M. fijiensis were investigated. Results A total of 613 LTR-Gypsy and 27 LTR-Copia complete elements of the class I were detected. Among the class II elements, a total of 28 Mariner, five Mutator and one Harbinger complete elements were identified. The results of this study indicate that transposons were and are important ectopic recombination sites. A distribution analysis of a transposable element from each class of the M. fijiensis isolates revealed variable hybridization profiles, indicating the activity of these elements. Several genes encoding proteins involved in important metabolic pathways and with potential correlation to pathogenicity systems were identified upstream and downstream of transposable elements. A comparison of the sequences from different transposon groups suggested the action of the RIP silencing mechanism in the genome of this microorganism. Conclusions The analysis of TEs in M. fijiensis suggests that TEs play an important role in the evolution of
Memory for sequences of events impaired in typical aging

Science.gov (United States)

Allen, Timothy A.; Morris, Andrea M.; Stark, Shauna M.; Fortin, Norbert J.

2015-01-01

Typical aging is associated with diminished episodic memory performance. To improve our understanding of the fundamental mechanisms underlying this age-related memory deficit, we previously developed an integrated, cross-species approach to link converging evidence from human and animal research. This novel approach focuses on the ability to remember sequences of events, an important feature of episodic memory. Unlike existing paradigms, this task is nonspatial, nonverbal, and can be used to isolate different cognitive processes that may be differentially affected in aging. Here, we used this task to make a comprehensive comparison of sequence memory performance between younger (18–22 yr) and older adults (62–86 yr). Specifically, participants viewed repeated sequences of six colored, fractal images and indicated whether each item was presented “in sequence” or “out of sequence.” Several out of sequence probe trials were used to provide a detailed assessment of sequence memory, including: (i) repeating an item from earlier in the sequence (“Repeats”; e.g., ABADEF), (ii) skipping ahead in the sequence (“Skips”; e.g., ABDDEF), and (iii) inserting an item from a different sequence into the same ordinal position (“Ordinal Transfers”; e.g., AB3DEF). We found that older adults performed as well as younger controls when tested on well-known and predictable sequences, but were severely impaired when tested using novel sequences. Importantly, overall sequence memory performance in older adults steadily declined with age, a decline not detected with other measures (RAVLT or BPS-O). We further characterized this deficit by showing that performance of older adults was severely impaired on specific probe trials that required detailed knowledge of the sequence (Skips and Ordinal Transfers), and was associated with a shift in their underlying mnemonic representation of the sequences. Collectively, these findings provide unambiguous evidence that the
The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

Science.gov (United States)

Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

2014-04-01

Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.
The Complete Chloroplast Genome Sequences of the Medicinal Plant Forsythia suspensa (Oleaceae

Directory of Open Access Journals (Sweden)

Wenbin Wang

2017-10-01

Full Text Available Forsythia suspensa is an important medicinal plant and traditionally applied for the treatment of inflammation, pyrexia, gonorrhea, diabetes, and so on. However, there is limited sequence and genomic information available for F. suspensa. Here, we produced the complete chloroplast genomes of F. suspensa using Illumina sequencing technology. F. suspensa is the first sequenced member within the genus Forsythia (Oleaceae. The gene order and organization of the chloroplast genome of F. suspensa are similar to other Oleaceae chloroplast genomes. The F. suspensa chloroplast genome is 156,404 bp in length, exhibits a conserved quadripartite structure with a large single-copy (LSC; 87,159 bp region, and a small single-copy (SSC; 17,811 bp region interspersed between inverted repeat (IRa/b; 25,717 bp regions. A total of 114 unique genes were annotated, including 80 protein-coding genes, 30 tRNA, and four rRNA. The low GC content (37.8% and codon usage bias for A- or T-ending codons may largely affect gene codon usage. Sequence analysis identified a total of 26 forward repeats, 23 palindrome repeats with lengths >30 bp (identity > 90%, and 54 simple sequence repeats (SSRs with an average rate of 0.35 SSRs/kb. We predicted 52 RNA editing sites in the chloroplast of F. suspensa, all for C-to-U transitions. IR expansion or contraction and the divergent regions were analyzed among several species including the reported F. suspensa in this study. Phylogenetic analysis based on whole-plastome revealed that F. suspensa, as a member of the Oleaceae family, diverged relatively early from Lamiales. This study will contribute to strengthening medicinal resource conservation, molecular phylogenetic, and genetic engineering research investigations of this species.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.