terminal repeat sequences: Topics by WorldWideScience.org

Sample records for terminal repeat sequences

C-terminal low-complexity sequence repeats of Mycobacterium smegmatis Ku modulate DNA binding.

Science.gov (United States)

Kushwaha, Ambuj K; Grove, Anne

2013-01-24

Ku protein is an integral component of the NHEJ (non-homologous end-joining) pathway of DSB (double-strand break) repair. Both eukaryotic and prokaryotic Ku homologues have been characterized and shown to bind DNA ends. A unique feature of Mycobacterium smegmatis Ku is its basic C-terminal tail that contains several lysine-rich low-complexity PAKKA repeats that are absent from homologues encoded by obligate parasitic mycobacteria. Such PAKKA repeats are also characteristic of mycobacterial Hlp (histone-like protein) for which they have been shown to confer the ability to appose DNA ends. Unexpectedly, removal of the lysine-rich extension enhances DNA-binding affinity, but an interaction between DNA and the PAKKA repeats is indicated by the observation that only full-length Ku forms multiple complexes with a short stem-loop-containing DNA previously designed to accommodate only one Ku dimer. The C-terminal extension promotes DNA end-joining by T4 DNA ligase, suggesting that the PAKKA repeats also contribute to efficient end-joining. We suggest that low-complexity lysine-rich sequences have evolved repeatedly to modulate the function of unrelated DNA-binding proteins.
C-terminal sequences of hsp70 and hsp90 as non-specific anchors for tetratricopeptide repeat (TPR) proteins.

Science.gov (United States)

Ramsey, Andrew J; Russell, Lance C; Chinkers, Michael

2009-10-12

Steroid-hormone-receptor maturation is a multi-step process that involves several TPR (tetratricopeptide repeat) proteins that bind to the maturation complex via the C-termini of hsp70 (heat-shock protein 70) and hsp90 (heat-shock protein 90). We produced a random T7 peptide library to investigate the roles played by the C-termini of the two heat-shock proteins in the TPR-hsp interactions. Surprisingly, phages with the MEEVD sequence, found at the C-terminus of hsp90, were not recovered from our biopanning experiments. However, two groups of phages were isolated that bound relatively tightly to HsPP5 (Homo sapiens protein phosphatase 5) TPR. Multiple copies of phages with a C-terminal sequence of LFG were isolated. These phages bound specifically to the TPR domain of HsPP5, although mutation studies produced no evidence that they bound to the domain's hsp90-binding groove. However, the most abundant family obtained in the initial screen had an aspartate residue at the C-terminus. Two members of this family with a C-terminal sequence of VD appeared to bind with approximately the same affinity as the hsp90 C-12 control. A second generation pseudo-random phage library produced a large number of phages with an LD C-terminus. These sequences acted as hsp70 analogues and had relatively low affinities for hsp90-specific TPR domains. Unfortunately, we failed to identify residues near hsp90's C-terminus that impart binding specificity to individual hsp90-TPR interactions. The results suggest that the C-terminal sequences of hsp70 and hsp90 act primarily as non-specific anchors for TPR proteins.
Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

Science.gov (United States)

Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

1997-12-01

Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.
Novel expressed sequence tag- simple sequence repeats (EST ...

African Journals Online (AJOL)

Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...
Comparison of the carboxy-terminal DP-repeat region in the co-chaperones Hop and Hip.

Science.gov (United States)

Nelson, Gregory M; Huffman, Holly; Smith, David F

2003-01-01

Functional steroid receptor complexes are assembled and maintained by an ordered pathway of interactions involving multiple components of the cellular chaperone machinery. Two of these components, Hop and Hip, serve as co-chaperones to the major heat shock proteins (Hsps), Hsp70 and Hsp90, and participate in intermediate stages of receptor assembly. In an effort to better understand the functions of Hop and Hip in the assembly process, we focused on a region of similarity located near the C-terminus of each co-chaperone. Contained within this region is a repeated sequence motif we have termed the DP repeat. Earlier mutagenesis studies implicated the DP repeat of either Hop or Hip in Hsp70 binding and in normal assembly of the co-chaperones with progesterone receptor (PR) complexes. We report here that the DP repeat lies within a protease-resistant domain that extends to or is near the C-terminus of both co-chaperones. Point mutations in the DP repeats render the C-terminal regions hypersensitive to proteolysis. In addition, a Hop DP mutant displays altered proteolytic digestion patterns, which suggest that the DP-repeat region influences the folding of other Hop domains. Although the respective DP regions of Hop and Hip share sequence and structural similarities, they are not functionally interchangeable. Moreover, a double-point mutation within the second DP-repeat unit of Hop that converts this to the sequence found in Hip disrupts Hop function; however, the corresponding mutation in Hip does not alter its function. We conclude that the DP repeats are important structural elements within a C-terminal domain, which is important for Hop and Hip function.
Repeated DNA sequences in fungi

Energy Technology Data Exchange (ETDEWEB)

Dutta, S K

1974-11-01

Several fungal species, representatives of all broad groups like basidiomycetes, ascomycetes and phycomycetes, were examined for the nature of repeated DNA sequences by DNA:DNA reassociation studies using hydroxyapatite chromatography. All of the fungal species tested contained 10 to 20 percent repeated DNA sequences. There are approximately 100 to 110 copies of repeated DNA sequences of approximately 4 x 10/sup 7/ daltons piece size of each. Repeated DNA sequence homoduplexes showed on average 5/sup 0/C difference of T/sub e/50 (temperature at which 50 percent duplexes dissociate) values from the corresponding homoduplexes of unfractionated whole DNA. It is suggested that a part of repetitive sequences in fungi constitutes mitochondrial DNA and a part of it constitutes nuclear DNA. (auth)
Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

Science.gov (United States)

Tetreault, Hannah M.; Ungerer, Mark C.

2016-01-01

The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667
Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats.

Science.gov (United States)

Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang

2016-09-01

Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.
The proviral genome of radiation leukemia virus: Molecular cloning, nucleotide sequence of its long terminal repeat and integration in lymphoma cell DNA

International Nuclear Information System (INIS)

Janowski, M.; Merregaert, J.; Boniver, J.; Maisin, J.R.

1985-01-01

The proviral genome of a thymotropic and leukemogenic C57BL/Ka mouse retrovirus, RadLV/VL/sub 3/(T+L+), was cloned as a biologically active PstI insert in the bacterial plasmid pBR322. Its restriction map was compared to those, already known, of two nonthymotropic and nonleukemogenic viruses of the same mouse strain, the ecotropic BL/Ka(B) and the xenotropic constituent of the radiation leukemia virus complex (RadLV). Differences were observed in the pol gene and in the env gene. Moreover, the nucleotide sequence of the RadLV/VL/sub 3/(T+L+) long terminal repeat revealed the existence of two copies of a 42 bp long sequence, separated by 11 nucleotides and of which BL/Ka(B) possesses only one copy
Optimization of sequence alignment for simple sequence repeat regions

Directory of Open Access Journals (Sweden)

Ogbonnaya Francis C

2011-07-01

Full Text Available Abstract Background Microsatellites, or simple sequence repeats (SSRs, are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs. SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic
Creation and structure determination of an artificial protein with three complete sequence repeats

Energy Technology Data Exchange (ETDEWEB)

Adachi, Motoyasu, E-mail: adachi.motoyasu@jaea.go.jp; Shimizu, Rumi; Kuroki, Ryota [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Blaber, Michael [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Florida State University, Tallahassee, FL 32306-4300 (United States)

2013-11-01

An artificial protein with three complete sequence repeats was created and the structure was determined by X-ray crystallography. The structure showed threefold symmetry even though there is an amino- and carboxy-terminal. The artificial protein with threefold symmetry may be useful as a scaffold to capture small materials with C3 symmetry. Symfoil-4P is a de novo protein exhibiting the threefold symmetrical β-trefoil fold designed based on the human acidic fibroblast growth factor. First three asparagine–glycine sequences of Symfoil-4P are replaced with glutamine–glycine (Symfoil-QG) or serine–glycine (Symfoil-SG) sequences protecting from deamidation, and His-Symfoil-II was prepared by introducing a protease digestion site into Symfoil-QG so that Symfoil-II has three complete repeats after removal of the N-terminal histidine tag. The Symfoil-QG and SG and His-Symfoil-II proteins were expressed in Eschericha coli as soluble protein, and purified by nickel affinity chromatography. Symfoil-II was further purified by anion-exchange chromatography after removing the HisTag by proteolysis. Both Symfoil-QG and Symfoil-II were crystallized in 0.1 M Tris-HCl buffer (pH 7.0) containing 1.8 M ammonium sulfate as precipitant at 293 K; several crystal forms were observed for Symfoil-QG and II. The maximum diffraction of Symfoil-QG and II crystals were 1.5 and 1.1 Å resolution, respectively. The Symfoil-II without histidine tag diffracted better than Symfoil-QG with N-terminal histidine tag. Although the crystal packing of Symfoil-II is slightly different from Symfoil-QG and other crystals of Symfoil derivatives having the N-terminal histidine tag, the refined crystal structure of Symfoil-II showed pseudo-threefold symmetry as expected from other Symfoils. Since the removal of the unstructured N-terminal histidine tag did not affect the threefold structure of Symfoil, the improvement of diffraction quality of Symfoil-II may be caused by molecular characteristics of
Characterization of the env gene and long terminal repeat of molecularly cloned Friend mink cell focus-inducing virus DNA.

OpenAIRE

Adachi, A; Sakai, K; Kitamura, N; Nakanishi, S; Niwa, O; Matsuyama, M; Ishimoto, A

1984-01-01

The highly oncogenic erythroleukemia-inducing Friend mink cell focus-inducing (MCF) virus was molecularly cloned in phage lambda gtWES.lambda B, and the DNA sequences of the env gene and the long terminal repeat were determined. The nucleotide sequences of Friend MCF virus and Friend spleen focus-forming virus were quite homologous, supporting the hypothesis that Friend spleen focus-forming virus might be generated via Friend MCF virus from an ecotropic Friend virus mainly by some deletions. ...
simple sequence repeat (SSR)

African Journals Online (AJOL)

In the present study, 78 mapped simple sequence repeat (SSR) markers representing 11 linkage groups of adzuki bean were evaluated for transferability to mungbean and related Vigna spp. 41 markers amplified characteristic bands in at least one Vigna species. The transferability percentage across the genotypes ranged ...
Cell type-specific termination of transcription by transposable element sequences.

Science.gov (United States)

Conley, Andrew B; Jordan, I King

2012-09-30

Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription
Human immunodeficiency virus long terminal repeat responds to T-cell activation signals

International Nuclear Information System (INIS)

Tong-Starksen, S.E.; Luciw, P.A.; Peterlin, B.M.

1987-01-01

Human immunodeficiency virus (HIV), the causative agent of AIDS, infects and kills lymphoid cells bearing the CD4 antigen. In an infected cell, a number of cellular as well as HIV-encoded gene products determine the levels of viral gene expression and HIV replication. Efficient HIV replication occurs in activated T cells. Utilizing transient expression assays, the authors show that gene expression directed by the HIV long terminal repeat (LTR) increases in response to T-cell activation signals. The effects of T-cell activation and of the HIV-encoded trans-activator (TAT) are multiplicative. Analysis of mutations and deletions in the HIV LTR reveals that the region responding to T-cell activation signals is located at positions -105 to -80. These sequences are composed of two direct repeats, which are homologous to the core transcriptional enhancer elements in the simian virus 40 genome. The studies reveal that these elements function as the HIV enhancer. By acting directly on the HIV LTR, T-cell activation may play an important role in HIV gene expression and in the activation of latent HIV
Cell type-specific termination of transcription by transposable element sequences

Directory of Open Access Journals (Sweden)

Conley Andrew B

2012-09-01

Full Text Available Abstract Background Transposable elements (TEs encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Results Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3′ UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. Conclusions TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are
Not so bad after all: retroviruses and long terminal repeat retrotransposons as a source of new genes in vertebrates.

Science.gov (United States)

Naville, M; Warren, I A; Haftek-Terreau, Z; Chalopin, D; Brunet, F; Levin, P; Galiana, D; Volff, J-N

2016-04-01

Viruses and transposable elements, once considered as purely junk and selfish sequences, have repeatedly been used as a source of novel protein-coding genes during the evolution of most eukaryotic lineages, a phenomenon called 'molecular domestication'. This is exemplified perfectly in mammals and other vertebrates, where many genes derived from long terminal repeat (LTR) retroelements (retroviruses and LTR retrotransposons) have been identified through comparative genomics and functional analyses. In particular, genes derived from gag structural protein and envelope (env) genes, as well as from the integrase-coding and protease-coding sequences, have been identified in humans and other vertebrates. Retroelement-derived genes are involved in many important biological processes including placenta formation, cognitive functions in the brain and immunity against retroelements, as well as in cell proliferation, apoptosis and cancer. These observations support an important role of retroelement-derived genes in the evolution and diversification of the vertebrate lineage. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Structure of long terminal repeats of transcriptionally active and inactive copies of Drosophila mobile dispersed genetic elements mdg3

International Nuclear Information System (INIS)

Dzhumagaliev, E.B.; Mazo, A.N.; Baev, A.A. Jr.; Gorelova, T.V.; Arkhipova, I.R.; Shuppe, N.G.; Il'in, Yu.V.

1986-01-01

The authors have determined the nucleotide sequences of long terminal repeats (LTRS) and adjacent regions in the transcribed and nontranscribed variants of the mobile dispersed gene mdg3. In its main characteristics the mdg3 is similar to other mdg. Its integration into chromosomal DNA brings about duplication of the 4 bp of the host DNA, no specificity of the mdg integration at the nucleotide level being detected. The mdg3 is flanked by a 5 bp inverted repeat. The variations in the length of the LTR in different mdg copies is mainly due to duplication of certain sequences in the U3 and R regions. mdg3 copies with a LTR length of 267 bp are the most abundant and are completely conservative in their primary structure. They are transcribed in the cells of the 67J25D culture, but not transcribed in the K/sub c/ line, where another mdg3 variant with a LTR length of 293 bp is transcriptionally active. The SI mapping of transcription initiation and termination sites has shown that in both mdg3 variants they are localized in the same LTR regions, and that the LTR itself has a characteristic U3-R-U5 structure-like retroviral LTRs. The possible factors involved in the regulation of mdg transcription are discussed
Simple sequence repeat marker development and genetic mapping ...

Indian Academy of Sciences (India)

polymorphic SSR (simple sequence repeats) markers from libraries enriched for GA, CAA and AAT repeats, as well as 6 ... ers for quinoa was the development of a genetic linkage map ...... Weber J. L. 1990 Informativeness of human (dC-dA)n.
Discovery and analysis of an active long terminal repeat-retrotransposable element in Aspergillus oryzae.

Science.gov (United States)

Jie Jin, Feng; Hara, Seiichi; Sato, Atsushi; Koyama, Yasuji

2014-01-01

Wild-type Aspergillus oryzae RIB40 contains two copies of the AO090005001597 gene. We previously constructed A. oryzae RIB40 strain, RKuAF8B, with multiple chromosomal deletions, in which the AO090005001597 copy number was found to be increased significantly. Sequence analysis indicated that AO090005001597 is part of a putative 6,000-bp retrotransposable element, flanked by two long terminal repeats (LTRs) of 669 bp, with characteristics of retroviruses and retrotransposons, and thus designated AoLTR (A. oryzae LTR-retrotransposable element). AoLTR comprised putative reverse transcriptase, RNase H, and integrase domains. The deduced amino acid sequence alignment of AoLTR showed 94% overall identity with AFLAV, an A. flavus Tf1/sushi retrotransposon. Quantitative real-time RT-PCR showed that AoLTR gene expression was significantly increased in the RKuAF8B, in accordance with the increased copy number. Inverse PCR indicated that the full-length retrotransposable element was randomly integrated into multiple genomic locations. However, no obvious phenotypic changes were associated with the increased AoLTR gene copy number.

Identification, variation and transcription of pneumococcal repeat sequences

Science.gov (United States)

2011-01-01

Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003
Genus-specific protein binding to the large clusters of DNA repeats (short regularly spaced repeats) present in Sulfolobus genomes

DEFF Research Database (Denmark)

Peng, Xu; Brügger, Kim; Shen, Biao

2003-01-01

terminally modified and corresponds to SSO454, an open reading frame of previously unassigned function. It binds specifically to DNA fragments carrying double and single repeat sequences, binding on one side of the repeat structure, and producing an opening of the opposite side of the DNA structure. It also...... recognizes both main families of repeat sequences in S. solfataricus. The recombinant protein, expressed in Escherichia coli, showed the same binding properties to the SRSR repeat as the native one. The SSO454 protein exhibits a tripartite internal repeat structure which yields a good sequence match...... with a helix-turn-helix DNA-binding motif. Although this putative motif is shared by other archaeal proteins, orthologs of SSO454 were only detected in species within the Sulfolobus genus and in the closely related Acidianus genus. We infer that the genus-specific protein induces an opening of the structure...
Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

Science.gov (United States)

Oggioni, M R; Claverys, J P

1999-10-01

A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Always look on both sides: phylogenetic information conveyed by simple sequence repeat allele sequences.

Directory of Open Access Journals (Sweden)

Stéphanie Barthe

Full Text Available Simple sequence repeat (SSR markers are widely used tools for inferences about genetic diversity, phylogeography and spatial genetic structure. Their applications assume that variation among alleles is essentially caused by an expansion or contraction of the number of repeats and that, accessorily, mutations in the target sequences follow the stepwise mutation model (SMM. Generally speaking, PCR amplicon sizes are used as direct indicators of the number of SSR repeats composing an allele with the data analysis either ignoring the extent of allele size differences or assuming that there is a direct correlation between differences in amplicon size and evolutionary distance. However, without precisely knowing the kind and distribution of polymorphism within an allele (SSR and the associated flanking region (FR sequences, it is hard to say what kind of evolutionary message is conveyed by such a synthetic descriptor of polymorphism as DNA amplicon size. In this study, we sequenced several SSR alleles in multiple populations of three divergent tree genera and disentangled the types of polymorphisms contained in each portion of the DNA amplicon containing an SSR. The patterns of diversity provided by amplicon size variation, SSR variation itself, insertions/deletions (indels, and single nucleotide polymorphisms (SNPs observed in the FRs were compared. Amplicon size variation largely reflected SSR repeat number. The amount of variation was as large in FRs as in the SSR itself. The former contributed significantly to the phylogenetic information and sometimes was the main source of differentiation among individuals and populations contained by FR and SSR regions of SSR markers. The presence of mutations occurring at different rates within a marker's sequence offers the opportunity to analyse evolutionary events occurring on various timescales, but at the same time calls for caution in the interpretation of SSR marker data when the distribution of within
Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats.

Science.gov (United States)

Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

2013-08-01

Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Multineuronal Spike Sequences Repeat with Millisecond Precision

Directory of Open Access Journals (Sweden)

Koki eMatsumoto

2013-06-01

Full Text Available Cortical microcircuits are nonrandomly wired by neurons. As a natural consequence, spikes emitted by microcircuits are also nonrandomly patterned in time and space. One of the prominent spike organizations is a repetition of fixed patterns of spike series across multiple neurons. However, several questions remain unsolved, including how precisely spike sequences repeat, how the sequences are spatially organized, how many neurons participate in sequences, and how different sequences are functionally linked. To address these questions, we monitored spontaneous spikes of hippocampal CA3 neurons ex vivo using a high-speed functional multineuron calcium imaging technique that allowed us to monitor spikes with millisecond resolution and to record the location of spiking and nonspiking neurons. Multineuronal spike sequences were overrepresented in spontaneous activity compared to the statistical chance level. Approximately 75% of neurons participated in at least one sequence during our observation period. The participants were sparsely dispersed and did not show specific spatial organization. The number of sequences relative to the chance level decreased when larger time frames were used to detect sequences. Thus, sequences were precise at the millisecond level. Sequences often shared common spikes with other sequences; parts of sequences were subsequently relayed by following sequences, generating complex chains of multiple sequences.
Development of simple sequence repeat (SSR) markers that are ...

African Journals Online (AJOL)

Simple sequence repeats (SSRs) markers were developed through data mining of 3,803 expressed sequence tags (ESTs) previously published. A total of 144 di- to penta-type SSRs were identified and they were screened for polymorphism between two turnip cultivars, 'Tsuda' and 'Yurugi Akamaru'. Out of 90 EST-SSRs for ...
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

Directory of Open Access Journals (Sweden)

Charlotte Rehm

Full Text Available In prokaryotes simple sequence repeats (SSRs with unit sizes of 1-5 nucleotides (nt are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4 structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc, Xanthomonas axonopodis pv. citri str. 306 (Xac, and Nostoc sp. strain PCC7120 (Ana. In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

Science.gov (United States)

Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

Science.gov (United States)

de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

2015-11-16

Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

Science.gov (United States)

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Simple sequence repeat marker loci discovery using SSR primer.

Science.gov (United States)

Robinson, Andrew J; Love, Christopher G; Batley, Jacqueline; Barker, Gary; Edwards, David

2004-06-12

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. With the increase in the availability of DNA sequence information, an automated process to identify and design PCR primers for amplification of SSR loci would be a useful tool in plant breeding programs. We report an application that integrates SPUTNIK, an SSR repeat finder, with Primer3, a PCR primer design program, into one pipeline tool, SSR Primer. On submission of multiple FASTA formatted sequences, the script screens each sequence for SSRs using SPUTNIK. The results are parsed to Primer3 for locus-specific primer design. The script makes use of a Web-based interface, enabling remote use. This program has been written in PERL and is freely available for non-commercial users by request from the authors. The Web-based version may be accessed at http://hornbill.cspp.latrobe.edu.au/
Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

Directory of Open Access Journals (Sweden)

Graner Andreas

2008-10-01

Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular
SeqEntropy: genome-wide assessment of repeats for short read sequencing.

Directory of Open Access Journals (Sweden)

Hsueh-Ting Chu

Full Text Available BACKGROUND: Recent studies on genome assembly from short-read sequencing data reported the limitation of this technology to reconstruct the entire genome even at very high depth coverage. We investigated the limitation from the perspective of information theory to evaluate the effect of repeats on short-read genome assembly using idealized (error-free reads at different lengths. METHODOLOGY/PRINCIPAL FINDINGS: We define a metric H(k to be the entropy of sequencing reads at a read length k and use the relative loss of entropy ΔH(k to measure the impact of repeats for the reconstruction of whole-genome from sequences of length k. In our experiments, we found that entropy loss correlates well with de-novo assembly coverage of a genome, and a score of ΔH(k>1% indicates a severe loss in genome reconstruction fidelity. The minimal read lengths to achieve ΔH(k<1% are different for various organisms and are independent of the genome size. For example, in order to meet the threshold of ΔH(k<1%, a read length of 60 bp is needed for the sequencing of human genome (3.2 10(9 bp and 320 bp for the sequencing of fruit fly (1.8×10(8 bp. We also calculated the ΔH(k scores for 2725 prokaryotic chromosomes and plasmids at several read lengths. Our results indicate that the levels of repeats in different genomes are diverse and the entropy of sequencing reads provides a measurement for the repeat structures. CONCLUSIONS/SIGNIFICANCE: The proposed entropy-based measurement, which can be calculated in seconds to minutes in most cases, provides a rapid quantitative evaluation on the limitation of idealized short-read genome sequencing. Moreover, the calculation can be parallelized to scale up to large euakryotic genomes. This approach may be useful to tune the sequencing parameters to achieve better genome assemblies when a closely related genome is already available.
simple sequence repeat (SSR) markers in genetic analysis of

African Journals Online (AJOL)

Yomi

2012-08-28

1998). Cross- species amplification of soybean (Glycine max) simple sequence repeats (SSRs) within the genus and other legume genera: implications for the transferability of SSRs in plants. Mol. Biol. Evol. 15:1275-1287.
3' terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing.

Science.gov (United States)

Goldfarb, Katherine C; Cech, Thomas R

2013-09-21

Post-transcriptional 3' end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3' RACE coupled with high-throughput sequencing to characterize the 3' terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. The 3' terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3' terminus of an in vitro transcribed MRP RNA control and the differing 3' terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). 3' RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3' terminal sequences of noncoding RNAs.
Effects of integration and replication on transcription of the HIV-1 long terminal repeat

NARCIS (Netherlands)

Jeang, K. T.; Berkhout, B.; Dropulic, B.

1993-01-01

The activity of a promoter is influenced by chromosomal and cell cycle/replication context. We analyzed the influences of integration and replication on transcription of the human immunodeficiency virus (HIV)-1 long terminal repeat (LTR). We found that one requirement for Tat trans-activated
Comparative effectiveness of inter-simple sequence repeat and ...

African Journals Online (AJOL)

A study to compare the effectiveness of inter-simple sequence repeats (ISSR) and randomly amplified polymorphic DNA (RAPD) profiling was carried out with a total of 65 DNA samples using 12 species of Indian Garcinia. ISSR and RAPD profiling were performed with 19 and 12 primers, respectively. ISSR markers ...
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.

Science.gov (United States)

Anwar, Tamanna; Khan, Asad U

2006-02-20

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.
Simple sequence repeat (SSR)-based genetic variability among ...

African Journals Online (AJOL)

The objective of this study was to compare if simple sequence repeat (SSR) markers could correctly identify peanut genotypes with difference in specific leaf weight (SLW) and relative water content (RWC). Four peanut genotypes and two water regimes (FC and 1/3 available water; 1/3 AW) were arranged in factorial ...

Potentials and limitations of histone repeat sequences for phylogenetic reconstruction of Sophophora.

Science.gov (United States)

Baldo, A M; Les, D H; Strausbaugh, L D

1999-11-01

Simplified DNA sequence acquisition has provided many new data sets that are useful for phylogenetic reconstruction, including single- and multiple-copy nuclear and organellar genes. Although transcribed regions receive much attention, nontranscribed regions have recently been added to the repertoire of sequences suitable for phylogenetic studies, especially for closely related taxa. We evaluated the efficacy of a small portion of the histone repeat for phylogenetic reconstruction among Drosophila species. Histone repeats in invertebrates offer distinct advantages similar to those of widely used ribosomal repeats. First, the units are tandemly repeated and undergo concerted evolution. Second, histone repeats include both highly conserved coding and variable intergenic regions. This composition facilitates application of "universal" primers spanning potentially informative sites. We examined a small region of the histone repeat, including the intergenic spacer segments of coding regions from the divergently transcribed H2A and H2B histone genes. The spacer (about 230 bp) exists as a mosaic with highly conserved functional motifs interspersed with rapidly diverging regions; the former aid in alignment of the spacer. There are no ambiguities in alignment of coding regions. Coding and noncoding regions were analyzed together and separately for phylogenetic information. Parsimony, distance, and maximum-likelihood methods successfully retrieve the corroborated phylogeny for the taxa examined. This study demonstrates the resolving power of a small histone region which may now be added to the growing collection of phylogenetically useful DNA sequences.
PSSRdb: a relational database of polymorphic simple sequence repeats extracted from prokaryotic genomes.

Science.gov (United States)

Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A

2011-01-01

PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
APE1 incision activity at abasic sites in tandem repeat sequences.

Science.gov (United States)

Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

2014-05-29

Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
D20S16 is a complex interspersed repeated sequence: Genetic and physical analysis of the locus

Energy Technology Data Exchange (ETDEWEB)

Bowden, D.W.; Krawchuk, M.D.; Howard, T.D. [Wake Forest Univ., Winston-Salem, NC (United States)] [and others

1995-01-20

The genomic structure of the D20S16 locus has been evaluated using genetic and physical methods. D20S16, originally detected with the probe CRI-L1214, is a highly informative, complex restriction fragment length polymorphism consisting of two separate allelic systems. The allelic systems have the characteristics of conventional VNTR polymorphisms and are separated by recombination ({theta} = 0.02, Z{sub max} = 74.82), as demonstrated in family studies. Most of these recombination events are meiotic crossovers and are maternal in origin, but two, including deletion of the locus in a cell line from a CEPH family member, occur without evidence for exchange of flanking markers. DNA sequence analysis suggests that the basis of the polymorphism is variable numbers of a 98-bp sequence tandemly repeated with 87 to 90% sequence similarity between repeats. The 98-bp repeat is a dimer of 49 bp sequence with 45 to 98% identity between the elements. In addition, nonpolymorphic genomic sequences adjacent to the polymorphic 98-bp repeat tracts are also repeated but are not polymorphic, i.e., show no individual to individual variation. Restriction enzyme mapping of cosmids containing the CRI-L1214 sequence suggests that there are multiple interspersed repeats of the CRI-L1214 sequence on chromosome 20. The results of dual-color fluorescence in situ hybridization experiments with interphase nuclei are also consistent with multiple repeats of an interspersed sequence on chromosome 20. 23 refs., 6 figs.
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

Science.gov (United States)

Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

2015-05-01

Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.
Determinants of Genomic RNA Encapsidation in the Saccharomyces cerevisiae Long Terminal Repeat Retrotransposons Ty1 and Ty3

Directory of Open Access Journals (Sweden)

Katarzyna Pachulska-Wieczorek

2016-07-01

Full Text Available Long-terminal repeat (LTR retrotransposons are transposable genetic elements that replicate intracellularly, and can be considered progenitors of retroviruses. Ty1 and Ty3 are the most extensively characterized LTR retrotransposons whose RNA genomes provide the template for both protein translation and genomic RNA that is packaged into virus-like particles (VLPs and reverse transcribed. Genomic RNAs are not divided into separate pools of translated and packaged RNAs, therefore their trafficking and packaging into VLPs requires an equilibrium between competing events. In this review, we focus on Ty1 and Ty3 genomic RNA trafficking and packaging as essential steps of retrotransposon propagation. We summarize the existing knowledge on genomic RNA sequences and structures essential to these processes, the role of Gag proteins in repression of genomic RNA translation, delivery to VLP assembly sites, and encapsidation.
Association of endogenous retroviruses and long terminal repeats with human disorders

Directory of Open Access Journals (Sweden)

Iyoko eKatoh

2013-09-01

Full Text Available Since the human genome sequences became available in 2001, our knowledge about the human transposable elements which comprise ~40% of the total nucleotides has been expanding. Non- LTR (long terminal repeat retrotransposons are actively transposing in the present-day human genome, and have been found to cause ~100 identified clinical cases of varied disorders. In contrast, almost all of the human endogenous retroviruses (HERVs originating from ancient infectious retroviruses lost their infectivity and transposing activity at various times before the human-chimpanzee speciation (~6 million years ago, and no known HERV is presently infectious. Insertion of HERVs and mammalian apparent LTR retrotransposons (MaLRs into the chromosomal DNA influenced a number of host genes in various modes during human evolution. Apart from the aspect of genome evolution, HERVs and solitary LTRs being suppressed in normal biological processes can potentially act as extra transcriptional apparatuses of cellular genes by re-activation in individuals. There has been a reasonable prediction that aberrant LTR activation could trigger malignant disorders and autoimmune responses if epigenetic changes including DNA hypomethylation occur in somatic cells. Evidence supporting this hypothesis has begun to emerge only recently: a MaLR family LTR activation in the pathogenesis of Hodgkin’s lymphoma and a HERV-E antigen expression in an anti-renal cell carcinoma immune response. This mini review addresses the impacts of the remnant-form LTR retrotransposons on human pathogenesis.
A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

Directory of Open Access Journals (Sweden)

Glass John I

2010-07-01

Full Text Available Abstract Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT. Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the
3′ terminal diversity of MRP RNA and other human noncoding RNAs revealed by deep sequencing

Science.gov (United States)

2013-01-01

Background Post-transcriptional 3′ end processing is a key component of RNA regulation. The abundant and essential RNA subunit of RNase MRP has been proposed to function in three distinct cellular compartments and therefore may utilize this mode of regulation. Here we employ 3′ RACE coupled with high-throughput sequencing to characterize the 3′ terminal sequences of human MRP RNA and other noncoding RNAs that form RNP complexes. Results The 3′ terminal sequence of MRP RNA from HEK293T cells has a distinctive distribution of genomically encoded termini (including an assortment of U residues) with a portion of these selectively tagged by oligo(A) tails. This profile contrasts with the relatively homogenous 3′ terminus of an in vitro transcribed MRP RNA control and the differing 3′ terminal profiles of U3 snoRNA, RNase P RNA, and telomerase RNA (hTR). Conclusions 3′ RACE coupled with deep sequencing provides a valuable framework for the functional characterization of 3′ terminal sequences of noncoding RNAs. PMID:24053768
Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

KAUST Repository

Cahill, Matt J.

2010-07-12

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.
Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies.

Directory of Open Access Journals (Sweden)

Matt J Cahill

Full Text Available BACKGROUND: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. METHODOLOGY/PRINCIPAL FINDINGS: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. CONCLUSIONS: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length.
Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

KAUST Repository

Cahill, Matt J.; Kö ser, Claudio U.; Ross, Nicholas E.; Archer, John A.C.

2010-01-01

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.
Tandemly repeated sequence in 5'end of mtDNA control region of ...

African Journals Online (AJOL)

Extensive length variability was observed in 5' end sequence of the mitochondrial DNA control region of the Japanese Spanish mackerel (Scomberomorus niphonius). This length variability was due to the presence of varying numbers of a 56-bp tandemly repeated sequence and a 46-bp insertion/deletion (indel).
Repeat Sequence Proteins as Matrices for Nanocomposites

Energy Technology Data Exchange (ETDEWEB)

Drummy, L.; Koerner, H; Phillips, D; McAuliffe, J; Kumar, M; Farmer, B; Vaia, R; Naik, R

2009-01-01

Recombinant protein-inorganic nanocomposites comprised of exfoliated Na+ montmorillonite (MMT) in a recombinant protein matrix based on silk-like and elastin-like amino acid motifs (silk elastin-like protein (SELP)) were formed via a solution blending process. Charged residues along the protein backbone are shown to dominate long-range interactions, whereas the SELP repeat sequence leads to local protein/MMT compatibility. Up to a 50% increase in room temperature modulus and a comparable decrease in high temperature coefficient of thermal expansion occur for cast films containing 2-10 wt.% MMT.
Ten tandem repeats of β-hCG 109-118 enhance immunogenicity and anti-tumor effects of β-hCG C-terminal peptide carried by mycobacterial heat-shock protein HSP65

International Nuclear Information System (INIS)

Zhang Yankai; Yan Rong; He Yi; Liu Wentao; Cao Rongyue; Yan Ming; Li Taiming; Liu Jingjing; Wu Jie

2006-01-01

The β-subunit of human chorionic gonadotropin (β-hCG) is secreted by many kinds of tumors and it has been used as an ideal target antigen to develop vaccines against tumors. In view of the low immunogenicity of this self-peptide,we designed a method based on isocaudamer technique to repeat tandemly the 10-residue sequence X of β-hCG (109-118), then 10 tandemly repeated copies of the 10-residue sequence combined with β-hCG C-terminal 37 peptides were fused to mycobacterial heat-shock protein 65 to construct a fusion protein HSP65-X10-βhCGCTP37 as an immunogen. In this study, we examined the effect of the tandem repeats of this 10-residue sequence in eliciting an immune by comparing the immunogenicity and anti-tumor effects of the two immunogens, HSP65-X10-βhCGCTP37 and HSP65-βhCGCTP37 (without the 10 tandem repeats). Immunization of mice with the fusion protein HSP65-X10-βhCGCTP37 elicited much higher levels of specific anti-β-hCG antibodies and more effectively inhibited the growth of Lewis lung carcinoma (LLC) in vivo than with HSP65-βhCGCTP37, which should suggest that HSP65-X10-βhCGCTP37 may be an effective protein vaccine for the treatment of β-hCG-dependent tumors and multiple tandem repeats of a certain epitope are an efficient method to overcome the low immunogenicity of self-peptide antigens
The history and advances of reversible terminators used in new generations of sequencing technology.

Science.gov (United States)

Chen, Fei; Dong, Mengxing; Ge, Meng; Zhu, Lingxiang; Ren, Lufeng; Liu, Guocheng; Mu, Rong

2013-02-01

DNA sequencing using reversible terminators, as one sequencing by synthesis strategy, has garnered a great deal of interest due to its popular application in the second-generation high-throughput DNA sequencing technology. In this review, we provided its history of development, classification, and working mechanism of this technology. We also outlined the screening strategies for DNA polymerases to accommodate the reversible terminators as substrates during polymerization; particularly, we introduced the "REAP" method developed by us. At the end of this review, we discussed current limitations of this approach and provided potential solutions to extend its application. Copyright © 2013. Production and hosting by Elsevier Ltd.
Roles of genes and Alu repeats in nonlinear correlations of HUMHBB DNA sequence

International Nuclear Information System (INIS)

Xiao Yi; Huang Yanzhao

2004-01-01

DNA sequences of different species and different portion of the DNA of the same species may have completely different correlation properties, but the origin of these correlations is still not very clear and is currently being investigated, especially in different particular cases. We report here a study of the DNA sequence of human beta globin region (HUMHBB) which has strong linear and nonlinear correlations. We studied the roles of two of the typical elements of DNA sequence, genes and Alu repeats, in the nonlinear correlations of HUMHBB. We find that there exist strong nonlinear correlations between the exons or introns in different genes and between the Alu repeats. They may be one of the major sources of the nonlinear correlations in HUMBHB
Structure and possible function of a G-quadruplex in the long terminal repeat of the proviral HIV-1 genome.

Science.gov (United States)

De Nicola, Beatrice; Lech, Christopher J; Heddi, Brahim; Regmi, Sagar; Frasson, Ilaria; Perrone, Rosalba; Richter, Sara N; Phan, Anh Tuân

2016-07-27

The long terminal repeat (LTR) of the proviral human immunodeficiency virus (HIV)-1 genome is integral to virus transcription and host cell infection. The guanine-rich U3 region within the LTR promoter, previously shown to form G-quadruplex structures, represents an attractive target to inhibit HIV transcription and replication. In this work, we report the structure of a biologically relevant G-quadruplex within the LTR promoter region of HIV-1. The guanine-rich sequence designated LTR-IV forms a well-defined structure in physiological cationic solution. The nuclear magnetic resonance (NMR) structure of this sequence reveals a parallel-stranded G-quadruplex containing a single-nucleotide thymine bulge, which participates in a conserved stacking interaction with a neighboring single-nucleotide adenine loop. Transcription analysis in a HIV-1 replication competent cell indicates that the LTR-IV region may act as a modulator of G-quadruplex formation in the LTR promoter. Consequently, the LTR-IV G-quadruplex structure presented within this work could represent a valuable target for the design of HIV therapeutics. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Myelodysplastic syndromes and acute myeloid leukemia in cats infected with feline leukemia virus clone33 containing a unique long terminal repeat.

Science.gov (United States)

Hisasue, Masaharu; Nagashima, Naho; Nishigaki, Kazuo; Fukuzawa, Isao; Ura, Shigeyoshi; Katae, Hiromi; Tsuchiya, Ryo; Yamada, Takatsugu; Hasegawa, Atsuhiko; Tsujimoto, Hajime

2009-03-01

Feline leukemia virus (FeLV) clone33 was obtained from a domestic cat with acute myeloid leukemia (AML). The long terminal repeat (LTR) of this virus, like the LTRs present in FeLV from other cats with AML, differs from the LTRs of other known FeLV in that it has 3 tandem direct 47-bp repeats in the upstream region of the enhancer (URE). Here, we injected cats with FeLV clone33 and found 41% developed myelodysplastic syndromes (MDS) characterized by peripheral blood cytopenias and dysplastic changes in the bone marrow. Some of the cats with MDS eventually developed AML. The bone marrow of the majority of cats with FeLV clone33 induced MDS produced fewer erythroid and myeloid colonies upon being cultured with erythropoietin or granulocyte-macrophage colony-stimulating factor (GM-SCF) than bone marrow from normal control cats. Furthermore, the bone marrow of some of the cats expressed high-levels of the apoptosis-related genes TNF-alpha and survivin. Analysis of the proviral sequences obtained from 13 cats with naturally occurring MDS reveal they also bear the characteristic URE repeats seen in the LTR of FeLV clone33 and other proviruses from cats with AML. Deletions and mutations within the enhancer elements are frequently observed in naturally occurring MDS as well as AML. These results suggest that FeLV variants that bear URE repeats in their LTR strongly associate with the induction of both MDS and AML in cats.
The effects of multiple UV exposures on HIV-LTR (long terminal repeat) expression

International Nuclear Information System (INIS)

Schreck, S.; Milton, J.; Panozzo, J.; Libertin, C.R.; Woloschak, G.E.; Loyola Univ., Maywood, IL

1995-01-01

Previous studies have shown that cellular stress agents such as UV radiation induce transcription from the long terminal repeat (LTR) of the human immunodeficiency virus (HIV). Using HeLa cells stably transfected with the HIV-LTR sequence, which transcriptionally drives the chloramphenicol acetyl transferase (CAT) reporter gene, we examined the effects of multiple exposures to UVC (254 nm) on HIV-LTR-CAT expression. Low doses (≤ 5 J m -2 ) had no effect on CAT expression, but up to 29-fold induction was observed with 10 J m -2 when cells were harvested 48 h after completion of the exposure. Little difference was noted in induction levels when cells were exposed to one 25 J m -2 dose, viable cells were harvested at 24 h, 48 h or 72 h, and cell lysates were assayed for CAT expression. Two sequential 12.5 J m -2 exposures, given 24 h apart, resulted in an additive effect on CAT expression; these two exposures produced CAT activity equivalent to that induced following a single 25 J m -2 dose. Our data suggest that HIV-LTR requires a specific threshold UV dose in order to elicit induction; a maximal induction dose is also evident; exposures higher than this maximal dose contribute no more to HIV-LTR induction in viable cells. (author)

Dynamic probability of reinforcement for cooperation: Random game termination in the centipede game.

Science.gov (United States)

Krockow, Eva M; Colman, Andrew M; Pulford, Briony D

2018-03-01

Experimental games have previously been used to study principles of human interaction. Many such games are characterized by iterated or repeated designs that model dynamic relationships, including reciprocal cooperation. To enable the study of infinite game repetitions and to avoid endgame effects of lower cooperation toward the final game round, investigators have introduced random termination rules. This study extends previous research that has focused narrowly on repeated Prisoner's Dilemma games by conducting a controlled experiment of two-player, random termination Centipede games involving probabilistic reinforcement and characterized by the longest decision sequences reported in the empirical literature to date (24 decision nodes). Specifically, we assessed mean exit points and cooperation rates, and compared the effects of four different termination rules: no random game termination, random game termination with constant termination probability, random game termination with increasing termination probability, and random game termination with decreasing termination probability. We found that although mean exit points were lower for games with shorter expected game lengths, the subjects' cooperativeness was significantly reduced only in the most extreme condition with decreasing computer termination probability and an expected game length of two decision nodes. © 2018 Society for the Experimental Analysis of Behavior.
Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

Science.gov (United States)

Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

2016-01-01

A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis.
Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

Science.gov (United States)

Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

2014-01-01

Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.
Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

Science.gov (United States)

Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

2016-05-23

Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.
Adenovirus sequences required for replication in vivo.

OpenAIRE

Wang, K; Pearson, G D

1985-01-01

We have studied the in vivo replication properties of plasmids carrying deletion mutations within cloned adenovirus terminal sequences. Deletion mapping located the adenovirus DNA replication origin entirely within the first 67 bp of the adenovirus inverted terminal repeat. This region could be further subdivided into two functional domains: a minimal replication origin and an adjacent auxillary region which boosted the efficiency of replication by more than 100-fold. The minimal origin occup...
Tandemly repeated sequence in 5'end of mtDNA control region of ...

African Journals Online (AJOL)

STORAGESEVER

2008-12-17

Dec 17, 2008 ... chain reaction (PCR). Japanese Spanish ... mainly covered general ecology and fishery biology. No study concerning the ... Conserved sequence blocks and the repeat units are indicated by boxes. performed using the exact ...
Inverted repeats in the promoter as an autoregulatory sequence for TcrX in Mycobacterium tuberculosis

International Nuclear Information System (INIS)

Bhattacharya, Monolekha; Das, Amit Kumar

2011-01-01

Highlights: ► The regulatory sequences recognized by TcrX have been identified. ► The regulatory region comprises of inverted repeats segregated by 30 bp region. ► The mode of binding of TcrX with regulatory sequence is unique. ► In silico TcrX–DNA docked model binds one of the inverted repeats. ► Both phosphorylated and unphosphorylated TcrX binds regulatory sequence in vitro. -- Abstract: TcrY, a histidine kinase, and TcrX, a response regulator, constitute a two-component system in Mycobacterium tuberculosis. tcrX, which is expressed during iron scarcity, is instrumental in the survival of iron-dependent M. tuberculosis. However, the regulator of tcrX/Y has not been fully characterized. Crosslinking studies of TcrX reveal that it can form oligomers in vitro. Electrophoretic mobility shift assays (EMSAs) show that TcrX recognizes two regions in the promoter that are comprised of inverted repeats separated by ∼30 bp. The dimeric in silico model of TcrX predicts binding to one of these inverted repeat regions. Site-directed mutagenesis and radioactive phosphorylation indicate that D54 of TcrX is phosphorylated by H256 of TcrY. However, phosphorylated and unphosphorylated TcrX bind the regulatory sequence with equal efficiency, which was shown with an EMSA using the D54A TcrX mutant.
Automation of C-terminal sequence analysis of 2D-PAGE separated proteins

Directory of Open Access Journals (Sweden)

P.P. Moerman

2014-06-01

Full Text Available Experimental assignment of the protein termini remains essential to define the functional protein structure. Here, we report on the improvement of a proteomic C-terminal sequence analysis method. The approach aims to discriminate the C-terminal peptide in a CNBr-digest where Met-Xxx peptide bonds are cleaved in internal peptides ending at a homoserine lactone (hsl-derivative. pH-dependent partial opening of the lactone ring results in the formation of doublets for all internal peptides. C-terminal peptides are distinguished as singlet peaks by MALDI-TOF MS and MS/MS is then used for their identification. We present a fully automated protocol established on a robotic liquid-handling station.
Complete DNA sequence of the linear mitochondrial genome of the pathogenic yeast Candida parapsilosis

DEFF Research Database (Denmark)

Nosek, J.; Novotna, M.; Hlavatovicova, Z.

2004-01-01

The complete sequence of the mitochondrial DNA of the opportunistic yeast pathogen Candida parapsilosis was determined. The mitochondrial genome is represented by linear DNA molecules terminating with tandem repeats of a 738-bp unit. The number of repeats varies, thus generating a population...
TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

Science.gov (United States)

Richard, François D; Kajava, Andrey V

2014-06-01

The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.
Scrutinizing virus genome termini by high-throughput sequencing.

Directory of Open Access Journals (Sweden)

Shasha Li

Full Text Available Analysis of genomic terminal sequences has been a major step in studies on viral DNA replication and packaging mechanisms. However, traditional methods to study genome termini are challenging due to the time-consuming protocols and their inefficiency where critical details are lost easily. Recent advances in next generation sequencing (NGS have enabled it to be a powerful tool to study genome termini. In this study, using NGS we sequenced one iridovirus genome and twenty phage genomes and confirmed for the first time that the high frequency sequences (HFSs found in the NGS reads are indeed the terminal sequences of viral genomes. Further, we established a criterion to distinguish the type of termini and the viral packaging mode. We also obtained additional terminal details such as terminal repeats, multi-termini, asymmetric termini. With this approach, we were able to simultaneously detect details of the genome termini as well as obtain the complete sequence of bacteriophage genomes. Theoretically, this application can be further extended to analyze larger and more complicated genomes of plant and animal viruses. This study proposed a novel and efficient method for research on viral replication, packaging, terminase activity, transcription regulation, and metabolism of the host cell.
Segment-specific terminal sequences of Bunyamwera bunyavirus regulate genome replication

International Nuclear Information System (INIS)

Barr, John N.; Elliott, Richard M.; Dunn, Ewan F.; Wertz, Gail W.

2003-01-01

Bunyamwera virus (BUNV) is the prototype of both the Orthobunyavirus genus and the Bunyaviridae family of segmented negative sense RNA viruses. The tripartite BUNV genome consists of small (S), medium (M), and large (L) segments that are transcribed to give a single mRNA and replicated to generate an antigenome that is the template for synthesis of further genomic RNA strands. We modified an existing cDNA-derived RNA synthesis system to allow identification of BUNV RNA replication and transcription products by direct metabolic labeling. Direct RNA analysis allowed us to distinguish between template activities that affected either RNA replication or mRNA transcription, an ability that was not possible using previous reporter gene expression assays. We generated genome analogs containing the entire nontranslated terminal sequences of the S, M, and L BUNV segments surrounding a common sequence. Analysis of RNAs synthesized from these templates revealed that the relative abilities of BUNV segments to perform RNA replication was M > L > S. Exchange of segment-specific terminal nucleotides identified a 12-nt region located within both the 3' and 5' termini of the M segment that correlated with its high replication ability
Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence

NARCIS (Netherlands)

Semenova, E.V.; Jore, M.M.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

2011-01-01

Prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR)/Cas (CRISPR-associated sequences) systems provide adaptive immunity against viruses when a spacer sequence of small CRISPR RNA (crRNA) matches a protospacer sequence in the viral genome. Viruses that escape CRISPR/Cas
MSDB: A Comprehensive Database of Simple Sequence Repeats.

Science.gov (United States)

Avvaru, Akshay Kumar; Saxena, Saketh; Sowpati, Divya Tej; Mishra, Rakesh Kumar

2017-06-01

Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Simple sequence repeat (SSR) markers are effective for identifying ...

African Journals Online (AJOL)

DNA was extracted from newly formed leaves and amplified using 21 simple sequence repeat (SSR) markers (NH001c, NH002b, NH005b, NH007b, NH008b, NH009b, NH011b, NH013b, NH012a, NH014a, NH015a, NH017a, KA4b, KA5, KA14, KA16, KB16, KU10, BGA35, BGT23b and HGA8b). The data was analyzed by ...
Repeated-Sprint Sequences During Female Soccer Matches Using Fixed and Individual Speed Thresholds.

Science.gov (United States)

Nakamura, Fábio Y; Pereira, Lucas A; Loturco, Irineu; Rosseti, Marcelo; Moura, Felipe A; Bradley, Paul S

2017-07-01

Nakamura, FY, Pereira, LA, Loturco, I, Rosseti, M, Moura, FA, and Bradley, PS. Repeated-sprint sequences during female soccer matches using fixed and individual speed thresholds. J Strength Cond Res 31(7): 1802-1810, 2017-The main objective of this study was to characterize the occurrence of single sprint and repeated-sprint sequences (RSS) during elite female soccer matches, using fixed (20 km·h) and individually based speed thresholds (>90% of the mean speed from a 20-m sprint test). Eleven elite female soccer players from the same team participated in the study. All players performed a 20-m linear sprint test, and were assessed in up to 10 official matches using Global Positioning System technology. Magnitude-based inferences were used to test for meaningful differences. Results revealed that irrespective of adopting fixed or individual speed thresholds, female players produced only a few RSS during matches (2.3 ± 2.4 sequences using the fixed threshold and 3.3 ± 3.0 sequences using the individually based threshold), with most sequences composing of just 2 sprints. Additionally, central defenders performed fewer sprints (10.2 ± 4.1) than other positions (fullbacks: 28.1 ± 5.5; midfielders: 21.9 ± 10.5; forwards: 31.9 ± 11.1; with the differences being likely to almost certainly associated with effect sizes ranging from 1.65 to 2.72), and sprinting ability declined in the second half. The data do not support the notion that RSS occurs frequently during soccer matches in female players, irrespective of using fixed or individual speed thresholds to define sprint occurrence. However, repeated-sprint ability development cannot be ruled out from soccer training programs because of its association with match-related performance.
Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

Directory of Open Access Journals (Sweden)

Natalie L. Dillon

2014-01-01

Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.
Genomic organization and developmental fate of adjacent repeated sequences in a foldback DNA clone of Tetrahymena thermophila

International Nuclear Information System (INIS)

Tschunko, A.H.; Loechel, R.H.; McLaren, N.C.; Allen, S.L.

1987-01-01

DNA sequence elimination and rearrangement occurs during the development of somatic cell lineages of eukaryotes and was first discovered over a century ago. However, the significance and mechanism of chromatin elimination are not understood. DNA elimination also occurs during the development of the somatic macronucleus from the germinal micronucleus in unicellular ciliated protozoa such as Tetrahymena thermophila. In this study foldback DNA from the micronucleus was used as a probe to isolate ten clones. All of those tested (4/4) contained sequences that were repetitive in the micronucleus and rearranged in the macronucleus. Inverted repeated sequences were present in one clone. This clone, pTtFBl, was subjected to a detailed analysis of its developmental fate. Subregions were subcloned and used as probes against Southern blots of micronuclear and macronuclear DNA. DNA was labeled with [ 33 P]-labeled dATP. The authors found that all subregions defined repeated sequence families in the micronuclear genome. A minimum of four different families was defined, two of which are retained in the macronucleus and two of which are completely eliminated. The inverted repeat family is retained with little rearrangement. Two of the families, defined by subregions that do not contain parts of the inverted repeat are totally eliminated during macronuclear development-and contain open reading frames. The significance of retained inverted repeats to the process of elimination is discussed
Sequence variability is correlated with weak immunogenicity in Streptococcus pyogenes M protein

DEFF Research Database (Denmark)

Lannergård, Jonas; Kristensen, Bodil M.; Gustafsson, Mattias C. U.

2015-01-01

The M protein of Streptococcus pyogenes, a major bacterial virulence factor, has an amino-terminal hypervariable region (HVR) that is a target for type-specific protective antibodies. Intriguingly, the HVR elicits a weak antibody response, indicating that it escapes host immunity by two mechanisms...... fibrinogen-binding B repeat region exhibits extensive sequence divergence. Analysis of antisera from S. pyogenes-infected patients, infected mice, and immunized mice showed that both the HVR and the B repeat region elicited weak antibody responses, while the conserved carboxy-terminal part was immunodominant...
Capillary electrophoresis of Big-Dye terminator sequencing reactions for human mtDNA Control Region haplotyping in the identification of human remains.

Science.gov (United States)

Montesino, Marta; Prieto, Lourdes

2012-01-01

Cycle sequencing reaction with Big-Dye terminators provides the methodology to analyze mtDNA Control Region amplicons by means of capillary electrophoresis. DNA sequencing with ddNTPs or terminators was developed by (1). The progressive automation of the method by combining the use of fluorescent-dye terminators with cycle sequencing has made it possible to increase the sensibility and efficiency of the method and hence has allowed its introduction into the forensic field. PCR-generated mitochondrial DNA products are the templates for sequencing reactions. Different set of primers can be used to generate amplicons with different sizes according to the quality and quantity of the DNA extract providing sequence data for different ranges inside the Control Region.

RePS: a sequence assembler that masks exact repeats identified from the shotgun data

DEFF Research Database (Denmark)

Wang, Jun; Wong, Gane Ka-Shu; Ni, Peixiang

2002-01-01

We describe a sequence assembler, RePS (repeat-masked Phrap with scaffolding), that explicitly identifies exact 20mer repeats from the shotgun data and removes them prior to the assembly. The established software is used to compute meaningful error probabilities for each base. Clone......-end-pairing information is used to construct scaffolds that order and orient the contigs. We show with real data for human and rice that reasonable assemblies are possible even at coverages of only 4x to 6x, despite having up to 42.2% in exact repeats. Udgivelsesdato: 2002-May...
Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats

OpenAIRE

Gymrek, Melissa

2016-01-01

This was presented as a BitesizeBio Webinar entitled "Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats"Accompanying scripts can be accessed on github:https://github.com/mgymrek/mgymrek-bitesizebio-webinar
In silico analysis of Simple Sequence Repeats from chloroplast genomes of Solanaceae species

Directory of Open Access Journals (Sweden)

Evandro Vagner Tambarussi

2009-01-01

Full Text Available The availability of chloroplast genome (cpDNA sequences of Atropa belladonna, Nicotiana sylvestris, N.tabacum, N. tomentosiformis, Solanum bulbocastanum, S. lycopersicum and S. tuberosum, which are Solanaceae species,allowed us to analyze the organization of cpSSRs in their genic and intergenic regions. In general, the number of cpSSRs incpDNA ranged from 161 in S. tuberosum to 226 in N. tabacum, and the number of intergenic cpSSRs was higher than geniccpSSRs. The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, pentaandhexanucleotide repeats. Multiple alignments of all cpSSRs sequences from Solanaceae species made the identification ofnucleotide variability possible and the phylogeny was estimated by maximum parsimony. Our study showed that the plastomedatabase can be exploited for phylogenetic analysis and biotechnological approaches.
Comparative genomic analysis reveals multiple long terminal repeats, lineage-specific amplification, and frequent interelement recombination for Cassandra retrotransposon in pear (Pyrus bretschneideri Rehd.).

Science.gov (United States)

Yin, Hao; Du, Jianchang; Li, Leiting; Jin, Cong; Fan, Lian; Li, Meng; Wu, Jun; Zhang, Shaoling

2014-06-04

Cassandra transposable elements belong to a specific group of terminal-repeat retrotransposons in miniature (TRIM). Although Cassandra TRIM elements have been found in almost all vascular plants, detailed investigations on the nature, abundance, amplification timeframe, and evolution have not been performed in an individual genome. We therefore conducted a comprehensive analysis of Cassandra retrotransposons using the newly sequenced pear genome along with four other Rosaceae species, including apple, peach, mei, and woodland strawberry. Our data reveal several interesting findings for this particular retrotransposon family: 1) A large number of the intact copies contain three, four, or five long terminal repeats (LTRs) (∼20% in pear); 2) intact copies and solo LTRs with or without target site duplications are both common (∼80% vs. 20%) in each genome; 3) the elements exhibit an overall unbiased distribution among the chromosomes; 4) the elements are most successfully amplified in pear (5,032 copies); and 5) the evolutionary relationships of these elements vary among different lineages, species, and evolutionary time. These results indicate that Cassandra retrotransposons contain more complex structures (elements with multiple LTRs) than what we have known previously, and that frequent interelement unequal recombination followed by transposition may play a critical role in shaping and reshaping host genomes. Thus this study provides insights into the property, propensity, and molecular mechanisms governing the formation and amplification of Cassandra retrotransposons, and enhances our understanding of the structural variation, evolutionary history, and transposition process of LTR retrotransposons in plants. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Identification of succinimide sites in proteins by N-terminal sequence analysis after alkaline hydroxylamine cleavage.

Science.gov (United States)

Kwong, M. Y.; Harris, R. J.

1994-01-01

Under favorable conditions, Asp or Asn residues can undergo rearrangement to a succinimide (cyclic imide), which may also serve as an intermediate for deamidation and/or isoaspartate formation. Direct identification of such succinimides by peptide mapping is hampered by their lability at neutral and alkaline pH. We determined that incubation in 2 M hydroxylamine, 0.2 M Tris buffer, pH 9, for 2 h at 45 degrees C will specifically cleave on the C-terminal side of succinimides without cleavage at Asn-Gly bonds; yields are typically approximately 50%. N-terminal sequence analysis can then be used to identify an internal sequence generated by cleavage of the succinimide, hence identifying the succinimide site. PMID:8142891
Inability of Kaplan radiation leukemia virus to replicate on mouse fibroblasts is conferred by its long terminal repeat

International Nuclear Information System (INIS)

Rassart, E.; Paquette, Y.; Jolicoeur, P.

1988-01-01

The molecularly cloned infectious Kaplan radiation leukemia virus has previously been shown to be unable to replicate on mouse fibroblasts. To map the viral sequences responsible for this, we constructed chimeric viral DNA genomes in vitro with parental cloned infectious viral DNAs from the nonfibrotropic (F-) BL/VL3 V-13 radiation leukemia virus and the fibrotropic (F+) endogenous BALB/c or Moloney murine leukemia viruses (MuLV). Infectious chimeric MuLVs, recovered after transfection of Ti-6 lymphocytes with these recombinant DNAs, were tested for capacity to replicate on mouse fibroblasts in vitro. We found that chimeric MuLVs harboring the long terminal repeat (LTR) of a fibrotropic MuLV replicated well on mouse fibroblasts. Conversely, chimeric MuLVs harboring the LTR of a nonfibrotropic MuLV were restricted on mouse fibroblasts. These results indicate that the LTR of BL/VL3 radiation leukemia virus harbors the primary determinant responsible for its inability to replicate on mouse fibroblasts in vitro. Our results also show that the primary determinant allowing F+ MuLVs (endogenous BALB/c and Moloney MuLVs) to replicate on mouse fibroblasts in vitro resides within the LTR
simple sequence repeats (EST-SSR)

African Journals Online (AJOL)

Yomi

2012-01-19

Jan 19, 2012 ... 212 primer pairs selected, based on repeat patterns of n≥8 for di-, tri-, tetra- and penta-nucleotide repeat ... Cluster analysis revealed a high genetic similarity among the sugarcane (Saccharum spp.) breeding lines which could reduce the genetic gain in ..... The multiple allele characteristic of SSR com-.
Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

Science.gov (United States)

Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...
Exploiting BAC-end sequences for the mining, characterization and utility of new short sequences repeat (SSR) markers in Citrus.

Science.gov (United States)

Biswas, Manosh Kumar; Chai, Lijun; Mayer, Christoph; Xu, Qiang; Guo, Wenwu; Deng, Xiuxin

2012-05-01

The aim of this study was to develop a large set of microsatellite markers based on publicly available BAC-end sequences (BESs), and to evaluate their transferability, discriminating capacity of genotypes and mapping ability in Citrus. A set of 1,281 simple sequence repeat (SSR) markers were developed from the 46,339 Citrus clementina BAC-end sequences (BES), of them 20.67% contained SSR longer than 20 bp, corresponding to roughly one perfect SSR per 2.04 kb. The most abundant motifs were di-nucleotide (16.82%) repeats. Among all repeat motifs (TA/AT)n is the most abundant (8.38%), followed by (AG/CT)n (4.51%). Most of the BES-SSR are located in the non-coding region, but 1.3% of BES-SSRs were found to be associated with transposable element (TE). A total of 400 novel SSR primer pairs were synthesized and their transferability and polymorphism tested on a set of 16 Citrus and Citrus relative's species. Among these 333 (83.25%) were successfully amplified and 260 (65.00%) showed cross-species transferability with Poncirus trifoliata and Fortunella sp. These cross-species transferable markers could be useful for cultivar identification, for genomic study of Citrus, Poncirus and Fortunella sp. Utility of the developed SSR marker was demonstrated by identifying a set of 118 markers each for construction of linkage map of Citrus reticulata and Poncirus trifoliata. Genetic diversity and phylogenetic relationship among 40 Citrus and its related species were conducted with the aid of 25 randomly selected SSR primer pairs and results revealed that citrus genomic SSRs are superior to genic SSR for genetic diversity and germplasm characterization of Citrus spp.
ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

Science.gov (United States)

Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

2006-06-16

The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.
ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants.

Science.gov (United States)

Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh

2014-01-01

Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.
Human TRPA1 is intrinsically cold- and chemosensitive with and without its N-terminal ankyrin repeat domain.

Science.gov (United States)

Moparthi, Lavanya; Survery, Sabeen; Kreir, Mohamed; Simonsen, Charlotte; Kjellbom, Per; Högestätt, Edward D; Johanson, Urban; Zygmunt, Peter M

2014-11-25

We have purified and reconstituted human transient receptor potential (TRP) subtype A1 (hTRPA1) into lipid bilayers and recorded single-channel currents to understand its inherent thermo- and chemosensory properties as well as the role of the ankyrin repeat domain (ARD) of the N terminus in channel behavior. We report that hTRPA1 with and without its N-terminal ARD (Δ1-688 hTRPA1) is intrinsically cold-sensitive, and thus, cold-sensing properties of hTRPA1 reside outside the N-terminal ARD. We show activation of hTRPA1 by the thiol oxidant 2-((biotinoyl)amino)ethyl methanethiosulfonate (MTSEA-biotin) and that electrophilic compounds activate hTRPA1 in the presence and absence of the N-terminal ARD. The nonelectrophilic compounds menthol and the cannabinoid Δ(9)-tetrahydrocannabiorcol (C16) directly activate hTRPA1 at different sites independent of the N-terminal ARD. The TRPA1 antagonist HC030031 inhibited cold and chemical activation of hTRPA1 and Δ1-688 hTRPA1, supporting a direct interaction with hTRPA1 outside the N-terminal ARD. These findings show that hTRPA1 is an intrinsically cold- and chemosensitive ion channel. Thus, second messengers, including Ca(2+), or accessory proteins are not needed for hTRPA1 responses to cold or chemical activators. We suggest that conformational changes outside the N-terminal ARD by cold, electrophiles, and nonelectrophiles are important in hTRPA1 channel gating and that targeting chemical interaction sites outside the N-terminal ARD provides possibilities to fine tune TRPA1-based drug therapies (e.g., for treatment of pain associated with cold hypersensitivity and cardiovascular disease).
A novel family of sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats.

Science.gov (United States)

Beloglazova, Natalia; Brown, Greg; Zimmerman, Matthew D; Proudfoot, Michael; Makarova, Kira S; Kudritska, Marina; Kochinyan, Samvel; Wang, Shuren; Chruszcz, Maksymilian; Minor, Wladek; Koonin, Eugene V; Edwards, Aled M; Savchenko, Alexei; Yakunin, Alexander F

2008-07-18

Clustered regularly interspaced short palindromic repeats (CRISPRs) together with the associated CAS proteins protect microbial cells from invasion by foreign genetic elements using presently unknown molecular mechanisms. All CRISPR systems contain proteins of the CAS2 family, suggesting that these uncharacterized proteins play a central role in this process. Here we show that the CAS2 proteins represent a novel family of endoribonucleases. Six purified CAS2 proteins from diverse organisms cleaved single-stranded RNAs preferentially within U-rich regions. A representative CAS2 enzyme, SSO1404 from Sulfolobus solfataricus, cleaved the phosphodiester linkage on the 3'-side and generated 5'-phosphate- and 3'-hydroxyl-terminated oligonucleotides. The crystal structure of SSO1404 was solved at 1.6A resolution revealing the first ribonuclease with a ferredoxin-like fold. Mutagenesis of SSO1404 identified six residues (Tyr-9, Asp-10, Arg-17, Arg-19, Arg-31, and Phe-37) that are important for enzymatic activity and suggested that Asp-10 might be the principal catalytic residue. Thus, CAS2 proteins are sequence-specific endoribonucleases, and we propose that their role in the CRISPR-mediated anti-phage defense might involve degradation of phage or cellular mRNAs.
Detection of reverse transcriptase termination sites using cDNA ligation and massive parallel sequencing

DEFF Research Database (Denmark)

Kielpinski, Lukasz J; Boyd, Mette; Sandelin, Albin

2013-01-01

Detection of reverse transcriptase termination sites is important in many different applications, such as structural probing of RNAs, rapid amplification of cDNA 5' ends (5' RACE), cap analysis of gene expression, and detection of RNA modifications and protein-RNA cross-links. The throughput...... of these methods can be increased by applying massive parallel sequencing technologies.Here, we describe a versatile method for detection of reverse transcriptase termination sites based on ligation of an adapter to the 3' end of cDNA with bacteriophage TS2126 RNA ligase (CircLigase™). In the following PCR...
Characterization of sequence diversity in Plasmodium falciparum SERA5 from Indian isolates

Directory of Open Access Journals (Sweden)

Rahul C.N

2015-06-01

Full Text Available Objective: To characterize the sequence diversity of blood-stage Plasmodium falciparum serine repeat antigen-5 (PfSERA5 which is lacking in a malaria-endemic country like India. Methods: In this study, parasitic DNA was obtained from field isolates collected from various geographic regions. Subsequently, PfSERA5 gene sequence was PCR amplified and DNA sequenced. Results: We reported the existence of unique repeat polymorphisms and novel haplotypes for both the octamer repeat (OR and serine repeat (SR regions of the N-terminal fragment of PfSERA5 from Indian isolates. Several isolates from India were identical to low-frequency African haplotypes. Unique finding of our study was an Indian isolate showing deletion in a perfectly conserved 14 mer sequence within octamer repeat. Indian haplotypes reported in this study were found to be distributed into the three earlier classified allelic clusters of FCR3, K1 and Honduras showcasing broad diversity as compared to worldwide haplotypes. Conclusions: This study is the first report on genetic diversity of PfSERA5 antigen from India. Further evaluation of these haplotypes by serotyping would provide useful information for investigating variant-specific immunity and aid in malaria vaccine research.
Long Terminal Repeat CRISPR-CAR-Coupled "Universal" T Cells Mediate Potent Anti-leukemic Effects.

Science.gov (United States)

Georgiadis, Christos; Preece, Roland; Nickolay, Lauren; Etuk, Aniekan; Petrova, Anastasia; Ladon, Dariusz; Danyi, Alexandra; Humphryes-Kirilov, Neil; Ajetunmobi, Ayokunmi; Kim, Daesik; Kim, Jin-Soo; Qasim, Waseem

2018-03-06

Gene editing can be used to overcome allo-recognition, which otherwise limits allogeneic T cell therapies. Initial proof-of-concept applications have included generation of such "universal" T cells expressing chimeric antigen receptors (CARs) against CD19 target antigens combined with transient expression of DNA-targeting nucleases to disrupt the T cell receptor alpha constant chain (TRAC). Although relatively efficient, transgene expression and editing effects were unlinked, yields variable, and resulting T cell populations heterogeneous, complicating dosing strategies. We describe a self-inactivating lentiviral "terminal" vector platform coupling CAR expression with CRISPR/Cas9 effects through incorporation of an sgRNA element into the ΔU3 3' long terminal repeat (LTR). Following reverse transcription and duplication of the hybrid ΔU3-sgRNA, delivery of Cas9 mRNA resulted in targeted TRAC locus cleavage and allowed the enrichment of highly homogeneous (>96%) CAR + (>99%) TCR - populations by automated magnetic separation. Molecular analyses, including NGS, WGS, and Digenome-seq, verified on-target specificity with no evidence of predicted off-target events. Robust anti-leukemic effects were demonstrated in humanized immunodeficient mice and were sustained longer than by conventional CAR + TCR + T cells. Terminal-TRAC (TT) CAR T cells offer the possibility of a pre-manufactured, non-HLA-matched CAR cell therapy and will be evaluated in phase 1 trials against B cell malignancies shortly. Copyright © 2018 The American Society of Gene and Cell Therapy. Published by Elsevier Inc. All rights reserved.
Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

Science.gov (United States)

Raynard, Steven J; Baker, Mark D

2004-01-01

In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.
Murine mammary tumor virus pol-related sequences in human DNA: characterization and sequence comparison with the complete murine mammary tumor virus pol gene

International Nuclear Information System (INIS)

Deen, K.C.; Sweet, R.W.

1986-01-01

Sequences in the human genome with homology to the murine mammary tumor virus (MMTV) pol gene were isolated from a human phage library. Ten clones with extensive pol homology were shown to define five separate loci. These loci share common sequences immediately adjacent to the pol-like segments and, in addition, contain a related repeat element which bounds this region. This organization is suggestive of a proviral structure. The authors estimate that the human genome contains 30 to 40 copies of these pol-related sequences. The pol region of one of the cloned segments (HM16) and the complete MMTV pol gene were sequenced and compared. The nucleotide homology between these pol sequences is 52% and is concentrated in the terminal regions. The MMTV pol gene contains a single long open reading frame encoding 899 amino acids and is demarcated from the partially overlapping putative gag gene by termination codons and a shift in translational reading frame. The pol sequence of HM16 is multiply terminated but does contain open reading frames which encode 370, 105, and 112 amino acids residues in separate reading frames. The authors deduced a composite pol protein sequence for HM16 by aligning it to the MMTV pol gene and then compared these sequences with other retroviral pol protein sequences. Conserved sequences occur in both the amino and carboxyl regions which lie within the polymerase and endonuclease domains of pol, respectively
Stress-induced rearrangement of Fusarium retrotransposon sequences.

Science.gov (United States)

Anaya, N; Roncero, M I

1996-11-27

Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.
Intrinsic terminators in Mycoplasma hyopneumoniae transcription.

Science.gov (United States)

Fritsch, Tiago Ebert; Siqueira, Franciele Maboni; Schrank, Irene Silveira

2015-04-08

Mycoplasma hyopneumoniae, an important pathogen of swine, exhibits a low guanine and cytosine (GC) content genome. M. hyopneumoniae genome is organised in long transcriptional units and promoter sequences have been mapped upstream of all transcription units. These analysis provided insights into the gene organisation and transcription initiation at the genome scale. However, the presence of transcriptional terminator sequences in the M. hyopneumoniae genome is poorly understood. In silico analyses demonstrated the presence of putative terminators in 82% of the 33 monocistronic units (mCs) and in 74% of the 116 polycistronic units (pCs) considering different classes of terminators. The functional activity of 23 intrinsic terminators was confirmed by RT-PCR and qPCR. Analysis of all terminators found by three software algorithms, combined with experimental results, allowed us to propose a pattern of RNA hairpin formation during the termination process and to predict the location of terminators in the M. hyopneumoniae genome sequence. The stem-loop structures of intrinsic terminators of mycoplasma diverge from the pattern of terminators found in other bacteria due the low content of guanine and cytosine. In M. hyopneumoniae, transcription can end after a transcriptional unit and before its terminator sequence and can also continue past the terminator sequence with RNA polymerases gradually releasing the RNA.

Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability

Science.gov (United States)

Xiao, Jing; Zhao, Jin; Liu, Mengjun; Liu, Ping; Dai, Li; Zhao, Zhihui

2015-01-01

Chinese jujube (Ziziphus jujuba), an economically important species in the Rhamnaceae family, is a popular fruit tree in Asia. Here, we surveyed and characterized simple sequence repeats (SSRs) in the jujube genome. A total of 436,676 SSR loci were identified, with an average distance of 0.93 Kb between the loci. A large proportion of the SSRs included mononucleotide, dinucleotide and trinucleotide repeat motifs, which accounted for 64.87%, 24.40%, and 8.74% of all repeats, respectively. Among the mononucleotide repeats, A/T was the most common, whereas AT/TA was the most common dinucleotide repeat. A total of 30,565 primer pairs were successfully designed and screened using a series of criteria. Moreover, 725 of 1,000 randomly selected primer pairs were effective among 6 cultivars, and 511 of these primer pairs were polymorphic. Sequencing the amplicons of two SSRs across three jujube cultivars revealed variations in the repeats. The transferability of jujube SSR primers proved that 35/64 SSRs could be transferred across family boundary. Using jujube SSR primers, clustering analysis results from 15 species were highly consistent with the Angiosperm Phylogeny Group (APGIII) System. The genome-wide characterization of SSRs in Chinese jujube is very valuable for whole-genome characterization and marker-assisted selection in jujube breeding. In addition, the transferability of jujube SSR primers could provide a solid foundation for their further utilization. PMID:26000739
Effects of loading sequences and size of repeated stress block of loads on fatigue life calculated using fatigue functions

International Nuclear Information System (INIS)

Schott, G.

1989-01-01

It is well-known that collective form, stress intensity and loading sequence of individual stresses as well as size of repeated stress blocks can influence fatigue life, significantly. The basic variant of the consecutive Woehler curve concept will permit these effects to be involved into fatigue life computation. The paper presented will demonstrate that fatigue life computations using fatigue functions reflect the loading sequence effect with multilevel loading precisely and provide reliable fatigue life data. Effects of size of repeated stress block and loading sequence on fatigue life as observed with block program tests can be reproduced using the new computation method. (orig.) [de
Isolation and N-terminal sequencing of a novel cadmium-binding protein from Boletus edulis

Science.gov (United States)

Collin-Hansen, C.; Andersen, R. A.; Steinnes, E.

2003-05-01

A Cd-binding protein was isolated from the popular edible mushroom Boletus edulis, which is a hyperaccumulator of both Cd and Hg. Wild-growing samples of B. edulis were collected from soils rich in Cd. Cd radiotracer was added to the crude protein preparation obtained from ethanol precipitation of heat-treated cytosol. Proteins were then further separated in two consecutive steps; gel filtration and anion exchange chromatography. In both steps the Cd radiotracer profile showed only one distinct peak, which corresponded well with the profiles of endogenous Cd obtained by atomic absorption spectrophotometry (AAS). Concentrations of the essential elements Cu and Zn were low in the protein fractions high in Cd. N-terminal sequencing performed on the Cd-binding protein fractions revealed a protein with a novel amino acid sequence, which contained aromatic amino acids as well as proline. Both the N-terminal sequencing and spectrofluorimetric analysis with EDTA and ABD-F (4-aminosulfonyl-7-fluoro-2, 1, 3-benzoxadiazole) failed to detect cysteine in the Cd-binding fractions. These findings conclude that the novel protein does not belong to the metallothionein family. The results suggest a role for the protein in Cd transport and storage, and they are of importance in view of toxicology and food chemistry, but also for environmental protection.
Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.

Science.gov (United States)

Šatović, Eva; Plohl, Miroslav

2017-10-01

Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
Identification of apple cultivars on the basis of simple sequence repeat markers.

Science.gov (United States)

Liu, G S; Zhang, Y G; Tao, R; Fang, J G; Dai, H Y

2014-09-12

DNA markers are useful tools that play an important role in plant cultivar identification. They are usually based on polymerase chain reaction (PCR) and include simple sequence repeats (SSRs), inter-simple sequence repeats, and random amplified polymorphic DNA. However, DNA markers were not used effectively in the complete identification of plant cultivars because of the lack of known DNA fingerprints. Recently, a novel approach called the cultivar identification diagram (CID) strategy was developed to facilitate the use of DNA markers for separate plant individuals. The CID was designed whereby a polymorphic maker was generated from each PCR that directly allowed for cultivar sample separation at each step. Therefore, it could be used to identify cultivars and varieties easily with fewer primers. In this study, 60 apple cultivars, including a few main cultivars in fields and varieties from descendants (Fuji x Telamon) were examined. Of the 20 pairs of SSR primers screened, 8 pairs gave reproducible, polymorphic DNA amplification patterns. The banding patterns obtained from these 8 primers were used to construct a CID map. Each cultivar or variety in this study was distinguished from the others completely, indicating that this method can be used for efficient cultivar identification. The result contributed to studies on germplasm resources and the seedling industry in fruit trees.
The polymorphic integumentary mucin B.1 from Xenopus laevis contains the short consensus repeat.

Science.gov (United States)

Probst, J C; Hauser, F; Joba, W; Hoffmann, W

1992-03-25

The frog integumentary mucin B.1 (FIM-B.1), discovered by molecular cloning, contains a cysteine-rich C-terminal domain which is homologous with von Willebrand factor. With the help of the polymerase chain reaction, we now characterize a contiguous region 5' to the von Willebrand factor domain containing the short consensus repeat typical of many proteins from the complement system. Multiple transcripts have been cloned, which originate from a single animal and differ by a variable number of tandem repeats (rep-33 sequences). These different transcripts probably originate solely from two genes and are generated presumably by alternative splicing of an huge array of functional cassettes. This model is supported by analysis of genomic FIM-B.1 sequences from Xenopus laevis. Here, rep-33 sequences are arranged in an interrupted array of individual units. Additionally, results of Southern analysis revealed genetic polymorphism between different animals which is predicted to be within the tandem repeats. A first investigation of the predicted mucins with the help of a specific antibody against a synthetic peptide determined the molecular mass of FIM-B.1 to greater than 200 kDa. Here again, genetic polymorphism between different animals is detected.
Distribution and evolution of repeated sequences in genomes of Triatominae (Hemiptera-Reduviidae inferred from genomic in situ hybridization.

Directory of Open Access Journals (Sweden)

Sebastian Pita

Full Text Available The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily.
Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

Science.gov (United States)

Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

1999-04-16

A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.
Simple sequence repeats in Neurospora crassa: distribution, polymorphism and evolutionary inference

Directory of Open Access Journals (Sweden)

Park Jongsun

2008-01-01

Full Text Available Abstract Background Simple sequence repeats (SSRs have been successfully used for various genetic and evolutionary studies in eukaryotic systems. The eukaryotic model organism Neurospora crassa is an excellent system to study evolution and biological function of SSRs. Results We identified and characterized 2749 SSRs of 963 SSR types in the genome of N. crassa. The distribution of tri-nucleotide (nt SSRs, the most common SSRs in N. crassa, was significantly biased in exons. We further characterized the distribution of 19 abundant SSR types (AST, which account for 71% of total SSRs in the N. crassa genome, using a Poisson log-linear model. We also characterized the size variation of SSRs among natural accessions using Polymorphic Index Content (PIC and ANOVA analyses and found that there are genome-wide, chromosome-dependent and local-specific variations. Using polymorphic SSRs, we have built linkage maps from three line-cross populations. Conclusion Taking our computational, statistical and experimental data together, we conclude that 1 the distributions of the SSRs in the sequenced N. crassa genome differ systematically between chromosomes as well as between SSR types, 2 the size variation of tri-nt SSRs in exons might be an important mechanism in generating functional variation of proteins in N. crassa, 3 there are different levels of evolutionary forces in variation of amino acid repeats, and 4 SSRs are stable molecular markers for genetic studies in N. crassa.
Sequence variability is correlated with weak immunogenicity in Streptococcus pyogenes M protein

Science.gov (United States)

Lannergård, Jonas; Kristensen, Bodil M; Gustafsson, Mattias C U; Persson, Jenny J; Norrby-Teglund, Anna; Stålhammar-Carlemalm, Margaretha; Lindahl, Gunnar

2015-01-01

The M protein of Streptococcus pyogenes, a major bacterial virulence factor, has an amino-terminal hypervariable region (HVR) that is a target for type-specific protective antibodies. Intriguingly, the HVR elicits a weak antibody response, indicating that it escapes host immunity by two mechanisms, sequence variability and weak immunogenicity. However, the properties influencing the immunogenicity of regions in an M protein remain poorly understood. Here, we studied the antibody response to different regions of the classical M1 and M5 proteins, in which not only the HVR but also the adjacent fibrinogen-binding B repeat region exhibits extensive sequence divergence. Analysis of antisera from S. pyogenes-infected patients, infected mice, and immunized mice showed that both the HVR and the B repeat region elicited weak antibody responses, while the conserved carboxy-terminal part was immunodominant. Thus, we identified a correlation between sequence variability and weak immunogenicity for M protein regions. A potential explanation for the weak immunogenicity was provided by the demonstration that protease digestion selectively eliminated the HVR-B part from whole M protein-expressing bacteria. These data support a coherent model, in which the entire variable HVR-B part evades antibody attack, not only by sequence variability but also by weak immunogenicity resulting from protease attack. PMID:26175306
Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2

International Nuclear Information System (INIS)

Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.; Cohen, G.H.

1984-01-01

Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar. For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells
Sequence variations in C9orf72 downstream of the hexanucleotide repeat region and its effect on repeat-primed PCR interpretation

DEFF Research Database (Denmark)

Nordin, Angelica; Akimoto, Chizuru; Wuolikainen, Anna

2017-01-01

A large GGGGCC-repeat expansion mutation (HREM) in C9orf72 is the most common known cause of ALS and FTD in European populations. Sequence variations immediately downstream of the HREM region have previously been observed and have been suggested to be one reason for difficulties in interpreting R...
Genetic alterations of the long terminal repeat of an ecotropic porcine endogenous retrovirus during passage in human cells

International Nuclear Information System (INIS)

Denner, Joachim; Specke, Volker; Thiesen, Ulla; Karlas, Alexander; Kurth, Reinhard

2003-01-01

Human-tropic porcine endogenous retroviruses (PERV) such as PERV-A and PERV-B can infect human cells and are therefore a potential risk to recipients of xenotransplants. A similar risk is posed by recombinant viruses containing the receptor-binding site of PERV-A and large parts of the genome of the ecotropic PERV-C including its long terminal repeat (LTR). We describe here the unique organization of the PERV-C LTR and its changes during serial passage of recombinant virus in human cells. An increase in virus titer correlated with an increase in LTR length, caused by multiplication of 37-bp repeats containing nuclear factor Y binding sites. Luciferase dual reporter assays revealed a correlation between the number of repeats and the extent of expression. No alterations have been observed in the receptor-binding site, indicating that the increased titer is due to the changes in the LTR. These data indicate that recombinant PERVs generated during infection of human cells can adapt and subsequently replicate with greater efficiency
Length and repeat-sequence variation in 58 STRs and 94 SNPs in two Spanish populations.

Science.gov (United States)

Casals, Ferran; Anglada, Roger; Bonet, Núria; Rasal, Raquel; van der Gaag, Kristiaan J; Hoogenboom, Jerry; Solé-Morata, Neus; Comas, David; Calafell, Francesc

2017-09-01

We have genotyped the 58 STRs (27 autosomal, 24 Y-STRs and 7 X-STRs) and 94 autosomal SNPs in Illumina ForenSeq™ Primer Mix A in 88 Spanish Roma (Gypsy) samples and 143 Catalans. Since this platform is based in massive parallel sequencing, we have used simple R scripts to uncover the sequence variation in the repeat region. Thus, we have found, across 58 STRs, 541 length-based alleles, which, after considering repeat-sequence variation, became 804 different alleles. All loci in both populations were in Hardy-Weinberg equilibrium. F ST between both populations was 0.0178 for autosomal SNPs, 0.0146 for autosomal STRs, 0.0101 for X-STRs and 0.1866 for Y-STRs. Combined a priori statistics showed quite large; for instance, pooling all the autosomal loci, the a priori probabilities of discriminating a suspect become 1-(2.3×10 -70 ) and 1-(5.9×10 -73 ), for Roma and Catalans respectively, and the chances of excluding a false father in a trio are 1-(2.6×10 -20 ) and 1-(2.0×10 -21 ). Copyright © 2017 Elsevier B.V. All rights reserved.
Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

Science.gov (United States)

Amirhaeri, S; Wohlrab, F; Wells, R D

1995-02-17

The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.
Molecular identification and characterization of clustered regularly interspaced short palindromic repeats (CRISPRs) in a urease-positive thermophilic Campylobacter sp. (UPTC).

Science.gov (United States)

Tasaki, E; Hirayama, J; Tazumi, A; Hayashi, K; Hara, Y; Ueno, H; Moore, J E; Millar, B C; Matsuda, M

2012-02-01

Novel clustered regularly-interspaced short palindromic repeats (CRISPRs) locus [7,500 base pairs (bp) in length] occurred in the urease-positive thermophilic Campylobacter (UPTC) Japanese isolate, CF89-12. The 7,500 bp gene loci consisted of the 5'-methylaminomethyl-2-thiouridylate methyltransferase gene, putative (P) CRISPR associated (p-Cas), putative open reading frames, Cas1 and Cas2, leader sequence region (146 bp), 12 CRISPRs consensus sequence repeats (each 36 bp) separated by a non-repetitive unique spacer region of similar length (26-31 bp) and the phosphatidyl glycerophosphatase A gene. When the CRISPRs loci in the UPTC CF89-12 and five C. jejuni isolates were compared with one another, these six isolates contained p-Cas, Cas1 and Cas2 within the loci. Four to 12 CRISPRs consensus sequence repeats separated by a non-repetitive unique spacer region occurred in six isolates and the nucleotide sequences of those repeats gave approximately 92-100% similarity with each other. However, no sequence similarity occurred in the unique spacer regions among these isolates. The putative σ(70) transcriptional promoter and the hypothetical ρ-independent terminator structures for the CRISPRs and Cas were detected. No in vivo transcription of p-Cas, Cas1 and Cas2 was confirmed in the UPTC cells.
Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

Science.gov (United States)

Spielmann, A; Stutz, E

1983-10-25

The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.
Development of simple sequence repeat markers and diversity analysis in alfalfa (Medicago sativa L.).

Science.gov (United States)

Wang, Zan; Yan, Hongwei; Fu, Xinnian; Li, Xuehui; Gao, Hongwen

2013-04-01

Efficient and robust molecular markers are essential for molecular breeding in plant. Compared to dominant and bi-allelic markers, multiple alleles of simple sequence repeat (SSR) markers are particularly informative and superior in genetic linkage map and QTL mapping in autotetraploid species like alfalfa. The objective of this study was to enrich SSR markers directly from alfalfa expressed sequence tags (ESTs). A total of 12,371 alfalfa ESTs were retrieved from the National Center for Biotechnology Information. Total 774 SSR-containing ESTs were identified from 716 ESTs. On average, one SSR was found per 7.7 kb of EST sequences. Tri-nucleotide repeats (48.8 %) was the most abundant motif type, followed by di-(26.1 %), tetra-(11.5 %), penta-(9.7 %), and hexanucleotide (3.9 %). One hundred EST-SSR primer pairs were successfully designed and 29 exhibited polymorphism among 28 alfalfa accessions. The allele number per marker ranged from two to 21 with an average of 6.8. The PIC values ranged from 0.195 to 0.896 with an average of 0.608, indicating a high level of polymorphism of the EST-SSR markers. Based on the 29 EST-SSR markers, assessment of genetic diversity was conducted and found that Medicago sativa ssp. sativa was clearly different from the other subspecies. The high transferability of those EST-SSR markers was also found for relative species.
Transcription arrest by a G quadruplex forming-trinucleotide repeat sequence from the human c-myb gene.

Science.gov (United States)

Broxson, Christopher; Beckett, Joshua; Tornaletti, Silvia

2011-05-17

Non canonical DNA structures correspond to genomic regions particularly susceptible to genetic instability. The transcription process facilitates formation of these structures and plays a major role in generating the instability associated with these genomic sites. However, little is known about how non canonical structures are processed when encountered by an elongating RNA polymerase. Here we have studied the behavior of T7 RNA polymerase (T7RNAP) when encountering a G quadruplex forming-(GGA)(4) repeat located in the human c-myb proto-oncogene. To make direct correlations between formation of the structure and effects on transcription, we have taken advantage of the ability of the T7 polymerase to transcribe single-stranded substrates and of G4 DNA to form in single-stranded G-rich sequences in the presence of potassium ions. Under physiological KCl concentrations, we found that T7 RNAP transcription was arrested at two sites that mapped to the c-myb (GGA)(4) repeat sequence. The extent of arrest did not change with time, indicating that the c-myb repeat represented an absolute block and not a transient pause to T7 RNAP. Consistent with G4 DNA formation, arrest was not observed in the absence of KCl or in the presence of LiCl. Furthermore, mutations in the c-myb (GGA)(4) repeat, expected to prevent transition to G4, also eliminated the transcription block. We show T7 RNAP arrest at the c-myb repeat in double-stranded DNA under conditions mimicking the cellular concentration of biomolecules and potassium ions, suggesting that the G4 structure formed in the c-myb repeat may represent a transcription roadblock in vivo. Our results support a mechanism of transcription-coupled DNA repair initiated by arrest of transcription at G4 structures.
Sequence context effects on 8-methoxypsoralen photobinding to defined DNA fragments

International Nuclear Information System (INIS)

Sage, E.; Moustacchi, E.

1987-01-01

The photoreaction of 8-methoxypsoralen (8-MOP) with DNA fragments of defined sequence was studied. The authors took advantage of the blockage by bulky adducts of the 3'-5'-exonuclease activity associated with the T4 DNA polymerase. The action of the exonuclease is stopped by biadducts as well as by monoadducts. The termination products were analyzed on sequencing gels. A strong sequence specificity was observed in the DNA photobinding of 8-MOP. The exonuclease terminates its digestion near thymine residues, mainly at potentially cross-linkable sites. There is an increasing reactivity of thymine residues in the order T < TT << TTT in a GC environment. For thymine residues in cross-linkable sites, the reactivity follows the order AT << TA ∼ TAT << ATA < ATAT < ATATAA. Repeated A-T sequences are hot spots for the photochemical reaction of 8-MOP with DNA. Both monoadducts and interstrand cross-links are formed preferentially in 5'-TpA sites. The results highlight the role of the sequence and consequently of the conformation around a potential site in the photobinding of 8-MOP to DNA

Use of short tandem repeat sequences to study Mycobacterium leprae in leprosy patients in Malawi and India.

Directory of Open Access Journals (Sweden)

Saroj K Young

2008-04-01

Full Text Available Inadequate understanding of the transmission of Mycobacterium leprae makes it difficult to predict the impact of leprosy control interventions. Genotypic tests that allow tracking of individual bacterial strains would strengthen epidemiological studies and contribute to our understanding of the disease.Genotyping assays based on variation in the copy number of short tandem repeat sequences were applied to biopsies collected in population-based epidemiological studies of leprosy in northern Malawi, and from members of multi-case households in Hyderabad, India. In the Malawi series, considerable genotypic variability was observed between patients, and also within patients, when isolates were collected at different times or from different tissues. Less within-patient variability was observed when isolates were collected from similar tissues at the same time. Less genotypic variability was noted amongst the closely related Indian patients than in the Malawi series.Lineages of M. leprae undergo changes in their pattern of short tandem repeat sequences over time. Genetic divergence is particularly likely between bacilli inhabiting different (e.g., skin and nerve tissues. Such variability makes short tandem repeat sequences unsuitable as a general tool for population-based strain typing of M. leprae, or for distinguishing relapse from reinfection. Careful use of these markers may provide insights into the development of disease within individuals and for tracking of short transmission chains.
Analysis of sequence diversity through internal transcribed spacers and simple sequence repeats to identify Dendrobium species.

Science.gov (United States)

Liu, Y T; Chen, R K; Lin, S J; Chen, Y C; Chin, S W; Chen, F C; Lee, C Y

2014-04-08

The Orchidaceae is one of the largest and most diverse families of flowering plants. The Dendrobium genus has high economic potential as ornamental plants and for medicinal purposes. In addition, the species of this genus are able to produce large crops. However, many Dendrobium varieties are very similar in outward appearance, making it difficult to distinguish one species from another. This study demonstrated that the 12 Dendrobium species used in this study may be divided into 2 groups by internal transcribed spacer (ITS) sequence analysis. Red and yellow flowers may also be used to separate these species into 2 main groups. In particular, the deciduous characteristic is associated with the ITS genetic diversity of the A group. Of 53 designed simple sequence repeat (SSR) primer pairs, 7 pairs were polymorphic for polymerase chain reaction products that were amplified from a specific band. The results of this study demonstrate that these 7 SSR primer pairs may potentially be used to identify Dendrobium species and their progeny in future studies.
Structural organization of glycophorin A and B genes: Glycophorin B gene evolved by homologous recombination at Alu repeat sequences

International Nuclear Information System (INIS)

Kudo, Shinichi; Fukuda, Minoru

1989-01-01

Glycophorins A (GPA) and B (GPB) are two major sialoglycoproteins of the human erythrocyte membrane. Here the authors present a comparison of the genomic structures of GPA and GPB developed by analyzing DNA clones isolated from a K562 genomic library. Nucleotide sequences of exon-intron junctions and 5' and 3' flanking sequences revealed that the GPA and GPB genes consist of 7 and 5 exons, respectively, and both genes have >95% identical sequence from the 5' flanking region to the region ∼ 1 kilobase downstream from the exon encoding the transmembrane regions. In this homologous part of the genes, GPB lacks one exon due to a point mutation at the 5' splicing site of the third intron, which inactivates the 5' cleavage event of splicing and leads to ligation of the second to the fourth exon. Following these very homologous sequences, the genomic sequences for GPA and GPB diverge significantly and no homology can be detected in their 3' end sequences. The analysis of the Alu sequences and their flanking direct repeat sequences suggest that an ancestral genomic structure has been maintained in the GPA gene, whereas the GPB gene has arisen from the acquisition of 3' sequences different from those of the GPA gene by homologous recombination at the Alu repeats during or after gene duplication
Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

Science.gov (United States)

Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

2012-08-01

Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.
Sequence variability is correlated with weak immunogenicity in Streptococcus pyogenes M protein.

Science.gov (United States)

Lannergård, Jonas; Kristensen, Bodil M; Gustafsson, Mattias C U; Persson, Jenny J; Norrby-Teglund, Anna; Stålhammar-Carlemalm, Margaretha; Lindahl, Gunnar

2015-10-01

The M protein of Streptococcus pyogenes, a major bacterial virulence factor, has an amino-terminal hypervariable region (HVR) that is a target for type-specific protective antibodies. Intriguingly, the HVR elicits a weak antibody response, indicating that it escapes host immunity by two mechanisms, sequence variability and weak immunogenicity. However, the properties influencing the immunogenicity of regions in an M protein remain poorly understood. Here, we studied the antibody response to different regions of the classical M1 and M5 proteins, in which not only the HVR but also the adjacent fibrinogen-binding B repeat region exhibits extensive sequence divergence. Analysis of antisera from S. pyogenes-infected patients, infected mice, and immunized mice showed that both the HVR and the B repeat region elicited weak antibody responses, while the conserved carboxy-terminal part was immunodominant. Thus, we identified a correlation between sequence variability and weak immunogenicity for M protein regions. A potential explanation for the weak immunogenicity was provided by the demonstration that protease digestion selectively eliminated the HVR-B part from whole M protein-expressing bacteria. These data support a coherent model, in which the entire variable HVR-B part evades antibody attack, not only by sequence variability but also by weak immunogenicity resulting from protease attack. © 2015 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
Engineered bacterial hydrophobic oligopeptide repeats in a synthetic yeast prion, [REP-PSI+

Directory of Open Access Journals (Sweden)

Fátima eGasset-Rosa

2015-04-01

Full Text Available The yeast translation termination factor Sup35p, by aggregating as the [PSI+] prion, enables ribosomes to read-through stop codons, thus expanding the diversity of the Saccharomyces cerevisiae proteome. Yeast prions are functional amyloids that replicate by templating their conformation on native protein molecules, then assembling as large aggregates and fibers. Prions propagate epigenetically from mother to daughter cells by fragmentation of such assemblies. In the N-terminal prion-forming domain, Sup35p has glutamine/asparagine-rich oligopeptide repeats (OPRs, which enable propagation through chaperone-elicited shearing. We have engineered chimeras by replacing the polar OPRs in Sup35p by up to five repeats of a hydrophobic amyloidogenic sequence from the synthetic bacterial prionoid RepA-WH1. The resulting hybrid, [REP-PSI+], i was functional in a stop codon read-through assay in S. cerevisiae; ii generates weak phenotypic variants upon both its expression or transformation into [psi-] cells; iii these variants correlated with high molecular weight aggregates resistant to SDS during electrophoresis; and iv according to fluorescence microscopy, the fusion of the prion domains from the engineered chimeras to the reporter protein mCherry generated perivacuolar aggregate foci in yeast cells. All these are signatures of bona fide yeast prions. As assessed through biophysical approaches, the chimeras assembled as oligomers rather than as the fibers characteristic of [PSI+]. These results suggest that it is the balance between polar and hydrophobic residues in OPRs what determines prion conformational dynamics. In addition, our findings illustrate the feasibility of enabling new propagation traits in yeast prions by engineering OPRs with heterologous amyloidogenic sequence repeats.
P22 Arc repressor: enhanced expression of unstable mutants by addition of polar C-terminal sequences.

OpenAIRE

Milla, M. E.; Brown, B. M.; Sauer, R. T.

1993-01-01

Many mutant variants of the P22 Arc repressor are subject to intracellular proteolysis in Escherichia coli, which precludes their expression at levels sufficient for purification and subsequent biochemical characterization. Here we examine the effects of several different C-terminal extension sequences on the expression and activity of a set of Arc mutants. We show that two tail sequences, KNQHE (st5) and H6KNQHE (st11), increase the expression levels of most mutants from 10- to 20-fold and, ...
Molecular cloning and sequence analysis of hamster CENP-A cDNA

Directory of Open Access Journals (Sweden)

Valdivia Manuel M

2002-05-01

Full Text Available Abstract Background The centromere is a specialized locus that mediates chromosome movement during mitosis and meiosis. This chromosomal domain comprises a uniquely packaged form of heterochromatin that acts as a nucleus for the assembly of the kinetochore a trilaminar proteinaceous structure on the surface of each chromatid at the primary constriction. Kinetochores mediate interactions with the spindle fibers of the mitotic apparatus. Centromere protein A (CENP-A is a histone H3-like protein specifically located to the inner plate of kinetochore at active centromeres. CENP-A works as a component of specialized nucleosomes at centromeres bound to arrays of repeat satellite DNA. Results We have cloned the hamster homologue of human and mouse CENP-A. The cDNA isolated was found to contain an open reading frame encoding a polypeptide consisting of 129 amino acid residues with a C-terminal histone fold domain highly homologous to those of CENP-A and H3 sequences previously released. However, significant sequence divergence was found at the N-terminal region of hamster CENP-A that is five and eleven residues shorter than those of mouse and human respectively. Further, a human serine 7 residue, a target site for Aurora B kinase phosphorylation involved in the mechanism of cytokinesis, was not found in the hamster protein. A human autoepitope at the N-terminal region of CENP-A described in autoinmune diseases is not conserved in the hamster protein. Conclusions We have cloned the hamster cDNA for the centromeric protein CENP-A. Significant differences on protein sequence were found at the N-terminal tail of hamster CENP-A in comparison with that of human and mouse. Our results show a high degree of evolutionary divergence of kinetochore CENP-A proteins in mammals. This is related to the high diverse nucleotide repeat sequences found at the centromere DNA among species and support a current centromere model for kinetochore function and structural
Cytogenetic Analysis of Populus trichocarpa - Ribosomal DNA, Telomere Repeat Sequence, and Marker-selected BACs

Science.gov (United States)

M.N. lslam-Faridi; C.D. Nelson; S.P. DiFazio; L.E. Gunter; G.A. Tuskan

2009-01-01

The 185-285 rDNA and 55 rDNA loci in Populus trichocarpa were localized using fluorescent in situ hybridization (FISH). Two 185-285 rDNA sites and one 55 rDNA site were identified and located at the ends of 3 different chromosomes. FISH signals from the Arabidopsis-type telomere repeat sequence were observed at the distal ends of each chromosome. Six BAC clones...
Genotyping and Molecular Identification of Date Palm Cultivars Using Inter-Simple Sequence Repeat (ISSR) Markers.

Science.gov (United States)

Ayesh, Basim M

2017-01-01

Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
The complete chloroplast genome sequence of Taxus chinensis var. mairei (Taxaceae): loss of an inverted repeat region and comparative analysis with related species.

Science.gov (United States)

Zhang, Yanzhen; Ma, Ji; Yang, Bingxian; Li, Ruyi; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Zhang, Lin

2014-05-01

Taxus chinensis var. mairei (Taxaceae) is a domestic variety of yew species in local China. This plant is one of the sources for paclitaxel, which is a promising antineoplastic chemotherapy drugs during the last decade. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of T. chinensis var. mairei. The T. chinensis var. mairei cp genome is 129,513 bp in length, with 113 single copy genes and two duplicated genes (trnI-CAU, trnQ-UUG). Among the 113 single copy genes, 9 are intron-containing. Compared to other land plant cp genomes, the T. chinensis var. mairei cp genome has lost one of the large inverted repeats (IRs) found in angiosperms, fern, liverwort, and gymnosperm such as Cycas revoluta and Ginkgo biloba L. Compared to related species, the gene order of T. chinensis var. mairei has a large inversion of ~110kb including 91 genes (from rps18 to accD) with gene contents unarranged. Repeat analysis identified 48 direct and 2 inverted repeats 30 bp long or longer with a sequence identity greater than 90%. Repeated short segments were found in genes rps18, rps19 and clpP. Analysis also revealed 22 simple sequence repeat (SSR) loci and almost all are composed of A or T. Copyright © 2014 Elsevier B.V. All rights reserved.
Distinct repeat motifs at the C-terminal region of CagA of Helicobacter pylori strains isolated from diseased patients and asymptomatic individuals in West Bengal, India

Directory of Open Access Journals (Sweden)

Chattopadhyay Santanu

2012-05-01

Full Text Available Abstract Background Infection with Helicobacter pylori strains that express CagA is associated with gastritis, peptic ulcer disease, and gastric adenocarcinoma. The biological function of CagA depends on tyrosine phosphorylation by a cellular kinase. The phosphate acceptor tyrosine moiety is present within the EPIYA motif at the C-terminal region of the protein. This region is highly polymorphic due to variations in the number of EPIYA motifs and the polymorphism found in spacer regions among EPIYA motifs. The aim of this study was to analyze the polymorphism at the C-terminal end of CagA and to evaluate its association with the clinical status of the host in West Bengal, India. Results Seventy-seven H. pylori strains isolated from patients with various clinical statuses were used to characterize the C-ternimal polymorphic region of CagA. Our analysis showed that there is no correlation between the previously described CagA types and various disease outcomes in Indian context. Further analyses of different CagA structures revealed that the repeat units in the spacer sequences within the EPIYA motifs are actually more discrete than the previously proposed models of CagA variants. Conclusion Our analyses suggest that EPIYA motifs as well as the spacer sequence units are present as distinct insertions and deletions, which possibly have arisen from extensive recombination events. Moreover, we have identified several new CagA types, which could not be typed by the existing systems and therefore, we have proposed a new typing system. We hypothesize that a cagA gene encoding higher number EPIYA motifs may perhaps have arisen from cagA genes that encode lesser EPIYA motifs by acquisition of DNA segments through recombination events.
Constructs for the expression of repeating triple-helical protein domains

International Nuclear Information System (INIS)

Peng, Yong Y; Werkmeister, Jerome A; Vaughan, Paul R; Ramshaw, John A M

2009-01-01

The development of novel scaffolds will be an important aspect in future success of tissue engineering. Scaffolds will preferably contain information that directs the cellular content of constructs so that the new tissue that is formed is closely aligned in structure, composition and function to the target natural tissue. One way of approaching this will be the development of novel protein-based constructs that contain one or more repeats of functional elements derived from various proteins. In the present case, we describe a strategy to make synthetic, recombinant triple-helical constructs that contain repeat segments of biologically relevant domains. Copies of a DNA fragment prepared by PCR from human type III collagen have been inserted in a co-linear contiguous fashion into the yeast expression vector YEpFlag-1, using sequential addition between selected restriction sites. Constructs containing 1, 2 and 3 repeats were designed to maintain the (Gly-X-Y) repeat, which is essential for the formation of an extended triple helix. All constructs gave expressed protein, with the best being the 3-repeat construct which was readily secreted. This material had the expected composition and N-terminal sequence. Incubation of the product at low temperature led to triple-helix formation, shown by reaction with a conformation dependent monoclonal antibody.
Constructs for the expression of repeating triple-helical protein domains

Energy Technology Data Exchange (ETDEWEB)

Peng, Yong Y; Werkmeister, Jerome A; Vaughan, Paul R; Ramshaw, John A M, E-mail: jerome.werkmeister@csiro.a [CSIRO Molecular and Health Technologies, Bag 10, Clayton South, VIC 3169 (Australia)

2009-02-15

The development of novel scaffolds will be an important aspect in future success of tissue engineering. Scaffolds will preferably contain information that directs the cellular content of constructs so that the new tissue that is formed is closely aligned in structure, composition and function to the target natural tissue. One way of approaching this will be the development of novel protein-based constructs that contain one or more repeats of functional elements derived from various proteins. In the present case, we describe a strategy to make synthetic, recombinant triple-helical constructs that contain repeat segments of biologically relevant domains. Copies of a DNA fragment prepared by PCR from human type III collagen have been inserted in a co-linear contiguous fashion into the yeast expression vector YEpFlag-1, using sequential addition between selected restriction sites. Constructs containing 1, 2 and 3 repeats were designed to maintain the (Gly-X-Y) repeat, which is essential for the formation of an extended triple helix. All constructs gave expressed protein, with the best being the 3-repeat construct which was readily secreted. This material had the expected composition and N-terminal sequence. Incubation of the product at low temperature led to triple-helix formation, shown by reaction with a conformation dependent monoclonal antibody.
Accessibility of the Shine-Dalgarno sequence dictates N-terminal codon bias in E. coli

OpenAIRE

Shakhnovich, Eugene; Zhang, Wenli; Yan, Jin; Adkar, Bharat; Jacobs, William; Bhattacharyya, Sanchari; Adkar, Bharat

2018-01-01

Despite considerable efforts, no physical mechanism has been shown to explain N-terminal codon bias in prokaryotic genomes. Using a systematic study of synonymous substitutions in two endogenous E. coli genes, we show that interactions between the coding region and the upstream Shine-Dalgarno (SD) sequence modulate the efficiency of translation initiation, affecting both intracellular mRNA and protein levels due to the inherent coupling of transcription and translation in E. coli. We further ...
Inter-simple sequence repeat (ISSR) loci mapping in the genome of perennial ryegrass

DEFF Research Database (Denmark)

Pivorienė, O; Pašakinskienė, I; Brazauskas, G

2008-01-01

The aim of this study was to identify and characterize new ISSR markers and their loci in the genome of perennial ryegrass. A subsample of the VrnA F2 mapping family of perennial ryegrass comprising 92 individuals was used to develop a linkage map including inter-simple sequence repeat markers...... demonstrated a 70% similarity to the Hordeum vulgare germin gene GerA. Inter-SSR mapping will provide useful information for gene targeting, quantitative trait loci mapping and marker-assisted selection in perennial ryegrass....
Evaluation of the Terminal Sequencing and Spacing System for Performance Based Navigation Arrivals

Science.gov (United States)

Thipphavong, Jane; Jung, Jaewoo; Swenson, Harry N.; Martin, Lynne; Lin, Melody; Nguyen, Jimmy

2013-01-01

NASA has developed the Terminal Sequencing and Spacing (TSS) system, a suite of advanced arrival management technologies combining timebased scheduling and controller precision spacing tools. TSS is a ground-based controller automation tool that facilitates sequencing and merging arrivals that have both current standard ATC routes and terminal Performance-Based Navigation (PBN) routes, especially during highly congested demand periods. In collaboration with the FAA and MITRE's Center for Advanced Aviation System Development (CAASD), TSS system performance was evaluated in human-in-the-loop (HITL) simulations with currently active controllers as participants. Traffic scenarios had mixed Area Navigation (RNAV) and Required Navigation Performance (RNP) equipage, where the more advanced RNP-equipped aircraft had preferential treatment with a shorter approach option. Simulation results indicate the TSS system achieved benefits by enabling PBN, while maintaining high throughput rates-10% above baseline demand levels. Flight path predictability improved, where path deviation was reduced by 2 NM on average and variance in the downwind leg length was 75% less. Arrivals flew more fuel-efficient descents for longer, spending an average of 39 seconds less in step-down level altitude segments. Self-reported controller workload was reduced, with statistically significant differences at the p less than 0.01 level. The RNP-equipped arrivals were also able to more frequently capitalize on the benefits of being "Best-Equipped, Best- Served" (BEBS), where less vectoring was needed and nearly all RNP approaches were conducted without interruption.
Effects of GABA[subscript A] Modulators on the Repeated Acquisition of Response Sequences in Squirrel Monkeys

Science.gov (United States)

Campbell, Una C.; Winsauer, Peter J.; Stevenson, Michael W.; Moerschbaecher, Joseph M.

2004-01-01

The present study investigated the effects of positive and negative GABA[subscript A] modulators under three different baselines of repeated acquisition in squirrel monkeys in which the monkeys acquired a three-response sequence on three keys under a second-order fixed-ratio (FR) schedule of food reinforcement. In two of these baselines, the…
Morpholino spin-labeling for base-pair sequencing of a 3'-terminal RNA stem by proton homonuclear Overhauser enhancements: yeast ribosomal 5S RNA

International Nuclear Information System (INIS)

Lee, K.M.; Marshall, A.G.

1987-01-01

Base-pair sequences for 5S and 5.8S RNAs are not readily extracted from proton homonuclear nuclear Overhauser enhancement (NOE) connectivity experiments alone, due to extensive peak overlap in the downfield (11-15 ppm) proton NMR spectrum. In this paper, we introduce a new method for base-pair proton peak assignment for ribosomal RNAs, based upon the distance-dependent broadening of the resonances of base-pair protons spatially proximal to a paramagnetic group. Introduction of a nitroxide spin-label covalently attached to the 3'-terminal ribose provides an unequivocal starting point for base-pair hydrogen-bond proton NMR assignment. Subsequent NOE connectivities then establish the base-pair sequence for the terminal stem of a 5S RNA. Periodate oxidation of yeast 5S RNA, followed by reaction with 4-amino-2,2,6,6-tetramethylpiperidinyl-1-oxy (TEMPO-NH2) and sodium borohydride reduction, produces yeast 5S RNA specifically labeled with a paramagnetic nitroxide group at the 3'-terminal ribose. Comparison of the 500-MHz 1H NMR spectra of native and 3'-terminal spin-labeled yeast 5S RNA serves to identify the terminal base pair (G1 . C120) and its adjacent base pair (G2 . U119) on the basis of their proximity to the 3'-terminal spin-label. From that starting point, we have then identified (G . C, A . U, or G . U) and sequenced eight of the nine base pairs in the terminal helix via primary and secondary NOE's
Development and Characterization of Simple Sequence Repeat (SSR) Markers Based on RNA-Sequencing of Medicago sativa and In silico Mapping onto the M. truncatula Genome

Science.gov (United States)

Wang, Zan; Yu, Guohui; Shi, Binbin; Wang, Xuemin; Qiang, Haiping; Gao, Hongwen

2014-01-01

Sufficient codominant genetic markers are needed for various genetic investigations in alfalfa since the species is an outcrossing autotetraploid. With the newly developed next generation sequencing technology, a large amount of transcribed sequences of alfalfa have been generated and are available for identifying SSR markers by data mining. A total of 54,278 alfalfa non-redundant unigenes were assembled through the Illumina HiSeqTM 2000 sequencing technology. Based on 3,903 unigene sequences, 4,493 SSRs were identified. Tri-nucleotide repeats (56.71%) were the most abundant motif class while AG/CT (21.7%), AGG/CCT (19.8%), AAC/GTT (10.3%), ATC/ATG (8.8%), and ACC/GGT (6.3%) were the subsequent top five nucleotide repeat motifs. Eight hundred and thirty- seven EST-SSR primer pairs were successfully designed. Of these, 527 (63%) primer pairs yielded clear and scored PCR products and 372 (70.6%) exhibited polymorphisms. High transferability was observed for ssp falcata at 99.2% (523) and 71.7% (378) in M. truncatula. In addition, 313 of 527 SSR marker sequences were in silico mapped onto the eight M. truncatula chromosomes. Thirty-six polymorphic SSR primer pairs were used in the genetic relatedness analysis of 30 Chinese alfalfa cultivated accessions generating a total of 199 scored alleles. The mean observed heterozygosity and polymorphic information content were 0.767 and 0.635, respectively. The codominant markers not only enriched the current resources of molecular markers in alfalfa, but also would facilitate targeted investigations in marker-trait association, QTL mapping, and genetic diversity analysis in alfalfa. PMID:24642969

Protection against β-amyloid neurotoxicity by a non-toxic endogenous N-terminal β-amyloid fragment and its active hexapeptide core sequence.

Science.gov (United States)

Forest, Kelly H; Alfulaij, Naghum; Arora, Komal; Taketa, Ruth; Sherrin, Tessi; Todorovic, Cedomir; Lawrence, James L M; Yoshikawa, Gene T; Ng, Ho-Leung; Hruby, Victor J; Nichols, Robert A

2018-01-01

High levels (μM) of beta amyloid (Aβ) oligomers are known to trigger neurotoxic effects, leading to synaptic impairment, behavioral deficits, and apoptotic cell death. The hydrophobic C-terminal domain of Aβ, together with sequences critical for oligomer formation, is essential for this neurotoxicity. However, Aβ at low levels (pM-nM) has been shown to function as a positive neuromodulator and this activity resides in the hydrophilic N-terminal domain of Aβ. An N-terminal Aβ fragment (1-15/16), found in cerebrospinal fluid, was also shown to be a highly active neuromodulator and to reverse Aβ-induced impairments of long-term potentiation. Here, we show the impact of this N-terminal Aβ fragment and a shorter hexapeptide core sequence in the Aβ fragment (Aβcore: 10-15) to protect or reverse Aβ-induced neuronal toxicity, fear memory deficits and apoptotic death. The neuroprotective effects of the N-terminal Aβ fragment and Aβcore on Aβ-induced changes in mitochondrial function, oxidative stress, and apoptotic neuronal death were demonstrated via mitochondrial membrane potential, live reactive oxygen species, DNA fragmentation and cell survival assays using a model neuroblastoma cell line (differentiated NG108-15) and mouse hippocampal neuron cultures. The protective action of the N-terminal Aβ fragment and Aβcore against spatial memory processing deficits in amyloid precursor protein/PSEN1 (5XFAD) mice was demonstrated in contextual fear conditioning. Stabilized derivatives of the N-terminal Aβcore were also shown to be fully protective against Aβ-triggered oxidative stress. Together, these findings indicate an endogenous neuroprotective role for the N-terminal Aβ fragment, while active stabilized N-terminal Aβcore derivatives offer the potential for therapeutic application. © 2017 International Society for Neurochemistry.
Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

Directory of Open Access Journals (Sweden)

Varala Kranthi

2007-05-01

Full Text Available Abstract Background Extensive computational and database tools are available to mine genomic and genetic databases for model organisms, but little genomic data is available for many species of ecological or agricultural significance, especially those with large genomes. Genome surveys using conventional sequencing techniques are powerful, particularly for detecting sequences present in many copies per genome. However these methods are time-consuming and have potential drawbacks. High throughput 454 sequencing provides an alternative method by which much information can be gained quickly and cheaply from high-coverage surveys of genomic DNA. Results We sequenced 78 million base-pairs of randomly sheared soybean DNA which passed our quality criteria. Computational analysis of the survey sequences provided global information on the abundant repetitive sequences in soybean. The sequence was used to determine the copy number across regions of large genomic clones or contigs and discover higher-order structures within satellite repeats. We have created an annotated, online database of sequences present in multiple copies in the soybean genome. The low bias of pyrosequencing against repeat sequences is demonstrated by the overall composition of the survey data, which matches well with past estimates of repetitive DNA content obtained by DNA re-association kinetics (Cot analysis. Conclusion This approach provides a potential aid to conventional or shotgun genome assembly, by allowing rapid assessment of copy number in any clone or clone-end sequence. In addition, we show that partial sequencing can provide access to partial protein-coding sequences.
N-terminal amino acid sequence of Bacillus licheniformis alpha-amylase: comparison with Bacillus amyloliquefaciens and Bacillus subtilis Enzymes.

OpenAIRE

Kuhn, H; Fietzek, P P; Lampen, J O

1982-01-01

The thermostable, liquefying alpha-amylase from Bacillus licheniformis was immunologically cross-reactive with the thermolabile, liquefying alpha-amylase from Bacillus amyloliquefaciens. Their N-terminal amino acid sequences showed extensive homology with each other, but not with the saccharifying alpha-amylases of Bacillus subtilis.
Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

Energy Technology Data Exchange (ETDEWEB)

Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie; Weinberg, Marc S. [Antiviral Gene Therapy Research Unit, University of the Witwatersrand (South Africa); Arbuthnot, Patrick, E-mail: Patrick.Arbuthnot@wits.ac.za [Antiviral Gene Therapy Research Unit, University of the Witwatersrand (South Africa)

2009-11-20

RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR) shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.
Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

International Nuclear Information System (INIS)

Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie; Weinberg, Marc S.; Arbuthnot, Patrick

2009-01-01

RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR) shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.
The leucine-rich repeat structure.

Science.gov (United States)

Bella, J; Hindle, K L; McEwan, P A; Lovell, S C

2008-08-01

The leucine-rich repeat is a widespread structural motif of 20-30 amino acids with a characteristic repetitive sequence pattern rich in leucines. Leucine-rich repeat domains are built from tandems of two or more repeats and form curved solenoid structures that are particularly suitable for protein-protein interactions. Thousands of protein sequences containing leucine-rich repeats have been identified by automatic annotation methods. Three-dimensional structures of leucine-rich repeat domains determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. As the essential structural principles become well established, the leucine-rich repeat architecture is emerging as an attractive framework for structural prediction and protein engineering. This review presents an update of the current understanding of leucine-rich repeat structure at the primary, secondary, tertiary and quaternary levels and discusses specific examples from recently determined three-dimensional structures.
Nucleotide sequence, transcript mapping, and regulation of the RAD2 gene of Saccharomyces cerevisiae

International Nuclear Information System (INIS)

Madura, K.; Prakash, S.

1986-01-01

The authors determined the nucleotide sequence, mapped the 5' and 3' nRNA termini, and examined the regulation of the RAD2 gene of Saccharomyces cerevisiae. A long open reading frame within the RAD2 transcribed region encodes a protein of 1031 amino acids with a calculated molecular weight of 117,847. A disruption of the RAD2 gene that deletes the 78 carboxyl terminal codons results in loss of RAD2 function. The 5' ends of RAD2 mRNA show considerable heterogeneity, mapping 5 to 62 nucleotides upstream of the first ATG codon of the long RAD2 open reading frame. The longest RAD2 transcripts also contain a short open reading frame of 37 codons that precedes and overlaps the 5' end of the long RAD2 open reading frame. The RAD2 3' nRNA end maps 171 nucleotides downstream of the TAA termination codon and 20 nucleotides downstream from a 12-base-pair inverted repeat that might function in transcript termination. Northern blot analysis showed a ninefold increase in steady-state levels of RAD2 mRNA after treatment of yeast cells with UV light. The 5' flanking region of the RAD2 gene contains several direct and inverted repeats and a 44-nuclotide-long purine-rich tract. The sequence T G G A G G C A T T A A found at position - 167 to -156 in the RAD2 gene is similar to at sequence present in the 5' flanking regions of the RAD7 and RAD10 genes
The sequence of camelpox virus shows it is most closely related to variola virus, the cause of smallpox.

Science.gov (United States)

Gubser, Caroline; Smith, Geoffrey L

2002-04-01

Camelpox virus (CMPV) and variola virus (VAR) are orthopoxviruses (OPVs) that share several biological features and cause high mortality and morbidity in their single host species. The sequence of a virulent CMPV strain was determined; it is 202182 bp long, with inverted terminal repeats (ITRs) of 6045 bp and has 206 predicted open reading frames (ORFs). As for other poxviruses, the genes are tightly packed with little non-coding sequence. Most genes within 25 kb of each terminus are transcribed outwards towards the terminus, whereas genes within the centre of the genome are transcribed from either DNA strand. The central region of the genome contains genes that are highly conserved in other OPVs and 87 of these are conserved in all sequenced chordopoxviruses. In contrast, genes towards either terminus are more variable and encode proteins involved in host range, virulence or immunomodulation. In some cases, these are broken versions of genes found in other OPVs. The relationship of CMPV to other OPVs was analysed by comparisons of DNA and predicted protein sequences, repeats within the ITRs and arrangement of ORFs within the terminal regions. Each comparison gave the same conclusion: CMPV is the closest known virus to variola virus, the cause of smallpox.
Comparison of the degree of homology of DNA and quantity of repeated sequences in an intact plant and cell structure

International Nuclear Information System (INIS)

Solov'yan, V.T.; Kunaleh, V.A.; Shumnyl, V.K.; Vershinin, A.V.

1986-01-01

This paper attempts to assess the quantity of repeated sequences and degree of homology of DNA in the intact plant and two lines of callus tissue of Rauwolfia serpentina Benth maintained for 20 years, which differ among themselves in the level of biosynthesis of the pharmacologically valuable alkaloid ajmaline. The tritium-labeled repeats of plants and calli were used in direct and reverse hybridization on nitrocellulose filters. Hybridization of H 3-labeled repeats with phage 17 DNA was used as control. The radioactivity of filters after washing was measured in a liquid scintillation counter
Survey and analysis of simple sequence repeats in the Laccaria bicolor genome, with development of microsatellite markers

Energy Technology Data Exchange (ETDEWEB)

Labbe, Jessy L [ORNL; Murat, Claude [INRA, Nancy, France; Morin, Emmanuelle [INRA, Nancy, France; Le Tacon, F [UMR, France; Martin, Francis [INRA, Nancy, France

2011-01-01

It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in the L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.
Nucleotide sequences of two cellulase genes from alkalophilic Bacillus sp. strain N-4 and their strong homology.

OpenAIRE

Fukumori, F; Sashihara, N; Kudo, T; Horikoshi, K

1986-01-01

Two genes for cellulases of alkalophilic Bacillus sp. strain N-4 (ATCC 21833) have been sequenced. From the DNA sequences the cellulases encoded in the plasmids pNK1 and pNK2 consist of 488 and 409 amino acids, respectively. The DNA and protein sequences of the pNK1-encoded cellulase are related to those of the pNK2-encoded cellulase. The pNK2-encoded cellulase lacks the direct repeat sequence of a stretch of 60 amino acids near the C-terminal end of the pNK1-encoded cellulase. The duplicatio...
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

Directory of Open Access Journals (Sweden)

Gao Zhihong

2010-07-01

Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.
Kaposi's sarcoma herpesvirus C-terminal LANA concentrates at pericentromeric and peri-telomeric regions of a subset of mitotic chromosomes

International Nuclear Information System (INIS)

Kelley-Clarke, Brenna; Ballestas, Mary E.; Komatsu, Takashi; Kaye, Kenneth M.

2007-01-01

The Kaposi's sarcoma-associated herpesvirus (KSHV) latency-associated nuclear antigen (LANA) tethers KSHV terminal repeat (TR) DNA to mitotic chromosomes to efficiently segregate episomes to progeny nuclei. LANA contains N- and C-terminal chromosome binding regions. We now show that C-terminal LANA preferentially concentrates to paired dots at pericentromeric and peri-telomeric regions of a subset of mitotic chromosomes through residues 996-1139. Deletions within C-terminal LANA abolished both self-association and chromosome binding, consistent with a requirement for self-association to bind chromosomes. A deletion abolishing TR DNA binding did not affect chromosome targeting, indicating LANA's localization is not due to binding its recognition sequence in chromosomal DNA. LANA distributed similarly on human and non-human mitotic chromosomes. These results are consistent with C-terminal LANA interacting with a cell factor that concentrates at pericentromeric and peri-telomeric regions of mitotic chromosomes
Phylogenetic analysis of Gossypium L. using restriction fragment length polymorphism of repeated sequences.

Science.gov (United States)

Zhang, Meiping; Rong, Ying; Lee, Mi-Kyung; Zhang, Yang; Stelly, David M; Zhang, Hong-Bin

2015-10-01

Cotton is the world's leading textile fiber crop and is also grown as a bioenergy and food crop. Knowledge of the phylogeny of closely related species and the genome origin and evolution of polyploid species is significant for advanced genomics research and breeding. We have reconstructed the phylogeny of the cotton genus, Gossypium L., and deciphered the genome origin and evolution of its five polyploid species by restriction fragment analysis of repeated sequences. Nuclear DNA of 84 accessions representing 35 species and all eight genomes of the genus were analyzed. The phylogenetic tree of the genus was reconstructed using the parsimony method on 1033 polymorphic repeated sequence restriction fragments. The genome origin of its polyploids was determined by calculating the diploid-polyploid restriction fragment correspondence (RFC). The tree is consistent with the morphological classification, genome designation and geographic distribution of the species at subgenus, section and subsection levels. Gossypium lobatum (D7) was unambiguously shown to have the highest RFC with the D-subgenomes of all five polyploids of the genus, while the common ancestor of Gossypium herbaceum (A1) and Gossypium arboreum (A2) likely contributed to the A-subgenomes of the polyploids. These results provide a comprehensive phylogenetic tree of the cotton genus and new insights into the genome origin and evolution of its polyploid species. The results also further demonstrate a simple, rapid and inexpensive method suitable for phylogenetic analysis of closely related species, especially congeneric species, and the inference of genome origin of polyploids that constitute over 70 % of flowering plants.
Linkage of congenital isolated adrenocorticotropic hormone deficiency to the corticotropin releasing hormone locus using simple sequence repeat polymorphisms

Energy Technology Data Exchange (ETDEWEB)

Kyllo, J.H.; Collins, M.M.; Vetter, K.L. [Univ. of Iowa College of Medicine, Iowa City, IA (United States)] [and others

1996-03-29

Genetic screening techniques using simple sequence repeat polymorphisms were applied to investigate the molecular nature of congenital isolated adrenocorticotropic hormone (ACTH) deficiency. We hypothesize that this rare cause of hypocortisolism shared by a brother and sister with two unaffected sibs and unaffected parents is inherited as an autosomal recessive single gene mutation. Genes involved in the hypothalamic-pituitary axis controlling cortisol sufficiency were investigated for a causal role in this disorder. Southern blotting showed no detectable mutations of the gene encoding pro-opiomelanocortin (POMC), the ACTH precursor. Other candidate genes subsequently considered were those encoding neuroendocrine convertase-1, and neuroendocrine convertase-2 (NEC-1, NEC-2), and corticotropin releasing hormone (CRH). Tests for linkage were performed using polymorphic di- and tetranucleotide simple sequence repeat markers flanking the reported map locations for POMC, NEC-1, NEC-2, and CRH. The chromosomal haplotypes determined by the markers flanking the loci for POMC, NEC-1, and NEC-2 were not compatible with linkage. However, 22 individual markers defining the chromosomal haplotypes flanking CRH were compatible with linkage of the disorder to the immediate area of this gene of chromosome 8. Based on these data, we hypothesize that the ACTH deficiency in this family is due to an abnormality of CRH gene structure or expression. These results illustrate the useful application of high density genetic maps constructed with simple sequence repeat markers for inclusion/exclusion studies of candidate genes in even very small nuclear families segregating for unusual phenotypes. 25 refs., 5 figs., 2 tabs.
Distribution and Evolution of Yersinia Leucine-Rich Repeat Proteins

Science.gov (United States)

Hu, Yueming; Huang, He; Hui, Xinjie; Cheng, Xi; White, Aaron P.

2016-01-01

Leucine-rich repeat (LRR) proteins are widely distributed in bacteria, playing important roles in various protein-protein interaction processes. In Yersinia, the well-characterized type III secreted effector YopM also belongs to the LRR protein family and is encoded by virulence plasmids. However, little has been known about other LRR members encoded by Yersinia genomes or their evolution. In this study, the Yersinia LRR proteins were comprehensively screened, categorized, and compared. The LRR proteins encoded by chromosomes (LRR1 proteins) appeared to be more similar to each other and different from those encoded by plasmids (LRR2 proteins) with regard to repeat-unit length, amino acid composition profile, and gene expression regulation circuits. LRR1 proteins were also different from LRR2 proteins in that the LRR1 proteins contained an E3 ligase domain (NEL domain) in the C-terminal region or an NEL domain-encoding nucleotide relic in flanking genomic sequences. The LRR1 protein-encoding genes (LRR1 genes) varied dramatically and were categorized into 4 subgroups (a to d), with the LRR1a to -c genes evolving from the same ancestor and LRR1d genes evolving from another ancestor. The consensus and ancestor repeat-unit sequences were inferred for different LRR1 protein subgroups by use of a maximum parsimony modeling strategy. Structural modeling disclosed very similar repeat-unit structures between LRR1 and LRR2 proteins despite the different unit lengths and amino acid compositions. Structural constraints may serve as the driving force to explain the observed mutations in the LRR regions. This study suggests that there may be functional variation and lays the foundation for future experiments investigating the functions of the chromosomally encoded LRR proteins of Yersinia. PMID:27217422
A C-terminal PDZ domain-binding sequence is required for striatal distribution of the dopamine transporter

DEFF Research Database (Denmark)

Rickhag, Karl Mattias; Hansen, Freja Herborg; Sørensen, Gunnar

2013-01-01

believed to bind synaptic scaffolding proteins, but its functional significance is uncertain. Here we demonstrate that two different dopamine transporter knock-in mice with disrupted PDZ-binding motifs (dopamine transporter-AAA and dopamine transporter+Ala) are characterized by dramatic loss of dopamine......The dopamine transporter mediates reuptake of dopamine from the synaptic cleft. The cellular mechanisms controlling dopamine transporter levels in striatal nerve terminals remain poorly understood. The dopamine transporters contain a C-terminal PDZ (PSD-95/Discs-large/ZO-1) domain-binding sequence...... transporter expression in the striatum, causing hyperlocomotion and attenuated response to amphetamine. In cultured dopaminergic neurons and striatal slices from dopamine transporter-AAA mice, we find markedly reduced dopamine transporter surface levels and evidence for enhanced constitutive internalization...
Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple

Science.gov (United States)

2012-01-01

Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding
The conserved residue Arg46 in the N-terminal heptad repeat domain of HIV-1 gp41 is critical for viral fusion and entry.

Directory of Open Access Journals (Sweden)

Xiaoyi Wang

Full Text Available During the process of HIV-1 fusion with the target cell, the N-terminal heptad repeat (NHR of gp41 interacts with the C-terminal heptad repeat (CHR to form fusogenic six-helix bundle (6-HB core. We previously identified a crucial residue for 6-HB formation and virus entry--Lys63 (K63 in the C-terminal region of NHR (aa 54-70, which forms a hydrophobic cavity. It can form an important salt bridge with Asp121 (D121 in gp41 CHR. Here, we found another important conserved residue for virus fusion and entry, Arg46 (R46, in the N-terminal region of NHR (aa 35-53, which forms a hydrogen bond with a polar residue, Asn43 (N43, in NHR, as a part of the hydrogen-bond network. R46 can also form a salt bridge with a negatively charged residue, Glu137 (E137, in gp41 CHR. Substitution of R46 with the hydrophobic residue Ala (R46A or the negatively charged residue Glu (R46E resulted in disruption of the hydrogen bond network, breakage of the salt bridge and reduction of 6-HB's stability, leading to impairment of viral fusion and decreased inhibition of N36, an NHR peptide. Similarly, CHR peptide C34 with substitution of E137 for Ala (E137A or Arg (E137R also exhibited reduced inhibitory activity against HIV-1 infection and HIV-1-mediated cell-to-cell fusion. These results suggest that the positively charged residue R46 and its hydrogen bond network, together with the salt bridge between R46 and E137, are important for viral fusion and entry and may therefore serve as a target for designing novel HIV fusion/entry inhibitors.
Nucleotide sequence of a cDNA coding for the amino-terminal region of human prepro. alpha. 1(III) collagen

Energy Technology Data Exchange (ETDEWEB)

Toman, P D; Ricca, G A [Rorer Biotechnology, Inc., Springfield, VA (USA); de Crombrugghe, B [National Institutes of Health, Bethesda, MD (USA)

1988-07-25

Type III Collagen is synthesized in a variety of tissues as a precursor macromolecule containing a leader sequence, a N-propeptide, a N-telopeptide, the triple helical region, a C-telopeptide, and C-propeptide. To further characterize the human type III collagen precursor, a human placental cDNA library was constructed in gt11 using an oligonucleotide derived from a partial cDNA sequence corresponding to the carboxy-terminal part of the 1(III) collagen. A cDNA was identified which contains the leader sequence, the N-propeptide and N-telopeptide regions. The DNA sequence of these regions are presented here. The triple helical, C-telopeptide and C-propeptide amino acid sequence for human type III collagen has been determined previously. A comparison of the human amino acid sequence with mouse, chicken, and calf sequence shows 81%, 81%, and 92% similarity, respectively. At the DNA level, the sequence similarity between human and mouse or chicken type III collagen sequences in this area is 82% and 77%, respectively.

A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology.

Science.gov (United States)

Ustek, Duran; Sirma, Sema; Gumus, Ergun; Arikan, Muzaffer; Cakiris, Aris; Abaci, Neslihan; Mathew, Jaicy; Emrence, Zeliha; Azakli, Hulya; Cosan, Fulya; Cakar, Atilla; Parlak, Mahmut; Kursun, Olcay

2012-10-01

One application of next-generation sequencing (NGS) is the targeted resequencing of interested genes which has not been used in viral integration site analysis of gene therapy applications. Here, we combined targeted sequence capture array and next generation sequencing to address the whole genome profiling of viral integration sites. Human 293T and K562 cells were transduced with a HIV-1 derived vector. A custom made DNA probe sets targeted pLVTHM vector used to capture lentiviral vector/human genome junctions. The captured DNA was sequenced using GS FLX platform. Seven thousand four hundred and eighty four human genome sequences flanking the long terminal repeats (LTR) of pLVTHM fragment sequences matched with an identity of at least 98% and minimum 50 bp criteria in both cells. In total, 203 unique integration sites were identified. The integrations in both cell lines were totally distant from the CpG islands and from the transcription start sites and preferentially located in introns. A comparison between the two cell lines showed that the lentiviral-transduced DNA does not have the same preferred regions in the two different cell lines. Copyright © 2012 Elsevier B.V. All rights reserved.
Comparative molecular cytogenetics of major repetitive sequence families of three Dendrobium species (Orchidaceae) from Bangladesh

Science.gov (United States)

Begum, Rabeya; Alam, Sheikh Shamimul; Menzel, Gerhard; Schmidt, Thomas

2009-01-01

Background and Aims Dendrobium species show tremendous morphological diversity and have broad geographical distribution. As repetitive sequence analysis is a useful tool to investigate the evolution of chromosomes and genomes, the aim of the present study was the characterization of repetitive sequences from Dendrobium moschatum for comparative molecular and cytogenetic studies in the related species Dendrobium aphyllum, Dendrobium aggregatum and representatives from other orchid genera. Methods In order to isolate highly repetitive sequences, a c0t-1 DNA plasmid library was established. Repeats were sequenced and used as probes for Southern hybridization. Sequence divergence was analysed using bioinformatic tools. Repetitive sequences were localized along orchid chromosomes by fluorescence in situ hybridization (FISH). Key Results Characterization of the c0t-1 library resulted in the detection of repetitive sequences including the (GA)n dinucleotide DmoO11, numerous Arabidopsis-like telomeric repeats and the highly amplified dispersed repeat DmoF14. The DmoF14 repeat is conserved in six Dendrobium species but diversified in representative species of three other orchid genera. FISH analyses showed the genome-wide distribution of DmoF14 in D. moschatum, D. aphyllum and D. aggregatum. Hybridization with the telomeric repeats demonstrated Arabidopsis-like telomeres at the chromosome ends of Dendrobium species. However, FISH using the telomeric probe revealed two pairs of chromosomes with strong intercalary signals in D. aphyllum. FISH showed the terminal position of 5S and 18S–5·8S–25S rRNA genes and a characteristic number of rDNA sites in the three Dendrobium species. Conclusions The repeated sequences isolated from D. moschatum c0t-1 DNA constitute major DNA families of the D. moschatum, D. aphyllum and D. aggregatum genomes with DmoF14 representing an ancient component of orchid genomes. Large intercalary telomere-like arrays suggest chromosomal
Clinical Observation on Termination of Early Pregnancy of 213 Cases after Caesarian Section with Repeated Use of Mifepristone and Misoprostol

Institute of Scientific and Technical Information of China (English)

高佩佩; 汪平

1999-01-01

Objective To investigate the efficacy and safety in women after caesarian section for termination of early pregnancies by treatment, or repeated treatment with mifepristone and misoprostot.Subjects and Methods A total of 213 pregnant women with amenorrhea of 34-69d after caesarian section who asked for medical abortion were recruited,including 63 cases undergoing their second medical abortion.A total amount of mi feprisstone of 150 mg given in separate doses(25 mg×4 and 50 mg at the first time)was administered orally within 3d, followed by misoprostot of 0.6 mg orally in the morning of d 3.Results The complete abortion rate was 92.5%,incomplete abortion was 4.7% and failure was 2.8%.Conclusion The sequential use of mifepristone and misoprostol could be successfully and repeatedly used for induced abortion in those women with a caesarian section histo-ry.Its efficacy was similar to that for ordinary population.Its safety and effec-tiveness were satisfactory.
The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

Science.gov (United States)

Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

2014-04-01

Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.
Triplet repeat sequences in human DNA can be detected by hybridization to a synthetic (5'-CGG-3')17 oligodeoxyribonucleotide

DEFF Research Database (Denmark)

Behn-Krappa, A; Mollenhauer, J; Doerfler, W

1993-01-01

The seemingly autonomous amplification of naturally occurring triplet repeat sequences in the human genome has been implicated in the causation of human genetic disease, such as the fragile X (Martin-Bell) syndrome, myotonic dystrophy (Curshmann-Steinert), spinal and bulbar muscular atrophy...
The Pentapeptide Repeat Proteins

OpenAIRE

Vetting, Matthew W.; Hegde, Subray S.; Fajardo, J. Eduardo; Fiser, Andras; Roderick, Steven L.; Takiff, Howard E.; Blanchard, John S.

2006-01-01

The Pentapeptide Repeat Protein (PRP) family has over 500 members in the prokaryotic and eukaryotic kingdoms. These proteins are composed of, or contain domains composed of, tandemly repeated amino acid sequences with a consensus sequence of [S,T,A,V][D,N][L,F]-[S,T,R][G]. The biochemical function of the vast majority of PRP family members is unknown. The three-dimensional structure of the first member of the PRP family was determined for the fluoroquinolone resistance protein (MfpA) from Myc...
Genome-Wide Analysis of Simple Sequence Repeats in Bitter Gourd (Momordica charantia

Directory of Open Access Journals (Sweden)

Junjie Cui

2017-06-01

Full Text Available Bitter gourd (Momordica charantia is widely cultivated as a vegetable and medicinal herb in many Asian and African countries. After the sequencing of the cucumber (Cucumis sativus, watermelon (Citrullus lanatus, and melon (Cucumis melo genomes, bitter gourd became the fourth cucurbit species whose whole genome was sequenced. However, a comprehensive analysis of simple sequence repeats (SSRs in bitter gourd, including a comparison with the three aforementioned cucurbit species has not yet been published. Here, we identified a total of 188,091 and 167,160 SSR motifs in the genomes of the bitter gourd lines ‘Dali-11’ and ‘OHB3-1,’ respectively. Subsequently, the SSR content, motif lengths, and classified motif types were characterized for the bitter gourd genomes and compared among all the cucurbit genomes. Lastly, a large set of 138,727 unique in silico SSR primer pairs were designed for bitter gourd. Among these, 71 primers were selected, all of which successfully amplified SSRs from the two bitter gourd lines ‘Dali-11’ and ‘K44’. To further examine the utilization of unique SSR primers, 21 SSR markers were used to genotype a collection of 211 bitter gourd lines from all over the world. A model-based clustering method and phylogenetic analysis indicated a clear separation among the geographic groups. The genomic SSR markers developed in this study have considerable potential value in advancing bitter gourd research.
De novo Transcriptome Sequencing Reveals a Considerable Bias in the Incidence of Simple Sequence Repeats towards the Downstream of ‘Pre-miRNAs’ of Black Pepper

Science.gov (United States)

Joy, Nisha; Asha, Srinivasan; Mallika, Vijayan; Soniya, Eppurathu Vasudevan

2013-01-01

Next generation sequencing has an advantageon transformational development of species with limited available sequence data as it helps to decode the genome and transcriptome. We carried out the de novo sequencing using illuminaHiSeq™ 2000 to generate the first leaf transcriptome of black pepper (Piper nigrum L.), an important spice variety native to South India and also grown in other tropical regions. Despite the economic and biochemical importance of pepper, a scientifically rigorous study at the molecular level is far from complete due to lack of sufficient sequence information and cytological complexity of its genome. The 55 million raw reads obtained, when assembled using Trinity program generated 2,23,386 contigs and 1,28,157 unigenes. Reports suggest that the repeat-rich genomic regions give rise to small non-coding functional RNAs. MicroRNAs (miRNAs) are the most abundant type of non-coding regulatory RNAs. In spite of the widespread research on miRNAs, little is known about the hair-pin precursors of miRNAs bearing Simple Sequence Repeats (SSRs). We used the array of transcripts generated, for the in silico prediction and detection of ‘43 pre-miRNA candidates bearing different types of SSR motifs’. The analysis identified 3913 different types of SSR motifs with an average of one SSR per 3.04 MB of thetranscriptome. About 0.033% of the transcriptome constituted ‘pre-miRNA candidates bearing SSRs’. The abundance, type and distribution of SSR motifs studied across the hair-pin miRNA precursors, showed a significant bias in the position of SSRs towards the downstream of predicted ‘pre-miRNA candidates’. The catalogue of transcripts identified, together with the demonstration of reliable existence of SSRs in the miRNA precursors, permits future opportunities for understanding the genetic mechanism of black pepper and likely functions of ‘tandem repeats’ in miRNAs. PMID:23469176
Assembly of Repeat Content Using Next Generation Sequencing Data

Energy Technology Data Exchange (ETDEWEB)

labutti, Kurt; Kuo, Alan; Grigoriev, Igor; Copeland, Alex

2014-03-17

Repetitive organisms pose a challenge for short read assembly, and typically only unique regions and repeat regions shorter than the read length, can be accurately assembled. Recently, we have been investigating the use of Pacific Biosciences reads for de novo fungal assembly. We will present an assessment of the quality and degree of repeat reconstruction possible in a fungal genome using long read technology. We will also compare differences in assembly of repeat content using short read and long read technology.
Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi and related species

Directory of Open Access Journals (Sweden)

Odvody Gary N

2008-11-01

Full Text Available Abstract Background A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites to detect differences at the DNA level. Results Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55% with dinucleotide repeats and 6 (11% with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40% and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis, sugar cane (P. sacchari, pearl millet (Sclerospora graminicola and rose (Peronospora sparsa indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34
Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi) and related species.

Science.gov (United States)

Perumal, Ramasamy; Nimmakayala, Padmavathi; Erattaimuthu, Saradha R; No, Eun-Gyu; Reddy, Umesh K; Prom, Louis K; Odvody, Gary N; Luster, Douglas G; Magill, Clint W

2008-11-29

A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites) to detect differences at the DNA level. Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55%) with dinucleotide repeats and 6 (11%) with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40%) and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis), sugar cane (P. sacchari), pearl millet (Sclerospora graminicola) and rose (Peronospora sparsa) indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production) were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34 Peronosclerospora, Peronospora and Sclerospora
Direct repeat sequences are essential for function of the cis-acting locus of transfer (clt) of Streptomyces phaeochromogenes plasmid pJV1.

Science.gov (United States)

Franco, Bernardo; González-Cerón, Gabriela; Servín-González, Luis

2003-11-01

The functionality of direct and inverted repeat sequences inside the cis acting locus of transfer (clt) of the Streptomyces plasmid pJV1 was determined by testing the effect of different deletions on plasmid transfer. The results show that the single most important element for pJV1 clt function is a series of evenly spaced 9 bp long direct repeats which match the consensus CCGCACA(C/G)(C/G), since their deletion caused a dramatic reduction in plasmid transfer. The presence of these repeats in the absence of any other clt sequences allowed plasmid transfer to occur at a frequency that was at least two orders of magnitude higher than that obtained in the complete absence of clt. A database search revealed regions with a similar organization, and in the same position, in Streptomyces plasmids pSN22 and pSLS, which have transfer proteins homologous to those of pJV1.
Generating markers based on biotic stress of protein system in and tandem repeats sequence for Aquilaria sp

International Nuclear Information System (INIS)

Azhar Mohamad; Muhammad Hanif Azhari N; Siti Norhayati Ismail

2014-01-01

Aquilaria sp. belongs to the Thymelaeaceae family and is well distributed in Asia region. The species has multipurpose use from root to shoot and is an economically important crop, which generates wide interest in understanding genetic diversity of the species. Knowledge on DNA-based markers has become a prerequisite for more effective application of molecular marker techniques in breeding and mapping programs. In this work, both targeted genes and tandem repeat sequences were used for DNA fingerprinting in Aquilaria sp. A total of 100 ISSR (inter simple sequence repeat) primers and 50 combination pairs of specific primers derived from conserved region of a specific protein known as system in were optimized. 38 ISSR primers were found affirmative for polymorphism evaluation study and were generated from both specific and degenerate ISSR primers. And one utmost combination of system in primers showed significant results in distinguishing the Aquilaria sp. In conclusion, polymorphism derived from ISSR profiling and targeted stress genes of protein system in proved as a powerful approach for identification and molecular classification of Aquilaria sp. which will be useful for diversification in identifying any mutant lines derived from nature. (author)
Genome-wide identification and validation of simple sequence repeats (SSRs) from Asparagus officinalis.

Science.gov (United States)

Li, Shufen; Zhang, Guojun; Li, Xu; Wang, Lianjun; Yuan, Jinhong; Deng, Chuanliang; Gao, Wujun

2016-06-01

Garden asparagus (Asparagus officinalis), an important vegetable cultivated worldwide, can also serve as a model dioecious plant species in the study of sex determination and sex chromosome evolution. However, limited DNA marker resources have been developed and used for this species. To expand these resources, we examined the DNA sequences for simple sequence repeats (SSRs) in 163,406 scaffolds representing approximately 400 Mbp of the A. officinalis genome. A total of 87,576 SSRs were identified in 59,565 scaffolds. The most abundant SSR repeats were trinucleotide and tetranucleotide, accounting for 29.2 and 29.1% of the total SSRs, respectively, followed by di-, penta-, hexa-, hepta-, and octanucleotides. The AG motif was most common among dinucleotides and was also the most frequent motif in the entire A. officinalis genome, representing 14.7% of all SSRs. A total of 41,917 SSR primers pairs were designed to amplify SSRs. Twenty-two genomic SSR markers were tested in 39 asparagus accessions belonging to ten cultivars and one accession of Asparagus setaceus for determination of genetic diversity. The intra-species polymorphism information content (PIC) values of the 22 genomic SSR markers were intermediate, with an average of 0.41. The genetic diversity between the ten A. officinalis cultivars was low, and the UPGMA dendrogram was largely unrelated to cultivars. It is here suggested that the sex of individuals is an important factor influencing the clustering results. The information reported here provides new information about the organization of the microsatellites in A. officinalis genome and lays a foundation for further genetic studies and breeding applications of A. officinalis and related species. Copyright © 2016 Elsevier Ltd. All rights reserved.
Ex vivo response to histone deacetylase (HDAC inhibitors of the HIV long terminal repeat (LTR derived from HIV-infected patients on antiretroviral therapy.

Directory of Open Access Journals (Sweden)

Hao K Lu

Full Text Available Histone deacetylase inhibitors (HDACi can induce human immunodeficiency virus (HIV transcription from the HIV long terminal repeat (LTR. However, ex vivo and in vivo responses to HDACi are variable and the activity of HDACi in cells other than T-cells have not been well characterised. Here, we developed a novel assay to determine the activity of HDACi on patient-derived HIV LTRs in different cell types. HIV LTRs from integrated virus were amplified using triple-nested Alu-PCR from total memory CD4+ T-cells (CD45RO+ isolated from HIV-infected patients prior to and following suppressive antiretroviral therapy. NL4-3 or patient-derived HIV LTRs were cloned into the chromatin forming episomal vector pCEP4, and the effect of HDACi investigated in the astrocyte and epithelial cell lines SVG and HeLa, respectively. There were no significant differences in the sequence of the HIV LTRs isolated from CD4+ T-cells prior to and after 18 months of combination antiretroviral therapy (cART. We found that in both cell lines, the HDACi panobinostat, trichostatin A, vorinostat and entinostat activated patient-derived HIV LTRs to similar levels seen with NL4-3 and all patient derived isolates had similar sensitivity to maximum HDACi stimulation. We observed a marked difference in the maximum fold induction of luciferase by HDACi in HeLa and SVG, suggesting that the effect of HDACi may be influenced by the cellular environment. Finally, we observed significant synergy in activation of the LTR with vorinostat and the viral protein Tat. Together, our results suggest that the LTR sequence of integrated virus is not a major determinant of a functional response to an HDACi.
A theory that may explain the Hayflick limit--a means to delete one copy of a repeating sequence during each cell cycle in certain human cells such as fibroblasts.

Science.gov (United States)

Naveilhan, P; Baudet, C; Jabbour, W; Wion, D

1994-09-01

A model that may explain the limited division potential of certain cells such as human fibroblasts in culture is presented. The central postulate of this theory is that there exists, prior to certain key exons that code for materials needed for cell division, a unique sequence of specific repeating segments of DNA. One copy of such repeating segments is deleted during each cell cycle in cells that are not protected from such deletion through methylation of their cytosine residues. According to this theory, the means through which such repeated sequences are removed, one per cycle, is through the sequential action of enzymes that act much as bacterial restriction enzymes do--namely to produce scissions in both strands of DNA in areas that correspond to the DNA base sequence recognition specificities of such enzymes. After the first scission early in a replicative cycle, that enzyme becomes inhibited, but the cleavage of the first site exposes the closest site in the repetitive element to the action of a second restriction enzyme after which that enzyme also becomes inhibited. Then repair occurs, regenerating the original first site. Through this sequential activation and inhibition of two different restriction enzymes, only one copy of the repeating sequence is deleted during each cell cycle. In effect, the repeating sequence operates as a precise counter of the numbers of cell doubling that have occurred since the cells involved differentiated during development.
Identification and Mapping of Simple Sequence Repeat Markers from Common Bean (Phaseolus vulgaris L. Bacterial Artificial Chromosome End Sequences for Genome Characterization and Genetic–Physical Map Integration

Directory of Open Access Journals (Sweden)

Juana M. Córdoba

2010-11-01

Full Text Available Microsatellite markers or simple sequence repeat (SSR loci are useful for diversity characterization and genetic–physical mapping. Different in silico microsatellite search methods have been developed for mining bacterial artificial chromosome (BAC end sequences for SSRs. The overall goal of this study was genome characterization based on SSRs in 89,017 BAC end sequences (BESs from the G19833 common bean ( L. library. Another objective was to identify new SSR taking into account three tandem motif identification programs (Automated Microsatellite Marker Development [AMMD], Tandem Repeats Finder [TRF], and SSRLocator [SSRL]. Among the microsatellite search engines, SSRL identified the highest number of SSRs; however, when primer design was attempted, the number dropped due to poor primer design regions. Automated Microsatellite Marker Development software identified many SSRs with valuable AT/TA or AG/TC motifs, while TRF found fewer SSRs and produced no primers. A subgroup of 323 AT-rich, di-, and trinucleotide SSRs were selected from the AMMD results and used in a parental survey with DOR364 and G19833, of which 75 could be mapped in the corresponding population; these represented 4052 BAC clones. Together with 92 previously mapped BES- and 114 non-BES-derived markers, a total of 280 SSRs were included in the polymerase chain reaction (PCR-based map, integrating a total of 8232 BAC clones in 162 contigs from the physical map.
Structure, organization, and sequence of alpha satellite DNA from human chromosome 17: evidence for evolution by unequal crossing-over and an ancestral pentamer repeat shared with the human X chromosome.

Science.gov (United States)

Waye, J S; Willard, H F

1986-09-01

The centromeric regions of all human chromosomes are characterized by distinct subsets of a diverse tandemly repeated DNA family, alpha satellite. On human chromosome 17, the predominant form of alpha satellite is a 2.7-kilobase-pair higher-order repeat unit consisting of 16 alphoid monomers. We present the complete nucleotide sequence of the 16-monomer repeat, which is present in 500 to 1,000 copies per chromosome 17, as well as that of a less abundant 15-monomer repeat, also from chromosome 17. These repeat units were approximately 98% identical in sequence, differing by the exclusion of precisely 1 monomer from the 15-monomer repeat. Homologous unequal crossing-over is suggested as a probable mechanism by which the different repeat lengths on chromosome 17 were generated, and the putative site of such a recombination event is identified. The monomer organization of the chromosome 17 higher-order repeat unit is based, in part, on tandemly repeated pentamers. A similar pentameric suborganization has been previously demonstrated for alpha satellite of the human X chromosome. Despite the organizational similarities, substantial sequence divergence distinguishes these subsets. Hybridization experiments indicate that the chromosome 17 and X subsets are more similar to each other than to the subsets found on several other human chromosomes. We suggest that the chromosome 17 and X alpha satellite subsets may be related components of a larger alphoid subfamily which have evolved from a common ancestral repeat into the contemporary chromosome-specific subsets.
The complete genomic sequence of lytic bacteriophage gh-1 infecting Pseudomonas putida--evidence for close relationship to the T7 group

International Nuclear Information System (INIS)

Kovalyova, Irina V.; Kropinski, Andrew M.

2003-01-01

The genome of the lytic Pseudomonas putida bacteriophage gh-1 is linear double-stranded DNA containing 37,359 bp with 216-bp direct terminal repeats. Like other members of the T7 group, the gh-1 genome contains regions of high homology to T7 interspersed with nonhomologous regions that contain small open reading frames of unknown function. The genome shares 31 genes in common with other members of the T7 group, including RNA polymerase, and an additional 12 unique putative genes. A major difference between gh-1 and other members of this group is the absence of any open reading frames between the left direct terminal repeat and gene 1. Sequence analysis of the gh-1 genome also revealed the presence of 10 putative phage promoters with a consensus sequence similar to the promoters of T3 and phiYeO3-12 (consensus: TAAAAACCCTCACTRTGGCHSCM). P. putida mutants resistant to gh-1 were demonstrated to have an altered lipopolysaccharide structure, indicating that members of this group use lipopolysaccharide as their cellular receptor
Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

LENUS (Irish Health Repository)

Ivanov, Ivaylo P

2011-05-01

In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.

Peptides derived from human galectin-3 N-terminal tail interact with its carbohydrate recognition domain in a phosphorylation-dependent manner

Energy Technology Data Exchange (ETDEWEB)

Berbís, M. Álvaro [Chemical and Physical Biology Department, Centro de Investigaciones Biológicas, CSIC, 28040 Madrid (Spain); André, Sabine [Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians University, 80539 Munich (Germany); Cañada, F. Javier [Chemical and Physical Biology Department, Centro de Investigaciones Biológicas, CSIC, 28040 Madrid (Spain); Pipkorn, Rüdiger [Central Peptide Synthesis Unit, German Cancer Research Center, 69120 Heidelberg (Germany); Ippel, Hans [Department of Biochemistry, CARIM, University of Maastricht, Maastricht (Netherlands); Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455 (United States); Mayo, Kevin H. [Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455 (United States); Kübler, Dieter [Biomolecular Interactions, German Cancer Research Center, 69120 Heidelberg (Germany); Gabius, Hans-Joachim [Institute of Physiological Chemistry, Faculty of Veterinary Medicine, Ludwig-Maximilians University, 80539 Munich (Germany); Jiménez-Barbero, Jesús, E-mail: jjbarbero@cib.csic.es [Chemical and Physical Biology Department, Centro de Investigaciones Biológicas, CSIC, 28040 Madrid (Spain)

2014-01-03

Highlights: •Galectin-3 is composed of a carbohydrate recognition domain and an N-terminal tail. •Synthetic peptides derived from the tail are shown to interact with the CRD. •This interaction is modulated by Ser- and Tyr-phosphorylation of the peptides. -- Abstract: Galectin-3 (Gal-3) is a multi-functional effector protein that functions in the cytoplasm and the nucleus, as well as extracellularly following non-classical secretion. Structurally, Gal-3 is unique among galectins with its carbohydrate recognition domain (CRD) attached to a rather long N-terminal tail composed mostly of collagen-like repeats (nine in the human protein) and terminating in a short non-collagenous terminal peptide sequence unique in this lectin family and not yet fully explored. Although several Ser and Tyr sites within the N-terminal tail can be phosphorylated, the physiological significance of this post-translational modification remains unclear. Here, we used a series of synthetic (phospho)peptides derived from the tail to assess phosphorylation-mediated interactions with {sup 15}N-labeled Gal-3 CRD. HSQC-derived chemical shift perturbations revealed selective interactions at the backface of the CRD that were attenuated by phosphorylation of Tyr 107 and Tyr 118, while phosphorylation of Ser 6 and Ser 12 was essential. Controls with sequence scrambling underscored inherent specificity. Our studies shed light on how phosphorylation of the N-terminal tail may impact on Gal-3 function and prompt further studies using phosphorylated full-length protein.
Peptides derived from human galectin-3 N-terminal tail interact with its carbohydrate recognition domain in a phosphorylation-dependent manner

International Nuclear Information System (INIS)

Berbís, M. Álvaro; André, Sabine; Cañada, F. Javier; Pipkorn, Rüdiger; Ippel, Hans; Mayo, Kevin H.; Kübler, Dieter; Gabius, Hans-Joachim; Jiménez-Barbero, Jesús

2014-01-01

Highlights: •Galectin-3 is composed of a carbohydrate recognition domain and an N-terminal tail. •Synthetic peptides derived from the tail are shown to interact with the CRD. •This interaction is modulated by Ser- and Tyr-phosphorylation of the peptides. -- Abstract: Galectin-3 (Gal-3) is a multi-functional effector protein that functions in the cytoplasm and the nucleus, as well as extracellularly following non-classical secretion. Structurally, Gal-3 is unique among galectins with its carbohydrate recognition domain (CRD) attached to a rather long N-terminal tail composed mostly of collagen-like repeats (nine in the human protein) and terminating in a short non-collagenous terminal peptide sequence unique in this lectin family and not yet fully explored. Although several Ser and Tyr sites within the N-terminal tail can be phosphorylated, the physiological significance of this post-translational modification remains unclear. Here, we used a series of synthetic (phospho)peptides derived from the tail to assess phosphorylation-mediated interactions with 15 N-labeled Gal-3 CRD. HSQC-derived chemical shift perturbations revealed selective interactions at the backface of the CRD that were attenuated by phosphorylation of Tyr 107 and Tyr 118, while phosphorylation of Ser 6 and Ser 12 was essential. Controls with sequence scrambling underscored inherent specificity. Our studies shed light on how phosphorylation of the N-terminal tail may impact on Gal-3 function and prompt further studies using phosphorylated full-length protein
A novel tandem reporter quantifies RNA polymerase II termination in mammalian cells.

Directory of Open Access Journals (Sweden)

Ayan Banerjee

2009-07-01

Full Text Available Making the correct choice between transcription elongation and transcription termination is essential to the function of RNA polymerase II, and fundamental to gene expression. This choice can be influenced by factors modifying the transcription complex, factors modifying chromatin, or signals mediated by the template or transcript. To aid in the study of transcription elongation and termination we have developed a transcription elongation reporter system that consists of tandem luciferase reporters flanking a test sequence of interest. The ratio of expression from the reporters provides a measure of the relative rates of successful elongation through the intervening sequence.Size matched fragments containing the polyadenylation signal of the human beta-actin gene (ACTB and the human beta-globin gene (HBB were evaluated for transcription termination using this new ratiometric tandem reporter assay. Constructs bearing just 200 base pairs on either side of the consensus poly(A addition site terminated 98% and 86% of transcription for ACTB and HBB sequences, respectively. The nearly 10-fold difference in read-through transcription between the two short poly(A regions was eclipsed when additional downstream poly(A sequence was included for each gene. Both poly(A regions proved very effective at termination when 1100 base pairs were included, stopping 99.6% of transcription. To determine if part of the increased termination was simply due to the increased template length, we inserted several kilobases of heterologous coding sequence downstream of each poly(A region test fragment. Unexpectedly, the additional length reduced the effectiveness of termination of HBB sequences 2-fold and of ACTB sequences 3- to 5-fold.The tandem construct provides a sensitive measure of transcription termination in human cells. Decreased Xrn2 or Senataxin levels produced only a modest release from termination. Our data support overlap in allosteric and torpedo mechanisms
Molecular identification and characterization of clustered regularly interspaced short palindromic repeat (CRISPR) gene cluster in Taylorella equigenitalis.

Science.gov (United States)

Hara, Yasushi; Hayashi, Kyohei; Nakajima, Takuya; Kagawa, Shizuko; Tazumi, Akihiro; Moore, John E; Matsuda, Motoo

2013-09-01

Clustered regularly interspaced short palindromic repeats (CRISPRs), of approximately 10,000 base pairs (bp) in length, were shown to occur in the Japanese Taylorella equigenitalis strain, EQ59. The locus was composed of the putative CRISPRs-associated with 5 (cas5), RAMP csd1, csd2, recB, cas1, a leader region, 13 CRISPR consensus sequence repeats (each 32 bp; 5'-TCAGCCACGTTCGCGTGGCTGTGTGTTTAAAG-3'). These were in turn separated by 12 non repetitive unique spacer regions of similar length. In addition, a leader region, a transposase/IS protein, a leader region, and cas3 were also seen. All seven putative open reading frames carry their ribosome binding sites. Promoter consensus sequences at the -35 and -10 regions and putative intrinsic ρ-independent transcription terminator regions also occurred. A possible long overlap of 170 bp in length occurred between the recB and cas1 loci. Positive reverse transcription PCR signals of cas5, RAMP csd1, csd2-recB/cas1, and cas3 were generated. A putative secondary structure of the CRISPR consensus repeats was constructed. Following this, CRISPR results of the T. equigenitalis EQ59 isolate were subsequently compared with those from the Taylorella asinigenitalis MCE3 isolate.
t2prhd: a tool to study the patterns of repeat evolution

Directory of Open Access Journals (Sweden)

Pénzes Zsolt

2008-01-01

Full Text Available Abstract Background The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading. Results We have developed a novel rapid visual approach to study repeat evolution, that takes into account the exact repeat position in a sequence. The "pairwise repeat homology diagram" visualizes sequence repeats detected by a profile HMM in a pair of sequences and highlights their homology relations inferred by a phylogenetic tree. The method is implemented in a Perl script (t2prhd available for downloading at http://t2prhd.sourceforge.net and is also accessible as an online tool at http://t2prhd.brc.hu. The power of the method is demonstrated on the EGF-like and fibronectin-III-like (Fn-III domain repeats of three selected mammalian Tenascin sequences. Conclusion Although pairwise repeat homology diagrams do not carry all the information provided by the phylogenetic tree, they allow a rapid and intuitive assessment of repeat evolution. We believe, that t2prhd is a helpful tool with which to study the pattern of repeat evolution. This method can be particularly useful in cases of large datasets (such as large gene families, as the command line interface makes it possible to automate the generation of pairwise repeat homology diagrams with the aid of scripts.
Sequence determinants of human microsatellite variability

Directory of Open Access Journals (Sweden)

Jakobsson Mattias

2009-12-01

Full Text Available Abstract Background Microsatellite loci are frequently used in genomic studies of DNA sequence repeats and in population studies of genetic variability. To investigate the effect of sequence properties of microsatellites on their level of variability we have analyzed genotypes at 627 microsatellite loci in 1,048 worldwide individuals from the HGDP-CEPH cell line panel together with the DNA sequences of these microsatellites in the human RefSeq database. Results Calibrating PCR fragment lengths in individual genotypes by using the RefSeq sequence enabled us to infer repeat number in the HGDP-CEPH dataset and to calculate the mean number of repeats (as opposed to the mean PCR fragment length, under the assumption that differences in PCR fragment length reflect differences in the numbers of repeats in the embedded repeat sequences. We find the mean and maximum numbers of repeats across individuals to be positively correlated with heterozygosity. The size and composition of the repeat unit of a microsatellite are also important factors in predicting heterozygosity, with tetra-nucleotide repeat units high in G/C content leading to higher heterozygosity. Finally, we find that microsatellites containing more separate sets of repeated motifs generally have higher heterozygosity. Conclusions These results suggest that sequence properties of microsatellites have a significant impact in determining the features of human microsatellite variability.
The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

Science.gov (United States)

Nallaseth, Ferez Soli

The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1
The human receptor for urokinase plasminogen activator. NH2-terminal amino acid sequence and glycosylation variants

DEFF Research Database (Denmark)

Behrendt, N; Rønne, E; Ploug, M

1990-01-01

-PA. The purified protein shows a single 55-60 kDa band after sodium dodecyl sulfate-polyacrylamide gel electrophoresis and silver staining. It is a heavily glycosylated protein, the deglycosylated polypeptide chain comprising only 35 kDa. The glycosylated protein contains N-acetyl-D-glucosamine and sialic acid......, but no N-acetyl-D-galactosamine. Glycosylation is responsible for substantial heterogeneity in the receptor on phorbol ester-stimulated U937 cells, and also for molecular weight variations among various cell lines. The amino acid composition and the NH2-terminal amino acid sequence are reported...
Development of Simple Sequence Repeats (SSR) markers in Setaria italica (Poaceae) and cross-amplification in related species.

Science.gov (United States)

Lin, Heng-Sheng; Chiang, Chih-Yun; Chang, Song-Bin; Kuoh, Chang-Sheng

2011-01-01

Foxtail millet is one of the world's oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR) markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21%) and CAT (46.15%). The average number of alleles (N(a)), the average heterozygosities observed (H(o)) and expected (H(e)) are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.
Detection, characterization and evolution of internal repeats in Chitinases of known 3-D structure.

Directory of Open Access Journals (Sweden)

Manigandan Sivaji

Full Text Available Chitinase proteins have evolved and diversified almost in all organisms ranging from prokaryotes to eukaryotes. During evolution, internal repeats may appear in amino acid sequences of proteins which alter the structural and functional features. Here we deciphered the internal repeats from Chitinase and characterized the structural similarities between them. Out of 24 diverse Chitinase sequences selected, six sequences (2CJL, 2DSK, 2XVP, 2Z37, 3EBV and 3HBE did not contain any internal repeats of amino acid sequences. Ten sequences contained repeats of length <50, and the remaining 8 sequences contained repeat length between 50 and 100 residues. Two Chitinase sequences, 1ITX and 3SIM, were found to be structurally similar when analyzed using secondary structure of Chitinase from secondary and 3-Dimensional structure database of Protein Data Bank. Internal repeats of 3N17 and 1O6I were also involved in the ligand-binding site of those Chitinase proteins, respectively. Our analyses enhance our understanding towards the identification of structural characteristics of internal repeats in Chitinase proteins.
5'-Terminal AUGs in Escherichia coli mRNAs with Shine-Dalgarno Sequences: Identification and Analysis of Their Roles in Non-Canonical Translation Initiation.

Directory of Open Access Journals (Sweden)

Heather J Beck

Full Text Available Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs that contain a conventional untranslated leader and Shine-Dalgarno (SD sequence upstream of the gene's start codon while also containing an AUG triplet at the mRNA's 5'- terminus (5'-uAUG. Fusion of the coding sequence specified by the 5'-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5'-terminal upstream open reading frames (5'-uORFs tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5'-terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5'-terminal uORFs impact expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5'-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5'-uORFs may play roles in downstream regulation. Since the 5'-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5'-uAUGs should be examined for their potential to function also as leaderless mRNAs.
Complete genome sequence and architecture of crucian carp Carassius auratus herpesvirus (CaHV).

Science.gov (United States)

Zeng, Xiao-Tao; Chen, Zhong-Yuan; Deng, Yuan-Sheng; Gui, Jian-Fang; Zhang, Qi-Ya

2016-12-01

Crucian carp Carassius auratus herpesvirus (CaHV) was isolated from diseased crucian carp with acute gill hemorrhages and high mortality. The CaHV genome was sequenced and analyzed. The data showed that it consists of 275,348 bp and contains 150 predicted ORFs. The architecture of the CaHV genome differs from those of four cyprinid herpesviruses (CyHV1, CyHV2, SY-C1, CyHV3), with insertions, deletions and the absence of a terminal direct repeat. Phylogenetic analysis of the DNA polymerase sequences of 17 strains of Herpesvirales members, and the concatenated 12 core ORFs from 10 strains of alloherpesviruses showed that CaHV clustered together with members of the genus Cyprinivirus, family Alloherpesviridae.
A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum.

Science.gov (United States)

Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin

2016-01-07

The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.
Analysis of simple sequence repeats in rice bean (Vigna umbellata using an SSR-enriched library

Directory of Open Access Journals (Sweden)

Lixia Wang

2016-02-01

Full Text Available Rice bean (Vigna umbellata Thunb., a warm-season annual legume, is grown in Asia mainly for dried grain or fodder and plays an important role in human and animal nutrition because the grains are rich in protein and some essential fatty acids and minerals. With the aim of expediting the genetic improvement of rice bean, we initiated a project to develop genomic resources and tools for molecular breeding in this little-known but important crop. Here we report the construction of an SSR-enriched genomic library from DNA extracted from pooled young leaf tissues of 22 rice bean genotypes and developing SSR markers. In 433,562 reads generated by a Roche 454 GS-FLX sequencer, we identified 261,458 SSRs, of which 48.8% were of compound form. Dinucleotide repeats were predominant with an absolute proportion of 81.6%, followed by trinucleotides (17.8%. Other types together accounted for 0.6%. The motif AC/GT accounted for 77.7% of the total, followed by AAG/CTT (14.3%, and all others accounted for 12.0%. Among the flanking sequences, 2928 matched putative genes or gene models in the protein database of Arabidopsis thaliana, corresponding with 608 non-redundant Gene Ontology terms. Of these sequences, 11.2% were involved in cellular components, 24.2% were involved molecular functions, and 64.6% were associated with biological processes. Based on homolog analysis, 1595 flanking sequences were similar to mung bean and 500 to common bean genomic sequences. Comparative mapping was conducted using 350 sequences homologous to both mung bean and common bean sequences. Finally, a set of primer pairs were designed, and a validation test showed that 58 of 220 new primers can be used in rice bean and 53 can be transferred to mung bean. However, only 11 were polymorphic when tested on 32 rice bean varieties. We propose that this study lays the groundwork for developing novel SSR markers and will enhance the mapping of qualitative and quantitative traits and marker
Single Strand Annealing Plays a Major Role in RecA-Independent Recombination between Repeated Sequences in the Radioresistant Deinococcus radiodurans Bacterium.

Directory of Open Access Journals (Sweden)

Solenne Ithurbide

2015-10-01

Full Text Available The bacterium Deinococcus radiodurans is one of the most radioresistant organisms known. It is able to reconstruct a functional genome from hundreds of radiation-induced chromosomal fragments. Our work aims to highlight the genes involved in recombination between 438 bp direct repeats separated by intervening sequences of various lengths ranging from 1,479 bp to 10,500 bp to restore a functional tetA gene in the presence or absence of radiation-induced DNA double strand breaks. The frequency of spontaneous deletion events between the chromosomal direct repeats were the same in recA+ and in ΔrecA, ΔrecF, and ΔrecO bacteria, whereas recombination between chromosomal and plasmid DNA was shown to be strictly dependent on the RecA and RecF proteins. The presence of mutations in one of the repeated sequence reduced, in a MutS-dependent manner, the frequency of the deletion events. The distance between the repeats did not influence the frequencies of deletion events in recA+ as well in ΔrecA bacteria. The absence of the UvrD protein stimulated the recombination between the direct repeats whereas the absence of the DdrB protein, previously shown to be involved in DNA double strand break repair through a single strand annealing (SSA pathway, strongly reduces the frequency of RecA- (and RecO- independent deletions events. The absence of the DdrB protein also increased the lethal sectoring of cells devoid of RecA or RecO protein. γ-irradiation of recA+ cells increased about 10-fold the frequencies of the deletion events, but at a lesser extend in cells devoid of the DdrB protein. Altogether, our results suggest a major role of single strand annealing in DNA repeat deletion events in bacteria devoid of the RecA protein, and also in recA+ bacteria exposed to ionizing radiation.
Genomic Characterization for Parasitic Weeds of the Genus Striga by Sample Sequence Analysis

Directory of Open Access Journals (Sweden)

Matt C. Estep

2012-03-01

Full Text Available Generation of ∼2200 Sanger sequence reads or ∼10,000 454 reads for seven Lour. DNA samples (five species allowed identification of the highly repetitive DNA content in these genomes. The 14 most abundant repeats in these species were identified and partially assembled. Annotation indicated that they represent nine long terminal repeat (LTR retrotransposon families, three tandem satellite repeats, one long interspersed element (LINE retroelement, and one DNA transposon. All of these repeats are most closely related to repetitive elements in other closely related plants and are not products of horizontal transfer from their host species. These repeats were differentially abundant in each species, with the LTR retrotransposons and satellite repeats most responsible for variation in genome size. Each species had some repetitive elements that were more abundant and some less abundant than the other species examined, indicating that no single element or any unilateral growth or decrease trend in genome behavior was responsible for variation in genome size and composition. Genome sizes were determined by flow sorting, and the values of 615 Mb [ (L. Kuntze], 1330 Mb [ (Willd. Vatke], 1425 Mb [ (Delile Benth.] and 2460 Mb ( Benth. suggest a ploidy series, a prediction supported by repetitive DNA sequence analysis. Phylogenetic analysis using six chloroplast loci indicated the ancestral relationships of the five most agriculturally important species, with the unexpected result that the one parasite of dicotyledonous plants ( was found to be more closely related to some of the grass parasites than many of the grass parasites are to each other.
The N-terminal of a heparin-binding sperm membrane mitogen possess lectin-like sequence

International Nuclear Information System (INIS)

Mor, Visesato; Chatterjee, Tapati

2007-01-01

Glycosaminoglycans like heparin and heparin sulfate in follicular fluid induce changes in the intracellular environment during the spermatozoal functional maturation. We previously reported the isolation, purification and partial characterization of a heparin binding sperm membrane protein (HBSM). In the present study, the amino acids analysis provided evidence of a single sequence, which suggest the homogeneity of the purified HBSM. Fourteen amino acids- 1 A D T I V A V E L D T Y P N 14 -correspond to the amino terminal sequence of Concanavalin A (Con A) and contain 45.2% carbohydrate by weight. HBSM possess mitogenic property on lymphocytes with comparable magnitude to the well-known mitogen; Con A, inducing 83% radiolabel thymidine incorporation in growing lymphocytes. Unlike Con A, there was no agglutination of cell by HBSM upto 5 ng/ml concentration. Interestingly, we found that heparin and chondroitin sulfate-conjugated HBSM inhibit the proliferative activity. Similar effect was also found with an in-house isolate sulfated glycans; G-I (28% sulfate). In contrast, there was no inhibition by the desulfated form; G-ID. Altogether, our data suggest that the mechanism of cell proliferative pathway may be different for HBSM and Con A
Automated genotyping of dinucleotide repeat markers

Energy Technology Data Exchange (ETDEWEB)

Perlin, M.W.; Hoffman, E.P. [Carnegie Mellon Univ., Pittsburgh, PA (United States)]|[Univ. of Pittsburgh, PA (United States)

1994-09-01

The dinucleotide repeats (i.e., microsatellites) such as CA-repeats are a highly polymorphic, highly abundant class of PCR-amplifiable markers that have greatly streamlined genetic mapping experimentation. It is expected that over 30,000 such markers (including tri- and tetranucleotide repeats) will be characterized for routine use in the next few years. Since only size determination, and not sequencing, is required to determine alleles, in principle, dinucleotide repeat genotyping is easily performed on electrophoretic gels, and can be automated using DNA sequencers. Unfortunately, PCR stuttering with these markers generates not one band for each allele, but a pattern of bands. Since closely spaced alleles must be disambiguated by human scoring, this poses a key obstacle to full automation. We have developed methods that overcome this obstacle. Our model is that the observed data is generated by arithmetic superposition (i.e., convolution) of multiple allele patterns. By quantitatively measuring the size of each component band, and exploiting the unique stutter pattern associated with each marker, closely spaced alleles can be deconvolved; this unambiguously reconstructs the {open_quotes}true{close_quotes} allele bands, with stutter artifact removed. We used this approach in a system for automated diagnosis of (X-linked) Duchenne muscular dystrophy; four multiplexed CA-repeats within the dystrophin gene were assayed on a DNA sequencer. Our method accurately detected small variations in gel migration that shifted the allele size estimate. In 167 nonmutated alleles, 89% (149/167) showed no size variation, 9% (15/167) showed 1 bp variation, and 2% (3/167) showed 2 bp variation. We are currently developing a library of dinucleotide repeat patterns; together with our deconvolution methods, this library will enable fully automated genotyping of dinucleotide repeats from sizing data.
Identification, isolation, and N-terminal sequencing of style glycoproteins associated with self-incompatibility in Nicotiana alata.

Science.gov (United States)

Jahnen, W; Batterham, M P; Clarke, A E; Moritz, R L; Simpson, R J

1989-05-01

S-Gene-associated glycoproteins (S-glycoproteins) from styles of Nicotiana alata, identified by non-equilibrium two-dimensional electrophoresis, were purified by cation exchange fast protein liquid chromatography with yields of 0.5 to 8 micrograms of protein per style, depending on the S-genotype of the plant. The method relies on the highly basic nature of the S-glycoproteins. The elution profiles of the different S-glycoproteins from the fast protein liquid chromatography column were characteristic of each S-glycoprotein, and could be used to establish the S-genotype of plants in outbreeding populations. In all cases, the S-genotype predicted from the style protein profile corresponded to that predicted from DNA gel blot analysis using S-allele-specific DNA probes and to that established by conventional breeding tests. Amino-terminal sequences of five purified S-glycoproteins showed a high degree of homology with the previously published sequences of N. alata and Lycopersicon esculentum S-glycoproteins.
Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

Science.gov (United States)

Pelsy, F.; Merdinoglu, D.

2002-09-01

A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.

Human adenovirus serotype 12 virion precursors pMu and pVI are cleaved at amino-terminal and carboxy-terminal sites that conform to the adenovirus 2 endoproteinase cleavage consensus sequence.

Science.gov (United States)

Freimuth, P; Anderson, C W

1993-03-01

The sequence of a 1158-base pair fragment of the human adenovirus serotype 12 (Ad12) genome was determined. This segment encodes the precursors for virion components Mu and VI. Both Ad12 precursors contain two sequences that conform to a consensus sequence motif for cleavage by the endoproteinase of adenovirus 2 (Ad2). Analysis of the amino terminus of VI and of the peptide fragments found in Ad12 virions demonstrated that these sites are cleaved during Ad12 maturation. This observation suggests that the recognition motif for adenovirus endoproteinases is highly conserved among human serotypes. The adenovirus 2 endoproteinase polypeptide requires additional co-factors for activity (C. W. Anderson, Protein Expression Purif., 1993, 4, 8-15). Synthetic Ad12 or Ad2 pVI carboxy-terminal peptides each permitted efficient cleavage of an artificial endoproteinase substrate by recombinant Ad2 endoproteinase polypeptide.
Structure determination of a peptide model of the repeated helical domain in Samia cynthia ricini silk fibroin before spinning by a combination of advanced solid-state NMR methods.

Science.gov (United States)

Nakazawa, Yasumoto; Asakura, Tetsuo

2003-06-18

Fibrous proteins unlike globular proteins, contain repetitive amino acid sequences, giving rise to very regular secondary protein structures. Silk fibroin from a wild silkworm, Samia cynthia ricini, consists of about 100 repeats of alternating polyalanine (poly-Ala) regions of 12-13 residues in length and Gly-rich regions. In this paper, the precise structure of the model peptide, GGAGGGYGGDGG(A)(12)GGAGDGYGAG, which is a typical repeated sequence of the silk fibroin, was determined using a combination of three kinds of solid-state NMR studies; a quantitative use of (13)C CP/MAS NMR chemical shift with conformation-dependent (13)C chemical shift contour plots, 2D spin diffusion (13)C solid-state NMR under off magic angle spinning and rotational echo double resonance. The structure of the model peptide corresponding to the silk fibroin structure before spinning was determined. The torsion angles of the central Ala residue, Ala(19), in the poly-Ala region were determined to be (phi, psi) = (-59 degrees, -48 degrees ) which are values typically associated with alpha-helical structures. However, the torsion angles of the Gly(25) residue adjacent to the C-terminal side of the poly-Ala chain were determined to be (phi, psi) = (-66 degrees, -22 degrees ) and those of Gly(12) and Ala(13) residues at the N-terminal of the poly-Ala chain to be (phi, psi) = (-70 degrees, -30 degrees ). In addition, REDOR experiments indicate that the torsion angles of the two C-terminal Ala residues, Ala(23) and Ala(24), are (phi, psi) = (-66 degrees, -22 degrees ) and those of N-terminal two Ala residues, Ala(13) and Ala(14) are (phi, psi) = (-70 degrees, -30 degrees ). Thus, the local structure of N-terminal and C-terminal residues, and also the neighboring residues of alpha-helical poly-Ala chain in the model peptide is a more strongly wound structure than found in typical alpha-helix structures.
Development of Simple Sequence Repeats (SSR Markers in Setaria italica (Poaceae and Cross-Amplification in Related Species

Directory of Open Access Journals (Sweden)

Chih-Yun Chiang

2011-11-01

Full Text Available Foxtail millet is one of the world’s oldest cultivated crops. It has been adopted as a model organism for providing a deeper understanding of plant biology. In this study, 45 simple sequence repeats (SSR markers of Setaria italica were developed. These markers showing polymorphism were screened in 223 samples from 12 foxtail millet populations around Taiwan. The most common dinucleotide and trinucleotide repeat motifs are AC/TG (84.21% and CAT (46.15%. The average number of alleles (Na, the average heterozygosities observed (Ho and expected (He are 3.73, 0.714, 0.587, respectively. In addition, 24 SSR markers had shown transferability to six related Poaceae species. These new markers provide tools for examining genetic relatedness among foxtail millet populations and other related species. It is suitable for germplasm management and protection in Poaceae.
Genomic clones of bovine parvovirus: Construction and effect of deletions and terminal sequence inversions on infectivity

International Nuclear Information System (INIS)

Shull, B.C.; Chen, K.C.; Lederman, M.; Stout, E.R.; Bates, R.C.

1988-01-01

Genomic clones of the autonomous parvovirus bovine parvovirus (BPV) were constructed by blunt-end ligation of reannealed virion plus and minus DNA strands into the plasmid pUC8. These clones were stable during propagation in Escherichia coli JM107. All clones tested were found to be infectious by the criteria of plaque titer and progressive cytophathic effect after transfection into bovine fetal lung cells. Sequencing of the recombinant plasmids demonstrated that all of the BPV inserts had left-end (3')-terminal deletions of up to 34 bases. Defective genomes could also be detected in the progeny DNA even though the infection was initiated with homogeneous, cloned DNA. Full-length genomic clones with 3' flip and 3' flop conformations were constructed and were found to have equal infectivity. Expression of capsid proteins from tranfected genomes was demonstrated by hemagglutination, indirect immunofluorescence, and immunoprecipitation of [ 35 S]methionine-labeled cell lysates. Use of appropriate antiserum for immunoprecipitation showed the synthesis of BPV capsid and noncapsid proteins after transfection. Independently, a series of genomic clones with increasingly larger 3'-terminal deletions was prepared from separately subcloned 3'-terminal fragments. Transfection of these clones into bovine fetal lung cells revealed that deletions of up to 34 bases at the 3' end lowered but did not abolish infectivity, while deletions of greater than 52 bases were lethal. End-label analysis showed that the 34-base deletion was repaired to wild-type length in the progeny virus
The complete chloroplast genome sequence of Mahonia bealei (Berberidaceae) reveals a significant expansion of the inverted repeat and phylogenetic relationship with other angiosperms.

Science.gov (United States)

Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin

2013-10-10

Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae
Roles of repetitive sequences

Energy Technology Data Exchange (ETDEWEB)

Bell, G.I.

1991-12-31

The DNA of higher eukaryotes contains many repetitive sequences. The study of repetitive sequences is important, not only because many have important biological function, but also because they provide information on genome organization, evolution and dynamics. In this paper, I will first discuss some generic effects that repetitive sequences will have upon genome dynamics and evolution. In particular, it will be shown that repetitive sequences foster recombination among, and turnover of, the elements of a genome. I will then consider some examples of repetitive sequences, notably minisatellite sequences and telomere sequences as examples of tandem repeats, without and with respectively known function, and Alu sequences as an example of interspersed repeats. Some other examples will also be considered in less detail.
Nonlinear analysis of sequence repeats of multi-domain proteins

Energy Technology Data Exchange (ETDEWEB)

Huang Yanzhao [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Li Mingfeng [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China); Xiao Yi [Biomolecular Physics and Modeling Group, Department of Physics, Huazhong University of Science and Technology, Wuhan 430074, Hubei (China)]. E-mail: lmf_bill@sina.com

2007-11-15

Many multi-domain proteins have repetitive three-dimensional structures but nearly-random amino acid sequences. In the present paper, by using a modified recurrence plot proposed by us previously, we show that these amino acid sequences have hidden repetitions in fact. These results indicate that the repetitive domain structures are encoded by the repetitive sequences. This also gives a method to detect the repetitive domain structures directly from amino acid sequences.
The triplet repeats of the Sin Nombre hantavirus 5' untranslated region are sufficient in cis for nucleocapsid-mediated translation initiation.

Science.gov (United States)

Mir, Mohammad A; Panganiban, Antonito T

2010-09-01

Hantavirus nucleocapsid protein (N) can replace the cellular cap-binding complex, eukaryotic initiation factor 4F (eIF4F), to mediate translation initiation. Although N can augment translation initiation of nonviral mRNA, initiation of viral mRNA by N is superior. All members of the Bunyaviridae family, including the species of the hantavirus genus, express either three or four primary mRNAs from their tripartite negative-sense genomes. The 5' ends of the mRNAs contain nonviral heterologous oligonucleotides that originate from endonucleolytic cleavage of cellular mRNA during the process of cap snatching. In the hantaviruses these caps terminate with a 3' G residue followed by nucleotides arising from the viral template. Further, the 5' untranslated region (UTR) of viral mRNA uniformly contains, near the 5' end, either two or three copies of the triplet repeat sequence, UAGUAG or UAGUAGUAG. Through analysis of a panel of mutants with mutations in the viral UTR, we found that the sequence GUAGUAG is sufficient for preferential N-mediated translation initiation and for high-affinity binding of N to the UTR. This heptanucleotide sequence is present in viral mRNA containing either two or three copies of the triplet repeat.
Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

Science.gov (United States)

Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

2012-12-01

Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.
Local repeat sequence organization of an intergenic spacer

Indian Academy of Sciences (India)

The amplification yielded the same uniquely ``sequence-scrambled” product, whether the template used for PCR was total cellular DNA, chloroplast DNA or a plasmid clone DNA corresponding to that region. The PCR product, a ``unique” new sequence, had lost the repetitive organization of the template genome where it ...
The soybean-Phytophthora resistance locus Rps1-k encompasses coiled coil-nucleotide binding-leucine rich repeat-like genes and repetitive sequences

Directory of Open Access Journals (Sweden)

Bhattacharyya Madan K

2008-03-01

Full Text Available Abstract Background A series of Rps (resistance to Pytophthora sojae genes have been protecting soybean from the root and stem rot disease caused by the Oomycete pathogen, Phytophthora sojae. Five Rps genes were mapped to the Rps1 locus located near the 28 cM map position on molecular linkage group N of the composite genetic soybean map. Among these five genes, Rps1-k was introgressed from the cultivar, Kingwa. Rps1-k has been providing stable and broad-spectrum Phytophthora resistance in the major soybean-producing regions of the United States. Rps1-k has been mapped and isolated. More than one functional Rps1-k gene was identified from the Rps1-k locus. The clustering feature at the Rps1-k locus might have facilitated the expansion of Rps1-k gene numbers and the generation of new recognition specificities. The Rps1-k region was sequenced to understand the possible evolutionary steps that shaped the generation of Phytophthora resistance genes in soybean. Results Here the analyses of sequences of three overlapping BAC clones containing the 184,111 bp Rps1-k region are reported. A shotgun sequencing strategy was applied in sequencing the BAC contig. Sequence analysis predicted a few full-length genes including two Rps1-k genes, Rps1-k-1 and Rps1-k-2. Previously reported Rps1-k-3 from this genomic region 1 was evolved through intramolecular recombination between Rps1-k-1 and Rps1-k-2 in Escherichia coli. The majority of the predicted genes are truncated and therefore most likely they are nonfunctional. A member of a highly abundant retroelement, SIRE1, was identified from the Rps1-k region. The Rps1-k region is primarily composed of repetitive sequences. Sixteen simple repeat and 63 tandem repeat sequences were identified from the locus. Conclusion These data indicate that the Rps1 locus is located in a gene-poor region. The abundance of repetitive sequences in the Rps1-k region suggested that the location of this locus is in or near a
Isolation and sequence analysis of the wheat B genome subtelomeric DNA.

Science.gov (United States)

Salina, Elena A; Sergeeva, Ekaterina M; Adonina, Irina G; Shcherban, Andrey B; Afonnikov, Dmitry A; Belcram, Harry; Huneau, Cecile; Chalhoub, Boulos

2009-09-05

Telomeric and subtelomeric regions are essential for genome stability and regular chromosome replication. In this work, we have characterized the wheat BAC (bacterial artificial chromosome) clones containing Spelt1 and Spelt52 sequences, which belong to the subtelomeric repeats of the B/G genomes of wheats and Aegilops species from the section Sitopsis. The BAC library from Triticum aestivum cv. Renan was screened using Spelt1 and Spelt52 as probes. Nine positive clones were isolated; of them, clone 2050O8 was localized mainly to the distal parts of wheat chromosomes by in situ hybridization. The distribution of the other clones indicated the presence of different types of repetitive sequences in BACs. Use of different approaches allowed us to prove that seven of the nine isolated clones belonged to the subtelomeric chromosomal regions. Clone 2050O8 was sequenced and its sequence of 119,737 bp was annotated. It is composed of 33% transposable elements (TEs), 8.2% Spelt52 (namely, the subfamily Spelt52.2) and five non-TE-related genes. DNA transposons are predominant, making up 24.6% of the entire BAC clone, whereas retroelements account for 8.4% of the clone length. The full-length CACTA transposon Caspar covers 11,666 bp, encoding a transposase and CTG-2 proteins, and this transposon accounts for 40% of the DNA transposons. The in situ hybridization data for 2050O8 derived subclones in combination with the BLAST search against wheat mapped ESTs (expressed sequence tags) suggest that clone 2050O8 is located in the terminal bin 4BL-10 (0.95-1.0). Additionally, four of the predicted 2050O8 genes showed significant homology to four putative orthologous rice genes in the distal part of rice chromosome 3S and confirm the synteny to wheat 4BL. Satellite DNA sequences from the subtelomeric regions of diploid wheat progenitor can be used for selecting the BAC clones from the corresponding regions of hexaploid wheat chromosomes. It has been demonstrated for the first time
Small leucine-rich repeat proteoglycans associated with mature insoluble elastin serve as binding sites for galectins.

Science.gov (United States)

Itoh, Aiko; Nonaka, Yasuhiro; Ogawa, Takashi; Nakamura, Takanori; Nishi, Nozomu

2017-11-01

We previously reported that galectin-9 (Gal-9), an immunomodulatory animal lectin, could bind to insoluble collagen preparations and exerted direct cytocidal effects on immune cells. In the present study, we found that mature insoluble elastin is capable of binding Gal-9 and other members of the human galectin family. Lectin blot analysis of a series of commercial water-soluble elastin preparations, PES-(A) ~ PES-(E), revealed that only PES-(E) contained substances recognized by Gal-9. Gal-9-interacting substances in PES-(E) were affinity-purified, digested with trypsin and then analyzed by reversed-phase HPLC. Peptide fragments derived from five members of the small leucine-rich repeat proteoglycan family, versican, lumican, osteoglycin/mimecan, prolargin, and fibromodulin, were identified by N-terminal amino acid sequence analysis. The results indicate that Gal-9 and possibly other galectins recognize glycans attached to small leucine-rich repeat proteoglycans associated with insoluble elastin and also indicate the possibility that mature insoluble elastin serves as an extracellular reservoir for galectins.
Fingerprinting for discriminating tea germplasm using inter-simple sequence repeat (ISSR) markers

International Nuclear Information System (INIS)

Liu, B.Y.; Li, Y.Y.; Wang, P.S.; Wang, L.Y.; Wang, P.S.

2012-01-01

For the discrimination of tea germplasm at the inter-specific level, 134 tea varieties preserved in the China National Germplasm Tea Repositories (CNGTR) were analyzed using inter simple sequence repeat (ISSR) markers. Eighteen primers were chosen from 60 screened for ISSR amplification, generating 99.4% polymorphic bands. The mean Nei's gene diversity (H) and the overall mean Shannon's Information index (I) were 0.396 and 0.578, respectively, indicating a wide gene pool. Using the presence, sometimes absence of unique ISSR markers, it was possible to discriminate 32 of the genotypes tested. No single primer could discriminate all the 134 genotypes. However, UBC811 provided rich band patterns and it can discriminate 35 genotypes. The combination of two and three primers could discriminate 99 and 121 genotypes, respectively. Furthermore, the combination of band patterns or the DNA fingerprinting based on specific ISSR markers generated by UBC811, UBC835, ISSR2 and ISSR3 could discriminate all 134 genotypes tested. ISSR markers also provide a powerful tool to discriminate tea germplasm at the inter-specific level. (author)
Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures

Directory of Open Access Journals (Sweden)

Hayes Michael L

2012-05-01

Full Text Available Abstract Background Pentatricopeptide repeat (PPR proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82 and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. Results All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3’ UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. Conclusion PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative
Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

Science.gov (United States)

Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

2012-05-14

Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target
RNA sequence determinants of a coupled termination-reinitiation strategy for downstream open reading frame translation in Helminthosporium victoriae virus 190S and other victoriviruses (Family Totiviridae).

Science.gov (United States)

Li, Hua; Havens, Wendy M; Nibert, Max L; Ghabrial, Said A

2011-07-01

The genome-length, dicistronic mRNA of the double-stranded RNA fungal virus Helminthosporium victoriae virus 190S (genus Victorivirus, family Totiviridae) contains two long open reading frames (ORFs) that overlap in the tetranucleotide AUGA. Translation of the downstream ORF, which encodes the RNA-dependent RNA polymerase (RdRp), has been proposed to depend on ribosomal reinitiation following termination of the upstream ORF, which encodes the capsid protein. In the current study, we examined the RNA sequence determinants for RdRp translation in this virus and demonstrated that a coupled termination-reinitiation (stop-restart) strategy is indeed used. Signals for termination-reinitiation are found within a 32-nucleotide stretch of RNA immediately upstream of the AUGA motif, including a predicted pseudoknot structure. The close proximity in which this predicted structure is followed by the upstream ORF's stop codon appears to be especially important for promoting translation of the downstream ORF. The normal strong preferences for an AUG start codon and the canonical sequence context to favor translation initiation appear somewhat relaxed for the downstream ORF. Similar sequence motifs and predicted RNA structures in other victoriviruses suggest that they all share a related stop-restart strategy for RdRp translation. Members of the genus Victorivirus thus provide new and unique opportunities for exploring the molecular mechanisms of translational coupling, which remain only partly understood in this and other systems.
UV-induced transcription from the human immunodeficiency virus type 1 (HIV-1) long terminal repeat and UV-induced secretion of an extracellular factor that induces HIV-1 transcription in nonirradiated cells

International Nuclear Information System (INIS)

Stein, B.; Kraemer, M.R.; Rahmsdorf, H.J.; Ponta, H.; Herrlich, P.

1989-01-01

UV irradiation, but not visible sunlight, induces the transcription of human immunodeficiency virus type 1 (HIV-1). Chimeric constructs carrying all or parts of the HIV-1 long terminal repeat linked to an indicator gene were transfected into HeLa cells or murine and human T-cell lines, and their response to irradiation was tested. The cis-acting element conferring UV responsiveness is identical to the sequence binding transcription factor NF kappa B. UV irradiation enhances NF kappa B binding activity as assayed by gel retardation experiments. Interestingly, the requirement for UV irradiation can be replaced by cocultivation of transfected cells with UV-irradiated nontransfected (HIV-1-negative) cells. A UV-induced extracellular protein factor is detected in the culture medium conditioned by UV-treated cells. The factor is produced upon UV irradiation by several murine and human cell lines, including HeLa, Molt-4, and Jurkat, and acts on several cells. These data suggest that the UV response of keratinocytes in human skin can be magnified and spread to deeper layers that are more shielded, including the Langerhans cells, and that this indirect UV response may contribute to the activation of HIV-1 in humans
Structural plasticity of the N-terminal capping helix of the TPR domain of kinesin light chain.

Directory of Open Access Journals (Sweden)

The Quyen Nguyen

Full Text Available Kinesin1 plays a major role in neuronal transport by recruiting many different cargos through its kinesin light chain (KLC. Various structurally unrelated cargos interact with the conserved tetratricopeptide repeat (TPR domain of KLC. The N-terminal capping helix of the TPR domain exhibits an atypical sequence and structural features that may contribute to the versatility of the TPR domain to bind different cargos. We determined crystal structures of the TPR domain of both KLC1 and KLC2 encompassing the N-terminal capping helix and show that this helix exhibits two distinct and defined orientations relative to the rest of the TPR domain. Such a difference in orientation gives rise, at the N-terminal part of the groove, to the formation of one hydrophobic pocket, as well as to electrostatic variations at the groove surface. We present a comprehensive structural analysis of available KLC1/2-TPR domain structures that highlights that ligand binding into the groove can be specific of one or the other N-terminal capping helix orientations. Further, structural analysis reveals that the N-terminal capping helix is always involved in crystal packing contacts, especially in a TPR1:TPR1' contact which highlights its propensity to be a protein-protein interaction site. Together, these results underline that the structural plasticity of the N-terminal capping helix might represent a structural determinant for TPR domain structural versatility in cargo binding.
Untangling spider silk evolution with spidroin terminal domains

Directory of Open Access Journals (Sweden)

Garb Jessica E

2010-08-01

Full Text Available Abstract Background Spidroins are a unique family of large, structural proteins that make up the bulk of spider silk fibers. Due to the highly variable nature of their repetitive sequences, spidroin evolutionary relationships have principally been determined from their non-repetitive carboxy (C-terminal domains, though they offer limited character data. The few known spidroin amino (N-terminal domains have been difficult to obtain, but potentially contain critical phylogenetic information for reconstructing the diversification of spider silks. Here we used silk gland expression data (ESTs from highly divergent species to evaluate the functional significance and phylogenetic utility of spidroin N-terminal domains. Results We report 11 additional spidroin N-termini found by sequencing ~1,900 silk gland cDNAs from nine spider species that shared a common ancestor > 240 million years ago. In contrast to their hyper-variable repetitive regions, spidroin N-terminal domains have retained striking similarities in sequence identity, predicted secondary structure, and hydrophobicity. Through separate and combined phylogenetic analyses of N-terminal domains and their corresponding C-termini, we find that combined analysis produces the most resolved trees and that N-termini contribute more support and less conflict than the C-termini. These analyses show that paralogs largely group by silk gland type, except for the major ampullate spidroins. Moreover, spidroin structural motifs associated with superior tensile strength arose early in the history of this gene family, whereas a motif conferring greater extensibility convergently evolved in two distantly related paralogs. Conclusions A non-repetitive N-terminal domain appears to be a universal attribute of spidroin proteins, likely retained from the origin of spider silk production. Since this time, spidroin N-termini have maintained several features, consistent with this domain playing a key role in silk

Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster.

Directory of Open Access Journals (Sweden)

Raquel S Linheiro

Full Text Available Transposable elements are mobile DNA sequences that integrate into host genomes using diverse mechanisms with varying degrees of target site specificity. While the target site preferences of some engineered transposable elements are well studied, the natural target preferences of most transposable elements are poorly characterized. Using population genomic resequencing data from 166 strains of Drosophila melanogaster, we identified over 8,000 new insertion sites not present in the reference genome sequence that we used to decode the natural target preferences of 22 families of transposable element in this species. We found that terminal inverted repeat transposon and long terminal repeat retrotransposon families present clade-specific target site duplications and target site sequence motifs. Additionally, we found that the sequence motifs at transposable element target sites are always palindromes that extend beyond the target site duplication. Our results demonstrate the utility of population genomics data for high-throughput inference of transposable element targeting preferences in the wild and establish general rules for terminal inverted repeat transposon and long terminal repeat retrotransposon target site selection in eukaryotic genomes.
Repeat-aware modeling and correction of short read errors.

Science.gov (United States)

Yang, Xiao; Aluru, Srinivas; Dorman, Karin S

2011-02-15

High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors
Genetic diversity studies in pea (Pisum sativum L.) using simple sequence repeat markers.

Science.gov (United States)

Kumari, P; Basal, N; Singh, A K; Rai, V P; Srivastava, C P; Singh, P K

2013-03-13

The genetic diversity among 28 pea (Pisum sativum L.) genotypes was analyzed using 32 simple sequence repeat markers. A total of 44 polymorphic bands, with an average of 2.1 bands per primer, were obtained. The polymorphism information content ranged from 0.657 to 0.309 with an average of 0.493. The variation in genetic diversity among these cultivars ranged from 0.11 to 0.73. Cluster analysis based on Jaccard's similarity coefficient using the unweighted pair-group method with arithmetic mean (UPGMA) revealed 2 distinct clusters, I and II, comprising 6 and 22 genotypes, respectively. Cluster II was further differentiated into 2 subclusters, IIA and IIB, with 12 and 10 genotypes, respectively. Principal component (PC) analysis revealed results similar to those of UPGMA. The first, second, and third PCs contributed 21.6, 16.1, and 14.0% of the variation, respectively; cumulative variation of the first 3 PCs was 51.7%.
The C-terminal sequence of several human serine proteases encodes host defense functions.

Science.gov (United States)

Kasetty, Gopinath; Papareddy, Praveen; Kalle, Martina; Rydengård, Victoria; Walse, Björn; Svensson, Bo; Mörgelin, Matthias; Malmsten, Martin; Schmidtchen, Artur

2011-01-01

Serine proteases of the S1 family have maintained a common structure over an evolutionary span of more than one billion years, and evolved a variety of substrate specificities and diverse biological roles, involving digestion and degradation, blood clotting, fibrinolysis and epithelial homeostasis. We here show that a wide range of C-terminal peptide sequences of serine proteases, particularly from the coagulation and kallikrein systems, share characteristics common with classical antimicrobial peptides of innate immunity. Under physiological conditions, these peptides exert antimicrobial effects as well as immunomodulatory functions by inhibiting macrophage responses to bacterial lipopolysaccharide. In mice, selected peptides are protective against lipopolysaccharide-induced shock. Moreover, these S1-derived host defense peptides exhibit helical structures upon binding to lipopolysaccharide and also permeabilize liposomes. The results uncover new and fundamental aspects on host defense functions of serine proteases present particularly in blood and epithelia, and provide tools for the identification of host defense molecules of therapeutic interest. Copyright © 2011 S. Karger AG, Basel.
Initial study of stability and repeatability of measuring R2' and oxygen extraction fraction values in the healthy brain with gradient-echo sampling of spin-echo sequence

International Nuclear Information System (INIS)

Hui Lihong; Zhang Xiaodong; He Chao; Xie Sheng; Xiao Jiangxi; Zhang jue; Wang Xiaoying; Jiang Xuexiang

2010-01-01

Objective: To evaluate the stability and repeatability of gradient-echo sampling of spin- echo (GESSE) sequence in measuring the R 2 ' value in volunteers, by comparison with traditional GRE sequence (T 2 * ]nap and T 2 map). Methods: Eight normal healthy volunteers were enrolled in this study and written informed consents were obtained from all subjects. MR scanning including sequences of GESSE, T 2 map and T 2 * map were performed in these subjects at resting status. The same protocol was repeated one day later. Raw data from GESSE sequence were transferred to PC to conduct postprocessing with the software built in house. R 2 ' map and OEF map were got consequently. To obtain quantitative R 2 ' and OEF values in the brain parenchyma, six ROIs were equally placed in the anterior, middle and posterior part of bilateral hemispheres. Both mean and standard deviation of R 2 ' and OEF were recorded. All images from T 2 * map and T 2 map were transferred to the Workstation for postprocessing. The ROIs were put at the same areas as those for GESSE sequence. R 2 ' is defined as R 2 ' = R 2 * - R 2 , R 2 * = 1/T 2 * . The R 2 ' value of GESSE sequence were compared with that of GRE sequence. Results: The mean R 2 ' values of GESSE at the first and second scan and those of the GRE were (4.21±0.92), (4.45±0.94) Hz and (7.37±1.47), (6.42±2.33) Hz respectively. The mean OEF values of GESSE at the first and second scan is 0.327±0.036 and 0.336± 0.035 respectively. The R 2 ' value and OEF value obtained from GESSE were not significantly different between the first and second scan (t=-0.83, -1.48, P>0.05). The R 2 ' value of first GRE imaging had significantly statistical difference from that of second GRE imaging (t=1.80, P 2 ' value of GESSE sequence was less than that of GRE sequence, and there was significantly statistical difference between them (t=1.71, P<0.05). Conclusion: The GESSE sequence has good stability and repeatability with promising clinical practicability
Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus).

Science.gov (United States)

Cech, Jennifer N; Peichel, Catherine L

2015-12-01

Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Directory of Open Access Journals (Sweden)

Huaiyong Luo

Full Text Available The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Science.gov (United States)

Luo, Huaiyong; Wang, Xiaojie; Zhan, Gangming; Wei, Guorong; Zhou, Xinli; Zhao, Jing; Huang, Lili; Kang, Zhensheng

2015-01-01

The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs) are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
[Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

Science.gov (United States)

Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

2015-04-01

This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
Yeast eIF4B binds to the head of the 40S ribosomal subunit and promotes mRNA recruitment through its N-terminal and internal repeat domains.

Science.gov (United States)

Walker, Sarah E; Zhou, Fujun; Mitchell, Sarah F; Larson, Victoria S; Valasek, Leos; Hinnebusch, Alan G; Lorsch, Jon R

2013-02-01

Eukaryotic translation initiation factor (eIF)4B stimulates recruitment of mRNA to the 43S ribosomal pre-initiation complex (PIC). Yeast eIF4B (yeIF4B), shown previously to bind single-stranded (ss) RNA, consists of an N-terminal domain (NTD), predicted to be unstructured in solution; an RNA-recognition motif (RRM); an unusual domain comprised of seven imperfect repeats of 26 amino acids; and a C-terminal domain. Although the mechanism of yeIF4B action has remained obscure, most models have suggested central roles for its RRM and ssRNA-binding activity. We have dissected the functions of yeIF4B's domains and show that the RRM and its ssRNA-binding activity are dispensable in vitro and in vivo. Instead, our data indicate that the 7-repeats and NTD are the most critical domains, which mediate binding of yeIF4B to the head of the 40S ribosomal subunit via interaction with Rps20. This interaction induces structural changes in the ribosome's mRNA entry channel that could facilitate mRNA loading. We also show that yeIF4B strongly promotes productive interaction of eIF4A with the 43S•mRNA PIC in a manner required for efficient mRNA recruitment.
75 FR 60258 - Federal Acquisition Regulation; Termination for Default Reporting

Science.gov (United States)

2010-09-29

..., Sequence 1] RIN 9000-AL45 Federal Acquisition Regulation; Termination for Default Reporting AGENCIES... terminations for cause or default and defective cost or pricing data, into the Past Performance Information... defective cost or pricing data and terminations for cause or default into the FAPIIS module of the PPIRS...
A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

Science.gov (United States)

Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

2017-01-01

TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.
Utilization of a cloned alphoid repeating sequence of human DNA in the study of polymorphism of chromosomal heterochromatin regions

International Nuclear Information System (INIS)

Kruminya, A.R.; Kroshkina, V.G.; Yurov, Yu.B.; Aleksandrov, I.A.; Mitkevich, S.P.; Gindilis, V.M.

1988-01-01

The chromosomal distribution of the cloned PHS05 fragment of human alphoid DNA was studied by in situ hybridization in 38 individuals. It was shown that this DNA fraction is primarily localized in the pericentric regions of practically all chromosomes of the set. Significant interchromosomal differences and a weakly expressed interindividual polymorphism were discovered in the copying ability of this class of repeating DNA sequences; associations were not found between the results of hybridization and the pattern of Q-polymorphism
In situ detection of tandem DNA repeat length

Energy Technology Data Exchange (ETDEWEB)

Yaar, R.; Szafranski, P.; Cantor, C.R.; Smith, C.L. [Boston Univ., MA (United States)

1996-11-01

A simple method for scoring short tandem DNA repeats is presented. An oligonucleotide target, containing tandem repeats embedded in a unique sequence, was hybridized to a set of complementary probes, containing tandem repeats of known lengths. Single-stranded loop structures formed on duplexes containing a mismatched (different) number of tandem repeats. No loop structure formed on duplexes containing a matched (identical) number of tandem repeats. The matched and mismatched loop structures were enzymatically distinguished and differentially labeled by treatment with S1 nuclease and the Klenow fragment of DNA polymerase. 7 refs., 4 figs.
In silico reversal of repeat-induced point mutation (RIP identifies the origins of repeat families and uncovers obscured duplicated genes

Directory of Open Access Journals (Sweden)

Hane James K

2010-11-01

Full Text Available Abstract Background Repeat-induced point mutation (RIP is a fungal genome defence mechanism guarding against transposon invasion. RIP mutates the sequence of repeated DNA and over time renders the affected regions unrecognisable by similarity search tools such as BLAST. Results DeRIP is a new software tool developed to predict the original sequence of a RIP-mutated region prior to the occurrence of RIP. In this study, we apply deRIP to the genome of the wheat pathogen Stagonospora nodorum SN15 and predict the origin of several previously uncharacterised classes of repetitive DNA. Conclusions Five new classes of transposon repeats and four classes of endogenous gene repeats were identified after deRIP. The deRIP process is a new tool for fungal genomics that facilitates the identification and understanding of the role and origin of fungal repetitive DNA. DeRIP is open-source and is available as part of the RIPCAL suite at http://www.sourceforge.net/projects/ripcal.
Draft Sequencing of the Heterozygous Diploid Genome of Satsuma (Citrus unshiu Marc. Using a Hybrid Assembly Approach

Directory of Open Access Journals (Sweden)

Tokurou Shimizu

2017-12-01

Full Text Available Satsuma (Citrus unshiu Marc. is one of the most abundantly produced mandarin varieties of citrus, known for its seedless fruit production and as a breeding parent of citrus. De novo assembly of the heterozygous diploid genome of Satsuma (“Miyagawa Wase” was conducted by a hybrid assembly approach using short-read sequences, three mate-pair libraries, and a long-read sequence of PacBio by the PLATANUS assembler. The assembled sequence, with a total size of 359.7 Mb at the N50 length of 386,404 bp, consisted of 20,876 scaffolds. Pseudomolecules of Satsuma constructed by aligning the scaffolds to three genetic maps showed genome-wide synteny to the genomes of Clementine, pummelo, and sweet orange. Gene prediction by modeling with MAKER-P proposed 29,024 genes and 37,970 mRNA; additionally, gene prediction analysis found candidates for novel genes in several biosynthesis pathways for gibberellin and violaxanthin catabolism. BUSCO scores for the assembled scaffold and predicted transcripts, and another analysis by BAC end sequence mapping indicated the assembled genome consistency was close to those of the haploid Clementine, pummel, and sweet orange genomes. The number of repeat elements and long terminal repeat retrotransposon were comparable to those of the seven citrus genomes; this suggested no significant failure in the assembly at the repeat region. A resequencing application using the assembled sequence confirmed that both kunenbo-A and Satsuma are offsprings of Kishu, and Satsuma is a back-crossed offspring of Kishu. These results illustrated the performance of the hybrid assembly approach and its ability to construct an accurate heterozygous diploid genome.
Cloning and characterization of cDNAs encoding the complete sequence of decay-accelerating factor of human complement

International Nuclear Information System (INIS)

Medof, M.E.; Lublin, D.M.; Holers, V.M.; Ayers, D.J.; Getty, R.R.; Leykam, J.F.; Atkinson, J.P.; Tykocinski, M.L.

1987-01-01

cDNAs encoding the complement decay-accelerating factor (DAF) were isolated from HeLa and differentiated HL-60 λgt cDNA libraries by screening with a codon preference oligonucleotide corresponding to DAF NH 2 -terminal amino acids 3-14. The composite cDNA sequence showed a 347-amino acid protein preceded by an NH 2 -terminal leader peptide sequence. The translated sequence beginning at the DAF NH 2 terminus encodes four contiguous ≅ 61-amino acid long repetitive units of internal homology. The repetitive regions contain four conserved cysteines, one proline, one glycine, one glycine/alanine, four leucines/isoleucines/valines, one serine, three tyrosines/phenylalanines, and on tryptophan and show striking homology to similar regions previously identified in factor B, C2, C4 binding protein, factor H, C1r, factor XIII, interleukin 2 receptor, and serum β 2 -glycoprotein I. The consensus repeats are attached to a 70-amino acid long segment rich in serine and threonine (potential O-glycosylation sites), which is in turn followed by a stretch of hydrophobic amino acids. RNA blot analysis of HeLa and HL-60 RNA revealed three DAF mRNA species of 3.1, 2.7, and 2.0 kilobases. The results indicate that portions of the DAF gene may have evolved from a DNA element common to the above proteins, that DAF cDNA predicts a COOH-terminal anchoring polypeptide, and that distinct species of DAF message are elaborated in cells
THE USE OF INTER SIMPLE SEQUENCE REPEATS (ISSR) IN DISTINGUISHING NEIGHBORING DOUGLAS-FIR TREES AS A MEANS TO IDENTIFYING TREE ROOTS WITH ABOVE-GROUND BIOMASS

Science.gov (United States)

We are attempting to identify specific root fragments from soil cores with individual trees. We successfully used Inter Simple Sequence Repeats (ISSR) to distinguish neighboring old-growth Douglas-fir trees from one another, while maintaining identity among each tree's parts. W...
Identification of multiple binding sites for the THAP domain of the Galileo transposase in the long terminal inverted-repeats☆

Science.gov (United States)

Marzo, Mar; Liu, Danxu; Ruiz, Alfredo; Chalmers, Ronald

2013-01-01

Galileo is a DNA transposon responsible for the generation of several chromosomal inversions in Drosophila. In contrast to other members of the P-element superfamily, it has unusually long terminal inverted-repeats (TIRs) that resemble those of Foldback elements. To investigate the function of the long TIRs we derived consensus and ancestral sequences for the Galileo transposase in three species of Drosophilids. Following gene synthesis, we expressed and purified their constituent THAP domains and tested their binding activity towards the respective Galileo TIRs. DNase I footprinting located the most proximal DNA binding site about 70 bp from the transposon end. Using this sequence we identified further binding sites in the tandem repeats that are found within the long TIRs. This suggests that the synaptic complex between Galileo ends may be a complicated structure containing higher-order multimers of the transposase. We also attempted to reconstitute Galileo transposition in Drosophila embryos but no events were detected. Thus, although the limited numbers of Galileo copies in each genome were sufficient to provide functional consensus sequences for the THAP domains, they do not specify a fully active transposase. Since the THAP recognition sequence is short, and will occur many times in a large genome, it seems likely that the multiple binding sites within the long, internally repetitive, TIRs of Galileo and other Foldback-like elements may provide the transposase with its binding specificity. PMID:23648487
Identification of an osteoclast transcription factor that binds to the human T cell leukemia virus type I-long terminal repeat enhancer element.

Science.gov (United States)

Inoue, D; Santiago, P; Horne, W C; Baron, R

1997-10-03

Transgenic mice expressing human T cell leukemia virus type I (HTLV-I)-tax under the control of HTLV-I-long terminal repeat (LTR) promoter develop skeletal abnormalities with high bone turnover and myelofibrosis. In these animals, Tax is highly expressed in bone with a pattern of expression restricted to osteoclasts and spindle-shaped cells within the endosteal myelofibrosis. To test the hypothesis that lineage-specific transcription factors promote transgene expression from the HTLV-I-LTR in osteoclasts, we first examined tax expression in transgenic bone marrow cultures. Expression was dependent on 1alpha,25-dihydroxycholecalciferol and coincided with tartrate-resistant acid phosphatase (TRAP) expression, a marker of osteoclast differentiation. Furthermore, Tax was expressed in vitronectin receptor-positive mononuclear precursors as well as in mature osteoclast-like cells (OCLs). Consistent with our hypothesis, electrophoretic mobility shift assays revealed the presence of an OCL nuclear factor (NFOC-1) that binds to the LTR 21-base pair direct repeat, a region critical for the promoter activity. This binding is further enhanced by Tax. Since NFOC-1 is absent in macrophages and conserved in osteoclasts among species including human, such a factor may play a role in lineage determination and/or in expression of the differentiated osteoclast phenotype.

[open quotes]Cryptic[close quotes] repeating triplets of purines and pyrimidines (cRRY(i)) are frequent and polymorphic: Analysis of coding cRRY(i) in the proopiomelanocortin (POMC) and TATA-binding protein (TBP) genes

Energy Technology Data Exchange (ETDEWEB)

Gostout, B.; Qiang Liu; Sommer, S.S. (Mayo Clinic/Foundation, Rochester, MN (United States))

1993-06-01

Triplets of the form of purine, purine, pyrimidine (RRY(i)) are enhanced in frequency in the genomes of primates, rodents, and bacteria. Some RRY(i) are [open quotes]cryptic[close quotes] repeats (cRRY(i)) in which no one tandem run of a trinucleotide predominates. A search of human GenBank sequence revealed that the sequences of cRRY(i) are highly nonrandom. Three randomly chosen human cRRY(i) were sequenced in search of polymorphic alleles. Multiple polymorphic alleles were found in cRRY(i) in the coding regions of the genes for proopiomelanocortin (POMC) and TATA-binding protein (TBP). The highly polymorphic TBP cRRY(i) was characterized in detail. Direct sequencing of 157 unrelated human alleles demonstrated the presence of 20 different alleles which resulted in 29--40 consecutive glutamines in the amino-terminal region of TBP. These alleles are differently distributed among the races. PCR was used to screen 1,846 additional alleles in order to characterize more fully the range of variation in the population. Three additional alleles were discovered, but there was no example of a substantial sequence amplification as is seen in the repeat sequences associated with X-linked spinal and bulbar muscular atrophy, myotonic dystrophy, or the fragile-X syndrome. The structure of the TBP cRRY(i) is conserved in the five monkey species examined. In the chimpanzee, examination of four individuals revealed that the cRRY(i) was highly polymorphic, but the pattern of polymorphism differed from that in humans. The TBP cRRY(i) displays both similarities with and differences from the previously described RRY(i) in the coding sequence of the androgen receptor. The data suggest how simple tandem repeats could evolve from cryptic repeats. 18 refs., 3 figs., 6 tabs.
Rate-determining Step of Flap Endonuclease 1 (FEN1) Reflects a Kinetic Bias against Long Flaps and Trinucleotide Repeat Sequences.

Science.gov (United States)

Tarantino, Mary E; Bilotti, Katharina; Huang, Ji; Delaney, Sarah

2015-08-21

Flap endonuclease 1 (FEN1) is a structure-specific nuclease responsible for removing 5'-flaps formed during Okazaki fragment maturation and long patch base excision repair. In this work, we use rapid quench flow techniques to examine the rates of 5'-flap removal on DNA substrates of varying length and sequence. Of particular interest are flaps containing trinucleotide repeats (TNR), which have been proposed to affect FEN1 activity and cause genetic instability. We report that FEN1 processes substrates containing flaps of 30 nucleotides or fewer at comparable single-turnover rates. However, for flaps longer than 30 nucleotides, FEN1 kinetically discriminates substrates based on flap length and flap sequence. In particular, FEN1 removes flaps containing TNR sequences at a rate slower than mixed sequence flaps of the same length. Furthermore, multiple-turnover kinetic analysis reveals that the rate-determining step of FEN1 switches as a function of flap length from product release to chemistry (or a step prior to chemistry). These results provide a kinetic perspective on the role of FEN1 in DNA replication and repair and contribute to our understanding of FEN1 in mediating genetic instability of TNR sequences. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
Construction and sequencing of an infectious clone of the human parvovirus B19

International Nuclear Information System (INIS)

Zhi Ning; Zadori, Zoltan; Brown, Kevin E.; Tijssen, Peter

2004-01-01

Human parvovirus B19 has a nonenveloped, icosahedral capsid packaging a linear single-stranded DNA genome of 5.6 kb with long inverted terminal repeats (ITR) at both the 5' and 3' end. Previous attempts to construct a full-length B19 clone were unsuccessful due to deletions in the ITR sequences. We cloned the complete parvovirus B19 genome with intact ITRs from an aplastic crisis patient. Sequence analysis of the complete viral genome indicated that both 5' and 3' ITRs have two sequence configurations and several base changes within the ITRs compared to previous published sequences. After transfection of the plasmid into permissive cells, spliced and non-spliced viral transcripts and viral capsid proteins could be detected. Southern blot analysis of the DNA purified from the plasmid-transfected cells confirmed parvovirus B19 DNA replication. Production of infectious virus by the B19 plasmid was shown by inoculation of cell lysate derived from transfected cells into fresh cells. Together, these results indicate the first successful production of an infectious clone for parvovirus B19 virus
Assessment of Cultivar Distinctness in Alfalfa: A Comparison of Genotyping-by-Sequencing, Simple-Sequence Repeat Marker, and Morphophysiological Observations

Directory of Open Access Journals (Sweden)

Paolo Annicchiarico

2016-07-01

Full Text Available Cultivar registration agencies typically require morphophysiological trait-based distinctness of candidate cultivars. This requirement is difficult to achieve for cultivars of major perennial forages because of their genetic structure and ever-increasing number of registered material, leading to possible rejection of agronomically valuable cultivars. This study aimed to explore the value of molecular markers applied to replicated bulked plants (three bulks of 100 independent plants each per cultivar to assess alfalfa ( L. subsp. cultivar distinctness. We compared genotyping-by-sequencing information based on 2902 polymorphic single-nucleotide polymorphism (SNP markers (>30 reads per DNA sample with morphophysiological information based on 11 traits and with simple-sequence repeat (SSR marker information from 41 polymorphic markers for their ability to distinguish 11 alfalfa landraces representative of the germplasm from northern Italy. Three molecular criteria, one based on cultivar differences for individual SSR bands and two based on overall SNP marker variation assessed either by statistically significant cultivar differences on principal component axes or discriminant analysis, distinctly outperformed the morphophysiological criterion. Combining the morphophysiological criterion with either molecular marker method increased discrimination among cultivars, since morphophysiological diversity was unrelated to SSR marker-based diversity ( = 0.04 and poorly related to SNP marker-based diversity ( = 0.23, < 0.15. The criterion based on statistically significant SNP allele frequency differences was less discriminating than morphophysiological variation. Marker-based distinctness, which can be assessed at low cost and without interactions with testing conditions, could validly substitute for (or complement morphophysiological distinctness in alfalfa cultivar registration schemes. It also has interest in sui generis registration systems aimed at
Agarose gel electrophoresis and polyacrylamide gel electrophoresis for visualization of simple sequence repeats.

Science.gov (United States)

Anderson, James; Wright, Drew; Meksem, Khalid

2013-01-01

In the modern age of genetic research there is a constant search for ways to improve the efficiency of plant selection. The most recent technology that can result in a highly efficient means of selection and still be done at a low cost is through plant selection directed by simple sequence repeats (SSRs or microsatellites). The molecular markers are used to select for certain desirable plant traits without relying on ambiguous phenotypic data. The best way to detect these is the use of gel electrophoresis. Gel electrophoresis is a common technique in laboratory settings which is used to separate deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) by size. Loading DNA and RNA onto gels allows for visualization of the size of fragments through the separation of DNA and RNA fragments. This is achieved through the use of the charge in the particles. As the fragments separate, they form into distinct bands at set sizes. We describe the ability to visualize SSRs on slab gels of agarose and polyacrylamide gel electrophoresis.
Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

Science.gov (United States)

Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

2002-07-01

Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.
Isolation and sequence analysis of the wheat B genome subtelomeric DNA

Directory of Open Access Journals (Sweden)

Huneau Cecile

2009-09-01

Full Text Available Abstract Background Telomeric and subtelomeric regions are essential for genome stability and regular chromosome replication. In this work, we have characterized the wheat BAC (bacterial artificial chromosome clones containing Spelt1 and Spelt52 sequences, which belong to the subtelomeric repeats of the B/G genomes of wheats and Aegilops species from the section Sitopsis. Results The BAC library from Triticum aestivum cv. Renan was screened using Spelt1 and Spelt52 as probes. Nine positive clones were isolated; of them, clone 2050O8 was localized mainly to the distal parts of wheat chromosomes by in situ hybridization. The distribution of the other clones indicated the presence of different types of repetitive sequences in BACs. Use of different approaches allowed us to prove that seven of the nine isolated clones belonged to the subtelomeric chromosomal regions. Clone 2050O8 was sequenced and its sequence of 119 737 bp was annotated. It is composed of 33% transposable elements (TEs, 8.2% Spelt52 (namely, the subfamily Spelt52.2 and five non-TE-related genes. DNA transposons are predominant, making up 24.6% of the entire BAC clone, whereas retroelements account for 8.4% of the clone length. The full-length CACTA transposon Caspar covers 11 666 bp, encoding a transposase and CTG-2 proteins, and this transposon accounts for 40% of the DNA transposons. The in situ hybridization data for 2050O8 derived subclones in combination with the BLAST search against wheat mapped ESTs (expressed sequence tags suggest that clone 2050O8 is located in the terminal bin 4BL-10 (0.95-1.0. Additionally, four of the predicted 2050O8 genes showed significant homology to four putative orthologous rice genes in the distal part of rice chromosome 3S and confirm the synteny to wheat 4BL. Conclusion Satellite DNA sequences from the subtelomeric regions of diploid wheat progenitor can be used for selecting the BAC clones from the corresponding regions of hexaploid wheat
Evaluation of Mammalian Interspersed Repeats to investigate the goat genome

Directory of Open Access Journals (Sweden)

P. Mariani

2010-01-01

Full Text Available Among the repeated sequences present in most eukaryotic genomes, SINEs (Short Interspersed Nuclear Elements are widely used to investigate evolution in the mammalian order (Buchanan et al., 1999. One family of these repetitive sequences, the MIR (Mammalian Interspersed Repeats; Jurka et al., 1995, is ubiquitous in all mammals.MIR elements are tRNA-derived SINEs and are identifiable by a conserved core region of about 70 nucleotides.
Mapping the transcription termination region of the mouse immunoglobulin kappa gene

International Nuclear Information System (INIS)

Xu, M.; Garrard, W.T.

1986-01-01

To define the transcription termination region of the mouse immunoglobulin kappa gene, they have subcloned single copy DNA sequences corresponding to both the template and the non-template strands of this locus. In vitro nuclear transcription with isolated MPC-11 nuclei was performed and the resulting 32 P-labeled RNA was hybridized to slot-blotted, single-stranded M13 probes covering regions within and flanking the kappa gene. The hybridization pattern for the template-strand reveals that transcription terminates within the region between 1.1 to 2.3 kb downstream from the poly(A) site. Ten different short sequences (8-13 bp) reside within 460 bp of this region that exhibit homology with sequences found in the termination regions of mouse β-globin and chicken ovalbumin genes. Transcription of the non-template strand occurs on either side of this termination region. They note that no transcription is detectable on the non-template strand downstream of the enhancer, indicating that if RNA polymerase II enters at this site, it does not initiate transcription during transit to the promoter region. They conclude that transcription of the kappa gene passes the poly(A) addition site and terminates within 2.3 Kb downstream
Regulation of HFE expression by poly(ADP-ribose) polymerase-1 (PARP1) through an inverted repeat DNA sequence in the distal promoter.

Science.gov (United States)

Pelham, Christopher; Jimenez, Tamara; Rodova, Marianna; Rudolph, Angela; Chipps, Elizabeth; Islam, M Rafiq

2013-12-01

Hereditary hemochromatosis (HH) is a common autosomal recessive disorder of iron overload among Caucasians of northern European descent. Over 85% of all cases with HH are due to mutations in the hemochromatosis protein (HFE) involved in iron metabolism. Although the importance in iron homeostasis is well recognized, the mechanism of sensing and regulating iron absorption by HFE, especially in the absence of iron response element in its gene, is not fully understood. In this report, we have identified an inverted repeat sequence (ATGGTcttACCTA) within 1700bp (-1675/+35) of the HFE promoter capable to form cruciform structure that binds PARP1 and strongly represses HFE promoter. Knockdown of PARP1 increases HFE mRNA and protein. Similarly, hemin or FeCl3 treatments resulted in increase in HFE expression by reducing nuclear PARP1 pool via its apoptosis induced cleavage, leading to upregulation of the iron regulatory hormone hepcidin mRNA. Thus, PARP1 binding to the inverted repeat sequence on the HFE promoter may serve as a novel iron sensing mechanism as increased iron level can trigger PARP1 cleavage and relief of HFE transcriptional repression. © 2013.
Rhoptry-associated protein (rap-1) genes in the sheep pathogen Babesia sp. Xinjiang: Multiple transcribed copies differing by 3' end repeated sequences.

Science.gov (United States)

Niu, Qingli; Marchand, Jordan; Yang, Congshan; Bonsergent, Claire; Guan, Guiquan; Yin, Hong; Malandrin, Laurence

2015-07-30

Sheep babesiosis occurs mainly in tropical and subtropical areas. The sheep parasite Babesia sp. Xinjiang is widespread in China, and our goal is to characterize rap-1 (rhoptry-associated protein 1) gene diversity and expression as a first step of a long term goal aiming at developing a recombinant subunit vaccine. Seven different rap-1a genes were amplified in Babesia sp. Xinjiang, using degenerate primers designed from conserved motifs. Rap-1b and rap-1c gene types could not be identified. In all seven rap-1a genes, the 5' regions exhibited identical sequences over 936 nt, and the 3' regions differed at 28 positions over 147 nt, defining two types of genes designated α and β. The remaining 3' part varied from 72 to 360 nt in length, depending on the gene. This region consists of a succession of two to ten 36 nt repeats, which explains the size differences. Even if the nucleotide sequences varied, 6 repeats encoded the same stretch of amino acids. Transcription of at least four α and two β genes was demonstrated by standard RT-PCR. Copyright © 2015 Elsevier B.V. All rights reserved.
R-loops: targets for nuclease cleavage and repeat instability.

Science.gov (United States)

Freudenreich, Catherine H

2018-01-11

R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.
Germ-line CAG repeat instability causes extreme CAG repeat expansion with infantile-onset spinocerebellar ataxia type 2

DEFF Research Database (Denmark)

Vinther-Jensen, Tua; Ek, Jakob; Duno, Morten

2013-01-01

The spinocerebellar ataxias (SCA) are a genetically and clinically heterogeneous group of diseases, characterized by dominant inheritance, progressive cerebellar ataxia and diverse extracerebellar symptoms. A subgroup of the ataxias is caused by unstable CAG-repeat expansions in their respective ...... of paternal germ-line repeat sequence instability of the expanded SCA2 locus.European Journal of Human Genetics advance online publication, 10 October 2012; doi:10.1038/ejhg.2012.231....
A family of DNA repeats in Aspergillus nidulans has assimilated degenerated retrotransposons

DEFF Research Database (Denmark)

Nielsen, M.L.; Hermansen, T.D.; Aleksenko, Alexei Y.

2001-01-01

In the course of a chromosomal walk towards the centromere of chromosome IV of Aspergillus nidulans, several cross- hybridizing genomic cosmid clones were isolated. Restriction mapping of two such clones revealed that their restriction patterns were similar in a region of at least 15 kb, indicati......) phenomenon, first described in Neurospora crassa, may have operated in A. nidulans. The data indicate that this family of repeats has assimilated mobile elements that subsequently degenerated but then underwent further duplications as a part of the host repeats....... the presence of a large repeat. The nature of the repeat was further investigated by sequencing and Southern analysis. The study revealed a family of long dispersed repeats with a high degree of sequence similarity. The number and location of the repeats vary between wild isolates. Two copies of the repeat...
Female Choice Reveals Terminal Investment in Male Mealworm Beetles, Tenebrio molitor, after a Repeated Activation of the Immune System

Science.gov (United States)

Krams, I; Daukšte, J; Kivleniece, I; Krama, T; Rantala, MJ; Ramey, G; Šauša, L

2011-01-01

Increasing evidence suggests that secondary sexual traits reflect immunocompetence of males in many animal species. This study experimentally investigated whether a parasite-like immunological challenge via a nylon implant affects sexual attractiveness of males in Tenebrio molitor L. (Coleoptera: Tenebrionidae) Although a single immunological challenge significantly reduced sexual attractiveness and locomotor activity of males, it had no adverse effect on their survival. A second immune challenge of the same males increased their attractiveness. However, it was found that the repeated challenge significantly reduced locomotor activity of males and caused higher mortality. This result indicates terminal investment on sexual signaling, which is supposedly based on a trade-off between pheromone production and energy expenditures needed for such activities as recovery of immune system and locomotor activity. When the third implantation was carried out in the same group of males, melanization of nylon implants was found to be lower in more attractive than in less attractive males. This suggests that males that became sexually attractive after the second immune challenge did not invest in recovery of their immune system. PMID:21864151
Study of simple sequence repeat (SSR) polymorphism for biotic ...

African Journals Online (AJOL)

home

2013-10-02

Oct 2, 2013 ... G. Siva Kumar1, K. Aruna Kumari1*, Ch. V. Durga Rani1, R. M. Sundaram2, S. Vanisree3, Md. ..... review by Jena and Mackill (2008) provided the list of .... repeat protein and is a member of a resistance gene cluster on rice.
Mononucleotide repeats are asymmetrically distributed in fungal genes

NARCIS (Netherlands)

Passel, van M.W.J.; Graaff, de L.H.

2008-01-01

ABSTRACT: BACKGROUND: Systematic analyses of sequence features have resulted in a better characterisation of the organisation of the genome. A previous study in prokaryotes on the distribution of sequence repeats, which are notoriously variable and can disrupt the reading frame in genes, showed that
Efficient farnesylation of an extended C-terminal C(x)3X sequence motif expands the scope of the prenylated proteome.

Science.gov (United States)

Blanden, Melanie J; Suazo, Kiall F; Hildebrandt, Emily R; Hardgrove, Daniel S; Patel, Meet; Saunders, William P; Distefano, Mark D; Schmidt, Walter K; Hougland, James L

2018-02-23

Protein prenylation is a post-translational modification that has been most commonly associated with enabling protein trafficking to and interaction with cellular membranes. In this process, an isoprenoid group is attached to a cysteine near the C terminus of a substrate protein by protein farnesyltransferase (FTase) or protein geranylgeranyltransferase type I or II (GGTase-I and GGTase-II). FTase and GGTase-I have long been proposed to specifically recognize a four-amino acid C AAX C-terminal sequence within their substrates. Surprisingly, genetic screening reveals that yeast FTase can modify sequences longer than the canonical C AAX sequence, specifically C( x ) 3 X sequences with four amino acids downstream of the cysteine. Biochemical and cell-based studies using both peptide and protein substrates reveal that mammalian FTase orthologs can also prenylate C( x ) 3 X sequences. As the search to identify physiologically relevant C( x ) 3 X proteins begins, this new prenylation motif nearly doubles the number of proteins within the yeast and human proteomes that can be explored as potential FTase substrates. This work expands our understanding of prenylation's impact within the proteome, establishes the biologically relevant reactivity possible with this new motif, and opens new frontiers in determining the impact of non-canonically prenylated proteins on cell function. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Neurospora tryptophan synthase: N-terminal analysis and the sequence of the pyridoxal phosphate active site peptide

International Nuclear Information System (INIS)

Pratt, M.L.; Hsu, P.Y.; DeMoss, J.A.

1986-01-01

Tryptophan synthase (TS), which catalyzes the final step of tryptophan biosynthesis, is a multifunctional protein requiring pyridoxal phosphate (B6P) for two of its three distinct enzyme activities. TS from Neurospora has a blocked N-terminal, is a homodimer of 150 KDa and binds one mole of B6P per mole of subunit. The authors shown the N-terminal residue to be acyl-serine. The B6P-active site of holoenzyme was labelled by reduction of the B6P-Schiff base with [ 3 H]-NaBH 4 , and resulted in a proportionate loss of activity in the two B6P-requiring reactions. SDS-polyacrylamide gel electrophoresis of CNBr-generated peptides showed the labelled, active site peptide to be 6 KDa. The sequence of this peptide, purified to apparent homogeneity by a combination of C-18 reversed phase and TSK gel filtration HPLC is: gly-arg-pro-gly-gln-leu-his-lys-ala-glu-arg-leu-thr-glu-tyr-ala-gly-gly-ala-gln-ile-xxx-leu-lys-arg-glu-asp-leu-asn-his-xxx-gly-xxx-his-/sub ***/-ile-asn-asn-ala-leu. Although four residues (xxx, /sub ***/) are unidentified, this peptide is minimally 78% homologous with the corresponding peptide from yeast TS, in which residue (/sub ***/) is the lysine that binds B6P
Development of novel simple sequence repeat markers in bitter gourd (Momordica charantia L.) through enriched genomic libraries and their utilization in analysis of genetic diversity and cross-species transferability.

Science.gov (United States)

Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B

2015-01-01

Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.

Alu repeats as markers for forensic DNA analyses

Energy Technology Data Exchange (ETDEWEB)

Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Kass, D.H. [Louisiana State Univ., New Orleans, LA (United States)] [and others

1994-01-01

The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 98.9% nucleotide identity with the HS subfamily consensus sequence, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 inch and 3 inch unique flanking DNA sequences from each HS Alu that allow the locus to be assayed for the presence or absence of the Alu repeat. The dimorphic HS Alu sequences probably inserted in the human genome after the radiation of modem humans (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project. HS Alu family member insertions differ from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) in that polymorphisms due to Alu insertions arise as a result of a unique event which has occurred only one time in the human population and spread through the population from that point. Therefore, individuals that share HS Alu repeats inherited these elements from a common ancestor. Most VNTR and RFLP polymorphisms may arise multiple times in parallel within a population.
The First Molecular Identification of an Olive Collection Applying Standard Simple Sequence Repeats and Novel Expressed Sequence Tag Markers.

Science.gov (United States)

Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo

2017-01-01

Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections
Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana

Directory of Open Access Journals (Sweden)

Yuan Tong

2010-01-01

Full Text Available Abstract Background Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In Arabidopsis thaliana, there are at least 223 Leucine-rich repeat receptor-like kinases (LRR-RLKs, representing one of the largest protein families. Although functional roles for a handful of LRR-RLKs have been revealed, the functions of the majority of members in this protein family have not been elucidated. Results As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for in planta functional analysis. All clones from this study have been submitted to the Arabidopsis Biological Resource Center (ABRC at Ohio State University for full accessibility by the Arabidopsis research community. Conclusions Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.
NMR Analysis of Amide Hydrogen Exchange Rates in a Pentapeptide-Repeat Protein from A. thaliana.

Science.gov (United States)

Xu, Shenyuan; Ni, Shuisong; Kennedy, Michael A

2017-05-23

At2g44920 from Arabidopsis thaliana is a pentapeptide-repeat protein (PRP) composed of 25 repeats capped by N- and C-terminal α-helices. PRP structures are dominated by four-sided right-handed β-helices typically consisting of mixtures of type II and type IV β-turns. PRPs adopt repeated five-residue (Rfr) folds with an Rfr consensus sequence (STAV)(D/N)(L/F)(S/T/R)(X). Unlike other PRPs, At2g44920 consists exclusively of type II β-turns. At2g44920 is predicted to be located in the thylakoid lumen although its biochemical function remains unknown. Given its unusual structure, we investigated the biophysical properties of At2g44920 as a representative of the β-helix family to determine if it had exceptional global stability, backbone dynamics, or amide hydrogen exchange rates. Circular dichroism measurements yielded a melting point of 62.8°C, indicating unexceptional global thermal stability. Nuclear spin relaxation measurements indicated that the Rfr-fold core was rigid with order parameters ranging from 0.7 to 0.9. At2g44920 exhibited a striking range of amide hydrogen exchange rates spanning 10 orders of magnitude, with lifetimes ranging from minutes to several months. A weak correlation was found among hydrogen exchange rates, hydrogen bonding energies, and amino acid solvent-accessible areas. Analysis of contributions from fast (approximately picosecond to nanosecond) backbone dynamics to amide hydrogen exchange rates revealed that the average order parameter of amides undergoing fast exchange was significantly smaller compared to those undergoing slow exchange. Importantly, the activation energies for amide hydrogen exchange were found to be generally higher for the slowest exchanging amides in the central Rfr coil and decreased toward the terminal coils. This could be explained by assuming that the concerted motions of two preceding or following coils required for hydrogen bond disruption and amide hydrogen exchange have a higher activation energy
N-Terminal Domains in Two-Domain Proteins Are Biased to Be Shorter and Predicted to Fold Faster Than Their C-Terminal Counterparts

Directory of Open Access Journals (Sweden)

Etai Jacob

2013-04-01

Full Text Available Computational analysis of proteomes in all kingdoms of life reveals a strong tendency for N-terminal domains in two-domain proteins to have shorter sequences than their neighboring C-terminal domains. Given that folding rates are affected by chain length, we asked whether the tendency for N-terminal domains to be shorter than their neighboring C-terminal domains reflects selection for faster-folding N-terminal domains. Calculations of absolute contact order, another predictor of folding rate, provide additional evidence that N-terminal domains tend to fold faster than their neighboring C-terminal domains. A possible explanation for this bias, which is more pronounced in prokaryotes than in eukaryotes, is that faster folding of N-terminal domains reduces the risk for protein aggregation during folding by preventing formation of nonnative interdomain interactions. This explanation is supported by our finding that two-domain proteins with a shorter N-terminal domain are much more abundant than those with a shorter C-terminal domain.
The DUB/USP17 deubiquitinating enzymes: A gene family within a tandemly repeated sequence, is also embedded within the copy number variable Beta-defensin cluster

Directory of Open Access Journals (Sweden)

Scott Christopher J

2010-04-01

Full Text Available Abstract Background The DUB/USP17 subfamily of deubiquitinating enzymes were originally identified as immediate early genes induced in response to cytokine stimulation in mice (DUB-1, DUB-1A, DUB-2, DUB-2A. Subsequently we have identified a number of human family members and shown that one of these (DUB-3 is also cytokine inducible. We originally showed that constitutive expression of DUB-3 can block cell proliferation and more recently we have demonstrated that this is due to its regulation of the ubiquitination and activity of the 'CAAX' box protease RCE1. Results Here we demonstrate that the human DUB/USP17 family members are found on both chromosome 4p16.1, within a block of tandem repeats, and on chromosome 8p23.1, embedded within the copy number variable beta-defensin cluster. In addition, we show that the multiple genes observed in humans and other distantly related mammals have arisen due to the independent expansion of an ancestral sequence within each species. However, it is also apparent when sequences from humans and the more closely related chimpanzee are compared, that duplication events have taken place prior to these species separating. Conclusions The observation that the DUB/USP17 genes, which can influence cell growth and survival, have evolved from an unstable ancestral sequence which has undergone multiple and varied duplications in the species examined marks this as a unique family. In addition, their presence within the beta-defensin repeat raises the question whether they may contribute to the influence of this repeat on immune related conditions.
Karyological characterization and identification of four repetitive element groups (the 18S – 28S rRNA gene, telomeric sequences, microsatellite repeat motifs, Rex retroelements) of the Asian swamp eel (Monopterus albus)

Science.gov (United States)

Suntronpong, Aorarat; Thapana, Watcharaporn; Twilprawat, Panupon; Prakhongcheep, Ornjira; Somyong, Suthasinee; Muangmai, Narongrit; Surin Peyachoknagul; Srikulnath, Kornsorn

2017-01-01

Abstract Among teleost fishes, Asian swamp eel (Monopterus albus Zuiew, 1793) possesses the lowest chromosome number, 2n = 24. To characterize the chromosome constitution and investigate the genome organization of repetitive sequences in M. albus, karyotyping and chromosome mapping were performed with the 18S – 28S rRNA gene, telomeric repeats, microsatellite repeat motifs, and Rex retroelements. The 18S – 28S rRNA genes were observed to the pericentromeric region of chromosome 4 at the same position with large propidium iodide and C-positive bands, suggesting that the molecular structure of the pericentromeric regions of chromosome 4 has evolved in a concerted manner with amplification of the 18S – 28S rRNA genes. (TTAGGG)n sequences were found at the telomeric ends of all chromosomes. Eight of 19 microsatellite repeat motifs were dispersedly mapped on different chromosomes suggesting the independent amplification of microsatellite repeat motifs in M. albus. Monopterus albus Rex1 (MALRex1) was observed at interstitial sites of all chromosomes and in the pericentromeric regions of most chromosomes whereas MALRex3 was scattered and localized to all chromosomes and MALRex6 to several chromosomes. This suggests that these retroelements were independently amplified or lost in M. albus. Among MALRexs (MALRex1, MALRex3, and MALRex6), MALRex6 showed higher interspecific sequence divergences from other teleost species in comparison. This suggests that the divergence of Rex6 sequences of M. albus might have occurred a relatively long time ago. PMID:29093797
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

Science.gov (United States)

Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

2017-07-01

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
DNA Polymerases Drive DNA Sequencing-by-Synthesis Technologies: Both Past and Present

Directory of Open Access Journals (Sweden)

Cheng-Yao eChen

2014-06-01

Full Text Available Next-generation sequencing (NGS technologies have revolutionized modern biological and biomedical research. The engines responsible for this innovation are DNA polymerases; they catalyze the biochemical reaction for deriving template sequence information. In fact, DNA polymerase has been a cornerstone of DNA sequencing from the very beginning. E. coli DNA polymerase I proteolytic (Klenow fragment was originally utilized in Sanger's dideoxy chain terminating DNA sequencing chemistry. From these humble beginnings followed an explosion of organism-specific, genome sequence information accessible via public database. Family A/B DNA polymerases from mesophilic/thermophilic bacteria/archaea were modified and tested in today's standard capillary electrophoresis (CE and NGS sequencing platforms. These enzymes were selected for their efficient incorporation of bulky dye-terminator and reversible dye-terminator nucleotides respectively. Third generation, real-time single molecule sequencing platform requires slightly different enzyme properties. Enterobacterial phage ⱷ29 DNA polymerase copies long stretches of DNA and possesses a unique capability to efficiently incorporate terminal phosphate-labeled nucleoside polyphosphates. Furthermore, ⱷ29 enzyme has also been utilized in emerging DNA sequencing technologies including nanopore-, and protein-transistor-based sequencing. DNA polymerase is, and will continue to be, a crucial component of sequencing technologies.
Revisiting the TALE repeat.

Science.gov (United States)

Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

2014-04-01

Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.
Survey of transposable elements in sugarcane expressed sequence tags (ESTs

Directory of Open Access Journals (Sweden)

Rossi Magdalena

2001-01-01

Full Text Available The sugarcane expressed sequence tag (SUCEST project has produced a large number of cDNA sequences from several plant tissues submitted or not to different conditions of stress. In this paper we report the result of a search for transposable elements (TEs revealing a surprising amount of expressed TEs homologues. Of the 260,781 sequences grouped in 81,223 fragment assembly program (Phrap clusters, a total of 276 clones showed homology to previously reported TEs using a stringent cut-off value of e-50 or better. Homologous clones to Copia/Ty1 and Gypsy/Ty3 groups of long terminal repeat (LTR retrotransposons were found but no non-LTR retroelements were identified. All major transposon families were represented in sugarcane including Activator (Ac, Mutator (MuDR, Suppressor-mutator (En/Spm and Mariner. In order to compare the TE diversity in grasses genomes, we carried out a search for TEs described in sugarcane related species O.sativa, Z. mays and S. bicolor. We also present preliminary results showing the potential use of TEs insertion pattern polymorphism as molecular markers for cultivar identification.
Timeless links replication termination to mitotic kinase activation.

Directory of Open Access Journals (Sweden)

Jayaraju Dheekollu

2011-05-01

Full Text Available The mechanisms that coordinate the termination of DNA replication with progression through mitosis are not completely understood. The human Timeless protein (Tim associates with S phase replication checkpoint proteins Claspin and Tipin, and plays an important role in maintaining replication fork stability at physical barriers, like centromeres, telomeres and ribosomal DNA repeats, as well as at termination sites. We show here that human Tim can be isolated in a complex with mitotic entry kinases CDK1, Auroras A and B, and Polo-like kinase (Plk1. Plk1 bound Tim directly and colocalized with Tim at a subset of mitotic structures in M phase. Tim depletion caused multiple mitotic defects, including the loss of sister-chromatid cohesion, loss of mitotic spindle architecture, and a failure to exit mitosis. Tim depletion caused a delay in mitotic kinase activity in vivo and in vitro, as well as a reduction in global histone H3 S10 phosphorylation during G2/M phase. Tim was also required for the recruitment of Plk1 to centromeric DNA and formation of catenated DNA structures at human centromere alpha satellite repeats. Taken together, these findings suggest that Tim coordinates mitotic kinase activation with termination of DNA replication.
Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

Science.gov (United States)

Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

2012-01-01

Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes.

Science.gov (United States)

Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H; Cavallini, Andrea; Natali, Lucia

2015-11-24

The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes--7 wild accessions and 8 cultivars--of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
[Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

Science.gov (United States)

Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

2009-11-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Inter- and intra-strain variability of tandem repeats in Mycoplasma pneumoniae based on next-generation sequencing data.

Science.gov (United States)

Zhang, Jing; Song, Xiaohong; Ma, Marella J; Xiao, Li; Kenri, Tsuyoshi; Sun, Hongmei; Ptacek, Travis; Li, Shaoli; Waites, Ken B; Atkinson, T Prescott; Shibayama, Keigo; Dybvig, Kevin; Feng, Yanmei

2017-02-01

To characterize inter- and intra-strain variability of variable-number tandem repeats (VNTRs) in Mycoplasma pneumoniae to determine the optimal multilocus VNTR analysis scheme for improved strain typing. Whole genome assemblies and next-generation sequencing data from diverse M. pneumoniae isolates were used to characterize VNTRs and their variability, and to compare the strain discriminability of new VNTR and existing markers. We identified 13 VNTRs including five reported previously. These VNTRs displayed different levels of inter- and intra-strain copy number variations. All new markers showed similar or higher discriminability compared with existing VNTR markers and the P1 typing system. Our study provides novel insights into VNTR variations and potential new multilocus VNTR analysis schemes for improved genotyping of M. pneumoniae.
Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

Energy Technology Data Exchange (ETDEWEB)

Novelli, G.; Sineo, L.; Pontieri, E. [Catholic Univ. of Rome (Italy)]|[Univ. of Milan (Italy)]|[Univ. Florence (Italy)] [and others

1994-09-01

Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PK gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.
The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

Directory of Open Access Journals (Sweden)

Vergnaud Gilles

2007-05-01

Full Text Available Abstract Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the
The catalytic chain of human complement subcomponent C1r. Purification and N-terminal amino acid sequences of the major cyanogen bromide-cleavage fragments.

Science.gov (United States)

Arlaud, G J; Gagnon, J; Porter, R R

1982-01-01

1. The a- and b-chains of reduced and alkylated human complement subcomponent C1r were separated by high-pressure gel-permeation chromatography and isolated in good yield and in pure form. 2. CNBr cleavage of C1r b-chain yielded eight major peptides, which were purified by gel filtration and high-pressure reversed-phase chromatography. As determined from the sum of their amino acid compositions, these peptides accounted for a minimum molecular weight of 28 000, close to the value 29 100 calculated from the whole b-chain. 3. N-Terminal sequence determinations of C1r b-chain and its CNBr-cleavage peptides allowed the identification of about two-thirds of the amino acids of C1r b-chain. From our results, and on the basis of homology with other serine proteinases, an alignment of the eight CNBr-cleavage peptides from C1r b-chain is proposed. 4. The residues forming the 'charge-relay' system of the active site of serine proteinases (His-57, Asp-102 and Ser-195 in the chymotrypsinogen numbering) are found in the corresponding regions of C1r b-chain, and the amino acid sequence around these residues has been determined. 5. The N-terminal sequence of C1r b-chain has been extended to residue 60 and reveals that C1r b-chain lacks the 'histidine loop', a disulphide bond that is present in all other known serine proteinases.
A viral long terminal repeat expressed in CD4+CD8+ precursors is downregulated in mature peripheral CD4-CD8+ or CD4+CD8- T cells.

OpenAIRE

Paquette, Y; Doyon, L; Laperrière, A; Hanna, Z; Ball, J; Sekaly, R P; Jolicoeur, P

1992-01-01

The long terminal repeat from a thymotropic mouse mammary tumor virus variant, DMBA-LV, was used to drive the expression of two reporter genes, murine c-myc and human CD4, in transgenic mice. Expression was observed specifically in thymic immature cells. Expression of c-myc in these cells induced oligoclonal CD4+ CD8+ T-cell thymomas. Expression of human CD4 was restricted to thymic progenitor CD4- CD8- and CD4+ CD8+ T cells and was shut off in mature CD4+ CD8- and CD4- CD8+ T cells, known to...

The chloroplast genome sequence of the green alga Leptosira terrestris: multiple losses of the inverted repeat and extensive genome rearrangements within the Trebouxiophyceae

Directory of Open Access Journals (Sweden)

Turmel Monique

2007-07-01

Full Text Available Abstract Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales. Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate
Characterization and expression of the maize β-carbonic anhydrase gene repeat regions.

Science.gov (United States)

Tems, Ursula; Burnell, James N

2010-12-01

In maize, carbonic anhydrase (CA; EC 4.2.1.1) catalyzes the first reaction of the C(4) photosynthetic pathway; it catalyzes the hydration of CO(2) to bicarbonate and provides an inorganic carbon source for the primary carboxylation reaction catalyzed by phosphoenolpyruvate (PEP) carboxylase. The β-CA isozymes from maize, as well as other agronomically important NADP-malic enzyme (NADP-ME) type C(4) crops, have remained relatively uncharacterized but differ significantly from the β-CAs of other C(4) monocot species primarily due to transcript length and the presence of repeat sequences. This research confirmed earlier findings of repeat sequences in maize CA transcripts, and demonstrated that the gene encoding these transcripts is also composed of repeat sequences. One of the maize CA genes was sequenced and found to encode two domains, with distinct groups of exons corresponding to the repeat regions of the transcript. We have also shown that expression of a single repeat region of the CA transcript produced active enzyme that associated as a dimer and was composed primarily of α-helices, consistent with that observed for other plant CAs. As the presence of repeat regions in the CA gene is unique to NADP-ME type C(4) monocot species, the implications of these findings in the context of the evolution of the location and function of this C(4) pathway enzyme are strongly suggestive of CA gene duplication resulting in an evolutionary advantage and a higher photosynthetic efficiency. Copyright © 2010 Elsevier Masson SAS. All rights reserved.
Expansion of protein domain repeats.

Directory of Open Access Journals (Sweden)

Asa K Björklund

2006-08-01

Full Text Available Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein-protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.
Critical structural and functional roles for the N-terminal insertion sequence in surfactant protein B analogs.

Directory of Open Access Journals (Sweden)

Frans J Walther

2010-01-01

Full Text Available Surfactant protein B (SP-B; 79 residues belongs to the saposin protein superfamily, and plays functional roles in lung surfactant. The disulfide cross-linked, N- and C-terminal domains of SP-B have been theoretically predicted to fold as charged, amphipathic helices, suggesting their participation in surfactant activities. Earlier structural studies with Mini-B, a disulfide-linked construct based on the N- and C-terminal regions of SP-B (i.e., approximately residues 8-25 and 63-78, confirmed that these neighboring domains are helical; moreover, Mini-B retains critical in vitro and in vivo surfactant functions of the native protein. Here, we perform similar analyses on a Super Mini-B construct that has native SP-B residues (1-7 attached to the N-terminus of Mini-B, to test whether the N-terminal sequence is also involved in surfactant activity.FTIR spectra of Mini-B and Super Mini-B in either lipids or lipid-mimics indicated that these peptides share similar conformations, with primary alpha-helix and secondary beta-sheet and loop-turns. Gel electrophoresis demonstrated that Super Mini-B was dimeric in SDS detergent-polyacrylamide, while Mini-B was monomeric. Surface plasmon resonance (SPR, predictive aggregation algorithms, and molecular dynamics (MD and docking simulations further suggested a preliminary model for dimeric Super Mini-B, in which monomers self-associate to form a dimer peptide with a "saposin-like" fold. Similar to native SP-B, both Mini-B and Super Mini-B exhibit in vitro activity with spread films showing near-zero minimum surface tension during cycling using captive bubble surfactometry. In vivo, Super Mini-B demonstrates oxygenation and dynamic compliance that are greater than Mini-B and compare favorably to full-length SP-B.Super Mini-B shows enhanced surfactant activity, probably due to the self-assembly of monomer peptide into dimer Super Mini-B that mimics the functions and putative structure of native SP-B.
Loss and recovery of Arabidopsis-type telomere repeat sequences 5'-(TTTAGGG)(n)-3' in the evolution of a major radiation of flowering plants.

OpenAIRE

Adams, S. P.; Hartman, T. P.; Lim, K. Y.; Chase, M. W.; Bennett, M. D.; Leitch, I. J.; Leitch, A. R.

2001-01-01

Fluorescent in situ hybridization and Southern blotting were used for showing the predominant absence of the Arabidopsis-type telomere repeat sequence (TRS) 5'-(TTTAGGG)(n)-3' (the 'typical' telomere) in a monocot clade which comprises up to 6300 species within Asparagales. Initially, two apparently disparate genera that lacked the typical telomere were identified. Here, we used the new angiosperm phylogenetic classification for predicting in which other related families such telomeres might ...
ASAP: Amplification, sequencing & annotation of plastomes

Directory of Open Access Journals (Sweden)

Folta Kevin M

2005-12-01

Full Text Available Abstract Background Availability of DNA sequence information is vital for pursuing structural, functional and comparative genomics studies in plastids. Traditionally, the first step in mining the valuable information within a chloroplast genome requires sequencing a chloroplast plasmid library or BAC clones. These activities involve complicated preparatory procedures like chloroplast DNA isolation or identification of the appropriate BAC clones to be sequenced. Rolling circle amplification (RCA is being used currently to amplify the chloroplast genome from purified chloroplast DNA and the resulting products are sheared and cloned prior to sequencing. Herein we present a universal high-throughput, rapid PCR-based technique to amplify, sequence and assemble plastid genome sequence from diverse species in a short time and at reasonable cost from total plant DNA, using the large inverted repeat region from strawberry and peach as proof of concept. The method exploits the highly conserved coding regions or intergenic regions of plastid genes. Using an informatics approach, chloroplast DNA sequence information from 5 available eudicot plastomes was aligned to identify the most conserved regions. Cognate primer pairs were then designed to generate ~1 – 1.2 kb overlapping amplicons from the inverted repeat region in 14 diverse genera. Results 100% coverage of the inverted repeat region was obtained from Arabidopsis, tobacco, orange, strawberry, peach, lettuce, tomato and Amaranthus. Over 80% coverage was obtained from distant species, including Ginkgo, loblolly pine and Equisetum. Sequence from the inverted repeat region of strawberry and peach plastome was obtained, annotated and analyzed. Additionally, a polymorphic region identified from gel electrophoresis was sequenced from tomato and Amaranthus. Sequence analysis revealed large deletions in these species relative to tobacco plastome thus exhibiting the utility of this method for structural and
Rapid Acquisition of Choice and Timing and the Provenance of the Terminal-Link Effect

Science.gov (United States)

Kyonka, Elizabeth G. E.; Grace, Randolph C.

2010-01-01

Eight pigeons responded in a concurrent-chains procedure in which terminal-link schedules changed pseudorandomly across sessions. Pairs of terminal-link delays either summed to 15 s or to 45 s. Across sessions, the location of the shorter terminal link changed according to a pseudorandom binary sequence. On some terminal links, food was withheld…
Exact Tandem Repeats Analyzer (E-TRA): A new program for DNA ...

Indian Academy of Sciences (India)

Unknown

Advanced user defined parameters/options let the researchers use different minimum motif repeats ... E-TRA, we used 5,465,605 human EST sequences derived from 18,814,550 ..... repeat rates of T-cells, embryo and testis were higher.
Site-specific Isopeptide Bridge Tethering of Chimeric gp41 N-terminal Heptad Repeat Helical Trimers for the Treatment of HIV-1 Infection

Science.gov (United States)

Wang, Chao; Li, Xue; Yu, Fei; Lu, Lu; Jiang, Xifeng; Xu, Xiaoyu; Wang, Huixin; Lai, Wenqing; Zhang, Tianhong; Zhang, Zhenqing; Ye, Ling; Jiang, Shibo; Liu, Keliang

2016-01-01

Peptides derived from the N-terminal heptad repeat (NHR) of HIV-1 gp41 can be potent inhibitors against viral entry when presented in a nonaggregating trimeric coiled-coil conformation via the introduction of exogenous trimerization motifs and intermolecular disulfide bonds. We recently discovered that crosslinking isopeptide bridges within the de novo helical trimers added exceptional resistance to unfolding. Herein, we attempted to optimize (CCIZN17)3, a representative disulfide bond-stabilized chimeric NHR-trimer, by incorporating site-specific interhelical isopeptide bonds as the redox-sensitive disulfide surrogate. In this process, we systematically examined the effect of isopeptide bond position and molecular sizes of auxiliary trimeric coiled-coil motif and NHR fragments on the antiviral potency of these NHR-trimers. Pleasingly, (IZ14N24N)3 possessed promising inhibitory activity against HIV-1 infection and markedly increased proteolytic stability relative to its disulfide-tethered counterpart, suggesting good potential for further development as an effective antiviral agent for treatment of HIV-1 infection. PMID:27562370
Macro scale models for freight railroad terminals.

Science.gov (United States)

2016-03-02

The project has developed a yard capacity model for macro-level analysis. The study considers the detailed sequence and scheduling in classification yards and their impacts on yard capacities simulate typical freight railroad terminals, and statistic...
Molecular Characterization of Cultivated Bromeliad Accessions with Inter-Simple Sequence Repeat (ISSR Markers

Directory of Open Access Journals (Sweden)

Yongming Yu

2012-05-01

Full Text Available Bromeliads are of great economic importance in flower production; however little information is available with respect to genetic characterization of cultivated bromeliads thus far. In the present study, a selection of cultivated bromeliads was characterized via inter-simple sequence repeat (ISSR markers with an emphasis on genetic diversity and population structure. Twelve ISSR primers produced 342 bands, of which 287 (~84% were polymorphic, with polymorphic bands per primer ranging from 17 to 34. The Jaccard’s similarity ranged from 0.08 to 0.89 and averaged ~0.30 for the investigated bromeliads. The Bayesian-based approach, together with the un-weighted paired group method with arithmetic average (UPGMA-based clustering and the principal coordinate analysis (PCoA, distinctly grouped the bromeliads from Neoregelia, Guzmania, and Vriesea into three separately clusters, well corresponding with their botanical classifications; whereas the bromeliads of Aechmea other than the recently selected hybrids were not well assigned to a cluster. Additionally, ISSR marker was proven efficient for the identification of hybrids and bud sports of cultivated bromeliads. The findings achieved herein will further our knowledge about the genetic variability within cultivated bromeliads and therefore facilitate breeding for new varieties of cultivated bromeliads in future as well.
In vivo activation of human immunodeficiency virus type 1 long terminal repeat by UV type A (UV-A) light plus psoralen and UV-B light in the skin of transgenic mice

OpenAIRE

Morrey, John D; Bourn, S M; Bunch, T D; Jackson, M K; Sidwell, R W; Barrows, L R; Daynes, R A; Rosen, C A

1991-01-01

UV irradiation has been shown to activate the human immunodeficiency virus type 1 (HIV-1) long terminal repeat (LTR) in cell culture; however, only limited studies have been described in vivo. UV light has been categorized as UV-A (400 to 315 nm), -B (315 to 280 nm), or -C (less than 280 nm); the longer wavelengths are less harmful but more penetrative. Highly penetrative UV-A radiation constitutes the vast majority of UV sunlight reaching the earth's surface but is normally harmless. UV-B ir...
Sequencing two cooperating automated stacking cranes in a container terminal

NARCIS (Netherlands)

Vis, I.F.A.; Carlo, H.J.

2010-01-01

The containerized trade market is growing rapidly with the uprising of the Far East. Container ports worldwide should be responsive by developing tools to handle these massive volumes of containers in order to retain their level of competitiveness. One of the areas in a container terminal that is
Analysis of simple sequence repeats in the Gaeumannomyces graminis var. tritici genome and the development of microsatellite markers.

Science.gov (United States)

Li, Wei; Feng, Yanxia; Sun, Haiyan; Deng, Yuanyu; Yu, Hanshou; Chen, Huaigu

2014-11-01

Understanding the genetic structure of Gaeumannomyces graminis var. tritici is essential for the establishment of efficient disease control strategies. It is becoming clear that microsatellites, or simple sequence repeats (SSRs), play an important role in genome organization and phenotypic diversity, and are a large source of genetic markers for population genetics and meiotic maps. In this study, we examined the G. graminis var. tritici genome (1) to analyze its pattern of SSRs, (2) to compare it with other plant pathogenic filamentous fungi, such as Magnaporthe oryzae and M. poae, and (3) to identify new polymorphic SSR markers for genetic diversity. The G. graminis var. tritici genome was rich in SSRs; a total 13,650 SSRs have been identified with mononucleotides being the most common motifs. In coding regions, the densities of tri- and hexanucleotides were significantly higher than in noncoding regions. The di-, tri-, tetra, penta, and hexanucleotide repeats in the G. graminis var. tritici genome were more abundant than the same repeats in M. oryzae and M. poae. From 115 devised primers, 39 SSRs are polymorphic with G. graminis var. tritici isolates, and 8 primers were randomly selected to analyze 116 isolates from China. The number of alleles varied from 2 to 7 and the expected heterozygosity (He) from 0.499 to 0.837. In conclusion, SSRs developed in this study were highly polymorphic, and our analysis indicated that G. graminis var. tritici is a species with high genetic diversity. The results provide a pioneering report for several applications, such as the assessment of population structure and genetic diversity of G. graminis var. tritici.
The binding of TIA-1 to RNA C-rich sequences is driven by its C-terminal RRM domain.

Science.gov (United States)

Cruz-Gallardo, Isabel; Aroca, Ángeles; Gunzburg, Menachem J; Sivakumaran, Andrew; Yoon, Je-Hyun; Angulo, Jesús; Persson, Cecilia; Gorospe, Myriam; Karlsson, B Göran; Wilce, Jacqueline A; Díaz-Moreno, Irene

2014-01-01

T-cell intracellular antigen-1 (TIA-1) is a key DNA/RNA binding protein that regulates translation by sequestering target mRNAs in stress granules (SG) in response to stress conditions. TIA-1 possesses three RNA recognition motifs (RRM) along with a glutamine-rich domain, with the central domains (RRM2 and RRM3) acting as RNA binding platforms. While the RRM2 domain, which displays high affinity for U-rich RNA sequences, is primarily responsible for interaction with RNA, the contribution of RRM3 to bind RNA as well as the target RNA sequences that it binds preferentially are still unknown. Here we combined nuclear magnetic resonance (NMR) and surface plasmon resonance (SPR) techniques to elucidate the sequence specificity of TIA-1 RRM3. With a novel approach using saturation transfer difference NMR (STD-NMR) to quantify protein-nucleic acids interactions, we demonstrate that isolated RRM3 binds to both C- and U-rich stretches with micromolar affinity. In combination with RRM2 and in the context of full-length TIA-1, RRM3 significantly enhanced the binding to RNA, particularly to cytosine-rich RNA oligos, as assessed by biotinylated RNA pull-down analysis. Our findings provide new insight into the role of RRM3 in regulating TIA-1 binding to C-rich stretches, that are abundant at the 5' TOPs (5' terminal oligopyrimidine tracts) of mRNAs whose translation is repressed under stress situations.
Induction of transcription from the long terminal repeat of Moloney murine sarcoma provirus by UV-irradiation, x-irradiation, and phorbol ester

International Nuclear Information System (INIS)

Lin, C.S.; Goldthwait, D.A.; Samols, D.

1990-01-01

The long terminal repeat (LTR) of Moloney murine sarcoma virus (Mo-MuSV) was used as a model system to study the stress response of mammalian cells to physical carcinogens. The chloramphenicol acetyltransferase (CAT) gene was inserted between two Mo-MuSV LTRs, and the LTR-CAT-LTR construct was used for virus production and was integrated into the genome of NIH 3T3 cells in the proviral form. This construct was used to assure that the integrated CAT gene was driven by the promoter of the LTR. Expression of the CAT gene was stimulated 4-fold by UV irradiation, and the peak of activity was observed at 18 hr. In contrast, stimulation of the CAT expression after x-irradiation was 2-fold and occurred at 6 hr. Phorbol myristate acetate also stimulated CAT activity 4-fold with a peak at 6 hr. Down-regulation of protein kinase C blocked totally the response to x-irradiation but only partially the response to UV. The protein kinase inhibitor H7 blocked the response to treatment by UV, x-ray, and phorbol ester
Simple sequence repeats and compositional bias in the bipartite Ralstonia solanacearum GMI1000 genome

Directory of Open Access Journals (Sweden)

Vandamme Peter

2003-03-01

Full Text Available Abstract Background Ralstonia solanacearum is an important plant pathogen. The genome of R. solananearum GMI1000 is organised into two replicons (a 3.7-Mb chromosome and a 2.1-Mb megaplasmid and this bipartite genome structure is characteristic for most R. solanacearum strains. To determine whether the megaplasmid was acquired via recent horizontal gene transfer or is part of an ancestral single chromosome, we compared the abundance, distribution and compositon of simple sequence repeats (SSRs between both replicons and also compared the respective compositional biases. Results Our data show that both replicons are very similar in respect to distribution and composition of SSRs and presence of compositional biases. Minor variations in SSR and compositional biases observed may be attributable to minor differences in gene expression and regulation of gene expression or can be attributed to the small sample numbers observed. Conclusions The observed similarities indicate that both replicons have shared a similar evolutionary history and thus suggest that the megaplasmid was not recently acquired from other organisms by lateral gene transfer but is a part of an ancestral R. solanacearum chromosome.
Highly sensitive detection of individual HEAT and ARM repeats with HHpred and COACH.

Science.gov (United States)

Kippert, Fred; Gerloff, Dietlind L

2009-09-24

HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high
Sequence of the amino-terminal region of rat liver ribosomal proteins S4, S6, S8, L6, L7a, L18, L27, L30, L37, L37a, and L39.

Science.gov (United States)

Wittmann-Liebold, B; Geissler, A W; Lin, A; Wool, I G

1979-01-01

The sequence of the amino-terminal region of eleven rat liver ribosomal proteins--S4, S6, S8, L6, L7a, L18, L27, L30, L37a, and L39--was determined. The analysis confirmed the homogeneity of the proteins and suggests that they are unique, since no extensive common sequences were found. The N-terminal regions of the rat liver proteins were compared with amino acid sequences in Saccharomyces cerevisiae and in Escherichia coli ribosomal proteins. It seems likely that the proteins L37 from rat liver and Y55 from yeast ribosomes are homologous. It is possible that rat liver L7a or L37a or both are related to S cerevisiae Y44, although the similar sequences are at the amino-terminus of the rat liver proteins and in an internal region of Y44. A number of similarities in the sequences of rat liver and E coli ribosomal proteins have been found; however, it is not yet possible to say whether they connote a common ancestry.
The differences in heparin binding for the C-terminal basic-sequence-rich peptides of HPV-16 and HPV-18 capsid protein L1

International Nuclear Information System (INIS)

Sun Jian; Yu Jisheng; Yu Zhiwu; Zha Xiao; Wu Yuqing

2012-01-01

Graphial abstract: The differences in heparin binding for the C-terminal basic-sequence-rich peptides of HPV-16 and HPV-18 capsid protein L1. Highlights: ► Several driving forces contribute to the interaction between heparin and peptides. ► C-terminal of HPV L1 is a potential candidate for the attachment to host cells. ► The C-terminal peptides of HPV-16 and -18 L1 have different heparin-binding. ► The different heparin-binding provides an explanation for the distinct prevalences. - Abstract: The high-risk types of human papillomaviruses (HPV) HPV-16 and -18 are the predominant types associated with cervical cancer. HPV-16 and -18 account for about 50% and 20%, respectively, of cervical cancers worldwide. While the reason and molecular mechanism of the distinct prevalence and distributions between them remain poorly understood, the binding affinity of cell surface receptor with capsid proteins, especially L1, may be involved. We examined heparin binding with two synthetic peptides corresponding to the 14 amino acid C-terminal peptides of HPV-16 and -18 L1 with the goal of comparing the equivalent residues in different HPV types. Using isothermal titration calorimetry (ITC) and static right-angle light scattering (SLS), we determined the binding constant K, reaction enthalpy ΔH, and other thermodynamic parameters in the interaction. Especially, we assessed the role of specific residues in binding with heparin by comparing the NMR spectra of free and heparin-bound peptides.

FRB 121102: A Starquake-induced Repeater?

Science.gov (United States)

Wang, Weiyang; Luo, Rui; Yue, Han; Chen, Xuelei; Lee, Kejia; Xu, Renxin

2018-01-01

Since its initial discovery, the fast radio burst (FRB) FRB 121102 has been found to be repeating with millisecond-duration pulses. Very recently, 14 new bursts were detected by the Green Bank Telescope during its continuous monitoring observations. In this paper, we show that the burst energy distribution has a power-law form which is very similar to the Gutenberg–Richter law of earthquakes. In addition, the distribution of burst waiting time can be described as a Poissonian or Gaussian distribution, which is consistent with earthquakes, while the aftershock sequence exhibits some local correlations. These findings suggest that the repeating FRB pulses may originate from the starquakes of a pulsar. Noting that the soft gamma-ray repeaters (SGRs) also exhibit such distributions, the FRB could be powered by some starquake mechanisms associated with the SGRs, including the crustal activity of a magnetar or solidification-induced stress of a newborn strangeon star. These conjectures could be tested with more repeating samples.
Barley polyamine oxidase: Characterisation and analysis of the cofactor and the N-terminal amino acid sequence

DEFF Research Database (Denmark)

Radova, A.; Sebela, M.; Galuszka, P.

2001-01-01

This paper reports the first purification method developed for the isolation of an homogeneous polyamine oxidase (PAO) from etiolated barley seedlings. The crude enzyme preparation was obtained after initial precipitation of the extract with protamine sulphate and ammonium sulphate. The enzyme...... was further confirmed by measuring the fluorescence spectra, Barley PAO is an acidic protein (pI 5.4) containing 3% of neutral sugars: its molecular mass determined by SDS-PAGE was 56 kDa, whilst gel permeation chromatography revealed the higher value of 76 kDa. The N-terminal amino acid sequence of barley...... PAO shows a high degree of similarity to that of maize PAO and to several other flavoprotein oxidases. The polyamines spermine and spermidine were the only two substrates of the enzyme with K-m values 4 x 10(-5) and 3 x 10(-5) M and pH optima of 5.0 and 6.0, respectively. Barley polyamine oxidase...
Alu repeats as markers for human population genetics

Energy Technology Data Exchange (ETDEWEB)

Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Bazan, H. [Louisiana State Univ., New Orleans, LA (United States). Medical Center] [and others

1993-09-01

The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 97.9% nucleotide identity with each other and an average of 98.9% nucleotide identity with the HS subfamily consensus sequence. HS Alu family members are thought to be derived from a single source ``master`` gene, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 in. and 3 in. unique flanking DNA sequences from each HS Alu that allows the locus to be assayed for the presence or absence of an Alu repeat. Individual HS Alu sequences were found to be either monomorphic or dimorphic for the presence or absence of each repeat. The monomorphic HS Alu family members inserted in the human genome after the human/great ape divergence (which is thought to have occurred 4--6 million years ago), but before the radiation of modem man. The dimorphic HS Alu sequences inserted in the human genome after the radiation of modem man (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project as well. HS Alu family member insertion dimorphism differs from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) because individuals share HS Alu family member insertions based upon identity by descent from a common ancestor as a result of a single event which occurred one time within the human population. The VNTR and RFLP polymorphisms may arise multiple times within a population and are identical by state only.
Identification of Variable-Number Tandem-Repeat (VNTR) Sequences in Acinetobacter baumannii and Interlaboratory Validation of an Optimized Multiple-Locus VNTR Analysis Typing Scheme▿†

Science.gov (United States)

Pourcel, Christine; Minandri, Fabrizia; Hauck, Yolande; D'Arezzo, Silvia; Imperi, Francesco; Vergnaud, Gilles; Visca, Paolo

2011-01-01

Acinetobacter baumannii is an important opportunistic pathogen responsible for nosocomial outbreaks, mostly occurring in intensive care units. Due to the multiplicity of infection sources, reliable molecular fingerprinting techniques are needed to establish epidemiological correlations among A. baumannii isolates. Multiple-locus variable-number tandem-repeat analysis (MLVA) has proven to be a fast, reliable, and cost-effective typing method for several bacterial species. In this study, an MLVA assay compatible with simple PCR- and agarose gel-based electrophoresis steps as well as with high-throughput automated methods was developed for A. baumannii typing. Preliminarily, 10 potential polymorphic variable-number tandem repeats (VNTRs) were identified upon bioinformatic screening of six annotated genome sequences of A. baumannii. A collection of 7 reference strains plus 18 well-characterized isolates, including unique types and representatives of the three international A. baumannii lineages, was then evaluated in a two-center study aimed at validating the MLVA assay and comparing it with other genotyping assays, namely, macrorestriction analysis with pulsed-field gel electrophoresis (PFGE) and PCR-based sequence group (SG) profiling. The results showed that MLVA can discriminate between isolates with identical PFGE types and SG profiles. A panel of eight VNTR markers was selected, all showing the ability to be amplified and good amounts of polymorphism in the majority of strains. Independently generated MLVA profiles, composed of an ordered string of allele numbers corresponding to the number of repeats at each VNTR locus, were concordant between centers. Typeability, reproducibility, stability, discriminatory power, and epidemiological concordance were excellent. A database containing information and MLVA profiles for several A. baumannii strains is available from http://mlva.u-psud.fr/. PMID:21147956
Survey of clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) systems in multiple sequenced strains of Klebsiella pneumoniae.

Science.gov (United States)

Ostria-Hernández, Martha Lorena; Sánchez-Vallejo, Carlos Javier; Ibarra, J Antonio; Castro-Escarpulli, Graciela

2015-08-04

In recent years the emergence of multidrug resistant Klebsiella pneumoniae strains has been an increasingly common event. This opportunistic species is one of the five main bacterial pathogens that cause hospital infections worldwide and multidrug resistance has been associated with the presence of high molecular weight plasmids. Plasmids are generally acquired through horizontal transfer and therefore is possible that systems that prevent the entry of foreign genetic material are inactive or absent. One of these systems is CRISPR/Cas. However, little is known regarding the clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) system in K. pneumoniae. The adaptive immune system CRISPR/Cas has been shown to limit the entry of foreign genetic elements into bacterial organisms and in some bacteria it has been shown to be involved in regulation of virulence genes. Thus in this work we used bioinformatics tools to determine the presence or absence of CRISPR/Cas systems in available K. pneumoniae genomes. The complete CRISPR/Cas system was identified in two out of the eight complete K. pneumoniae genomes sequences and in four out of the 44 available draft genomes sequences. The cas genes in these strains comprises eight cas genes similar to those found in Escherichia coli, suggesting they belong to the type I-E group, although their arrangement is slightly different. As for the CRISPR sequences, the average lengths of the direct repeats and spacers were 29 and 33 bp, respectively. BLAST searches demonstrated that 38 of the 116 spacer sequences (33%) are significantly similar to either plasmid, phage or genome sequences, while the remaining 78 sequences (67%) showed no significant similarity to other sequences. The region where the CRISPR/Cas systems were located is the same in all the Klebsiella genomes containing it, it has a syntenic architecture, and is located among genes encoding for proteins likely involved in
Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

Directory of Open Access Journals (Sweden)

Pupko Tal

2008-10-01

Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.
An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data.

Science.gov (United States)

Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E; Greenwood, Alex D

2015-11-24

Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.
An Evolutionarily Young Polar Bear (Ursus maritimus) Endogenous Retrovirus Identified from Next Generation Sequence Data

Science.gov (United States)

Tsangaras, Kyriakos; Mayer, Jens; Alquezar-Planas, David E.; Greenwood, Alex D.

2015-01-01

Transcriptome analysis of polar bear (Ursus maritimus) tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV). Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs) of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos) and black bear (Ursus americanus) but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs) and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals. PMID:26610552
Genetic Diversity of Pinus nigra Arn. Populations in Southern Spain and Northern Morocco Revealed By Inter-Simple Sequence Repeat Profiles

Directory of Open Access Journals (Sweden)

Oussama Ahrazem

2012-05-01

Full Text Available Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA and Nei’s genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst was 0.233. Cuenca showed the highest Nei’s genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups—Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco—while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra.
Natural variation of the amino-terminal glutamine-rich domain in Drosophila argonaute2 is not associated with developmental defects.

Directory of Open Access Journals (Sweden)

Daniel Hain

2010-12-01

Full Text Available The Drosophila argonaute2 (ago2 gene plays a major role in siRNA mediated RNA silencing pathways. Unlike mammalian Argonaute proteins, the Drosophila protein has an unusual amino-terminal domain made up largely of multiple copies of glutamine-rich repeats (GRRs. We report here that the ago2 locus produces an alternative transcript that encodes a putative short isoform without this amino-terminal domain. Several ago2 mutations previously reported to be null alleles only abolish expression of the long, GRR-containing isoform. Analysis of drop out (dop mutations had previously suggested that variations in GRR copy number result in defects in RNAi and embryonic development. However, we find that dop mutations genetically complement transcript-null alleles of ago2 and that ago2 alleles with variant GRR copy numbers support normal development. In addition, we show that the assembly of the central RNAi machinery, the RISC (RNA induced silencing complex, is unimpaired in embryos when GRR copy number is altered. In fact, we find that GRR copy number is highly variable in natural D. melanogaster populations as well as in laboratory strains. Finally, while many other insects share an extensive, glutamine-rich Ago2 amino-terminal domain, its primary sequence varies drastically between species. Our data indicate that GRR variation does not modulate an essential function of Ago2 and that the amino-terminal domain of Ago2 is subject to rapid evolution.
Isolation and sequence analysis of a cDNA clone encoding the fifth complement component

DEFF Research Database (Denmark)

Lundwall, Åke B; Wetsel, Rick A; Kristensen, Torsten

1985-01-01

DNA clone of 1.85 kilobase pairs was isolated. Hybridization of the mixed-sequence probe to the complementary strand of the plasmid insert and sequence analysis by the dideoxy method predicted the expected protein sequence of C5a (positions 1-12), amino-terminal to the anticipated priming site. The sequence......, subcloned into M13 mp8, and sequenced at random by the dideoxy technique, thereby generating a contiguous sequence of 1703 base pairs. This clone contained coding sequence for the C-terminal 262 amino acid residues of the beta-chain, the entire C5a fragment, and the N-terminal 98 residues of the alpha......'-chain. The 3' end of the clone had a polyadenylated tail preceded by a polyadenylation recognition site, a 3'-untranslated region, and base pairs homologous to the human Alu concensus sequence. Comparison of the derived partial human C5 protein sequence with that previously determined for murine C3 and human...
Crystal structure of the Xpo1p nuclear export complex bound to the SxFG/PxFG repeats of the nucleoporin Nup42p.

Science.gov (United States)

Koyama, Masako; Hirano, Hidemi; Shirai, Natsuki; Matsuura, Yoshiyuki

2017-10-01

Xpo1p (yeast CRM1) is the major nuclear export receptor that carries a plethora of proteins and ribonucleoproteins from the nucleus to cytoplasm. The passage of the Xpo1p nuclear export complex through nuclear pore complexes (NPCs) is facilitated by interactions with nucleoporins (Nups) containing extensive repeats of phenylalanine-glycine (so-called FG repeats), although the precise role of each Nup in the nuclear export reaction remains incompletely understood. Here we report structural and biochemical characterization of the interactions between the Xpo1p nuclear export complex and the FG repeats of Nup42p, a nucleoporin localized at the cytoplasmic face of yeast NPCs and has characteristic SxFG/PxFG sequence repeat motif. The crystal structure of Xpo1p-PKI-Nup42p-Gsp1p-GTP complex identified three binding sites for the SxFG/PxFG repeats on HEAT repeats 14-20 of Xpo1p. Mutational analyses of Nup42p showed that the conserved serines and prolines in the SxFG/PxFG repeats contribute to Xpo1p-Nup42p binding. Our structural and biochemical data suggest that SxFG/PxFG-Nups such as Nup42p and Nup159p at the cytoplasmic face of NPCs provide high-affinity docking sites for the Xpo1p nuclear export complex in the terminal stage of NPC passage and that subsequent disassembly of the nuclear export complex facilitates recycling of free Xpo1p back to the nucleus. © 2017 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.
Unique CCT repeats mediate transcription of the TWIST1 gene in mesenchymal cell lines

International Nuclear Information System (INIS)

Ohkuma, Mizue; Funato, Noriko; Higashihori, Norihisa; Murakami, Masanori; Ohyama, Kimie; Nakamura, Masataka

2007-01-01

TWIST1, a basic helix-loop-helix transcription factor, plays critical roles in embryo development, cancer metastasis and mesenchymal progenitor differentiation. Little is known about transcriptional regulation of TWIST1 expression. Here we identified DNA sequences responsible for TWIST1 expression in mesenchymal lineage cell lines. Reporter assays with TWIST1 promoter mutants defined the -102 to -74 sequences that are essential for TWIST1 expression in human and mouse mesenchymal cell lines. Tandem repeats of CCT, but not putative CREB and NF-κB sites in the sequences substantially supported activity of the TWIST1 promoter. Electrophoretic mobility shift assay demonstrated that the DNA sequences with the CCT repeats formed complexes with nuclear factors, containing, at least, Sp1 and Sp3. These results suggest critical implication of the CCT repeats in association with Sp1 and Sp3 factors in sustaining expression of the TWIST1 gene in mesenchymal cells
Comparing Whole-Genome Sequencing with Sanger Sequencing for spa Typing of Methicillin-Resistant Staphylococcus aureus

DEFF Research Database (Denmark)

Bartels, Mette Damkjaer; Petersen, Andreas; Worning, Peder

2014-01-01

spa typing of methicillin-resistant Staphylococcus aureus (MRSA) has traditionally been done by PCR amplification and Sanger sequencing of the spa repeat region. At Hvidovre Hospital, Denmark, whole-genome sequencing (WGS) of all MRSA isolates has been performed routinely since January 2013, and ...
Interstitial telomere-like repeats in the Arabidopsis thaliana genome.

Science.gov (United States)

Uchida, Wakana; Matsunaga, Sachihiro; Sugiyama, Ryuji; Kawano, Shigeyuki

2002-02-01

Eukaryotic chromosomal ends are protected by telomeres, which are thought to play an important role in ensuring the complete replication of chromosomes. On the other hand, non-functional telomere-like repeats in the interchromosomal regions (interstitial telomeric repeats; ITRs) have been reported in several eukaryotes. In this study, we identified eight ITRs in the Arabidopsis thaliana genome, each consisting of complete and degenerate 300- to 1200-bp sequences. The ITRs were grouped into three classes (class IA-B, class II, and class IIIA-E) based on the degeneracy of the telomeric repeats in ITRs. The telomeric repeats of the two ITRs in class I were conserved for the most part, whereas the single ITR in class II, and the five ITRs in class III were relatively degenerated. In addition, degenerate ITRs were surrounded by common sequences that shared 70-100% homology to each other; these are named ITR-adjacent sequences (IAS). Although the genomic regions around ITRs in class I lacked IAS, those around ITRs in class II contained IAS (IASa), and those around five ITRs in class III had nine types of IAS (IASb, c, d, e, f, g, h, i, and j). Ten IAS types in classes II and III showed no significant homology to each other. The chromosomal locations of ITRs and IAS were not category-related, but most of them were adjacent to, or part of, a centromere. These results show that the A. thaliana genome has undergone chromosomal rearrangements, such as end-fusions and segmental duplications.
Two-terminal video coding.

Science.gov (United States)

Yang, Yang; Stanković, Vladimir; Xiong, Zixiang; Zhao, Wei

2009-03-01

Following recent works on the rate region of the quadratic Gaussian two-terminal source coding problem and limit-approaching code designs, this paper examines multiterminal source coding of two correlated, i.e., stereo, video sequences to save the sum rate over independent coding of both sequences. Two multiterminal video coding schemes are proposed. In the first scheme, the left sequence of the stereo pair is coded by H.264/AVC and used at the joint decoder to facilitate Wyner-Ziv coding of the right video sequence. The first I-frame of the right sequence is successively coded by H.264/AVC Intracoding and Wyner-Ziv coding. An efficient stereo matching algorithm based on loopy belief propagation is then adopted at the decoder to produce pixel-level disparity maps between the corresponding frames of the two decoded video sequences on the fly. Based on the disparity maps, side information for both motion vectors and motion-compensated residual frames of the right sequence are generated at the decoder before Wyner-Ziv encoding. In the second scheme, source splitting is employed on top of classic and Wyner-Ziv coding for compression of both I-frames to allow flexible rate allocation between the two sequences. Experiments with both schemes on stereo video sequences using H.264/AVC, LDPC codes for Slepian-Wolf coding of the motion vectors, and scalar quantization in conjunction with LDPC codes for Wyner-Ziv coding of the residual coefficients give a slightly lower sum rate than separate H.264/AVC coding of both sequences at the same video quality.
The complete genome sequence and proteomics of Yersinia pestis phage Yep-phi.

Science.gov (United States)

Zhao, Xiangna; Wu, Weili; Qi, Zhizhen; Cui, Yujun; Yan, Yanfeng; Guo, Zhaobiao; Wang, Zuyun; Wang, Hu; Deng, Haijun; Xue, Yan; Chen, Weijun; Wang, Xiaoyi; Yang, Ruifu

2011-01-01

Yep-phi, a lytic phage of Yersinia pestis, was isolated in China and is routinely used as a diagnostic phage for the identification of the plague pathogen. Yep-phi has an isometric hexagonal head containing dsDNA and a short non-contractile conical tail. In this study, we sequenced the Yep-phi genome (GenBank accession no. HQ333270) and performed proteomics analysis. The genome consists of 38 ,616 bp of DNA, including direct terminal repeats of 222 bp, and is predicted to contain 45 ORFs. Most structural proteins were identified by proteomics analysis. Compared with the three available genome sequences of lytic phages for Y. pestis, the phages could be divided into two subgroups. Yep-phi displays marked homology to the bacteriophages Berlin (GenBank accession no. AM183667) and Yepe2 (GenBank accession no. EU734170), and these comprise one subgroup. The other subgroup is represented by bacteriophage ΦA1122 (GenBank accession no. AY247822). Potential recombination was detected among the Yep-phi subgroup.
Molecular characterization of Vulmar1, a complete mariner transposon of sugar beet and diversity of mariner- and En/Spm-like sequences in the genus Beta.

Science.gov (United States)

Jacobs, Gunnar; Dechyeva, Daryna; Menzel, Gerhard; Dombrowski, Cora; Schmidt, Thomas

2004-12-01

Transposons of the Tc1-mariner superfamily are widespread in eukaryotic genomes. We have isolated the mariner element Vulmar1 from Beta vulgaris L., which is 3909 bp long and bordered by perfect terminal inverted repeats of 32 bp with homology to terminal inverted repeats of transposons from soybean and rice. According to a characteristic amino acid signature, Vulmar1 can be assigned to the DD39D group of mariner transposons. Vulmar1 is flanked by a 5'-TA-3' target site duplication that is typical for mariner transposons. Southern hybridization revealed that mariner-like copies are highly abundant in Beta species, and sequence analysis of 10 transposase fragments from representative species of the four Beta sections revealed an identity between 34% and 100% after conceptual translation. By fluorescent in situ hybridization, Vulmar1 was detected in distal euchromatin as well as in some intercalary and pericentromeric regions of all B. vulgaris chromosomes. In addition, using PCR, we were able to amplify fragments of the transposase gene of En/Spm-like transposons in the genus Beta. En/Spm-like transposase sequences are highly amplified in four Beta sections and showed a considerable degree of conservation (88.5-100%) at the protein level, while the homology to corresponding regions of En/Spm transposons of other plant species ranges from 49.5% to 62.5%. By fluorescent in situ hybridization, En/Spm-like transposon signals of strong intensity were detected on all chromosomes of B. vulgaris.
Deletion of Repeats in the Alpha C Protein Enhances the Pathogenicity of Group B Streptococci in Immune Mice

OpenAIRE

Gravekamp, C.; Rosner, Bernard; Madoff, L. C.

1998-01-01

The alpha C protein is a protective surface-associated antigen of group B streptococci (GBS). The prototype alpha C protein of GBS (strain A909) contains nine identical tandem repeats, each comprising 82 amino acids, flanked by N- and C-terminal domains. Clinical isolates of GBS show variable numbers of repeats with a normal distribution and a median of 9 to 10 repeats. Here, we show that escape mutants of GBS expressing one-repeat alpha C protein were 100-fold more pathogenic than GBS expres...
Acquiring a cognitive skill with a new repeating version of the Tower of London task.

Science.gov (United States)

Ouellet, Marie-Christine; Beauchamp, Miriam H; Owen, Adrian M; Doyon, Julien

2004-12-01

A computerized version of the Tower of London task was used to investigate cognitive skill learning. Thirty-six healthy volunteers were assigned to either a random condition (nonrecurring problems), or to a sequence condition in which, unbeknownst to the subjects, a repeating sequence of three problems was presented. Indices of execution, planning, and total time, as well as number of moves performed, were used to measure behavioural change. Subjects' performance improved in both conditions across blocks of practice. A distinct learning effect related to the repeating sequence was also observed. This suggests that a specific skill that reflects procedural learning of the strategies, rules, and procedures pertaining to repeating problems can develop over and above a more general skill at solving cognitive planning problems with practice.

Gene conversion homogenizes the CMT1A paralogous repeats

Directory of Open Access Journals (Sweden)

Hurles Matthew E

2001-12-01

Full Text Available Abstract Background Non-allelic homologous recombination between paralogous repeats is increasingly being recognized as a major mechanism causing both pathogenic microdeletions and duplications, and structural polymorphism in the human genome. It has recently been shown empirically that gene conversion can homogenize such repeats, resulting in longer stretches of absolute identity that may increase the rate of non-allelic homologous recombination. Results Here, a statistical test to detect gene conversion between pairs of non-coding sequences is presented. It is shown that the 24 kb Charcot-Marie-Tooth type 1A paralogous repeats (CMT1A-REPs exhibit the imprint of gene conversion processes whilst control orthologous sequences do not. In addition, Monte Carlo simulations of the evolutionary divergence of the CMT1A-REPs, incorporating two alternative models for gene conversion, generate repeats that are statistically indistinguishable from the observed repeats. Bounds are placed on the rate of these conversion processes, with central values of 1.3 × 10-4 and 5.1 × 10-5 per generation for the alternative models. Conclusions This evidence presented here suggests that gene conversion may have played an important role in the evolution of the CMT1A-REP paralogous repeats. The rates of these processes are such that it is probable that homogenized CMT1A-REPs are polymorphic within modern populations. Gene conversion processes are similarly likely to play an important role in the evolution of other segmental duplications and may influence the rate of non-allelic homologous recombination between them.
Ureaplasma antigenic variation beyond MBA phase variation: DNA inversions generating chimeric structures and switching in expression of the MBA N-terminal paralogue UU172.

Science.gov (United States)

Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim

2011-02-01

Phase variation of the major ureaplasma surface membrane protein, the multiple-banded antigen (MBA), with its counterpart, the UU376 protein, was recently discussed as a result of DNA inversion occurring at specific inverted repeats. Two similar inverted repeats to the ones within the mba locus were found in the genome of Ureaplasma parvum serovar 3; one within the MBA N-terminal paralogue UU172 and another in the adjacent intergenic spacer region. In this report, we demonstrate on both genomic and protein level that DNA inversion at these inverted repeats leads to alternating expression between UU172 and the neighbouring conserved hypothetical ORF UU171. Sequence analysis of this phase-variable 'UU172 element' from both U. parvum and U. urealyticum strains revealed that it is highly conserved among both species and that it also includes the orthologue of UU144. A third inverted repeat region in UU144 is proposed to serve as an additional potential inversion site from which chimeric genes can evolve. Our results indicate that site-specific recombination events in the genome of U. parvum serovar 3 are dynamic and frequent, leading to a broad spectrum of antigenic variation by which the organism may evade host immune responses. © 2010 Blackwell Publishing Ltd.
Ureaplasma antigenic variation beyond MBA phase variation: DNA inversions generating chimeric structures and switching in expression of the MBA N-terminal paralogue UU172

Science.gov (United States)

Zimmerman, Carl-Ulrich R; Rosengarten, Renate; Spergser, Joachim

2011-01-01

Phase variation of the major ureaplasma surface membrane protein, the multiple-banded antigen (MBA), with its counterpart, the UU376 protein, was recently discussed as a result of DNA inversion occurring at specific inverted repeats. Two similar inverted repeats to the ones within the mba locus were found in the genome of Ureaplasma parvum serovar 3; one within the MBA N-terminal paralogue UU172 and another in the adjacent intergenic spacer region. In this report, we demonstrate on both genomic and protein level that DNA inversion at these inverted repeats leads to alternating expression between UU172 and the neighbouring conserved hypothetical ORF UU171. Sequence analysis of this phase-variable ‘UU172 element’ from both U. parvum and U. urealyticum strains revealed that it is highly conserved among both species and that it also includes the orthologue of UU144. A third inverted repeat region in UU144 is proposed to serve as an additional potential inversion site from which chimeric genes can evolve. Our results indicate that site-specific recombination events in the genome of U. parvum serovar 3 are dynamic and frequent, leading to a broad spectrum of antigenic variation by which the organism may evade host immune responses. PMID:21255110
Unprecedented large inverted repeats at the replication terminus of circular bacterial chromosomes suggest a novel mode of chromosome rescue

Science.gov (United States)

El Kafsi, Hela; Loux, Valentin; Mariadassou, Mahendra; Blin, Camille; Chiapello, Hélène; Abraham, Anne-Laure; Maguin, Emmanuelle; van de Guchte, Maarten

2017-01-01

The first Lactobacillus delbrueckii ssp. bulgaricus genome sequence revealed the presence of a very large inverted repeat (IR), a DNA sequence arrangement which thus far seemed inconceivable in a non-manipulated circular bacterial chromosome, at the replication terminus. This intriguing observation prompted us to investigate if similar IRs could be found in other bacteria. IRs with sizes varying from 38 to 76 kbp were found at the replication terminus of all 5 L. delbrueckii ssp. bulgaricus chromosomes analysed, but in none of 1373 other chromosomes. They represent the first naturally occurring very large IRs detected in circular bacterial genomes. A comparison of the L. bulgaricus replication terminus regions and the corresponding regions without IR in 5 L. delbrueckii ssp. lactis genomes leads us to propose a model for the formation and evolution of the IRs. The DNA sequence data are consistent with a novel model of chromosome rescue after premature replication termination or irreversible chromosome damage near the replication terminus, involving mechanisms analogous to those proposed in the formation of very large IRs in human cancer cells. We postulate that the L. delbrueckii ssp. bulgaricus-specific IRs in different strains derive from a single ancestral IR of at least 93 kbp. PMID:28281695
Genome Sequence of the Bacterium Streptomyces davawensis JCM 4913 and Heterologous Production of the Unique Antibiotic Roseoflavin

Science.gov (United States)

Jankowitsch, Frank; Schwarz, Julia; Rückert, Christian; Gust, Bertolt; Szczepanowski, Rafael; Blom, Jochen; Pelzer, Stefan; Kalinowski, Jörn

2012-01-01

Streptomyces davawensis JCM 4913 synthesizes the antibiotic roseoflavin, a structural riboflavin (vitamin B2) analog. Here, we report the 9,466,619-bp linear chromosome of S. davawensis JCM 4913 and a 89,331-bp linear plasmid. The sequence has an average G+C content of 70.58% and contains six rRNA operons (16S-23S-5S) and 69 tRNA genes. The 8,616 predicted protein-coding sequences include 32 clusters coding for secondary metabolites, several of which are unique to S. davawensis. The chromosome contains long terminal inverted repeats of 33,255 bp each and atypical telomeres. Sequence analysis with regard to riboflavin biosynthesis revealed three different patterns of gene organization in Streptomyces species. Heterologous expression of a set of genes present on a subgenomic fragment of S. davawensis resulted in the production of roseoflavin by the host Streptomyces coelicolor M1152. Phylogenetic analysis revealed that S. davawensis is a close relative of Streptomyces cinnabarinus, and much to our surprise, we found that the latter bacterium is a roseoflavin producer as well. PMID:23043000
Tat-dependent repression of human immunodeficiency virus type 1 long terminal repeat promoter activity by fusion of cellular transcription factors

International Nuclear Information System (INIS)

Zhao Cunyou; Chen Yali; Park, Jiyoung; Kim, Jae Bum; Tang Hong

2004-01-01

Transcription initiation from HIV-1 long terminal repeat (LTR) promoter requires the virally encoded transactivator, Tat, and several cellular co-factors to accomplish the Tat-dependent processive transcription elongation. Individual cellular transcription activators, LBP-1b and Oct-1, on the other hand, have been shown to inhibit LTR promoter activities probably via competitive binding against TFIID to the TATA-box in LTR promoter. To explore the genetic interference strategies against the viral replication, we took advantage of the existence of the bipartite DNA binding domains and the repression domains of LBP-1b and Oct-1 factors to generate a chimeric transcription repressor. Our results indicated that the fusion protein of LBP-1b and Oct-1 exhibited higher DNA binding affinity to the viral promoter than the individual factors, and little interference with the host cell gene expression due to its anticipated rare cognate DNA sites in the host cell genome. Moreover, the chimera exerted increased Tat-dependent repression of transcription initiation at the LTR promoter both in vitro and in vivo compared to LBP-1b, Oct-1 or combination of LBP-1b and Oct-1. These results might provide the lead in generating a therapeutic reagent useful to suppress HIV-1 replication
Complete plastid genome sequence of Primula sinensis (Primulaceae: structure comparison, sequence variation and evidence for accD transfer to nucleus

Directory of Open Access Journals (Sweden)

Tong-Jian Liu

2016-06-01

Full Text Available Species-rich genus Primula L. is a typical plant group with which to understand genetic variance between species in different levels of relationships. Chloroplast genome sequences are used to be the information resource for quantifying this difference and reconstructing evolutionary history. In this study, we reported the complete chloroplast genome sequence of Primula sinensis and compared it with other related species. This genome of chloroplast showed a typical circular quadripartite structure with 150,859 bp in sequence length consisting of 37.2% GC base. Two inverted repeated regions (25,535 bp were separated by a large single-copy region (82,064 bp and a small single-copy region (17,725 bp. The genome consists of 112 genes, including 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Among them, seven coding genes, seven tRNA genes and four rRNA genes have two copies due to their locations in the IR regions. The accD and infA genes lacking intact open reading frames (ORF were identified as pseudogenes. SSR and sequence variation analyses were also performed on the plastome of Primula sinensis, comparing with another available plastome of P. poissonii. The four most variable regions, rpl36–rps8, rps16–trnQ, trnH–psbA and ndhC–trnV, were identified. Phylogenetic relationship estimates using three sub-datasets extracted from a matrix of 57 protein-coding gene sequences showed the identical result that was consistent with previous studies. A transcript found from P. sinensis transcriptome showed a high similarity to plastid accD functional region and was identified as a putative plastid transit peptide at the N-terminal region. The result strongly suggested that plastid accD has been functionally transferred to the nucleus in P. sinensis.
Nucleotide sequence analysis of HTLV-I isolated from cerebrospinal fluid of a patient with TSP/HAM: comparison to other HTLV-I isolates.

Science.gov (United States)

Mukhopadhyaya, R; Sadaie, M R

1993-02-01

Human T-cell leukemia virus type I (HTLV-I) has been associated with adult T-cell leukemia/lymphoma and the chronic neurologic disorder tropical spastic paraparesis/HTLV-I-associated myelopathy (TSP/HAM). To study the genetic structure of the virus associated with TSP/HAM, we have obtained and sequenced a partial genomic clone from an HTLV-I-positive cell line established from cerebrospinal fluid (CSF) of a Jamaican patient with TSP/HAM. This clone consisted of a 4.3-kb viral sequence containing the 5' long terminal repeat (LTR), gag, and N-terminal portion of the pol gene, with an overall 1.3% sequence variation resulting from mostly nucleotide substitutions, as compared to the prototype HTLV-I ATK-1. The gag and pol regions showed only 1.4% and 1.2% nucleotide variations, respectively. However, the U3 region of the LTR showed the highest sequence variation (3.6%), where several changes appear to be common among certain TSP/HAM isolates. Several of these changes reside within the 21-bp boundaries and the Tax-responsive element. It would be important to determine if the observed changes are sufficient to cause neurologic disorders similar to the murine leukemia virus system or simply reflect the divergent pool of HTLV-I from different geographic locations. At this time, we cannot rule out the possibility that the observed changes have either direct or indirect significance for the HTLV-I pathogenesis in TSP/HAM.
High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform

DEFF Research Database (Denmark)

Fordyce, Sarah Louise; Avila Arcos, Maria del Carmen; Rockenbauer, Eszter

2011-01-01

repeat units. These methods do not allow for the full resolution of STR base composition that sequencing approaches could provide. Here we present an STR profiling method based on the use of the Roche Genome Sequencer (GS) FLX to simultaneously sequence multiple core STR loci. Using this method...
Complete chloroplast genome sequence of a major economic species, Ziziphus jujuba (Rhamnaceae).

Science.gov (United States)

Ma, Qiuyue; Li, Shuxian; Bi, Changwei; Hao, Zhaodong; Sun, Congrui; Ye, Ning

2017-02-01

Ziziphus jujuba is an important woody plant with high economic and medicinal value. Here, we analyzed and characterized the complete chloroplast (cp) genome of Z. jujuba, the first member of the Rhamnaceae family for which the chloroplast genome sequence has been reported. We also built a web browser for navigating the cp genome of Z. jujuba ( http://bio.njfu.edu.cn/gb2/gbrowse/Ziziphus_jujuba_cp/ ). Sequence analysis showed that this cp genome is 161,466 bp long and has a typical quadripartite structure of large (LSC, 89,120 bp) and small (SSC, 19,348 bp) single-copy regions separated by a pair of inverted repeats (IRs, 26,499 bp). The sequence contained 112 unique genes, including 78 protein-coding genes, 30 transfer RNAs, and four ribosomal RNAs. The genome structure, gene order, GC content, and codon usage are similar to other typical angiosperm cp genomes. A total of 38 tandem repeats, two forward repeats, and three palindromic repeats were detected in the Z. jujuba cp genome. Simple sequence repeat (SSR) analysis revealed that most SSRs were AT-rich. The homopolymer regions in the cp genome of Z. jujuba were verified and manually corrected by Sanger sequencing. One-third of mononucleotide repeats were found to be erroneously sequenced by the 454 pyrosequencing, which resulted in sequences of 1-4 bases shorter than that by the Sanger sequencing. Analyzing the cp genome of Z. jujuba revealed that the IR contraction and expansion events resulted in ycf1 and rps19 pseudogenes. A phylogenetic analysis based on 64 protein-coding genes showed that Z. jujuba was closely related to members of the Elaeagnaceae family, which will be helpful for phylogenetic studies of other Rosales species. The complete cp genome sequence of Z. jujuba will facilitate population, phylogenetic, and cp genetic engineering studies of this economic plant.
The N-terminal sequence of ribosomal protein L10 from the archaebacterium Halobacterium marismortui and its relationship to eubacterial protein L6 and other ribosomal proteins.

Science.gov (United States)

Dijk, J; van den Broek, R; Nasiulas, G; Beck, A; Reinhardt, R; Wittmann-Liebold, B

1987-08-01

The amino-terminal sequence of ribosomal protein L10 from Halobacterium marismortui has been determined up to residue 54, using both a liquid- and a gas-phase sequenator. The two sequences are in good agreement. The protein is clearly homologous to protein HcuL10 from the related strain Halobacterium cutirubrum. Furthermore, a weaker but distinct homology to ribosomal protein L6 from Escherichia coli and Bacillus stearothermophilus can be detected. In addition to 7 identical amino acids in the first 36 residues in all four sequences a number of conservative replacements occurs, of mainly hydrophobic amino acids. In this common region the pattern of conserved amino acids suggests the presence of a beta-alpha fold as it occurs in ribosomal proteins L12 and L30. Furthermore, several potential cases of homology to other ribosomal components of the three ur-kingdoms have been found.
Estrogen Repression of MicroRNAs Is Associated with High Guanine Content in the Terminal Loop Sequences of Their Precursors

Directory of Open Access Journals (Sweden)

Amit Cohen

2017-08-01

Full Text Available Widespread microRNA (miRNA repression is a phenomenon observed in mammals after exposure to cigarette smoke and in many types of cancer. A comprehensive reduction in miRNA expression after treatment with the hormone estrogen has also previously been described. Here, we reveal a conserved association of miRNA downregulation after estrogen exposure in zebrafish, mouse, and human breast cancer cell line, with a high guanine content in the terminal loop sequences of their precursors, and offer a possible link between estrogen-related miRNA-adducts formation and carcinogenesis. We also show common gene expression patterns shared by breast cancer tumors and estrogen-treated zebrafish, suggesting that this organism can be used as a powerful model system for the study of human breast cancer.
PLE-wu, a new member of piggyBac transposon family from insect, is active in mammalian cells.

Science.gov (United States)

Wu, Chunxiao; Wang, Shu

2014-10-01

piggyBac, a highly active transposon in insect and mammalian cells, is a very useful tool in genome manipulation. A new piggyBac-like element (PLE), named PLE-wu, was identified from a mutant baculovirus cultured in sf9 insect cells. This new transposon is 2931 bp in length and encodes two active forms of transposase, a 708-amino acid-long transposase and a short 576-residue-long transposase translated from a downstream in-frame initiation codon. PLE-wu has asymmetric terminal structures, containing 6-bp inverted terminal repeats, 32-bp imperfect inverted and direct sub-terminal repeats. Similar to piggyBac, PLE-wu exhibits traceless excision activity in both insect and mammalian cells, restoring the original TTAA target sequence upon excision. It also retains the insertion activity in mammalian cells with a plasmid to chromosome transposition rate about 10-fold higher than random integration. Plasmid rescue assays revealed that the TTAA target sequence was duplicated at the junctions of the insertion site. Deletion of the terminal sequences including the sub-terminal repeats decreased the transposition activity of the 708-residue-long transposase, while the transposition activity of the short form of transposase was not affected. With its low sequence similarity to piggyBac, PLE-wu will contribute to the understanding the mechanism of PLE transposition, as well as design of new transposon systems with higher activity. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.

Science.gov (United States)

Militello, Kevin T; Lazatin, Justine C

2017-05-01

Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Ni(II) and Cu(II) binding with a 14-aminoacid sequence of Cap43 protein, TRSRSHTSEGTRSR.

Science.gov (United States)

Zoroddu, M A; Kowalik-Jankowska, T; Kozlowski, H; Salnikow, K; Costa, M

2001-03-01

The tetradecapeptide containing the 10 aminoacid repeated sequence on the C-terminus of the Ni(II)-induced Cap43 protein, was analyzed for Ni(II) and Cu(II) binding. A combined pH-metric and spectroscopic UV-VIS, EPR, CD and NMR study of Ni(II) and Cu(II) binding to the blocked CH3CO-Thr-Arg-Ser-Arg-Ser-His-Thr-Ser-Glu-Gly-Thr-Arg-Ser-Arg-NH2 (Ac-TRSRSHTSEGTRSR-Am) peptide, modeling a part of the C-terminal sequence of the Cap43 protein, revealed the formation of octahedral complexes involving imidazole nitrogen of histidine, at pH 5.5 and pH 7 for Cu(II) and Ni(II), respectively; a major square planar 4N-Ni(II) complex (about 100% at pH 9, log K* = -28.16) involving imidazole nitrogen of histidine and three deprotonated amide nitrogens of the backbone of the peptide was revealed; a 3N-Cu(II) complex (maximum about 70% at pH 7, log K*=-13.91) and a series of 4N-Cu(II) complexes starting at pH 5.5 (maximum about 90% at pH 8.7, log K* = -21.39 for CuH(-3)L), were revealed. This work supports the existence of a metal binding site at the COOH-terminal part of the Cap43 peptide.
Amyloid fibril formation from sequences of a natural beta-structured fibrous protein, the adenovirus fiber.

Science.gov (United States)

Papanikolopoulou, Katerina; Schoehn, Guy; Forge, Vincent; Forsyth, V Trevor; Riekel, Christian; Hernandez, Jean-François; Ruigrok, Rob W H; Mitraki, Anna

2005-01-28

Amyloid fibrils are fibrous beta-structures that derive from abnormal folding and assembly of peptides and proteins. Despite a wealth of structural studies on amyloids, the nature of the amyloid structure remains elusive; possible connections to natural, beta-structured fibrous motifs have been suggested. In this work we focus on understanding amyloid structure and formation from sequences of a natural, beta-structured fibrous protein. We show that short peptides (25 to 6 amino acids) corresponding to repetitive sequences from the adenovirus fiber shaft have an intrinsic capacity to form amyloid fibrils as judged by electron microscopy, Congo Red binding, infrared spectroscopy, and x-ray fiber diffraction. In the presence of the globular C-terminal domain of the protein that acts as a trimerization motif, the shaft sequences adopt a triple-stranded, beta-fibrous motif. We discuss the possible structure and arrangement of these sequences within the amyloid fibril, as compared with the one adopted within the native structure. A 6-amino acid peptide, corresponding to the last beta-strand of the shaft, was found to be sufficient to form amyloid fibrils. Structural analysis of these amyloid fibrils suggests that perpendicular stacking of beta-strand repeat units is an underlying common feature of amyloid formation.
Genome survey sequencing and genetic background characterization of Gracilariopsis lemaneiformis (Rhodophyta) based on next-generation sequencing.

Science.gov (United States)

Zhou, Wei; Hu, Yiyi; Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

2013-01-01

Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon.
Genome Survey Sequencing and Genetic Background Characterization of Gracilariopsis lemaneiformis (Rhodophyta) Based on Next-Generation Sequencing

Science.gov (United States)

Sui, Zhenghong; Fu, Feng; Wang, Jinguo; Chang, Lianpeng; Guo, Weihua; Li, Binbin

2013-01-01

Gracilariopsis lemaneiformis has a high economic value and is one of the most important aquaculture species in China. Despite it is economic importance, it has remained largely unstudied at the genomic level. In this study, we conducted a genome survey of Gp. lemaneiformis using next-generation sequencing (NGS) technologies. In total, 18.70 Gb of high-quality sequence data with an estimated genome size of 97 Mb were obtained by HiSeq 2000 sequencing for Gp. lemaneiformis. These reads were assembled into 160,390 contigs with a N50 length of 3.64 kb, which were further assembled into 125,685 scaffolds with a total length of 81.17 Mb. Genome analysis predicted 3490 genes and a GC% content of 48%. The identified genes have an average transcript length of 1,429 bp, an average coding sequence size of 1,369 bp, 1.36 exons per gene, exon length of 1,008 bp, and intron length of 191 bp. From the initial assembled scaffold, transposable elements constituted 54.64% (44.35 Mb) of the genome, and 7737 simple sequence repeats (SSRs) were identified. Among these SSRs, the trinucleotide repeat type was the most abundant (up to 73.20% of total SSRs), followed by the di- (17.41%), tetra- (5.49%), hexa- (2.90%), and penta- (1.00%) nucleotide repeat type. These characteristics suggest that Gp. lemaneiformis is a model organism for genetic study. This is the first report of genome-wide characterization within this taxon. PMID:23875008
N-terminal nesprin-2 variants regulate β-catenin signalling

Energy Technology Data Exchange (ETDEWEB)

Zhang, Qiuping; Minaisah, Rose-Marie; Ferraro, Elisa; Li, Chen; Porter, Lauren J.; Zhou, Can; Gao, Fang; Zhang, Junyi; Rajgor, Dipen; Autore, Flavia; Shanahan, Catherine M.; Warren, Derek T., E-mail: derek.warren@kcl.ac.uk

2016-07-15

The spatial compartmentalisation of biochemical signalling pathways is essential for cell function. Nesprins are a multi-isomeric family of proteins that have emerged as signalling scaffolds, herein, we investigate the localisation and function of novel nesprin-2 N-terminal variants. We show that these nesprin-2 variants display cell specific distribution and reside in both the cytoplasm and nucleus. Immunofluorescence microscopy revealed that nesprin-2 N-terminal variants colocalised with β-catenin at cell-cell junctions in U2OS cells. Calcium switch assays demonstrated that nesprin-2 and β-catenin are lost from cell-cell junctions in low calcium conditions whereas emerin localisation at the NE remained unaltered, furthermore, an N-terminal fragment of nesprin-2 was sufficient for cell-cell junction localisation and interacted with β-catenin. Disruption of these N-terminal nesprin-2 variants, using siRNA depletion resulted in loss of β-catenin from cell-cell junctions, nuclear accumulation of active β-catenin and augmented β-catenin transcriptional activity. Importantly, we show that U2OS cells lack nesprin-2 giant, suggesting that the N-terminal nesprin-2 variants regulate β-catenin signalling independently of the NE. Together, these data identify N-terminal nesprin-2 variants as novel regulators of β-catenin signalling that tether β-catenin to cell-cell contacts to inhibit β-catenin transcriptional activity. - Highlights: • N-terminal nesprin-2 variants display cell specific expression patterns. • N-terminal spectrin repeats of nesprin-2 interact with β-catenin. • N-terminal nesprin-2 variants scaffold β-catenin at cell-cell junctions.. • Nesprin-2 variants play multiple roles in β-catenin signalling.
Adenovirus fibre shaft sequences fold into the native triple beta-spiral fold when N-terminally fused to the bacteriophage T4 fibritin foldon trimerisation motif.

Science.gov (United States)

Papanikolopoulou, Katerina; Teixeira, Susana; Belrhali, Hassan; Forsyth, V Trevor; Mitraki, Anna; van Raaij, Mark J

2004-09-03

Adenovirus fibres are trimeric proteins that consist of a globular C-terminal domain, a central fibrous shaft and an N-terminal part that attaches to the viral capsid. In the presence of the globular C-terminal domain, which is necessary for correct trimerisation, the shaft segment adopts a triple beta-spiral conformation. We have replaced the head of the fibre by the trimerisation domain of the bacteriophage T4 fibritin, the foldon. Two different fusion constructs were made and crystallised, one with an eight amino acid residue linker and one with a linker of only two residues. X-ray crystallographic studies of both fusion proteins shows that residues 319-391 of the adenovirus type 2 fibre shaft fold into a triple beta-spiral fold indistinguishable from the native structure, although this is now resolved at a higher resolution of 1.9 A. The foldon residues 458-483 also adopt their natural structure. The intervening linkers are not well ordered in the crystal structures. This work shows that the shaft sequences retain their capacity to fold into their native beta-spiral fibrous fold when fused to a foreign C-terminal trimerisation motif. It provides a structural basis to artificially trimerise longer adenovirus shaft segments and segments from other trimeric beta-structured fibre proteins. Such artificial fibrous constructs, amenable to crystallisation and solution studies, can offer tractable model systems for the study of beta-fibrous structure. They can also prove useful for gene therapy and fibre engineering applications.

The genomic sequence of ectromelia virus, the causative agent of mousepox

International Nuclear Information System (INIS)

Chen Nanhai; Danila, Maria I.; Feng Zehua; Buller, R. Mark L.; Wang Chunlin; Han Xiaosi; Lefkowitz, Elliot J.; Upton, Chris

2003-01-01

Ectromelia virus is the causative agent of mousepox, an acute exanthematous disease of mouse colonies in Europe, Japan, China, and the U.S. The Moscow, Hampstead, and NIH79 strains are the most thoroughly studied with the Moscow strain being the most infectious and virulent for the mouse. In the late 1940s mousepox was proposed as a model for the study of the pathogenesis of smallpox and generalized vaccinia in humans. Studies in the last five decades from a succession of investigators have resulted in a detailed description of the virologic and pathologic disease course in genetically susceptible and resistant inbred and out-bred mice. We report the DNA sequence of the left-hand end, the predicted right-hand terminal repeat, and central regions of the genome of the Moscow strain of ectromelia virus (approximately 177,500 bp), which together with the previously sequenced right-hand end, yields a genome of 209,771 bp. We identified 175 potential genes specifying proteins of between 53 and 1924 amino acids, and 29 regions containing sequences related to genes predicted in other poxviruses, but unlikely to encode for functional proteins in ectromelia virus. The translated protein sequences were compared with the protein database for structure/function relationships, and these analyses were used to investigate poxvirus evolution and to attempt to explain at the cellular and molecular level the well-characterized features of the ectromelia virus natural life cycle
Resistance to Change and Preference for Variable versus Fixed Response Sequences

Science.gov (United States)

Arantes, Joana; Berg, Mark E.; Le, Dien; Grace, Randolph C.

2012-01-01

In Experiment 1, 4 pigeons were trained on a multiple chain schedule in which the initial link was a variable-interval (VI) 20-s schedule signalled by a red or green center key, and terminal links required four responses made to the left (L) and/or right (R) keys. In the REPEAT component, signalled by red keylights, only LRLR terminal-link…
An Evolutionarily Young Polar Bear (Ursus maritimus Endogenous Retrovirus Identified from Next Generation Sequence Data

Directory of Open Access Journals (Sweden)

Kyriakos Tsangaras

2015-11-01

Full Text Available Transcriptome analysis of polar bear (Ursus maritimus tissues identified sequences with similarity to Porcine Endogenous Retroviruses (PERV. Based on these sequences, four proviral copies and 15 solo long terminal repeats (LTRs of a newly described endogenous retrovirus were characterized from the polar bear draft genome sequence. Closely related sequences were identified by PCR analysis of brown bear (Ursus arctos and black bear (Ursus americanus but were absent in non-Ursinae bear species. The virus was therefore designated UrsusERV. Two distinct groups of LTRs were observed including a recombinant ERV that contained one LTR belonging to each group indicating that genomic invasions by at least two UrsusERV variants have recently occurred. Age estimates based on proviral LTR divergence and conservation of integration sites among ursids suggest the viral group is only a few million years old. The youngest provirus was polar bear specific, had intact open reading frames (ORFs and could potentially encode functional proteins. Phylogenetic analyses of UrsusERV consensus protein sequences suggest that it is part of a pig, gibbon and koala retrovirus clade. The young age estimates and lineage specificity of the virus suggests UrsusERV is a recent cross species transmission from an unknown reservoir and places the viral group among the youngest of ERVs identified in mammals.
Large scale analysis of small repeats via mining of the human genome

NARCIS (Netherlands)

van den Berg, I.; Bosnacki, D.; Hilbers, P.A.J.

2009-01-01

Small repetitive sequences, called tandem repeats, are abundant throughout the human genome, both in coding and in non-coding regions. Their role is still mostly unknown, but at least 20 of those repetitive sequences have been related to neurodegenerative disorders. The mutational process that is
Genetic diversity among Puccinia melanocephala isolates from Brazil assessed using simple sequence repeat markers.

Science.gov (United States)

Peixoto-Junior, R F; Creste, S; Landell, M G A; Nunes, D S; Sanguino, A; Campos, M F; Vencovsky, R; Tambarussi, E V; Figueira, A

2014-09-26

Brown rust (causal agent Puccinia melanocephala) is an important sugarcane disease that is responsible for large losses in yield worldwide. Despite its importance, little is known regarding the genetic diversity of this pathogen in the main Brazilian sugarcane cultivation areas. In this study, we characterized the genetic diversity of 34 P. melanocephala isolates from 4 Brazilian states using loci identified from an enriched simple sequence repeat (SSR) library. The aggressiveness of 3 isolates from major sugarcane cultivation areas was evaluated by inoculating an intermediately resistant and a susceptible cultivar. From the enriched library, 16 SSR-specific primers were developed, which produced scorable alleles. Of these, 4 loci were polymorphic and 12 were monomorphic for all isolates evaluated. The molecular characterization of the 34 isolates of P. melanocephala conducted using 16 SSR loci revealed the existence of low genetic variability among the isolates. The average estimated genetic distance was 0.12. Phenetic analysis based on Nei's genetic distance clustered the isolates into 2 major groups. Groups I and II included 18 and 14 isolates, respectively, and both groups contained isolates from all 4 geographic regions studied. Two isolates did not cluster with these groups. It was not possible to obtain clusters according to location or state of origin. Analysis of disease severity data revealed that the isolates did not show significant differences in aggressiveness between regions.
Massively parallel sequencing of forensic STRs

DEFF Research Database (Denmark)

Parson, Walther; Ballard, David; Budowle, Bruce

2016-01-01

The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that...
Antral content, secretion and peripheral metabolism of N-terminal progastrin fragments

DEFF Research Database (Denmark)

Goetze, Jens Peter; Hansen, Carsten Palnaes; Rehfeld, Jens F

2006-01-01

OBJECTIVES: In addition to the acid-stimulatory gastrins, progastrin also release N-terminal fragments. In order to examine the cellular content, secretion and peripheral metabolism of these fragments, we developed an immunoassay specific for the N-terminal sequence of human progastrin. RESULTS......-terminal progastrin fragments. The basal concentration of N-terminal fragments in normal human plasma was almost 30-fold higher than that of the amidated, acid-stimulatory gastrins (286 pmol/l versus 9.8 pmol/l, n=26, P...-35 in circulation was 30 min, and a pig model revealed the kidneys and the vasculature to the head as the primary sites of degradation. CONCLUSION: The cellular and circulatory concentration profiles of N-terminal progastrin fragments differ markedly from those of the acid-stimulatory gastrins. The high basal...
Analysis of CR1 Repeats in the Zebra Finch Genome

Directory of Open Access Journals (Sweden)

George E. Liu

2013-06-01

Full Text Available Most bird species have smaller genomes and fewer repeats than mammals. Chicken Repeat 1 (CR1 repeat is one of the most abundant families of repeats, ranging from ~133,000 to ~187,000 copies accounting for ~50 to ~80% of the interspersed repeats in the zebra finch and chicken genomes, respectively. CR1 repeats are believed to have arisen from the retrotransposition of a small number of master elements, which gave rise to multiple CR1 subfamilies in the chicken. In this study, we performed a global assessment of the divergence distributions, phylogenies, and consensus sequences of CR1 repeats in the zebra finch genome. We identified and validated 34 CR1 subfamilies and further analyzed the correlation between these subfamilies. We also discovered 4 novel lineage-specific CR1 subfamilies in the zebra finch when compared to the chicken genome. We built various evolutionary trees of these subfamilies and concluded that CR1 repeats may play an important role in reshaping the structure of bird genomes.
Variable number of tandem repeat markers in the genome sequence of Mycosphaerella fijiensis, the causal agent of black leaf streak disease of banana (Musa spp).

Science.gov (United States)

Garcia, S A L; Van der Lee, T A J; Ferreira, C F; Te Lintel Hekkert, B; Zapater, M-F; Goodwin, S B; Guzmán, M; Kema, G H J; Souza, M T

2010-11-09

We searched the genome of Mycosphaerella fijiensis for molecular markers that would allow population genetics analysis of this plant pathogen. M. fijiensis, the causal agent of banana leaf streak disease, also known as black Sigatoka, is the most devastating pathogen attacking bananas (Musa spp). Recently, the entire genome sequence of M. fijiensis became available. We screened this database for VNTR markers. Forty-two primer pairs were selected for validation, based on repeat type and length and the number of repeat units. Five VNTR markers showing multiple alleles were validated with a reference set of isolates from different parts of the world and a population from a banana plantation in Costa Rica. Polymorphism information content values varied from 0.6414 to 0.7544 for the reference set and from 0.0400 and 0.7373 for the population set. Eighty percent of the polymorphism information content values were above 0.60, indicating that the markers are highly informative. These markers allowed robust scoring of agarose gels and proved to be useful for variability and population genetics studies. In conclusion, the strategy we developed to identify and validate VNTR markers is an efficient means to incorporate markers that can be used for fungicide resistance management and to develop breeding strategies to control banana black leaf streak disease. This is the first report of VNTR-minisatellites from the M. fijiensis genome sequence.
Sequencing and generation of an infectious clone of the pathogenic goose parvovirus strain LH.

Science.gov (United States)

Wang, Jianye; Duan, Jinkun; Zhu, Liqian; Jiang, Zhiwei; Zhu, Guoqiang

2015-03-01

In this study, the complete genome of the virulent strain LH of goose parvovirus (GPV) was sequenced and cloned into the pBluescript II (SK) plasmid vector. Sequence alignments of the inverted terminal repeats (ITR) of GPV strains revealed a common 14-nt-pair deletion in the stem of the palindromic structure in the LH strain and three other strains isolated after 1982 when compared to three GPV strains isolated earlier than that time. Transfection of 11-day-old embryonated goose eggs with the plasmid pLH, which contains the entire genome of strain LH, resulted in successful rescue of the infectious virus. Death of embryos after transfection via the chorioallantoic membrane infiltration route occurred earlier than when transfection was done via the allantoic cavity inoculation route. The rescued virus exhibited virulence similar to that of its parental virus, as evaluated by the mortality rate in goslings. Generation of the pathogenic infectious clone provides us with a powerful tool to elucidate the molecular pathogenesis of GPV in the future.
High-Pressure NMR and SAXS Reveals How Capping Modulates Folding Cooperativity of the pp32 Leucine-rich Repeat Protein.

Science.gov (United States)

Zhang, Yi; Berghaus, Melanie; Klein, Sean; Jenkins, Kelly; Zhang, Siwen; McCallum, Scott A; Morgan, Joel E; Winter, Roland; Barrick, Doug; Royer, Catherine A

2018-04-27

Many repeat proteins contain capping motifs, which serve to shield the hydrophobic core from solvent and maintain structural integrity. While the role of capping motifs in enhancing the stability and structural integrity of repeat proteins is well documented, their contribution to folding cooperativity is not. Here we examined the role of capping motifs in defining the folding cooperativity of the leucine-rich repeat protein, pp32, by monitoring the pressure- and urea-induced unfolding of an N-terminal capping motif (N-cap) deletion mutant, pp32-∆N-cap, and a C-terminal capping motif destabilization mutant pp32-Y131F/D146L, using residue-specific NMR and small-angle X-ray scattering. Destabilization of the C-terminal capping motif resulted in higher cooperativity for the unfolding transition compared to wild-type pp32, as these mutations render the stability of the C-terminus similar to that of the rest of the protein. In contrast, deletion of the N-cap led to strong deviation from two-state unfolding. In both urea- and pressure-induced unfolding, residues in repeats 1-3 of pp32-ΔN-cap lost their native structure first, while the C-terminal half was more stable. The residue-specific free energy changes in all regions of pp32-ΔN-cap were larger in urea compared to high pressure, indicating a less cooperative destabilization by pressure. Moreover, in contrast to complete structural disruption of pp32-ΔN-cap at high urea concentration, its pressure unfolded state remained compact. The contrasting effects of the capping motifs on folding cooperativity arise from the differential local stabilities of pp32, whereas the contrasting effects of pressure and urea on the pp32-ΔN-cap variant arise from their distinct mechanisms of action. Copyright © 2018 Elsevier Ltd. All rights reserved.
Simple Sequence Repeat Analysis of Selected NSIC-registered Coffee Varieties in the Philippines

Directory of Open Access Journals (Sweden)

Daisy May C. Santos

2016-06-01

Full Text Available Coffee (Coffea sp. is an important commercial crop worldwide. Three species of coffee are used as beverage, namely Coffea arabica, C. canephora, and C. liberica. Coffea arabica L. is the most cultivated among the three coffee species due to its taste quality, rich aroma, and low caffeine content. Despite its inferior taste and aroma, C. canephora Pierre ex A. Froehner, which has the highest caffeine content, is the second most widely cultivated because of its resistance to coffee diseases. On the other hand, C. liberica W.Bull ex Hierncomes is characterized by its very strong taste and flavor. The Philippines used to be a leading exporter of coffee until coffee rust destroyed the farms in Batangas, home of the famous Kapeng Barako. The country has been attempting to revive the coffee industry by focusing on the production of specialty coffee with registered varieties on the National Seed Industry Council (NSIC. Correct identification and isolation of pure coffee beans are the main factors that determine coffee’s market value. Local farms usually misidentify and mix coffee beans of different varieties, leading to the depreciation of their value. This study used simple sequence repeat (SSR markers to evaluate and distinguish Philippine NSIC-registered coffee species and varieties. The neighbor-joining tree generated using PAUP showed high bootstrap support, separating C. arabica, C. canephora, and C. liberica from each other. Among the twenty primer pairs used, seven were able to distinguish C. arabica, nine for C. liberica, and one for C. canephora.
In situ optical sequencing and structure analysis of a trinucleotide repeat genome region by localization microscopy after specific COMBO-FISH nano-probing

Science.gov (United States)

Stuhlmüller, M.; Schwarz-Finsterle, J.; Fey, E.; Lux, J.; Bach, M.; Cremer, C.; Hinderhofer, K.; Hausmann, M.; Hildenbrand, G.

2015-10-01

Trinucleotide repeat expansions (like (CGG)n) of chromatin in the genome of cell nuclei can cause neurological disorders such as for example the Fragile-X syndrome. Until now the mechanisms are not clearly understood as to how these expansions develop during cell proliferation. Therefore in situ investigations of chromatin structures on the nanoscale are required to better understand supra-molecular mechanisms on the single cell level. By super-resolution localization microscopy (Spectral Position Determination Microscopy; SPDM) in combination with nano-probing using COMBO-FISH (COMBinatorial Oligonucleotide FISH), novel insights into the nano-architecture of the genome will become possible. The native spatial structure of trinucleotide repeat expansion genome regions was analysed and optical sequencing of repetitive units was performed within 3D-conserved nuclei using SPDM after COMBO-FISH. We analysed a (CGG)n-expansion region inside the 5' untranslated region of the FMR1 gene. The number of CGG repeats for a full mutation causing the Fragile-X syndrome was found and also verified by Southern blot. The FMR1 promotor region was similarly condensed like a centromeric region whereas the arrangement of the probes labelling the expansion region seemed to indicate a loop-like nano-structure. These results for the first time demonstrate that in situ chromatin structure measurements on the nanoscale are feasible. Due to further methodological progress it will become possible to estimate the state of trinucleotide repeat mutations in detail and to determine the associated chromatin strand structural changes on the single cell level. In general, the application of the described approach to any genome region will lead to new insights into genome nano-architecture and open new avenues for understanding mechanisms and their relevance in the development of heredity diseases.
Genetic Diversity Assessment and Identification of New Sour Cherry Genotypes Using Intersimple Sequence Repeat Markers

Directory of Open Access Journals (Sweden)

Roghayeh Najafzadeh

2014-01-01

Full Text Available Iran is one of the chief origins of subgenus Cerasus germplasm. In this study, the genetic variation of new Iranian sour cherries (which had such superior growth characteristics and fruit quality as to be considered for the introduction of new cultivars was investigated and identified using 23 intersimple sequence repeat (ISSR markers. Results indicated a high level of polymorphism of the genotypes based on these markers. According to these results, primers tested in this study specially ISSR-4, ISSR-6, ISSR-13, ISSR-14, ISSR-16, and ISSR-19 produced good and various levels of amplifications which can be effectively used in genetic studies of the sour cherry. The genetic similarity among genotypes showed a high diversity among the genotypes. Cluster analysis separated improved cultivars from promising Iranian genotypes, and the PCoA supported the cluster analysis results. Since the Iranian genotypes were superior to the improved cultivars and were separated from them in most groups, these genotypes can be considered as distinct genotypes for further evaluations in the framework of breeding programs and new cultivar identification in cherries. Results also confirmed that ISSR is a reliable DNA marker that can be used for exact genetic studies and in sour cherry breeding programs.
Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

Science.gov (United States)

Martin, Andrew C R

2014-01-01

The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.
Analysis of proteolytic processes and enzymatic activities in the generation of huntingtin n-terminal fragments in an HEK293 cell model.

Directory of Open Access Journals (Sweden)

Andrew T N Tebbenkamp

Full Text Available N-terminal fragments of mutant huntingtin (htt that terminate between residues 90-115, termed cleavage product A or 1 (cp-A/1, form intracellular and intranuclear inclusion bodies in the brains of patients with Huntington's disease (HD. These fragments appear to be proteolytic products of the full-length protein. Here, we use an HEK293 cell culture model to investigate huntingtin proteolytic processing; previous studies of these cells have demonstrated cleavage of htt to cp-A/1 like htt fragments.Recombinant N-terminal htt fragments, terminating at residue 171 (also referred to as cp-B/2 like, were efficiently cleaved to produce cp-A/1 whereas fragments representing endogenous caspase, calpain, and metalloproteinase cleavage products, terminating between residues 400-600, were inefficiently cleaved. Using cysteine-labeling techniques and antibody binding mapping, we localized the C-terminus of the cp-A/1 fragments produced by HEK293 cells to sequences minimally limited by cysteine 105 and an antibody epitope composed of residues 115-124. A combination of genetic and pharmacologic approaches to inhibit potential proteases, including γ-secretase and calpain, proved ineffective in preventing production of cp-A/1.Our findings indicate that HEK293 cells express a protease that is capable of efficiently cleaving cp-B/2 like fragments of htt with normal or expanded glutamine repeats. For reasons that remain unclear, this protease cleaves longer htt fragments, with normal or expanded glutamine expansions, much less efficiently. The protease in HEK293 cells that is capable of generating a cp-A/1 like htt fragment may be a novel protease with a high preference for a cp-B/2-like htt fragment as substrate.
Structural analyses of the Ankyrin Repeat Domain of TRPV6 and related TRPV ion channels†‡

OpenAIRE

Phelps, Christopher B.; Huang, Robert J.; Lishko, Polina V.; Wang, Ruiqi R.; Gaudet, Rachelle

2008-01-01

Transient Receptor Potential (TRP) proteins are cation channels composed of a transmembrane domain flanked by large N- and C-terminal cytoplasmic domains. All members of the vanilloid family of TRP channels (TRPV) possess an N-terminal ankyrin repeat domain (ARD). The ARD of mammalian TRPV6, an important regulator of calcium uptake and homeostasis, is essential for channel assembly and regulation. The 1.7 Å crystal structure of the TRPV6-ARD reveals conserved structural elements unique to the...
Heterogeneity of rat tropoelastin mRNA revealed by cDNA cloning

International Nuclear Information System (INIS)

Pierce, R.A.; Deak, S.B.; Stolle, C.A.; Boyd, C.D.

1990-01-01

A λgt11 library constructed from poly(A+) RNA isolated from aortic tissue of neonatal rats was screened for rat tropoelastin cDNAs. The first, screen, utilizing a human tropoelastin cDNA clone, provided rat tropoelastin cDNAs spanning 2.3 kb of carboxy-terminal coding sequence and extended into the 3'-untranslated region. A subsequent screen using a 5' rat tropoelastin cDNA clone yielded clones extending into the amino-terminal signal sequence coding region. Sequence analysis of these clones has provided the complete derived amino acid sequence of rat tropoelastin and allowed alignment and comparison with published bovine cDNA sequence. While the overall structure of rat tropoelastin is similar to bovine sequence, numerous substitutions, deletions, and insertions demonstrated considerable heterogeneity between species. In particular, the pentapeptide repeat VPGVG, characteristic of all tropoelastins analyzed to date, is replaced in rat tropoelastin by a repeating pentapeptide, IPGVG. The hexapeptide repeat VGVAPG, the bovine elastin receptor binding peptide, is not encoded by rat tropoelastin cDNAs. Variations in coding sequence between rat tropoelastin CDNA clones were also found which may represent mRNA heterogeneity produced by alternative splicing of the rat tropoelastin pre-mRNA
ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

Science.gov (United States)

Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

2012-11-07

Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40
Capillary electrophoresis fragment analysis and clone sequencing in detection of dynamic mutations of spinocerebellar ataxia

Directory of Open Access Journals (Sweden)

Yuan-yuan CHEN

2018-04-01

Full Text Available Objective To estimate the accuracy and stability of capillary electrophoresis fragment analysis and clone sequencing in detecting dynamic mutations of spinocerebellar ataxia (SCA. Methods Capillary electrophoresis fragment analysis and clone sequencing were used in detecting trinucleotide repeated sequence of 14 SCA patients (3 cases of SCA2, 2 cases of SCA7, 7 cases of SCA8 and 2 cases of SCA17. Results Capillary electrophoresis fragment analysis of 3 SCA2 cases showed the expanded cytosine-adenine-guanine (CAG repeats were 31, 30 and 32, and the copy numbers of 3 clone sequencing for 3 colonies in each case were 37/40/40, 37/38/39 and 38/39/40 respectively. Capillary electrophoresis fragment analysis of 2 SCA7 cases showed the expanded CAG repeats were 57 and 34, and the copy numbers of repeats were 69, 74, 75 in 3 colonies of one case, and was 45 in the other case. For the 7 SCA8 cases with the expanded cytosine-thymine-adenine (CTA/cytosine-thymine-guanine (CTG repeats of 99, 111, 104, 92, 89, 104 and 75, the results of clone sequencing were 97, 116, 104, 90, 90, 102 and 76 respectively. For 2 SCA17 cases with the short/expanded CAG repeats of 37/50 and 36/45, the results of clone sequencing were 51/50/52 and 45/44 for 3 and 2 colonies. Conclusions Although the higher mobility of polymerase chain reaction (PCR products containing dynamic mutation in the capillary electrophoresis fragment analysis might cause the deviation for analysis of copy numbers, the deviation was predictable and the results were repeatable. The clone sequencing results showed obvious instability, especially for SCA2 and SCA7 genes, which might owing to their simple CAG repeats. Consequently, clone sequencing is not suited for detection of dynamic mutation, not to mention the quantitative criteria of dynamic mutation sequencing. DOI: 10.3969/j.issn.1672-6731.2018.03.008

Localization of Daucus carota NMCP1 to the nuclear periphery: the role of the N-terminal region and an NLS-linked sequence motif, RYNLRR, in the tail domain

Directory of Open Access Journals (Sweden)

Yuta eKimura

2014-02-01

Full Text Available Recent ultrastructural studies revealed that a structure similar to the vertebrate nuclear lamina exists in the nuclei of higher plants. However, plant genomes lack genes for lamins and intermediate-type filament proteins, and this suggests that plant-specific nuclear coiled-coil proteins make up the lamina-like structure in plants. NMCP1 is a protein, first identified in Daucus carota cells, that localizes exclusively to the nuclear periphery in interphase cells. It has a tripartite structure comprised of head, rod, and tail domains, and includes putative nuclear localization signal (NLS motifs. We identified the functional NLS of DcNMCP1 (carrot NMCP1 and determined the protein regions required for localizing to the nuclear periphery using EGFP-fused constructs transiently expressed in Apium graveolens epidermal cells. Transcription was driven under a CaMV35S promoter, and the genes were introduced into the epidermal cells by a DNA-coated microprojectile delivery system. Of the NLS motifs, KRRRK and RRHK in the tail domain were highly functional for nuclear localization. Addition of the N-terminal 141 amino acids from DcNMCP1 shifted the localization of a region including these NLSs from the entire nucleus to the nuclear periphery. Using this same construct, the replacement of amino acids in RRHK or its preceding sequence, YNL, with alanine residues abolished localization to the nuclear periphery, while replacement of KRRRK did not affect localization. The sequence R/Q/HYNLRR/H, including YNL and the first part of the sequence of RRHK, is evolutionarily conserved in a subclass of NMCP1 sequences from many plant species. These results show that NMCP1 localizes to the nuclear periphery by a combined action of a sequence composed of R/Q/HYNLRR/H, NLS, and the N-terminal region including the head and a portion of the rod domain, suggesting that more than one binding site is implicated in localization of NMCP1.
Evaluation of genetic diversity amongst Descurainia sophia L. genotypes by inter-simple sequence repeat (ISSR) marker.

Science.gov (United States)

Saki, Sahar; Bagheri, Hedayat; Deljou, Ali; Zeinalabedini, Mehrshad

2016-01-01

Descurainia sophia is a valuable medicinal plant in family of Brassicaceae. To determine the range of diversity amongst D. sophia in Iran, 32 naturally distributed plants belonging to six natural populations of the Iranian plateau were investigated by inter-simple sequence repeat (ISSR) markers. The average percentage of polymorphism produced by 12 ISSR primers was 86 %. The PIC values for primers ranged from 0.22 to 0.40 and Rp values ranged between 6.5 and 19.9. The relative genetic diversity of the populations was not high (Gst =0.32). However, the value of gene flow revealed by the ISSR marker was high (Nm = 1.03). UPGMA clustering method based on Jaccard similarity coefficient grouped the genotypes into two major clusters. Graph results from Neighbor-Net Network generated after a 1000 bootstrap test using Jaccard coefficient, and STRUCTURE analysis confirmed the UPGMA clustering. The first three PCAs represented 57.31 % of the total variation. The high levels of genetic diversity were observed within populations, which is useful in breeding and conservation programs. ISSR is found to be an eligible marker to study genetic diversity of D. sophia.
TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

Science.gov (United States)

Pellegrini, Marco; Renda, M Elena; Vecchio, Alessio

2010-06-15

Genomes in higher eukaryotic organisms contain a substantial amount of repeated sequences. Tandem Repeats (TRs) constitute a large class of repetitive sequences that are originated via phenomena such as replication slippage and are characterized by close spatial contiguity. They play an important role in several molecular regulatory mechanisms, and also in several diseases (e.g. in the group of trinucleotide repeat disorders). While for TRs with a low or medium level of divergence the current methods are rather effective, the problem of detecting TRs with higher divergence (fuzzy TRs) is still open. The detection of fuzzy TRs is propaedeutic to enriching our view of their role in regulatory mechanisms and diseases. Fuzzy TRs are also important as tools to shed light on the evolutionary history of the genome, where higher divergence correlates with more remote duplication events. We have developed an algorithm (christened TRStalker) with the aim of detecting efficiently TRs that are hard to detect because of their inherent fuzziness, due to high levels of base substitutions, insertions and deletions. To attain this goal, we developed heuristics to solve a Steiner version of the problem for which the fuzziness is measured with respect to a motif string not necessarily present in the input string. This problem is akin to the 'generalized median string' that is known to be an NP-hard problem. Experiments with both synthetic and biological sequences demonstrate that our method performs better than current state of the art for fuzzy TRs and that the fuzzy TRs of the type we detect are indeed present in important biological sequences. TRStalker will be integrated in the web-based TRs Discovery Service (TReaDS) at bioalgo.iit.cnr.it. Supplementary data are available at Bioinformatics online.
Sequence diversities of serine-aspartate repeat genes among Staphylococcus aureus isolates from different hosts presumably by horizontal gene transfer.

Directory of Open Access Journals (Sweden)

Huping Xue

Full Text Available BACKGROUND: Horizontal gene transfer (HGT is recognized as one of the major forces for bacterial genome evolution. Many clinically important bacteria may acquire virulence factors and antibiotic resistance through HGT. The comparative genomic analysis has become an important tool for identifying HGT in emerging pathogens. In this study, the Serine-Aspartate Repeat (Sdr family has been compared among different sources of Staphylococcus aureus (S. aureus to discover sequence diversities within their genomes. METHODOLOGY/PRINCIPAL FINDINGS: Four sdr genes were analyzed for 21 different S. aureus strains and 218 mastitis-associated S. aureus isolates from Canada. Comparative genomic analyses revealed that S. aureus strains from bovine mastitis (RF122 and mastitis isolates in this study, ovine mastitis (ED133, pig (ST398, chicken (ED98, and human methicillin-resistant S. aureus (MRSA (TCH130, MRSA252, Mu3, Mu50, N315, 04-02981, JH1 and JH9 were highly associated with one another, presumably due to HGT. In addition, several types of insertion and deletion were found in sdr genes of many isolates. A new insertion sequence was found in mastitis isolates, which was presumably responsible for the HGT of sdrC gene among different strains. Moreover, the sdr genes could be used to type S. aureus. Regional difference of sdr genes distribution was also indicated among the tested S. aureus isolates. Finally, certain associations were found between sdr genes and subclinical or clinical mastitis isolates. CONCLUSIONS: Certain sdr gene sequences were shared in S. aureus strains and isolates from different species presumably due to HGT. Our results also suggest that the distributional assay of virulence factors should detect the full sequences or full functional regions of these factors. The traditional assay using short conserved regions may not be accurate or credible. These findings have important implications with regard to animal husbandry practices that may
The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats.

Directory of Open Access Journals (Sweden)

Andrew J Alverson

2011-01-01

Full Text Available The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean, and show that despite its unexceptional size (401,262 nt, the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.
Transferability of simple sequence repeat (SSR) markers developed in guava (Psidium guajava L.) to four Myrtaceae species.

Science.gov (United States)

Rai, Manoj K; Phulwaria, Mahendra; Shekhawat, N S

2013-08-01

Present study demonstrated the cross-genera transferability of 23 simple sequence repeat (SSR) primer pairs developed for guava (Psidium guajava L.) to four new targets, two species of eucalypts (Eucalyptus citriodora, Eucalyptus camaldulensis), bottlebrush (Callistemon lanceolatus) and clove (Syzygium aromaticum), belonging to the family Myrtaceae and subfamily Myrtoideae. Off the 23 SSR loci assayed, 18 (78.2%) gave cross-amplification in E. citriodora, 14 (60.8%) in E. camaldulensis and 17-17 (73.9%) in C. lanceolatus and S. aromaticum. Eight primer pairs were found to be transferable to all four species. The number of alleles detected at each locus ranged from one to nine, with an average of 4.8, 2.6, 4.5 and 4.6 alleles in E. citriodora, E. camaldulensis, C. lanceolatus and S. aromaticum, respectively. The high levels of cross-genera transferability of guava SSRs may be applicable for the analysis of intra- and inter specific genetic diversity of target species, especially in E. citriodora, C. lanceolatus and S. aromaticum, for which till date no information about EST-derived as well as genomic SSR is available.
Plasmid P1 replication: negative control by repeated DNA sequences.

OpenAIRE

Chattoraj, D; Cordes, K; Abeles, A

1984-01-01

The incompatibility locus, incA, of the unit-copy plasmid P1 is contained within a fragment that is essentially a set of nine 19-base-pair repeats. One or more copies of the fragment destabilizes the plasmid when present in trans. Here we show that extra copies of incA interfere with plasmid DNA replication and that a deletion of most of incA increases plasmid copy number. Thus, incA is not essential for replication but is required for its control. When cloned in a high-copy-number vector, pi...
Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells

DEFF Research Database (Denmark)

Rodriguez, Jairo; Vives, Laura; Jordà, Mireia

2008-01-01

Methylation of the cytosine is the most frequent epigenetic modification of DNA in mammalian cells. In humans, most of the methylated cytosines are found in CpG-rich sequences within tandem and interspersed repeats that make up to 45% of the human genome, being Alu repeats the most common family....
Pick-up ion energization at the termination shock

Energy Technology Data Exchange (ETDEWEB)

Gary, S Peter [Los Alamos National Laboratory; Winske, Dan [Los Alamos National Laboratory; Wu, Pin [BOSTON UNIV.; Schwadron, N A [BOSTON UNIV.

2009-01-01

One-dimensional hybrid simulations are used to investigate how pickup ions are energized at the perpendicular termination shock. Contrary to previous models based on pickup ion energy gain by repeated crossings of the shock front (shock surfing) or due to a reforming shock front, the present simulations show that pickup ion energy gain involves a gyro-phasedependent interaction with the inhomogeneous motional electric field at the shock. The process operates at all relative concentrations of pickup ion density.
Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

Science.gov (United States)

de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

2014-06-01

The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Genetic variation and DNA fingerprinting of durian types in Malaysia using simple sequence repeat (SSR) markers.

Science.gov (United States)

Siew, Ging Yang; Ng, Wei Lun; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Tan, Soon Guan; Yeap, Swee Keong

2018-01-01

Durian ( Durio zibethinus ) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, H E = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10 -3 . Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called "clones", "varieties", or "cultivars". Such matters have a direct impact on the regulation and management of durian genetic resources in the region.
Rendezvous terminal phase automatic braking sequencing and targeting. [for space shuttle orbiter

Science.gov (United States)

Kachmar, P. M.

1973-01-01

The purpose of the rendezvous terminal phase braking program is to provide the means of automatically bringing the primary orbiter within desired station keeping boundaries relative to the target satellite. A detailed discussion is presented on the braking program and its navigation, targeting, and guidance functions.
Genetic characterization of autochthonous grapevine cultivars from Eastern Turkey by simple sequence repeats (SSRs

Directory of Open Access Journals (Sweden)

Sadiye Peral Eyduran

2016-01-01

Full Text Available In this research, two well-recognized standard grape cultivars, Cabernet Sauvignon and Merlot, together with eight historical autochthonous grapevine cultivars from Eastern Anatolia in Turkey, were genetically characterized by using 12 pairs of simple sequence repeat (SSR primers in order to evaluate their genetic diversity and relatedness. All of the used SSR primers produced successful amplifications and revealed DNA polymorphisms, which were subsequently utilized to evaluate the genetic relatedness of the grapevine cultivars. Allele richness was implied by the identification of 69 alleles in 8 autochthonous cultivars with a mean value of 5.75 alleles per locus. The average expected heterozygosity and observed heterozygosity were found to be 0.749 and 0.739, respectively. Taking into account the generated alleles, the highest number was recorded in VVC2C3 and VVS2 loci (nine and eight alleles per locus, respectively, whereas the lowest number was recorded in VrZAG83 (three alleles per locus. Two main clusters were produced by using the unweighted pair-group method with arithmetic mean dendrogram constructed on the basis of the SSR data. Only Cabernet Sauvignon and Merlot cultivars were included in the first cluster. The second cluster involved the rest of the autochthonous cultivars. The results obtained during the study illustrated clearly that SSR markers have verified to be an effective tool for fingerprinting grapevine cultivars and carrying out grapevine biodiversity studies. The obtained data are also meaningful references for grapevine domestication.
On balanced minimal repeated measurements designs

Directory of Open Access Journals (Sweden)

Shakeel Ahmad Mir

2014-10-01

Full Text Available Repeated Measurements designs are concerned with scientific experiments in which each experimental unit is assigned more than once to a treatment either different or identical. This class of designs has the property that the unbiased estimators for elementary contrasts among direct and residual effects are obtainable. Afsarinejad (1983 provided a method of constructing balanced Minimal Repeated Measurements designs p < t , when t is an odd or prime power, one or more than one treatment may occur more than once in some sequences and designs so constructed no longer remain uniform in periods. In this paper an attempt has been made to provide a new method to overcome this drawback. Specifically, two cases have been considered RM[t,n=t(t-t/(p-1,p], λ2=1 for balanced minimal repeated measurements designs and RM[t,n=2t(t-t/(p-1,p], λ2=2 for balanced repeated measurements designs. In addition , a method has been provided for constructing extra-balanced minimal designs for special case RM[t,n=t2/(p-1,p], λ2=1.
Solar Terminator Waves in the Ionosphere Measured by the Wallops Island, VA Dynasonde

Science.gov (United States)

Zabotin, N. A.; Song, H.; Bullett, T. W.

2017-12-01

Solar terminator represents a unique source of atmospheric waves possessing of near-ideal coherent properties: its geometry and magnitude of the impact changes very little from day to day. This feature has been used in [Forbes et al., GRL, 2008] to obtain "snapshots" of terminator waves in the neutral atmosphere at the altitude 400 km by averaging CHAMP accelerometer data over relatively long sequences of the satellite passes. The results were represented in the geographic latitude vs local time coordinates. We apply a similar approach averaging time series of Wallops Island, VA Dynasonde Doppler data to obtain "snapshots" of terminator waves in the ionosphere in the true altitude vs local "terminator time" coordinates. The averaging is performed independently for every month of the yearlong observation period from May 2013 to April 2014. The altitude range covered is 90 km to 400 km with 2 km resolution, representing the entire bottom-side ionosphere. Individual local time segments used for the averaging were 12 hours long and all centered at the times of the sunrise or sunset terminator passing at every specific altitude. This procedure effectively suppresses all kinds of incoherent wave activity and allows one to reveal the perturbation phenomenon mainly caused by the solar terminator. This is an important advantage of this technique compared to multiple "terminator wave" studies based on simple time coincidence. Both sunrise and sunset terminator waves are easily visualized in all of the monthly images. Our results confirm observations of [Forbes et al., GRL, 2008] of the wave structures existing on both sides of the terminator. The phase fronts of the sunset terminator wave are propagating downward indicating upward movement of the terminator-related disturbance and of the wave energy generated by it. The phase fronts of the sunrise terminator waves are propagating upward indicating downward movement of the terminator-related disturbance and of the wave energy
Sequence analysis of two alleles reveals that intra-and intergenic recombination played a role in the evolution of the radish fertility restorer (Rfo

Directory of Open Access Journals (Sweden)

Budar Françoise

2010-02-01

Full Text Available Abstract Background Land plant genomes contain multiple members of a eukaryote-specific gene family encoding proteins with pentatricopeptide repeat (PPR motifs. Some PPR proteins were shown to participate in post-transcriptional events involved in organellar gene expression, and this type of function is now thought to be their main biological role. Among PPR genes, restorers of fertility (Rf of cytoplasmic male sterility systems constitute a peculiar subgroup that is thought to evolve in response to the presence of mitochondrial sterility-inducing genes. Rf genes encoding PPR proteins are associated with very close relatives on complex loci. Results We sequenced a non-restoring allele (L7rfo of the Rfo radish locus whose restoring allele (D81Rfo was previously described, and compared the two alleles and their PPR genes. We identified a ca 13 kb long fragment, likely originating from another part of the radish genome, inserted into the L7rfo sequence. The L7rfo allele carries two genes (PPR-1 and PPR-2 closely related to the three previously described PPR genes of the restorer D81Rfo allele (PPR-A, PPR-B, and PPR-C. Our results indicate that alleles of the Rfo locus have experienced complex evolutionary events, including recombination and insertion of extra-locus sequences, since they diverged. Our analyses strongly suggest that present coding sequences of Rfo PPR genes result from intragenic recombination. We found that the 10 C-terminal PPR repeats in Rfo PPR gene encoded proteins result from the tandem duplication of a 5 PPR repeat block. Conclusions The Rfo locus appears to experience more complex evolution than its flanking sequences. The Rfo locus and PPR genes therein are likely to evolve as a result of intergenic and intragenic recombination. It is therefore not possible to determine which genes on the two alleles are direct orthologs. Our observations recall some previously reported data on pathogen resistance complex loci.
Genetic sequences derived from suppression subtractive ...

African Journals Online (AJOL)

STORAGESEVER

2008-06-17

Jun 17, 2008 ... their possible roles in Xanthomonas albilineans ... Technology, P. O. Box 1334, Durban 4000, Republic of South Africa. Accepted 4 ... Clones selected were sequenced (using a Perkin Elmer ABI PRISM Dye terminator cycle.
Abrupt climate changes during Termination III in Southern Europe.

Science.gov (United States)

Pérez-Mejías, Carlos; Moreno, Ana; Sancho, Carlos; Bartolomé, Miguel; Stoll, Heather; Cacho, Isabel; Cheng, Hai; Edwards, R Lawrence

2017-09-19

The Late Quaternary glacial-interglacial transitions represent the highest amplitude climate changes over the last million years. Unraveling the sequence of events and feedbacks at Termination III (T-III), including potential abrupt climate reversals similar to those of the last Termination, has been particularly challenging due to the scarcity of well-dated records worldwide. Here, we present speleothem data from southern Europe covering the interval from 262.7 to 217.9 kyBP, including the transition from marine isotope stage (MIS) 8 to MIS 7e. High-resolution δ 13 C, δ 18 O, and Mg/Ca profiles reveal major millennial-scale changes in aridity manifested in changing water availability and vegetation productivity. uranium-thorium dates provide a solid chronology for two millennial-scale events (S8.1 and S8.2) which, compared with the last two terminations, has some common features with Heinrich 1 and Heinrich 2 in Termination I (T-I).
Structural features in the HIV-1 repeat region facilitate strand transfer during reverse transcription

NARCIS (Netherlands)

Berkhout, B.; Vastenhouw, N. L.; Klasens, B. I.; Huthoff, H.

2001-01-01

Two obligatory DNA strand transfers take place during reverse transcription of a retroviral RNA genome. The first strand transfer is facilitated by terminal repeat (R) elements in the viral genome. This strand-transfer reaction depends on base pairing between the cDNA of the 5'R and the 3'R. There
Production of carboxy-terminal specific antiserum against glucagon

International Nuclear Information System (INIS)

Liu Yibing; Han Shiquan

1993-01-01

To produce carboxy-terminal specific antisera against glucagon was coupled mainly via its amino terminal histidine to thyroglobulin, using the amino group reactive pentandiol at pH 7.0 for the conjugation procedure. After repeated immunization of guinea pigs and rabbits, the antisera were obtained. The titer of guinea pig antiserum against glucagon was 1:3000-1:35000 and affinity constant was 9.3 x 10 10 -11.4 x 10 10 l · mol -1 . There were no cross reaction with GIP, INS, Copeptide and gastrin. The titer of rabbit antiserum against glucagon was 1:900-1:9000 and affinity constant was 0.36 x 10 10 -3.9 x 10 10 l · mol -1 . There were no cross reaction with INS, C-peptide and gastrin. The cross reaction with GIP was 0.02%

Cloaked similarity between HIV-1 and SARS-CoV suggests an anti-SARS strategy

Directory of Open Access Journals (Sweden)

Kliger Yossef

2003-09-01

Full Text Available Abstract Background Severe acute respiratory syndrome (SARS is a febrile respiratory illness. The disease has been etiologically linked to a novel coronavirus that has been named the SARS-associated coronavirus (SARS-CoV, whose genome was recently sequenced. Since it is a member of the Coronaviridae, its spike protein (S2 is believed to play a central role in viral entry by facilitating fusion between the viral and host cell membranes. The protein responsible for viral-induced membrane fusion of HIV-1 (gp41 differs in length, and has no sequence homology with S2. Results Sequence analysis reveals that the two viral proteins share the sequence motifs that construct their active conformation. These include (1 an N-terminal leucine/isoleucine zipper-like sequence, and (2 a C-terminal heptad repeat located upstream of (3 an aromatic residue-rich region juxtaposed to the (4 transmembrane segment. Conclusions This study points to a similar mode of action for the two viral proteins, suggesting that anti-viral strategy that targets the viral-induced membrane fusion step can be adopted from HIV-1 to SARS-CoV. Recently the FDA approved Enfuvirtide, a synthetic peptide corresponding to the C-terminal heptad repeat of HIV-1 gp41, as an anti-AIDS agent. Enfuvirtide and C34, another anti HIV-1 peptide, exert their inhibitory activity by binding to a leucine/isoleucine zipper-like sequence in gp41, thus inhibiting a conformational change of gp41 required for its activation. We suggest that peptides corresponding to the C-terminal heptad repeat of the S2 protein may serve as inhibitors for SARS-CoV entry.
Characterization of a digestive carboxypeptidase from the insect pest corn earworm (Helicoverpa armigera) with novel specificity towards C-terminal glutamate residues.

Science.gov (United States)

Bown, David P; Gatehouse, John A

2004-05-01

Carboxypeptidases were purified from guts of larvae of corn earworm (Helicoverpa armigera), a lepidopteran crop pest, by affinity chromatography on immobilized potato carboxypeptidase inhibitor, and characterized by N-terminal sequencing. A larval gut cDNA library was screened using probes based on these protein sequences. cDNA HaCA42 encoded a carboxypeptidase with sequence similarity to enzymes of clan MC [Barrett, A. J., Rawlings, N. D. & Woessner, J. F. (1998) Handbook of Proteolytic Enzymes. Academic Press, London.], but with a novel predicted specificity towards C-terminal acidic residues. This carboxypeptidase was expressed as a recombinant proprotein in the yeast Pichia pastoris. The expressed protein could be activated by treatment with bovine trypsin; degradation of bound pro-region, rather than cleavage of pro-region from mature protein, was the rate-limiting step in activation. Activated HaCA42 carboxypeptidase hydrolysed a synthetic substrate for glutamate carboxypeptidases (FAEE, C-terminal Glu), but did not hydrolyse substrates for carboxypeptidase A or B (FAPP or FAAK, C-terminal Phe or Lys) or methotrexate, cleaved by clan MH glutamate carboxypeptidases. The enzyme was highly specific for C-terminal glutamate in peptide substrates, with slow hydrolysis of C-terminal aspartate also observed. Glutamate carboxypeptidase activity was present in larval gut extract from H. armigera. The HaCA42 protein is the first glutamate-specific metallocarboxypeptidase from clan MC to be identified and characterized. The genome of Drosophila melanogaster contains genes encoding enzymes with similar sequences and predicted specificity, and a cDNA encoding a similar enzyme has been isolated from gut tissue in tsetse fly. We suggest that digestive carboxypeptidases with sequence similarity to the classical mammalian enzymes, but with specificity towards C-terminal glutamate, are widely distributed in insects.
Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.

Science.gov (United States)

Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav

2010-09-16

Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic
Using inter simple sequence repeat (ISSR) markers to study genetic ...

African Journals Online (AJOL)

enoh

2012-04-10

Apr 10, 2012 ... Genetic relationships among the cultivars was assessed by using six inter simple sequence ... polymorphism breeders of this species in order to find the ..... well as the high level of heterozygosity due to the cross- pollinating ...
Effects of Repeated Acute Stress in Obese and Non-Obese Rats

Science.gov (United States)

2008-04-02

level of corticosterone occurs approximately 30 minutes after the stressor terminates (Garcia, Marti, Valles, Dal-Zotto, & Armario , 2000). Some studies...Garcia, Marti, Valles, Oal-Zotto, & Armario , 2000; Schrijver et aI., 2002). This repeated, mild stressor provides a model of daily or frequent...Response in Rats. Physiology and Behavior, 63(4),693-697. Garcia, A., Marti, 0., Valles, A., Dal-Zotto, S., & Armario , A. (2000). Recovery of the
Long Terminal Repeat Circular DNA as Markers of Active Viral Replication of Human T Lymphotropic Virus-1 in Vivo

Directory of Open Access Journals (Sweden)

James M Fox

2016-03-01

Full Text Available Clonal expansion of human T-lymphotropic virus type-1 (HTLV-1 infected cells in vivo is well documented. Unlike human immunodeficiency virus type 1 (HIV-1, HTLV-1 plasma RNA is sparse. The contribution of the “mitotic” spread of HTLV-1 compared with infectious spread of the virus to HTLV-1 viral burden in established infection is uncertain. Since extrachromosomal long terminal repeat (LTR DNA circles are indicators of viral replication in HIV-1 carriers with undetectable plasma HIV RNA, we hypothesised that HTLV-1 LTR circles could indicate reverse transcriptase (RT usage and infectious activity. 1LTR and 2LTR DNA circles were measured in HTLV-1 cell lines and peripheral blood mononuclear cells (PBMC of asymptomatic carriers (ACs and patients with HTLV-1-associated myelopathy/tropical spastic paraparesis (HAM/TSP or adult T cell leukaemia/lymphoma (ATLL. 1LTR DNA circles were detected in 14/20 patients at a mean of 1.38/100 PBMC but did not differentiate disease status nor correlate with HTLV-1 DNA copies. 2LTR DNA circles were detected in 30/31 patients and at higher concentrations in patients with HTLV-1-associated diseases, independent of HTLV-1 DNA load. In an incident case the 2LTR DNA circle concentration increased 2.1 fold at the onset of HAM/TSP compared to baseline. Detectable and fluctuating levels of HTLV-1 DNA circles in patients indicate viral RT usage and virus replication. Our results indicate HTLV-1 viral replication capacity is maintained in chronic infection and may be associated with disease onset.
Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

International Nuclear Information System (INIS)

Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

1987-01-01

Apolipoprotein(a) [apo(a)] is a glycoprotein with M/sub r/ ∼ 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain
Regulation, initiation, and termination of the cenA and cex transcripts of Cellulomonas fimi

International Nuclear Information System (INIS)

Greenberg, N.M.; Warren, R.A.J.; Kilburn, D.G.; Miller, R.C. Jr.

1987-01-01

The authors characterized the in vivo transcripts of two Cellulomonas fimi genes, which encodes an extracellular endo-β-1,4-glucanase. By Northern blot analysis, cenA mRNA was detected in C. fimi RNA preparations from glycerol- and carboxymethyl cellulose-grown cells but not from glucose-grown cells. In contrast, cex mRNA was detected only in the preparations from carboxymethyl cellulose-grown cells. Therefore, the transcription of these genes is subject to regulation by the carbon source provided to C. fimi. By nuclease SI protection studies with unique 5'-labeled DNA probes and C. fimi RNA isolated in vivo, 5' termini were found 51 and 62 bases before the cenA translational initiation codon and 28 bases before the cex translational initiation codon. S1 mapping with unlabeled DNA probes and C. fimi RNA which had been isolated in vivo but which had been 5' labeled in vitro with guanylyltransferase and [α- 32 P]GTP confirmed that true transcription initiation sites for cenA and cex mRNA had been identified. Comparative analysis of the DNA sequences immediately upstream of the initiation sites of the cenA and cex mRNAs revealed a 30-base-pair region where these two sequences display at least 66% homology. S1 mapping was also used to locate the 3' termini of the cenA and cex transcripts. Three 3' termini were found for cenA messages, whereas only one 3' terminus was identified for cex mRNA. The transcripts of both genes terminate in regions where their corresponding DNA sequences contain inverted repeats
Alanine repeats influence protein localization in splicing speckles and paraspeckles.

Science.gov (United States)

Chang, Shuo-Hsiu; Chang, Wei-Lun; Lu, Chia-Chen; Tarn, Woan-Yuh

2014-12-16

Mammalian splicing regulatory protein RNA-binding motif protein 4 (RBM4) has an alanine repeat-containing C-terminal domain (CAD) that confers both nuclear- and splicing speckle-targeting activities. Alanine-repeat expansion has pathological potential. Here we show that the alanine-repeat tracts influence the subnuclear targeting properties of the RBM4 CAD in cultured human cells. Notably, truncation of the alanine tracts redistributed a portion of RBM4 to paraspeckles. The alanine-deficient CAD was sufficient for paraspeckle targeting. On the other hand, alanine-repeat expansion reduced the mobility of RBM4 and impaired its splicing activity. We further took advantage of the putative coactivator activator (CoAA)-RBM4 conjoined splicing factor, CoAZ, to investigate the function of the CAD in subnuclear targeting. Transiently expressed CoAZ formed discrete nuclear foci that emerged and subsequently separated-fully or partially-from paraspeckles. Alanine-repeat expansion appeared to prevent CoAZ separation from paraspeckles, resulting in their complete colocalization. CoAZ foci were dynamic but, unlike paraspeckles, were resistant to RNase treatment. Our results indicate that the alanine-rich CAD, in conjunction with its conjoined RNA-binding domain(s), differentially influences the subnuclear localization and biogenesis of RBM4 and CoAZ. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Characterization of Erwinia amylovora strains from different host plants using repetitive-sequences PCR analysis, and restriction fragment length polymorphism and short-sequence DNA repeats of plasmid pEA29.

Science.gov (United States)

Barionovi, D; Giorgi, S; Stoeger, A R; Ruppitsch, W; Scortichini, M

2006-05-01

The three main aims of the study were the assessment of the genetic relationship between a deviating Erwinia amylovora strain isolated from Amelanchier sp. (Maloideae) grown in Canada and other strains from Maloideae and Rosoideae, the investigation of the variability of the PstI fragment of the pEA29 plasmid using restriction fragment length polymorphism (RFLP) analysis and the determination of the number of short-sequence DNA repeats (SSR) by DNA sequence analysis in representative strains. Ninety-three strains obtained from 12 plant genera and different geographical locations were examined by repetitive-sequences PCR using Enterobacterial Repetitive Intergenic Consensus, BOX and Repetitive Extragenic Palindromic primer sets. Upon the unweighted pair group method with arithmetic mean analysis, a deviating strain from Amelanchier sp. was analysed using amplified ribosomal DNA restriction analysis (ARDRA) analysis and the sequencing of the 16S rDNA gene. This strain showed 99% similarity to other E. amylovora strains in the 16S gene and the same banding pattern with ARDRA. The RFLP analysis of pEA29 plasmid using MspI and Sau3A restriction enzymes showed a higher variability than that previously observed and no clear-cut grouping of the strains was possible. The number of SSR units reiterated two to 12 times. The strains obtained from pear orchards showing for the first time symptoms of fire blight had a low number of SSR units. The strains from Maloideae exhibit a wider genetic variability than previously thought. The RFLP analysis of a fragment of the pEA29 plasmid would not seem a reliable method for typing E. amylovora strains. A low number of SSR units was observed with first epidemics of fire blight. The current detection techniques are mainly based on the genetic similarities observed within the strains from the cultivated tree-fruit crops. For a more reliable detection of the fire blight pathogen also in wild and ornamentals Rosaceous plants the genetic
Local repeat sequence organization of an intergenic spacer in the ...

Indian Academy of Sciences (India)

Unknown

chloroplast genome of Chlamydomonas reinhardtii leads to DNA expansion and sequence ... The discovery of uniparentally inherited streptomycin resistant mutants ... resembles yeast, mitochondrial and phage recombination in that it is typically ...... Sager R and Lane D 1972 Molecular basis of maternal inheritance; Proc.
Chaotic generation of PN sequences : a VLSI implementation

NARCIS (Netherlands)

Dornbusch, A.; Pineda de Gyvez, J.

1999-01-01

Generation of repeatable pseudo-random sequences with chaotic analog electronics is not feasible using standard circuit topologies. Component variation caused by imperfect fabrication causes the same divergence of output sequences as does varying initial conditions. By quantizing the output of a
Deletion of the B-B' and C-C' regions of inverted terminal repeats reduces rAAV productivity but increases transgene expression.

Science.gov (United States)

Zhou, Qingzhang; Tian, Wenhong; Liu, Chunguo; Lian, Zhonghui; Dong, Xiaoyan; Wu, Xiaobing

2017-07-14

Inverted terminal repeats (ITRs) of the adeno-associated virus (AAV) are essential for rescue, replication, packaging, and integration of the viral genome. While ITR mutations have been identified in previous reports, we designed a new truncated ITR lacking the B-B' and C-C' regions named as ITRΔBC and investigated its effects on viral genome replication, packaging, and expression of recombinant AAV (rAAV). The packaging ability was compared between ITRΔBC rAAV and wild-type (wt) ITR rAAV. Our results showed the productivity of ITRΔBC rAAV was reduced 4-fold, which is consistent with the 8-fold decrease in the replication of viral genomic DNA of ITRΔBC rAAV compared with wt ITR rAAV. Surprisingly, transgene expression was significantly higher for ITRΔBC rAAV. A preliminary exploration of the underlying mechanisms was carried out by inhibiting and degrading the ataxia telangiectasia mutated (ATM) protein and the Mre11 complex (MRN), respectively, since the rAAV expression was inhibited by the ATM and/or MRN through cis interaction or binding with wt ITRs. We demonstrated that the inhibitory effects were weakened on ITRΔBC rAAV expression. This study suggests deletion in ITR can affect the transgene expression of AAV, which provides a new way to improve the AAV expression through ITRs modification.
Aberrant splicing in transgenes containing introns, exons, and V5 epitopes: lessons from developing an FSHD mouse model expressing a D4Z4 repeat with flanking genomic sequences.

Directory of Open Access Journals (Sweden)

Eugénie Ansseau

Full Text Available The DUX4 gene, encoded within D4Z4 repeats on human chromosome 4q35, has recently emerged as a key factor in the pathogenic mechanisms underlying Facioscapulohumeral muscular dystrophy (FSHD. This recognition prompted development of animal models expressing the DUX4 open reading frame (ORF alone or embedded within D4Z4 repeats. In the first published model, we used adeno-associated viral vectors (AAV and strong viral control elements (CMV promoter, SV40 poly A to demonstrate that the DUX4 cDNA caused dose-dependent toxicity in mouse muscles. As a follow-up, we designed a second generation of DUX4-expressing AAV vectors to more faithfully genocopy the FSHD-permissive D4Z4 repeat region located at 4q35. This new vector (called AAV.D4Z4.V5.pLAM contained the D4Z4/DUX4 promoter region, a V5 epitope-tagged DUX4 ORF, and the natural 3' untranslated region (pLAM harboring two small introns, DUX4 exons 2 and 3, and the non-canonical poly A signal required for stabilizing DUX4 mRNA in FSHD. AAV.D4Z4.V5.pLAM failed to recapitulate the robust pathology of our first generation vectors following delivery to mouse muscle. We found that the DUX4.V5 junction sequence created an unexpected splice donor in the pre-mRNA that was preferentially utilized to remove the V5 coding sequence and DUX4 stop codon, yielding non-functional DUX4 protein with 55 additional residues on its carboxyl-terminus. Importantly, we further found that aberrant splicing could occur in any expression construct containing a functional splice acceptor and sequences resembling minimal splice donors. Our findings represent an interesting case study with respect to AAV.D4Z4.V5.pLAM, but more broadly serve as a note of caution for designing constructs containing V5 epitope tags and/or transgenes with downstream introns and exons.
A novel rat genomic simple repeat DNA with RNA-homology shows triplex (H-DNA)-like structure and tissue-specific RNA expression

International Nuclear Information System (INIS)

Dey, Indranil; Rath, Pramod C.

2005-01-01

Mammalian genome contains a wide variety of repetitive DNA sequences of relatively unknown function. We report a novel 227 bp simple repeat DNA (3.3 DNA) with a d {(GA) 7 A (AG) 7 } dinucleotide mirror repeat from the rat (Rattus norvegicus) genome. 3.3 DNA showed 75-85% homology with several eukaryotic mRNAs due to (GA/CU) n dinucleotide repeats by nBlast search and a dispersed distribution in the rat genome by Southern blot hybridization with [ 32 P]3.3 DNA. The d {(GA) 7 A (AG) 7 } mirror repeat formed a triplex (H-DNA)-like structure in vitro. Two large RNAs of 9.1 and 7.5 kb were detected by [ 32 P]3.3 DNA in rat brain by Northern blot hybridization indicating expression of such simple sequence repeats at RNA level in vivo. Further, several cDNAs were isolated from a rat cDNA library by [ 32 P]3.3 DNA probe. Three such cDNAs showed tissue-specific RNA expression in rat. pRT 4.1 cDNA showed strong expression of a 2.39 kb RNA in brain and spleen, pRT 5.5 cDNA showed strong expression of a 2.8 kb RNA in brain and a 3.9 kb RNA in lungs, and pRT 11.4 cDNA showed weak expression of a 2.4 kb RNA in lungs. Thus, genomic simple sequence repeats containing d (GA/CT) n dinucleotides are transcriptionally expressed and regulated in rat tissues. Such d (GA/CT) n dinucleotide repeats may form structural elements (e.g., triplex) which may be sites for functional regulation of genomic coding sequences as well as RNAs. This may be a general function of such transcriptionally active simple sequence repeats widely dispersed in mammalian genome
Organelle Simple Sequence Repeat Markers Help to Distinguish Carpelloid Stamen and Normal Cytoplasmic Male Sterile Sources in Broccoli

Science.gov (United States)

Shu, Jinshuai; Liu, Yumei; Li, Zhansheng; Zhang, Lili; Fang, Zhiyuan; Yang, Limei; Zhuang, Mu; Zhang, Yangyong; Lv, Honghao

2015-01-01

We previously discovered carpelloid stamens when breeding cytoplasmic male sterile lines in broccoli (Brassica oleracea var. italica). In this study, hybrids and multiple backcrosses were produced from different cytoplasmic male sterile carpelloid stamen sources and maintainer lines. Carpelloid stamens caused dysplasia of the flower structure and led to hooked or coiled siliques with poor seed setting, which were inherited in a maternal fashion. Using four distinct carpelloid stamens and twelve distinct normal stamens from cytoplasmic male sterile sources and one maintainer, we used 21 mitochondrial simple sequence repeat (mtSSR) primers and 32 chloroplast SSR primers to identify a mitochondrial marker, mtSSR2, that can differentiate between the cytoplasm of carpelloid and normal stamens. Thereafter, mtSSR2 was used to identify another 34 broccoli accessions, with an accuracy rate of 100%. Analysis of the polymorphic sequences revealed that the mtSSR2 open reading frame of carpelloid stamen sterile sources had a deletion of 51 bases (encoding 18 amino acids) compared with normal stamen materials. The open reading frame is located in the coding region of orf125 and orf108 of the mitochondrial genomes in Brassica crops and had the highest similarity with Raphanus sativus and Brassica carinata. The current study has not only identified a useful molecular marker to detect the cytoplasm of carpelloid stamens during broccoli breeding, but it also provides evidence that the mitochondrial genome is maternally inherited and provides a basis for studying the effect of the cytoplasm on flower organ development in plants. PMID:26407159
Non-radioactive detection of trinucleotide repeat size variability.

Science.gov (United States)

Tomé, Stéphanie; Nicole, Annie; Gomes-Pereira, Mario; Gourdon, Genevieve

2014-03-06

Many human diseases are associated with the abnormal expansion of unstable trinucleotide repeat sequences. The mechanisms of trinucleotide repeat size mutation have not been fully dissected, and their understanding must be grounded on the detailed analysis of repeat size distributions in human tissues and animal models. Small-pool PCR (SP-PCR) is a robust, highly sensitive and efficient PCR-based approach to assess the levels of repeat size variation, providing both quantitative and qualitative data. The method relies on the amplification of a very low number of DNA molecules, through sucessive dilution of a stock genomic DNA solution. Radioactive Southern blot hybridization is sensitive enough to detect SP-PCR products derived from single template molecules, separated by agarose gel electrophoresis and transferred onto DNA membranes. We describe a variation of the detection method that uses digoxigenin-labelled locked nucleic acid probes. This protocol keeps the sensitivity of the original method, while eliminating the health risks associated with the manipulation of radiolabelled probes, and the burden associated with their regulation, manipulation and waste disposal.
The Asian Rice Gall Midge (Orseolia oryzae Mitogenome Has Evolved Novel Gene Boundaries and Tandem Repeats That Distinguish Its Biotypes.

Directory of Open Access Journals (Sweden)

Isha Atray

Full Text Available The complete mitochondrial genome of the Asian rice gall midge, Orseolia oryzae (Diptera; Cecidomyiidae was sequenced, annotated and analysed in the present study. The circular genome is 15,286 bp with 13 protein-coding genes, 22 tRNAs and 2 ribosomal RNA genes, and a 578 bp non-coding control region. All protein coding genes used conventional start codons and terminated with a complete stop codon. The genome presented many unusual features: (1 rearrangement in the order of tRNAs as well as protein coding genes; (2 truncation and unusual secondary structures of tRNAs; (3 presence of two different repeat elements in separate non-coding regions; (4 presence of one pseudo-tRNA gene; (5 inversion of the rRNA genes; (6 higher percentage of non-coding regions when compared with other insect mitogenomes. Rearrangements of the tRNAs and protein coding genes are explained on the basis of tandem duplication and random loss model and why intramitochondrial recombination is a better model for explaining rearrangements in the O. oryzae mitochondrial genome is discussed. Furthermore, we evaluated the number of iterations of the tandem repeat elements found in the mitogenome. This led to the identification of genetic markers capable of differentiating rice gall midge biotypes and the two Orseolia species investigated.
Analysis of genetic relationships and identification of lily cultivars based on inter-simple sequence repeat markers.

Science.gov (United States)

Cui, G F; Wu, L F; Wang, X N; Jia, W J; Duan, Q; Ma, L L; Jiang, Y L; Wang, J H

2014-07-29

Inter-simple sequence repeat (ISSR) markers were used to discriminate 62 lily cultivars of 5 hybrid series. Eight ISSR primers generated 104 bands in total, which all showed 100% polymorphism, and an average of 13 bands were amplified by each primer. Two software packages, POPGENE 1.32 and NTSYSpc 2.1, were used to analyze the data matrix. Our results showed that the observed number of alleles (NA), effective number of alleles (NE), Nei's genetic diversity (H), and Shannon's information index (I) were 1.9630, 1.4179, 0.2606, and 0.4080, respectively. The highest genetic similarity (0.9601) was observed between the Oriental x Trumpet and Oriental lilies, which indicated that the two hybrids had a close genetic relationship. An unweighted pair-group method with arithmetic means dendrogram showed that the 62 lily cultivars clustered into two discrete groups. The first group included the Oriental and OT cultivars, while the Asiatic, LA, and Longiflorum lilies were placed in the second cluster. The distribution of individuals in the principal component analysis was consistent with the clustering of the dendrogram. Fingerprints of all lily cultivars built from 8 primers could be separated completely. This study confirmed the effect and efficiency of ISSR identification in lily cultivars.
Structure of the N-terminal region of Haemophilus Influenzae HI0017: Implications for function

International Nuclear Information System (INIS)

Yu Liping; Mack, Jamey; Hajduk, Phil; Fesik, Stephen W.

2001-01-01

Haemophilus influenzae is a gram-negative pathogen that causes infections ranging from asymptomatic colonization of the human upper respiratory tract to serious invasive diseases such as meningitis. Although the genome of Haemophilus influenzae has been completely sequenced, the structure and function of many of these proteins are unknown. HI0017 is one of these uncharacterized proteins. Here we describe the three-dimensional solution structure of the N-terminal portion of HI0017 as determined by NMR spectroscopy. The structure consists of a five-stranded antiparallel β-sheet and two short α-helices. It is similar to the C-terminal domain of Diphtheria toxin repressor (DtxR). The C-terminal portion of HI0017 has an amino acid sequence that closely resembles pyruvate formate-lyase - an enzyme that converts pyruvate and CoA into acetyl-CoA and formate by a radical mechanism. Based on structural and sequence comparisons, we propose that the C-terminus of HI0017 functions as an enzyme with a glycyl radical mechanism, while the N-terminus participates in protein/protein interactions involving an activase (iron-sulfur protein) and/or the substrate

Structure of the N-terminal region of Haemophilus Influenzae HI0017: Implications for function

Energy Technology Data Exchange (ETDEWEB)

Yu Liping; Mack, Jamey; Hajduk, Phil; Fesik, Stephen W. [Abbott Laboratories, Pharmaceutical Discovery Division, D46Y, AP10/LL (United States)

2001-06-15

Haemophilus influenzae is a gram-negative pathogen that causes infections ranging from asymptomatic colonization of the human upper respiratory tract to serious invasive diseases such as meningitis. Although the genome of Haemophilus influenzae has been completely sequenced, the structure and function of many of these proteins are unknown. HI0017 is one of these uncharacterized proteins. Here we describe the three-dimensional solution structure of the N-terminal portion of HI0017 as determined by NMR spectroscopy. The structure consists of a five-stranded antiparallel {beta}-sheet and two short {alpha}-helices. It is similar to the C-terminal domain of Diphtheria toxin repressor (DtxR). The C-terminal portion of HI0017 has an amino acid sequence that closely resembles pyruvate formate-lyase - an enzyme that converts pyruvate and CoA into acetyl-CoA and formate by a radical mechanism. Based on structural and sequence comparisons, we propose that the C-terminus of HI0017 functions as an enzyme with a glycyl radical mechanism, while the N-terminus participates in protein/protein interactions involving an activase (iron-sulfur protein) and/or the substrate.
Correlation between fibroin amino acid sequence and physical silk properties.

Science.gov (United States)

Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

2003-09-12

The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet.
Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs).

Science.gov (United States)

Liu, Fenyun; Kariyawasam, Subhashinie; Jayarao, Bhushan M; Barrangou, Rodolphe; Gerner-Smidt, Peter; Ribot, Efrain M; Knabel, Stephen J; Dudley, Edward G

2011-07-01

Salmonella enterica subsp. enterica serovar Enteritidis is a major cause of food-borne salmonellosis in the United States. Two major food vehicles for S. Enteritidis are contaminated eggs and chicken meat. Improved subtyping methods are needed to accurately track specific strains of S. Enteritidis related to human salmonellosis throughout the chicken and egg food system. A sequence typing scheme based on virulence genes (fimH and sseL) and clustered regularly interspaced short palindromic repeats (CRISPRs)-CRISPR-including multi-virulence-locus sequence typing (designated CRISPR-MVLST)-was used to characterize 35 human clinical isolates, 46 chicken isolates, 24 egg isolates, and 63 hen house environment isolates of S. Enteritidis. A total of 27 sequence types (STs) were identified among the 167 isolates. CRISPR-MVLST identified three persistent and predominate STs circulating among U.S. human clinical isolates and chicken, egg, and hen house environmental isolates in Pennsylvania, and an ST that was found only in eggs and humans. It also identified a potential environment-specific sequence type. Moreover, cluster analysis based on fimH and sseL identified a number of clusters, of which several were found in more than one outbreak, as well as 11 singletons. Further research is needed to determine if CRISPR-MVLST might help identify the ecological origins of S. Enteritidis strains that contaminate chickens and eggs.
Instability of (CTGn•(CAGn trinucleotide repeats and DNA synthesis

Directory of Open Access Journals (Sweden)

Liu Guoqi

2012-02-01

Full Text Available Abstract Expansion of (CTGn•(CAGn trinucleotide repeat (TNR microsatellite sequences is the cause of more than a dozen human neurodegenerative diseases. (CTGn and (CAGn repeats form imperfectly base paired hairpins that tend to expand in vivo in a length-dependent manner. Yeast, mouse and human models confirm that (CTGn•(CAGn instability increases with repeat number, and implicate both DNA replication and DNA damage response mechanisms in (CTGn•(CAGn TNR expansion and contraction. Mutation and knockdown models that abrogate the expression of individual genes might also mask more subtle, cumulative effects of multiple additional pathways on (CTGn•(CAGn instability in whole animals. The identification of second site genetic modifiers may help to explain the variability of (CTGn•(CAGn TNR instability patterns between tissues and individuals, and offer opportunities for prognosis and treatment.
Production of carboxy-terminal specific antiserum against glucagon

Energy Technology Data Exchange (ETDEWEB)

Yibing, Liu; Shiquan, Han [Academia Sinica, Beijing, BJ (China). Inst. of Atomic Energy

1993-02-01

To produce carboxy-terminal specific antisera against glucagon was coupled mainly via its amino terminal histidine to thyroglobulin, using the amino group reactive pentandiol at pH 7.0 for the conjugation procedure. After repeated immunization of guinea pigs and rabbits, the antisera were obtained. The titer of guinea pig antiserum against glucagon was 1:3000-1:35000 and affinity constant was 9.3 x 10[sup 10]-11.4 x 10[sup 10] l [center dot] mol[sup -1]. There were no cross reaction with GIP, INS, Copeptide and gastrin. The titer of rabbit antiserum against glucagon was 1:900-1:9000 and affinity constant was 0.36 x 10[sup 10]-3.9 x 10[sup 10] l [center dot] mol[sup -1]. There were no cross reaction with INS, C-peptide and gastrin. The cross reaction with GIP was 0.02%.
The EspF N-Terminal of Enterohemorrhagic Escherichia coli O157:H7 EDL933w Imparts Stronger Toxicity Effects on HT-29 Cells than the C-Terminal

Directory of Open Access Journals (Sweden)

Xiangyu Wang

2017-09-01

Full Text Available Enterohemorrhagic Escherichia coli (EHEC O157:H7 EspF is an important multifunctional protein that destroys the tight junctions of intestinal epithelial cells and promotes host cell apoptosis. However, its molecular mechanism remains elusive. We knocked out the espF sequence (747 bp, ΔespF, N-terminal sequence (219 bp, ΔespFN, and C-terminal sequence (528 bp, ΔespFC separately using the pKD46-mediated λ Red homologous recombination system. Then, we built the corresponding complementation strains, namely, ΔespF/pespF, ΔespFN/pespFN, and ΔespFC/pespFC by overlap PCR, which were used in infecting HT-29 cells and BALB/C mice. The level of reactive oxygen species, cell apoptosis, mitochondrial trans-membrane potential, inflammatory factors, transepithelial electrical resistance (TER, and animal mortality were evaluated by DCFH-DA, double staining of Annexin V-FITC/PI, JC-1 staining, ELISA kit, and a mouse assay. The wild-type (WT, ΔespF, ΔespF/pespF, ΔespFC, ΔespFC/pespFC, ΔespFN, and ΔespFN/pespFN groups exhibited apoptotic rates of 68.3, 27.9, 64.9, 65.7, 73.4, 41.3, and 35.3% respectively, and mean TNF-α expression levels of 428 pg/mL, 342, 466, 446, 381, 383, and 374 pg/mL, respectively. In addition, the apoptotic rates and TNF-α levels of the WT, ΔespF/pespF, and ΔespFC were significantly higher than that of ΔespF, ΔespFN, ΔespFC/pespFC, and ΔespFN/pespFN group (p < 0.05. The N-terminal of EspF resulted in an increase in the number of apoptotic cells, TNF-α secretion, ROS generation, mitochondria apoptosis, and pathogenicity in BalB/c mice. In conclusion, the N-terminal domain of the Enterohemorrhagic E. coli O157:H7 EspF more strongly promotes apoptosis and inflammation than the C-terminal domain.
Amino acid sequences and structures of chicken and turkey beta 2-microglobulin

DEFF Research Database (Denmark)

Welinder, K G; Jespersen, H M; Walther-Rasmussen, J

1991-01-01

The complete amino acid sequences of chicken and turkey beta 2-microglobulins have been determined by analyses of tryptic, V8-proteolytic and cyanogen bromide fragments, and by N-terminal sequencing. Mass spectrometric analysis of chicken beta 2-microglobulin supports the sequence-derived Mr of 11...
A Sequence-Specific Interaction between the Saccharomyces cerevisiae rRNA Gene Repeats and a Locus Encoding an RNA Polymerase I Subunit Affects Ribosomal DNA Stability

Science.gov (United States)

Cahyani, Inswasti; Cridge, Andrew G.; Engelke, David R.; Ganley, Austen R. D.

2014-01-01

The spatial organization of eukaryotic genomes is linked to their functions. However, how individual features of the global spatial structure contribute to nuclear function remains largely unknown. We previously identified a high-frequency interchromosomal interaction within the Saccharomyces cerevisiae genome that occurs between the intergenic spacer of the ribosomal DNA (rDNA) repeats and the intergenic sequence between the locus encoding the second largest RNA polymerase I subunit and a lysine tRNA gene [i.e., RPA135-tK(CUU)P]. Here, we used quantitative chromosome conformation capture in combination with replacement mapping to identify a 75-bp sequence within the RPA135-tK(CUU)P intergenic region that is involved in the interaction. We demonstrate that the RPA135-IGS1 interaction is dependent on the rDNA copy number and the Msn2 protein. Surprisingly, we found that the interaction does not govern RPA135 transcription. Instead, replacement of a 605-bp region within the RPA135-tK(CUU)P intergenic region results in a reduction in the RPA135-IGS1 interaction level and fluctuations in rDNA copy number. We conclude that the chromosomal interaction that occurs between the RPA135-tK(CUU)P and rDNA IGS1 loci stabilizes rDNA repeat number and contributes to the maintenance of nucleolar stability. Our results provide evidence that the DNA loci involved in chromosomal interactions are composite elements, sections of which function in stabilizing the interaction or mediating a functional outcome. PMID:25421713
Inter-simple sequence repeat (ISSR) markers in the evaluation of ...

African Journals Online (AJOL)

shawkat

2013-02-13

Feb 13, 2013 ... 666 Afr. J. Biotechnol. Table 1. Number and types of the ISSR bands as well as the total polymorphism percentages generated in six Capsicum hybrids. Primer code. Sequence. Monomorphic band. Polymorphic band. Total band. Polymorphism. (%). Unique. Shared. HB 1. (CAA)5. 4. 0. 1. 5. 20. HB 2. (CAG) ...
Dispersed repetitive sequences in eukaryotic genomes and their possible biological significance

International Nuclear Information System (INIS)

Georgiev, G.P.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Lukanidin, E.M.

1983-01-01

In this paper is described the properties of a novel mouse mdg-like element, the A2 sequence, which is the most abundant repetitive sequence. We also characterized an ubiquitous B2 sequence that represents, after B1, the dominant family among the short interspersed repeats of the mouse genome. The existence of some putative transposition intermediates was shown for repeats of both A and B types of the mouse genome. These are closed circular DNA of the A type and small polyadenylated B + RNAs. The fundamental question that arises is whether these sequences are simply selfish DNA capable of transpositions or do they fulfill some useful biological functions within the genome. 66 references, 11 figures, 1 table
Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes

NARCIS (Netherlands)

Al-Attar, S.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

2011-01-01

Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences
Amino acid sequence analysis of the annexin super-gene family of proteins.

Science.gov (United States)

Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

1991-06-15

The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of
Terminator Operon Reporter: combining a transcription termination switch with reporter technology for improved gene synthesis and synthetic biology applications.

Science.gov (United States)

Zampini, Massimiliano; Mur, Luis A J; Rees Stevens, Pauline; Pachebat, Justin A; Newbold, C James; Hayes, Finbarr; Kingston-Smith, Alison

2016-05-25

Synthetic biology is characterized by the development of novel and powerful DNA fabrication methods and by the application of engineering principles to biology. The current study describes Terminator Operon Reporter (TOR), a new gene assembly technology based on the conditional activation of a reporter gene in response to sequence errors occurring at the assembly stage of the synthetic element. These errors are monitored by a transcription terminator that is placed between the synthetic gene and reporter gene. Switching of this terminator between active and inactive states dictates the transcription status of the downstream reporter gene to provide a rapid and facile readout of the accuracy of synthetic assembly. Designed specifically and uniquely for the synthesis of protein coding genes in bacteria, TOR allows the rapid and cost-effective fabrication of synthetic constructs by employing oligonucleotides at the most basic purification level (desalted) and without the need for costly and time-consuming post-synthesis correction methods. Thus, TOR streamlines gene assembly approaches, which are central to the future development of synthetic biology.
Prenatal diagnosis of hypoplastic left heart syndrome: impact of counseling patterns on parental perceptions and decisions regarding termination of pregnancy.

Science.gov (United States)

Hilton-Kamm, Debra; Chang, Ruey-Kang; Sklansky, Mark

2012-12-01

An online survey for parents of children with congenital heart disease (CHD) was developed to study parents' experiences at the time of diagnosis. The survey was distributed to online support groups. A total of 841 responses from parents of children with CHD were received during a 4-week period. The current study examined those respondents (211 [25 %]) who reported their child's diagnosis as hypoplastic left heart syndrome (HLHS). Among these, 138 (65 %) reported receiving the diagnosis prenatally. 32 % of those receiving a prenatal diagnosis reported that after they declined to terminate the pregnancy, termination was mentioned again by their physicians. Parents who had termination mentioned again after their initial decline reported significantly lower optimism regarding their child's life expectancy than those who did not have it mentioned again (66 vs. 94 %, p survival" (34 vs. 13 %, p = 0.01); and were more likely to change pediatric cardiologists (PCs) (43 vs. 12 %, p parents, when termination of pregnancy was mentioned after the parents declined it, or if the parents felt pressure to terminate, the parents perceived a lower chance of survival, felt less optimistic about their child's life expectancy, and were more likely to choose another PC for long-term follow-up care. Our study could not determine whether repeated discussions of the possibility for termination of pregnancy independently impacts parental optimism regarding prognosis or whether those who counsel with repeated discussions of termination tend to have more guarded notions of the prognosis of children with HLHS. Further study is warranted to identify the implications of counseling patterns on parental perceptions and decisions regarding termination of pregnancy.
Bromine isotopic signature facilitates de novo sequencing of peptides in free-radical-initiated peptide sequencing (FRIPS) mass spectrometry.

Science.gov (United States)

Nam, Jungjoo; Kwon, Hyuksu; Jang, Inae; Jeon, Aeran; Moon, Jingyu; Lee, Sun Young; Kang, Dukjin; Han, Sang Yun; Moon, Bongjin; Oh, Han Bin

2015-02-01

We recently showed that free-radical-initiated peptide sequencing mass spectrometry (FRIPS MS) assisted by the remarkable thermochemical stability of (2,2,6,6-tetramethyl-piperidin-1-yl)oxyl (TEMPO) is another attractive radical-driven peptide fragmentation MS tool. Facile homolytic cleavage of the bond between the benzylic carbon and the oxygen of the TEMPO moiety in o-TEMPO-Bz-C(O)-peptide and the high reactivity of the benzylic radical species generated in •Bz-C(O)-peptide are key elements leading to extensive radical-driven peptide backbone fragmentation. In the present study, we demonstrate that the incorporation of bromine into the benzene ring, i.e. o-TEMPO-Bz(Br)-C(O)-peptide, allows unambiguous distinction of the N-terminal peptide fragments from the C-terminal fragments through the unique bromine doublet isotopic signature. Furthermore, bromine substitution does not alter the overall radical-driven peptide backbone dissociation pathways of o-TEMPO-Bz-C(O)-peptide. From a practical perspective, the presence of the bromine isotopic signature in the N-terminal peptide fragments in TEMPO-assisted FRIPS MS represents a useful and cost-effective opportunity for de novo peptide sequencing. Copyright © 2015 John Wiley & Sons, Ltd.
The mechanism of vault opening from the high resolution structure of the N-terminal repeats of MVP.

Science.gov (United States)

Querol-Audí, Jordi; Casañas, Arnau; Usón, Isabel; Luque, Daniel; Castón, José R; Fita, Ignasi; Verdaguer, Nuria

2009-11-04

Vaults are ubiquitous ribonucleoprotein complexes involved in a diversity of cellular processes, including multidrug resistance, transport mechanisms and signal transmission. The vault particle shows a barrel-shaped structure organized in two identical moieties, each consisting of 39 copies of the major vault protein MVP. Earlier data indicated that vault halves can dissociate at acidic pH. The crystal structure of the vault particle solved at 8 A resolution, together with the 2.1-A structure of the seven N-terminal domains (R1-R7) of MVP, reveal the interactions governing vault association and provide an explanation for a reversible dissociation induced by low pH. The structural comparison with the recently published 3.5 A model shows major discrepancies, both in the main chain tracing and in the side chain assignment of the two terminal domains R1 and R2.
Estimation of genetic structure of a Mycosphaerella musicola population using inter-simple sequence repeat markers.

Science.gov (United States)

Peixouto, Y S; Dórea Bragança, C A; Andrade, W B; Ferreira, C F; Haddad, F; Oliveira, S A S; Darosci Brito, F S; Miller, R N G; Amorim, E P

2015-07-17

Among the diseases affecting banana (Musa sp), yellow Sigatoka, caused by the fungal pathogen Mycosphaerella musicola Leach, is considered one of the most important in Brazil, causing losses throughout the year. Understanding the genetic structure of pathogen populations will provide insight into the life history of pathogens, including the evolutionary processes occurring in agrosystems. Tools for estimating the possible emergence of pathogen variants with altered pathogenicity, virulence, or aggressiveness, as well as resistance to systemic fungicides, can also be developed from such data. The objective of this study was to analyze the genetic diversity and population genetics of M. musicola in the main banana-producing regions in Brazil. A total of 83 isolates collected from different banana cultivars in the Brazilian states of Bahia, Rio Grande do Norte, and Minas Gerais were evaluated using inter-simple sequence repeat markers. High variability was detected between the isolates, and 85.5% of the haplotypes were singletons in the populations. The highest source of genetic diversity (97.22%) was attributed to variations within populations. Bayesian cluster analysis revealed the presence of 2 probable ancestral groups, however, showed no relationship to population structure in terms of collection site, state of origin, or cultivar. Similarly, we detected noevidence of genetic recombination between individuals within different states, indicating that asexual cycles play a major role in M. musicola reproduction and that long-distance dispersal of the pathogen is the main factor contributing to the lack of population structure in the fungus.
Inferring repeat-protein energetics from evolutionary information.

Directory of Open Access Journals (Sweden)

Rocío Espada

2017-06-01

Full Text Available Natural protein sequences contain a record of their history. A common constraint in a given protein family is the ability to fold to specific structures, and it has been shown possible to infer the main native ensemble by analyzing covariations in extant sequences. Still, many natural proteins that fold into the same structural topology show different stabilization energies, and these are often related to their physiological behavior. We propose a description for the energetic variation given by sequence modifications in repeat proteins, systems for which the overall problem is simplified by their inherent symmetry. We explicitly account for single amino acid and pair-wise interactions and treat higher order correlations with a single term. We show that the resulting evolutionary field can be interpreted with structural detail. We trace the variations in the energetic scores of natural proteins and relate them to their experimental characterization. The resulting energetic evolutionary field allows the prediction of the folding free energy change for several mutants, and can be used to generate synthetic sequences that are statistically indistinguishable from the natural counterparts.
Long-term repeatability of the skin prick test is high when supported by history or allergen-sensitivity tests

DEFF Research Database (Denmark)

Bødtger, Uffe; Jacobsen, C R; Poulsen, L K

2003-01-01

subjects. An SPT was positive when > or =3 mm, and repeatable if either persistently positive or negative. Clinical sensitivity to birch pollen was used as model for inhalation allergy, and was investigated at inclusion and at study termination by challenge tests, intradermal test, titrated SPT and Ig......E measurements. Birch pollen symptoms were confirmed in diaries. RESULTS: The repeatability of a positive SPT was 67%, increasing significantly to 100% when supported by the history. When not supported by history, the presence of specific IgE was significantly associated with a repeatable SPT. Allergen....... CONCLUSION: SPT changes are clinically relevant. Further studies using other allergens are needed. Long-term repeatability of SPT is high in the presence of a supportive history....
Designing a Long Acting Erythropoietin by Fusing Three Carboxyl-Terminal Peptides of Human Chorionic Gonadotropin β Subunit to the N-Terminal and C-Terminal Coding Sequence

Directory of Open Access Journals (Sweden)

Fuad Fares

2011-01-01

Full Text Available A new analog of EPO was designed by fusing one and two CTPs to the N-terminal and C-terminal ends of EPO (EPO-(CTP3, respectively. This analog was expressed and secreted efficiently in CHO cells. The in vitro test shows that the activity of EPO-(CTP3 in TFI-1 cell proliferation assay is similar to that of EPO-WT and commercial rHEPO. However, in vivo studies indicated that treatment once a week with EPO-(CTP3 (15 μg/kg dramatically increased (~8 folds haematocrit as it was compared to rHuEPO. Moreover, it was found that EPO-(CTP3 is more effective than rHuEPO and Aranesp in increasing reticulocyte number in mice blood. The detected circulatory half-lives of rHuEPO, Aranesp, and EPO-(CTP3 following IV injection of 20 IU were 4.4, 10.8, and 13.1 h, respectively. These data established the rational for using this chimera as a long-acting EPO analog in clinics. The therapeutic efficacy of EPO-CTP analog needs to be established in higher animals and in human clinical trials.

The scorpion toxin Bot IX is a potent member of the α-like family and has a unique N-terminal sequence extension.

Science.gov (United States)

Martin-Eauclaire, Marie-France; Salvatierra, Juan; Bosmans, Frank; Bougis, Pierre E

2016-09-01

We report the detailed chemical, immunological and pharmacological characterization of the α-toxin Bot IX from the Moroccan scorpion Buthus occitanus tunetanus venom. Bot IX, which consists of 70 amino acids, is a highly atypical toxin. It carries a unique N-terminal sequence extension and is highly lethal in mice. Voltage clamp recordings on oocytes expressing rat Nav1.2 or insect BgNav1 reveal that, similar to other α-like toxins, Bot IX inhibits fast inactivation of both variants. Moreover, Bot IX belongs to the same structural/immunological group as the α-like toxin Bot I. Remarkably, radioiodinated Bot IX competes efficiently with the classical α-toxin AaH II from Androctonus australis, and displays one of the highest affinities for Nav channels. © 2016 Federation of European Biochemical Societies.
Formation of large viroplasms and virulence of Cauliflower mosaic virus in turnip plants depend on the N-terminal EKI sequence of viral protein TAV.

Directory of Open Access Journals (Sweden)

Angèle Geldreich

Full Text Available Cauliflower mosaic virus (CaMV TAV protein (TransActivator/Viroplasmin plays a pivotal role during the infection cycle since it activates translation reinitiation of viral polycistronic RNAs and suppresses RNA silencing. It is also the major component of cytoplasmic electron-dense inclusion bodies (EDIBs called viroplasms that are particularly evident in cells infected by the virulent CaMV Cabb B-JI isolate. These EDIBs are considered as virion factories, vehicles for CaMV intracellular movement and reservoirs for CaMV transmission by aphids. In this study, focused on different TAV mutants in vivo, we demonstrate that three physically separated domains collectively participate to the formation of large EDIBs: the N-terminal EKI motif, a sequence of the MAV domain involved in translation reinitiation and a C-terminal region encompassing the zinc finger. Surprisingly, EKI mutant TAVm3, corresponding to a substitution of the EKI motif at amino acids 11-13 by three alanines (AAA, which completely abolished the formation of large viroplasms, was not lethal for CaMV but highly reduced its virulence without affecting the rate of systemic infection. Expression of TAVm3 in a viral context led to formation of small irregularly shaped inclusion bodies, mild symptoms and low levels of viral DNA and particles accumulation, despite the production of significant amounts of mature capsid proteins. Unexpectedly, for CaMV-TAVm3 the formation of viral P2-containing electron-light inclusion body (ELIB, which is essential for CaMV aphid transmission, was also altered, thus suggesting an indirect role of the EKI tripeptide in CaMV plant-to-plant propagation. This important functional contribution of the EKI motif in CaMV biology can explain the strict conservation of this motif in the TAV sequences of all CaMV isolates.
Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

Energy Technology Data Exchange (ETDEWEB)

Jackson, P.J.; Walthers, E.A.; Richmond, K.L. [Los Alamos National Lab., NM (United States)] [and others

1997-04-01

PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats are generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.
Tax relieves transcriptional repression by promoting histone deacetylase 1 release from the human T-cell leukemia virus type 1 long terminal repeat.

Science.gov (United States)

Lu, Hanxin; Pise-Masison, Cynthia A; Linton, Rebecca; Park, Hyeon Ung; Schiltz, R Louis; Sartorelli, Vittorio; Brady, John N

2004-07-01

Expression of human T-cell leukemia virus type 1 (HTLV-1) is regulated by the viral transcriptional activator Tax. Tax activates viral transcription through interaction with the cellular transcription factor CREB and the coactivators CBP/p300. In this study, we have analyzed the role of histone deacetylase 1 (HDAC1) on HTLV-1 gene expression from an integrated template. First we show that trichostatin A, an HDAC inhibitor, enhances Tax expression in HTLV-1-transformed cells. Second, using a cell line containing a single-copy HTLV-1 long terminal repeat, we demonstrate that overexpression of HDAC1 represses Tax transactivation. Furthermore, a chromatin immunoprecipitation assay allowed us to analyze the interaction of transcription factors, coactivators, and HDACs with the basal and activated HTLV-1 promoter. We demonstrate that HDAC1 is associated with the inactive, but not the Tax-transactivated, HTLV-1 promoter. In vitro and in vivo glutathione S-transferase-Tax pull-down and coimmunoprecipitation experiments demonstrated that there is a direct physical association between Tax and HDAC1. Importantly, biotinylated chromatin pull-down assays demonstrated that Tax inhibits and/or dissociates the binding of HDAC1 to the HTLV-1 promoter. Our results provide evidence that Tax interacts directly with HDAC1 and regulates binding of the repressor to the HTLV-1 promoter.
Structural basis for sequence-specific recognition of DNA by TAL effectors

KAUST Repository

Deng, Dong; Yan, Chuangye; Pan, Xiaojing; Mahfouz, Magdy M.; Wang, Jiawei; Zhu, Jiankang; Shi, Yi Gong; Yan, Nieng

2012-01-01

TAL (transcription activator-like) effectors, secreted by phytopathogenic bacteria, recognize host DNA sequences through a central domain of tandem repeats. Each repeat comprises 33 to 35 conserved amino acids and targets a specific base pair
Locus-specific detection of HLA-DQ and -DR antigens by antibodies against synthetic N-terminal octapeptides of the beta chain

DEFF Research Database (Denmark)

Deufel, T; Grove, A; Kofod, Hans

1985-01-01

Antibodies against synthetic peptides representing the class-II antigen HLA-DR and -DQ beta chain N-terminal sequences were prepared in rabbits. The two octapeptides only share two amino acids and enzyme-linked immuno-assays showed the antisera only to bind to its own antigen. Both peptide antisera...... chains of HLA-DR and -DQ have been prepared by the preparation by the production of antibodies against the N-terminal sequences of each polypeptide....
Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis.

Science.gov (United States)

Zhu, Huayu; Song, Pengyao; Koo, Dal-Hoe; Guo, Luqin; Li, Yanman; Sun, Shouru; Weng, Yiqun; Yang, Luming

2016-08-05

Microsatellite markers are one of the most informative and versatile DNA-based markers used in plant genetic research, but their development has traditionally been difficult and costly. The whole genome sequencing with next-generation sequencing (NGS) technologies provides large amounts of sequence data to develop numerous microsatellite markers at whole genome scale. SSR markers have great advantage in cross-species comparisons and allow investigation of karyotype and genome evolution through highly efficient computation approaches such as in silico PCR. Here we described genome wide development and characterization of SSR markers in the watermelon (Citrullus lanatus) genome, which were then use in comparative analysis with two other important crop species in the Cucurbitaceae family: cucumber (Cucumis sativus L.) and melon (Cucumis melo L.). We further applied these markers in evaluating the genetic diversity and population structure in watermelon germplasm collections. A total of 39,523 microsatellite loci were identified from the watermelon draft genome with an overall density of 111 SSRs/Mbp, and 32,869 SSR primers were designed with suitable flanking sequences. The dinucleotide SSRs were the most common type representing 34.09 % of the total SSR loci and the AT-rich motifs were the most abundant in all nucleotide repeat types. In silico PCR analysis identified 832 and 925 SSR markers with each having a single amplicon in the cucumber and melon draft genome, respectively. Comparative analysis with these cross-species SSR markers revealed complicated mosaic patterns of syntenic blocks among the genomes of three species. In addition, genetic diversity analysis of 134 watermelon accessions with 32 highly informative SSR loci placed these lines into two groups with all accessions of C.lanatus var. citorides and three accessions of C. colocynthis clustered in one group and all accessions of C. lanatus var. lanatus and the remaining accessions of C. colocynthis
Organization and Evolution of Subtelomeric Satellite Repeats in the Potato Genome

Czech Academy of Sciences Publication Activity Database

Torres, A.T.; Gong, Z.; Iovene, M.; Hirsch, C.D.; Buell, C.R.; Bryan, G.J.; Novák, Petr; Macas, Jiří; Jiang, J.

2011-01-01

Roč. 1, July 2011 (2011), s. 85-92 ISSN 2160-1836 R&D Projects: GA MŠk(CZ) LH11058 Institutional research plan: CEZ:AV0Z50510513 Keywords : Satellite sequences * Potato genome * Repeats Subject RIV: EB - Genetics ; Molecular Biology
Magnetic properties of smooth terminating dipole bands in 110,112Te

International Nuclear Information System (INIS)

Evans, A.O.; Paul, E.S.; Boston, A.J.; Chantler, H.J.; Chiara, C.J.; Devlin, M.; Fletcher, A.M.; Fossan, D.B.; LaFosse, D.R.; Lane, G.J.; Lee, I.Y.; Macchiavelli, A.O.; Nolan, P.J.; Sarantites, D.G.; Sears, J.M.; Semple, A.T.; Smith, J.F.; Starosta, K.; Vaman, C.; Afanasjev, A.V.; Ragnarsson, I.

2006-01-01

Three strongly coupled sequences have been established in 110,112 Te up to high spins. They are interpreted in terms of deformed structures built on proton 1-particle-1-hole excitations that reach termination at I∼40-bar . This is the first observation of smooth terminating dipole structures in this mass region. Lifetime measurements have allowed the extraction of experimental B(M1;I->I-1) and B(E2;I->I-2) reduced transition rates for one of the dipole bands in 110 Te. The results support the deformed interpretation
Gene mining a marama bean expressed sequence tags (ESTs ...

African Journals Online (AJOL)

The authors reported the identification of genes associated with embryonic development and microsatellite sequences. The future direction will entail characterization of these genes using gene over-expression and mutant assays. Key words: Namibia, simple sequence repeats (SSR), data mining, homology searches, ...
Two mechanisms coordinate replication termination by the Escherichia coli Tus–Ter complex

KAUST Repository

Pandey, Manjula; Elshenawy, Mohamed; Jergic, Slobodan; Takahashi, Masateru; Dixon, Nicholas E.; Hamdan, Samir; Patel, Smita S.

2015-01-01

The Escherichia coli replication terminator protein (Tus) binds to Ter sequences to block replication forks approaching from one direction. Here, we used single molecule and transient state kinetics to study responses of the heterologous phage T7
Biased distribution of DNA uptake sequences towards genome maintenance genes

DEFF Research Database (Denmark)

Davidsen, T.; Rodland, E.A.; Lagesen, K.

2004-01-01

Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within...... in these organisms. Pasteurella multocida also displayed high frequencies of a putative DUS identical to that previously identified in H. influenzae and with a skewed distribution towards genome maintenance genes, indicating that this bacterium might be transformation competent under certain conditions....
Topological disposition of the sequences -QRKIVE- and -KETYY in native (Na+ + K+)-ATPase

International Nuclear Information System (INIS)

Bayer, R.

1990-01-01

The dispositions with respect to the plane of the membrane of lysine-905 in the internal sequence -EQRKIVE- and of lysine-1012 in the carboxy-terminal sequence -RRPGGWVEKETYY of the α-polypeptide of sodium and potassium ion activated adenosinetriphosphatase have been determined. These lysines are found in peptides released from the intact α-polypeptide by the extracellular protease from Staphylococcus aureus strain V8 and by trypsin, respectively. Synthetic peptides containing terminal sequences of these were used to prepare polyclonal antibodies, which were then used to prepare immunoadsorbents directed against the respective peptides. Sealed, right-side-out membrane vesicles containing native (Na + + K + )-ATPase were labeled with pyridoxal phosphate and sodium [ 3 H]borohydride in the absence or presence of saponin. The labeled α-polypeptide was isolated from these vesicles and digested with appropriate proteases. The incorporation of radioactivity into the peptides binding to the immunoadsorbent directed against the sequence pyrERXIVE increased 3-fold int the presence of saponin as a result of the increased accessibility of this portion of the protein to the reagent when the vesicles were breached by saponin; hence, this sequence is located on the cytoplasmic face of the membrane. It was inferred that the carboxy-terminal sequence -KETYY is on the extracytoplasmic face since the incorporation of radioactivity into peptides binding to the immunoadsorbent directed against the sequence -ETYY did not change when the vesicles were breached with saponin
RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location.

Science.gov (United States)

Serghini, M A; Fuchs, M; Pinck, M; Reinbolt, J; Walter, B; Pinck, L

1990-07-01

The nucleotide sequence of the genomic RNA2 (3774 nucleotides) of grapevine fanleaf virus strain F13 was determined from overlapping cDNA clones and its genetic organization was deduced. Two rapid and efficient methods were used for cDNA cloning of the 5' region of RNA2. The complete sequence contained only one long open reading frame of 3555 nucleotides (1184 codons, 131K product). The analysis of the N-terminal sequence of purified coat protein (CP) and identification of its C-terminal residue have allowed the CP cistron to be precisely positioned within the polyprotein. The CP produced by proteolytic cleavage at the Arg/Gly site between residues 680 and 681 contains 504 amino acids (Mr 56019) and has hydrophobic properties. The Arg/Gly cleavage site deduced by N-terminal amino acid sequence analysis is the first for a nepovirus coat protein and for plant viruses expressing their genomic RNAs by polyprotein synthesis. Comparison of GFLV RNA2 with M RNA of cowpea mosaic comovirus and with RNA2 of two closely related nepoviruses, tomato black ring virus and Hungarian grapevine chrome mosaic virus, showed strong similarities among the 3' non-coding regions but less similarity among the 5' end non-coding sequences than reported among other nepovirus RNAs.
Occurrence of C-Terminal Residue Exclusion in Peptide Fragmentation by ESI and MALDI Tandem Mass Spectrometry

Science.gov (United States)

Dupré, Mathieu; Cantel, Sonia; Martinez, Jean; Enjalbal, Christine

2012-02-01

By screening a data set of 392 synthetic peptides MS/MS spectra, we found that a known C-terminal rearrangement was unexpectedly frequently occurring from monoprotonated molecular ions in both ESI and MALDI tandem mass spectrometry upon low and high energy collision activated dissociations with QqTOF and TOF/TOF mass analyzer configuration, respectively. Any residue localized at the C-terminal carboxylic acid end, even a basic one, was lost, provided that a basic amino acid such arginine and to a lesser extent histidine and lysine was present in the sequence leading to a fragment ion, usually depicted as (bn-1 + H2O) ion, corresponding to a shortened non-scrambled peptide chain. Far from being an epiphenomenon, such a residue exclusion from the peptide chain C-terminal extremity gave a fragment ion that was the base peak of the MS/MS spectrum in certain cases. Within the frame of the mobile proton model, the ionizing proton being sequestered onto the basic amino acid side chain, it is known that the charge directed fragmentation mechanism involved the C-terminal carboxylic acid function forming an anhydride intermediate structure. The same mechanism was also demonstrated from cationized peptides. To confirm such assessment, we have prepared some of the peptides that displayed such C-terminal residue exclusion as a C-terminal backbone amide. As expected in this peptide amide series, the production of truncated chains was completely suppressed. Besides, multiply charged molecular ions of all peptides recorded in ESI mass spectrometry did not undergo such fragmentation validating that any mobile ionizing proton will prevent such a competitive C-terminal backbone rearrangement. Among all well-known nondirect sequence fragment ions issued from non specific loss of neutral molecules (mainly H2O and NH3) and multiple backbone amide ruptures (b-type internal ions), the described C-terminal residue exclusion is highly identifiable giving raise to a single fragment ion in
Structure of antigenetic determinants in the amino-terminal region of bovine fibrinogen Aα chain

International Nuclear Information System (INIS)

Tanswell, P.; Reiter, H.; Timpl, R.

1978-01-01

A radioimmunoassay was developed for peptide F-CB1α from the amino of bovine fibrinogen Aα chain, isolated after reduction and carboxymethylation of the multichain disulfide-linked cyanogen bromide peptide F-CB1. Seven out of twelve different rabbit antisera produced against fibrinogen, peptide F-CB1 or Aα chain showed distinct binding to 125 I-labelled F-CB1α. Thrombin cleavage of F-CB1α yielded two fragments: fibrinopeptide A (residues 1-19) and the carboxy-terminal fragment Th2 (residues 20-54). Antisera could be classified into three groups according to whether they recognized antigenic determinants on fibrinopeptide A, on peptide Th2 or as they showed diminished reactions with both fragments. Only little or no cross-reaction was observed with the amino-terminal cyanogen bromide peptides of Bβ and γ chain. Proteolytic fragments of fibrinopeptide A were isolated and tested for inhibitory activity with two antisera. One antiserum contained anitbodies binding selectively to the amino-terminal sequence (residues 4-11) and did not cross-react with human fibrinopeptide A. Another antiserum showed a specific binding restricted to the carboxy-terminal sequence (residues 11-18) and cross-reacted completely with human fibrinopeptide A. These results correlate well with the primary structures of the two fibrinopeptides. The antigenic activity of the peptide fragment Th2 was localized on a 15-residue tryptic peptide derived from the central portion of the sequence. These and further data indicate that at least six different antigenic determinants are present in peptide F-CB1α. (orig.) 891 AJ [de
Dynamics of termination during in vitro replication of ultraviolet-irradiated DNA with DNA polymerase III holoenzyme of Escherichia coli

International Nuclear Information System (INIS)

Shwartz, H.; Livneh, Z.

1987-01-01

During in vitro replication of UV-irradiated single-stranded DNA with Escherichia coli DNA polymerase III holoenzyme termination frequently occurs at pyrimidine photodimers. The termination stage is dynamic and characterized by at least three different events: repeated dissociation-reinitiation cycles of the polymerase at the blocked termini; extensive hydrolysis of ATP to ADP and inorganic phosphate; turnover of dNTPs into dNMP. The reinitiation events are nonproductive and are not followed by further elongation. The turnover of dNTPs into dNMPs is likely to result from repeated cycles of insertion of dNMP residues opposite the blocking lesions followed by their excision by the 3'----5' exonucleolytic activity of the polymerase. Although all dNTPs are turned over, there is a preference for dATP, indicating that DNA polymerase III holoenzyme has a preference for inserting a dAMP residue opposite blocking pyrimidine photodimers. We suggest that the inability of the polymerase to bypass photodimers during termination is due to the formation of defective initiation-like complexes with reduced stability at the blocked termini
Distribution and sequence homogeneity of an abundant satellite DNA in the beetle, Tenebrio molitor.

Science.gov (United States)

Davis, C A; Wyatt, G R

1989-01-01

The mealworm beetle, Tenebrio molitor, contains an unusually abundant and homogeneous satellite DNA which constitutes up to 60% of its genome. The satellite DNA is shown to be present in all of the chromosomes by in situ hybridization. 18 dimers of the repeat unit were cloned and sequenced. The consensus sequence is 142 nt long and lacks any internal repeat structure. Monomers of the sequence are very similar, showing on average a 2% divergence from the calculated consensus. Variant nucleotides are scattered randomly throughout the sequence although some variants are more common than others. Neighboring repeat units are no more alike than randomly chosen ones. The results suggest that some mechanism, perhaps gene conversion, is acting to maintain the homogeneity of the satellite DNA despite its abundance and distribution on all of the chromosomes. Images PMID:2762148
Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase

Directory of Open Access Journals (Sweden)

Komivi Dossa

2017-08-01

Full Text Available The sequencing of the full nuclear genome of sesame (Sesamum indicum L. provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78% were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/, which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.
Cyclization of the N-Terminal X-Asn-Gly Motif during Sample Preparation for Bottom-Up Proteomics

DEFF Research Database (Denmark)

Zhang, Xumin; Højrup, Peter

2010-01-01

We, herein, report a novel -17 Da peptide modification corresponding to an N-terminal cyclization of peptides possessing the N-terminal motif of X-Asn-Gly. The cyclization occurs spontaneously during sample preparation for bottom-up proteomics studies. Distinct from the two well-known N-terminal ......We, herein, report a novel -17 Da peptide modification corresponding to an N-terminal cyclization of peptides possessing the N-terminal motif of X-Asn-Gly. The cyclization occurs spontaneously during sample preparation for bottom-up proteomics studies. Distinct from the two well-known N......-terminal cyclizations, cyclization of N-terminal glutamine and S-carbamoylmethylcysteine, it is dependent on pH instead of [NH(4)(+)]. The data set from our recent study on large-scale N(α)-modified peptides revealed a sequence requirement for the cyclization event similar to the well-known deamidation of Asn to iso...

Complete plastid genome sequencing of Trochodendraceae reveals a significant expansion of the inverted repeat and suggests a Paleogene divergence between the two extant species.

Directory of Open Access Journals (Sweden)

Yan-xia Sun

Full Text Available The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots, but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ∼4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8 that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern.
Mechanical processes with repeated attenuated impacts

CERN Document Server

Nagaev, R F

1999-01-01

This book is devoted to considering in the general case - using typical concrete examples - the motion of machines and mechanisms of impact and vibro-impact action accompanied by a peculiar phenomenon called "impact collapse". This phenomenon is that after the initial collision, a sequence of repeated gradually quickening collisions of decreasing-to-zero intensity occurs, with the final establishment of protracted contact between the interacting bodies. The initiation conditions of the impact collapse are determined and calculation techniques for the quantitative characteristics of the corresp
Genetic Diversity of Arabica Coffee (Coffea arabica L. in Nicaragua as Estimated by Simple Sequence Repeat Markers

Directory of Open Access Journals (Sweden)

Mulatu Geleta

2012-01-01

Full Text Available Coffea arabica L. (arabica coffee, the only tetraploid species in the genus Coffea, represents the majority of the world’s coffee production and has a significant contribution to Nicaragua’s economy. The present paper was conducted to determine the genetic diversity of arabica coffee in Nicaragua for its conservation and breeding values. Twenty-six populations that represent eight varieties in Nicaragua were investigated using simple sequence repeat (SSR markers. A total of 24 alleles were obtained from the 12 loci investigated across 260 individual plants. The total Nei’s gene diversity (HT and the within-population gene diversity (HS were 0.35 and 0.29, respectively, which is comparable with that previously reported from other countries and regions. Among the varieties, the highest diversity was recorded in the variety Catimor. Analysis of variance (AMOVA revealed that about 87% of the total genetic variation was found within populations and the remaining 13% differentiate the populations (FST=0.13; P<0.001. The variation among the varieties was also significant. The genetic variation in Nicaraguan coffee is significant enough to be used in the breeding programs, and most of this variation can be conserved through ex situ conservation of a low number of populations from each variety.
Position-dependent termination and widespread obligatory frameshifting in Euplotes translation

Energy Technology Data Exchange (ETDEWEB)

Lobanov, Alexei V.; Heaphy, Stephen M.; Turanov, Anton A.; Gerashchenko, Maxim V.; Pucciarelli, Sandra; Devaraj, Raghul R.; Xie, Fang; Petyuk, Vladislav A.; Smith, Richard D.; Klobutcher, Lawrence A.; Atkins, John F.; Miceli, Cristina; Hatfield, Dolph L.; Baranov, Pavel V.; Gladyshev, Vadim N.

2016-11-21

The ribosome can change its reading frame during translation in a process known as programmed ribosomal frameshifting. These rare events are supported by complex mRNA signals. However, we found that the ciliates Euplotes crassus and Euplotes focardii exhibit widespread frameshifting at stop codons. 47 different codons preceding stop signals resulted in either +1 or +2 frameshifts, and +1 frameshifting at AAA was the most frequent. The frameshifts showed unusual plasticity and rapid evolution, and had little influence on translation rates. The proximity of a stop codon to the 3' mRNA end, rather than its occurrence or sequence context, appeared to designate termination. Thus, a ‘stop codon’ is not a sufficient signal for translation termination, and the default function of stop codons in Euplotes is frameshifting, whereas termination is specific to certain mRNA positions and probably requires additional factors.
Genomic 3' terminal sequence comparison of three isolates of rabbit haemorrhagic disease virus.

Science.gov (United States)

Milton, I D; Vlasak, R; Nowotny, N; Rodak, L; Carter, M J

1992-05-15

Comparison of sequence data is necessary in older to investigate virus origins, identify features common to virulent strains, and characterize genomic organization within virus families. A virulent caliciviral disease of rabbits recently emerged in China. We have sequenced 1100 bases from the 3' ends of two independent European isolates of this virus, and compared these with previously determined calicivirus sequences. Rabbit caliciviruses were closely related, despite the different countries in which isolation was made. This supports the rapid spread of a new virus across Europe. The capsid protein sequences of these rabbit viruses differ markedly from those determined for feline calicivirus, but a hypothetical 3' open reading frame is relatively well conserved between the caliciviruses of these two different hosts and argues for a functional role.
Long-read sequencing and de novo assembly of a Chinese genome

Science.gov (United States)

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arr...
Structural and sequence diversity of the transposon Galileo in the Drosophila willistoni genome.

Science.gov (United States)

Gonçalves, Juliana W; Valiati, Victor Hugo; Delprat, Alejandra; Valente, Vera L S; Ruiz, Alfredo

2014-09-13

Galileo is one of three members of the P superfamily of DNA transposons. It was originally discovered in Drosophila buzzatii, in which three segregating chromosomal inversions were shown to have been generated by ectopic recombination between Galileo copies. Subsequently, Galileo was identified in six of 12 sequenced Drosophila genomes, indicating its widespread distribution within this genus. Galileo is strikingly abundant in Drosophila willistoni, a neotropical species that is highly polymorphic for chromosomal inversions, suggesting a role for this transposon in the evolution of its genome. We carried out a detailed characterization of all Galileo copies present in the D. willistoni genome. A total of 191 copies, including 133 with two terminal inverted repeats (TIRs), were classified according to structure in six groups. The TIRs exhibited remarkable variation in their length and structure compared to the most complete copy. Three copies showed extended TIRs due to internal tandem repeats, the insertion of other transposable elements (TEs), or the incorporation of non-TIR sequences into the TIRs. Phylogenetic analyses of the transposase (TPase)-encoding and TIR segments yielded two divergent clades, which we termed Galileo subfamilies V and W. Target-site duplications (TSDs) in D. willistoni Galileo copies were 7- or 8-bp in length, with the consensus sequence GTATTAC. Analysis of the region around the TSDs revealed a target site motif (TSM) with a 15-bp palindrome that may give rise to a stem-loop secondary structure. There is a remarkable abundance and diversity of Galileo copies in the D. willistoni genome, although no functional copies were found. The TIRs in particular have a dynamic structure and extend in different ways, but their ends (required for transposition) are more conserved than the rest of the element. The D. willistoni genome harbors two Galileo subfamilies (V and W) that diverged ~9 million years ago and may have descended from an ancestral
Evolutionary analysis of a novel zinc ribbon in the N-terminal region of threonine synthase.

Science.gov (United States)

Kaur, Gurmeet; Subramanian, Srikrishna

2017-10-18

Threonine synthase (TS) catalyzes the terminal reaction in the biosynthetic pathway of threonine and requires pyridoxal phosphate as a cofactor. TSs share a common catalytic domain with other fold type II PALP dependent enzymes. TSs are broadly grouped into two classes based on their sequence, quaternary structure, and enzyme regulation. We report the presence of a novel zinc ribbon domain in the N-terminal region preceding the catalytic core in TS. The zinc ribbon domain is present in TSs belonging to both classes. Our sequence analysis reveals that archaeal TSs possess all zinc chelating residues to bind a metal ion that are lacking in the structurally characterized homologs. Phylogenetic analysis suggests that TSs with an N-terminal zinc ribbon likely represents the ancestral state of the enzyme while TSs without a zinc ribbon must have diverged later in specific lineages. The zinc ribbon and its N- and C-terminal extensions are important for enzyme stability, activity and regulation. It is likely that the zinc ribbon domain is involved in higher order oligomerization or mediating interactions with other biomolecules leading to formation of larger metabolic complexes.
Genetic variability in Brazilian populations of Biomphalaria straminea complex detected by simple sequence repeat anchored polymerase chain reaction amplification

Directory of Open Access Journals (Sweden)

Caldeira Roberta L

2001-01-01

Full Text Available Biomphalaria glabrata, B. tenagophila and B. straminea are intermediate hosts of Schistosoma mansoni, in Brazil. The latter is of epidemiological importance in the northwest of Brazil and, due to morphological similarities, has been grouped with B. intermedia and B. kuhniana in a complex named B. straminea. In the current work, we have standardized the simple sequence repeat anchored polymerase chain reaction (SSR-PCR technique, using the primers (CA8RY and K7, to study the genetic variability of these species. The similarity level was calculated using the Dice coefficient and genetic distance using the Nei and Li coefficient. The trees were obtained by the UPGMA and neighbor-joining methods. We have observed that the most related individuals belong to the same species and locality and that individuals from different localities, but of the same species, present clear heterogeneity. The trees generated using both methods showed similar topologies. The SSR-PCR technique was shown to be very efficient in intrapopulational and intraspecific studies of the B. straminea complex snails.
Sequencing of BAC pools by different next generation sequencing platforms and strategies

Directory of Open Access Journals (Sweden)

Scholz Uwe

2011-10-01

Full Text Available Abstract Background Next generation sequencing of BACs is a viable option for deciphering the sequence of even large and highly repetitive genomes. In order to optimize this strategy, we examined the influence of read length on the quality of Roche/454 sequence assemblies, to what extent Illumina/Solexa mate pairs (MPs improve the assemblies by scaffolding and whether barcoding of BACs is dispensable. Results Sequencing four BACs with both FLX and Titanium technologies revealed similar sequencing accuracy, but showed that the longer Titanium reads produce considerably less misassemblies and gaps. The 454 assemblies of 96 barcoded BACs were improved by scaffolding 79% of the total contig length with MPs from a non-barcoded library. Assembly of the unmasked 454 sequences without separation by barcodes revealed chimeric contig formation to be a major problem, encompassing 47% of the total contig length. Masking the sequences reduced this fraction to 24%. Conclusion Optimal BAC pool sequencing should be based on the longest available reads, with barcoding essential for a comprehensive assessment of both repetitive and non-repetitive sequence information. When interest is restricted to non-repetitive regions and repeats are masked prior to assembly, barcoding is non-essential. In any case, the assemblies can be improved considerably by scaffolding with non-barcoded BAC pool MPs.
Electricity sequence control

International Nuclear Information System (INIS)

Shin, Heung Ryeol

2010-03-01

The contents of the book are introduction of control system, like classification and control signal, introduction of electricity power switch, such as push-button and detection switch sensor for induction type and capacitance type machinery for control, solenoid valve, expression of sequence and type of electricity circuit about using diagram, time chart, marking and term, logic circuit like Yes, No, and, or and equivalence logic, basic electricity circuit, electricity sequence control, added condition, special program control about choice and jump of program, motor control, extra circuit on repeat circuit, pause circuit in a conveyer, safety regulations and rule about classification of electricity disaster and protective device for insulation.
Heterogeneity of the Epstein-Barr Virus (EBV) Major Internal Repeat Reveals Evolutionary Mechanisms of EBV and a Functional Defect in the Prototype EBV Strain B95-8.

Science.gov (United States)

Ba Abdullah, Mohammed M; Palermo, Richard D; Palser, Anne L; Grayson, Nicholas E; Kellam, Paul; Correia, Samantha; Szymula, Agnieszka; White, Robert E

2017-12-01

Epstein-Barr virus (EBV) is a ubiquitous pathogen of humans that can cause several types of lymphoma and carcinoma. Like other herpesviruses, EBV has diversified through both coevolution with its host and genetic exchange between virus strains. Sequence analysis of the EBV genome is unusually challenging because of the large number and lengths of repeat regions within the virus. Here we describe the sequence assembly and analysis of the large internal repeat 1 of EBV (IR1; also known as the BamW repeats) for more than 70 strains. The diversity of the latency protein EBV nuclear antigen leader protein (EBNA-LP) resides predominantly within the exons downstream of IR1. The integrity of the putative BWRF1 open reading frame (ORF) is retained in over 80% of strains, and deletions truncating IR1 always spare BWRF1. Conserved regions include the IR1 latency promoter (Wp) and one zone upstream of and two within BWRF1. IR1 is heterogeneous in 70% of strains, and this heterogeneity arises from sequence exchange between strains as well as from spontaneous mutation, with interstrain recombination being more common in tumor-derived viruses. This genetic exchange often incorporates regions of Epstein-Barr virus (EBV) infects the majority of the world population but causes illness in only a small minority of people. Nevertheless, over 1% of cancers worldwide are attributable to EBV. Recent sequencing projects investigating virus diversity to see if different strains have different disease impacts have excluded regions of repeating sequence, as they are more technically challenging. Here we analyze the sequence of the largest repeat in EBV (IR1). We first characterized the variations in protein sequences encoded across IR1. In studying variations within the repeat of each strain, we identified a mutation in the main laboratory strain of EBV that impairs virus function, and we suggest that tumor-associated viruses may be more likely to contain DNA mixed from two strains. The
Forkhead Box C1 Regulates Human Primary Keratinocyte Terminal Differentiation.

Directory of Open Access Journals (Sweden)

Lianghua Bin

Full Text Available The epidermis serves as a critical protective barrier between the internal and external environment of the human body. Its remarkable barrier function is established through the keratinocyte (KC terminal differentiation program. The transcription factors specifically regulating terminal differentiation remain largely unknown. Using a RNA-sequencing (RNA-seq profiling approach, we found that forkhead box c 1 (FOXC1 was significantly up-regulated in human normal primary KC during the course of differentiation. This observation was validated in human normal primary KC from several different donors and human skin biopsies. Silencing FOXC1 in human normal primary KC undergoing differentiation led to significant down-regulation of late terminal differentiation genes markers including epidermal differentiation complex genes, keratinization genes, sphingolipid/ceramide metabolic process genes and epidermal specific cell-cell adhesion genes. We further demonstrated that FOXC1 works down-stream of ZNF750 and KLF4, and upstream of GRHL3. Thus, this study defines FOXC1 as a regulator specific for KC terminal differentiation and establishes its potential position in the genetic regulatory network.
Report on achievements in fiscal 1998 on research and development of the genome infomatics technology in the industrial and scientific technology research and development project. Research and development of the genome infomatics technology; 1998 nendo genome infomatics gijutsu kenkyu kaihatsu seika hokokusho

Energy Technology Data Exchange (ETDEWEB)

NONE

2000-03-01

This paper describes the achievements in fiscal 1998 on research and development of the genome infomatics technology. First, plasmid DNA was prepared that becomes a mold for sequence reaction; primers were prepared based on the base sequence of terminal groups, which were used to perform the sequence reaction for the next step; and determination was made on the base sequence following the terminal sequence that has been determined previously. This primer walking process was repeated, whereas the obtained data for each base sequence piece were unified to have determined the base sequence in the complete-length cDNA300 clone. The complete-length cDNA853 for homo-sapiens was analyzed by using the primer walking process. The Sanger's sequencing method was used for the reaction. The resultant sequence data was verified to be of a complete-length cDNA containing the actual protein codon from the N terminal to the C terminal. A cDNA database was newly structured. The complete-length cDNA can be retrieved by using as the retrieval condition each organ originating the sequence, the manifestation frequency therein, and the keyword representing the function. (NEDO)
Report on achievements in fiscal 1998 on research and development of the genome infomatics technology in the industrial and scientific technology research and development project. Research and development of the genome infomatics technology; 1998 nendo genome infomatics gijutsu kenkyu kaihatsu seika hokokusho

Energy Technology Data Exchange (ETDEWEB)

NONE

2000-03-01

This paper describes the achievements in fiscal 1998 on research and development of the genome infomatics technology. First, plasmid DNA was prepared that becomes a mold for sequence reaction; primers were prepared based on the base sequence of terminal groups, which were used to perform the sequence reaction for the next step; and determination was made on the base sequence following the terminal sequence that has been determined previously. This primer walking process was repeated, whereas the obtained data for each base sequence piece were unified to have determined the base sequence in the complete-length cDNA300 clone. The complete-length cDNA853 for homo-sapiens was analyzed by using the primer walking process. The Sanger's sequencing method was used for the reaction. The resultant sequence data was verified to be of a complete-length cDNA containing the actual protein codon from the N terminal to the C terminal. A cDNA database was newly structured. The complete-length cDNA can be retrieved by using as the retrieval condition each organ originating the sequence, the manifestation frequency therein, and the keyword representing the function. (NEDO)
Sites of termination of in vitro DNA synthesis on psoralen phototreated single-stranded templates

International Nuclear Information System (INIS)

Piette, J.; Hearst, J.

1985-01-01

Single-stranded DNA has been photochemically induced to react with 4'-hydroxymethyl-4,5',8-trimethylpsoralen (HMT) and used as substrate for DNA replication with E. coli DNA polymerase I large fragment. By using the dideoxy sequencing procedure, it is possible to map the termination sites on the template photoreacted with HMT. These sites occur at the nucleotides preceding each thymine residue (and a few cytosine residues), emphasizing the fact that in a single-stranded stretch of DNA, HMT reacts with each thymine residue without any specificity regarding the flanking base sequence of the thymine residues. In addition, termination of DNA synthesis due to psoralen-adducted thymine is not influenced by the efficiency of the 3'-5' exonuclease proof-reading activity of the DNA polymerase. (author)
Unusually effective microRNA targeting within repeat-rich coding regions of mammalian mRNAs

Science.gov (United States)

Schnall-Levin, Michael; Rissland, Olivia S.; Johnston, Wendy K.; Perrimon, Norbert; Bartel, David P.; Berger, Bonnie

2011-01-01

MicroRNAs (miRNAs) regulate numerous biological processes by base-pairing with target messenger RNAs (mRNAs), primarily through sites in 3′ untranslated regions (UTRs), to direct the repression of these targets. Although miRNAs have sometimes been observed to target genes through sites in open reading frames (ORFs), large-scale studies have shown such targeting to be generally less effective than 3′ UTR targeting. Here, we show that several miRNAs each target significant groups of genes through multiple sites within their coding regions. This ORF targeting, which mediates both predictable and effective repression, arises from highly repeated sequences containing miRNA target sites. We show that such sequence repeats largely arise through evolutionary duplications and occur particularly frequently within families of paralogous C2H2 zinc-finger genes, suggesting the potential for their coordinated regulation. Examples of ORFs targeted by miR-181 include both the well-known tumor suppressor RB1 and RBAK, encoding a C2H2 zinc-finger protein and transcriptional binding partner of RB1. Our results indicate a function for repeat-rich coding sequences in mediating post-transcriptional regulation and reveal circumstances in which miRNA-mediated repression through ORF sites can be reliably predicted. PMID:21685129
In Silico Genome Comparison and Distribution Analysis of Simple Sequences Repeats in Cassava

Directory of Open Access Journals (Sweden)

Andrea Vásquez

2014-01-01

Full Text Available We conducted a SSRs density analysis in different cassava genomic regions. The information obtained was useful to establish comparisons between cassava’s SSRs genomic distribution and those of poplar, flax, and Jatropha. In general, cassava has a low SSR density (~50 SSRs/Mbp and has a high proportion of pentanucleotides, (24,2 SSRs/Mbp. It was found that coding sequences have 15,5 SSRs/Mbp, introns have 82,3 SSRs/Mbp, 5′ UTRs have 196,1 SSRs/Mbp, and 3′ UTRs have 50,5 SSRs/Mbp. Through motif analysis of cassava’s genome SSRs, the most abundant motif was AT/AT while in intron sequences and UTRs regions it was AG/CT. In addition, in coding sequences the motif AAG/CTT was also found to occur most frequently; in fact, it is the third most used codon in cassava. Sequences containing SSRs were classified according to their functional annotation of Gene Ontology categories. The identified SSRs here may be a valuable addition for genetic mapping and future studies in phylogenetic analyses and genomic evolution.
The EspF N-Terminal of Enterohemorrhagic Escherichia coli O157:H7 EDL933w Imparts Stronger Toxicity Effects on HT-29 Cells than the C-Terminal.

Science.gov (United States)

Wang, Xiangyu; Du, Yanli; Hua, Ying; Fu, Muqing; Niu, Cong; Zhang, Bao; Zhao, Wei; Zhang, Qiwei; Wan, Chengsong

2017-01-01

Enterohemorrhagic Escherichia coli (EHEC) O157:H7 EspF is an important multifunctional protein that destroys the tight junctions of intestinal epithelial cells and promotes host cell apoptosis. However, its molecular mechanism remains elusive. We knocked out the espF sequence (747 bp, Δ espF ), N-terminal sequence (219 bp, Δ espF N ), and C-terminal sequence (528 bp, Δ espF C ) separately using the pKD46-mediated λ Red homologous recombination system. Then, we built the corresponding complementation strains, namely, Δ espF/pespF , Δ espF N /pespF N , and Δ espF C /pespF C by overlap PCR, which were used in infecting HT-29 cells and BALB/C mice. The level of reactive oxygen species, cell apoptosis, mitochondrial trans-membrane potential, inflammatory factors, transepithelial electrical resistance (TER), and animal mortality were evaluated by DCFH-DA, double staining of Annexin V-FITC/PI, JC-1 staining, ELISA kit, and a mouse assay. The wild-type (WT), Δ espF , Δ espF/pespF , Δ espF C , Δ espF C /pespF C , Δ espF N , and Δ espF N /pespF N groups exhibited apoptotic rates of 68.3, 27.9, 64.9, 65.7, 73.4, 41.3, and 35.3% respectively, and mean TNF-α expression levels of 428 pg/mL, 342, 466, 446, 381, 383, and 374 pg/mL, respectively. In addition, the apoptotic rates and TNF-α levels of the WT, Δ espF/pespF , and Δ espF C were significantly higher than that of Δ espF , Δ espF N , Δ espF C /pespF C , and Δ espF N /pespF N group ( p < 0.05). The N-terminal of EspF resulted in an increase in the number of apoptotic cells, TNF-α secretion, ROS generation, mitochondria apoptosis, and pathogenicity in BalB/c mice. In conclusion, the N-terminal domain of the Enterohemorrhagic E. coli O157:H7 EspF more strongly promotes apoptosis and inflammation than the C-terminal domain.
Characterization of Equine Infectious Anemia Virus Long Terminal Repeat Quasispecies In Vitro and In Vivo.

Science.gov (United States)

Wang, Xue-Feng; Liu, Qiang; Wang, Yu-Hong; Wang, Shuai; Chen, Jie; Lin, Yue-Zhi; Ma, Jian; Zhou, Jian-Hua; Wang, Xiaojun

2018-04-15

The equine infectious anemia virus (EIAV) attenuated vaccine was developed by long-term passaging of a field-isolated virulent strain in cross-species hosts, followed by successive cultivation in cells in vitro To explore the molecular mechanism underlying the evolution of the EIAV attenuated vaccine, a systematic study focusing on long-terminal-repeat (LTR) variation in numerous virus strains ranging from virulent EIAV to attenuated EIAV was performed over time both in vitro and in vivo Two hypervariable regions were identified within the U3 region in the enhancer region (EHR) and the negative regulatory element (NRE) and within the R region in the transcription start site (TSS) and the Tat-activating region (TAR). Among these sites, variation in the U3 region resulted in the formation of additional transcription factor binding sites; this variation of the in vitro -adapted strains was consistent with the loss of pathogenicity. Notably, the same LTR variation pattern was observed both in vitro and in vivo Generally, the LTR variation in both the attenuated virus and the virulent strain fluctuated over time in vivo Interestingly, the attenuated-virus-specific LTR variation was also detected in horses infected with the virulent strain, supporting the hypothesis that the evolution of an attenuated virus might have involved branching from EIAV quasispecies. This hypothesis was verified by phylogenetic analysis. The present systematic study examining the molecular evolution of attenuated EIAV from EIAV quasispecies may provide an informative model reflecting the evolution of similar lentiviruses. IMPORTANCE The attenuated EIAV vaccine was the first lentiviral vaccine used to successfully control for equine infectious anemia in China. This vaccine provides an important reference for studying the relationship between EIAV gene variation and changes in biological characteristics. Importantly, the vaccine provides a model for the investigation of lentiviral quasispecies

The decorin sequence SYIRIADTNIT binds collagen type I

DEFF Research Database (Denmark)

Kalamajski, Sebastian; Aspberg, Anders; Oldberg, Ake

2007-01-01

Decorin belongs to the small leucine-rich repeat proteoglycan family, interacts with fibrillar collagens, and regulates the assembly, structure, and biomechanical properties of connective tissues. The decorin-collagen type I-binding region is located in leucine-rich repeats 5-6. Site......-directed mutagenesis of this 54-residue-long collagen-binding sequence identifies Arg-207 and Asp-210 in leucine-rich repeat 6 as crucial for the binding to collagen. The synthetic peptide SYIRIADTNIT, which includes Arg-207 and Asp-210, inhibits the binding of full-length recombinant decorin to collagen in vitro....... These collagen-binding amino acids are exposed on the exterior of the beta-sheet-loop structure of the leucine-rich repeat. This resembles the location of interacting residues in other leucine-rich repeat proteins....
New polymorphisms within the variable number tandem repeat (VNTR) 7 locus of Mycobacterium avium subsp. paratuberculosis.

Science.gov (United States)

Fawzy, Ahmad; Zschöck, Michael; Ewers, Christa; Eisenberg, Tobias

2016-06-01

Variable number tandem repeat (VNTR) is a frequently employed typing method of Mycobacterium avium paratuberculosis (MAP) isolates. Based on whole genome sequencing in a previous study, allelic diversity at some VNTR loci seems to over- or under-estimate the actual phylogenetic variance among isolates. Interestingly, two closely related isolates on one farm showed polymorphism at the VNTR 7 locus, raising concerns about the misleading role that it might play in genotyping. We aimed to investigate the underlying basis of VNTR 7-polymorphism by analyzing sequence data for published genomes and field isolates of MAP and other M. avium complex (MAC) members. In contrast to MAP strains from cattle, strains from sheep displayed an "imperfect" repeat within VNTR 7, which was identical to respective allele types in other MAC genomes. Subspecies- and strain-specific single nucleotide polymorphisms (SNPs) and two novel (16 and 56 bp) repeats were detected. Given the combination of the three existing repeats, there are at least five different patterns for VNTR 7. The present findings highlight a higher polymorphism and probable instability of VNTR 7 locus that needs to be considered and challenged in future studies. Until then, sequencing of this locus in future studies is important to correctly assign the underlying allele types.(1). Copyright © 2016 Elsevier Ltd. All rights reserved.
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome

Science.gov (United States)

Gill, Navdeep; Buti, Matteo; Kane, Nolan; Bellec, Arnaud; Helmstetter, Nicolas; Berges, Hélène; Rieseberg, Loren H.

2014-01-01

Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence. PMID:24833511
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L. Genome

Directory of Open Access Journals (Sweden)

Navdeep Gill

2014-04-01

Full Text Available Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
The C-terminal tail of the gp41 transmembrane envelope glycoprotein of HIV-1 clades A, B, C, and D may exist in two conformations: an analysis of sequence, structure, and function

International Nuclear Information System (INIS)

Hollier, Mark J.; Dimmock, Nigel J.

2005-01-01

In addition to the major ectodomain, the gp41 transmembrane glycoprotein of HIV-1 is now known to have a minor ectodomain that is part of the long C-terminal tail. Both ectodomains are highly antigenic, carry neutralizing and non-neutralizing epitopes, and are involved in virus-mediated fusion activity. However, data have so far been biologically based, and derived solely from T cell line-adapted (TCLA), B clade viruses. Here we have carried out sequence and theoretically based structural analyses of 357 gp41 C-terminal sequences of mainly primary isolates of HIV-1 clades A, B, C, and D. Data show that all these viruses have the potential to form a tail loop structure (the minor ectodomain) supported by three, β-sheet, membrane-spanning domains (MSDs). This means that the first (N-terminal) tyrosine-based sorting signal of the gp41 tail is situated outside the cell membrane and is non-functional, and that gp41 that reaches the cell surface may be recycled back into the cytoplasm through the activity of the second tyrosine-sorting signal. However, we suggest that only a minority of cell-associated gp41 molecules - those destined for incorporation into virions - has 3 MSDs and the minor ectodomain. Most intracellular gp41 has the conventional single MSD, no minor ectodomain, a functional first tyrosine-based sorting signal, and in line with current thinking is degraded intracellularly. The gp41 structural diversity suggested here can be viewed as an evolutionary strategy to minimize HIV-1 envelope glycoprotein expression on the cell surface, and hence possible cytotoxicity and immune attack on the infected cell
Solution structure of the twelfth cysteine-rich ligand-binding repeat in rat megalin

International Nuclear Information System (INIS)

Wolf, Christian A.; Dancea, Felician; Shi Meichen; Bade-Noskova, Veronika; Rueterjans, Heinz; Kerjaschki, Dontscho; Luecke, Christian

2007-01-01

Megalin, an approx. 600 kDa transmembrane glycoprotein that acts as multi-ligand transporter, is a member of the low density lipoprotein receptor gene family. Several cysteine-rich repeats, each consisting of about 40 residues, are responsible for the multispecific binding of ligands. The solution structure of the twelfth cysteine-rich ligand-binding repeat with class A motif found in megalin features two short β-strands and two helical turns, yielding the typical fold with a I-III, II-V and IV-VI disulfide bridge connectivity pattern and a calcium coordination site at the C-terminal end. The resulting differences in electrostatic surface potential compared to other ligand-binding modules of this gene family, however, may be responsible for the functional divergence
Regulation of presynaptic Ca2+, synaptic plasticity and contextual fear conditioning by a N-terminal β-amyloid fragment.

Science.gov (United States)

Lawrence, James L M; Tong, Mei; Alfulaij, Naghum; Sherrin, Tessi; Contarino, Mark; White, Michael M; Bellinger, Frederick P; Todorovic, Cedomir; Nichols, Robert A

2014-10-22

Soluble β-amyloid has been shown to regulate presynaptic Ca(2+) and synaptic plasticity. In particular, picomolar β-amyloid was found to have an agonist-like action on presynaptic nicotinic receptors and to augment long-term potentiation (LTP) in a manner dependent upon nicotinic receptors. Here, we report that a functional N-terminal domain exists within β-amyloid for its agonist-like activity. This sequence corresponds to a N-terminal fragment generated by the combined action of α- and β-secretases, and resident carboxypeptidase. The N-terminal β-amyloid fragment is present in the brains and CSF of healthy adults as well as in Alzheimer's patients. Unlike full-length β-amyloid, the N-terminal β-amyloid fragment is monomeric and nontoxic. In Ca(2+) imaging studies using a model reconstituted rodent neuroblastoma cell line and isolated mouse nerve terminals, the N-terminal β-amyloid fragment proved to be highly potent and more effective than full-length β-amyloid in its agonist-like action on nicotinic receptors. In addition, the N-terminal β-amyloid fragment augmented theta burst-induced post-tetanic potentiation and LTP in mouse hippocampal slices. The N-terminal fragment also rescued LTP inhibited by elevated levels of full-length β-amyloid. Contextual fear conditioning was also strongly augmented following bilateral injection of N-terminal β-amyloid fragment into the dorsal hippocampi of intact mice. The fragment-induced augmentation of fear conditioning was attenuated by coadministration of nicotinic antagonist. The activity of the N-terminal β-amyloid fragment appears to reside largely in a sequence surrounding a putative metal binding site, YEVHHQ. These findings suggest that the N-terminal β-amyloid fragment may serve as a potent and effective endogenous neuromodulator. Copyright © 2014 the authors 0270-6474/14/3414210-09$15.00/0.
Comparative genomics and repetitive sequence divergence in the species of diploid Nicotiana section Alatae.

Science.gov (United States)

Lim, K Yoong; Kovarik, Ales; Matyasek, Roman; Chase, Mark W; Knapp, Sandra; McCarthy, Elizabeth; Clarkson, James J; Leitch, Andrew R

2006-12-01

Combining phylogenetic reconstructions of species relationships with comparative genomic approaches is a powerful way to decipher evolutionary events associated with genome divergence. Here, we reconstruct the history of karyotype and tandem repeat evolution in species of diploid Nicotiana section Alatae. By analysis of plastid DNA, we resolved two clades with high bootstrap support, one containing N. alata, N. langsdorffii, N. forgetiana and N. bonariensis (called the n = 9 group) and another containing N. plumbaginifolia and N. longiflora (called the n = 10 group). Despite little plastid DNA sequence divergence, we observed, via fluorescent in situ hybridization, substantial chromosomal repatterning, including altered chromosome numbers, structure and distribution of repeats. Effort was focussed on 35S and 5S nuclear ribosomal DNA (rDNA) and the HRS60 satellite family of tandem repeats comprising the elements HRS60, NP3R and NP4R. We compared divergence of these repeats in diploids and polyploids of Nicotiana. There are dramatic shifts in the distribution of the satellite repeats and complete replacement of intergenic spacers (IGSs) of 35S rDNA associated with divergence of the species in section Alatae. We suggest that sequence homogenization has replaced HRS60 family repeats at sub-telomeric regions, but that this process may not occur, or occurs more slowly, when the repeats are found at intercalary locations. Sequence homogenization acts more rapidly (at least two orders of magnitude) on 35S rDNA than 5S rDNA and sub-telomeric satellite sequences. This rapid rate of divergence is analogous to that found in polyploid species, and is therefore, in plants, not only associated with polyploidy.
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Science.gov (United States)

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Cloning of cDNAs coding for the heavy chain region and connecting region of human factor V, a blood coagulation factor with four types of internal repeats

International Nuclear Information System (INIS)

Kane, W.H.; Ichinose, A.; Hagen, F.S.; Davie, E.W.

1987-01-01

Human factor V is a high molecular weight plasma glycoprotein that participates as a cofactor in the conversion of prothrombin to thrombin by factor X/sub a/. Prior to its participation in the coagulation cascade, factor V is converted to factor V/sub a/ by thrombin generating a heavy chain and a light chain, and these two chains are held together by calcium ions. A connecting region originally located between the heavy and light chains is liberated during the activation reaction. In a previous study, a cDNA of 2970 nucleotides that codes for the carboxyl-terminal 938 amino acids of factor V was isolated and characterized from a Hep G2 cDNA library. This cDNA has been used to obtain additional clones from Hep G2 and human liver cDNA libraries. Furthermore, a Hep G2 cDNA library prepared with an oligonucleotide from the 5' end of these cDNAs was screened to obtain overlapping cDNA clones that code for the amino-terminal region of the molecule. The composite sequence of these clones spans 6911 nucleotides and is consistent with the size of the factor V message present in Hep G2 cells (approximately 7 kilobases). The cDNA codes for a leader sequence of 28 amino acids and a mature protein of 2196 amino acids. The amino acid sequence predicted from the cDNA was in complete agreement with 139 amino acid residues that were identified by Edman degradation of cyanogen bromide peptides isolated from the heavy chain region and connecting region of plasma factor V. The domain structure of human factor V is similar to that previously reported for human coagulation factor VIII. Two types of tandem repeats (17 and 9 amino acids) have also been identified in the connecting region of factor V. The present data indicate that the amino acid sequence in the heavy and light chain regions of factor V is ∼ 40% identical with the corresponding regions of factor VIII
Identifying uniformly mutated segments within repeats.

Science.gov (United States)

Sahinalp, S Cenk; Eichler, Evan; Goldberg, Paul; Berenbrink, Petra; Friedetzky, Tom; Ergun, Funda

2004-12-01

Given a long string of characters from a constant size alphabet we present an algorithm to determine whether its characters have been generated by a single i.i.d. random source. More specifically, consider all possible n-coin models for generating a binary string S, where each bit of S is generated via an independent toss of one of the n coins in the model. The choice of which coin to toss is decided by a random walk on the set of coins where the probability of a coin change is much lower than the probability of using the same coin repeatedly. We present a procedure to evaluate the likelihood of a n-coin model for given S, subject a uniform prior distribution over the parameters of the model (that represent mutation rates and probabilities of copying events). In the absence of detailed prior knowledge of these parameters, the algorithm can be used to determine whether the a posteriori probability for n=1 is higher than for any other n>1. Our algorithm runs in time O(l4logl), where l is the length of S, through a dynamic programming approach which exploits the assumed convexity of the a posteriori probability for n. Our test can be used in the analysis of long alignments between pairs of genomic sequences in a number of ways. For example, functional regions in genome sequences exhibit much lower mutation rates than non-functional regions. Because our test provides means for determining variations in the mutation rate, it may be used to distinguish functional regions from non-functional ones. Another application is in determining whether two highly similar, thus evolutionarily related, genome segments are the result of a single copy event or of a complex series of copy events. This is particularly an issue in evolutionary studies of genome regions rich with repeat segments (especially tandemly repeated segments).
Giardia telomeric sequence d(TAGGG)4 forms two intramolecular G-quadruplexes in K+ solution: effect of loop length and sequence on the folding topology.

Science.gov (United States)

Hu, Lanying; Lim, Kah Wai; Bouaziz, Serge; Phan, Anh Tuân

2009-11-25

Recently, it has been shown that in K(+) solution the human telomeric sequence d[TAGGG(TTAGGG)(3)] forms a (3 + 1) intramolecular G-quadruplex, while the Bombyx mori telomeric sequence d[TAGG(TTAGG)(3)], which differs from the human counterpart only by one G deletion in each repeat, forms a chair-type intramolecular G-quadruplex, indicating an effect of G-tract length on the folding topology of G-quadruplexes. To explore the effect of loop length and sequence on the folding topology of G-quadruplexes, here we examine the structure of the four-repeat Giardia telomeric sequence d[TAGGG(TAGGG)(3)], which differs from the human counterpart only by one T deletion within the non-G linker in each repeat. We show by NMR that this sequence forms two different intramolecular G-quadruplexes in K(+) solution. The first one is a novel basket-type antiparallel-stranded G-quadruplex containing two G-tetrads, a G x (A-G) triad, and two A x T base pairs; the three loops are consecutively edgewise-diagonal-edgewise. The second one is a propeller-type parallel-stranded G-quadruplex involving three G-tetrads; the three loops are all double-chain-reversal. Recurrence of several structural elements in the observed structures suggests a "cut and paste" principle for the design and prediction of G-quadruplex topologies, for which different elements could be extracted from one G-quadruplex and inserted into another.
Repeat-containing protein effectors of plant-associated organisms

Directory of Open Access Journals (Sweden)

Carl H. Mesarich

2015-10-01

Full Text Available Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.
Regulation of the transient receptor potential channel TRPA1 by its N-terminal ankyrin repeat domain

Czech Academy of Sciences Publication Activity Database

Zayats, Vasilina; Samad, Abdul; Minofar, Babak; Roelofs, K. E.; Stockner, T.; Ettrich, Rüdiger

2012-01-01

Roč. 19, č. 11 (2012), s. 4689-4700 ISSN 1610-2940 R&D Projects: GA ČR GAP207/10/1934 Institutional research plan: CEZ:AV0Z60870520 Keywords : ankyrin repeat * EF-hand * familial episodic pain syndrom * TRPA1 Subject RIV: CE - Biochemistry Impact factor: 1.984, year: 2012
CDNA encoding a polypeptide including a hevein sequence

Science.gov (United States)

Raikhel, Natasha V.; Broekaert, Willem F.; Chua, Nam-Hai; Kush, Anil

1995-03-21

A cDNA clone (HEV1) encoding hevein was isolated via polymerase chain reaction (PCR) using mixed oligonucleotides corresponding to two regions of hevein as primers and a Hevea brasiliensis latex cDNA library as a template. HEV1 is 1018 nucleotides long and includes an open reading frame of 204 amino acids. The deduced amino acid sequence contains a putative signal sequence of 17 amino acid residues followed by a 187 amino acid polypeptide. The amino-terminal region (43 amino acids) is identical to hevein and shows homology to several chitin-binding proteins and to the amino-termini of wound-induced genes in potato and poplar. The carboxyl-terminal portion of the polypeptide (144 amino acids) is 74-79% homologous to the carboxyl-terminal region of wound-inducible genes of potato. Wounding, as well as application of the plant hormones abscisic acid and ethylene, resulted in accumulation of hevein transcripts in leaves, stems and latex, but not in roots, as shown by using the cDNA as a probe. A fusion protein was produced in E. coli from the protein of the present invention and maltose binding protein produced by the E. coli.
Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

Science.gov (United States)

Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

2012-01-01

The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697
Analysis of genetic diversity and population structure of oil palm (Elaeis guineensis) from China and Malaysia based on species-specific simple sequence repeat markers.

Science.gov (United States)

Zhou, L X; Xiao, Y; Xia, W; Yang, Y D

2015-12-08

Genetic diversity and patterns of population structure of the 94 oil palm lines were investigated using species-specific simple sequence repeat (SSR) markers. We designed primers for 63 SSR loci based on their flanking sequences and conducted amplification in 94 oil palm DNA samples. The amplification result showed that a relatively high level of genetic diversity was observed between oil palm individuals according a set of 21 polymorphic microsatellite loci. The observed heterozygosity (Ho) was 0.3683 and 0.4035, with an average of 0.3859. The Ho value was a reliable determinant of the discriminatory power of the SSR primer combinations. The principal component analysis and unweighted pair-group method with arithmetic averaging cluster analysis showed the 94 oil palm lines were grouped into one cluster. These results demonstrated that the oil palm in Hainan Province of China and the germplasm introduced from Malaysia may be from the same source. The SSR protocol was effective and reliable for assessing the genetic diversity of oil palm. Knowledge of the genetic diversity and population structure will be crucial for establishing appropriate management stocks for this species.
Determination of allele frequencies in nine short tandem repeat loci ...

African Journals Online (AJOL)

SERVER

2008-04-17

Apr 17, 2008 ... out the human genome. These loci are a rich source of highly polymorphic markers that may be detected using the polymerase chain reaction (PCR). PCR is a mimic of the normal cellular process of replication of DNA molecules. Each STR is distinguished by the number of times a sequence is repeated, ...
Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

Energy Technology Data Exchange (ETDEWEB)

Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.; Jansen, Robert K.

2006-01-09

Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 and ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies
Application of synthetic DNA probes to the analysis of DNA sequence variants in man

International Nuclear Information System (INIS)

Wallace, R.B.; Petz, L.D.; Yam, P.Y.

1986-01-01

Oligonucleotide probes provide a tool to discriminate between any two alleles on the basis of hybridization. Random sampling of the genome with different oligonucleotide probes should reveal polymorphism in a certain percentage of the cases. In the hope of identifying polymorphic regions more efficiently, we chose to take advantage of the proposed hypermutability of repeated DNA sequences and the specificity of oligonucleotide hybridization. Since, under appropriate conditions, oligonucleotide probes require complete base pairing for hybridization to occur, they will only hybridize to a subset of the members of a repeat family when all members of the family are not identical. The results presented here suggest that oligonucleotide hybridization can be used to extend the genomic sequences that can be tested for the presence of RFLPs. This expands the tools available to human genetics. In addition, the results suggest that repeated DNA sequences are indeed more polymorphic than single-copy sequences. 28 references, 2 figures

N-terminal domains of human DNA polymerase lambda promote primer realignment during translesion DNA synthesis

Science.gov (United States)

Taggart, David J.; Dayeh, Daniel M.; Fredrickson, Saul W.; Suo, Zucai

2014-01-01

The X-family DNA polymerases λ (Polλ) and β (Polβ) possess similar 5′-2-deoxyribose-5-phosphatelyase (dRPase) and polymerase domains. Besides these domains, Polλ also possesses a BRCA1 C-terminal (BRCT) domain and a proline-rich domain at its N terminus. However, it is unclear how these non-enzymatic domains contribute to the unique biological functions of Polλ. Here, we used primer extension assays and a newly developed high-throughput short oligonucleotide sequencing assay (HT-SOSA) to compare the efficiency of lesion bypass and fidelity of human Polβ, Polλ and two N-terminal deletion constructs of Polλ during the bypass of either an abasic site or a 8-oxo-7,8-dihydro-2′-deoxyguanosine (8-oxodG) lesion. We demonstrate that the BRCT domain of Polλ enhances the efficiency of abasic site bypass by approximately 1.6-fold. In contrast, deletion of the N-terminal domains of Polλ did not affect the efficiency of 8-oxodG bypass relative to nucleotide incorporations opposite undamaged dG. HT-SOSA analysis demonstrated that Polλ and Polβ preferentially generated −1 or −2 frameshift mutations when bypassing an abasic site and the single or double base deletion frequency was highly sequence dependent. Interestingly, the BRCT and proline-rich domains of Polλ cooperatively promoted the generation of −2 frameshift mutations when the abasic site was situated within a sequence context that was susceptible to homology-driven primer realignment. Furthermore, both N-terminal domains of Polλ increased the generation of −1 frameshift mutations during 8-oxodG bypass and influenced the frequency of substitution mutations produced by Polλ opposite the 8-oxodG lesion. Overall, our data support a model wherein the BRCT and proline-rich domains of Polλ act cooperatively to promote primer/template realignment between DNA strands of limited sequence homology. This function of the N-terminal domains may facilitate the role of Polλ as a gap-filling polymerase
Striking structural dynamism and nucleotide sequence variation of the transposon Galileo in the genome of Drosophila mojavensis.

Science.gov (United States)

Marzo, Mar; Bello, Xabier; Puig, Marta; Maside, Xulio; Ruiz, Alfredo

2013-02-04

Galileo is a transposable element responsible for the generation of three chromosomal inversions in natural populations of Drosophila buzzatii. Although the most characteristic feature of Galileo is the long internally-repetitive terminal inverted repeats (TIRs), which resemble the Drosophila Foldback element, its transposase-coding sequence has led to its classification as a member of the P-element superfamily (Class II, subclass 1, TIR order). Furthermore, Galileo has a wide distribution in the genus Drosophila, since it has been found in 6 of the 12 Drosophila sequenced genomes. Among these species, D. mojavensis, the one closest to D. buzzatii, presented the highest diversity in sequence and structure of Galileo elements. In the present work, we carried out a thorough search and annotation of all the Galileo copies present in the D. mojavensis sequenced genome. In our set of 170 Galileo copies we have detected 5 Galileo subfamilies (C, D, E, F, and X) with different structures ranging from nearly complete, to only 2 TIR or solo TIR copies. Finally, we have explored the structural and length variation of the Galileo copies that point out the relatively frequent rearrangements within and between Galileo elements. Different mechanisms responsible for these rearrangements are discussed. Although Galileo is a transposable element with an ancient history in the D. mojavensis genome, our data indicate a recent transpositional activity. Furthermore, the dynamism in sequence and structure, mainly affecting the TIRs, suggests an active exchange of sequences among the copies. This exchange could lead to new subfamilies of the transposon, which could be crucial for the long-term survival of the element in the genome.
[Active miniature inverted-repeat transposable elements transposon in plants: a review].

Science.gov (United States)

Hu, Bingjie; Zhou, Mingbing

2018-02-25

Miniature inverted-repeat transposable elements transposon is a special transposon that could transpose by "cut-paste" mechanism, which is one of characteristics of DNA transposons. Otherwise, the copy number of MITEs is very high, which is one of characteristics of RNA transposons. Many MITE families have been reported, but little about active MITEs. We summarize recent advances in studying active MITEs. Most the MITEs belong to the Tourist-like family, such as mPing, mGing, PhTourist1, Tmi1 and PhTst-3. Additionally, DTstu1 and MITE-39 belong to Stowaway-like family, and AhMITEs1 belongs to Mutator-like family. Moreover, we summarize the structure (terminal inverse repeats and target site duplications), copy number, evolution pattern and transposition characteristics of these active MITEs, to provide the foundation for the identification of other active MITEs and subsequent research on MITE transposition and amplification mechanism.
Human pro. cap alpha. 1(III) collagen: cDNA sequence for the 3' end

Energy Technology Data Exchange (ETDEWEB)

Mankoo, B S; Dalgleish, R

1988-03-25

The authors have previously isolated two overlapping cDNA clones, pIII-21 and pIII-33, which encode the C-terminal end of human type III procollagen. They now present the sequence of 2520 bases encoded in these cDNAs which overlaps other previously published sequences for the same gene. The sequence presented differs from previously published sequences at five positions.
Comparing whole-genome sequencing with Sanger sequencing for spa typing of methicillin-resistant Staphylococcus aureus.

Science.gov (United States)

Bartels, Mette Damkjær; Petersen, Andreas; Worning, Peder; Nielsen, Jesper Boye; Larner-Svensson, Hanna; Johansen, Helle Krogh; Andersen, Leif Percival; Jarløv, Jens Otto; Boye, Kit; Larsen, Anders Rhod; Westh, Henrik

2014-12-01

spa typing of methicillin-resistant Staphylococcus aureus (MRSA) has traditionally been done by PCR amplification and Sanger sequencing of the spa repeat region. At Hvidovre Hospital, Denmark, whole-genome sequencing (WGS) of all MRSA isolates has been performed routinely since January 2013, and an in-house analysis pipeline determines the spa types. Due to national surveillance, all MRSA isolates are sent to Statens Serum Institut, where the spa type is determined by PCR and Sanger sequencing. The purpose of this study was to evaluate the reliability of the spa types obtained by 150-bp paired-end Illumina WGS. MRSA isolates from new MRSA patients in 2013 (n = 699) in the capital region of Denmark were included. We found a 97% agreement between spa types obtained by the two methods. All isolates achieved a spa type by both methods. Nineteen isolates differed in spa types by the two methods, in most cases due to the lack of 24-bp repeats in the whole-genome-sequenced isolates. These related but incorrect spa types should have no consequence in outbreak investigations, since all epidemiologically linked isolates, regardless of spa type, will be included in the single nucleotide polymorphism (SNP) analysis. This will reveal the close relatedness of the spa types. In conclusion, our data show that WGS is a reliable method to determine the spa type of MRSA. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
AcmD, a homolog of the major autolysin AcmA of Lactococcus lactis, binds to the cell wall and contributes to cell separation and autolysis

NARCIS (Netherlands)

Visweswaran, Ganesh Ram R; Steen, Anton; Leenhouts, Kees; Szeliga, Monika; Ruban, Beata; Hesseling-Meinders, Anne; Dijkstra, Bauke W; Kuipers, Oscar P; Kok, Jan; Buist, Girbe

2013-01-01

Lactococcus lactis expresses the homologous glucosaminidases AcmB, AcmC, AcmA and AcmD. The latter two have three C-terminal LysM repeats for peptidoglycan binding. AcmD has much shorter intervening sequences separating the LysM repeats and a lower iso-electric point (4.3) than AcmA (10.3). Under
Outlier Loci and Selection Signatures of Simple Sequence Repeats (SSRs) in Flax (Linum usitatissimum L.).

Science.gov (United States)

Soto-Cerda, Braulio J; Cloutier, Sylvie

2013-01-01

Genomic microsatellites (gSSRs) and expressed sequence tag-derived SSRs (EST-SSRs) have gained wide application for elucidating genetic diversity and population structure in plants. Both marker systems are assumed to be selectively neutral when making demographic inferences, but this assumption is rarely tested. In this study, three neutrality tests were assessed for identifying outlier loci among 150 SSRs (85 gSSRs and 65 EST-SSRs) that likely influence estimates of population structure in three differentiated flax sub-populations ( F ST = 0.19). Moreover, the utility of gSSRs, EST-SSRs, and the combined sets of SSRs was also evaluated in assessing genetic diversity and population structure in flax. Six outlier loci were identified by at least two neutrality tests showing footprints of balancing selection. After removing the outlier loci, the STRUCTURE analysis and the dendrogram topology of EST-SSRs improved. Conversely, gSSRs and combined SSRs results did not change significantly, possibly as a consequence of the higher number of neutral loci assessed. Taken together, the genetic structure analyses established the superiority of gSSRs to determine the genetic relationships among flax accessions, although the combined SSRs produced the best results. Genetic diversity parameters did not differ statistically ( P > 0.05) between gSSRs and EST-SSRs, an observation partially explained by the similar number of repeat motifs. Our study provides new insights into the ability of gSSRs and EST-SSRs to measure genetic diversity and structure in flax and confirms the importance of testing for the occurrence of outlier loci to properly assess natural and breeding populations, particularly in studies considering only few loci.
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

Science.gov (United States)

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Topological disposition of the sequences -QRKIVE- and -KETYY in native (Na sup + + K sup + )-ATPase

Energy Technology Data Exchange (ETDEWEB)

Bayer, R. (Univ. of California, San Diego, La Jolla (USA))

1990-03-06

The dispositions with respect to the plane of the membrane of lysine-905 in the internal sequence -EQRKIVE- and of lysine-1012 in the carboxy-terminal sequence -RRPGGWVEKETYY of the {alpha}-polypeptide of sodium and potassium ion activated adenosinetriphosphatase have been determined. These lysines are found in peptides released from the intact {alpha}-polypeptide by the extracellular protease from Staphylococcus aureus strain V8 and by trypsin, respectively. Synthetic peptides containing terminal sequences of these were used to prepare polyclonal antibodies, which were then used to prepare immunoadsorbents directed against the respective peptides. Sealed, right-side-out membrane vesicles containing native (Na{sup +} + K{sup +})-ATPase were labeled with pyridoxal phosphate and sodium ({sup 3}H)borohydride in the absence or presence of saponin. The labeled {alpha}-polypeptide was isolated from these vesicles and digested with appropriate proteases. The incorporation of radioactivity into the peptides binding to the immunoadsorbent directed against the sequence pyrERXIVE increased 3-fold int the presence of saponin as a result of the increased accessibility of this portion of the protein to the reagent when the vesicles were breached by saponin; hence, this sequence is located on the cytoplasmic face of the membrane. It was inferred that the carboxy-terminal sequence -KETYY is on the extracytoplasmic face since the incorporation of radioactivity into peptides binding to the immunoadsorbent directed against the sequence -ETYY did not change when the vesicles were breached with saponin.
Estimating Genetic Conformism of Korean Mulberry Cultivars Using Random Amplified Polymorphic DNA and Inter-Simple Sequence Repeat Profiling

Directory of Open Access Journals (Sweden)

Sunirmal Sheet

2018-03-01

Full Text Available Apart from being fed to silkworms in sericulture, the ecologically important Mulberry plant has been used for traditional medicine in Asian countries as well as in manufacturing wine, food, and beverages. Germplasm analysis among Mulberry cultivars originating from South Korea is crucial in the plant breeding program for cultivar development. Hence, the genetic deviations and relations among 8 Morus alba plants, and one Morus lhou plant, of different cultivars collected from South Korea were investigated using 10 random amplified polymorphic DNA (RAPD and 10 inter-simple sequence repeat (ISSR markers in the present study. The ISSR markers exhibited a higher polymorphism (63.42% among mulberry genotypes in comparison to RAPD markers. Furthermore, the similarity coefficient was estimated for both markers and found to be varying between 0.183 and 0.814 for combined pooled data of ISSR and RAPD. The phenogram drawn using the UPGMA cluster method based on combined pooled data of RAPD and ISSR markers divided the nine mulberry genotypes into two divergent major groups and the two individual independent accessions. The distant relationship between Dae-Saug (SM1 and SangchonJo Sang Saeng (SM5 offers a possibility of utilizing them in mulberry cultivar improvement of Morus species of South Korea.
The structure, organization and radiation of Sadhu non-long terminal repeat retroelements in Arabidopsis species

Directory of Open Access Journals (Sweden)

Rangwala Sanjida H

2010-03-01

Full Text Available Abstract Background Sadhu elements are non-autonomous retroposons first recognized in Arabidopsis thaliana. There is a wide degree of divergence among different elements, suggesting that these sequences are ancient in origin. Here we report the results of several lines of investigation into the genomic organization and evolutionary history of this element family. Results We present a classification scheme for Sadhu elements in A. thaliana, describing derivative elements related to the full-length elements we reported previously. We characterized Sadhu5 elements in a set of A. thaliana strains in order to trace the history of radiation in this subfamily. Sequences surrounding the target sites of different Sadhu insertions are consistent with mobilization by LINE retroelements. Finally, we identified Sadhu elements grouping into distinct subfamilies in two related species, Arabidopsis arenosa and Arabidopsis lyrata. Conclusions Our analyses suggest that the Sadhu retroelement family has undergone target primed reverse transcription-driven retrotransposition during the divergence of different A. thaliana strains. In addition, Sadhu elements can be found at moderate copy number in three distinct Arabidopsis species, indicating that the evolutionary history of these sequences can be traced back at least several millions of years.
Structural analysis of complementary DNA and amino acid sequences of human and rat androgen receptors

International Nuclear Information System (INIS)

Chang, C.; Kokontis, J.; Liao, S.

1988-01-01

Structural analysis of cDNAs for human and rat androgen receptors (ARs) indicates that the amino-terminal regions of ARs are rich in oligo- and poly(amino acid) motifs as in some homeotic genes. The human AR has a long stretch of repeated glycines, whereas rat AR has a long stretch of glutamines. There is a considerable sequence similarity among ARs and the receptors for glucocorticoids, progestins, and mineralocorticoids within the steroid-binding domains. The cysteine-rich DNA-binding domains are well conserved. Translation of mRNA transcribed from AR cDNAs yielded 94- and 76-kDa proteins and smaller forms that bind to DNA and have high affinity toward androgens. These rat or human ARs were recognized by human autoantibodies to natural Ars. Molecular hybridization studies, using AR cDNAs as probes, indicated that the ventral prostate and other male accessory organs are rich in AR mRNA and that the production of AR mRNA in the target organs may be autoregulated by androgens
Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

Science.gov (United States)

Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

2012-12-01

In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
An infinitely expandable cloning strategy plus repeat-proof PCR for working with multiple shRNA.

Directory of Open Access Journals (Sweden)

Glen John McIntyre

Full Text Available Vector construction with restriction enzymes (REs typically involves the ligation of a digested donor fragment (insert to a reciprocally digested recipient fragment (vector backbone. Creating a suitable cloning plan becomes increasingly difficult for complex strategies requiring repeated insertions such as constructing multiple short hairpin RNA (shRNA expression vectors for RNA interference (RNAi studies. The problem lies in the reduced availability of suitable RE recognition sites with an increasing number of cloning events and or vector size. This report details a technically simple, directional cloning solution using REs with compatible cohesive ends that are repeatedly destroyed and simultaneously re-introduced with each round of cloning. Donor fragments can be made by PCR or sub-cloned from pre-existing vectors and inserted ad infinitum in any combination. The design incorporates several cloning cores in order to be compatible with as many donor sequences as possible. We show that joining sub-combinations made in parallel is more time-efficient than sequential construction (of one cassette at a time for any combination of 4 or more insertions. Screening for the successful construction of combinations using Taq polymerase based PCR became increasingly difficult with increasing number of repeated sequence elements. A Pfu polymerase based PCR was developed and successfully used to amplify combinations of up to eleven consecutive hairpin expression cassettes. The identified PCR conditions can be beneficial to others working with multiple shRNA or other repeated sequences, and the infinitely expandable cloning strategy serves as a general solution applicable to many cloning scenarios.
detectIR: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation.

Science.gov (United States)

Ye, Congting; Ji, Guoli; Li, Lei; Liang, Chun

2014-01-01

Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures--hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir.
Sequencing Lys-N Proteolytic Peptides by ESI and MALDI Tandem Mass Spectrometry

Science.gov (United States)

Dupré, Mathieu; Cantel, Sonia; Verdié, Pascal; Martinez, Jean; Enjalbal, Christine

2011-02-01

In this study, we explored the MS/MS behavior of various synthetic peptides that possess a lysine residue at the N-terminal position. These peptides were designed to mimic peptides produced upon proteolysis by the Lys-N enzyme, a metalloendopeptidase issued from a Japanese fungus Grifola frondosa that was recently investigated in proteomic studies as an alternative to trypsin digestion, as a specific cleavage at the amide X-Lys chain is obtained that provides N-terminal lysine peptide fragments. In contrast to tryptic peptides exhibiting a lysine or arginine residue solely at the C-terminal position, and are thus devoid of such basic amino acids within the sequence, these Lys-N proteolytic peptides can contain the highly basic arginine residue anywhere within the peptide chain. The fragmentation patterns of such sequences with the ESI-QqTOF and MALDI-TOF/TOF mass spectrometers commonly used in proteomic bottom-up experiments were investigated.
Coactivator-associated arginine methyltransferase 1 enhances transcriptional activity of the human T-cell lymphotropic virus type 1 long terminal repeat through direct interaction with Tax.

Science.gov (United States)

Jeong, Soo-Jin; Lu, Hanxin; Cho, Won-Kyung; Park, Hyeon Ung; Pise-Masison, Cynthia; Brady, John N

2006-10-01

In this study, we demonstrate that the coactivator-associated arginine methyltransferase 1 (CARM1), which methylates histone H3 and other proteins such as p300/CBP, is positively involved in the regulation of Tax transactivation. First, transfection studies demonstrated that overexpression of CARM1 wild-type protein resulted in increased Tax transactivation of the human T-cell lymphotropic virus type 1 (HTLV-1) long terminal repeat (LTR). In contrast, transfection of a catalytically inactive CARM1 methyltransferase mutant did not enhance Tax transactivation. CARM1 facilitated Tax transactivation of the CREB-dependent cellular GEM promoter. A direct physical interaction between HTLV-1 Tax and CARM1 was demonstrated using in vitro glutathione S-transferase-Tax binding assays, in vivo coimmunoprecipitation, and confocal microscopy experiments. Finally, chromatin immunoprecipitation analysis of the activated HTLV-1 LTR promoter showed the association of CARM1 and methylated histone H3 with the template DNA. In vitro, Tax facilitates the binding of CARM1 to the transcription complex. Together, our data provide evidence that CARM1 enhances Tax transactivation of the HTLV-1 LTR through a direct interaction between CARM1 and Tax and this binding promotes methylation of histone H3 (R2, R17, and R26).
Structural analysis of a repetitive protein sequence motif in strepsirrhine primate amelogenin.

Directory of Open Access Journals (Sweden)

Rodrigo S Lacruz

2011-03-01

Full Text Available Strepsirrhines are members of a primate suborder that has a distinctive set of features associated with the development of the dentition. Amelogenin (AMEL, the better known of the enamel matrix proteins, forms 90% of the secreted organic matrix during amelogenesis. Although AMEL has been sequenced in numerous mammalian lineages, the only reported strepsirrhine AMEL sequences are those of the ring-tailed lemur and galago, which contain a set of additional proline-rich tandem repeats absent in all other primates species analyzed to date, but present in some non-primate mammals. Here, we first determined that these repeats are present in AMEL from three additional lemur species and thus are likely to be widespread throughout this group. To evaluate the functional relevance of these repeats in strepsirrhines, we engineered a mutated murine amelogenin sequence containing a similar proline-rich sequence to that of Lemur catta. In the monomeric form, the MQP insertions had no influence on the secondary structure or refolding properties, whereas in the assembled form, the insertions increased the hydrodynamic radii. We speculate that increased AMEL nanosphere size may influence enamel formation in strepsirrhine primates.
The genomic sequence of cowpea aphid-borne mosaic virus and its similarities with other potyviruses

NARCIS (Netherlands)

Mlotshwa, S.; Verver, J.; Sithole-Niang, I.; Kampen, van T.; Kammen, van A.; Wellink, J.

2002-01-01

The genomic sequence of a Zimbabwe isolate of Cowpea aphid-borne mosaic virus (CABMV-Z) was determined by sequencing overlapping viral cDNA clones generated by RT-PCR using degenerate and/or specific primers. The sequence is 9465 nucleotides in length excluding the 3' terminal poly (A) tail and
Simulation of stevedoring work in the Klaipėda oil terminal

Directory of Open Access Journals (Sweden)

D. Makackas

2002-10-01

Full Text Available The paper presents the approach of developing simulation models of stevedoring work in oil terminal using aggregate approach and simulation system ARENA. Aggregate approach permits to develop formal specification of simulated systems. Aggregate specification of the system is presented as a set of interacting piece-linear aggregates. Controlling sequences are used for formal description of each piece-linear aggregate. Transformation of aggregate formal specification to ARENA system operators permits to develop a simulation model. The developed model permits to evaluate characteristics of the Klaipėda oil terminal which are used both in queuing theory and analysing timing properties of real-time systems.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.