protein coding region: Topics by WorldWideScience.org

Sample records for protein coding region

Discrete Ramanujan transform for distinguishing the protein coding regions from other regions.

Science.gov (United States)

Hua, Wei; Wang, Jiasong; Zhao, Jian

2014-01-01

Based on the study of Ramanujan sum and Ramanujan coefficient, this paper suggests the concepts of discrete Ramanujan transform and spectrum. Using Voss numerical representation, one maps a symbolic DNA strand as a numerical DNA sequence, and deduces the discrete Ramanujan spectrum of the numerical DNA sequence. It is well known that of discrete Fourier power spectrum of protein coding sequence has an important feature of 3-base periodicity, which is widely used for DNA sequence analysis by the technique of discrete Fourier transform. It is performed by testing the signal-to-noise ratio at frequency N/3 as a criterion for the analysis, where N is the length of the sequence. The results presented in this paper show that the property of 3-base periodicity can be only identified as a prominent spike of the discrete Ramanujan spectrum at period 3 for the protein coding regions. The signal-to-noise ratio for discrete Ramanujan spectrum is defined for numerical measurement. Therefore, the discrete Ramanujan spectrum and the signal-to-noise ratio of a DNA sequence can be used for distinguishing the protein coding regions from the noncoding regions. All the exon and intron sequences in whole chromosomes 1, 2, 3 and 4 of Caenorhabditis elegans have been tested and the histograms and tables from the computational results illustrate the reliability of our method. In addition, we have analyzed theoretically and gotten the conclusion that the algorithm for calculating discrete Ramanujan spectrum owns the lower computational complexity and higher computational accuracy. The computational experiments show that the technique by using discrete Ramanujan spectrum for classifying different DNA sequences is a fast and effective method. Copyright © 2014 Elsevier Ltd. All rights reserved.
An evolutionary model for protein-coding regions with conserved RNA structure

DEFF Research Database (Denmark)

Pedersen, Jakob Skou; Forsberg, Roald; Meyer, Irmtraud Margret

2004-01-01

in the RNA structure. The overlap of these fundamental dependencies is sufficient to cause "contagious" context dependencies which cascade across many nucleotide sites. Such large-scale dependencies challenge the use of traditional phylogenetic models in evolutionary inference because they explicitly assume...... components of traditional phylogenetic models. We applied this to a data set of full-genome sequences from the hepatitis C virus where five RNA structures are mapped within the coding region. This allowed us to partition the effects of selection on different structural elements and to test various hypotheses......Here we present a model of nucleotide substitution in protein-coding regions that also encode the formation of conserved RNA structures. In such regions, apparent evolutionary context dependencies exist, both between nucleotides occupying the same codon and between nucleotides forming a base pair...
Detecting non-coding selective pressure in coding regions

Directory of Open Access Journals (Sweden)

Blanchette Mathieu

2007-02-01

Full Text Available Abstract Background Comparative genomics approaches, where orthologous DNA regions are compared and inter-species conserved regions are identified, have proven extremely powerful for identifying non-coding regulatory regions located in intergenic or intronic regions. However, non-coding functional elements can also be located within coding region, as is common for exonic splicing enhancers, some transcription factor binding sites, and RNA secondary structure elements affecting mRNA stability, localization, or translation. Since these functional elements are located in regions that are themselves highly conserved because they are coding for a protein, they generally escaped detection by comparative genomics approaches. Results We introduce a comparative genomics approach for detecting non-coding functional elements located within coding regions. Codon evolution is modeled as a mixture of codon substitution models, where each component of the mixture describes the evolution of codons under a specific type of coding selective pressure. We show how to compute the posterior distribution of the entropy and parsimony scores under this null model of codon evolution. The method is applied to a set of growth hormone 1 orthologous mRNA sequences and a known exonic splicing elements is detected. The analysis of a set of CORTBP2 orthologous genes reveals a region of several hundred base pairs under strong non-coding selective pressure whose function remains unknown. Conclusion Non-coding functional elements, in particular those involved in post-transcriptional regulation, are likely to be much more prevalent than is currently known. With the numerous genome sequencing projects underway, comparative genomics approaches like that proposed here are likely to become increasingly powerful at detecting such elements.
Influence of the Leader protein coding region of foot-and-mouth disease virus on virus replication

DEFF Research Database (Denmark)

Belsham, Graham

2013-01-01

The foot-and-mouth disease virus (FMDV) Leader (L) protein is produced in two forms, Lab and Lb, differing only at their amino-termini, due to the use of separate initiation codons, usually 84 nt apart. It has been shown previously, and confirmed here, that precise deletion of the Lab coding......, in the context of the virus lacking the Lb coding region, was also tolerated by the virus within BHK cells. However, precise loss of the Lb coding sequence alone blocked FMDV replication in primary bovine thyroid cells. Thus, the requirement for the Leader protein coding sequences is highly dependent...... on the nature and extent of the residual Leader protein sequences and on the host cell system used. FMDVs precisely lacking Lb and with the Lab initiation codon modified may represent safer seed viruses for vaccine production....
Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Science.gov (United States)

Pujar, Shashikant; O'Leary, Nuala A; Farrell, Catherine M; Loveland, Jane E; Mudge, Jonathan M; Wallin, Craig; Girón, Carlos G; Diekhans, Mark; Barnes, If; Bennett, Ruth; Berry, Andrew E; Cox, Eric; Davidson, Claire; Goldfarb, Tamara; Gonzalez, Jose M; Hunt, Toby; Jackson, John; Joardar, Vinita; Kay, Mike P; Kodali, Vamsi K; Martin, Fergal J; McAndrews, Monica; McGarvey, Kelly M; Murphy, Michael; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Seal, Ruth L; Suner, Marie-Marthe; Webb, David; Zhu, Sophia; Aken, Bronwen L; Bruford, Elspeth A; Bult, Carol J; Frankish, Adam; Murphy, Terence; Pruitt, Kim D

2018-01-04

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.
Cloning and expression of the coding regions of the heat shock proteins HSP10 and HSP16 from Piscirickettsia salmonis

Directory of Open Access Journals (Sweden)

VIVIAN WILHELM

2003-01-01

Full Text Available The genes encoding the heat shock proteins HSP10 and HSP16 of the salmon pathogen Piscirickettsia salmonis have been isolated and sequenced. The HSP10 coding sequence is located in an open reading frame of 291 base pairs encoding 96 aminoacids. The HSP16 coding region was isolated as a 471 base pair fragment encoding a protein of 156 aminoacids. The deduced aminoacid sequences of both proteins show a significant homology to the respective protein from other prokaryotic organisms. Both proteins were expressed in E. coli as fusion proteins with thioredoxin and purified by chromatography on Ni-column. A rabbit serum against P. salmonis total proteins reacts with the recombinant HSP10 and HSP16 proteins. Similar reactivity was determined by ELISA using serum from salmon infected with P. salmonis. The possibility of formulating a vaccine containing these two proteins is discussed
The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

Directory of Open Access Journals (Sweden)

Nicolás Sarute

Full Text Available Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.
The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

Science.gov (United States)

Sarute, Nicolás; Calderón, Marina Gallo; Pérez, Ruben; La Torre, José; Hernández, Martín; Francia, Lourdes; Panzera, Yanina

2013-01-01

Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus) is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H) gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp) coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.
Non-Protein Coding RNAs

CERN Document Server

Walter, Nils G; Batey, Robert T

2009-01-01

This book assembles chapters from experts in the Biophysics of RNA to provide a broadly accessible snapshot of the current status of this rapidly expanding field. The 2006 Nobel Prize in Physiology or Medicine was awarded to the discoverers of RNA interference, highlighting just one example of a large number of non-protein coding RNAs. Because non-protein coding RNAs outnumber protein coding genes in mammals and other higher eukaryotes, it is now thought that the complexity of organisms is correlated with the fraction of their genome that encodes non-protein coding RNAs. Essential biological processes as diverse as cell differentiation, suppression of infecting viruses and parasitic transposons, higher-level organization of eukaryotic chromosomes, and gene expression itself are found to largely be directed by non-protein coding RNAs. The biophysical study of these RNAs employs X-ray crystallography, NMR, ensemble and single molecule fluorescence spectroscopy, optical tweezers, cryo-electron microscopy, and ot...
IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction

Directory of Open Access Journals (Sweden)

Kiran Sree Pokkuluri

2014-01-01

Full Text Available Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000. The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata and MCC (modified clonal classifier to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992 datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006 dataset and nonpromoters from EID (Saxonov et al., 2000 and UTRdb (Pesole et al., 2002 datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively.
Annotating non-coding regions of the genome.

Science.gov (United States)

Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

2010-08-01

Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.
Hidden Structural Codes in Protein Intrinsic Disorder.

Science.gov (United States)

Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo

2017-10-17

Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.
Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes

DEFF Research Database (Denmark)

Lin, Michael F; Kheradpour, Pouya; Washietl, Stefan

2011-01-01

conservation compared to typical protein-coding genes—especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29......-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ~2% of their synonymous sites. We collect numerous lines of evidence that the observed...... synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian...
High abundance of Serine/Threonine-rich regions predicted to be hyper-O-glycosylated in the secretory proteins coded by eight fungal genomes

Directory of Open Access Journals (Sweden)

González Mario

2012-09-01

Full Text Available Abstract Background O-glycosylation of secretory proteins has been found to be an important factor in fungal biology and virulence. It consists in the addition of short glycosidic chains to Ser or Thr residues in the protein backbone via O-glycosidic bonds. Secretory proteins in fungi frequently display Ser/Thr rich regions that could be sites of extensive O-glycosylation. We have analyzed in silico the complete sets of putatively secretory proteins coded by eight fungal genomes (Botrytis cinerea, Magnaporthe grisea, Sclerotinia sclerotiorum, Ustilago maydis, Aspergillus nidulans, Neurospora crassa, Trichoderma reesei, and Saccharomyces cerevisiae in search of Ser/Thr-rich regions as well as regions predicted to be highly O-glycosylated by NetOGlyc (http://www.cbs.dtu.dk. Results By comparison with experimental data, NetOGlyc was found to overestimate the number of O-glycosylation sites in fungi by a factor of 1.5, but to be quite reliable in the prediction of highly O-glycosylated regions. About half of secretory proteins have at least one Ser/Thr-rich region, with a Ser/Thr content of at least 40% over an average length of 40 amino acids. Most secretory proteins in filamentous fungi were predicted to be O-glycosylated, sometimes in dozens or even hundreds of sites. Residues predicted to be O-glycosylated have a tendency to be grouped together forming hyper-O-glycosylated regions of varying length. Conclusions About one fourth of secretory fungal proteins were predicted to have at least one hyper-O-glycosylated region, which consists of 45 amino acids on average and displays at least one O-glycosylated Ser or Thr every four residues. These putative highly O-glycosylated regions can be found anywhere along the proteins but have a slight tendency to be at either one of the two ends.
Annotation of the protein coding regions of the equine genome

DEFF Research Database (Denmark)

Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.

2015-01-01

Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...
Tuning protein expression using synonymous codon libraries targeted to the 5' mRNA coding region

DEFF Research Database (Denmark)

Goltermann, Lise; Borch Jensen, Martin; Bentin, Thomas

2011-01-01

intermediate expression levels of green fluorescent protein in Escherichia coli. At least in one case, no apparent effect on protein stability was observed, pointing to RNA level effects as the principal reason for the observed expression differences. Targeting a synonymous codon library to the 5' coding...
A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

Science.gov (United States)

Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

2012-01-01

Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.
A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

Directory of Open Access Journals (Sweden)

Ai-bing Zhang

Full Text Available Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish and two representing non-coding ITS barcodes (rust fungi and brown algae. Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ and Maximum likelihood (ML methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40% for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37% for 1094 brown algae queries, both using ITS barcodes.
Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs.

Directory of Open Access Journals (Sweden)

Chen Xie

2012-09-01

Full Text Available Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA-Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis, which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level.
Fast rate of evolution in alternatively spliced coding regions of mammalian genes

Directory of Open Access Journals (Sweden)

Nurtdinov Ramil N

2006-04-01

Full Text Available Abstract Background At least half of mammalian genes are alternatively spliced. Alternative isoforms are often genome-specific and it has been suggested that alternative splicing is one of the major mechanisms for generating protein diversity in the course of evolution. Another way of looking at alternative splicing is to consider sequence evolution of constitutive and alternative regions of protein-coding genes. Indeed, it turns out that constitutive and alternative regions evolve in different ways. Results A set of 3029 orthologous pairs of human and mouse alternatively spliced genes was considered. The rate of nonsynonymous substitutions (dN, the rate of synonymous substitutions (dS, and their ratio (ω = dN/dS appear to be significantly higher in alternatively spliced coding regions compared to constitutive regions. When N-terminal, internal and C-terminal alternatives are analysed separately, C-terminal alternatives appear to make the main contribution to the observed difference. The effects become even more pronounced in a subset of fast evolving genes. Conclusion These results provide evidence of weaker purifying selection and/or stronger positive selection in alternative regions and thus one more confirmation of accelerated evolution in alternative regions. This study corroborates the theory that alternative splicing serves as a testing ground for molecular evolution.

Orion: Detecting regions of the human non-coding genome that are intolerant to variation using population genetics.

Science.gov (United States)

Gussow, Ayal B; Copeland, Brett R; Dhindsa, Ryan S; Wang, Quanli; Petrovski, Slavé; Majoros, William H; Allen, Andrew S; Goldstein, David B

2017-01-01

There is broad agreement that genetic mutations occurring outside of the protein-coding regions play a key role in human disease. Despite this consensus, we are not yet capable of discerning which portions of non-coding sequence are important in the context of human disease. Here, we present Orion, an approach that detects regions of the non-coding genome that are depleted of variation, suggesting that the regions are intolerant of mutations and subject to purifying selection in the human lineage. We show that Orion is highly correlated with known intolerant regions as well as regions that harbor putatively pathogenic variation. This approach provides a mechanism to identify pathogenic variation in the human non-coding genome and will have immediate utility in the diagnostic interpretation of patient genomes and in large case control studies using whole-genome sequences.
Death of a dogma: eukaryotic mRNAs can code for more than one protein.

Science.gov (United States)

Mouilleron, Hélène; Delcourt, Vivian; Roucou, Xavier

2016-01-08

mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ribosome Profiling Reveals Pervasive Translation Outside of Annotated Protein-Coding Genes

Directory of Open Access Journals (Sweden)

Nicholas T. Ingolia

2014-09-01

Full Text Available Ribosome profiling suggests that ribosomes occupy many regions of the transcriptome thought to be noncoding, including 5′ UTRs and long noncoding RNAs (lncRNAs. Apparent ribosome footprints outside of protein-coding regions raise the possibility of artifacts unrelated to translation, particularly when they occupy multiple, overlapping open reading frames (ORFs. Here, we show hallmarks of translation in these footprints: copurification with the large ribosomal subunit, response to drugs targeting elongation, trinucleotide periodicity, and initiation at early AUGs. We develop a metric for distinguishing between 80S footprints and nonribosomal sources using footprint size distributions, which validates the vast majority of footprints outside of coding regions. We present evidence for polypeptide production beyond annotated genes, including the induction of immune responses following human cytomegalovirus (HCMV infection. Translation is pervasive on cytosolic transcripts outside of conserved reading frames, and direct detection of this expanded universe of translated products enables efforts at understanding how cells manage and exploit its consequences.
Amino acid code of protein secondary structure.

Science.gov (United States)

Shestopalov, B V

2003-01-01

The calculation of protein three-dimensional structure from the amino acid sequence is a fundamental problem to be solved. This paper presents principles of the code theory of protein secondary structure, and their consequence--the amino acid code of protein secondary structure. The doublet code model of protein secondary structure, developed earlier by the author (Shestopalov, 1990), is part of this theory. The theory basis are: 1) the name secondary structure is assigned to the conformation, stabilized only by the nearest (intraresidual) and middle-range (at a distance no more than that between residues i and i + 5) interactions; 2) the secondary structure consists of regular (alpha-helical and beta-structural) and irregular (coil) segments; 3) the alpha-helices, beta-strands and coil segments are encoded, respectively, by residue pairs (i, i + 4), (i, i + 2), (i, i = 1), according to the numbers of residues per period, 3.6, 2, 1; 4) all such pairs in the amino acid sequence are codons for elementary structural elements, or structurons; 5) the codons are divided into 21 types depending on their strength, i.e. their encoding capability; 6) overlappings of structurons of one and the same structure generate the longer segments of this structure; 7) overlapping of structurons of different structures is forbidden, and therefore selection of codons is required, the codon selection is hierarchic; 8) the code theory of protein secondary structure generates six variants of the amino acid code of protein secondary structure. There are two possible kinds of model construction based on the theory: the physical one using physical properties of amino acid residues, and the statistical one using results of statistical analysis of a great body of structural data. Some evident consequences of the theory are: a) the theory can be used for calculating the secondary structure from the amino acid sequence as a partial solution of the problem of calculation of protein three
Nucleotide sequence of the Escherichia coli pyrE gene and of the DNA in front of the protein-coding region

DEFF Research Database (Denmark)

Poulsen, Peter; Jensen, Kaj Frank; Valentin-Hansen, Poul

1983-01-01

leader segment in front of the protein-coding region. This leader contains a structure with features characteristic for a (translated?) rho-independent transcriptional terminator, which is preceded by a cluster of uridylate residues. This indicates that the frequency of pyrE transcription is regulated......Orotate phosphoribosyltransferase (EC 2.4.2.10) was purified to electrophoretic homogeneity from a strain of Escherichia coli containing the pyrE gene cloned on a multicopy plasmid. The relative molecular masses (Mr) of the native enzyme and its subunit were estimated by means of gel filtration...
High genetic diversity in the coat protein and 3' untranslated regions

Indian Academy of Sciences (India)

The 3′ terminal region consisting of the coat protein (CP) coding sequence and 3′ untranslated region (3′UTR) was cloned and sequenced from seven isolates. Sequence comparisons revealed considerable genetic diversity among the isolates in their CP and 3′UTR, making CdMV one of the highly variable members ...
Discovery of Proteomic Code with mRNA Assisted Protein Folding

Directory of Open Access Journals (Sweden)

Jan C. Biro

2008-12-01

Full Text Available The 3x redundancy of the Genetic Code is usually explained as a necessity to increase the mutation-resistance of the genetic information. However recent bioinformatical observations indicate that the redundant Genetic Code contains more biological information than previously known and which is additional to the 64/20 definition of amino acids. It might define the physico-chemical and structural properties of amino acids, the codon boundaries, the amino acid co-locations (interactions in the coded proteins and the free folding energy of mRNAs. This additional information, which seems to be necessary to determine the 3D structure of coding nucleic acids as well as the coded proteins, is known as the Proteomic Code and mRNA Assisted Protein Folding.
Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium

Energy Technology Data Exchange (ETDEWEB)

Ansong, Charles; Tolic, Nikola; Purvine, Samuel O.; Porwollik, Steffen; Jones, Marcus B.; Yoon, Hyunjin; Payne, Samuel H.; Martin, Jessica L.; Burnet, Meagan C.; Monroe, Matthew E.; Venepally, Pratap; Smith, Richard D.; Peterson, Scott; Heffron, Fred; Mcclelland, Michael; Adkins, Joshua N.

2011-08-25

Complete and accurate genome annotation is crucial for comprehensive and systematic studies of biological systems. For example systems biology-oriented genome scale modeling efforts greatly benefit from accurate annotation of protein-coding genes to develop proper functioning models. However, determining protein-coding genes for most new genomes is almost completely performed by inference, using computational predictions with significant documented error rates (> 15%). Furthermore, gene prediction programs provide no information on biologically important post-translational processing events critical for protein function. With the ability to directly measure peptides arising from expressed proteins, mass spectrometry-based proteomics approaches can be used to augment and verify coding regions of a genomic sequence and importantly detect post-translational processing events. In this study we utilized “shotgun” proteomics to guide accurate primary genome annotation of the bacterial pathogen Salmonella Typhimurium 14028 to facilitate a systems-level understanding of Salmonella biology. The data provides protein-level experimental confirmation for 44% of predicted protein-coding genes, suggests revisions to 48 genes assigned incorrect translational start sites, and uncovers 13 non-annotated genes missed by gene prediction programs. We also present a comprehensive analysis of post-translational processing events in Salmonella, revealing a wide range of complex chemical modifications (70 distinct modifications) and confirming more than 130 signal peptide and N-terminal methionine cleavage events in Salmonella. This study highlights several ways in which proteomics data applied during the primary stages of annotation can improve the quality of genome annotations, especially with regards to the annotation of mature protein products.
Bioinformatic Analysis of Deleterious Non-Synonymous Single Nucleotide Polymorphisms (nsSNPs in the Coding Regions of Human Prion Protein Gene (PRNP

Directory of Open Access Journals (Sweden)

Kourosh Bamdad

2016-12-01

Full Text Available Background & Objective: Single nucleotide polymorphisms are the cause of genetic variation to living organisms. Single nucleotide polymorphisms alter residues in the protein sequence. In this investigation, the relationship between prion protein gene polymorphisms and its relevance to pathogenicity was studied. Material & Method: Amino acid sequence of the main isoform from the human prion protein gene (PRNP was extracted from UniProt database and evaluated by FoldAmyloid and AmylPred servers. All non-synonymous single nucleotide polymorphisms (nsSNPs from SNP database (dbSNP were further analyzed by bioinformatics servers including SIFT, PolyPhen-2, I-Mutant-3.0, PANTHER, SNPs & GO, PHD-SNP, Meta-SNP, and MutPred to determine the most damaging nsSNPs. Results: The results of the first structure analyses by FoldAmyloid and AmylPerd servers implied that regions including 5-15, 174-178, 180-184, 211-217, and 240-252 were the most sensitive parts of the protein sequence to amyloidosis. Screening all nsSNPs of the main protein isoform using bioinformatic servers revealed that substitution of Aspartic acid with Valine at position 178 (ID code: rs11538766 was the most deleterious nsSNP in the protein structure. Conclusion: Substitution of the Aspartic acid with Valine at position 178 (D178V was the most pathogenic mutation in the human prion protein gene. Analyses from the MutPred server also showed that beta-sheets’ increment in the secondary structure was the main reason behind the molecular mechanism of the prion protein aggregation.
RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.

Science.gov (United States)

Grassi, Luigi; Leoni, Guido; Tramontano, Anna

2015-07-14

When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.
Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes

KAUST Repository

Alam, Tanvir

2014-10-02

Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptional regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.
A photon dominated region code comparison study

NARCIS (Netherlands)

Roellig, M.; Abel, N. P.; Bell, T.; Bensch, F.; Black, J.; Ferland, G. J.; Jonkheid, B.; Kamp, I.; Kaufman, M. J.; Le Bourlot, J.; Le Petit, F.; Meijerink, R.; Morata, O.; Ossenkopf, Volker; Roueff, E.; Shaw, G.; Spaans, M.; Sternberg, A.; Stutzki, J.; Thi, W.-F.; van Dishoeck, E. F.; van Hoof, P. A. M.; Viti, S.; Wolfire, M. G.

Aims. We present a comparison between independent computer codes, modeling the physics and chemistry of interstellar photon dominated regions (PDRs). Our goal was to understand the mutual differences in the PDR codes and their effects on the physical and chemical structure of the model clouds, and
A study on climatic adaptation of dipteran mitochondrial protein coding genes

Directory of Open Access Journals (Sweden)

Debajyoti Kabiraj

2017-10-01

Full Text Available Diptera, the true flies are frequently found in nature and their habitat is found all over the world including Antarctica and Polar Regions. The number of documented species for order diptera is quite high and thought to be 14% of the total animal present in the earth [1]. Most of the study in diptera has focused on the taxa of economic and medical importance, such as the fruit flies Ceratitis capitata and Bactrocera spp. (Tephritidae, which are serious agricultural pests; the blowflies (Calliphoridae and oestrid flies (Oestridae, which can cause myiasis; the anopheles mosquitoes (Culicidae, are the vectors of malaria; and leaf-miners (Agromyzidae, vegetable and horticultural pests [2]. Insect mitochondrion consists of 13 protein coding genes, 22 tRNAs and 2 rRNAs, are the remnant portion of alpha-proteobacteria is responsible for simultaneous function of energy production and thermoregulation of the cell through the bi-genomic system thus different adaptability in different climatic condition might have compensated by complementary changes is the both genomes [3,4]. In this study we have collected complete mitochondrial genome and occurrence data of one hundred thirteen such dipteran insects from different databases and literature survey. Our understanding of the genetic basis of climatic adaptation in diptera is limited to the basic information on the occurrence location of those species and mito genetic factors underlying changes in conspicuous phenotypes. To examine this hypothesis, we have taken an approach of Nucleotide substitution analysis for 13 protein coding genes of mitochondrial DNA individually and combined by different software for monophyletic group as well as paraphyletic group of dipteran species. Moreover, we have also calculated codon adaptation index for all dipteran mitochondrial protein coding genes. Following this work, we have classified our sample organisms according to their location data from GBIF (https
New tools to analyze overlapping coding regions.

Science.gov (United States)

Bayegan, Amir H; Garcia-Martin, Juan Antonio; Clote, Peter

2016-12-13

Retroviruses transcribe messenger RNA for the overlapping Gag and Gag-Pol polyproteins, by using a programmed -1 ribosomal frameshift which requires a slippery sequence and an immediate downstream stem-loop secondary structure, together called frameshift stimulating signal (FSS). It follows that the molecular evolution of this genomic region of HIV-1 is highly constrained, since the retroviral genome must contain a slippery sequence (sequence constraint), code appropriate peptides in reading frames 0 and 1 (coding requirements), and form a thermodynamically stable stem-loop secondary structure (structure requirement). We describe a unique computational tool, RNAsampleCDS, designed to compute the number of RNA sequences that code two (or more) peptides p,q in overlapping reading frames, that are identical (or have BLOSUM/PAM similarity that exceeds a user-specified value) to the input peptides p,q. RNAsampleCDS then samples a user-specified number of messenger RNAs that code such peptides; alternatively, RNAsampleCDS can exactly compute the position-specific scoring matrix and codon usage bias for all such RNA sequences. Our software allows the user to stipulate overlapping coding requirements for all 6 possible reading frames simultaneously, even allowing IUPAC constraints on RNA sequences and fixing GC-content. We generalize the notion of codon preference index (CPI) to overlapping reading frames, and use RNAsampleCDS to generate control sequences required in the computation of CPI. Moreover, by applying RNAsampleCDS, we are able to quantify the extent to which the overlapping coding requirement in HIV-1 [resp. HCV] contribute to the formation of the stem-loop [resp. double stem-loop] secondary structure known as the frameshift stimulating signal. Using our software, we confirm that certain experimentally determined deleterious HCV mutations occur in positions for which our software RNAsampleCDS and RNAiFold both indicate a single possible nucleotide. We
De novo origin of human protein-coding genes.

Directory of Open Access Journals (Sweden)

Dong-Dong Wu

2011-11-01

Full Text Available The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA-seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes.
De Novo Origin of Human Protein-Coding Genes

Science.gov (United States)

Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping

2011-01-01

The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831
Retrotransposons and non-protein coding RNAs

DEFF Research Database (Denmark)

Mourier, Tobias; Willerslev, Eske

2009-01-01

does not merely represent spurious transcription. We review examples of functional RNAs transcribed from retrotransposons, and address the collection of non-protein coding RNAs derived from transposable element sequences, including numerous human microRNAs and the neuronal BC RNAs. Finally, we review...
The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition

Science.gov (United States)

Štambuk, Nikola

The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.
Paracantor: A two group, two region reactor code

Energy Technology Data Exchange (ETDEWEB)

Stone, Stuart

1956-07-01

Paracantor I a two energy group, two region, time independent reactor code, which obtains a closed solution for a critical reactor assembly. The code deals with cylindrical reactors of finite length and with a radial reflector of finite thickness. It is programmed for the 1.B.M: Magnetic Drum Data-Processing Machine, Type 650. The limited memory space available does not permit a flux solution to be included in the basic Paracantor code. A supplementary code, Paracantor 11, has been programmed which computes fluxes, .including adjoint fluxes, from the .output of Paracamtor I.
The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.

Science.gov (United States)

Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas

2014-01-01

For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional

A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements.

Directory of Open Access Journals (Sweden)

Eugeny A Elisaphenko

2008-06-01

Full Text Available X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC. Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA.
Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

Directory of Open Access Journals (Sweden)

Pupko Tal

2008-10-01

Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.
Systematic screening for mutations in the promoter and the coding region of the 5-HT{sub 1A} gene

Energy Technology Data Exchange (ETDEWEB)

Erdmann, J.; Shimron-Abarbanell, D.; Cichon, S. [Univ. of Bonn (Germany)] [and others

1995-10-09

In the present study we sought to identify genetic variation in the 5-HT{sub 1A} receptor gene which through alteration of protein function or level of expression might contribute to the genetic predisposition to neuropsychiatric diseases. Genomic DNA samples from 159 unrelated subjects (including 45 schizophrenic, 46 bipolar affective, and 43 patients with Tourette`s syndrome, as well as 25 healthy controls) were investigated by single-strand conformation analysis. Overlapping PCR (polymerase chain reaction) fragments covered the whole coding sequence as well as the 5{prime} untranslated region of the 5-HT{sub 1A} gene. The region upstream to the coding sequence we investigated contains a functional promoter. We found two rare nucleotide sequence variants. Both mutations are located in the coding region of the gene: a coding mutation (A{yields}G) in nucleotide position 82 which leads to an amino acid exchange (Ile{yields}Val) in position 28 of the receptor protein and a silent mutation (C{yields}T) in nucleotide position 549. The occurrence of the Ile-28-Val substitution was studied in an extended sample of patients (n = 352) and controls (n = 210) but was found in similar frequencies in all groups. Thus, this mutation is unlikely to play a significant role in the genetic predisposition to the diseases investigated. In conclusion, our study does not provide evidence that the 5-HT{sub 1A} gene plays either a major or a minor role in the genetic predisposition to schizophrenia, bipolar affective disorder, or Tourette`s syndrome. 29 refs., 4 figs., 1 tab.
Novel polymorphisms in UTR and coding region of inducible heat shock protein 70.1 gene in tropically adapted Indian zebu cattle (Bos indicus) and riverine buffalo (Bubalus bubalis).

Science.gov (United States)

Sodhi, M; Mukesh, M; Kishore, A; Mishra, B P; Kataria, R S; Joshi, B K

2013-09-25

Due to evolutionary divergence, cattle (taurine, and indicine) and buffalo are speculated to have different responses to heat stress condition. Variation in candidate genes associated with a heat-shock response may provide an insight into the dissimilarity and suggest targets for intervention. The present work was undertaken to characterize one of the inducible heat shock protein genes promoter and coding regions in diverse breeds of Indian zebu cattle and buffaloes. The genomic DNA from a panel of 117 unrelated animals representing 14 diversified native cattle breeds and 6 buffalo breeds were utilized to determine the complete sequence and gene diversity of HSP70.1 gene. The coding region of HSP70.1 gene in Indian zebu cattle, Bos taurus and buffalo was similar in length (1,926 bp) encoding a HSP70 protein of 641 amino acids with a calculated molecular weight (Mw) of 70.26 kDa. However buffalo had a longer 5' and 3' untranslated region (UTR) of 204 and 293 nucleotides respectively, in comparison to Indian zebu cattle and Bos taurus wherein length of 5' and 3'-UTR was 172 and 286 nucleotides, respectively. The increased length of buffalo HSP70.1 gene compared to indicine and taurine gene was due to two insertions each in 5' and 3'-UTR. Comparative sequence analysis of cattle (taurine and indicine) and buffalo HSP70.1 gene revealed a total of 54 gene variations (50 SNPs and 4 INDELs) among the three species in the HSP70.1 gene. The minor allele frequencies of these nucleotide variations varied from 0.03 to 0.5 with an average of 0.26. Among the 14 B. indicus cattle breeds studied, a total of 19 polymorphic sites were identified: 4 in the 5'-UTR and 15 in the coding region (of these 2 were non-synonymous). Analysis among buffalo breeds revealed 15 SNPs throughout the gene: 6 at the 5' flanking region and 9 in the coding region. In bubaline 5'-UTR, 2 additional putative transcription factor binding sites (Elk-1 and C-Re1) were identified, other than three common sites
Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins

Directory of Open Access Journals (Sweden)

Escalante Ricardo

2008-01-01

Full Text Available Abstract Background The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Results Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N, that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87–89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. Conclusion A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.
Origins of gene, genetic code, protein and life

Indian Academy of Sciences (India)

Unknown

have concluded that newly-born genes are products of nonstop frames (NSF) ... research to determine tertiary structures of proteins such ... the present earth, is favourable for new genes to arise, if ..... NGG) in the universal genetic code table, cannot satisfy ..... which has been proposed to explain the development of life on.
Conceptual Approach to Forming the Basic Code of Neo-Industrial Development of a Region

Directory of Open Access Journals (Sweden)

Elena Leonidovna Andreeva

2017-09-01

Full Text Available In the article, the authors propose the conceptual fundamentals of the “code approach” to the regional neo-industrial development. The purpose of the research is to reveal the essence of the transition to a new type of industrial and economic relations through a prism of “genetic codes” of the region. We consider these codes as a system of the “racial memory” of a territory, which determines the specificity and features of neo-industrialization realization. We substantiated the hypothesis about the influence of the “genetic codes” of the region on the effectiveness of the neo-industrialization. We have defined the participants, or else the carriers of the codes in the transformation of regional inheritance for the stimulation of the neoindustrial development of region’s economy. The subject matter of the research is the distinctive features of the functioning of the determinative region’s codes. Their content determines the socio-economic specificity of the region and the features of innovative, informational, value-based and competence-based development of the territory. The determinative codes generate the dynamic codes of the region, which are understood as their derivatives. They have a high probability of occurrence, higher speed of development and distribution, internal forces that make possible the self-development of the region. The scientific contribution is the substantiation of the basic code of the regional neo-industrial development. It represents the evolutionary accumulation of the rapid changes of its innovative, informational, value-based and competence-based codes stimulating the generation and implementation of new ideas regarding to economic entities adapted to the historical and cultural conditions. The article presents the code model of neo-industrial development of the region described by formulas. We applied the system analysis methods, historical and civilization approaches, evolutionary and
Prader-Willi region non-protein coding RNA 1 suppressed gastric cancer growth as a competing endogenous RNA of microRNA-425-5p.

Science.gov (United States)

Chen, Zihao; Ju, Hongping; Yu, Shan; Zhao, Ting; Jing, Xiaojie; Li, Ping; Jia, Jing; Li, Nan; Tan, Bibo; Li, Yong

2018-03-13

Gastric cancer (GC) is one of a major global health problem especially in Asia. Nowadays, long non-coding RNA has gained significantly attention in the current research climate such as carcinogenesis. This research desired to explore the mechanism of Prader-Willi region non-protein coding RNA 1 (PWRN1) on regulating GC process. Differentially expressed lncRNAs in GC tissues were screened out through microarray analysis. The RNA and protein expression level was detected by qRT-PCR and western blot. Cell proliferation, apoptosis rate, metastasis abilities were respectively determined by CCK8, flow cytometry, wound healing and transwell assay. The luciferase reporter system was used to verify the targeting relationships between PWRN1, miR-425-5p and PTEN RIP assay was performed to prove whether PWRN1 acted as a competitive endogenous RNA (ceRNA) of miR-425-5p. Tumor xenograft model and immunohistochemistry were developed to study the influence of PWRN1 on tumor growth in vivo Microarray analysis determined that PWRN1 was different expressed between GC tissues and adjacent tissues. QRT-PCR revealed PWRN1 low expression in GC tissues and cells. PWRN1 up-regulated could reduce proliferation and metastasis and increased apoptosis in GC cells, while miR-425-5p had reverse effects. The RIP assay indicated that PWRN1 may target an oncogene miR-425-5p. The tumor xenograft assay found that up-regulated PWRN1 suppressed the tumor growth. The bioinformatic analysis, luciferase assay and western blot indicated that PWRN1 affected PTEN/Akt/MDM2/p53 axis via suppressing miR-425-5p. Our findings suggested that PWRN1 functioned as a ceRNA targeting to miR-425-5p and suppressed GC development via p53 signaling pathway. ©2018 The Author(s).
Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes

KAUST Repository

Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; Brown, James B.; Lipovich, Leonard; Bajic, Vladimir B.

2014-01-01

raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted
On fuzzy semantic similarity measure for DNA coding.

Science.gov (United States)

Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin

2016-02-01

A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.
Annotating pathogenic non-coding variants in genic regions.

Science.gov (United States)

Gelfman, Sahar; Wang, Quanli; McSweeney, K Melodi; Ren, Zhong; La Carpia, Francesca; Halvorsen, Matt; Schoch, Kelly; Ratzon, Fanni; Heinzen, Erin L; Boland, Michael J; Petrovski, Slavé; Goldstein, David B

2017-08-09

Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.While non-coding synonymous and intronic variants are often not under strong selective constraint, they can be pathogenic through affecting splicing or transcription. Here, the authors develop a score that uses sequence context alterations to predict pathogenicity of synonymous and non-coding genetic variants, and provide a web server of pre-computed scores.
Unusually effective microRNA targeting within repeat-rich coding regions of mammalian mRNAs

Science.gov (United States)

Schnall-Levin, Michael; Rissland, Olivia S.; Johnston, Wendy K.; Perrimon, Norbert; Bartel, David P.; Berger, Bonnie

2011-01-01

MicroRNAs (miRNAs) regulate numerous biological processes by base-pairing with target messenger RNAs (mRNAs), primarily through sites in 3′ untranslated regions (UTRs), to direct the repression of these targets. Although miRNAs have sometimes been observed to target genes through sites in open reading frames (ORFs), large-scale studies have shown such targeting to be generally less effective than 3′ UTR targeting. Here, we show that several miRNAs each target significant groups of genes through multiple sites within their coding regions. This ORF targeting, which mediates both predictable and effective repression, arises from highly repeated sequences containing miRNA target sites. We show that such sequence repeats largely arise through evolutionary duplications and occur particularly frequently within families of paralogous C2H2 zinc-finger genes, suggesting the potential for their coordinated regulation. Examples of ORFs targeted by miR-181 include both the well-known tumor suppressor RB1 and RBAK, encoding a C2H2 zinc-finger protein and transcriptional binding partner of RB1. Our results indicate a function for repeat-rich coding sequences in mediating post-transcriptional regulation and reveal circumstances in which miRNA-mediated repression through ORF sites can be reliably predicted. PMID:21685129
SINEUPs are modular antisense long-non coding RNAs that increase synthesis of target proteins in cells

Directory of Open Access Journals (Sweden)

Silvia eZucchelli

2015-05-01

Full Text Available Despite recent efforts in discovering novel long non-coding RNAs (lncRNAs and unveiling their functions in a wide range of biological processes their applications as biotechnological or therapeutic tools are still at their infancy. We have recently shown that AS Uchl1, a natural lncRNA antisense to the Parkinson’s disease-associated gene Ubiquitin carboxyl-terminal esterase L1 (Uchl1, is able to increase UchL1 protein synthesis at post-transcriptional level. Its activity requires two RNA elements: an embedded inverted SINEB2 sequence to increase translation and the overlapping region to target its sense mRNA. This functional organization is shared with several mouse lncRNAs antisense to protein coding genes. The potential use of AS Uchl1-derived lncRNAs as enhancers of target mRNA translation remains unexplored. Here we define AS Uchl1 as the representative member of a new functional class of natural and synthetic antisense lncRNAs that activate translation. We named this class of RNAs SINEUPs for their requirement of the inverted SINEB2 sequence to UP-regulate translation in a gene-specific manner. The overlapping region is indicated as the Binding Doman (BD while the embedded inverted SINEB2 element is the Effector Domain (ED. By swapping BD, synthetic SINEUPs are designed targeting mRNAs of interest. SINEUPs function in an array of cell lines and can be efficiently directed towards N-terminally tagged proteins. Their biological activity is retained in a miniaturized version within the range of small RNAs length. Its modular structure was exploited to successfully design synthetic SINEUPs targeting endogenous Parkinson’s disease-associated DJ-1 and proved to be active in different neuronal cell lines.In summary, SINEUPs represent the first scalable tool to increase synthesis of proteins of interest. We propose SINEUPs as reagents for molecular biology experiments, in protein manufacturing as well as in therapy of haploinsufficiencies.
Emerging putative associations between non-coding RNAs and protein-coding genes in Neuropathic Pain. Added value from re-using microarray data.

Directory of Open Access Journals (Sweden)

Enrico Capobianco

2016-10-01

Full Text Available Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs. This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve injury, and studied in a rat model, using two neuronal tissues, namely dorsal root ganglion (DRG and sciatic nerve (SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes, and re-purposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parent genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to neuropathic pain. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN, and 8 in DRG, antisense RNA (31 asRNA in SN, and 12 in DRG and pseudogenes (456 in SN, 56 in DRG. In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly
Spectrum of small mutations in the dystrophin coding region

Energy Technology Data Exchange (ETDEWEB)

Prior, T.W.; Bartolo, C.; Pearl, D.K. [Ohio State Univ., Columbus, OH (United States)] [and others

1995-07-01

Duchenne and Becker muscular dystrophies (DMD and BMD) are caused by defects in the dystrophin gene. About two-thirds of the affected patients have large deletions or duplications, which occur in the 5` and central portion of the gene. The nondeletion/duplication cases are most likely the result of smaller mutations that cannot be identified by current diagnostic screening strategies. We screened {approximately} 80% of the dystrophin coding sequence for small mutations in 158 patients without deletions or duplications and identified 29 mutations. The study indicates that many of the DMD and the majority of the BMD small mutations lie in noncoding regions of the gene. All of the mutations identified were unique to single patients, and most of the mutations resulted in protein truncation. We did not find a clustering of small mutations similar to the deletion distribution but found > 40% of the small mutations 3` of exon 55. The extent of protein truncation caused by the 3` mutations did not determine the phenotype, since even the exon 76 nonsense mutation resulted in the severe DMD phenotype. Our study confirms that the dystrophin gene is subject to a high rate of mutation in CpG sequences. As a consequence of not finding any hotspots or prevalent small mutations, we conclude that it is presently not possible to perform direct carrier and prenatal diagnostics for many families without deletions or duplications. 71 refs., 2 figs., 2 tabs.
ProteinSplit: splitting of multi-domain proteins using prediction of ordered and disordered regions in protein sequences for virtual structural genomics

International Nuclear Information System (INIS)

Wyrwicz, Lucjan S; Koczyk, Grzegorz; Rychlewski, Leszek; Plewczynski, Dariusz

2007-01-01

The annotation of protein folds within newly sequenced genomes is the main target for semi-automated protein structure prediction (virtual structural genomics). A large number of automated methods have been developed recently with very good results in the case of single-domain proteins. Unfortunately, most of these automated methods often fail to properly predict the distant homology between a given multi-domain protein query and structural templates. Therefore a multi-domain protein should be split into domains in order to overcome this limitation. ProteinSplit is designed to identify protein domain boundaries using a novel algorithm that predicts disordered regions in protein sequences. The software utilizes various sequence characteristics to assess the local propensity of a protein to be disordered or ordered in terms of local structure stability. These disordered parts of a protein are likely to create interdomain spacers. Because of its speed and portability, the method was successfully applied to several genome-wide fold annotation experiments. The user can run an automated analysis of sets of proteins or perform semi-automated multiple user projects (saving the results on the server). Additionally the sequences of predicted domains can be sent to the Bioinfo.PL Protein Structure Prediction Meta-Server for further protein three-dimensional structure and function prediction. The program is freely accessible as a web service at http://lucjan.bioinfo.pl/proteinsplit together with detailed benchmark results on the critical assessment of a fully automated structure prediction (CAFASP) set of sequences. The source code of the local version of protein domain boundary prediction is available upon request from the authors
Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions.

Science.gov (United States)

Vucetic, Slobodan; Xie, Hongbo; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N

2007-05-01

Biologically active proteins without stable ordered structure (i.e., intrinsically disordered proteins) are attracting increased attention. Functional repertoires of ordered and disordered proteins are very different, and the ability to differentiate whether a given function is associated with intrinsic disorder or with a well-folded protein is crucial for modern protein science. However, there is a large gap between the number of proteins experimentally confirmed to be disordered and their actual number in nature. As a result, studies of functional properties of confirmed disordered proteins, while helpful in revealing the functional diversity of protein disorder, provide only a limited view. To overcome this problem, a bioinformatics approach for comprehensive study of functional roles of protein disorder was proposed in the first paper of this series (Xie, H.; Vucetic, S.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V. N. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J. Proteome Res. 2007, 5, 1882-1898). Applying this novel approach to Swiss-Prot sequences and functional keywords, we found over 238 and 302 keywords to be strongly positively or negatively correlated, respectively, with long intrinsically disordered regions. This paper describes approximately 90 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes, and coding sequence diversities possessing strong positive and negative correlation with long disordered regions.
Functional Anthology of Intrinsic Disorder. II. Cellular Components, Domains, Technical Terms, Developmental Processes and Coding Sequence Diversities Correlated with Long Disordered Regions

Science.gov (United States)

Vucetic, Slobodan; Xie, Hongbo; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.

2008-01-01

Biologically active proteins without stable ordered structure (i.e., intrinsically disordered proteins) are attracting increased attention. Functional repertoires of ordered and disordered proteins are very different, and the ability to differentiate whether a given function is associated with intrinsic disorder or with a well-folded protein is crucial for modern protein science. However, there is a large gap between the number of proteins experimentally confirmed to be disordered and their actual number in nature. As a result, studies of functional properties of confirmed disordered proteins, while helpful in revealing the functional diversity of protein disorder, provide only a limited view. To overcome this problem, a bioinformatics approach for comprehensive study of functional roles of protein disorder was proposed in the first paper of this series (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). Applying this novel approach to Swiss-Prot sequences and functional keywords, we found over 238 and 302 keywords to be strongly positively or negatively correlated, respectively, with long intrinsically disordered regions. This paper describes ~90 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes and coding sequence diversities possessing strong positive and negative correlation with long disordered regions. PMID:17391015
Block-based wavelet transform coding of mammograms with region-adaptive quantization

Science.gov (United States)

Moon, Nam Su; Song, Jun S.; Kwon, Musik; Kim, JongHyo; Lee, ChoongWoong

1998-06-01

To achieve both high compression ratio and information preserving, it is an efficient way to combine segmentation and lossy compression scheme. Microcalcification in mammogram is one of the most significant sign of early stage of breast cancer. Therefore in coding, detection and segmentation of microcalcification enable us to preserve it well by allocating more bits to it than to other regions. Segmentation of microcalcification is performed both in spatial domain and in wavelet transform domain. Peak error controllable quantization step, which is off-line designed, is suitable for medical image compression. For region-adaptive quantization, block- based wavelet transform coding is adopted and different peak- error-constrained quantizers are applied to blocks according to the segmentation result. In view of preservation of microcalcification, the proposed coding scheme shows better performance than JPEG.
Sensitivity Study of Regional TDC in MATRA-S code Using PSBT Benchmark Exercise

International Nuclear Information System (INIS)

Kim, Seong Jin; Cha, Jeong Hun; Seo, Kyong Won; Kwon, Hyuk; Hwang, Dae Hyun

2012-01-01

In the sub-channel analysis code, the modeling of interchannel exchanges between adjacent sub-channels expressed as diversion cross flow, turbulent mixing and so on. The turbulent mixing in MATRA-S code is considered as TDC( β : thermal diffusion coefficient). The TDC becomes different according to the bundle, grid type, mixing vane, and so on. Generally, the thermal mixing test is conducted to optimize the TDC. In the OECD/NRC PSBT benchmark, the thermal mixing test was conducted and the optimized TDC was analyzed using MATRA-S code. It was shown that the exit temperature distribution of MATRA-S code was different from an experimental result even though the optimized TDC was applied to the code. In this study, concept of the regional TDC was introduced and sensitivity analysis of the regional TDC was presented

Both noncoding and protein-coding RNAs contribute to gene expression evolution in the primate brain.

Science.gov (United States)

Babbitt, Courtney C; Fedrigo, Olivier; Pfefferle, Adam D; Boyle, Alan P; Horvath, Julie E; Furey, Terrence S; Wray, Gregory A

2010-01-18

Despite striking differences in cognition and behavior between humans and our closest primate relatives, several studies have found little evidence for adaptive change in protein-coding regions of genes expressed primarily in the brain. Instead, changes in gene expression may underlie many cognitive and behavioral differences. Here, we used digital gene expression: tag profiling (here called Tag-Seq, also called DGE:tag profiling) to assess changes in global transcript abundance in the frontal cortex of the brains of 3 humans, 3 chimpanzees, and 3 rhesus macaques. A substantial fraction of transcripts we identified as differentially transcribed among species were not assayed in previous studies based on microarrays. Differentially expressed tags within coding regions are enriched for gene functions involved in synaptic transmission, transport, oxidative phosphorylation, and lipid metabolism. Importantly, because Tag-Seq technology provides strand-specific information about all polyadenlyated transcripts, we were able to assay expression in noncoding intragenic regions, including both sense and antisense noncoding transcripts (relative to nearby genes). We find that many noncoding transcripts are conserved in both location and expression level between species, suggesting a possible functional role. Lastly, we examined the overlap between differential gene expression and signatures of positive selection within putative promoter regions, a sign that these differences represent adaptations during human evolution. Comparative approaches may provide important insights into genes responsible for differences in cognitive functions between humans and nonhuman primates, as well as highlighting new candidate genes for studies investigating neurological disorders.
Bifurcations in the interplay of messenger RNA, protein and nonprotein coding RNA

International Nuclear Information System (INIS)

Zhdanov, Vladimir P

2008-01-01

The interplay of messenger RNA (mRNA), protein, produced via translation of this RNA, and nonprotein coding RNA (ncRNA) may include regulation of the ncRNA production by protein and (i) ncRNA-protein association resulting in suppression of the protein regulatory activity or (ii) ncRNA-mRNA association resulting in degradation of the miRNA-mRNA complex. The kinetic models describing these two scenarios are found to predict bistability provided that protein suppresses the ncRNA formation
Evidence for gene-specific rather than transcription rate-dependent histone H3 exchange in yeast coding regions.

Science.gov (United States)

Gat-Viks, Irit; Vingron, Martin

2009-02-01

In eukaryotic organisms, histones are dynamically exchanged independently of DNA replication. Recent reports show that different coding regions differ in their amount of replication-independent histone H3 exchange. The current paradigm is that this histone exchange variability among coding regions is a consequence of transcription rate. Here we put forward the idea that this variability might be also modulated in a gene-specific manner independently of transcription rate. To that end, we study transcription rate-independent replication-independent coding region histone H3 exchange. We term such events relative exchange. Our genome-wide analysis shows conclusively that in yeast, relative exchange is a novel consistent feature of coding regions. Outside of replication, each coding region has a characteristic pattern of histone H3 exchange that is either higher or lower than what was expected by its RNAPII transcription rate alone. Histone H3 exchange in coding regions might be a way to add or remove certain histone modifications that are important for transcription elongation. Therefore, our results that gene-specific coding region histone H3 exchange is decoupled from transcription rate might hint at a new epigenetic mechanism of transcription regulation.
A human-specific de novo protein-coding gene associated with human brain functions.

Directory of Open Access Journals (Sweden)

Chuan-Yun Li

2010-03-01

Full Text Available To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203. Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.
Methods for Using Small Non-Coding RNAs to Improve Recombinant Protein Expression in Mammalian Cells

Directory of Open Access Journals (Sweden)

Sarah Inwood

2018-01-01

Full Text Available The ability to produce recombinant proteins by utilizing different “cell factories” revolutionized the biotherapeutic and pharmaceutical industry. Chinese hamster ovary (CHO cells are the dominant industrial producer, especially for antibodies. Human embryonic kidney cells (HEK, while not being as widely used as CHO cells, are used where CHO cells are unable to meet the needs for expression, such as growth factors. Therefore, improving recombinant protein expression from mammalian cells is a priority, and continuing effort is being devoted to this topic. Non-coding RNAs are RNA segments that are not translated into a protein and often have a regulatory role. Since their discovery, major progress has been made towards understanding their functions. Non-coding RNA has been investigated extensively in relation to disease, especially cancer, and recently they have also been used as a method for engineering cells to improve their protein expression capability. In this review, we provide information about methods used to identify non-coding RNAs with the potential of improving recombinant protein expression in mammalian cell lines.
Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

Science.gov (United States)

Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

2017-12-02

The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.
Implementation of the International Code of Marketing of Breastmilk Substitutes in the Eastern Mediterranean Region.

Science.gov (United States)

Al Jawaldeh, Ayoub; Sayed, Ghada

2018-04-05

Optimal breastfeeding practices and appropriate complementary feeding improve child health, survival and development. The countries of the Eastern Mediterranean Region have made significant strides in formulation and implementation of legislation to protect and promote breastfeeding based on The International Code of Marketing of Breast-milk Substitutes (the Code) and subsequent relevant World Health Assembly resolutions. To assess the implementation of the Code in the Region. Assessment was conducted by the World Health Organization (WHO) Regional Office for the Eastern Mediterranean using a WHO standard questionnaire. Seventeen countries in the Region have enacted legislation to protect breastfeeding. Only 6 countries have comprehensive legislation or other legal measures reflecting all or most provisions of the Code; 4 countries have legal measures incorporating many provisions of the Code; 7 countries have legal measures that contain a few provisions of the Code; 4 countries are currently studying the issue; and only 1 country has no measures in place. Further analysis of the legislation found that the text of articles in the laws fully reflected the Code articles in only 6 countries. Most countries need to revisit and amend existing national legislation to implement fully the Code and relevant World Health Assembly resolutions, supported by systematic monitoring and reporting. Copyright © World Health Organization (WHO) 2018. Some rights reserved. This work is available under the CC BY-NC-SA 3.0 IGO license (https://creativecommons.org/licenses/by-nc-sa/3.0/igo).
Phylogenetic analyses of the polyprotein coding sequences of serotype O foot-and-mouth disease viruses in East Africa: evidence for interserotypic recombination

DEFF Research Database (Denmark)

Balinda, Sheila; Siegismund, Hans; Muwanika, Vincent

2010-01-01

from both serotypes A and O. Conclusions Sequences of the VP1 coding region from recent serotype O FMDVs from Kenya and Uganda are all representatives of a specific East African lineage (topotype EA-2), a probable indication that hardly any FMD introductions of this serotype have occurred from outside...... the region in the recent past. Furthermore, evidence for interserotypic recombination, within the non-structural protein coding regions, between FMDVs of serotypes A and O has been obtained. In addition to characterization using the VP1 coding region, analyses involving the non-structural protein coding...
Determining coding CpG islands by identifying regions significant for pattern statistics on Markov chains.

Science.gov (United States)

Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior

2011-09-23

Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.
Regional Atmospheric Transport Code for Hanford Emission Tracking, Version 2 (RATCHET2)

International Nuclear Information System (INIS)

Ramsdell, James V.; Rishel, Jeremy P.

2006-01-01

This manual describes the atmospheric model and computer code for the Atmospheric Transport Module within SAC. The Atmospheric Transport Module, called RATCHET2, calculates the time-integrated air concentration and surface deposition of airborne contaminants to the soil. The RATCHET2 code is an adaptation of the Regional Atmospheric Transport Code for Hanford Emissions Tracking (RATCHET). The original RATCHET code was developed to perform the atmospheric transport for the Hanford Environmental Dose Reconstruction Project. Fundamentally, the two sets of codes are identical; no capabilities have been deleted from the original version of RATCHET. Most modifications are generally limited to revision of the run-specification file to streamline the simulation process for SAC.
Regional Atmospheric Transport Code for Hanford Emission Tracking, Version 2(RATCHET2)

Energy Technology Data Exchange (ETDEWEB)

Ramsdell, James V.; Rishel, Jeremy P.

2006-07-01

This manual describes the atmospheric model and computer code for the Atmospheric Transport Module within SAC. The Atmospheric Transport Module, called RATCHET2, calculates the time-integrated air concentration and surface deposition of airborne contaminants to the soil. The RATCHET2 code is an adaptation of the Regional Atmospheric Transport Code for Hanford Emissions Tracking (RATCHET). The original RATCHET code was developed to perform the atmospheric transport for the Hanford Environmental Dose Reconstruction Project. Fundamentally, the two sets of codes are identical; no capabilities have been deleted from the original version of RATCHET. Most modifications are generally limited to revision of the run-specification file to streamline the simulation process for SAC.
Improvement of genome assembly completeness and identification of novel full-length protein-coding genes by RNA-seq in the giant panda genome.

Science.gov (United States)

Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan

2015-12-11

High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.
Cloning of cDNAs coding for the heavy chain region and connecting region of human factor V, a blood coagulation factor with four types of internal repeats

International Nuclear Information System (INIS)

Kane, W.H.; Ichinose, A.; Hagen, F.S.; Davie, E.W.

1987-01-01

Human factor V is a high molecular weight plasma glycoprotein that participates as a cofactor in the conversion of prothrombin to thrombin by factor X/sub a/. Prior to its participation in the coagulation cascade, factor V is converted to factor V/sub a/ by thrombin generating a heavy chain and a light chain, and these two chains are held together by calcium ions. A connecting region originally located between the heavy and light chains is liberated during the activation reaction. In a previous study, a cDNA of 2970 nucleotides that codes for the carboxyl-terminal 938 amino acids of factor V was isolated and characterized from a Hep G2 cDNA library. This cDNA has been used to obtain additional clones from Hep G2 and human liver cDNA libraries. Furthermore, a Hep G2 cDNA library prepared with an oligonucleotide from the 5' end of these cDNAs was screened to obtain overlapping cDNA clones that code for the amino-terminal region of the molecule. The composite sequence of these clones spans 6911 nucleotides and is consistent with the size of the factor V message present in Hep G2 cells (approximately 7 kilobases). The cDNA codes for a leader sequence of 28 amino acids and a mature protein of 2196 amino acids. The amino acid sequence predicted from the cDNA was in complete agreement with 139 amino acid residues that were identified by Edman degradation of cyanogen bromide peptides isolated from the heavy chain region and connecting region of plasma factor V. The domain structure of human factor V is similar to that previously reported for human coagulation factor VIII. Two types of tandem repeats (17 and 9 amino acids) have also been identified in the connecting region of factor V. The present data indicate that the amino acid sequence in the heavy and light chain regions of factor V is ∼ 40% identical with the corresponding regions of factor VIII
Integration and visualization of non-coding RNA and protein interaction networks

OpenAIRE

Junge, Alexander; Refsgaard, Jan Christian; Garde, Christian; Pan, Xiaoyong; Santos Delgado, Alberto; Anthon, Christian; Alkan, Ferhat; von Mering, Christian; Workman, Christopher; Jensen, Lars Juhl; Gorodkin, Jan

2015-01-01

Non-coding RNAs (ncRNAs) fulfill a diverse set of biological functions relying on interactions with other molecular entities. The advent of new experimental and computational approaches makes it possible to study ncRNAs and their associations on an unprecedented scale. We present RAIN (RNA Association and Interaction Networks) - a database that combines ncRNA-ncRNA, ncRNA-mRNA and ncRNA-protein interactions with large-scale protein association networks available in the STRING database. By int...
Capsid coding region diversity of re-emerging lineage C foot-and-mouth disease virus serotype Asia1 from India.

Science.gov (United States)

Subramaniam, Saravanan; Mohapatra, Jajati K; Das, Biswajit; Sharma, Gaurav K; Biswal, Jitendra K; Mahajan, Sonalika; Misri, Jyoti; Dash, Bana B; Pattnaik, Bramhadev

2015-07-01

Foot-and-mouth disease virus (FMDV) serotype Asia1 was first reported in India in 1951, where three major genetic lineages (B, C and D) of this serotype have been described until now. In this study, the capsid protein coding region of serotype Asia1 viruses (n = 99) from India were analyzed, giving importance to the viruses circulating since 2007. All of the isolates (n = 50) recovered during 2007-2013 were found to group within the re-emerging cluster of lineage C (designated as sublineage C(R)). The evolutionary rate of sublineage C(R) was estimated to be slightly higher than that of the serotype as a whole, and the time of the most recent common ancestor for this cluster was estimated to be approximately 2001. In comparison to the older isolates of lineage C (1993-2001), the re-emerging viruses showed variation at eight amino acid positions, including substitutions at the antigenically critical residues VP279 and VP2131. However, no direct correlation was found between sequence variations and antigenic relationships. The number of codons under positive selection and the nature of the selection pressure varied widely among the structural proteins, implying a heterogeneous pattern of evolution in serotype Asia1. While episodic diversifying selection appears to play a major role in shaping the evolution of VP1 and VP3, selection pressure acting on codons of VP2 is largely pervasive. Further, episodic positive selection appears to be responsible for the early diversification of lineage C. Recombination events identified in the structural protein coding region indicates its probable role in adaptive evolution of serotype Asia1 viruses.
SCOWLP classification: Structural comparison and analysis of protein binding regions

Directory of Open Access Journals (Sweden)

Anders Gerd

2008-01-01

Full Text Available Abstract Background Detailed information about protein interactions is critical for our understanding of the principles governing protein recognition mechanisms. The structures of many proteins have been experimentally determined in complex with different ligands bound either in the same or different binding regions. Thus, the structural interactome requires the development of tools to classify protein binding regions. A proper classification may provide a general view of the regions that a protein uses to bind others and also facilitate a detailed comparative analysis of the interacting information for specific protein binding regions at atomic level. Such classification might be of potential use for deciphering protein interaction networks, understanding protein function, rational engineering and design. Description Protein binding regions (PBRs might be ideally described as well-defined separated regions that share no interacting residues one another. However, PBRs are often irregular, discontinuous and can share a wide range of interacting residues among them. The criteria to define an individual binding region can be often arbitrary and may differ from other binding regions within a protein family. Therefore, the rational behind protein interface classification should aim to fulfil the requirements of the analysis to be performed. We extract detailed interaction information of protein domains, peptides and interfacial solvent from the SCOWLP database and we classify the PBRs of each domain family. For this purpose, we define a similarity index based on the overlapping of interacting residues mapped in pair-wise structural alignments. We perform our classification with agglomerative hierarchical clustering using the complete-linkage method. Our classification is calculated at different similarity cut-offs to allow flexibility in the analysis of PBRs, feature especially interesting for those protein families with conflictive binding regions
The Number, Organization, and Size of Polymorphic Membrane Protein Coding Sequences as well as the Most Conserved Pmp Protein Differ within and across Chlamydia Species.

Science.gov (United States)

Van Lent, Sarah; Creasy, Heather Huot; Myers, Garry S A; Vanrompay, Daisy

2016-01-01

Variation is a central trait of the polymorphic membrane protein (Pmp) family. The number of pmp coding sequences differs between Chlamydia species, but it is unknown whether the number of pmp coding sequences is constant within a Chlamydia species. The level of conservation of the Pmp proteins has previously only been determined for Chlamydia trachomatis. As different Pmp proteins might be indispensible for the pathogenesis of different Chlamydia species, this study investigated the conservation of Pmp proteins both within and across C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci. The pmp coding sequences were annotated in 16 C. trachomatis, 6 C. pneumoniae, 2 C. abortus, and 16 C. psittaci genomes. The number and organization of polymorphic membrane coding sequences differed within and across the analyzed Chlamydia species. The length of coding sequences of pmpA,pmpB, and pmpH was conserved among all analyzed genomes, while the length of pmpE/F and pmpG, and remarkably also of the subtype pmpD, differed among the analyzed genomes. PmpD, PmpA, PmpH, and PmpA were the most conserved Pmp in C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci, respectively. PmpB was the most conserved Pmp across the 4 analyzed Chlamydia species. © 2016 S. Karger AG, Basel.
The Arabidopsis TOR Kinase Specifically Regulates the Expression of Nuclear Genes Coding for Plastidic Ribosomal Proteins and the Phosphorylation of the Cytosolic Ribosomal Protein S6.

Science.gov (United States)

Dobrenel, Thomas; Mancera-Martínez, Eder; Forzani, Céline; Azzopardi, Marianne; Davanture, Marlène; Moreau, Manon; Schepetilnikov, Mikhail; Chicher, Johana; Langella, Olivier; Zivy, Michel; Robaglia, Christophe; Ryabova, Lyubov A; Hanson, Johannes; Meyer, Christian

2016-01-01

Protein translation is an energy consuming process that has to be fine-tuned at both the cell and organism levels to match the availability of resources. The target of rapamycin kinase (TOR) is a key regulator of a large range of biological processes in response to environmental cues. In this study, we have investigated the effects of TOR inactivation on the expression and regulation of Arabidopsis ribosomal proteins at different levels of analysis, namely from transcriptomic to phosphoproteomic. TOR inactivation resulted in a coordinated down-regulation of the transcription and translation of nuclear-encoded mRNAs coding for plastidic ribosomal proteins, which could explain the chlorotic phenotype of the TOR silenced plants. We have identified in the 5' untranslated regions (UTRs) of this set of genes a conserved sequence related to the 5' terminal oligopyrimidine motif, which is known to confer translational regulation by the TOR kinase in other eukaryotes. Furthermore, the phosphoproteomic analysis of the ribosomal fraction following TOR inactivation revealed a lower phosphorylation of the conserved Ser240 residue in the C-terminal region of the 40S ribosomal protein S6 (RPS6). These results were confirmed by Western blot analysis using an antibody that specifically recognizes phosphorylated Ser240 in RPS6. Finally, this antibody was used to follow TOR activity in plants. Our results thus uncover a multi-level regulation of plant ribosomal genes and proteins by the TOR kinase.
Use of fluorescent proteins and color-coded imaging to visualize cancer cells with different genetic properties.

Science.gov (United States)

Hoffman, Robert M

2016-03-01

Fluorescent proteins are very bright and available in spectrally-distinct colors, enable the imaging of color-coded cancer cells growing in vivo and therefore the distinction of cancer cells with different genetic properties. Non-invasive and intravital imaging of cancer cells with fluorescent proteins allows the visualization of distinct genetic variants of cancer cells down to the cellular level in vivo. Cancer cells with increased or decreased ability to metastasize can be distinguished in vivo. Gene exchange in vivo which enables low metastatic cancer cells to convert to high metastatic can be color-coded imaged in vivo. Cancer stem-like and non-stem cells can be distinguished in vivo by color-coded imaging. These properties also demonstrate the vast superiority of imaging cancer cells in vivo with fluorescent proteins over photon counting of luciferase-labeled cancer cells.
Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms.

Science.gov (United States)

Mattick, John S

2003-10-01

The central dogma of biology holds that genetic information normally flows from DNA to RNA to protein. As a consequence it has been generally assumed that genes generally code for proteins, and that proteins fulfil not only most structural and catalytic but also most regulatory functions, in all cells, from microbes to mammals. However, the latter may not be the case in complex organisms. A number of startling observations about the extent of non-protein-coding RNA (ncRNA) transcription in the higher eukaryotes and the range of genetic and epigenetic phenomena that are RNA-directed suggests that the traditional view of the structure of genetic regulatory systems in animals and plants may be incorrect. ncRNA dominates the genomic output of the higher organisms and has been shown to control chromosome architecture, mRNA turnover and the developmental timing of protein expression, and may also regulate transcription and alternative splicing. This paper re-examines the available evidence and suggests a new framework for considering and understanding the genomic programming of biological complexity, autopoietic development and phenotypic variation. Copyright 2003 Wiley Periodicals, Inc.

Regional Atmospheric Transport Code for Hanford Emission Tracking (RATCHET)

International Nuclear Information System (INIS)

Ramsdell, J.V. Jr.; Simonen, C.A.; Burk, K.W.

1994-02-01

The purpose of the Hanford Environmental Dose Reconstruction (HEDR) Project is to estimate radiation doses that individuals may have received from operations at the Hanford Site since 1944. This report deals specifically with the atmospheric transport model, Regional Atmospheric Transport Code for Hanford Emission Tracking (RATCHET). RATCHET is a major rework of the MESOILT2 model used in the first phase of the HEDR Project; only the bookkeeping framework escaped major changes. Changes to the code include (1) significant changes in the representation of atmospheric processes and (2) incorporation of Monte Carlo methods for representing uncertainty in input data, model parameters, and coefficients. To a large extent, the revisions to the model are based on recommendations of a peer working group that met in March 1991. Technical bases for other portions of the atmospheric transport model are addressed in two other documents. This report has three major sections: a description of the model, a user's guide, and a programmer's guide. These sections discuss RATCHET from three different perspectives. The first provides a technical description of the code with emphasis on details such as the representation of the model domain, the data required by the model, and the equations used to make the model calculations. The technical description is followed by a user's guide to the model with emphasis on running the code. The user's guide contains information about the model input and output. The third section is a programmer's guide to the code. It discusses the hardware and software required to run the code. The programmer's guide also discusses program structure and each of the program elements
Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes.

Science.gov (United States)

Kumar, Dhirendra; Mondal, Anupam Kumar; Kutum, Rintu; Dash, Debasis

2016-01-01

Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Isolation and expression of the genes coding for the membrane bound transglycosylase B (MltB and the transferrin binding protein B (TbpB of the salmon pathogen Piscirickettsia salmonis

Directory of Open Access Journals (Sweden)

VIVIAN WILHELM

2004-01-01

Full Text Available We have isolated and sequenced the genes encoding the membrane bound transglycosylase B (MltB and the transferring binding protein B (TbpB of the salmon pathogen Piscirickettsia salmonis. The results of the sequence revealed two open reading frames that encode proteins with calculated molecular weights of 38,830 and 85,140. The deduced aminoacid sequences of both proteins show a significant homology to the respective protein from phylogenetically related microorganisms. Partial sequences coding the amino and carboxyl regions of MltB and a sequence of 761 base pairs encoding the amino region of TbpB have been expressed in E. coli. The strong humoral response elicited by these proteins in mouse confirmed the immunogenic properties of the recombinant proteins. A similar response was elicited by both proteins when injected intraperitoneally in Atlantic salmon. The present data indicates that these proteins are good candidates to be used in formulations to study the protective immunity of salmon to infection by P. salmonis.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus

DEFF Research Database (Denmark)

Hansen, T S; Andreasen, P H; Dreisig, H

1991-01-01

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 ...... by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.......We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63...... nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified...
Discovery of rare protein-coding genes in model methylotroph Methylobacterium extorquens AM1.

Science.gov (United States)

Kumar, Dhirendra; Mondal, Anupam Kumar; Yadav, Amit Kumar; Dash, Debasis

2014-12-01

Proteogenomics involves the use of MS to refine annotation of protein-coding genes and discover genes in a genome. We carried out comprehensive proteogenomic analysis of Methylobacterium extorquens AM1 (ME-AM1) from publicly available proteomics data with a motive to improve annotation for methylotrophs; organisms capable of surviving in reduced carbon compounds such as methanol. Besides identifying 2482(50%) proteins, 29 new genes were discovered and 66 annotated gene models were revised in ME-AM1 genome. One such novel gene is identified with 75 peptides, lacks homolog in other methylobacteria but has glycosyl transferase and lipopolysaccharide biosynthesis protein domains, indicating its potential role in outer membrane synthesis. Many novel genes are present only in ME-AM1 among methylobacteria. Distant homologs of these genes in unrelated taxonomic classes and low GC-content of few genes suggest lateral gene transfer as a potential mode of their origin. Annotations of methylotrophy related genes were also improved by the discovery of a short gene in methylotrophy gene island and redefining a gene important for pyrroquinoline quinone synthesis, essential for methylotrophy. The combined use of proteogenomics and rigorous bioinformatics analysis greatly enhanced the annotation of protein-coding genes in model methylotroph ME-AM1 genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Evolutionarily conserved regions of the human c-myc protein can be uncoupled from transforming activity

International Nuclear Information System (INIS)

Sarid, J.; Halazonetis, T.D.; Murphy, W.; Leder, P.

1987-01-01

The myc family of oncogenes contains coding sequences that have been preserved in different species for over 400 million years. This conservation (which implies functional selection) is broadly represented throughout the C-terminal portion of the human c-myc protein but is largely restricted to three cluster of amino acid sequences in the N-terminal region. The authors have examined the role that the latter three regions of the c-myc protein might play in the transforming function of the c-myc gene. Several mutations, deletions and frameshifts, were introduced into the c-myc gene, and these mutant genes were tested for their ability to collaborate with the EJ-ras oncogene to transform rat embryo fibroblasts. Complete elimination of the first two N-terminal conserved segments abolished transforming activity. In contrast, genes altered in a portion of the second or the entire third conserved segment retained their transforming activity. Thus, the latter two segments are not required for the transformation process, suggesting that they serve another function related only to the normal expression of the c-myc gene
Development of TIGER code for radionuclide transport in a geochemically evolving region

International Nuclear Information System (INIS)

Mihara, Morihiro; Ooi, Takao

2004-01-01

In a transuranic (TRU) waste geological disposal facility, using cementitious materials is being considered. Cementitious materials will gradually dissolve in groundwater over the long-term. In the performance assessment report of a TRU waste repository in Japan already published, the most conservative radionuclide migration parameter set was selected considering the evolving cementitious material. Therefore, a tool to perform the calculation of radionuclide transport considering long-term geochemically evolving cementitious materials, named the TIGER code, Transport In Geochemically Evolving Region was developed to calculate a more realistic performance assessment. It can calculate radionuclide transport in engineered and natural barrier systems. In this report, mathematical equations of this code are described and validated with analytical solutions and results of other codes for radionuclide transport. The more realistic calculation of radionuclide transport for a TRU waste geological disposal system using the TIGER code could be performed. (author)
Regional and temporal variations in coding of hospital diagnoses referring to upper gastrointestinal and oesophageal bleeding in Germany

Directory of Open Access Journals (Sweden)

Garbe Edeltraut

2011-08-01

Full Text Available Abstract Background Health insurance claims data are increasingly used for health services research in Germany. Hospital diagnoses in these data are coded according to the International Classification of Diseases, German modification (ICD-10-GM. Due to the historical division into West and East Germany, different coding practices might persist in both former parts. Additionally, the introduction of Diagnosis Related Groups (DRGs in Germany in 2003/2004 might have changed the coding. The aim of this study was to investigate regional and temporal variations in coding of hospitalisation diagnoses in Germany. Methods We analysed hospitalisation diagnoses for oesophageal bleeding (OB and upper gastrointestinal bleeding (UGIB from the official German Hospital Statistics provided by the Federal Statistical Office. Bleeding diagnoses were classified as "specific" (origin of bleeding provided or "unspecific" (origin of bleeding not provided coding. We studied regional (former East versus West Germany differences in incidence of hospitalisations with specific or unspecific coding for OB and UGIB and temporal variations between 2000 and 2005. For each year, incidence ratios of hospitalisations for former East versus West Germany were estimated with log-linear regression models adjusting for age, gender and population density. Results Significant differences in specific and unspecific coding between East and West Germany and over time were found for both, OB and UGIB hospitalisation diagnoses, respectively. For example in 2002, incidence ratios of hospitalisations for East versus West Germany were 1.24 (95% CI 1.16-1.32 for specific and 0.67 (95% CI 0.60-0.74 for unspecific OB diagnoses and 1.43 (95% CI 1.36-1.51 for specific and 0.83 (95% CI 0.80-0.87 for unspecific UGIB. Regional differences nearly disappeared and time trends were less marked when using combined specific and unspecific diagnoses of OB or UGIB, respectively. Conclusions During the study
Cloud prediction of protein structure and function with PredictProtein for Debian.

Science.gov (United States)

Kaján, László; Yachdav, Guy; Vicedo, Esmeralda; Steinegger, Martin; Mirdita, Milot; Angermüller, Christof; Böhm, Ariane; Domke, Simon; Ertl, Julia; Mertes, Christian; Reisinger, Eva; Staniewski, Cedric; Rost, Burkhard

2013-01-01

We report the release of PredictProtein for the Debian operating system and derivatives, such as Ubuntu, Bio-Linux, and Cloud BioLinux. The PredictProtein suite is available as a standard set of open source Debian packages. The release covers the most popular prediction methods from the Rost Lab, including methods for the prediction of secondary structure and solvent accessibility (profphd), nuclear localization signals (predictnls), and intrinsically disordered regions (norsnet). We also present two case studies that successfully utilize PredictProtein packages for high performance computing in the cloud: the first analyzes protein disorder for whole organisms, and the second analyzes the effect of all possible single sequence variants in protein coding regions of the human genome.
MARG1D: One dimensional outer region matching data code

International Nuclear Information System (INIS)

Tokuda, Shinji; Watanabe, Tomoko.

1995-08-01

A code MARG1D has been developed which computes outer region matching data of the one dimensional Newcomb equation. Matching data play an important role in the resistive (and non ideal) Magneto-hydrodynamic (MHD) stability analysis in a tokamak plasma. The MARG1D code computes matching data by using the boundary value method or by the eigenvalue method. Variational principles are derived for the problems to be solved and a finite element method is applied. Except for the case of marginal stability, the eigenvalue method is equivalent to the boundary value method. However, the eigenvalue method has the several advantages: it is a new method of ideal MHD stability analysis for which the marginally stable state can be identified, and it guarantees numerical stability in computing matching data close to marginal stability. We perform detailed numerical experiments for a model equation with analytical solutions and for the Newcomb equation in the m=1 mode theory. Numerical experiments show that MARG1D code gives the matching data with numerical stability and high accuracy. (author)
MICROX-2: an improved two-region flux spectrum code for the efficient calculation of group cross sections

International Nuclear Information System (INIS)

Mathews, D.; Koch, P.

1979-12-01

The MICROX-2 code is an improved version of the MICROX code. The improvements allow MICROX-2 to be used for the efficient and rigorous preparation of broad group neutron cross sections for poorly moderated systems such as fast breeder reactors in addition to the well moderated thermal reactors for which MICROX was designed. MICROX-2 is an integral transport theory code which solves the neutron slowing down and thermalization equations on a detailed energy grid for two-region lattice cells. The fluxes in the two regions are coupled by transport corrected collision probabilities. The inner region may include two different types of grains (particles). Neutron leakage effects are treated by performing B 1 slowing down and P 0 plus DB 2 thermalization calculations in each region. Cell averaged diffusion coefficients are prepared with the Benoist cell homogenization prescription
A HYDROCHEMICAL HYBRID CODE FOR ASTROPHYSICAL PROBLEMS. I. CODE VERIFICATION AND BENCHMARKS FOR A PHOTON-DOMINATED REGION (PDR)

International Nuclear Information System (INIS)

Motoyama, Kazutaka; Morata, Oscar; Hasegawa, Tatsuhiko; Shang, Hsien; Krasnopolsky, Ruben

2015-01-01

A two-dimensional hydrochemical hybrid code, KM2, is constructed to deal with astrophysical problems that would require coupled hydrodynamical and chemical evolution. The code assumes axisymmetry in a cylindrical coordinate system and consists of two modules: a hydrodynamics module and a chemistry module. The hydrodynamics module solves hydrodynamics using a Godunov-type finite volume scheme and treats included chemical species as passively advected scalars. The chemistry module implicitly solves nonequilibrium chemistry and change of energy due to thermal processes with transfer of external ultraviolet radiation. Self-shielding effects on photodissociation of CO and H 2 are included. In this introductory paper, the adopted numerical method is presented, along with code verifications using the hydrodynamics module and a benchmark on the chemistry module with reactions specific to a photon-dominated region (PDR). Finally, as an example of the expected capability, the hydrochemical evolution of a PDR is presented based on the PDR benchmark
Modeling disordered regions in proteins using Rosetta.

Directory of Open Access Journals (Sweden)

Ray Yu-Ruei Wang

Full Text Available Protein structure prediction methods such as Rosetta search for the lowest energy conformation of the polypeptide chain. However, the experimentally observed native state is at a minimum of the free energy, rather than the energy. The neglect of the missing configurational entropy contribution to the free energy can be partially justified by the assumption that the entropies of alternative folded states, while very much less than unfolded states, are not too different from one another, and hence can be to a first approximation neglected when searching for the lowest free energy state. The shortcomings of current structure prediction methods may be due in part to the breakdown of this assumption. Particularly problematic are proteins with significant disordered regions which do not populate single low energy conformations even in the native state. We describe two approaches within the Rosetta structure modeling methodology for treating such regions. The first does not require advance knowledge of the regions likely to be disordered; instead these are identified by minimizing a simple free energy function used previously to model protein folding landscapes and transition states. In this model, residues can be either completely ordered or completely disordered; they are considered disordered if the gain in entropy outweighs the loss of favorable energetic interactions with the rest of the protein chain. The second approach requires identification in advance of the disordered regions either from sequence alone using for example the DISOPRED server or from experimental data such as NMR chemical shifts. During Rosetta structure prediction calculations the disordered regions make only unfavorable repulsive contributions to the total energy. We find that the second approach has greater practical utility and illustrate this with examples from de novo structure prediction, NMR structure calculation, and comparative modeling.
Structural organization of poliovirus RNA replication is mediated by viral proteins of the P2 genomic region

International Nuclear Information System (INIS)

Bienz, K.; Egger, D.; Troxler, M.; Pasamontes, L.

1990-01-01

Transcriptionally active replication complexes bound to smooth membrane vesicles were isolated from poliovirus-infected cells. In electron microscopic, negatively stained preparations, the replication complex appeared as an irregularly shaped, oblong structure attached to several virus-induced vesicles of a rosettelike arrangement. Electron microscopic immunocytochemistry of such preparations demonstrated that the poliovirus replication complex contains the proteins coded by the P2 genomic region (P2 proteins) in a membrane-associated form. In addition, the P2 proteins are also associated with viral RNA, and they can be cross-linked to viral RNA by UV irradiation. Guanidine hydrochloride prevented the P2 proteins from becoming membrane bound but did not change their association with viral RNA. The findings allow the conclusion that the protein 2C or 2C-containing precursor(s) is responsible for the attachment of the viral RNA to the vesicular membrane and for the spatial organization of the replication complex necessary for its proper functioning in viral transcription. A model for the structure of the viral replication complex and for the function of the 2C-containing P2 protein(s) and the vesicular membranes is proposed
Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.

Science.gov (United States)

Goz, Eli; Mioduser, Oriah; Diament, Alon; Tuller, Tamir

2017-08-01

Deciphering the way gene expression regulatory aspects are encoded in viral genomes is a challenging mission with ramifications related to all biomedical disciplines. Here, we aimed to understand how the evolution shapes the bacteriophage lambda genes by performing a high resolution analysis of ribosomal profiling data and gene expression related synonymous/silent information encoded in bacteriophage coding regions.We demonstrated evidence of selection for distinct compositions of synonymous codons in early and late viral genes related to the adaptation of translation efficiency to different bacteriophage developmental stages. Specifically, we showed that evolution of viral coding regions is driven, among others, by selection for codons with higher decoding rates; during the initial/progressive stages of infection the decoding rates in early/late genes were found to be superior to those in late/early genes, respectively. Moreover, we argued that selection for translation efficiency could be partially explained by adaptation to Escherichia coli tRNA pool and the fact that it can change during the bacteriophage life cycle.An analysis of additional aspects related to the expression of viral genes, such as mRNA folding and more complex/longer regulatory signals in the coding regions, is also reported. The reported conclusions are likely to be relevant also to additional viruses. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Rampant adaptive evolution in regions of proteins with unknown function in Drosophila simulans.

Directory of Open Access Journals (Sweden)

Alisha K Holloway

2007-10-01

Full Text Available Adaptive protein evolution is pervasive in Drosophila. Genomic studies, thus far, have analyzed each protein as a single entity. However, the targets of adaptive events may be localized to particular parts of proteins, such as protein domains or regions involved in protein folding. We compared the population genetic mechanisms driving sequence polymorphism and divergence in defined protein domains and non-domain regions. Interestingly, we find that non-domain regions of proteins are more frequent targets of directional selection. Protein domains are also evolving under directional selection, but appear to be under stronger purifying selection than non-domain regions. Non-domain regions of proteins clearly play a major role in adaptive protein evolution on a genomic scale and merit future investigations of their functional properties.
Bioinformatics analysis identify novel OB fold protein coding genes in C. elegans.

Directory of Open Access Journals (Sweden)

Daryanaz Dargahi

Full Text Available BACKGROUND: The C. elegans genome has been extensively annotated by the WormBase consortium that uses state of the art bioinformatics pipelines, functional genomics and manual curation approaches. As a result, the identification of novel genes in silico in this model organism is becoming more challenging requiring new approaches. The Oligonucleotide-oligosaccharide binding (OB fold is a highly divergent protein family, in which protein sequences, in spite of having the same fold, share very little sequence identity (5-25%. Therefore, evidence from sequence-based annotation may not be sufficient to identify all the members of this family. In C. elegans, the number of OB-fold proteins reported is remarkably low (n=46 compared to other evolutionary-related eukaryotes, such as yeast S. cerevisiae (n=344 or fruit fly D. melanogaster (n=84. Gene loss during evolution or differences in the level of annotation for this protein family, may explain these discrepancies. METHODOLOGY/PRINCIPAL FINDINGS: This study examines the possibility that novel OB-fold coding genes exist in the worm. We developed a bioinformatics approach that uses the most sensitive sequence-sequence, sequence-profile and profile-profile similarity search methods followed by 3D-structure prediction as a filtering step to eliminate false positive candidate sequences. We have predicted 18 coding genes containing the OB-fold that have remarkably partially been characterized in C. elegans. CONCLUSIONS/SIGNIFICANCE: This study raises the possibility that the annotation of highly divergent protein fold families can be improved in C. elegans. Similar strategies could be implemented for large scale analysis by the WormBase consortium when novel versions of the genome sequence of C. elegans, or other evolutionary related species are being released. This approach is of general interest to the scientific community since it can be used to annotate any genome.
Formation of a unique cluster of G-quadruplex structures in the HIV-1 Nef coding region: implications for antiviral activity.

Directory of Open Access Journals (Sweden)

Rosalba Perrone

Full Text Available G-quadruplexes are tetraplex structures of nucleic acids that can form in G-rich sequences. Their presence and functional role have been established in telomeres, oncogene promoters and coding regions of the human chromosome. In particular, they have been proposed to be directly involved in gene regulation at the level of transcription. Because the HIV-1 Nef protein is a fundamental factor for efficient viral replication, infectivity and pathogenesis in vitro and in vivo, we investigated G-quadruplex formation in the HIV-1 nef gene to assess the potential for viral inhibition through G-quadruplex stabilization. A comprehensive computational analysis of the nef coding region of available strains showed the presence of three conserved sequences that were uniquely clustered. Biophysical testing proved that G-quadruplex conformations were efficiently stabilized or induced by G-quadruplex ligands in all three sequences. Upon incubation with a G-quadruplex ligand, Nef expression was reduced in a reporter gene assay and Nef-dependent enhancement of HIV-1 infectivity was significantly repressed in an antiviral assay. These data constitute the first evidence of the possibility to regulate HIV-1 gene expression and infectivity through G-quadruplex targeting and therefore open a new avenue for viral treatment.
Selection on Coding and Regulatory Variation Maintains Individuality in Major Urinary Protein Scent Marks in Wild Mice.

Directory of Open Access Journals (Sweden)

Michael J Sheehan

2016-03-01

Full Text Available Recognition of individuals by scent is widespread across animal taxa. Though animals can often discriminate chemical blends based on many compounds, recent work shows that specific protein pheromones are necessary and sufficient for individual recognition via scent marks in mice. The genetic nature of individuality in scent marks (e.g. coding versus regulatory variation and the evolutionary processes that maintain diversity are poorly understood. The individual signatures in scent marks of house mice are the protein products of a group of highly similar paralogs in the major urinary protein (Mup gene family. Using the offspring of wild-caught mice, we examine individuality in the major urinary protein (MUP scent marks at the DNA, RNA and protein levels. We show that individuality arises through a combination of variation at amino acid coding sites and differential transcription of central Mup genes across individuals, and we identify eSNPs in promoters. There is no evidence of post-transcriptional processes influencing phenotypic diversity as transcripts accurately predict the relative abundance of proteins in urine samples. The match between transcripts and urine samples taken six months earlier also emphasizes that the proportional relationships across central MUP isoforms in urine is stable. Balancing selection maintains coding variants at moderate frequencies, though pheromone diversity appears limited by interactions with vomeronasal receptors. We find that differential transcription of the central Mup paralogs within and between individuals significantly increases the individuality of pheromone blends. Balancing selection on gene regulation allows for increased individuality via combinatorial diversity in a limited number of pheromones.
Tetrahymena thermophila acidic ribosomal protein L37 contains an archaebacterial type of C-terminus.

Science.gov (United States)

Hansen, T S; Andreasen, P H; Dreisig, H; Højrup, P; Nielsen, H; Engberg, J; Kristiansen, K

1991-09-15

We have cloned and characterized a Tetrahymena thermophila macronuclear gene (L37) encoding the acidic ribosomal protein (A-protein) L37. The gene contains a single intron located in the 3'-part of the coding region. Two major and three minor transcription start points (tsp) were mapped 39 to 63 nucleotides upstream from the translational start codon. The uppermost tsp mapped to the first T in a putative T. thermophila RNA polymerase II initiator element, TATAA. The coding region of L37 predicts a protein of 109 amino acid (aa) residues. A substantial part of the deduced aa sequence was verified by protein sequencing. The T. thermophila L37 clearly belongs to the P1-type family of eukaryotic A-proteins, but the C-terminal region has the hallmarks of archaebacterial A-proteins.

Photoactivatable Mussel-Based Underwater Adhesive Proteins by an Expanded Genetic Code.

Science.gov (United States)

Hauf, Matthias; Richter, Florian; Schneider, Tobias; Faidt, Thomas; Martins, Berta M; Baumann, Tobias; Durkin, Patrick; Dobbek, Holger; Jacobs, Karin; Möglich, Andreas; Budisa, Nediljko

2017-09-19

Marine mussels exhibit potent underwater adhesion abilities under hostile conditions by employing 3,4-dihydroxyphenylalanine (DOPA)-rich mussel adhesive proteins (MAPs). However, their recombinant production is a major biotechnological challenge. Herein, a novel strategy based on genetic code expansion has been developed by engineering efficient aminoacyl-transfer RNA synthetases (aaRSs) for the photocaged noncanonical amino acid ortho-nitrobenzyl DOPA (ONB-DOPA). The engineered ONB-DOPARS enables in vivo production of MAP type 5 site-specifically equipped with multiple instances of ONB-DOPA to yield photocaged, spatiotemporally controlled underwater adhesives. Upon exposure to UV light, these proteins feature elevated wet adhesion properties. This concept offers new perspectives for the production of recombinant bioadhesives. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Arabidopsis RNASE THREE LIKE2 Modulates the Expression of Protein-Coding Genes via 24-Nucleotide Small Interfering RNA-Directed DNA Methylation.

Science.gov (United States)

Elvira-Matelot, Emilie; Hachet, Mélanie; Shamandi, Nahid; Comella, Pascale; Sáez-Vásquez, Julio; Zytnicki, Matthias; Vaucheret, Hervé

2016-02-01

RNaseIII enzymes catalyze the cleavage of double-stranded RNA (dsRNA) and have diverse functions in RNA maturation. Arabidopsis thaliana RNASE THREE LIKE2 (RTL2), which carries one RNaseIII and two dsRNA binding (DRB) domains, is a unique Arabidopsis RNaseIII enzyme resembling the budding yeast small interfering RNA (siRNA)-producing Dcr1 enzyme. Here, we show that RTL2 modulates the production of a subset of small RNAs and that this activity depends on both its RNaseIII and DRB domains. However, the mode of action of RTL2 differs from that of Dcr1. Whereas Dcr1 directly cleaves dsRNAs into 23-nucleotide siRNAs, RTL2 likely cleaves dsRNAs into longer molecules, which are subsequently processed into small RNAs by the DICER-LIKE enzymes. Depending on the dsRNA considered, RTL2-mediated maturation either improves (RTL2-dependent loci) or reduces (RTL2-sensitive loci) the production of small RNAs. Because the vast majority of RTL2-regulated loci correspond to transposons and intergenic regions producing 24-nucleotide siRNAs that guide DNA methylation, RTL2 depletion modifies DNA methylation in these regions. Nevertheless, 13% of RTL2-regulated loci correspond to protein-coding genes. We show that changes in 24-nucleotide siRNA levels also affect DNA methylation levels at such loci and inversely correlate with mRNA steady state levels, thus implicating RTL2 in the regulation of protein-coding gene expression. © 2016 American Society of Plant Biologists. All rights reserved.
Isotope coded protein labeling coupled immunoprecipitation (ICPL-IP): a novel approach for quantitative protein complex analysis from native tissue.

Science.gov (United States)

Vogt, Andreas; Fuerholzner, Bettina; Kinkl, Norbert; Boldt, Karsten; Ueffing, Marius

2013-05-01

High confidence definition of protein interactions is an important objective toward the understanding of biological systems. Isotope labeling in combination with affinity-based isolation of protein complexes has increased in accuracy and reproducibility, yet, larger organisms--including humans--are hardly accessible to metabolic labeling and thus, a major limitation has been its restriction to small animals, cell lines, and yeast. As composition as well as the stoichiometry of protein complexes can significantly differ in primary tissues, there is a great demand for methods capable to combine the selectivity of affinity-based isolation as well as the accuracy and reproducibility of isotope-based labeling with its application toward analysis of protein interactions from intact tissue. Toward this goal, we combined isotope coded protein labeling (ICPL)(1) with immunoprecipitation (IP) and quantitative mass spectrometry (MS). ICPL-IP allows sensitive and accurate analysis of protein interactions from primary tissue. We applied ICPL-IP to immuno-isolate protein complexes from bovine retinal tissue. Protein complexes of immunoprecipitated β-tubulin, a highly abundant protein with known interactors as well as the lowly expressed small GTPase RhoA were analyzed. The results of both analyses demonstrate sensitive and selective identification of known as well as new protein interactions by our method.
Isotope Coded Protein Labeling Coupled Immunoprecipitation (ICPL-IP): A Novel Approach for Quantitative Protein Complex Analysis From Native Tissue*

Science.gov (United States)

Vogt, Andreas; Fuerholzner, Bettina; Kinkl, Norbert; Boldt, Karsten; Ueffing, Marius

2013-01-01

High confidence definition of protein interactions is an important objective toward the understanding of biological systems. Isotope labeling in combination with affinity-based isolation of protein complexes has increased in accuracy and reproducibility, yet, larger organisms—including humans—are hardly accessible to metabolic labeling and thus, a major limitation has been its restriction to small animals, cell lines, and yeast. As composition as well as the stoichiometry of protein complexes can significantly differ in primary tissues, there is a great demand for methods capable to combine the selectivity of affinity-based isolation as well as the accuracy and reproducibility of isotope-based labeling with its application toward analysis of protein interactions from intact tissue. Toward this goal, we combined isotope coded protein labeling (ICPL)1 with immunoprecipitation (IP) and quantitative mass spectrometry (MS). ICPL-IP allows sensitive and accurate analysis of protein interactions from primary tissue. We applied ICPL-IP to immuno-isolate protein complexes from bovine retinal tissue. Protein complexes of immunoprecipitated β-tubulin, a highly abundant protein with known interactors as well as the lowly expressed small GTPase RhoA were analyzed. The results of both analyses demonstrate sensitive and selective identification of known as well as new protein interactions by our method. PMID:23268931
The nucleotide sequence of human transition protein 1 cDNA

Energy Technology Data Exchange (ETDEWEB)

Luerssen, H; Hoyer-Fender, S; Engel, W [Universitaet Goettingen (West Germany)

1988-08-11

The authors have screened a human testis cDNA library with an oligonucleotide of 81 mer prepared according to a part of the published nucleotide sequence of the rat transition protein TP 1. They have isolated a cDNA clone with the length of 441 bp containing the coding region of 162 bp for human transition protein 1. There is about 84% homology in the coding region of the sequence compared to rat. The human cDNA-clone encodes a polypeptide of 54 amino acids of which 7 are different to that of rat.
Identification of coding and non-coding mutational hotspots in cancer genomes.

Science.gov (United States)

Piraino, Scott W; Furney, Simon J

2017-01-05

The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from
The isoelectric region of proteins: a systematic analysis.

Directory of Open Access Journals (Sweden)

Michael Widmann

Full Text Available BACKGROUND: Binding of proteins in ion exchange chromatography is dominated by electrostatic interactions and can be tuned by adjusting pH and ionic strength of the solvent. Therefore, the isoelectric region (IER, the pH region of almost zero charge near the pI, has been used to predict the binding properties of proteins. PRINCIPAL FINDINGS: Usually the IER is small and binding and elution is carried out at pH values near to the pI. However, some proteins with an extended IER have been shown to bind and elute far away from its pI. To analyze factors that mediate the size of the IER and to identify proteins with an extended IER, two protein families consisting of more than 7000 proteins were systematically investigated. Most proteins were found to have a small IER and thus are expected to bind or elute near to their pI, while only a small fraction of less than 2% had a large IER. CONCLUSIONS: Only four factors, the number of histidines, the pI, the number of titratable amino acids and the ratio of acidic to basic residues, are sufficient to reliably classify proteins by their IER based on their sequence only, and thus to predict their binding and elution behaviour in ion exchange chromatography.
A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

Science.gov (United States)

Kress, W John; Erickson, David L

2007-06-06

A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.
Determination and application of immunodominant regions of SARS coronavirus spike and nucleocapsid proteins recognized by sera from different animal species.

Science.gov (United States)

Yu, Meng; Stevens, Vicky; Berry, Jody D; Crameri, Gary; McEachern, Jennifer; Tu, Changchun; Shi, Zhengli; Liang, Guodong; Weingartl, Hana; Cardosa, Jane; Eaton, Bryan T; Wang, Lin-Fa

2008-02-29

Knowledge of immunodominant regions in major viral antigens is important for rational design of effective vaccines and diagnostic tests. Although there have been many reports of such work done for SARS-CoV, these were mainly focused on the immune responses of humans and mice. In this study, we aim to search for and compare immunodominant regions of the spike (S) and nucleocapsid (N) proteins which are recognized by sera from different animal species, including mouse, rat, rabbit, civet, pig and horse. Twelve overlapping recombinant protein fragments were produced in Escherichia coli, six each for the S and N proteins, which covered the entire coding region of the two proteins. Using a membrane-strip based Western blot approach, the reactivity of each antigen fragment against a panel of animal sera was determined. Immunodominant regions containing linear epitopes, which reacted with sera from all the species tested, were identified for both proteins. The S3 fragment (aa 402-622) and the N4 fragment (aa 220-336) were the most immunodominant among the six S and N fragments, respectively. Antibodies raised against the S3 fragment were able to block the binding of a panel of S-specific monoclonal antibodies (mAb) to SARS-CoV in ELISA, further demonstrating the immunodominance of this region. Based on these findings, one-step competition ELISAs were established which were able to detect SARS-CoV antibodies from human and at least seven different animal species. Considering that a large number of animal species are known to be susceptible to SARS-CoV, these assays will be a useful tool to trace the origin and transmission of SARS-CoV and to minimise the risk of animal-to-human transmission.
The neurovirulence and neuroinvasiveness of chimeric tick-borne encephalitis/dengue virus can be attenuated by introducing defined mutations into the envelope and NS5 protein genes and the 3' non-coding region of the genome

International Nuclear Information System (INIS)

Engel, Amber R.; Rumyantsev, Alexander A.; Maximova, Olga A.; Speicher, James M.; Heiss, Brian; Murphy, Brian R.; Pletnev, Alexander G.

2010-01-01

Tick-borne encephalitis (TBE) is a severe disease affecting thousands of people throughout Eurasia. Despite the use of formalin-inactivated vaccines in endemic areas, an increasing incidence of TBE emphasizes the need for an alternative vaccine that will induce a more durable immunity against TBE virus (TBEV). The chimeric attenuated virus vaccine candidate containing the structural protein genes of TBEV on a dengue virus genetic background (TBEV/DEN4) retains a high level of neurovirulence in both mice and monkeys. Therefore, attenuating mutations were introduced into the envelope (E 315 ) and NS5 (NS5 654,655 ) proteins, and into the 3' non-coding region (Δ30) of TBEV/DEN4. The variant that contained all three mutations (vΔ30/E 315 /NS5 654,655 ) was significantly attenuated for neuroinvasiveness and neurovirulence and displayed a reduced level of replication and virus-induced histopathology in the brains of mice. The high level of safety in the central nervous system indicates that vΔ30/E 315 /NS5 654,655 should be further evaluated as a TBEV vaccine.
Isolation and expression of a novel chick G-protein cDNA coding for a G alpha i3 protein with a G alpha 0 N-terminus.

OpenAIRE

Kilbourne, E J; Galper, J B

1994-01-01

We have cloned cDNAs coding for G-protein alpha subunits from a chick brain cDNA library. Based on sequence similarity to G-protein alpha subunits from other eukaryotes, one clone was designated G alpha i3. A second clone, G alpha i3-o, was identical to the G alpha i3 clone over 932 bases on the 3' end. The 5' end of G alpha i3-o, however, contained an alternative sequence in which the first 45 amino acids coded for are 100% identical to the conserved N-terminus of G alpha o from species such...
nocoRNAc: Characterization of non-coding RNAs in prokaryotes

Directory of Open Access Journals (Sweden)

Nieselt Kay

2011-01-01

Full Text Available Abstract Background The interest in non-coding RNAs (ncRNAs constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not. Results We present NOCORNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. NOCORNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and NOCORNAc to the genome of Streptomyces coelicolor and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner. Conclusions We have developed NOCORNAc, a framework that facilitates the automated characterization of functional ncRNAs. NOCORNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. NOCORNAc is not restricted to
Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

KAUST Repository

Zhang, Zhang

2010-11-08

Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.
Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

KAUST Repository

Zhang, Zhang; Yu, Jun

2010-01-01

Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.
Single nucleotide polymorphisms (SNPs in coding regions of canine dopamine- and serotonin-related genes

Directory of Open Access Journals (Sweden)

Lingaas Frode

2008-01-01

Full Text Available Abstract Background Polymorphism in genes of regulating enzymes, transporters and receptors of the neurotransmitters of the central nervous system have been associated with altered behaviour, and single nucleotide polymorphisms (SNPs represent the most frequent type of genetic variation. The serotonin and dopamine signalling systems have a central influence on different behavioural phenotypes, both of invertebrates and vertebrates, and this study was undertaken in order to explore genetic variation that may be associated with variation in behaviour. Results Single nucleotide polymorphisms in canine genes related to behaviour were identified by individually sequencing eight dogs (Canis familiaris of different breeds. Eighteen genes from the dopamine and the serotonin systems were screened, revealing 34 SNPs distributed in 14 of the 18 selected genes. A total of 24,895 bp coding sequence was sequenced yielding an average frequency of one SNP per 732 bp (1/732. A total of 11 non-synonymous SNPs (nsSNPs, which may be involved in alteration of protein function, were detected. Of these 11 nsSNPs, six resulted in a substitution of amino acid residue with concomitant change in structural parameters. Conclusion We have identified a number of coding SNPs in behaviour-related genes, several of which change the amino acids of the proteins. Some of the canine SNPs exist in codons that are evolutionary conserved between five compared species, and predictions indicate that they may have a functional effect on the protein. The reported coding SNP frequency of the studied genes falls within the range of SNP frequencies reported earlier in the dog and other mammalian species. Novel SNPs are presented and the results show a significant genetic variation in expressed sequences in this group of genes. The results can contribute to an improved understanding of the genetics of behaviour.
Mutational analysis of the promoter and the coding region of the 5-HT1A gene

Energy Technology Data Exchange (ETDEWEB)

Erdmann, J.; Noethen, M.M.; Shimron-Abarbanell, D. [Univ. of Bonn (Germany)] [and others

1994-09-01

Disturbances of serotonergic pathways have been implicated in many neuropsychiatric disorders. Serotonin (5HT) receptors can be subdivided into at least three major families (5HT1, 5HT2, and 5HT3). Five human 5HT1 receptor subtypes have been cloned, namely 1A, 1D{alpha}, 1D{beta}, 1E, and 1F. Of these, the 5HT1A receptor is the best characterized subtype. In the present study we sought to identify genetic variation in the 5HT1A receptor gene which through alteration of protein function or level of expression might contribute to the genetics of neuropsychiatric diseases. The coding region and the 5{prime} promoter region of the 5HT1A gene from 159 unrelated subjects (45 schizophrenic, 46 bipolar affective, and 43 patients with Tourette`s syndrome, as well as 25 controls) were analyzed using SSCA. SSCA revealed the presence of two mutations both located in the coding region of the 5HT1A receptor gene. The first mutation is a rare silent C{r_arrow}T substitution at nucleotide position 549. The second mutation is characterized by a base pair substitution (A{r_arrow}G) at the first position of codon 28 and results in an amino acid exchange (Ile{r_arrow}Val). Since Val28 was found only in a single schizophrenic patient and in none of the other patients or controls, we decided to extend our samples and to use a restriction assay for screening a further 74 schizophrenic, 95 bipolar affective, and 49 patients with Tourette`s syndrome, as well as 185 controls, for the presence of the mutation. In total, the mutation was found in 2 schizophrenic patients, in 3 bipolars, in 1 Tourette patient, and in 5 controls. To our knowledge the Ile-28-Val substitution reported here is the first natural occuring molecular variant which has been identified for a serotonin receptor so far.
The structural analysis of protein sequences based on the quasi-amino acids code

International Nuclear Information System (INIS)

Ping, Zhu; Xu-Qing, Tang; Zhen-Yuan, Xu

2009-01-01

Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (Σ, +, *) is introduced, where Σ is the set of 64 codons. According to the characteristics of (Σ, +, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, ⊕, ) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3). (cross-disciplinary physics and related areas of science and technology)
Protein-protein docking using region-based 3D Zernike descriptors.

Science.gov (United States)

Venkatraman, Vishwesh; Yang, Yifeng D; Sael, Lee; Kihara, Daisuke

2009-12-09

Protein-protein interactions are a pivotal component of many biological processes and mediate a variety of functions. Knowing the tertiary structure of a protein complex is therefore essential for understanding the interaction mechanism. However, experimental techniques to solve the structure of the complex are often found to be difficult. To this end, computational protein-protein docking approaches can provide a useful alternative to address this issue. Prediction of docking conformations relies on methods that effectively capture shape features of the participating proteins while giving due consideration to conformational changes that may occur. We present a novel protein docking algorithm based on the use of 3D Zernike descriptors as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. Docking decoys are generated using geometric hashing, which are then ranked by a scoring function that incorporates a buried surface area and a novel geometric complementarity term based on normals associated with the 3D Zernike shape description. Our docking algorithm was tested on both bound and unbound cases in the ZDOCK benchmark 2.0 dataset. In 74% of the bound docking predictions, our method was able to find a near-native solution (interface C-alphaRMSD 3D Zernike descriptors are adept in capturing shape complementarity at the protein-protein interface and useful for protein docking prediction. Rigorous benchmark studies show that our docking approach has a superior performance compared to existing methods.
Numerical code to determine the particle trapping region in the LISA machine

International Nuclear Information System (INIS)

Azevedo, M.T. de; Raposo, C.C. de; Tomimura, A.

1984-01-01

A numerical code is constructed to determine the trapping region in machine like LISA. The variable magnetic field is two deimensional and is coupled to the Runge-Kutta through the Tchebichev polynomial. Various particle orbits including particle interactions were analysed. Beside this, a strong electric field is introduced to see the possible effects happening inside the plasma. (Author) [pt
A Bipartite Network-based Method for Prediction of Long Non-coding RNA–protein Interactions

Directory of Open Access Journals (Sweden)

Mengqu Ge

2016-02-01

Full Text Available As one large class of non-coding RNAs (ncRNAs, long ncRNAs (lncRNAs have gained considerable attention in recent years. Mutations and dysfunction of lncRNAs have been implicated in human disorders. Many lncRNAs exert their effects through interactions with the corresponding RNA-binding proteins. Several computational approaches have been developed, but only few are able to perform the prediction of these interactions from a network-based point of view. Here, we introduce a computational method named lncRNA–protein bipartite network inference (LPBNI. LPBNI aims to identify potential lncRNA–interacting proteins, by making full use of the known lncRNA–protein interactions. Leave-one-out cross validation (LOOCV test shows that LPBNI significantly outperforms other network-based methods, including random walk (RWR and protein-based collaborative filtering (ProCF. Furthermore, a case study was performed to demonstrate the performance of LPBNI using real data in predicting potential lncRNA–interacting proteins.

Cloning and identification of the gene coding for the 140-kd subunit of Drosophila RNA polymerase II

OpenAIRE

Faust, Daniela M.; Renkawitz-Pohl, Renate; Falkenburg, Dieter; Gasch, Alexander; Bialojan, Siegfried; Young, Richard A.; Bautz, Ekkehard K. F.

1986-01-01

Genomic clones of Drosophila melanogaster were isolated from a λ library by cross-hybridization with the yeast gene coding for the 150-kd subunit of RNA polymerase II. Clones containing a region of ∼2.0 kb with strong homology to the yeast gene were shown to code for a 3.9-kb poly(A)+-RNA. Part of the coding region was cloned into an expression vector. A fusion protein was obtained which reacted with an antibody directed against RNA polymerase II of Drosophila. Peptide mapping of the fusion p...
Protein Structure and the Sequential Structure of mRNA

DEFF Research Database (Denmark)

Brunak, Søren; Engelbrecht, Jacob

1996-01-01

entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment, By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets, These signals do not originate from......A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed, We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting...... protein, The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain, A complete search for GenBank nucleotide sequences coding for structural...
Combinatorial Control of mRNA Fates by RNA-Binding Proteins and Non-Coding RNAs

Directory of Open Access Journals (Sweden)

Valentina Iadevaia

2015-09-01

Full Text Available Post-transcriptional control of gene expression is mediated by RNA-binding proteins (RBPs and small non-coding RNAs (e.g., microRNAs that bind to distinct elements in their mRNA targets. Here, we review recent examples describing the synergistic and/or antagonistic effects mediated by RBPs and miRNAs to determine the localisation, stability and translation of mRNAs in mammalian cells. From these studies, it is becoming increasingly apparent that dynamic rearrangements of RNA-protein complexes could have profound implications in human cancer, in synaptic plasticity, and in cellular differentiation.
Deciphering systemic wound responses of the pumpkin extrafascicular phloem by metabolomics and stable isotope-coded protein labeling.

Science.gov (United States)

Gaupels, Frank; Sarioglu, Hakan; Beckmann, Manfred; Hause, Bettina; Spannagl, Manuel; Draper, John; Lindermayr, Christian; Durner, Jörg

2012-12-01

In cucurbits, phloem latex exudes from cut sieve tubes of the extrafascicular phloem (EFP), serving in defense against herbivores. We analyzed inducible defense mechanisms in the EFP of pumpkin (Cucurbita maxima) after leaf damage. As an early systemic response, wounding elicited transient accumulation of jasmonates and a decrease in exudation probably due to partial sieve tube occlusion by callose. The energy status of the EFP was enhanced as indicated by increased levels of ATP, phosphate, and intermediates of the citric acid cycle. Gas chromatography coupled to mass spectrometry also revealed that sucrose transport, gluconeogenesis/glycolysis, and amino acid metabolism were up-regulated after wounding. Combining ProteoMiner technology for the enrichment of low-abundance proteins with stable isotope-coded protein labeling, we identified 51 wound-regulated phloem proteins. Two Sucrose-Nonfermenting1-related protein kinases and a 32-kD 14-3-3 protein are candidate central regulators of stress metabolism in the EFP. Other proteins, such as the Silverleaf Whitefly-Induced Protein1, Mitogen Activated Protein Kinase6, and Heat Shock Protein81, have known defensive functions. Isotope-coded protein labeling and western-blot analyses indicated that Cyclophilin18 is a reliable marker for stress responses of the EFP. As a hint toward the induction of redox signaling, we have observed delayed oxidation-triggered polymerization of the major Phloem Protein1 (PP1) and PP2, which correlated with a decline in carbonylation of PP2. In sum, wounding triggered transient sieve tube occlusion, enhanced energy metabolism, and accumulation of defense-related proteins in the pumpkin EFP. The systemic wound response was mediated by jasmonate and redox signaling.
Cloning of a human insulin-stimulated protein kinase (ISPK-1) gene and analysis of coding regions and mRNA levels of the ISPK-1 and the protein phosphatase-1 genes in muscle from NIDDM patients

DEFF Research Database (Denmark)

Bjørbaek, C; Vik, T A; Echwald, S M

1995-01-01

with non-insulin-dependent diabetes mellitus (NIDDM). The human ISPK-1 cDNA was cloned from T-cell leukemia and placental cDNA libraries and mapped to the short arm of the human X chromosome. Single-strand conformation polymorphism (SSCP) analysis identified a total of six variations in the coding regions...
Predicting binding within disordered protein regions to structurally characterised peptide-binding domains.

Directory of Open Access Journals (Sweden)

Waqasuddin Khan

Full Text Available Disordered regions of proteins often bind to structured domains, mediating interactions within and between proteins. However, it is difficult to identify a priori the short disordered regions involved in binding. We set out to determine if docking such peptide regions to peptide binding domains would assist in these predictions.We assembled a redundancy reduced dataset of SLiM (Short Linear Motif containing proteins from the ELM database. We selected 84 sequences which had an associated PDB structures showing the SLiM bound to a protein receptor, where the SLiM was found within a 50 residue region of the protein sequence which was predicted to be disordered. First, we investigated the Vina docking scores of overlapping tripeptides from the 50 residue SLiM containing disordered regions of the protein sequence to the corresponding PDB domain. We found only weak discrimination of docking scores between peptides involved in binding and adjacent non-binding peptides in this context (AUC 0.58.Next, we trained a bidirectional recurrent neural network (BRNN using as input the protein sequence, predicted secondary structure, Vina docking score and predicted disorder score. The results were very promising (AUC 0.72 showing that multiple sources of information can be combined to produce results which are clearly superior to any single source.We conclude that the Vina docking score alone has only modest power to define the location of a peptide within a larger protein region known to contain it. However, combining this information with other knowledge (using machine learning methods clearly improves the identification of peptide binding regions within a protein sequence. This approach combining docking with machine learning is primarily a predictor of binding to peptide-binding sites, and is not intended as a predictor of specificity of binding to particular receptors.
Revisiting the missing protein-coding gene catalog of the domestic dog

Directory of Open Access Journals (Sweden)

Galibert Francis

2009-02-01

Full Text Available Abstract Background Among mammals for which there is a high sequence coverage, the whole genome assembly of the dog is unique in that it predicts a low number of protein-coding genes, ~19,000, compared to the over 20,000 reported for other mammalian species. Of particular interest are the more than 400 of genes annotated in primates and rodent genomes, but missing in dog. Results Using over 14,000 orthologous genes between human, chimpanzee, mouse rat and dog, we built multiple pairwise synteny maps to infer short orthologous intervals that were targeted for characterizing the canine missing genes. Based on gene prediction and a functionality test using the ratio of replacement to silent nucleotide substitution rates (dN/dS, we provide compelling structural and functional evidence for the identification of 232 new protein-coding genes in the canine genome and 69 gene losses, characterized as undetected gene or pseudogenes. Gene loss phyletic pattern analysis using ten species from chicken to human allowed us to characterize 28 canine-specific gene losses that have functional orthologs continuously from chicken or marsupials through human, and 10 genes that arose specifically in the evolutionary lineage leading to rodent and primates. Conclusion This study demonstrates the central role of comparative genomics for refining gene catalogs and exploring the evolutionary history of gene repertoires, particularly as applied for the characterization of species-specific gene gains and losses.
Non-coding RNA in Deinococcus radiodurans

International Nuclear Information System (INIS)

Chen Zhongzhong; Wang Liangyan; Lin Jun; Tian Bing; Hua Yuejin

2006-01-01

Researches on DNA damage and repair pathways of Deinococcus radiodurans show its extreme resistance to ionizing radiation, ultraviolet radiation and reactive oxygen species. Non-coding (ncRNA) RNAs are involved in a variety of processes such as transcriptional regulations, RNA processing and modification, mRNA translation, protein transportation and stability. The conserved secondary structures of intergenic regions of Deinococcus radiodurans R1 were predicted using Stochastic Context Free Grammar (SCFG) scan strategy. Results showed that 28 ncRNA families were present in the non-coding regions of the genome of Deinococcus radiodurans R1. Among these families, IRE is the largest family, followed by Histone3, tRNA, SECIS. DicF, ctRNA-pGA1 and tmRNA are one discovered in bacteria. Results from the comparison with other organisms showed that these ncRNA can be applied to the study of biological function of Deinococcus radiodurans and supply reference for the further study of DNA damage and repair mechanisms of this bacterium. (authors)
Predictions of RNA-binding ability and aggregation propensity of proteins

OpenAIRE

Agostini, Federico, 1985-

2014-01-01

RNA-binding proteins (RBPs) control the fate of a multitude of coding and non-coding transcripts. Formation of ribonucleoprotein (RNP) complexes fine-tunes regulation of post-transcriptional events and influences gene expression. Recently, it has been observed that non-canonical proteins with RNA-binding ability are enriched in structurally disordered and low-complexity regions that are generally involved in functional and dysfunctional associations. Therefore, it is possible that interaction...
Expected packing density allows prediction of both amyloidogenic and disordered regions in protein chains

Energy Technology Data Exchange (ETDEWEB)

Galzitskaya, Oxana V; Garbuzynskiy, Sergiy O; Lobanov, Michail Yu [Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region (Russian Federation)

2007-07-18

The determination of factors that influence conformational changes in proteins is very important for the identification of potentially amyloidogenic and disordered regions in polypeptide chains. In our work we introduce a new parameter, mean packing density, to detect both amyloidogenic and disordered regions in a protein sequence. It has been shown that regions with strong expected packing density are responsible for amyloid formation. Our predictions are consistent with known disease-related amyloidogenic regions for 9 of 12 amyloid-forming proteins and peptides in which the positions of amyloidogenic regions have been revealed experimentally. Our findings support the concept that the mechanism of formation of amyloid fibrils is similar for different peptides and proteins. Moreover, we have demonstrated that regions with weak expected packing density are responsible for the appearance of disordered regions. Our method has been tested on datasets of globular proteins and long disordered protein segments, and it shows improved performance over other widely used methods. Thus, we demonstrate that the expected packing density is a useful value for predicting both disordered and amyloidogenic regions of a protein based on sequence alone. Our results are important for understanding the structural characteristics of protein folding and misfolding.
Recombinant Vaccinia Viruses Coding Transgenes of Apoptosis-Inducing Proteins Enhance Apoptosis But Not Immunogenicity of Infected Tumor Cells

Science.gov (United States)

Tkachenko, Anastasiya; Richter, Vladimir

2017-01-01

Genetic modifications of the oncolytic vaccinia virus (VV) improve selective tumor cell infection and death, as well as activation of antitumor immunity. We have engineered a double recombinant VV, coding human GM-CSF, and apoptosis-inducing protein apoptin (VV-GMCSF-Apo) for comparing with the earlier constructed double recombinant VV-GMCSF-Lact, coding another apoptosis-inducing protein, lactaptin, which activated different cell death pathways than apoptin. We showed that both these recombinant VVs more considerably activated a set of critical apoptosis markers in infected cells than the recombinant VV coding GM-CSF alone (VV-GMCSF-dGF): these were phosphatidylserine externalization, caspase-3 and caspase-7 activation, DNA fragmentation, and upregulation of proapoptotic protein BAX. However, only VV-GMCSF-Lact efficiently decreased the mitochondrial membrane potential of infected cancer cells. Investigating immunogenic cell death markers in cancer cells infected with recombinant VVs, we demonstrated that all tested recombinant VVs were efficient in calreticulin and HSP70 externalization, decrease of cellular HMGB1, and ATP secretion. The comparison of antitumor activity against advanced MDA-MB-231 tumor revealed that both recombinants VV-GMCSF-Lact and VV-GMCSF-Apo efficiently delay tumor growth. Our results demonstrate that the composition of GM-CSF and apoptosis-inducing proteins in the VV genome is very efficient tool for specific killing of cancer cells and for activation of antitumor immunity. PMID:28951871
Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

Directory of Open Access Journals (Sweden)

Maggi Giorgio P

2008-06-01

Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.
Amino acid codes in mitochondria as possible clues to primitive codes

Science.gov (United States)

Jukes, T. H.

1981-01-01

Differences between mitochondrial codes and the universal code indicate that an evolutionary simplification has taken place, rather than a return to a more primitive code. However, these differences make it evident that the universal code is not the only code possible, and therefore earlier codes may have differed markedly from the previous code. The present universal code is probably a 'frozen accident.' The change in CUN codons from leucine to threonine (Neurospora vs. yeast mitochondria) indicates that neutral or near-neutral changes occurred in the corresponding proteins when this code change took place, caused presumably by a mutation in a tRNA gene.
Do prion protein gene polymorphisms induce apoptosis in non ...

Indian Academy of Sciences (India)

2016-08-26

Aug 26, 2016 ... Genetic variations such as single nucleotide polymorphisms (SNPs) in prion protein coding gene, Prnp, greatly affect susceptibility to prion diseases in mammals. Here, the coding region of Prnp was screened for polymorphisms in redeared turtle, Trachemys scripta. Four polymorphisms, L203V, N205I, ...
Two human cDNA molecules coding for the Duchenne muscular dystrophy (DMD) locus are highly homologous

Energy Technology Data Exchange (ETDEWEB)

Rosenthal, A.; Speer, A.; Billwitz, H. (Zentralinstitut fuer Molekularbiologie, Berlin-Buch (Germany Democratic Republic)); Cross, G.S.; Forrest, S.M.; Davies, K.E. (Univ. of Oxford (England))

1989-07-11

Recently the complete sequence of the human fetal cDNA coding for the Duchenne muscular dystrophy (DMD) locus was reported and a 3,685 amino acid long, rod-shaped cytoskeletal protein (dystrophin) was predicted as the protein product. Independently, the authors have isolated and sequenced different DMD cDNA molecules from human adult and fetal muscle. The complete 12.5 kb long sequence of all their cDNA clones has now been determined and they report here the nucleotide (nt) and amino acid (aa) differences between the sequences of both groups. The cDNA sequence comprises the whole coding region but lacks the first 110 nt from the 5{prime}-untranslated region and the last 1,417 nt of the 3{prime}-untranslated region. They have found 11 nt differences (approximately 99.9% homology) from which 7 occurred at the aa level.
Quantitative Profiling of Peptides from RNAs classified as non-coding

Science.gov (United States)

Prabakaran, Sudhakaran; Hemberg, Martin; Chauhan, Ruchi; Winter, Dominic; Tweedie-Cullen, Ry Y.; Dittrich, Christian; Hong, Elizabeth; Gunawardena, Jeremy; Steen, Hanno; Kreiman, Gabriel; Steen, Judith A.

2014-01-01

Only a small fraction of the mammalian genome codes for messenger RNAs destined to be translated into proteins, and it is generally assumed that a large portion of transcribed sequences - including introns and several classes of non-coding RNAs (ncRNAs) do not give rise to peptide products. A systematic examination of translation and physiological regulation of ncRNAs has not been conducted. Here, we use computational methods to identify the products of non-canonical translation in mouse neurons by analyzing unannotated transcripts in combination with proteomic data. This study supports the existence of non-canonical translation products from both intragenic and extragenic genomic regions, including peptides derived from anti-sense transcripts and introns. Moreover, the studied novel translation products exhibit temporal regulation similar to that of proteins known to be involved in neuronal activity processes. These observations highlight a potentially large and complex set of biologically regulated translational events from transcripts formerly thought to lack coding potential. PMID:25403355
Investigation of Polymorphisms in Coding Region of OsHKT1 in Relation to Salinity in Rice

Directory of Open Access Journals (Sweden)

Pham Quynh-Hoa

2016-11-01

Full Text Available Rice (Oryza sativa is sensitive to salinity, but the salt tolerance level differs among cultivars, which might result from natural variations in the genes that are responsible for salt tolerance. High-affinity potassium transporter (HKTs has been proven to be involved in salt tolerance in plants. Therefore, we screened for natural nucleotide polymorphism in the coding sequence of OsHKT1, which encodes the HKT protein in eight Vietnamese rice cultivars differing in salt tolerance level. In total, seven nucleotide substitutions in coding sequence of OsHKT1 were found, including two non-synonymous and five synonymous substitutions. Further analysis revealed that these two non-synonymous nucleotide substitutions (G50T and T1209A caused changes in amino acids (Gly17Val and Asp403Glu at signal peptide and the loop of the sixth transmembrane domain, respectively. To assess the potential effect of these substitutions on the protein function, the 3D structure of HKT protein variants was modelled by using PHYRE2 webserver. The results showed that no difference was observed when compared those predicted 3D structure of HKT protein variants with each other. In addition, the codon bias of synonymous substitutions cannot clearly show correlation with salt tolerance level. It might be interesting to further investigate the functional roles of detected non-synonymous substitutions as it might correlate to salt tolerance in rice.
LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.

Science.gov (United States)

Filatov, Gleb; Bauwens, Bruno; Kertész-Farkas, Attila

2018-05-07

Bioinformatics studies often rely on similarity measures between sequence pairs, which often pose a bottleneck in large-scale sequence analysis. Here, we present a new convolutional kernel function for protein sequences called the LZW-Kernel. It is based on code words identified with the Lempel-Ziv-Welch (LZW) universal text compressor. The LZW-Kernel is an alignment-free method, it is always symmetric, is positive, always provides 1.0 for self-similarity and it can directly be used with Support Vector Machines (SVMs) in classification problems, contrary to normalized compression distance (NCD), which often violates the distance metric properties in practice and requires further techniques to be used with SVMs. The LZW-Kernel is a one-pass algorithm, which makes it particularly plausible for big data applications. Our experimental studies on remote protein homology detection and protein classification tasks reveal that the LZW-Kernel closely approaches the performance of the Local Alignment Kernel (LAK) and the SVM-pairwise method combined with Smith-Waterman (SW) scoring at a fraction of the time. Moreover, the LZW-Kernel outperforms the SVM-pairwise method when combined with BLAST scores, which indicates that the LZW code words might be a better basis for similarity measures than local alignment approximations found with BLAST. In addition, the LZW-Kernel outperforms n-gram based mismatch kernels, hidden Markov model based SAM and Fisher kernel, and protein family based PSI-BLAST, among others. Further advantages include the LZW-Kernel's reliance on a simple idea, its ease of implementation, and its high speed, three times faster than BLAST and several magnitudes faster than SW or LAK in our tests. LZW-Kernel is implemented as a standalone C code and is a free open-source program distributed under GPLv3 license and can be downloaded from https://github.com/kfattila/LZW-Kernel. akerteszfarkas@hse.ru. Supplementary data are available at Bioinformatics Online.
Regional TEC model under quiet geomagnetic conditions and low-to-moderate solar activity based on CODE GIMs

Science.gov (United States)

Feng, Jiandi; Jiang, Weiping; Wang, Zhengtao; Zhao, Zhenzhen; Nie, Linjuan

2017-08-01

Global empirical total electron content (TEC) models based on TEC maps effectively describe the average behavior of the ionosphere. However, the accuracy of these global models for a certain region may not be ideal. Due to the number and distribution of the International GNSS Service (IGS) stations, the accuracy of TEC maps is geographically different. The modeling database derived from the global TEC maps with different accuracy is likely one of the main reasons that limits the accuracy of the new models. Moreover, many anomalies in the ionosphere are geographic or geomagnetic dependent, and as such the accuracy of global models can deteriorate if these anomalies are not fully incorporated into the modeling approach. For regional models built in small areas, these influences on modeling are immensely weakened. Thus, the regional TEC models may better reflect the temporal and spatial variations of TEC. In our previous work (Feng et al., 2016), a regional TEC model TECM-NEC is proposed for northeast China. However, this model is only directed against the typical region of Mid-latitude Summer Nighttime Anomaly (MSNA) occurrence, which is meaningless in other regions without MSNA. Following the technique of TECM-NEC model, this study proposes another regional empirical TEC model for other regions in mid-latitudes. Taking a small area BeiJing-TianJin-Tangshan (JJT) region (37.5°-42.5° N, 115°-120° E) in China as an example, a regional empirical TEC model (TECM-JJT) is proposed using the TEC grid data from January 1, 1999 to June 30, 2015 provided by the Center for Orbit Determination in Europe (CODE) under quiet geomagnetic conditions. The TECM-JJT model fits the input CODE TEC data with a bias of 0.11TECU and a root mean square error of 3.26TECU. Result shows that the regional model TECM-JJT is consistent with CODE TEC data and GPS-TEC data.
GUMAP: A GUPIXWIN-compatible code for extracting regional spectra from nuclear microbeam list mode files

Science.gov (United States)

Russell, John L.; Campbell, John L.; Boyd, Nicholas I.; Dias, Johnny F.

2018-02-01

The newly developed GUMAP software creates element maps from OMDAQ list mode files, displays these maps individually or collectively, and facilitates on-screen definitions of specified regions from which a PIXE spectrum can be built. These include a free-hand region defined by moving the cursor. The regional charge is entered automatically into the spectrum file in a new GUPIXWIN-compatible format, enabling a GUPIXWIN analysis of the spectrum. The code defaults to the OMDAQ dead time treatment but also facilitates two other methods for dead time correction in sample regions with count rates different from the average.

Polypeptide structure and encoding location of the adenovirus serotype 2 late, nonstructural 33K protein

International Nuclear Information System (INIS)

Oosterom-Dragon, E.A.; Anderson, C.W.

1983-01-01

Radiochemical microsequence analysis of selected tryptic peptides of the adenovirus type 2 33K nonstructural protein has revealed the precise region of the genomic nucleotide sequence that encodes this protein. The initiation codon for the 33K protein lies 606 nucleotides to the right of the EcoRI restriction site at 70.7 map units and 281 nucleotides to the left of the postulated carboxyterminal codon of the adenovirus 100K protein. The coding regions for these two proteins thus overlap; however, the 33K protein is derived from the +1 frame with respect to the postulated 100K reading frame. Our results contradict an earlier published report suggesting that these two proteins share extensive amino acid sequence homology. The published nucleotide sequence of the Ad2 EcoRI-F fragment (70.7 to 75.9 map units) cannot accomodate in a single reading frame the peptide sequences of the 33K protein that we have determined. Sequence analysis of DNA fragments derived from virus has confirmed the published nucleotide sequence in all critical regions with respect to the coding region for the 33K protein. Consequently, our data are only consistent with the existence of an mRNA splice within the coding for 33K. Consensus donor and acceptor splice sequences have been located that would predict the removal of 202 nucleotides from the transcripts for the 33K protein. Removal of these nucleotides would explain the structure of a peptide that cannot otherwise be directly encoded by the EcoRI-F fragment. Identification of the precise splice points by peptide sequencing has permitted a prediction of the complete amino acid sequence for the 33K protein
ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

Science.gov (United States)

Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

2017-01-04

The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Protein-protein docking using region-based 3D Zernike descriptors

Directory of Open Access Journals (Sweden)

Sael Lee

2009-12-01

Full Text Available Abstract Background Protein-protein interactions are a pivotal component of many biological processes and mediate a variety of functions. Knowing the tertiary structure of a protein complex is therefore essential for understanding the interaction mechanism. However, experimental techniques to solve the structure of the complex are often found to be difficult. To this end, computational protein-protein docking approaches can provide a useful alternative to address this issue. Prediction of docking conformations relies on methods that effectively capture shape features of the participating proteins while giving due consideration to conformational changes that may occur. Results We present a novel protein docking algorithm based on the use of 3D Zernike descriptors as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. Docking decoys are generated using geometric hashing, which are then ranked by a scoring function that incorporates a buried surface area and a novel geometric complementarity term based on normals associated with the 3D Zernike shape description. Our docking algorithm was tested on both bound and unbound cases in the ZDOCK benchmark 2.0 dataset. In 74% of the bound docking predictions, our method was able to find a near-native solution (interface C-αRMSD ≤ 2.5 Å within the top 1000 ranks. For unbound docking, among the 60 complexes for which our algorithm returned at least one hit, 60% of the cases were ranked within the top 2000. Comparison with existing shape-based docking algorithms shows that our method has a better performance than the others in unbound docking while remaining competitive for bound docking cases. Conclusion We show for the first time that the 3D Zernike descriptors are adept in capturing shape complementarity at the protein-protein interface and useful for
JJ1017 committee report: image examination order codes--standardized codes for imaging modality, region, and direction with local expansion: an extension of DICOM.

Science.gov (United States)

Kimura, Michio; Kuranishi, Makoto; Sukenobu, Yoshiharu; Watanabe, Hiroki; Tani, Shigeki; Sakusabe, Takaya; Nakajima, Takashi; Morimura, Shinya; Kabata, Shun

2002-06-01

The digital imaging and communications in medicine (DICOM) standard includes parts regarding nonimage data information, such as image study ordering data and performed procedure data, and is used for sharing information between HIS/RIS and modality systems, which is essential for IHE. To bring such parts of the DICOM standard into force in Japan, a joint committee of JIRA and JAHIS established the JJ1017 management guideline, specifying, for example, which items are legally required in Japan, while remaining optional in the DICOM standard. In Japan, the contents of orders from referring physicians for radiographic examinations include details of the examination. Such details are not used typically by referring physicians requesting radiographic examinations in the United States, because radiologists in the United States often determine the examination protocol. The DICOM standard has code tables for examination type, region, and direction for image examination orders. However, this investigation found that it does not include items that are detailed sufficiently for use in Japan, because of the above-mentioned reason. To overcome these drawbacks, we have generated the JJ1017 code for these 3 codes for use based on the JJ1017 guidelines. This report introduces the JJ1017 code. These codes (the study type codes in particular) must be expandable to keep up with technical advances in equipment. Expansion has 2 directions: width for covering more categories and depth for specifying the information in more detail (finer categories). The JJ1017 code takes these requirements into consideration and clearly distinguishes between the stem part as the common term and the expansion. The stem part of the JJ1017 code partially utilizes the DICOM codes to remain in line with the DICOM standard. This work is an example of how local requirements can be met by using the DICOM standard and extending it.
Phylogenetic analyses of the polyprotein coding sequences of serotype O foot-and-mouth disease viruses in East Africa: evidence for interserotypic recombination

Directory of Open Access Journals (Sweden)

Balinda Sheila N

2010-08-01

Full Text Available Abstract Background Foot-and-mouth disease (FMD is endemic in East Africa with the majority of the reported outbreaks attributed to serotype O virus. In this study, phylogenetic analyses of the polyprotein coding region of serotype O FMD viruses from Kenya and Uganda has been undertaken to infer evolutionary relationships and processes responsible for the generation and maintenance of diversity within this serotype. FMD virus RNA was obtained from six samples following virus isolation in cell culture and in one case by direct extraction from an oropharyngeal sample. Following RT-PCR, the single long open reading frame, encoding the polyprotein, was sequenced. Results Phylogenetic comparisons of the VP1 coding region showed that the recent East African viruses belong to one lineage within the EA-2 topotype while an older Kenyan strain, K/52/1992 is a representative of the topotype EA-1. Evolutionary relationships between the coding regions for the leader protease (L, the capsid region and almost the entire coding region are monophyletic except for the K/52/1992 which is distinct. Furthermore, phylogenetic relationships for the P2 and P3 regions suggest that the K/52/1992 is a probable recombinant between serotypes A and O. A bootscan analysis of K/52/1992 with East African FMD serotype A viruses (A21/KEN/1964 and A23/KEN/1965 and serotype O viral isolate (K/117/1999 revealed that the P2 region is probably derived from a serotype A strain while the P3 region appears to be a mosaic derived from both serotypes A and O. Conclusions Sequences of the VP1 coding region from recent serotype O FMDVs from Kenya and Uganda are all representatives of a specific East African lineage (topotype EA-2, a probable indication that hardly any FMD introductions of this serotype have occurred from outside the region in the recent past. Furthermore, evidence for interserotypic recombination, within the non-structural protein coding regions, between FMDVs of serotypes A
Investigation of the N-terminal coding region of MUC7 alterations in dentistry students with and without caries

Directory of Open Access Journals (Sweden)

Koç Öztürk L

2016-06-01

Full Text Available Human low-molecular weight salivary mucin (MUC7 is a small, secreted glycoprotein coded by MUC7. In the oral cavity, they inhibit the colonization of oral bacteria, including cariogenic ones, by masking their surface adhesions, thus helping saliva to avoid dental caries. The N-terminal domain is important for low-molecular weight (MG2 mucins to contact with oral microorganisms. In this study, we aimed to identify the N-terminal coding region of the MUC7 gene between individuals with and without caries. Forty-four healthy dental students were enrolled in this study; 24 of them were classified to have caries [decayed, missing, filled-teeth (DMFT = 5.6] according to the World Health Organization (WHO criteria, and 20 of them were caries-free (DMFT = 0. Simplified oral hygiene index (OHI-S and gingival index (GI were used to determine the oral hygiene and gingival conditions. Total protein levels and salivary total protein levels and salivary buffer capacity (SBC were determined by Lowry and Ericsson methods. DNA was extracted from peripheral blood cells of all the participants and genotyping was carried out by a polymerase chain reaction (PCR-sequencing method. No statistical differences were found between two groups in the terms of salivary parameters, oral hygiene and gingival conditions. We detected one common single nucleotide polymorphism (SNP that leads to a change of asparagine to lysine at codon 80. This substitution was found in 29.0 and 40.0%, respectively, of the groups with and without caries. No other sequence variations were detected. The SNP found in this study may be a specific polymorphism affecting the Turkish population. Further studies with extended numbers are necessary in order to clarify this finding.
The 75-kilodalton cytoplasmic Chlamydia trachomatis L2 polypeptide is a DnaK-like protein

DEFF Research Database (Denmark)

Birkelund, Svend; Lundemose, AG; Christiansen, Gunna

1990-01-01

,980-base-pair open reading frame revealed 94% homology with a 75-kilodalton protein from C. trachomatis serovar D and 57% homology with the DnaK proteins of E. coli and of Bacillus megaterium, while amino acid homology with human heat shock protein 70 (hsp70) was 42%. The promoter region was identified......The gene coding for the 75-kilodalton cytoplasmic Chlamydia trachomatis L2 polypeptide has been cloned in Escherichia coli, and the nucleotide sequence has been determined. The cloned DNA fragment contained the coding region as well as the putative promoter. The deduced amino acid sequence of the 1...... by computer search and by primer extension of mRNA synthesized in recombinant E. coli. The promoter region which differed from the putative promoter region in serovar D was shown to be a mixed promoter type in which the -10 region showed a regular TATA box configuration while the -35 region showed high...
Two-stage sparse coding of region covariance via Log-Euclidean kernels to detect saliency.

Science.gov (United States)

Zhang, Ying-Ying; Yang, Cai; Zhang, Ping

2017-05-01

In this paper, we present a novel bottom-up saliency detection algorithm from the perspective of covariance matrices on a Riemannian manifold. Each superpixel is described by a region covariance matrix on Riemannian Manifolds. We carry out a two-stage sparse coding scheme via Log-Euclidean kernels to extract salient objects efficiently. In the first stage, given background dictionary on image borders, sparse coding of each region covariance via Log-Euclidean kernels is performed. The reconstruction error on the background dictionary is regarded as the initial saliency of each superpixel. In the second stage, an improvement of the initial result is achieved by calculating reconstruction errors of the superpixels on foreground dictionary, which is extracted from the first stage saliency map. The sparse coding in the second stage is similar to the first stage, but is able to effectively highlight the salient objects uniformly from the background. Finally, three post-processing methods-highlight-inhibition function, context-based saliency weighting, and the graph cut-are adopted to further refine the saliency map. Experiments on four public benchmark datasets show that the proposed algorithm outperforms the state-of-the-art methods in terms of precision, recall and mean absolute error, and demonstrate the robustness and efficiency of the proposed method. Copyright © 2017 Elsevier Ltd. All rights reserved.
Specificity Protein (Sp) Transcription Factors and Metformin Regulate Expression of the Long Non-coding RNA HULC

Science.gov (United States)

There is evidence that specificity protein 1 (Sp1) transcription factor (TF) regulates expression of long non-coding RNAs (lncRNAs) in hepatocellular carcinoma (HCC) cells. RNA interference (RNAi) studies showed that among several lncRNAs expressed in HepG2, SNU-449 and SK-Hep-1...
AMPA: an automated web server for prediction of protein antimicrobial regions.

Science.gov (United States)

Torrent, Marc; Di Tommaso, Paolo; Pulido, David; Nogués, M Victòria; Notredame, Cedric; Boix, Ester; Andreu, David

2012-01-01

AMPA is a web application for assessing the antimicrobial domains of proteins, with a focus on the design on new antimicrobial drugs. The application provides fast discovery of antimicrobial patterns in proteins that can be used to develop new peptide-based drugs against pathogens. Results are shown in a user-friendly graphical interface and can be downloaded as raw data for later examination. AMPA is freely available on the web at http://tcoffee.crg.cat/apps/ampa. The source code is also available in the web. marc.torrent@upf.edu; david.andreu@upf.edu Supplementary data are available at Bioinformatics online.
Genetic diversity of the HLA-G coding region in Amerindian populations from the Brazilian Amazon: a possible role of natural selection.

Science.gov (United States)

Mendes-Junior, C T; Castelli, E C; Meyer, D; Simões, A L; Donadi, E A

2013-12-01

HLA-G has an important role in the modulation of the maternal immune system during pregnancy, and evidence that balancing selection acts in the promoter and 3'UTR regions has been previously reported. To determine whether selection acts on the HLA-G coding region in the Amazon Rainforest, exons 2, 3 and 4 were analyzed in a sample of 142 Amerindians from nine villages of five isolated tribes that inhabit the Central Amazon. Six previously described single-nucleotide polymorphisms (SNPs) were identified and the Expectation-Maximization (EM) and PHASE algorithms were used to computationally reconstruct SNP haplotypes (HLA-G alleles). A new HLA-G allele, which originated in Amerindian populations by a crossing-over event between two widespread HLA-G alleles, was identified in 18 individuals. Neutrality tests evidenced that natural selection has a complex part in the HLA-G coding region. Although balancing selection is the type of selection that shapes variability at a local level (Native American populations), we have also shown that purifying selection may occur on a worldwide scale. Moreover, the balancing selection does not seem to act on the coding region as strongly as it acts on the flanking regulatory regions, and such coding signature may actually reflect a hitchhiking effect.
Evaluation of the efficacy of twelve mitochondrial protein-coding genes as barcodes for mollusk DNA barcoding.

Science.gov (United States)

Yu, Hong; Kong, Lingfeng; Li, Qi

2016-01-01

In this study, we evaluated the efficacy of 12 mitochondrial protein-coding genes from 238 mitochondrial genomes of 140 molluscan species as potential DNA barcodes for mollusks. Three barcoding methods (distance, monophyly and character-based methods) were used in species identification. The species recovery rates based on genetic distances for the 12 genes ranged from 70.83 to 83.33%. There were no significant differences in intra- or interspecific variability among the 12 genes. The monophyly and character-based methods provided higher resolution than the distance-based method in species delimitation. Especially in closely related taxa, the character-based method showed some advantages. The results suggested that besides the standard COI barcode, other 11 mitochondrial protein-coding genes could also be potentially used as a molecular diagnostic for molluscan species discrimination. Our results also showed that the combination of mitochondrial genes did not enhance the efficacy for species identification and a single mitochondrial gene would be fully competent.
Sequence of the intron/exon junctions of the coding region of the human androgen receptor gene and identification of a point mutation in a family with complete androgen insensitivity

International Nuclear Information System (INIS)

Lubahn, D.B.; Simental, J.A.; Higgs, H.N.; Wilson, E.M.; French, F.S.; Brown, T.R.; Migeon, C.J.

1989-01-01

Androgens act through a receptor protein (AR) to mediate sex differentiation and development of the male phenotype. The authors have isolated the eight exons in the amino acid coding region of the AR gene from a human X chromosome library. Nucleotide sequences of the AR gene intron/exon boundaries were determined for use in designing synthetic oligonucleotide primers to bracket coding exons for amplification by the polymerase chain reaction. Genomic DNA was amplified from 46, XY phenotypic female siblings with complete androgen insensitivity syndrome. AR binding affinity for dihydrotestosterone in the affected siblings was lower than in normal males, but the binding capacity was normal. Sequence analysis of amplified exons demonstrated within the AR steroid-binding domain (exon G) a single guanine to adenine mutation, resulting in replacement of valine with methionine at amino acid residue 866. As expected, the carrier mother had both normal and mutant AR genes. Thus, a single point mutation in the steroid-binding domain of the AR gene correlated with the expression of an AR protein ineffective in stimulating male sexual development
Genome-scale prediction of proteins with long intrinsically disordered regions.

Science.gov (United States)

Peng, Zhenling; Mizianty, Marcin J; Kurgan, Lukasz

2014-01-01

Proteins with long disordered regions (LDRs), defined as having 30 or more consecutive disordered residues, are abundant in eukaryotes, and these regions are recognized as a distinct class of biologically functional domains. LDRs facilitate various cellular functions and are important for target selection in structural genomics. Motivated by the lack of methods that directly predict proteins with LDRs, we designed Super-fast predictor of proteins with Long Intrinsically DisordERed regions (SLIDER). SLIDER utilizes logistic regression that takes an empirically chosen set of numerical features, which consider selected physicochemical properties of amino acids, sequence complexity, and amino acid composition, as its inputs. Empirical tests show that SLIDER offers competitive predictive performance combined with low computational cost. It outperforms, by at least a modest margin, a comprehensive set of modern disorder predictors (that can indirectly predict LDRs) and is 16 times faster compared to the best currently available disorder predictor. Utilizing our time-efficient predictor, we characterized abundance and functional roles of proteins with LDRs over 110 eukaryotic proteomes. Similar to related studies, we found that eukaryotes have many (on average 30.3%) proteins with LDRs with majority of proteomes having between 25 and 40%, where higher abundance is characteristic to proteomes that have larger proteins. Our first-of-its-kind large-scale functional analysis shows that these proteins are enriched in a number of cellular functions and processes including certain binding events, regulation of catalytic activities, cellular component organization, biogenesis, biological regulation, and some metabolic and developmental processes. A webserver that implements SLIDER is available at http://biomine.ece.ualberta.ca/SLIDER/. Copyright © 2013 Wiley Periodicals, Inc.
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

Science.gov (United States)

Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

2017-10-06

Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.
Aggregation propensity of critical regions of the protein Tau

Science.gov (United States)

Muthee, Micaiah; Ahmed, Azka; Larini, Luca

The Alzheimer's disease is an irreversible, progressive brain disorder that slowly destroys memory and thinking skills, which eventually leads to the ability to not able to carry out the simplest tasks. The Alzheimer's disease is characterized by the formation of protein aggregates both within and outside of the brain's cells, the neurons. Within the neurons, the aggregation of the protein tau leads to the destruction of the microtubules in the axon of the neuron. Tau belongs to a group of proteins referred to as Microtubule-Associated Proteins. It is extremely flexible and is classified as an intrinsically unstructured protein due to its low propensity to form secondary structure. Tau promotes tubulin assembly into microtubules thereby stabilizing the cytoskeleton of the axon of the neurons. The microtubule binding region of tau consists of 4 pseudo-repeats. In this study, we will focus on the aggregation propensity of two fragments. In this study we will focus on the PHF43 fragment that contains the third pseudo-repeat and has been shown experimentally to aggregate readily. Another fragment that contains the second pseudo-repeat will be considered as well. Mutations in this region are associated with various form of dementia and for this reason we will consider the mutant P301L.
Stereoscopic Visual Attention-Based Regional Bit Allocation Optimization for Multiview Video Coding

Directory of Open Access Journals (Sweden)

Dai Qionghai

2010-01-01

Full Text Available We propose a Stereoscopic Visual Attention- (SVA- based regional bit allocation optimization for Multiview Video Coding (MVC by the exploiting visual redundancies from human perceptions. We propose a novel SVA model, where multiple perceptual stimuli including depth, motion, intensity, color, and orientation contrast are utilized, to simulate the visual attention mechanisms of human visual system with stereoscopic perception. Then, a semantic region-of-interest (ROI is extracted based on the saliency maps of SVA. Both objective and subjective evaluations of extracted ROIs indicated that the proposed SVA model based on ROI extraction scheme outperforms the schemes only using spatial or/and temporal visual attention clues. Finally, by using the extracted SVA-based ROIs, a regional bit allocation optimization scheme is presented to allocate more bits on SVA-based ROIs for high image quality and fewer bits on background regions for efficient compression purpose. Experimental results on MVC show that the proposed regional bit allocation algorithm can achieve over % bit-rate saving while maintaining the subjective image quality. Meanwhile, the image quality of ROIs is improved by dB at the cost of insensitive image quality degradation of the background image.
Pseudo-polyprotein translated from the full-length ORF1 of capillovirus is important for pathogenicity, but a truncated ORF1 protein without variable and CP regions is sufficient for replication.

Science.gov (United States)

Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou

2010-09-01

The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.
The spatial distribution of fixed mutations within genes coding for proteins

Science.gov (United States)

Holmquist, R.; Goodman, M.; Conroy, T.; Czelusniak, J.

1983-01-01

An examination has been conducted of the extensive amino acid sequence data now available for five protein families - the alpha crystallin A chain, myoglobin, alpha and beta hemoglobin, and the cytochromes c - with the goal of estimating the true spatial distribution of base substitutions within genes that code for proteins. In every case the commonly used Poisson density failed to even approximate the experimental pattern of base substitution. For the 87 species of beta hemoglobin examined, for example, the probability that the observed results were from a Poisson process was the minuscule 10 to the -44th. Analogous results were obtained for the other functional families. All the data were reasonably, but not perfectly, described by the negative binomial density. In particular, most of the data were described by one of the very simple limiting forms of this density, the geometric density. The implications of this for evolutionary inference are discussed. It is evident that most estimates of total base substitutions between genes are badly in need of revision.
Structural basis of the non-coding RNA RsmZ acting as a protein sponge.

Science.gov (United States)

Duss, Olivier; Michel, Erich; Yulikov, Maxim; Schubert, Mario; Jeschke, Gunnar; Allain, Frédéric H-T

2014-05-29

MicroRNA and protein sequestration by non-coding RNAs (ncRNAs) has recently generated much interest. In the bacterial Csr/Rsm system, which is considered to be the most general global post-transcriptional regulatory system responsible for bacterial virulence, ncRNAs such as CsrB or RsmZ activate translation initiation by sequestering homodimeric CsrA-type proteins from the ribosome-binding site of a subset of messenger RNAs. However, the mechanism of ncRNA-mediated protein sequestration is not understood at the molecular level. Here we show for Pseudomonas fluorescens that RsmE protein dimers assemble sequentially, specifically and cooperatively onto the ncRNA RsmZ within a narrow affinity range. This assembly yields two different native ribonucleoprotein structures. Using a powerful combination of nuclear magnetic resonance and electron paramagnetic resonance spectroscopy we elucidate these 70-kilodalton solution structures, thereby revealing the molecular mechanism of the sequestration process and how RsmE binding protects the ncRNA from RNase E degradation. Overall, our findings suggest that RsmZ is well-tuned to sequester, store and release RsmE and therefore can be viewed as an ideal protein 'sponge'.

Conserved syntenic clusters of protein coding genes are missing in birds.

Science.gov (United States)

Lovell, Peter V; Wirthlin, Morgan; Wilhelm, Larry; Minx, Patrick; Lazar, Nathan H; Carbone, Lucia; Warren, Wesley C; Mello, Claudio V

2014-01-01

Birds are one of the most highly successful and diverse groups of vertebrates, having evolved a number of distinct characteristics, including feathers and wings, a sturdy lightweight skeleton and unique respiratory and urinary/excretion systems. However, the genetic basis of these traits is poorly understood. Using comparative genomics based on extensive searches of 60 avian genomes, we have found that birds lack approximately 274 protein coding genes that are present in the genomes of most vertebrate lineages and are for the most part organized in conserved syntenic clusters in non-avian sauropsids and in humans. These genes are located in regions associated with chromosomal rearrangements, and are largely present in crocodiles, suggesting that their loss occurred subsequent to the split of dinosaurs/birds from crocodilians. Many of these genes are associated with lethality in rodents, human genetic disorders, or biological functions targeting various tissues. Functional enrichment analysis combined with orthogroup analysis and paralog searches revealed enrichments that were shared by non-avian species, present only in birds, or shared between all species. Together these results provide a clearer definition of the genetic background of extant birds, extend the findings of previous studies on missing avian genes, and provide clues about molecular events that shaped avian evolution. They also have implications for fields that largely benefit from avian studies, including development, immune system, oncogenesis, and brain function and cognition. With regards to the missing genes, birds can be considered ‘natural knockouts’ that may become invaluable model organisms for several human diseases.
Identification of a cis-regulatory region of a gene in Arabidopsis thaliana whose induction by dehydration is mediated by abscisic acid and requires protein synthesis.

Science.gov (United States)

Iwasaki, T; Yamaguchi-Shinozaki, K; Shinozaki, K

1995-05-20

In Arabidopsis thaliana, the induction of a dehydration-responsive gene, rd22, is mediated by abscisic acid (ABA) but the gene does not include any sequence corresponding to the consensus ABA-responsive element (ABRE), RYACGTGGYR, in its promoter region. The cis-regulatory region of the rd22 promoter was identified by monitoring the expression of beta-glucuronidase (GUS) activity in leaves of transgenic tobacco plants transformed with chimeric gene fusions constructed between 5'-deleted promoters of rd22 and the coding region of the GUS reporter gene. A 67-bp nucleotide fragment corresponding to positions -207 to -141 of the rd22 promoter conferred responsiveness to dehydration and ABA on a non-responsive promoter. The 67-bp fragment contains the sequences of the recognition sites for some transcription factors, such as MYC, MYB, and GT-1. The fact that accumulation of rd22 mRNA requires protein synthesis raises the possibility that the expression of rd22 might be regulated by one of these trans-acting protein factors whose de novo synthesis is induced by dehydration or ABA. Although the structure of the RD22 protein is very similar to that of a non-storage seed protein, USP, of Vicia faba, the expression of the GUS gene driven by the rd22 promoter in non-stressed transgenic Arabidopsis plants was found mainly in flowers and bolted stems rather than in seeds.
Functional intersection of ATM and DNA-dependent protein kinase catalytic subunit in coding end joining during V(D)J recombination

DEFF Research Database (Denmark)

Lee, Baeck-Seung; Gapud, Eric J; Zhang, Shichuan

2013-01-01

V(D)J recombination is initiated by the RAG endonuclease, which introduces DNA double-strand breaks (DSBs) at the border between two recombining gene segments, generating two hairpin-sealed coding ends and two blunt signal ends. ATM and DNA-dependent protein kinase catalytic subunit (DNA-PKcs) ar......V(D)J recombination is initiated by the RAG endonuclease, which introduces DNA double-strand breaks (DSBs) at the border between two recombining gene segments, generating two hairpin-sealed coding ends and two blunt signal ends. ATM and DNA-dependent protein kinase catalytic subunit (DNA......-PKcs) are serine-threonine kinases that orchestrate the cellular responses to DNA DSBs. During V(D)J recombination, ATM and DNA-PKcs have unique functions in the repair of coding DNA ends. ATM deficiency leads to instability of postcleavage complexes and the loss of coding ends from these complexes. DNA...... when ATM is present and its kinase activity is intact. The ability of ATM to compensate for DNA-PKcs kinase activity depends on the integrity of three threonines in DNA-PKcs that are phosphorylation targets of ATM, suggesting that ATM can modulate DNA-PKcs activity through direct phosphorylation of DNA...
Analysis of t(9;17)(q33.2;q25.3) chromosomal breakpoint regions and genetic association reveals novel candidate genes for bipolar disorder

DEFF Research Database (Denmark)

Rajkumar, Anto P; Christensen, Jane H; Mattheisen, Manuel

2015-01-01

,856) data. Genetic associations between these disorders and single nucleotide polymorphisms within these breakpoint regions were analysed by BioQ, FORGE, and RegulomeDB programmes. RESULTS: Four protein-coding genes [coding for (endonuclease V (ENDOV), neuronal pentraxin I (NPTX1), ring finger protein 213...
Robust expression of a bioactive mammalian protein in chlamydomonas chloroplast

Science.gov (United States)

Mayfield, Stephen P.

2010-03-16

Methods and compositions are disclosed to engineer chloroplast comprising heterologous mammalian genes via a direct replacement of chloroplast Photosystem II (PSII) reaction center protein coding regions to achieve expression of recombinant protein above 5% of total protein. When algae is used, algal expressed protein is produced predominantly as a soluble protein where the functional activity of the peptide is intact. As the host algae is edible, production of biologics in this organism for oral delivery or proteins/peptides, especially gut active proteins, without purification is disclosed.
The PIES2012 Code for Calculating 3D Equilibria with Islands and Stochastic Regions

Science.gov (United States)

Monticello, Donald; Reiman, Allan; Raburn, Daniel

2013-10-01

We have made major modifications to the PIES 3D equilibrium code to produce a new version, PIES2012. The new version uses an adaptive radial grid for calculating equilibrium currents. A subset of the flux surfaces conform closely to island separatrices, providing an accurate treatment of the effects driving the neoclassical tearing mode. There is now a set of grid surfaces that conform to the flux surfaces in the interiors of the islands, allowing the proper treatment of the current profiles in the islands, which play an important role in tearing phenomena. We have verified that we can introduce appropriate current profiles in the islands to suppress their growth, allowing us to simulate situations where islands are allowed to grow at some rational surfaces but not others. Placement of grid surfaces between islands is guided by the locations of high order fixed points, allowing us to avoid spectral polution and providing a more robust, and smoother convergence of the code. The code now has an option for turning on a vertical magnetic field to fix the position of the magnetic axis, which models the horizontal feedback positioning of a tokamak plasma. The code has a new option for using a Jacobian-Free Newton Krylov scheme for convergence. The code now also contains a model that properly handles stochastic regions with nonzero pressure gradients. Work supported by DOE contract DE-AC02-09CH11466.
Deciphering Systemic Wound Responses of the Pumpkin Extrafascicular Phloem by Metabolomics and Stable Isotope-Coded Protein Labeling1[C][W

Science.gov (United States)

Gaupels, Frank; Sarioglu, Hakan; Beckmann, Manfred; Hause, Bettina; Spannagl, Manuel; Draper, John; Lindermayr, Christian; Durner, Jörg

2012-01-01

In cucurbits, phloem latex exudes from cut sieve tubes of the extrafascicular phloem (EFP), serving in defense against herbivores. We analyzed inducible defense mechanisms in the EFP of pumpkin (Cucurbita maxima) after leaf damage. As an early systemic response, wounding elicited transient accumulation of jasmonates and a decrease in exudation probably due to partial sieve tube occlusion by callose. The energy status of the EFP was enhanced as indicated by increased levels of ATP, phosphate, and intermediates of the citric acid cycle. Gas chromatography coupled to mass spectrometry also revealed that sucrose transport, gluconeogenesis/glycolysis, and amino acid metabolism were up-regulated after wounding. Combining ProteoMiner technology for the enrichment of low-abundance proteins with stable isotope-coded protein labeling, we identified 51 wound-regulated phloem proteins. Two Sucrose-Nonfermenting1-related protein kinases and a 32-kD 14-3-3 protein are candidate central regulators of stress metabolism in the EFP. Other proteins, such as the Silverleaf Whitefly-Induced Protein1, Mitogen Activated Protein Kinase6, and Heat Shock Protein81, have known defensive functions. Isotope-coded protein labeling and western-blot analyses indicated that Cyclophilin18 is a reliable marker for stress responses of the EFP. As a hint toward the induction of redox signaling, we have observed delayed oxidation-triggered polymerization of the major Phloem Protein1 (PP1) and PP2, which correlated with a decline in carbonylation of PP2. In sum, wounding triggered transient sieve tube occlusion, enhanced energy metabolism, and accumulation of defense-related proteins in the pumpkin EFP. The systemic wound response was mediated by jasmonate and redox signaling. PMID:23085839
Quantifying the mechanisms of domain gain in animal proteins.

Science.gov (United States)

Buljan, Marija; Frankish, Adam; Bateman, Alex

2010-01-01

Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
Fragile X mental retardation protein participates in non-coding RNA pathways.

Science.gov (United States)

Li, En-Hui; Zhao, Xin; Zhang, Ce; Liu, Wei

2018-02-20

Fragile X syndrome is one of the most common forms of inherited intellectual disability. It is caused by mutations of the Fragile X mental retardation 1(FMR1) gene, resulting in either the loss or abnormal expression of the Fragile X mental retardation protein (FMRP). Recent research showed that FMRP participates in non-coding RNA pathways and plays various important roles in physiology, thereby extending our knowledge of the pathogenesis of the Fragile X syndrome. Initial studies showed that the Drosophila FMRP participates in siRNA and miRNA pathways by interacting with Dicer, Ago1 and Ago2, involved in neural activity and the fate determination of the germline stem cells. Subsequent studies showed that the Drosophila FMRP participates in piRNA pathway by interacting with Aub, Ago1 and Piwi in the maintenance of normal chromatin structures and genomic stability. More recent studies showed that FMRP is associated with lncRNA pathway, suggesting a potential role for the involvement in the clinical manifestations. In this review, we summarize the novel findings and explore the relationship between FMRP and non-coding RNA pathways, particularly the piRNA pathway, thereby providing critical insights on the molecular pathogenesis of Fragile X syndrome, and potential translational applications in clinical management of the disease.
Aminotryptophan-containing barstar: structure--function tradeoff in protein design and engineering with an expanded genetic code.

Science.gov (United States)

Rubini, Marina; Lepthien, Sandra; Golbik, Ralph; Budisa, Nediljko

2006-07-01

The indole ring of the canonical amino acid tryptophan (Trp) possesses distinguished features, such as sterical bulk, hydrophobicity and the nitrogen atom which is capable of acting as a hydrogen bond donor. The introduction of an amino group into the indole moiety of Trp yields the structural analogs 4-aminotryptophan ((4-NH(2))Trp) and 5-aminotryptophan ((5-NH(2))Trp). Their hydrophobicity and spectral properties are substantially different when compared to those of Trp. They resemble the purine bases of DNA and share their capacity for pH-sensitive intramolecular charge transfer. The Trp --> aminotryptophan substitution in proteins during ribosomal translation is expected to result in related protein variants that acquire these features. These expectations have been fulfilled by incorporating (4-NH(2))Trp and (5-NH(2))Trp into barstar, an intracellular inhibitor of the ribonuclease barnase from Bacillus amyloliquefaciens. The crystal structure of (4-NH(2))Trp-barstar is similar to that of the parent protein, whereas its spectral and thermodynamic behavior is found to be remarkably different. The T(m) value of (4-NH(2))Trp- and (5-NH(2))Trp-barstar is lowered by about 20 degrees Celsius, and they exhibit a strongly reduced unfolding cooperativity and substantial loss of free energy in folding. Furthermore, folding kinetic study of (4-NH(2))Trp-barstar revealed that the denatured state is even preferred over native one. The combination of structural and thermodynamic analyses clearly shows how structures of substituted barstar display a typical structure-function tradeoff: the acquirement of unique pH-sensitive charge transfer as a novel function is achieved at the expense of protein stability. These findings provide a new insight into the evolution of the amino acid repertoire of the universal genetic code and highlight possible problems regarding protein engineering and design by using an expanded genetic code.
Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

Directory of Open Access Journals (Sweden)

Ruan Jishou

2007-04-01

Full Text Available Abstract Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP; the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are
Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes

Directory of Open Access Journals (Sweden)

Zhang Michael Q

2011-05-01

Full Text Available Abstract Background Gene expression is regulated at both the DNA sequence level and through modification of chromatin. However, the effect of chromatin on tissue/cell-type specific gene regulation (TCSR is largely unknown. In this paper, we present a method to elucidate the relationship between histone modification/variation (HMV and TCSR. Results A classifier for differentiating CD4+ T cell-specific genes from housekeeping genes using HMV data was built. We found HMV in both promoter and gene body regions to be predictive of genes which are targets of TCSR. For example, the histone modification types H3K4me3 and H3K27ac were identified as the most predictive for CpG-related promoters, whereas H3K4me3 and H3K79me3 were the most predictive for nonCpG-related promoters. However, genes targeted by TCSR can be predicted using other type of HMVs as well. Such redundancy implies that multiple type of underlying regulatory elements, such as enhancers or intragenic alternative promoters, which can regulate gene expression in a tissue/cell-type specific fashion, may be marked by the HMVs. Finally, we show that the predictive power of HMV for TCSR is not limited to protein-coding genes in CD4+ T cells, as we successfully predicted TCSR targeted genes in muscle cells, as well as microRNA genes with expression specific to CD4+ T cells, by the same classifier which was trained on HMV data of protein-coding genes in CD4+ T cells. Conclusion We have begun to understand the HMV patterns that guide gene expression in both tissue/cell-type specific and ubiquitous manner.
Distribution of protein components of wheat from different regions

African Journals Online (AJOL)

kesiena

2012-06-07

Jun 7, 2012 ... The distribution of wheat protein components in different regions was researched to ..... properties of wheat gliadins II. effects on dynamic rheoligical ... fractions properties of wheat dough depending on molecular size and.
Protein functional features are reflected in the patterns of mRNA translation speed.

Science.gov (United States)

López, Daniel; Pazos, Florencio

2015-07-09

The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.
Female-biased expression of long non-coding RNAs in domains that escape X-inactivation in mouse

Directory of Open Access Journals (Sweden)

Lu Lu

2010-11-01

Full Text Available Abstract Background Sexual dimorphism in brain gene expression has been recognized in several animal species. However, the relevant regulatory mechanisms remain poorly understood. To investigate whether sex-biased gene expression in mammalian brain is globally regulated or locally regulated in diverse brain structures, and to study the genomic organisation of brain-expressed sex-biased genes, we performed a large scale gene expression analysis of distinct brain regions in adult male and female mice. Results This study revealed spatial specificity in sex-biased transcription in the mouse brain, and identified 173 sex-biased genes in the striatum; 19 in the neocortex; 12 in the hippocampus and 31 in the eye. Genes located on sex chromosomes were consistently over-represented in all brain regions. Analysis on a subset of genes with sex-bias in more than one tissue revealed Y-encoded male-biased transcripts and X-encoded female-biased transcripts known to escape X-inactivation. In addition, we identified novel coding and non-coding X-linked genes with female-biased expression in multiple tissues. Interestingly, the chromosomal positions of all of the female-biased non-coding genes are in close proximity to protein-coding genes that escape X-inactivation. This defines X-chromosome domains each of which contains a coding and a non-coding female-biased gene. Lack of repressive chromatin marks in non-coding transcribed loci supports the possibility that they escape X-inactivation. Moreover, RNA-DNA combined FISH experiments confirmed the biallelic expression of one such novel domain. Conclusion This study demonstrated that the amount of genes with sex-biased expression varies between individual brain regions in mouse. The sex-biased genes identified are localized on many chromosomes. At the same time, sexually dimorphic gene expression that is common to several parts of the brain is mostly restricted to the sex chromosomes. Moreover, the study uncovered
Trans-acting translational regulatory RNA binding proteins.

Science.gov (United States)

Harvey, Robert F; Smith, Tom S; Mulroney, Thomas; Queiroz, Rayner M L; Pizzinga, Mariavittoria; Dezi, Veronica; Villenueva, Eneko; Ramakrishna, Manasa; Lilley, Kathryn S; Willis, Anne E

2018-05-01

The canonical molecular machinery required for global mRNA translation and its control has been well defined, with distinct sets of proteins involved in the processes of translation initiation, elongation and termination. Additionally, noncanonical, trans-acting regulatory RNA-binding proteins (RBPs) are necessary to provide mRNA-specific translation, and these interact with 5' and 3' untranslated regions and coding regions of mRNA to regulate ribosome recruitment and transit. Recently it has also been demonstrated that trans-acting ribosomal proteins direct the translation of specific mRNAs. Importantly, it has been shown that subsets of RBPs often work in concert, forming distinct regulatory complexes upon different cellular perturbation, creating an RBP combinatorial code, which through the translation of specific subsets of mRNAs, dictate cell fate. With the development of new methodologies, a plethora of novel RNA binding proteins have recently been identified, although the function of many of these proteins within mRNA translation is unknown. In this review we will discuss these methodologies and their shortcomings when applied to the study of translation, which need to be addressed to enable a better understanding of trans-acting translational regulatory proteins. Moreover, we discuss the protein domains that are responsible for RNA binding as well as the RNA motifs to which they bind, and the role of trans-acting ribosomal proteins in directing the translation of specific mRNAs. This article is categorized under: RNA Interactions with Proteins and Other Molecules > RNA-Protein Complexes Translation > Translation Regulation Translation > Translation Mechanisms. © 2018 Medical Research Council and University of Cambridge. WIREs RNA published by Wiley Periodicals, Inc.
Allele-Selective Transcriptome Recruitment to Polysomes Primed for Translation: Protein-Coding and Noncoding RNAs, and RNA Isoforms.

Directory of Open Access Journals (Sweden)

Roshan Mascarenhas

Full Text Available mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs, and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs in lymphoblast cell lines (LCL and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T in ABCB1 (MDR1 on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.
Genetic coding and gene expression - new Quadruplet genetic coding model

Science.gov (United States)

Shankar Singh, Rama

2012-07-01

Successful demonstration of human genome project has opened the door not only for developing personalized medicine and cure for genetic diseases, but it may also answer the complex and difficult question of the origin of life. It may lead to making 21st century, a century of Biological Sciences as well. Based on the central dogma of Biology, genetic codons in conjunction with tRNA play a key role in translating the RNA bases forming sequence of amino acids leading to a synthesized protein. This is the most critical step in synthesizing the right protein needed for personalized medicine and curing genetic diseases. So far, only triplet codons involving three bases of RNA, transcribed from DNA bases, have been used. Since this approach has several inconsistencies and limitations, even the promise of personalized medicine has not been realized. The new Quadruplet genetic coding model proposed and developed here involves all four RNA bases which in conjunction with tRNA will synthesize the right protein. The transcription and translation process used will be the same, but the Quadruplet codons will help overcome most of the inconsistencies and limitations of the triplet codes. Details of this new Quadruplet genetic coding model and its subsequent potential applications including relevance to the origin of life will be presented.
Improved production of membrane proteins in Escherichia coli by selective codon substitutions

DEFF Research Database (Denmark)

Nørholm, Morten H.H.; Toddo, Stephen; Virkki, Minttu T.I.

2013-01-01

Membrane proteins are extremely challenging to produce in sufficient quantities for biochemical and structural analysis and there is a growing demand for solutions to this problem. In this study we attempted to improve expression of two difficult-to-express coding sequences (araH and narK) for me......Membrane proteins are extremely challenging to produce in sufficient quantities for biochemical and structural analysis and there is a growing demand for solutions to this problem. In this study we attempted to improve expression of two difficult-to-express coding sequences (araH and nar......K) for membrane transporters. For both coding sequences, synonymous codon substitutions in the region adjacent to the AUG start led to significant improvements in expression, whereas multi-parameter sequence optimization of codons throughout the coding sequence failed. We conclude that coding sequences can be re...
Genetic variants in promoters and coding regions of the muscle glycogen synthase and the insulin-responsive GLUT4 genes in NIDDM

DEFF Research Database (Denmark)

Bjørbaek, C; Echwald, Søren Morgenthaler; Hubricht, P

1994-01-01

To examine the hypothesis that variants in the regulatory or coding regions of the glycogen synthase (GS) and insulin-responsive glucose transporter (GLUT4) genes contribute to insulin-resistant glucose processing of muscle from non-insulin-dependent diabetes mellitus (NIDDM) patients, promoter...... volunteers. By applying inverse polymerase chain reaction and direct DNA sequencing, 532 base pairs (bp) of the GS promoter were identified and the transcriptional start site determined by primer extension. SSCP scanning of the promoter region detected five single nucleotide substitutions, positioned at 42......'-untranslated region, and the coding region of the GLUT4 gene showed four polymorphisms, all single nucleotide substitutions, positioned at -581, 1, 30, and 582. None of the three changes in the regulatory region of the gene had any major influence on expression of the GLUT4 gene in muscle. The variant at 582...

Natural selection in avian protein-coding genes expressed in brain.

Science.gov (United States)

Axelsson, Erik; Hultin-Rosenberg, Lina; Brandström, Mikael; Zwahlén, Martin; Clayton, David F; Ellegren, Hans

2008-06-01

The evolution of birds from theropod dinosaurs took place approximately 150 million years ago, and was associated with a number of specific adaptations that are still evident among extant birds, including feathers, song and extravagant secondary sexual characteristics. Knowledge about the molecular evolutionary background to such adaptations is lacking. Here, we analyse the evolution of > 5000 protein-coding gene sequences expressed in zebra finch brain by comparison to orthologous sequences in chicken. Mean d(N)/d(S) is 0.085 and genes with their maximal expression in the eye and central nervous system have the lowest mean d(N)/d(S) value, while those expressed in digestive and reproductive tissues exhibit the highest. We find that fast-evolving genes (those which have higher than expected rate of nonsynonymous substitution, indicative of adaptive evolution) are enriched for biological functions such as fertilization, muscle contraction, defence response, response to stress, wounding and endogenous stimulus, and cell death. After alignment to mammalian orthologues, we identify a catalogue of 228 genes that show a significantly higher rate of protein evolution in the two bird lineages than in mammals. These accelerated bird genes, representing candidates for avian-specific adaptations, include genes implicated in vocal learning and other cognitive processes. Moreover, colouration genes evolve faster in birds than in mammals, which may have been driven by sexual selection for extravagant plumage characteristics.
GBA manager: an online tool for querying low-complexity regions in proteins.

Science.gov (United States)

Bandyopadhyay, Nirmalya; Kahveci, Tamer

2010-01-01

Abstract We developed GBA Manager, an online software that facilitates the Graph-Based Algorithm (GBA) we proposed in our earlier work. GBA identifies the low-complexity regions (LCR) of protein sequences. GBA exploits a similarity matrix, such as BLOSUM62, to compute the complexity of the subsequences of the input protein sequence. It uses a graph-based algorithm to accurately compute the regions that have low complexities. GBA Manager is a user friendly web-service that enables online querying of protein sequences using GBA. In addition to querying capabilities of the existing GBA algorithm, GBA Manager computes the p-values of the LCR identified. The p-value gives an estimate of the possibility that the region appears by chance. GBA Manager presents the output in three different understandable formats. GBA Manager is freely accessible at http://bioinformatics.cise.ufl.edu/GBA/GBA.htm .
Pathway Detection from Protein Interaction Networks and Gene Expression Data Using Color-Coding Methods and A* Search Algorithms

Directory of Open Access Journals (Sweden)

Cheng-Yu Yeh

2012-01-01

Full Text Available With the large availability of protein interaction networks and microarray data supported, to identify the linear paths that have biological significance in search of a potential pathway is a challenge issue. We proposed a color-coding method based on the characteristics of biological network topology and applied heuristic search to speed up color-coding method. In the experiments, we tested our methods by applying to two datasets: yeast and human prostate cancer networks and gene expression data set. The comparisons of our method with other existing methods on known yeast MAPK pathways in terms of precision and recall show that we can find maximum number of the proteins and perform comparably well. On the other hand, our method is more efficient than previous ones and detects the paths of length 10 within 40 seconds using CPU Intel 1.73GHz and 1GB main memory running under windows operating system.
Reprint of "Two-stage sparse coding of region covariance via Log-Euclidean kernels to detect saliency".

Science.gov (United States)

Zhang, Ying-Ying; Yang, Cai; Zhang, Ping

2017-08-01

In this paper, we present a novel bottom-up saliency detection algorithm from the perspective of covariance matrices on a Riemannian manifold. Each superpixel is described by a region covariance matrix on Riemannian Manifolds. We carry out a two-stage sparse coding scheme via Log-Euclidean kernels to extract salient objects efficiently. In the first stage, given background dictionary on image borders, sparse coding of each region covariance via Log-Euclidean kernels is performed. The reconstruction error on the background dictionary is regarded as the initial saliency of each superpixel. In the second stage, an improvement of the initial result is achieved by calculating reconstruction errors of the superpixels on foreground dictionary, which is extracted from the first stage saliency map. The sparse coding in the second stage is similar to the first stage, but is able to effectively highlight the salient objects uniformly from the background. Finally, three post-processing methods-highlight-inhibition function, context-based saliency weighting, and the graph cut-are adopted to further refine the saliency map. Experiments on four public benchmark datasets show that the proposed algorithm outperforms the state-of-the-art methods in terms of precision, recall and mean absolute error, and demonstrate the robustness and efficiency of the proposed method. Copyright © 2017 Elsevier Ltd. All rights reserved.
Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces.

NARCIS (Netherlands)

Venselaar, H.; Beek, T.A.H. te; Kuipers, R.K.P.; Hekkelman, M.L.; Vriend, G.

2010-01-01

BACKGROUND: Many newly detected point mutations are located in protein-coding regions of the human genome. Knowledge of their effects on the protein's 3D structure provides insight into the protein's mechanism, can aid the design of further experiments, and eventually can lead to the development of
Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces

NARCIS (Netherlands)

Venselaar, H.; Beek, T.A.H.; Kuipers, R.K.P.; Hekkelman, M.L.; Vriend, G.

2010-01-01

Background: Many newly detected point mutations are located in protein-coding regions of the human genome. Knowledge of their effects on the protein's 3D structure provides insight into the protein's mechanism, can aid the design of further experiments, and eventually can lead to the development of
Function and Application Areas in Medicine of Non-Coding RNA

Directory of Open Access Journals (Sweden)

Figen Guzelgul

2009-06-01

Full Text Available RNA is the genetic material converting the genetic code that it gets from DNA into protein. While less than 2 % of RNA is converted into protein , more than 98 % of it can not be converted into protein and named as non-coding RNAs. 70 % of noncoding RNAs consists of introns , however, the rest part of them consists of exons. Non-coding RNAs are examined in two classes according to their size and functions. Whereas they are classified as long non-coding and small non-coding RNAs according to their size , they are grouped as housekeeping non-coding RNAs and regulating non-coding RNAs according to their function. For long years ,these non-coding RNAs have been considered as non-functional. However, today, it has been proved that these non-coding RNAs play role in regulating genes and in structural, functional and catalitic roles of RNAs converted into protein. Due to its taking a role in gene silencing mechanism, particularly in medical world , non-coding RNAs have led to significant developments. RNAi technolgy , which is used in designing drugs to be used in treatment of various diseases , is a ray of hope for medical world. [Archives Medical Review Journal 2009; 18(3.000: 141-155
Molecular characterization of amino acid deletion in VP1 (1D) protein and novel amino acid substitutions in 3D polymerase protein of foot and mouth disease virus subtype A/Iran87.

Science.gov (United States)

Esmaelizad, Majid; Jelokhani-Niaraki, Saber; Hashemnejad, Khadije; Kamalzadeh, Morteza; Lotfi, Mohsen

2011-12-01

The nucleotide sequence of the VP1 (1D) and partial 3D polymerase (3D(pol)) coding regions of the foot and mouth disease virus (FMDV) vaccine strain A/Iran87, a highly passaged isolate (~150 passages), was determined and aligned with previously published FMDV serotype A sequences. Overall analysis of the amino acid substitutions revealed that the partial 3D(pol) coding region contained four amino acid alterations. Amino acid sequence comparison of the VP1 coding region of the field isolates revealed deletions in the highly passaged Iranian isolate (A/Iran87). The prominent G-H loop of the FMDV VP1 protein contains the conserved arginine-glycine-aspartic acid (RGD) tripeptide, which is a well-known ligand for a specific cell surface integrin. Despite losing the RGD sequence of the VP1 protein and an Asp(26)→Glu substitution in a beta sheet located within a small groove of the 3D(pol) protein, the virus grew in BHK 21 suspension cell cultures. Since this strain has been used as a vaccine strain, it may be inferred that the RGD deletion has no critical role in virus attachment to the cell during the initiation of infection. It is probable that this FMDV subtype can utilize other pathways for cell attachment.
Gene divergence of homeologous regions associated with a major seed protein content QTL in soybean

Directory of Open Access Journals (Sweden)

Puji eLestari

2013-06-01

Full Text Available Understanding several modes of duplication contributing on the present genome structure is getting an attention because it could be related to numerous agronomically important traits. Since soybean serves as a rich protein source for animal feeds and human consumption, breeding efforts in soybean have been directed toward enhancing seed protein content. The publicly available soybean sequences and its genomically featured elements facilitate comprehending of quantitative trait loci (QTL for seed protein content in concordance with homeologous regions in soybean genome. Although parts of chromosome (Chr 20 and Chr 10 showed synteny, QTLs for seed protein content present only on Chr 20. Using comparative analysis of gene contents in recently duplicated genomic regions harboring QTL for protein/oil content on Chrs 20 and 10, a total of 27 genes are present in duplicated regions of both chromosomes. Notably, 4 tandem duplicates of the putative homeobox protein 22 (HB22 are present only on Chr 20 and this Medicago truncatula homolog expressed in endosperm at seed filling stage. These tandem duplicates could contribute on the protein/oil QTL of Chr 20. Our study suggests that non-shared gene contents within the duplicated genomic regions might lead to absence/presence of QTL related to protein/oil content.
The water-borne protein signals (pheromones) of the Antarctic ciliated protozoan Euplotes nobilii: structure of the gene coding for the En-6 pheromone.

Science.gov (United States)

La Terza, Antonietta; Dobri, Nicoleta; Alimenti, Claudio; Vallesi, Adriana; Luporini, Pierangelo

2009-01-01

The marine Antarctic ciliate, Euplotes nobilii, secretes a family of water-borne signal proteins, denoted as pheromones, which control vegetative proliferation and mating in the cell. Based on the knowledge of the amino acid sequences of a set of these pheromones isolated from the culture supernatant of wild-type strains, we designed probes to identify their encoding genes in the cell somatic nucleus (macronucleus). The full-length gene of the pheromone En-6 was determined and found to contain an open-reading frame specific for the synthesis of the En-6 cytoplasmic precursor (pre-pro-En-6), which requires 2 proteolytic cleavages to remove the signal peptide (pre) and the prosegment before secretion of the mature protein. In contrast to the sequence variability that distinguishes the secreted pheromones, the pre- and pro-sequences appear to be tightly conserved and useful for the construction of probes to clone every other E. nobilii pheromone gene. Potential intron sequences in the coding region of the En-6 gene imply the synthesis of more En-6 isoforms.
Fine-tuning of protein domain boundary by minimizing potential coiled coil regions

International Nuclear Information System (INIS)

Iwaya, Naoko; Goda, Natsuko; Unzai, Satoru; Fujiwara, Kenichiro; Tanaka, Toshiki; Tomii, Kentaro; Tochio, Hidehito; Shirakawa, Masahiro; Hiroaki, Hidekazu

2007-01-01

Structural determination of individual protein domains isolated from multidomain proteins is a common approach in the post-genomic era. Novel and thus uncharacterized domains liberated from intact proteins often self-associate due to incorrectly defined domain boundaries. Self-association results in missing signals, poor signal dispersion and a low signal-to-noise ratio in 1 H- 15 N HSQC spectra. We have found that a putative, non-canonical coiled coil region close to a domain boundary can cause transient hydrophobic self-association and monomer-dimer equilibrium in solution. Here we propose a rational method to predict putative coiled coil regions adjacent to the globular core domain using the program COILS. Except for the amino acid sequence, no preexisting knowledge concerning the domain is required. A small number of mutant proteins with a minimized coiled coil region have been rationally designed and tested. The engineered domains exhibit decreased self-association as assessed by 1 H- 15 N HSQC spectra with improved peak dispersion and sharper cross peaks. Two successful examples of isolating novel N-terminal domains from AAA-ATPases are demonstrated. Our method is useful for the experimental determination of domain boundaries suited for structural genomics studies
Fine-tuning of protein domain boundary by minimizing potential coiled coil regions.

Science.gov (United States)

Iwaya, Naoko; Goda, Natsuko; Unzai, Satoru; Fujiwara, Kenichiro; Tanaka, Toshiki; Tomii, Kentaro; Tochio, Hidehito; Shirakawa, Masahiro; Hiroaki, Hidekazu

2007-01-01

Structural determination of individual protein domains isolated from multidomain proteins is a common approach in the post-genomic era. Novel and thus uncharacterized domains liberated from intact proteins often self-associate due to incorrectly defined domain boundaries. Self-association results in missing signals, poor signal dispersion and a low signal-to-noise ratio in (1)H-(15)N HSQC spectra. We have found that a putative, non-canonical coiled coil region close to a domain boundary can cause transient hydrophobic self-association and monomer-dimer equilibrium in solution. Here we propose a rational method to predict putative coiled coil regions adjacent to the globular core domain using the program COILS. Except for the amino acid sequence, no preexisting knowledge concerning the domain is required. A small number of mutant proteins with a minimized coiled coil region have been rationally designed and tested. The engineered domains exhibit decreased self-association as assessed by (1)H-(15)N HSQC spectra with improved peak dispersion and sharper cross peaks. Two successful examples of isolating novel N-terminal domains from AAA-ATPases are demonstrated. Our method is useful for the experimental determination of domain boundaries suited for structural genomics studies.
Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation

International Nuclear Information System (INIS)

O'Hara, P.J.; Grant, F.J.; Haldeman, B.A.; Gray, C.L.; Insley, M.Y.; Hagen, F.S.; Murray, M.J.

1987-01-01

Activated factor VII (factor VIIa) is a vitamin K-dependent plasma serine protease that participates in a cascade of reactions leading to the coagulation of blood. Two overlapping genomic clones containing sequences encoding human factor VII were isolated and characterized. The complete sequence of the gene was determined and found to span about 12.8 kilobases. The mRNA for factor VII as demonstrated by cDNA cloning is polyadenylylated at multiple sites but contains only one AAUAAA poly(A) signal sequence. The mRNA can undergo alternative splicing, forming one transcript containing eight segments as exons and another with an additional exon that encodes a larger prepro leader sequence. The latter transcript has no known counterpart in the other vitamin K-dependent proteins. The positions of the introns with respect to the amino acid sequence encoded by the eight essential exons of factor VII are the same as those present in factor IX, factor X, protein C, and the first three exons of prothrombin. These exons code for domains generally conserved among members of this gene family. The comparable introns in these genes, however, are dissimilar with respect to size and sequence, with the exception of intron C in factor VII and protein C. The gene for factor VII also contains five regions made up of tandem repeats of oligonucleotide monomer elements. More than a quarter of the intron sequences and more than a third of the 3' untranslated portion of the mRNA transcript consist of these minisatellite tandem repeats
Comparative sequence analysis of acid sensitive/resistance proteins in Escherichia coli and Shigella flexneri

Science.gov (United States)

Manikandan, Selvaraj; Balaji, Seetharaaman; Kumar, Anil; Kumar, Rita

2007-01-01

The molecular basis for the survival of bacteria under extreme conditions in which growth is inhibited is a question of great current interest. A preliminary study was carried out to determine residue pattern conservation among the antiporters of enteric bacteria, responsible for extreme acid sensitivity especially in Escherichia coli and Shigella flexneri. Here we found the molecular evidence that proved the relationship between E. coli and S. flexneri. Multiple sequence alignment of the gadC coded acid sensitive antiporter showed many conserved residue patterns at regular intervals at the N-terminal region. It was observed that as the alignment approaches towards the C-terminal, the number of conserved residues decreases, indicating that the N-terminal region of this protein has much active role when compared to the carboxyl terminal. The motif, FHLVFFLLLGG, is well conserved within the entire gadC coded protein at the amino terminal. The motif is also partially conserved among other antiporters (which are not coded by gadC) but involved in acid sensitive/resistance mechanism. Phylogenetic cluster analysis proves the relationship of Escherichia coli and Shigella flexneri. The gadC coded proteins are converged as a clade and diverged from other antiporters belongs to the amino acid-polyamine-organocation (APC) superfamily. PMID:21670792
Evolutionary modeling and prediction of non-coding RNAs in Drosophila.

Directory of Open Access Journals (Sweden)

Robert K Bradley

2009-08-01

Full Text Available We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3' end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA.
Association of sequences in the coat protein/readthrough domain of potato mop-top virus with transmission by Spongospora subterranea.

Science.gov (United States)

Reavy, B; Arif, M; Cowan, G H; Torrance, L

1998-10-01

A monofungal culture of Spongospora subterranea was unable to acquire and transmit the T isolate of potato mop-top pomovirus (PMTV-T), which has been maintained by manual transmission in the laboratory for 30 years. A recently obtained field isolate (PMTV-S) was efficiently acquired and transmitted by the same fungus culture. Sequence analysis of the readthrough (RT) protein-coding region of PMTV-S showed the presence of an additional 543 nt in the 3' half of the coding region relative to that of PMTV-T. These additional nucleotides preserved the reading frame of the RT protein and inserted 181 amino acids into the RT protein. This was confirmed by a comparison by immunoblotting of the sizes of the RT protein of PMTV-T and other recent isolates of PMTV.
Identification of the antigenic region of Neospora caninum dense granule protein 7 using ELISA.

Science.gov (United States)

Abdelbaky, Hanan H; Fereig, Ragab M; Nishikawa, Yoshifumi

2018-06-26

Dense granule protein 7 (NcGRA7) is a potent diagnostic antigen of Neospora caninum. Some studies have reported on the difficult expression, low yield, and variable degree of solubility of recombinant NcGRA7. We aimed to unravel the possible causes for these issues and tested NcGRA7 antigenicity in enzyme linked immunosorbent assays (ELISAs). The NcGRA7 coding sequence (217 amino acids) was split into five amino acid regions: NcGRA7m (27-217), NcGRA7m3 (27-160), NcGRA7m4 (27-135), NcGRA7m5 (161-190), and NcGRA7m6 (188-217). Three fragments, NcGRA7m, NcGRA7m3 and NcGRA7m4, exhibited high antigenic properties when tested against experimentally-infected mice and dog sera by ELISA. High levels of IgG2 antibodies against NcGRA7m were observed in field dog sera. In experimentally and naturally-infected cattle, the N. caninum-specific sera only reacted with NcGRA7m, indicating that this antigenic region differs among the three animal species. This study presents valuable information about the antigenic properties and topology of NcGRA7, and highlights its suitability for diagnostic purposes. Copyright © 2018. Published by Elsevier B.V.
A retroviral oncogene, akt, encoding a serine-threonine kinase containing an SH2-like region.

Science.gov (United States)

Bellacosa, A; Testa, J R; Staal, S P; Tsichlis, P N

1991-10-11

The v-akt oncogene codes for a 105-kilodalton fusion phosphoprotein containing Gag sequences at its amino terminus. Sequence analysis of v-akt and biochemical characterization of its product revealed that it codes for a protein kinase C-related serine-threonine kinase whose cellular homolog is expressed in most tissues, with the highest amount found in thymus. Although Akt is a serine-threonine kinase, part of its regulatory region is similar to the Src homology-2 domain, a structural motif characteristic of cytoplasmic tyrosine kinases that functions in protein-protein interactions. This suggests that Akt may form a functional link between tyrosine and serine-threonine phosphorylation pathways.
CpG + CpNpG Analysis of Protein-Coding Sequences from Tomato

DEFF Research Database (Denmark)

Hobolth, Asger; Nielsen, Rasmus; Wang, Ying

2006-01-01

We develop codon-based models for simultaneously inferring the mutational effects of CpG and CpNpG methylation in coding regions. In a data set of 369 tomato genes, we show that there is very little effect of CpNpG methylation but a strong effect of CpG methylation affecting almost all genes. We...... further show that the CpNpG and CpG effects are largely uncorrelated. Our results suggest different roles of CpG and CpNpG methylation, with CpNpG methylation possibly playing a specialized role in defense against transposons and RNA viruses....
A central role for ubiquitination within a circadian clock protein modification code

Directory of Open Access Journals (Sweden)

Katarina eStojkovic

2014-08-01

Full Text Available Circadian rhythms, endogenous cycles of about 24 h in physiology, are generated by a master clock located in the suprachiasmatic nucleus of the hypothalamus and other clocks located in the brain and peripheral tissues. Circadian disruption is known to increase the incidence of various illnesses, such as mental disorders, metabolic syndrome and cancer. At the molecular level, periodicity is established by a set of clock genes via autoregulatory translation-transcription feedback loops. This clock mechanism is regulated by post-translational modifications such as phosphorylation and ubiquitination, which set the pace of the clock. Ubiquitination in particular has been found to regulate the stability of core clock components, but also other clock protein functions. Mutation of genes encoding ubiquitin ligases can cause either elongation or shortening of the endogenous circadian period. Recent research has also started to uncover roles for deubiquitination in the molecular clockwork. Here we review the role of the ubiquitin pathway in regulating the circadian clock and we propose that ubiquitination is a key element in a clock protein modification code that orchestrates clock mechanisms and circadian behavior over the daily cycle.

Disease-associated mutations disrupt functionally important regions of intrinsic protein disorder.

Directory of Open Access Journals (Sweden)

Vladimir Vacic

Full Text Available The effects of disease mutations on protein structure and function have been extensively investigated, and many predictors of the functional impact of single amino acid substitutions are publicly available. The majority of these predictors are based on protein structure and evolutionary conservation, following the assumption that disease mutations predominantly affect folded and conserved protein regions. However, the prevalence of the intrinsically disordered proteins (IDPs and regions (IDRs in the human proteome together with their lack of fixed structure and low sequence conservation raise a question about the impact of disease mutations in IDRs. Here, we investigate annotated missense disease mutations and show that 21.7% of them are located within such intrinsically disordered regions. We further demonstrate that 20% of disease mutations in IDRs cause local disorder-to-order transitions, which represents a 1.7-2.7 fold increase compared to annotated polymorphisms and neutral evolutionary substitutions, respectively. Secondary structure predictions show elevated rates of transition from helices and strands into loops and vice versa in the disease mutations dataset. Disease disorder-to-order mutations also influence predicted molecular recognition features (MoRFs more often than the control mutations. The repertoire of disorder-to-order transition mutations is limited, with five most frequent mutations (R→W, R→C, E→K, R→H, R→Q collectively accounting for 44% of all deleterious disorder-to-order transitions. As a proof of concept, we performed accelerated molecular dynamics simulations on a deleterious disorder-to-order transition mutation of tumor protein p63 and, in agreement with our predictions, observed an increased α-helical propensity of the region harboring the mutation. Our findings highlight the importance of mutations in IDRs and refine the traditional structure-centric view of disease mutations. The results of this study
Evaluation of 10 genes encoding cardiac proteins in Doberman Pinschers with dilated cardiomyopathy.

Science.gov (United States)

O'Sullivan, M Lynne; O'Grady, Michael R; Pyle, W Glen; Dawson, John F

2011-07-01

To identify a causative mutation for dilated cardiomyopathy (DCM) in Doberman Pinschers by sequencing the coding regions of 10 cardiac genes known to be associated with familial DCM in humans. 5 Doberman Pinschers with DCM and congestive heart failure and 5 control mixed-breed dogs that were euthanized or died. RNA was extracted from frozen ventricular myocardial samples from each dog, and first-strand cDNA was synthesized via reverse transcription, followed by PCR amplification with gene-specific primers. Ten cardiac genes were analyzed: cardiac actin, α-actinin, α-tropomyosin, β-myosin heavy chain, metavinculin, muscle LIM protein, myosinbinding protein C, tafazzin, titin-cap (telethonin), and troponin T. Sequences for DCM-affected and control dogs and the published canine genome were compared. None of the coding sequences yielded a common causative mutation among all Doberman Pinscher samples. However, 3 variants were identified in the α-actinin gene in the DCM-affected Doberman Pinschers. One of these variants, identified in 2 of the 5 Doberman Pinschers, resulted in an amino acid change in the rod-forming triple coiled-coil domain. Mutations in the coding regions of several genes associated with DCM in humans did not appear to consistently account for DCM in Doberman Pinschers. However, an α-actinin variant was detected in some Doberman Pinschers that may contribute to the development of DCM given its potential effect on the structure of this protein. Investigation of additional candidate gene coding and noncoding regions and further evaluation of the role of α-actinin in development of DCM in Doberman Pinschers are warranted.
Coding sequence of human rho cDNAs clone 6 and clone 9

Energy Technology Data Exchange (ETDEWEB)

Chardin, P; Madaule, P; Tavitian, A

1988-03-25

The authors have isolated human cDNAs including the complete coding sequence for two rho proteins corresponding to the incomplete isolates previously described as clone 6 and clone 9. The deduced a.a. sequences, when compared to the a.a. sequence deduced from clone 12 cDNA, show that there are in human at least three highly homologous rho genes. They suggest that clone 12 be named rhoA, clone 6 : rhoB and clone 9 : rhoC. RhoA, B and C proteins display approx. 30% a.a. identity with ras proteins,. mainly clustered in four highly homologous internal regions corresponding to the GTP binding site; however at least one significant difference is found; the 3 rho proteins have an Alanine in position corresponding to ras Glycine 13, suggesting that rho and ras proteins might have slightly different biochemical properties.
Linkage disequilibrium of evolutionarily conserved regions in the human genome

Directory of Open Access Journals (Sweden)

Johnson Todd A

2006-12-01

Full Text Available Abstract Background The strong linkage disequilibrium (LD recently found in genic or exonic regions of the human genome demonstrated that LD can be increased by evolutionary mechanisms that select for functionally important loci. This suggests that LD might be stronger in regions conserved among species than in non-conserved regions, since regions exposed to natural selection tend to be conserved. To assess this hypothesis, we used genome-wide polymorphism data from the HapMap project and investigated LD within DNA sequences conserved between the human and mouse genomes. Results Unexpectedly, we observed that LD was significantly weaker in conserved regions than in non-conserved regions. To investigate why, we examined sequence features that may distort the relationship between LD and conserved regions. We found that interspersed repeats, and not other sequence features, were associated with the weak LD tendency in conserved regions. To appropriately understand the relationship between LD and conserved regions, we removed the effect of repetitive elements and found that the high degree of sequence conservation was strongly associated with strong LD in coding regions but not with that in non-coding regions. Conclusion Our work demonstrates that the degree of sequence conservation does not simply increase LD as predicted by the hypothesis. Rather, it implies that purifying selection changes the polymorphic patterns of coding sequences but has little influence on the patterns of functional units such as regulatory elements present in non-coding regions, since the former are generally restricted by the constraint of maintaining a functional protein product across multiple exons while the latter may exist more as individually isolated units.
Human growth hormone-related latrogenic Creutzfeldt-Jakob disease: Search for a genetic susceptibility by analysis of the PRNP coding region

Energy Technology Data Exchange (ETDEWEB)

Jaegly, A.; Boussin, F.; Deslys, J.P. [CEA/CRSSA/DSV/DPTE, Fontenay-aux-Roses (France)] [and others

1995-05-20

The human PRNP gene encoding PrP is located on chromosome 20 and consists of two exons and a single intron. The open reading frame is entirely fitted into the second exon. Genetic studies indicate that all of the familial and several sporadic forms of TSSEs are associated with mutations in the PRNP 759-bp coding region. Moreover, homozygosity at codon 129, a locus harboring a polymorphism among the general population, was proposed as a genetic susceptibility marker for both sporadic and iatrogenic CJD. To assess whether additional genetic predisposition markers exist in the PRNP gene, the authors sequenced the PRNP coding region of 17 of the 32 French patients who developed a hGH-related CJD.
Cloning of human genes encoding novel G protein-coupled receptors

Energy Technology Data Exchange (ETDEWEB)

Marchese, A.; Docherty, J.M.; Heiber, M. [Univ. of Toronto, (Canada)] [and others

1994-10-01

We report the isolation and characterization of several novel human genes encoding G protein-coupled receptors. Each of the receptors contained the familiar seven transmembrane topography and most closely resembled peptide binding receptors. Gene GPR1 encoded a receptor protein that is intronless in the coding region and that shared identity (43% in the transmembrane regions) with the opioid receptors. Northern blot analysis revealed that GPR1 transcripts were expressed in the human hippocampus, and the gene was localized to chromosome 15q21.6. Gene GPR2 encoded a protein that most closely resembled an interleukin-8 receptor (51% in the transmembrane regions), and this gene, not expressed in the six brain regions examined, was localized to chromosome 17q2.1-q21.3. A third gene, GPR3, showed identity (56% in the transmembrane regions) with a previously characterized cDNA clone from rat and was localized to chromosome 1p35-p36.1. 31 refs., 5 figs., 1 tab.
Development of Coolant Radioactivity Interpretation Code

International Nuclear Information System (INIS)

Kim, Kiyoung; Jung, Youngsuk; Kim, Kyounghyun; Kim, Jangwook

2013-01-01

In Korea, the coolant radioactivity analysis has been performed by using the computer codes of foreign companies such as CADE (Westinghouse), IODYNE and CESIUM (ABB-CE). However, these computer codes are too conservative and have involved considerable errors. Furthermore, since these codes are DOS-based program, their easy operability is not satisfactory. Therefore it is required development of an enhanced analysis algorithm applying an analytical method reflecting the change of operational environments of domestic nuclear power plants and a fuel failure evaluation software considering user' conveniences. We have developed a nuclear fuel failure evaluation code able to estimate the number of failed fuel rods and the burn-up of failed fuels during nuclear power plant operation cycle. A Coolant Radio-activity Interpretation Code (CRIC) for LWR has been developed as the output of the project 'Development of Fuel Reliability Enhanced Technique' organized by Korea Institute of Energy Technology Evaluation and Planning (KETEP). The CRIC is Windows based-software able to evaluate the number of failed fuel rods and the burn-up of failed fuel region by analyzing coolant radioactivity of LWR in operation. The CRIC is based on the model of fission products release commonly known as 'three region model' (pellet region, gap region, and coolant region), and we are verifying the CRIC results based on the cases of domestic fuel failures. CRIC users are able to estimate the number of failed fuel rods, burn-up and regions of failed fuel considered enrichment and power distribution of fuel region by using operational cycle data, coolant activity data, fuel loading pattern, Cs-134/Cs-137 ratio according to burn-up and U-235 enrichment provided in the code. Due to development of the CRIC, it is secured own unique fuel failure evaluation code. And, it is expected to have the following significant meaning. This is that the code reflecting a proprietary technique for quantitatively
Sequencing the GRHL3 Coding Region Reveals Rare Truncating Mutations and a Common Susceptibility Variant for Nonsyndromic Cleft Palate

Science.gov (United States)

Mangold, Elisabeth; Böhmer, Anne C.; Ishorst, Nina; Hoebel, Ann-Kathrin; Gültepe, Pinar; Schuenke, Hannah; Klamt, Johanna; Hofmann, Andrea; Gölz, Lina; Raff, Ruth; Tessmann, Peter; Nowak, Stefanie; Reutter, Heiko; Hemprich, Alexander; Kreusch, Thomas; Kramer, Franz-Josef; Braumann, Bert; Reich, Rudolf; Schmidt, Gül; Jäger, Andreas; Reiter, Rudolf; Brosch, Sibylle; Stavusis, Janis; Ishida, Miho; Seselgyte, Rimante; Moore, Gudrun E.; Nöthen, Markus M.; Borck, Guntram; Aldhorae, Khalid A.; Lace, Baiba; Stanier, Philip; Knapp, Michael; Ludwig, Kerstin U.

2016-01-01

Nonsyndromic cleft lip with/without cleft palate (nsCL/P) and nonsyndromic cleft palate only (nsCPO) are the most frequent subphenotypes of orofacial clefts. A common syndromic form of orofacial clefting is Van der Woude syndrome (VWS) where individuals have CL/P or CPO, often but not always associated with lower lip pits. Recently, ∼5% of VWS-affected individuals were identified with mutations in the grainy head-like 3 gene (GRHL3). To investigate GRHL3 in nonsyndromic clefting, we sequenced its coding region in 576 Europeans with nsCL/P and 96 with nsCPO. Most strikingly, nsCPO-affected individuals had a higher minor allele frequency for rs41268753 (0.099) than control subjects (0.049; p = 1.24 × 10−2). This association was replicated in nsCPO/control cohorts from Latvia, Yemen, and the UK (pcombined = 2.63 × 10−5; ORallelic = 2.46 [95% CI 1.6–3.7]) and reached genome-wide significance in combination with imputed data from a GWAS in nsCPO triads (p = 2.73 × 10−9). Notably, rs41268753 is not associated with nsCL/P (p = 0.45). rs41268753 encodes the highly conserved p.Thr454Met (c.1361C>T) (GERP = 5.3), which prediction programs denote as deleterious, has a CADD score of 29.6, and increases protein binding capacity in silico. Sequencing also revealed four novel truncating GRHL3 mutations including two that were de novo in four families, where all nine individuals harboring mutations had nsCPO. This is important for genetic counseling: given that VWS is rare compared to nsCPO, our data suggest that dominant GRHL3 mutations are more likely to cause nonsyndromic than syndromic CPO. Thus, with rare dominant mutations and a common risk variant in the coding region, we have identified an important contribution for GRHL3 in nsCPO. PMID:27018475
Direct Profiling the Post-Translational Modification Codes of a Single Protein Immobilized on a Surface Using Cu-free Click Chemistry.

Science.gov (United States)

Kim, Kyung Lock; Park, Kyeng Min; Murray, James; Kim, Kimoon; Ryu, Sung Ho

2018-05-23

Combinatorial post-translational modifications (PTMs), which can serve as dynamic "molecular barcodes", have been proposed to regulate distinct protein functions. However, studies of combinatorial PTMs on single protein molecules have been hindered by a lack of suitable analytical methods. Here, we describe erasable single-molecule blotting (eSiMBlot) for combinatorial PTM profiling. This assay is performed in a highly multiplexed manner and leverages the benefits of covalent protein immobilization, cyclic probing with different antibodies, and single molecule fluorescence imaging. Especially, facile and efficient covalent immobilization on a surface using Cu-free click chemistry permits multiple rounds (>10) of antibody erasing/reprobing without loss of antigenicity. Moreover, cumulative detection of coregistered multiple data sets for immobilized single-epitope molecules, such as HA peptide, can be used to increase the antibody detection rate. Finally, eSiMBlot enables direct visualization and quantitative profiling of combinatorial PTM codes at the single-molecule level, as we demonstrate by revealing the novel phospho-codes of ligand-induced epidermal growth factor receptor. Thus, eSiMBlot provides an unprecedentedly simple, rapid, and versatile platform for analyzing the vast number of combinatorial PTMs in biological pathways.
Definition of IgG- and albumin-binding regions of streptococcal protein G.

Science.gov (United States)

Akerström, B; Nielsen, E; Björck, L

1987-10-05

Protein G, the immunoglobin G-binding surface protein of group C and G streptococci, also binds serum albumin. The albumin-binding site on protein G is distinct from the immunoglobulin G-binding site. By mild acid hydrolysis of the papain-liberated protein G fragment (35 kDa), a 28-kDa fragment was produced which retained full immunoglobulin G-binding activity (determined by Scatchard plotting) but had lost all albumin-binding capacity. A protein G (65 kDa), isolated after cloning and expression of the protein G gene in Escherichia coli, had comparable affinity to immunoglobulin G (5-10 X 10(10)M-1), but much higher affinity to albumin than the 35- and 28-kDa protein G fragments (31, 2.6, and 0 X 10(9)M-1, respectively). The amino-terminal amino acid sequences of the 65-, 35-, and 28-kDa fragments allowed us to exactly locate the three fragments in an overall sequence map of protein G, based on the partial gene sequences published by Guss et al. (Guss, B., Eliasson, M., Olsson, A., Uhlen, M., Frej, A.-K., Jörnvall, H., Flock, J.-I., and Lindberg, M. (1986) EMBO J. 5, 1567-1575) and Fahnestock et al. (Fahnestock, S. R., Alexander, P., Nagle, J., and Filpula, D. (1986) J. Bacteriol. 167, 870-880). In this map could then be deduced the location of three homologous albumin-binding regions and three homologous immunoglobulin G-binding regions.
Region-specific protein misfolding cyclic amplification reproduces brain tropism of prion strains.

Science.gov (United States)

Privat, Nicolas; Levavasseur, Etienne; Yildirim, Serfildan; Hannaoui, Samia; Brandel, Jean-Philippe; Laplanche, Jean-Louis; Béringue, Vincent; Seilhean, Danielle; Haïk, Stéphane

2017-10-06

Human prion diseases such as Creutzfeldt-Jakob disease are transmissible brain proteinopathies, characterized by the accumulation of a misfolded isoform of the host cellular prion protein (PrP) in the brain. According to the prion model, prions are defined as proteinaceous infectious particles composed solely of this abnormal isoform of PrP (PrP Sc ). Even in the absence of genetic material, various prion strains can be propagated in experimental models. They can be distinguished by the pattern of disease they produce and especially by the localization of PrP Sc deposits within the brain and the spongiform lesions they induce. The mechanisms involved in this strain-specific targeting of distinct brain regions still are a fundamental, unresolved question in prion research. To address this question, we exploited a prion conversion in vitro assay, protein misfolding cyclic amplification (PMCA), by using experimental scrapie and human prion strains as seeds and specific brain regions from mice and humans as substrates. We show here that region-specific PMCA in part reproduces the specific brain targeting observed in experimental, acquired, and sporadic Creutzfeldt-Jakob diseases. Furthermore, we provide evidence that, in addition to cellular prion protein, other region- and species-specific molecular factors influence the strain-dependent prion conversion process. This important step toward understanding prion strain propagation in the human brain may impact research on the molecular factors involved in protein misfolding and the development of ultrasensitive methods for diagnosing prion disease. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants.

Science.gov (United States)

Fu, Wenqing; O'Connor, Timothy D; Jun, Goo; Kang, Hyun Min; Abecasis, Goncalo; Leal, Suzanne M; Gabriel, Stacey; Rieder, Mark J; Altshuler, David; Shendure, Jay; Nickerson, Deborah A; Bamshad, Michael J; Akey, Joshua M

2013-01-10

Establishing the age of each mutation segregating in contemporary human populations is important to fully understand our evolutionary history and will help to facilitate the development of new approaches for disease-gene discovery. Large-scale surveys of human genetic variation have reported signatures of recent explosive population growth, notable for an excess of rare genetic variants, suggesting that many mutations arose recently. To more quantitatively assess the distribution of mutation ages, we resequenced 15,336 genes in 6,515 individuals of European American and African American ancestry and inferred the age of 1,146,401 autosomal single nucleotide variants (SNVs). We estimate that approximately 73% of all protein-coding SNVs and approximately 86% of SNVs predicted to be deleterious arose in the past 5,000-10,000 years. The average age of deleterious SNVs varied significantly across molecular pathways, and disease genes contained a significantly higher proportion of recently arisen deleterious SNVs than other genes. Furthermore, European Americans had an excess of deleterious variants in essential and Mendelian disease genes compared to African Americans, consistent with weaker purifying selection due to the Out-of-Africa dispersal. Our results better delimit the historical details of human protein-coding variation, show the profound effect of recent human history on the burden of deleterious SNVs segregating in contemporary populations, and provide important practical information that can be used to prioritize variants in disease-gene discovery.
The artificial zinc finger coding gene 'Jazz' binds the utrophin promoter and activates transcription.

Science.gov (United States)

Corbi, N; Libri, V; Fanciulli, M; Tinsley, J M; Davies, K E; Passananti, C

2000-06-01

Up-regulation of utrophin gene expression is recognized as a plausible therapeutic approach in the treatment of Duchenne muscular dystrophy (DMD). We have designed and engineered new zinc finger-based transcription factors capable of binding and activating transcription from the promoter of the dystrophin-related gene, utrophin. Using the recognition 'code' that proposes specific rules between zinc finger primary structure and potential DNA binding sites, we engineered a new gene named 'Jazz' that encodes for a three-zinc finger peptide. Jazz belongs to the Cys2-His2 zinc finger type and was engineered to target the nine base pair DNA sequence: 5'-GCT-GCT-GCG-3', present in the promoter region of both the human and mouse utrophin gene. The entire zinc finger alpha-helix region, containing the amino acid positions that are crucial for DNA binding, was specifically chosen on the basis of the contacts more frequently represented in the available list of the 'code'. Here we demonstrate that Jazz protein binds specifically to the double-stranded DNA target, with a dissociation constant of about 32 nM. Band shift and super-shift experiments confirmed the high affinity and specificity of Jazz protein for its DNA target. Moreover, we show that chimeric proteins, named Gal4-Jazz and Sp1-Jazz, are able to drive the transcription of a test gene from the human utrophin promoter.
Transport code and nuclear data in intermediate energy region

Energy Technology Data Exchange (ETDEWEB)

Hasegawa, Akira; Odama, Naomitsu [Japan Atomic Energy Research Inst., Tokai, Ibaraki (Japan). Tokai Research Establishment; Maekawa, F.; Ueki, K.; Kosaka, K.; Oyama, Y.

1998-11-01

We briefly reviewed the problems of intermediate energy nuclear data file and transport codes in connection with processing of the data. This is a summary of our group in the task force on JENDL High Energy File Integral Evaluation (JHEFIE). In this article we stress the necessity of the production of intermediate evaluated nuclear data file up to 3 GeV for the application of accelerator driven transmutation (ADT) system. And also we state the necessity of having our own transport code system to calculate the radiation fields using these evaluated files from the strategic points of view to keep our development of the ADT technology completely free from other conditions outside of our own such as imported codes and data with poor maintenance or unknown accuracy. (author)
Transport code and nuclear data in intermediate energy region

International Nuclear Information System (INIS)

Hasegawa, Akira; Odama, Naomitsu; Maekawa, F.; Ueki, K.; Kosaka, K.; Oyama, Y.

1998-01-01

We briefly reviewed the problems of intermediate energy nuclear data file and transport codes in connection with processing of the data. This is a summary of our group in the task force on JENDL High Energy File Integral Evaluation (JHEFIE). In this article we stress the necessity of the production of intermediate evaluated nuclear data file up to 3 GeV for the application of accelerator driven transmutation (ADT) system. And also we state the necessity of having our own transport code system to calculate the radiation fields using these evaluated files from the strategic points of view to keep our development of the ADT technology completely free from other conditions outside of our own such as imported codes and data with poor maintenance or unknown accuracy. (author)
Cloning of the cDNA for U1 small nuclear ribonucleoprotein particle 70K protein from Arabidopsis thaliana

Science.gov (United States)

Reddy, A. S.; Czernik, A. J.; An, G.; Poovaiah, B. W.

1992-01-01

We cloned and sequenced a plant cDNA that encodes U1 small nuclear ribonucleoprotein (snRNP) 70K protein. The plant U1 snRNP 70K protein cDNA is not full length and lacks the coding region for 68 amino acids in the amino-terminal region as compared to human U1 snRNP 70K protein. Comparison of the deduced amino acid sequence of the plant U1 snRNP 70K protein with the amino acid sequence of animal and yeast U1 snRNP 70K protein showed a high degree of homology. The plant U1 snRNP 70K protein is more closely related to the human counter part than to the yeast 70K protein. The carboxy-terminal half is less well conserved but, like the vertebrate 70K proteins, is rich in charged amino acids. Northern analysis with the RNA isolated from different parts of the plant indicates that the snRNP 70K gene is expressed in all of the parts tested. Southern blotting of genomic DNA using the cDNA indicates that the U1 snRNP 70K protein is coded by a single gene.
Sequencing and Characterization of Novel PII Signaling Protein Gene in Microalga Haematococcus pluvialis

Directory of Open Access Journals (Sweden)

Ruijuan Ma

2017-10-01

Full Text Available The PII signaling protein is a key protein for controlling nitrogen assimilatory reactions in most organisms, but little information is reported on PII proteins of green microalga Haematococcus pluvialis. Since H. pluvialis cells can produce a large amount of astaxanthin upon nitrogen starvation, its PII protein may represent an important factor on elevated production of Haematococcus astaxanthin. This study identified and isolated the coding gene (HpGLB1 from this microalga. The full-length of HpGLB1 was 1222 bp, including 621 bp coding sequence (CDS, 103 bp 5′ untranslated region (5′ UTR, and 498 bp 3′ untranslated region (3′ UTR. The CDS could encode a protein with 206 amino acids (HpPII. Its calculated molecular weight (Mw was 22.4 kDa and the theoretical isoelectric point was 9.53. When H. pluvialis cells were exposed to nitrogen starvation, the HpGLB1 expression was increased 2.46 times in 48 h, concomitant with the raise of astaxanthin content. This study also used phylogenetic analysis to prove that HpPII was homogeneous to the PII proteins of other green microalgae. The results formed a fundamental basis for the future study on HpPII, for its potential physiological function in Haematococcus astaxanthin biosysthesis.
Deep Sequencing Reveals Uncharted Isoform Heterogeneity of the Protein-Coding Transcriptome in Cerebral Ischemia.

Science.gov (United States)

Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh

2018-06-03

Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.
Porcine lung surfactant protein B gene (SFTPB)

DEFF Research Database (Denmark)

Cirera Salicio, Susanna; Fredholm, Merete

2008-01-01

The porcine surfactant protein B (SFTPB) is a single copy gene on chromosome 3. Three different cDNAs for the SFTPB have been isolated and sequenced. Nucleotide sequence comparison revealed six nonsynonymous single nucleotide polymorphisms (SNPs), four synonymous SNPs and an in-frame deletion of 69...... bp in the region coding for the active protein. Northern analysis showed lung-specific expression of three different isoforms of the SFTPB transcript. The expression level for the SFTPB gene is low in 50 days-old fetus and it increases during lung development. Quantitative real-time polymerase chain...
Simulations of the broad line region of NGC 5548 with CLOUDY code: Temperature determination

Directory of Open Access Journals (Sweden)

Ilić D.

2007-01-01

Full Text Available In this paper an analysis of the physical properties of the Broad Line Region (BLR of the active galaxy NGC 5548 is presented. Using the photoionization code CLOUDY and the measurements of Peterson et al. (2002, the physical conditions of the BLR are simulated and the BLR temperature is obtained. This temperature was compared to the temperature estimated with the Boltzmann-Plot (BP method (Popović et al. 2007. It was shown that the measured variability in the BLR temperature could be due to the change in the hydrogen density.

Functional dissection of Streptococcus pyogenes M5 protein: the hypervariable region is essential for virulence.

Directory of Open Access Journals (Sweden)

Johan Waldemarsson

Full Text Available The surface-localized M protein of Streptococcus pyogenes is a major virulence factor that inhibits phagocytosis, as determined ex vivo. Because little is known about the role of M protein in vivo we analyzed the contribution of different M protein regions to virulence, using the fibrinogen (Fg-binding M5 protein and a mouse model of acute invasive infection. This model was suitable, because M5 is required for mouse virulence and binds mouse and human Fg equally well, as shown here. Mixed infection experiments with wild type bacteria demonstrated that mutants lacking the N-terminal hypervariable region (HVR or the Fg-binding B-repeat region were strongly attenuated, while a mutant lacking the conserved C-repeats was only slightly attenuated. Because the HVR of M5 is not required for phagocytosis resistance, our data imply that this HVR plays a major but unknown role during acute infection. The B-repeat region is required for phagocytosis resistance and specifically binds Fg, suggesting that it promotes virulence by binding Fg. However, B-repeat mutants were attenuated even in Fg-deficient mice, implying that the B-repeats may have a second function, in addition to Fg-binding. These data demonstrate that two distinct M5 regions, including the HVR, are essential to virulence during the early stages of an infection. In particular, our data provide the first in vivo evidence that the HVR of an M protein plays a major role in virulence, focusing interest on the molecular role of this region.
DeepCNF-D: Predicting Protein Order/Disorder Regions by Weighted Deep Convolutional Neural Fields

Directory of Open Access Journals (Sweden)

Sheng Wang

2015-07-01

Full Text Available Intrinsically disordered proteins or protein regions are involved in key biological processes including regulation of transcription, signal transduction, and alternative splicing. Accurately predicting order/disorder regions ab initio from the protein sequence is a prerequisite step for further analysis of functions and mechanisms for these disordered regions. This work presents a learning method, weighted DeepCNF (Deep Convolutional Neural Fields, to improve the accuracy of order/disorder prediction by exploiting the long-range sequential information and the interdependency between adjacent order/disorder labels and by assigning different weights for each label during training and prediction to solve the label imbalance issue. Evaluated by the CASP9 and CASP10 targets, our method obtains 0.855 and 0.898 AUC values, which are higher than the state-of-the-art single ab initio predictors.
PATACSDB—the database of polyA translational attenuators in coding sequences

Directory of Open Access Journals (Sweden)

Malgorzata Habich

2016-02-01

Full Text Available Recent additions to the repertoire of gene expression regulatory mechanisms are polyadenylate (polyA tracks encoding for poly-lysine runs in protein sequences. Such tracks stall the translation apparatus and induce frameshifting independently of the effects of charged nascent poly-lysine sequence on the ribosome exit channel. As such, they substantially influence the stability of mRNA and the amount of protein produced from a given transcript. Single base changes in these regions are enough to exert a measurable response on both protein and mRNA abundance; this makes each of these sequences a potentially interesting case study for the effects of synonymous mutation, gene dosage balance and natural frameshifting. Here we present PATACSDB, a resource that contain a comprehensive list of polyA tracks from over 250 eukaryotic genomes. Our data is based on the Ensembl genomic database of coding sequences and filtered with algorithm of 12A-1 which selects sequences of polyA tracks with a minimal length of 12 A’s allowing for one mismatched base. The PATACSDB database is accessible at: http://sysbio.ibb.waw.pl/patacsdb. The source code is available at http://github.com/habich/PATACSDB, and it includes the scripts with which the database can be recreated.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor

International Nuclear Information System (INIS)

Antalis, T.M.; Clark, M.A.; Barnes, T.; Lehrbach, P.R.; Devine, P.L.; Schevzov, G.; Goss, N.H.; Stephens, R.W.; Tolstoshev, P.

1988-01-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A) + RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the λ P/sub L/ promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated M/sub r/ of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators
Characterisation of silent and active genes for a variable large protein of Borrelia recurrentis

Directory of Open Access Journals (Sweden)

Scragg Ian G

2002-10-01

Full Text Available Abstract Background We report the characterisation of the variable large protein (vlp gene expressed by clinical isolate A1 of Borrelia recurrentis; the agent of the life-threatening disease louse-borne relapsing fever. Methods The major vlp protein of this isolate was characterised and a DNA probe created. Use of this together with standard molecular methods was used to determine the location of the vlp1B. recurrentis A1 gene in both this and other isolates. Results This isolate was found to carry silent and expressed copies of the vlp1B. recurrentis A1 gene on plasmids of 54 kbp and 24 kbp respectively, whereas a different isolate, A17, had only the silent vlp1B. recurrentis A17 on a 54 kbp plasmid. Silent and expressed vlp1 have identical mature protein coding regions but have different 5' regions, both containing different potential lipoprotein leader sequences. Only one form of vlp1 is transcribed in the A1 isolate of B. recurrentis, yet both 5' upstream sequences of this vlp1 gene possess features of bacterial promoters. Conclusion Taken together these results suggest that antigenic variation in B. recurrentis may result from recombination of variable large and small protein genes at the junction between lipoprotein leader sequence and mature protein coding region. However, this hypothetical model needs to be validated by further identification of expressed and silent variant protein genes in other B. recurrentis isolates.
Direct ATP photolabeling of Escherichia coli recA proteins: identification of regions required for ATP binding

International Nuclear Information System (INIS)

Banks, G.R.; Sedgwick, S.G.

1986-01-01

When the Escherichia coli RecA protein is UV irradiated in the presence of [alpha- 32 P]ATP, a labeled protein--ATP adduct is formed. All the experimental evidence indicates that, in forming such an adduct, the ATP becomes specifically immobilized in the catalytically relevant ATP binding site. The adduct can also be identified after irradiation of E. coli cell lysates in a similar manner. This direct ATP photolabeling of RecA proteins has been used to identify regions of the polypeptide chain involved in the binding of ATP. The photolabeling of a RecA protein that lacks wild-type carboxy-terminal amino acids is not detectable. A RecA protein in which the amino-terminal sequence NH2-Ala-Ile-Asp-Glu-Asn- is replaced by NH2-Thr-Met-Ile-Thr-Asn-Ser-Ser-Ser- is only about 5% as efficiently photolabeled as the wild-type protein. Both of these RecA protein constructions, however, contain all the elements previously implicated, directly or indirectly, in the binding of ATP. ATP-photolabeled RecA protein has also been chemically cleaved at specific amino acids in order to identify regions of the polypeptide chain to which the nucleotide becomes covalently photolinked. The evidence is consistent with a region comprising amino acids 116-170. Thus, this work and that of others suggest that several disparate regions of the unfolded polypeptide chain may combine to form the ATP binding site upon protein folding or may influence binding through long-range effects
New PAH gene promoter KLF1 and 3'-region C/EBPalpha motifs influence transcription in vitro.

Science.gov (United States)

Klaassen, Kristel; Stankovic, Biljana; Kotur, Nikola; Djordjevic, Maja; Zukic, Branka; Nikcevic, Gordana; Ugrin, Milena; Spasovski, Vesna; Srzentic, Sanja; Pavlovic, Sonja; Stojiljkovic, Maja

2017-02-01

Phenylketonuria (PKU) is a metabolic disease caused by mutations in the phenylalanine hydroxylase (PAH) gene. Although the PAH genotype remains the main determinant of PKU phenotype severity, genotype-phenotype inconsistencies have been reported. In this study, we focused on unanalysed sequences in non-coding PAH gene regions to assess their possible influence on the PKU phenotype. We transiently transfected HepG2 cells with various chloramphenicol acetyl transferase (CAT) reporter constructs which included PAH gene non-coding regions. Selected non-coding regions were indicated by in silico prediction to contain transcription factor binding sites. Furthermore, electrophoretic mobility shift assay (EMSA) and supershift assays were performed to identify which transcriptional factors were engaged in the interaction. We found novel KLF1 motif in the PAH promoter, which decreases CAT activity by 50 % in comparison to basal transcription in vitro. The cytosine at the c.-170 promoter position creates an additional binding site for the protein complex involving KLF1 transcription factor. Moreover, we assessed for the first time the role of a multivariant variable number tandem repeat (VNTR) region located in the 3'-region of the PAH gene. We found that the VNTR3, VNTR7 and VNTR8 constructs had approximately 60 % of CAT activity. The regulation is mediated by the C/EBPalpha transcription factor, present in protein complex binding to VNTR3. Our study highlighted two novel promoter KLF1 and 3'-region C/EBPalpha motifs in the PAH gene which decrease transcription in vitro and, thus, could be considered as PAH expression modifiers. New transcription motifs in non-coding regions will contribute to better understanding of the PKU phenotype complexity and may become important for the optimisation of PKU treatment.
Novel methods for the molecular discrimination of Fasciola spp. on the basis of nuclear protein-coding genes.

Science.gov (United States)

Shoriki, Takuya; Ichikawa-Seki, Madoka; Suganuma, Keisuke; Naito, Ikunori; Hayashi, Kei; Nakao, Minoru; Aita, Junya; Mohanta, Uday Kumar; Inoue, Noboru; Murakami, Kenji; Itagaki, Tadashi

2016-06-01

Fasciolosis is an economically important disease of livestock caused by Fasciola hepatica, Fasciola gigantica, and aspermic Fasciola flukes. The aspermic Fasciola flukes have been discriminated morphologically from the two other species by the absence of sperm in their seminal vesicles. To date, the molecular discrimination of F. hepatica and F. gigantica has relied on the nucleotide sequences of the internal transcribed spacer 1 (ITS1) region. However, ITS1 genotypes of aspermic Fasciola flukes cannot be clearly differentiated from those of F. hepatica and F. gigantica. Therefore, more precise and robust methods are required to discriminate Fasciola spp. In this study, we developed PCR restriction fragment length polymorphism and multiplex PCR methods to discriminate F. hepatica, F. gigantica, and aspermic Fasciola flukes on the basis of the nuclear protein-coding genes, phosphoenolpyruvate carboxykinase and DNA polymerase delta, which are single locus genes in most eukaryotes. All aspermic Fasciola flukes used in this study had mixed fragment pattern of F. hepatica and F. gigantica for both of these genes, suggesting that the flukes are descended through hybridization between the two species. These molecular methods will facilitate the identification of F. hepatica, F. gigantica, and aspermic Fasciola flukes, and will also prove useful in etiological studies of fasciolosis. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
The PPARγ coding region and its role in visceral obesity

International Nuclear Information System (INIS)

Boon Yin, Khoo; Najimudin, Nazalan; Muhammad, Tengku Sifzizul Tengku

2008-01-01

Peroxisome proliferator-activated receptor gamma (PPARγ) is a ligand activated transcription factor, plays many essential roles of biological function in higher organisms. The PPARγ is mainly expressed in adipose tissue. It regulates the transcriptional activity of genes by binding with other transcription factor. The PPARγ coding region has been found to be closest to that of monkey in ours and other research groups. Thus, monkey is a more suitable animal model for future PPARγ studying, although mice and rat are frequently being used. The PPARγ is involved in regulating alterations of adipose tissue masses result from changes in mature adipocyte size and/or number through a complex interplay process called adipogenesis. However, the role of PPARγ in negatively regulating the process of adipogenesis remains unclear. This review may help we investigate the differential expression of key transcription factor in adipose tissue in response to visceral obesity-induced diet in vivo. The study may also provide valuable information to define a more appropriate physiological condition in adipogenesis which may help to prevent diseases cause by negative regulation of the transcription factors in adipose tissue
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

Directory of Open Access Journals (Sweden)

Christian J. Michel

2017-12-01

Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

Science.gov (United States)

Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

2017-12-03

A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first
DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach.

Directory of Open Access Journals (Sweden)

Zhiheng Wang

Full Text Available The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate predictor of protein intrinsically disordered regions. The DisoMCS bases on an original multi-class conservative score (MCS obtained by sequence-order/disorder alignment. Initially, near-disorder regions are defined on fragments located at both the terminus of an ordered region connecting a disordered region. Then the multi-class conservative score is generated by sequence alignment against a known structure database and represented as order, near-disorder and disorder conservative scores. The MCS of each amino acid has three elements: order, near-disorder and disorder profiles. Finally, the MCS is exploited as features to identify disordered regions in sequences. DisoMCS utilizes a non-redundant data set as the training set, MCS and predicted secondary structure as features, and a conditional random field as the classification algorithm. In predicted near-disorder regions a residue is determined as an order or a disorder according to the optimized decision threshold. DisoMCS was evaluated by cross-validation, large-scale prediction, independent tests and CASP (Critical Assessment of Techniques for Protein Structure Prediction tests. All results confirmed that DisoMCS was very competitive in terms of accuracy of prediction when compared with well-established publicly available disordered region predictors. It also indicated our approach was more accurate when a query has higher homologous with the knowledge database.The DisoMCS is available at http://cal.tongji.edu.cn/disorder/.
MHC class I–associated peptides derive from selective regions of the human genome

Science.gov (United States)

Pearson, Hillary; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Thibault, Pierre

2016-01-01

MHC class I–associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology. PMID:27841757
Interleaved Product LDPC Codes

OpenAIRE

Baldi, Marco; Cancellieri, Giovanni; Chiaraluce, Franco

2011-01-01

Product LDPC codes take advantage of LDPC decoding algorithms and the high minimum distance of product codes. We propose to add suitable interleavers to improve the waterfall performance of LDPC decoding. Interleaving also reduces the number of low weight codewords, that gives a further advantage in the error floor region.
Natural type 3/type 2 intertypic vaccine-related poliovirus recombinants with the first crossover sites within the VP1 capsid coding region.

Science.gov (United States)

Zhang, Yong; Zhu, Shuangli; Yan, Dongmei; Liu, Guiyan; Bai, Ruyin; Wang, Dongyan; Chen, Li; Zhu, Hui; An, Hongqiu; Kew, Olen; Xu, Wenbo

2010-12-21

Ten uncommon natural type 3/type 2 intertypic poliovirus recombinants were isolated from stool specimens from nine acute flaccid paralysis case patients and one healthy vaccinee in China from 2001 to 2008. Complete genomic sequences revealed their vaccine-related genomic features and showed that their first crossover sites were randomly distributed in the 3' end of the VP1 coding region. The length of donor Sabin 2 sequences ranged from 55 to 136 nucleotides, which is the longest donor sequence reported in the literature for this type of poliovirus recombination. The recombination resulted in the introduction of Sabin 2 neutralizing antigenic site 3a (NAg3a) into a Sabin 3 genomic background in the VP1 coding region, which may have been altered by some of the type 3-specific antigenic properties, but had not acquired any type 2-specific characterizations. NAg3a of the Sabin 3 strain seems atypical; other wild-type poliovirus isolates that have circulated in recent years have sequences of NAg3a more like the Sabin 2 strain. 10 natural type 3/type 2 intertypic VP1 capsid-recombinant polioviruses, in which the first crossover sites were found to be in the VP1 coding region, were isolated and characterized. In spite of the complete replacement of NAg3a by type 2-specific amino acids, the serotypes of the recombinants were not altered, and they were totally neutralized by polyclonal type 3 antisera but not at all by type 2 antisera. It is possible that recent type 3 wild poliovirus isolates may be a recombinant having NAg3a sequences derived from another strain during between 1967 and 1980, and the type 3/type 2 recombination events in the 3' end of the VP1 coding region may result in a higher fitness.
Natural type 3/type 2 intertypic vaccine-related poliovirus recombinants with the first crossover sites within the VP1 capsid coding region.

Directory of Open Access Journals (Sweden)

Yong Zhang

Full Text Available BACKGROUND: Ten uncommon natural type 3/type 2 intertypic poliovirus recombinants were isolated from stool specimens from nine acute flaccid paralysis case patients and one healthy vaccinee in China from 2001 to 2008. PRINCIPAL FINDINGS: Complete genomic sequences revealed their vaccine-related genomic features and showed that their first crossover sites were randomly distributed in the 3' end of the VP1 coding region. The length of donor Sabin 2 sequences ranged from 55 to 136 nucleotides, which is the longest donor sequence reported in the literature for this type of poliovirus recombination. The recombination resulted in the introduction of Sabin 2 neutralizing antigenic site 3a (NAg3a into a Sabin 3 genomic background in the VP1 coding region, which may have been altered by some of the type 3-specific antigenic properties, but had not acquired any type 2-specific characterizations. NAg3a of the Sabin 3 strain seems atypical; other wild-type poliovirus isolates that have circulated in recent years have sequences of NAg3a more like the Sabin 2 strain. CONCLUSIONS: 10 natural type 3/type 2 intertypic VP1 capsid-recombinant polioviruses, in which the first crossover sites were found to be in the VP1 coding region, were isolated and characterized. In spite of the complete replacement of NAg3a by type 2-specific amino acids, the serotypes of the recombinants were not altered, and they were totally neutralized by polyclonal type 3 antisera but not at all by type 2 antisera. It is possible that recent type 3 wild poliovirus isolates may be a recombinant having NAg3a sequences derived from another strain during between 1967 and 1980, and the type 3/type 2 recombination events in the 3' end of the VP1 coding region may result in a higher fitness.
Hypothyroidism coordinately and transiently affects myelin protein gene expression in most rat brain regions during postnatal development.

Science.gov (United States)

Ibarrola, N; Rodríguez-Peña, A

1997-03-28

To assess the role of thyroid hormone on myelin gene expression, we have studied the effect of hypothyroidism on the mRNA steady state levels for the major myelin protein genes: myelin basic protein (MBP), proteolipid protein (PLP), myelin-associated glycoprotein (MAG) and 2':3'-cyclic nucleotide 3'-phosphodiesterase (CNP) in different rat brain regions, during the first postnatal month. We found that hypothyroidism reduces the levels of every myelin protein transcript, with striking differences between the different brain regions. Thus, in the more caudal regions, the effect of hypothyroidism was extremely modest, being only evident at the earlier stages of myelination. In contrast, in the striatum and the cerebral cortex the important decrease in the myelin protein transcripts is maintained beyond the first postnatal month. Therefore, thyroid hormone modulates in a synchronous fashion the expression of the myelin genes and the length of its effect depends on the brain region. On the other hand, hyperthyroidism leads to an increase of the major myelin protein transcripts above control values. Finally, lack of thyroid hormone does not change the expression of the oligodendrocyte progenitor-specific gene, the platelet derived growth factor receptor alpha.
Protein profiles of serum, brain regions and hypophyses of pubertal ...

African Journals Online (AJOL)

The effects of dietary fumonisin B1 (FB1 ), a toxin produced mainly by Fusarium verticillioides and F. proliferatum that grow on maize worldwide, on protein profiles of serum, brain regions and hypophyses were studied in 24 male Large White weanling pigs randomly divided into four groups (n = 6). In a completely ...
Expression of Tau Pathology-Related Proteins in Different Brain Regions: A Molecular Basis of Tau Pathogenesis.

Science.gov (United States)

Hu, Wen; Wu, Feng; Zhang, Yanchong; Gong, Cheng-Xin; Iqbal, Khalid; Liu, Fei

2017-01-01

Microtubule-associated protein tau is hyperphosphorylated and aggregated in affected neurons in Alzheimer disease (AD) brains. The tau pathology starts from the entorhinal cortex (EC), spreads to the hippocampus and frontal and temporal cortices, and finally to all isocortex areas, but the cerebellum is spared from tau lesions. The molecular basis of differential vulnerability of different brain regions to tau pathology is not understood. In the present study, we analyzed brain regional expressions of tau and tau pathology-related proteins. We found that tau was hyperphosphorylated at multiple sites in the frontal cortex (FC), but not in the cerebellum, from AD brain. The level of tau expression in the cerebellum was about 1/4 of that seen in the frontal and temporal cortices in human brain. In the rat brain, the expression level of tau with three microtubule-binding repeats (3R-tau) was comparable in the hippocampus, EC, FC, parietal-temporal cortex (PTC), occipital-temporal cortex (OTC), striatum, thalamus, olfactory bulb (OB) and cerebellum. However, the expression level of 4R-tau was the highest in the EC and the lowest in the cerebellum. Tau phosphatases, kinases, microtubule-related proteins and other tau pathology-related proteins were also expressed in a region-specific manner in the rat brain. These results suggest that higher levels of tau and tau kinases in the EC and low levels of these proteins in the cerebellum may accounts for the vulnerability and resistance of these representative brain regions to the development of tau pathology, respectively. The present study provides the regional expression profiles of tau and tau pathology-related proteins in the brain, which may help understand the brain regional vulnerability to tau pathology in neurodegenerative tauopathies.
Protein structural similarity search by Ramachandran codes

Directory of Open Access Journals (Sweden)

Chang Chih-Hung

2007-08-01

Full Text Available Abstract Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation. SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era.

Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions.

Science.gov (United States)

Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Uversky, Vladimir N; Obradovic, Zoran

2007-05-01

Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our
Lightweight Object Tracking in Compressed Video Streams Demonstrated in Region-of-Interest Coding

Directory of Open Access Journals (Sweden)

Lerouge Sam

2007-01-01

Full Text Available Video scalability is a recent video coding technology that allows content providers to offer multiple quality versions from a single encoded video file in order to target different kinds of end-user devices and networks. One form of scalability utilizes the region-of-interest concept, that is, the possibility to mark objects or zones within the video as more important than the surrounding area. The scalable video coder ensures that these regions-of-interest are received by an end-user device before the surrounding area and preferably in higher quality. In this paper, novel algorithms are presented making it possible to automatically track the marked objects in the regions of interest. Our methods detect the overall motion of a designated object by retrieving the motion vectors calculated during the motion estimation step of the video encoder. Using this knowledge, the region-of-interest is translated, thus following the objects within. Furthermore, the proposed algorithms allow adequate resizing of the region-of-interest. By using the available information from the video encoder, object tracking can be done in the compressed domain and is suitable for real-time and streaming applications. A time-complexity analysis is given for the algorithms proving the low complexity thereof and the usability for real-time applications. The proposed object tracking methods are generic and can be applied to any codec that calculates the motion vector field. In this paper, the algorithms are implemented within MPEG-4 fine-granularity scalability codec. Different tests on different video sequences are performed to evaluate the accuracy of the methods. Our novel algorithms achieve a precision up to 96.4 .
Lightweight Object Tracking in Compressed Video Streams Demonstrated in Region-of-Interest Coding

Directory of Open Access Journals (Sweden)

Rik Van de Walle

2007-01-01

Full Text Available Video scalability is a recent video coding technology that allows content providers to offer multiple quality versions from a single encoded video file in order to target different kinds of end-user devices and networks. One form of scalability utilizes the region-of-interest concept, that is, the possibility to mark objects or zones within the video as more important than the surrounding area. The scalable video coder ensures that these regions-of-interest are received by an end-user device before the surrounding area and preferably in higher quality. In this paper, novel algorithms are presented making it possible to automatically track the marked objects in the regions of interest. Our methods detect the overall motion of a designated object by retrieving the motion vectors calculated during the motion estimation step of the video encoder. Using this knowledge, the region-of-interest is translated, thus following the objects within. Furthermore, the proposed algorithms allow adequate resizing of the region-of-interest. By using the available information from the video encoder, object tracking can be done in the compressed domain and is suitable for real-time and streaming applications. A time-complexity analysis is given for the algorithms proving the low complexity thereof and the usability for real-time applications. The proposed object tracking methods are generic and can be applied to any codec that calculates the motion vector field. In this paper, the algorithms are implemented within MPEG-4 fine-granularity scalability codec. Different tests on different video sequences are performed to evaluate the accuracy of the methods. Our novel algorithms achieve a precision up to 96.4%.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

Science.gov (United States)

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-02-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Inheritance-mode specific pathogenicity prioritization (ISPP) for human protein coding genes.

Science.gov (United States)

Hsu, Jacob Shujui; Kwan, Johnny S H; Pan, Zhicheng; Garcia-Barcelo, Maria-Mercè; Sham, Pak Chung; Li, Miaoxin

2016-10-15

Exome sequencing studies have facilitated the detection of causal genetic variants in yet-unsolved Mendelian diseases. However, the identification of disease causal genes among a list of candidates in an exome sequencing study is still not fully settled, and it is often difficult to prioritize candidate genes for follow-up studies. The inheritance mode provides crucial information for understanding Mendelian diseases, but none of the existing gene prioritization tools fully utilize this information. We examined the characteristics of Mendelian disease genes under different inheritance modes. The results suggest that Mendelian disease genes with autosomal dominant (AD) inheritance mode are more haploinsufficiency and de novo mutation sensitive, whereas those autosomal recessive (AR) genes have significantly more non-synonymous variants and regulatory transcript isoforms. In addition, the X-linked (XL) Mendelian disease genes have fewer non-synonymous and synonymous variants. As a result, we derived a new scoring system for prioritizing candidate genes for Mendelian diseases according to the inheritance mode. Our scoring system assigned to each annotated protein-coding gene (N = 18 859) three pathogenic scores according to the inheritance mode (AD, AR and XL). This inheritance mode-specific framework achieved higher accuracy (area under curve = 0.84) in XL mode. The inheritance-mode specific pathogenicity prioritization (ISPP) outperformed other well-known methods including Haploinsufficiency, Recessive, Network centrality, Genic Intolerance, Gene Damage Index and Gene Constraint scores. This systematic study suggests that genes manifesting disease inheritance modes tend to have unique characteristics. ISPP is included in KGGSeq v1.0 (http://grass.cgs.hku.hk/limx/kggseq/), and source code is available from (https://github.com/jacobhsu35/ISPP.git). mxli@hku.hkSupplementary information: Supplementary data are available at Bioinformatics online. © The Author
Junction region of EWS-FLI1 fusion protein has a dominant negative effect in Ewing's sarcoma in vitro.

Science.gov (United States)

Jully, Babu; Vijayalakshmi, Ramshankar; Gopal, Gopisetty; Sabitha, Kesavan; Rajkumar, Thangarajan

2012-11-12

Ewing's sarcoma is a malignancy characterized by a specific 11:22 chromosomal translocation which generates a novel EWS-FLI1 fusion protein functioning as an aberrant transcription factor. In the present study, we have further characterized the junction region of the EWS-FLI1 fusion protein. In-silico model of EWS-FLI1 fusion protein was analysed for ligand binding sites, and a putative region (amino acid (aa) 251-343 of the type 1 fusion protein) in the vicinity of the fusion junction was cloned and expressed using bacterial expression. The recombinant protein was characterized by Circular Dichroism (CD). We then expressed aa 251-280 ectopically in Ewing's sarcoma cell-line and its effect on cell proliferation, tumorigenicity and expression of EWS-FLI1 target genes were analysed. Our modelling analysis indicated that Junction region (aa 251-343) encompasses potential ligand biding sites in the EWS-FLI1 protein and when expressed in bacteria was present as soluble form. Ectopically expressing this region in Ewing's sarcoma cells inhibited tumorigenicity, and EWS-FLI1 target genes indicating a dominant negative biological effect. Junction region can be exploited further as target for drug development in future to specifically target EWS-FLI1 in Ewing's Sarcoma.
OPAL: prediction of MoRF regions in intrinsically disordered protein sequences.

Science.gov (United States)

Sharma, Ronesh; Raicar, Gaurav; Tsunoda, Tatsuhiko; Patil, Ashwini; Sharma, Alok

2018-06-01

Intrinsically disordered proteins lack stable 3-dimensional structure and play a crucial role in performing various biological functions. Key to their biological function are the molecular recognition features (MoRFs) located within long disordered regions. Computationally identifying these MoRFs from disordered protein sequences is a challenging task. In this study, we present a new MoRF predictor, OPAL, to identify MoRFs in disordered protein sequences. OPAL utilizes two independent sources of information computed using different component predictors. The scores are processed and combined using common averaging method. The first score is computed using a component MoRF predictor which utilizes composition and sequence similarity of MoRF and non-MoRF regions to detect MoRFs. The second score is calculated using half-sphere exposure (HSE), solvent accessible surface area (ASA) and backbone angle information of the disordered protein sequence, using information from the amino acid properties of flanks surrounding the MoRFs to distinguish MoRF and non-MoRF residues. OPAL is evaluated using test sets that were previously used to evaluate MoRF predictors, MoRFpred, MoRFchibi and MoRFchibi-web. The results demonstrate that OPAL outperforms all the available MoRF predictors and is the most accurate predictor available for MoRF prediction. It is available at http://www.alok-ai-lab.com/tools/opal/. ashwini@hgc.jp or alok.sharma@griffith.edu.au. Supplementary data are available at Bioinformatics online.
RNA-Binding Proteins in Trichomonas vaginalis: Atypical Multifunctional Proteins

Directory of Open Access Journals (Sweden)

Elisa E. Figueroa-Angulo

2015-11-01

Full Text Available Iron homeostasis is highly regulated in vertebrates through a regulatory system mediated by RNA-protein interactions between the iron regulatory proteins (IRPs that interact with an iron responsive element (IRE located in certain mRNAs, dubbed the IRE-IRP regulatory system. Trichomonas vaginalis, the causal agent of trichomoniasis, presents high iron dependency to regulate its growth, metabolism, and virulence properties. Although T. vaginalis lacks IRPs or proteins with aconitase activity, possesses gene expression mechanisms of iron regulation at the transcriptional and posttranscriptional levels. However, only one gene with iron regulation at the transcriptional level has been described. Recently, our research group described an iron posttranscriptional regulatory mechanism in the T. vaginalis tvcp4 and tvcp12 cysteine proteinase mRNAs. The tvcp4 and tvcp12 mRNAs have a stem-loop structure in the 5'-coding region or in the 3'-UTR, respectively that interacts with T. vaginalis multifunctional proteins HSP70, α-Actinin, and Actin under iron starvation condition, causing translation inhibition or mRNA stabilization similar to the previously characterized IRE-IRP system in eukaryotes. Herein, we summarize recent progress and shed some light on atypical RNA-binding proteins that may participate in the iron posttranscriptional regulation in T. vaginalis.
RNA-Binding Proteins in Trichomonas vaginalis: Atypical Multifunctional Proteins.

Science.gov (United States)

Figueroa-Angulo, Elisa E; Calla-Choque, Jaeson S; Mancilla-Olea, Maria Inocente; Arroyo, Rossana

2015-11-26

Iron homeostasis is highly regulated in vertebrates through a regulatory system mediated by RNA-protein interactions between the iron regulatory proteins (IRPs) that interact with an iron responsive element (IRE) located in certain mRNAs, dubbed the IRE-IRP regulatory system. Trichomonas vaginalis, the causal agent of trichomoniasis, presents high iron dependency to regulate its growth, metabolism, and virulence properties. Although T. vaginalis lacks IRPs or proteins with aconitase activity, possesses gene expression mechanisms of iron regulation at the transcriptional and posttranscriptional levels. However, only one gene with iron regulation at the transcriptional level has been described. Recently, our research group described an iron posttranscriptional regulatory mechanism in the T. vaginalis tvcp4 and tvcp12 cysteine proteinase mRNAs. The tvcp4 and tvcp12 mRNAs have a stem-loop structure in the 5'-coding region or in the 3'-UTR, respectively that interacts with T. vaginalis multifunctional proteins HSP70, α-Actinin, and Actin under iron starvation condition, causing translation inhibition or mRNA stabilization similar to the previously characterized IRE-IRP system in eukaryotes. Herein, we summarize recent progress and shed some light on atypical RNA-binding proteins that may participate in the iron posttranscriptional regulation in T. vaginalis.
Region-specific differences in bioenergetic proteins and protein response to acute high fat diet in brains of low and high capacity runner rats.

Science.gov (United States)

Gan, Li; Ma, Delin; Li, Min; Yang, Fu-Chen; Rogers, Robert S; Wheatley, Joshua L; Koch, Lauren G; Britton, Steven L; Thyfault, John P; Geiger, Paige C; Stanford, John A

2018-05-01

Aerobic capacity is a strong predictor of mortality. Low capacity runner (LCR) rats exhibit reduced mitochondrial function in peripheral organs. A high fat diet (HFD) can worsen metabolic phenotype in LCR rats. Little is known about metabolic changes in the brains of these rats, however. This study examined protein markers of mitochondrial function and metabolism as a function of aerobic running capacity and an acute HFD in four brain regions: the striatum, hippocampus, hypothalamus, and substantia nigra. After 3 days HFD or chow diets, we measured peroxisome proliferator-activated receptor-γ coactivator 1α (PGC1-α), nuclear respiratory factors 1 (Nrf-1), mitochondrial transcription factor A (TFAM), and phosphorylated (activated) AMP-activated protein kinase (p-AMPK) protein levels in the four brain regions. LCR rats exhibited lower levels of mitochondrial proteins (PGC1-α, Nrf-1, TFAM), and greater p-AMPK, in striatum, but not in the other brain regions. Mitochondrial protein levels were greater in HFD LCR striatum, while p-AMPK was lower in this group. Markers of lower mitochondrial biogenesis and increased metabolic demand were limited to the LCR striatum, which nevertheless maintained the capacity to respond to an acute HFD challenge. Copyright © 2018 Elsevier B.V. All rights reserved.
Cloning and sequence analysis of cDNA coding for rat nucleolar protein C23

International Nuclear Information System (INIS)

Ghaffari, S.H.; Olson, M.O.J.

1986-01-01

Using synthetic oligonucleotides as primers and probes, the authors have isolated and sequenced cDNA clones encoding protein C23, a putative nucleolus organizer protein. Poly(A + ) RNA was isolated from rat Novikoff hepatoma cells and enriched in C23 mRNA by sucrose density gradient ultracentrifugation. Two deoxyoligonuleotides, a 48- and a 27-mer, were synthesized on the basis of amino acid sequence from the C-terminal half of protein C23 and cDNA sequence data from CHO cell protein. The 48-mer was used a primer for synthesis of cDNA which was then inserted into plasmid pUC9. Transformed bacterial colonies were screened by hybridization with 32 P labeled 27-mer. Two clones among 5000 gave a strong positive signal. Plasmid DNAs from these clones were purified and characterized by blotting and nucleotide sequence analysis. The length of C23 mRNA was estimated to be 3200 bases in a northern blot analysis. The sequence of a 267 b.p. insert shows high homology with the CHO cDNA with only 9 nucleotide differences and an identical amino acid sequence. These studies indicate that this region of the protein is highly conserved
EXPANDA-75: one-dimensional diffusion code for multi-region plate lattice heterogeneous system

International Nuclear Information System (INIS)

Kikuchi, Yasuyuki; Katsuragi, Satoru; Suzuki, Tomoo; Ogitsu, Makoto.

1975-08-01

An advanced treatment has been developed for analyzing a multi-region plate lattice heterogeneous system using the coarse group constants set provided for a homogeneous system. The essential points of this treatment are modification of effective admixture cross sections and improvement of effective elastic removal cross sections. By this treatment the heterogeneity effects for flux distributions and effective cross sections in the unit cell can be reproduced accurately in comparison with the ultra fine group treatment which consumes huge amounts of computing time. Based on the present treatment and using the JAERI-Fast set, a one-dimensional diffusion code, EXPANDA-75, was developed for extensive use for analyses of fast critical experiments. The user's guide is also presented in this report. (auth.)
Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi.

Science.gov (United States)

Schoch, Conrad L; Seifert, Keith A; Huhndorf, Sabine; Robert, Vincent; Spouge, John L; Levesque, C André; Chen, Wen

2012-04-17

Six DNA regions were evaluated as potential DNA barcodes for Fungi, the second largest kingdom of eukaryotic life, by a multinational, multilaboratory consortium. The region of the mitochondrial cytochrome c oxidase subunit 1 used as the animal barcode was excluded as a potential marker, because it is difficult to amplify in fungi, often includes large introns, and can be insufficiently variable. Three subunits from the nuclear ribosomal RNA cistron were compared together with regions of three representative protein-coding genes (largest subunit of RNA polymerase II, second largest subunit of RNA polymerase II, and minichromosome maintenance protein). Although the protein-coding gene regions often had a higher percent of correct identification compared with ribosomal markers, low PCR amplification and sequencing success eliminated them as candidates for a universal fungal barcode. Among the regions of the ribosomal cistron, the internal transcribed spacer (ITS) region has the highest probability of successful identification for the broadest range of fungi, with the most clearly defined barcode gap between inter- and intraspecific variation. The nuclear ribosomal large subunit, a popular phylogenetic marker in certain groups, had superior species resolution in some taxonomic groups, such as the early diverging lineages and the ascomycete yeasts, but was otherwise slightly inferior to the ITS. The nuclear ribosomal small subunit has poor species-level resolution in fungi. ITS will be formally proposed for adoption as the primary fungal barcode marker to the Consortium for the Barcode of Life, with the possibility that supplementary barcodes may be developed for particular narrowly circumscribed taxonomic groups.
Non-Coding RNAs in Arabidopsis

DEFF Research Database (Denmark)

van Wonterghem, Miranda

This work evolves around elucidating the mechanisms of micro RNAs (miRNAs) in Arabidopsis thaliana. I identified a new class of nuclear non-coding RNAs derived from protein coding genes. The genes are miRNA targets with extensive gene body methylation. The RNA species are nuclear localized and de...
Sequence of a cloned cDNA encoding human ribosomal protein S11

Energy Technology Data Exchange (ETDEWEB)

Lott, J B; Mackie, G A

1988-02-11

The authors have isolated a cloned cDNA that encodes human ribosomal protein (rp) S11 by screening a human fibroblast cDNA library with a labelled 204 bp DNA fragment encompassing residues 212-416 of pRS11, a rat rp Sll cDNA clone. The human rp S11 cloned cDNA consists of 15 residues of the 5' leader, the entire coding sequence and all 51 residues of the 3' untranslated region. The predicted amino acid sequence of 158 residues is identical to rat rpS11. The nucleotide sequence in the coding region differs, however, from that in rat in the first position in two codons and in the third position in 44 codons.
Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

International Nuclear Information System (INIS)

Yu Jia-Feng; Sui Tian-Xiang; Wang Ji-Hua; Wang Hong-Mei; Wang Chun-Ling; Jing Li

2015-01-01

Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. (special topic)
Codon size reduction as the origin of the triplet genetic code.

Directory of Open Access Journals (Sweden)

Pavel V Baranov

Full Text Available The genetic code appears to be optimized in its robustness to missense errors and frameshift errors. In addition, the genetic code is near-optimal in terms of its ability to carry information in addition to the sequences of encoded proteins. As evolution has no foresight, optimality of the modern genetic code suggests that it evolved from less optimal code variants. The length of codons in the genetic code is also optimal, as three is the minimal nucleotide combination that can encode the twenty standard amino acids. The apparent impossibility of transitions between codon sizes in a discontinuous manner during evolution has resulted in an unbending view that the genetic code was always triplet. Yet, recent experimental evidence on quadruplet decoding, as well as the discovery of organisms with ambiguous and dual decoding, suggest that the possibility of the evolution of triplet decoding from living systems with non-triplet decoding merits reconsideration and further exploration. To explore this possibility we designed a mathematical model of the evolution of primitive digital coding systems which can decode nucleotide sequences into protein sequences. These coding systems can evolve their nucleotide sequences via genetic events of Darwinian evolution, such as point-mutations. The replication rates of such coding systems depend on the accuracy of the generated protein sequences. Computer simulations based on our model show that decoding systems with codons of length greater than three spontaneously evolve into predominantly triplet decoding systems. Our findings suggest a plausible scenario for the evolution of the triplet genetic code in a continuous manner. This scenario suggests an explanation of how protein synthesis could be accomplished by means of long RNA-RNA interactions prior to the emergence of the complex decoding machinery, such as the ribosome, that is required for stabilization and discrimination of otherwise weak triplet codon
Burnup code for fuel assembly by Monte Carlo code. MKENO-BURN

International Nuclear Information System (INIS)

Naito, Yoshitaka; Suyama, Kenya; Masukawa, Fumihiro; Matsumoto, Kiyoshi; Kurosawa, Masayoshi; Kaneko, Toshiyuki.

1996-12-01

The evaluation of neutron spectrum is so important for burnup calculation of the heterogeneous geometry like recent BWR fuel assembly. MKENO-BURN is a multi dimensional burnup code that based on the three dimensional monte carlo neutron transport code 'MULTI-KENO' and the routine for the burnup calculation of the one dimensional burnup code 'UNITBURN'. MKENO-BURN analyzes the burnup problem of arbitrary regions after evaluating the neutron spectrum and making one group cross section in three dimensional geometry with MULTI-KENO. It enables us to do three dimensional burnup calculation. This report consists of general description of MKENO-BURN and the input data. (author)
Burnup code for fuel assembly by Monte Carlo code. MKENO-BURN

Energy Technology Data Exchange (ETDEWEB)

Naito, Yoshitaka; Suyama, Kenya; Masukawa, Fumihiro; Matsumoto, Kiyoshi; Kurosawa, Masayoshi [Japan Atomic Energy Research Inst., Tokai, Ibaraki (Japan). Tokai Research Establishment; Kaneko, Toshiyuki

1996-12-01

The evaluation of neutron spectrum is so important for burnup calculation of the heterogeneous geometry like recent BWR fuel assembly. MKENO-BURN is a multi dimensional burnup code that based on the three dimensional monte carlo neutron transport code `MULTI-KENO` and the routine for the burnup calculation of the one dimensional burnup code `UNITBURN`. MKENO-BURN analyzes the burnup problem of arbitrary regions after evaluating the neutron spectrum and making one group cross section in three dimensional geometry with MULTI-KENO. It enables us to do three dimensional burnup calculation. This report consists of general description of MKENO-BURN and the input data. (author)
Engineering Aromatic-Aromatic Interactions To Nucleate Folding in Intrinsically Disordered Regions of Proteins.

Science.gov (United States)

Balakrishnan, Swati; Sarma, Siddhartha P

2017-08-22

Aromatic interactions are an important force in protein folding as they combine the stability of a hydrophobic interaction with the selectivity of a hydrogen bond. Much of our understanding of aromatic interactions comes from "bioinformatics" based analyses of protein structures and from the contribution of these interactions to stabilizing secondary structure motifs in model peptides. In this study, the structural consequences of aromatic interactions on protein folding have been explored in engineered mutants of the molten globule protein apo-cytochrome b 5 . Structural changes from disorder to order due to aromatic interactions in two variants of the protein, viz., WF-cytb5 and FF-cytb5, result in significant long-range secondary and tertiary structure. The results show that 54 and 52% of the residues in WF-cytb5 and FF-cytb5, respectively, occupy ordered regions versus 26% in apo-cytochrome b 5 . The interactions between the aromatic groups are offset-stacked and edge-to-face for the Trp-Phe and Phe-Phe mutants, respectively. Urea denaturation studies indicate that both mutants have a C m higher than that of apo-cytochrome b 5 and are more stable to chaotropic agents than apo-cytochrome b 5 . The introduction of these aromatic residues also results in "trimer" interactions with existing aromatic groups, reaffirming the selectivity of the aromatic interactions. These studies provide insights into the aromatic interactions that drive disorder-to-order transitions in intrinsically disordered regions of proteins and will aid in de novo protein design beyond small peptide scaffolds.

Arbitrariness is not enough: towards a functional approach to the genetic code.

Science.gov (United States)

Lacková, Ľudmila; Matlach, Vladimír; Faltýnek, Dan

2017-12-01

Arbitrariness in the genetic code is one of the main reasons for a linguistic approach to molecular biology: the genetic code is usually understood as an arbitrary relation between amino acids and nucleobases. However, from a semiotic point of view, arbitrariness should not be the only condition for definition of a code, consequently it is not completely correct to talk about "code" in this case. Yet we suppose that there exist a code in the process of protein synthesis, but on a higher level than the nucleic bases chains. Semiotically, a code should be always associated with a function and we propose to define the genetic code not only relationally (in basis of relation between nucleobases and amino acids) but also in terms of function (function of a protein as meaning of the code). Even if the functional definition of meaning in the genetic code has been discussed in the field of biosemiotics, its further implications have not been considered. In fact, if the function of a protein represents the meaning of the genetic code (the sign's object), then it is crucial to reconsider the notion of its expression (the sign) as well. In our contribution, we will show that the actual model of the genetic code is not the only possible and we will propose a more appropriate model from a semiotic point of view.
Structure of the protein core of the glypican Dally-like and localization of a region important for hedgehog signaling

Energy Technology Data Exchange (ETDEWEB)

Kim, Min-Sung; Saunders, Adam M.; Hamaoka, Brent Y.; Beachy, Philip A.; Leahy, Daniel J. (Stanford-MED); (JHU)

2011-09-20

Glypicans are heparan sulfate proteoglycans that modulate the signaling of multiple growth factors active during animal development, and loss of glypican function is associated with widespread developmental abnormalities. Glypicans consist of a conserved, approximately 45-kDa N-terminal protein core region followed by a stalk region that is tethered to the cell membrane by a glycosyl-phosphatidylinositol anchor. The stalk regions are predicted to be random coil but contain a variable number of attachment sites for heparan sulfate chains. Both the N-terminal protein core and the heparan sulfate attachments are important for glypican function. We report here the 2.4-{angstrom} crystal structure of the N-terminal protein core region of the Drosophila glypican Dally-like (Dlp). This structure reveals an elongated, {alpha}-helical fold for glypican core regions that does not appear homologous to any known structure. The Dlp core protein is required for normal responsiveness to Hedgehog (Hh) signals, and we identify a localized region on the Dlp surface important for mediating its function in Hh signaling. Purified Dlp protein core does not, however, interact appreciably with either Hh or an Hh:Ihog complex.
Nonsynonymous substitution rate (Ka is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes

Directory of Open Access Journals (Sweden)

Wang Lei

2011-02-01

Full Text Available Abstract Background Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functions and evolutionary dynamics are, through comparative genomics. Results We chose human and eleven other high-coverage mammalian genome data–as well as an avian genome as an outgroup–to analyze orthologous protein-coding genes using nonsynonymous (Ka and synonymous (Ks substitution rates. After evaluating eight commonly-used methods of Ka and Ks calculation, we observed that these methods yielded a nearly uniform result when estimating Ka, but not Ks (or Ka/Ks. When sorting genes based on Ka, we noticed that fast-evolving and slow-evolving genes often belonged to different functional classes, with respect to species-specificity and lineage-specificity. In particular, we identified two functional classes of genes in the acquired immune system. Fast-evolving genes coded for signal-transducing proteins, such as receptors, ligands, cytokines, and CDs (cluster of differentiation, mostly surface proteins, whereas the slow-evolving genes were for function-modulating proteins, such as kinases and adaptor proteins. In addition, among slow-evolving genes that had functions related to the central nervous system, neurodegenerative disease-related pathways were enriched significantly in most mammalian species. We also confirmed that gene expression was negatively correlated with evolution rate, i.e. slow-evolving genes were expressed at higher levels than fast-evolving genes. Our results indicated that the functional specializations of the three major mammalian clades were: sensory perception and oncogenesis in primates, reproduction and hormone regulation in large mammals, and immunity and angiotensin in rodents. Conclusion Our study suggests that Ka calculation, which is less biased compared to Ks and Ka
Molecular analysis of human argininosuccinate lyase: Mutant characterization and alternative splicing of the coding region

International Nuclear Information System (INIS)

Walker, D.C.; McCloskey, D.A.; Simard, L.R.; McInnes, R.R.

1990-01-01

Argininosuccinic acid lyase (ASAL) deficiency is a clinically heterogeneous autosomal recessive urea cycle disorder. The authors previously established by complementation analysis that 29 ASAL-deficient patients have heterogeneous mutations in a single gene. To prove that the ASAL structural gene is the affected locus, they sequenced polymerase chain reaction-amplified ASAL cDNA of a representative mutant from the single complementation group. Fibroblast strain 944 from a late-onset patient who was the product of a consanguineous mating, had only a single base-pair change in the coding region, a C-283→ T transition at a CpG dinucleotide in exon 3. This substitution converts Arg-95 to Cys (R95C), occurs in a stretch of 13 residues that is identical in yeast and human ASAL, and was present in both of the patient's alleles but not in 14 other mutant or 10 normal alleles. They observed that amplified cDNA from mutant 944 and normal cells (liver, keratinocytes, lymphoblasts, and fibroblasts) contained, in addition to the expected 5' 513-base-pair band, a prominent 318-base-pair ASAL band formed by the splicing of exon 2 from the transcript. The short transcript maintains the ASAL reading frame but removes Lys-51, a residue that may be essential for catalysis, since it binds the argininosuccinate substrate. They conclude (i) that the identification of the R95C mutation in strain 944 demonstrates that virtually all ASAL deficiency results from defects in the ASAL structural gene and (ii) that minor alternative splicing of the coding region occurs at the ASAL locus
Data on the evolutionary history of the V(DJ recombination-activating protein 1 – RAG1 coupled with sequence and variant analyses

Directory of Open Access Journals (Sweden)

Abhishek Kumar

2016-09-01

Full Text Available RAG1 protein is one of the key component of RAG complex regulating the V(DJ recombination. There are only few studies for RAG1 concerning evolutionary history, detailed sequence and mutational hotspots. Herein, we present out datasets used for the recent comprehensive study of RAG1 based on sequence, phylogenetic and genetic variant analyses (Kumar et al., 2015 [1]. Protein sequence alignment helped in characterizing the conserved domains and regions of RAG1. It also aided in unraveling ancestral RAG1 in the sea urchin. Human genetic variant analyses revealed 751 mutational hotspots, located both in the coding and the non-coding regions. For further analysis and discussion, see (Kumar et al., 2015 [1].
Long non-coding RNAs and mRNAs profiling during spleen development in pig.

Science.gov (United States)

Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou

2018-01-01

Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.
Atmospheric Transport Modeling with 3D Lagrangian Dispersion Codes Compared with SF6 Tracer Experiments at Regional Scale

Directory of Open Access Journals (Sweden)

François Van Dorpe

2007-01-01

Full Text Available The results of four gas tracer experiments of atmospheric dispersion on a regional scale are used for the benchmarking of two atmospheric dispersion modeling codes, MINERVE-SPRAY (CEA, and NOSTRADAMUS (IBRAE. The main topic of this comparison is to estimate the Lagrangian code capability to predict the radionuclide atmospheric transfer on a large field, in the case of risk assessment of nuclear power plant for example. For the four experiments, the results of calculations show a rather good agreement between the two codes, and the order of magnitude of the concentrations measured on the soil is predicted. Simulation is best for sampling points located ten kilometers from the source, while we note a divergence for more distant points results (difference in concentrations by a factor 2 to 5. This divergence may be explained by the fact that, for these four experiments, only one weather station (near the point source was used on a field of 10 000 km2, generating the simulation of a uniform wind field throughout the calculation domain.
Origins of gene, genetic code, protein and life

Indian Academy of Sciences (India)

We have further presented the [GADV]-protein world hypothesis of the origin of life as well as a hypothesis of protein production, suggesting that proteins were originally produced by random peptide formation of amino acids restricted in specific amino acid compositions termed as GNC-, SNS- and GC-NSF(a)-0th order ...
Functional Anthology of Intrinsic Disorder. I. Biological Processes and Functions of Proteins with Long Disordered Regions

Science.gov (United States)

Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Uversky, Vladimir N.; Obradovic, Zoran

2008-01-01

Identifying relationships between function, amino acid sequence and protein structure represents a major challenge. In this study we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical
Coded Cooperation for Multiway Relaying in Wireless Sensor Networks.

Science.gov (United States)

Si, Zhongwei; Ma, Junyang; Thobaben, Ragnar

2015-06-29

Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC) codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET) SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels.
Allegheny County Zip Code Boundaries

Data.gov (United States)

Allegheny County / City of Pittsburgh / Western PA Regional Data Center — This dataset demarcates the zip code boundaries that lie within Allegheny County.If viewing this description on the Western Pennsylvania Regional Data Center’s open...
The θ-γ neural code.

Science.gov (United States)

Lisman, John E; Jensen, Ole

2013-03-20

Theta and gamma frequency oscillations occur in the same brain regions and interact with each other, a process called cross-frequency coupling. Here, we review evidence for the following hypothesis: that the dual oscillations form a code for representing multiple items in an ordered way. This form of coding has been most clearly demonstrated in the hippocampus, where different spatial information is represented in different gamma subcycles of a theta cycle. Other experiments have tested the functional importance of oscillations and their coupling. These involve correlation of oscillatory properties with memory states, correlation with memory performance, and effects of disrupting oscillations on memory. Recent work suggests that this coding scheme coordinates communication between brain regions and is involved in sensory as well as memory processes. Copyright © 2013 Elsevier Inc. All rights reserved.
What Froze the Genetic Code?

Directory of Open Access Journals (Sweden)

Lluís Ribas de Pouplana

2017-04-01

Full Text Available The frozen accident theory of the Genetic Code was a proposal by Francis Crick that attempted to explain the universal nature of the Genetic Code and the fact that it only contains information for twenty amino acids. Fifty years later, it is clear that variations to the universal Genetic Code exist in nature and that translation is not limited to twenty amino acids. However, given the astonishing diversity of life on earth, and the extended evolutionary time that has taken place since the emergence of the extant Genetic Code, the idea that the translation apparatus is for the most part immobile remains true. Here, we will offer a potential explanation to the reason why the code has remained mostly stable for over three billion years, and discuss some of the mechanisms that allow species to overcome the intrinsic functional limitations of the protein synthesis machinery.
What Froze the Genetic Code?

Science.gov (United States)

Ribas de Pouplana, Lluís; Torres, Adrian Gabriel; Rafels-Ybern, Àlbert

2017-04-05

The frozen accident theory of the Genetic Code was a proposal by Francis Crick that attempted to explain the universal nature of the Genetic Code and the fact that it only contains information for twenty amino acids. Fifty years later, it is clear that variations to the universal Genetic Code exist in nature and that translation is not limited to twenty amino acids. However, given the astonishing diversity of life on earth, and the extended evolutionary time that has taken place since the emergence of the extant Genetic Code, the idea that the translation apparatus is for the most part immobile remains true. Here, we will offer a potential explanation to the reason why the code has remained mostly stable for over three billion years, and discuss some of the mechanisms that allow species to overcome the intrinsic functional limitations of the protein synthesis machinery.
On the role of the second coding exon of the HIV-1 Tat protein in virus replication and MHC class I downregulation

NARCIS (Netherlands)

Verhoef, K.; Bauer, M.; Meyerhans, A.; Berkhout, B.

1998-01-01

Tat is an essential protein of human immunodeficiency virus type 1 (HIV-1) and activates transcription from the viral long terminal repeat (LTR) promoter. The tat gene is composed of two coding exons of which the first, corresponding to the N-terminal 72 amino acid residues, has been reported to be
Localization of a Region in the Fusion Protein of Avian Metapneumovirus That Modulates Cell-Cell Fusion

Science.gov (United States)

Wei, Yongwei; Feng, Kurtis; Yao, Xiangjie; Cai, Hui; Li, Junan; Mirza, Anne M.; Iorio, Ronald M.

2012-01-01

The genus Metapneumovirus within the subfamily Pneumovirinae of the family Paramyxoviridae includes two members, human metapneumovirus (hMPV) and avian metapneumovirus (aMPV), causing respiratory tract infections in humans and birds, respectively. Paramyxoviruses enter host cells by fusing the viral envelope with a host cell membrane. Membrane fusion of hMPV appears to be unique, in that fusion of some hMPV strains requires low pH. Here, we show that the fusion (F) proteins of aMPV promote fusion in the absence of the attachment protein and low pH is not required. Furthermore, there are notable differences in cell-cell fusion among aMPV subtypes. Trypsin was required for cell-cell fusion induced by subtype B but not subtypes A and C. The F protein of aMPV subtype A was highly fusogenic, whereas those from subtypes B and C were not. By construction and evaluation of chimeric F proteins composed of domains from the F proteins of subtypes A and B, we localized a region composed of amino acid residues 170 to 338 in the F protein that is responsible for the hyperfusogenic phenotype of the F from subtype A. Further mutagenesis analysis revealed that residues R295, G297, and K323 in this region collectively contributed to the hyperfusogenicity. Taken together, we have identified a region in the aMPV F protein that modulates the extent of membrane fusion. A model for fusion consistent with these data is presented. PMID:22915815
Localization of a region in the fusion protein of avian metapneumovirus that modulates cell-cell fusion.

Science.gov (United States)

Wei, Yongwei; Feng, Kurtis; Yao, Xiangjie; Cai, Hui; Li, Junan; Mirza, Anne M; Iorio, Ronald M; Li, Jianrong

2012-11-01

The genus Metapneumovirus within the subfamily Pneumovirinae of the family Paramyxoviridae includes two members, human metapneumovirus (hMPV) and avian metapneumovirus (aMPV), causing respiratory tract infections in humans and birds, respectively. Paramyxoviruses enter host cells by fusing the viral envelope with a host cell membrane. Membrane fusion of hMPV appears to be unique, in that fusion of some hMPV strains requires low pH. Here, we show that the fusion (F) proteins of aMPV promote fusion in the absence of the attachment protein and low pH is not required. Furthermore, there are notable differences in cell-cell fusion among aMPV subtypes. Trypsin was required for cell-cell fusion induced by subtype B but not subtypes A and C. The F protein of aMPV subtype A was highly fusogenic, whereas those from subtypes B and C were not. By construction and evaluation of chimeric F proteins composed of domains from the F proteins of subtypes A and B, we localized a region composed of amino acid residues 170 to 338 in the F protein that is responsible for the hyperfusogenic phenotype of the F from subtype A. Further mutagenesis analysis revealed that residues R295, G297, and K323 in this region collectively contributed to the hyperfusogenicity. Taken together, we have identified a region in the aMPV F protein that modulates the extent of membrane fusion. A model for fusion consistent with these data is presented.
Junction region of EWS-FLI1 fusion protein has a dominant negative effect in Ewing’s Sarcoma in vitro

International Nuclear Information System (INIS)

Jully, Babu; Vijayalakshmi, Ramshankar; Gopal, Gopisetty; Sabitha, Kesavan; Rajkumar, Thangarajan

2012-01-01

Ewing’s sarcoma is a malignancy characterized by a specific 11:22 chromosomal translocation which generates a novel EWS-FLI1 fusion protein functioning as an aberrant transcription factor. In the present study, we have further characterized the junction region of the EWS-FLI1 fusion protein. In-silico model of EWS-FLI1 fusion protein was analysed for ligand binding sites, and a putative region (amino acid (aa) 251–343 of the type 1 fusion protein) in the vicinity of the fusion junction was cloned and expressed using bacterial expression. The recombinant protein was characterized by Circular Dichroism (CD). We then expressed aa 251–280 ectopically in Ewing’s sarcoma cell-line and its effect on cell proliferation, tumorigenicity and expression of EWS-FLI1 target genes were analysed. Our modelling analysis indicated that Junction region (aa 251–343) encompasses potential ligand biding sites in the EWS-FLI1 protein and when expressed in bacteria was present as soluble form. Ectopically expressing this region in Ewing’s sarcoma cells inhibited tumorigenicity, and EWS-FLI1 target genes indicating a dominant negative biological effect. Junction region can be exploited further as target for drug development in future to specifically target EWS-FLI1 in Ewing’s Sarcoma
Computational Approaches Reveal New Insights into Regulation and Function of Non; coding RNAs and their Targets

KAUST Repository

Alam, Tanvir

2016-01-01

Regulation and function of protein-coding genes are increasingly well-understood, but no comparable evidence exists for non-coding RNA (ncRNA) genes, which appear to be more numerous than protein-coding genes. We developed a novel machine
CPAD, Curated Protein Aggregation Database: A Repository of Manually Curated Experimental Data on Protein and Peptide Aggregation.

Science.gov (United States)

Thangakani, A Mary; Nagarajan, R; Kumar, Sandeep; Sakthivel, R; Velmurugan, D; Gromiha, M Michael

2016-01-01

Accurate distinction between peptide sequences that can form amyloid-fibrils or amorphous β-aggregates, identification of potential aggregation prone regions in proteins, and prediction of change in aggregation rate of a protein upon mutation(s) are critical to research on protein misfolding diseases, such as Alzheimer's and Parkinson's, as well as biotechnological production of protein based therapeutics. We have developed a Curated Protein Aggregation Database (CPAD), which has collected results from experimental studies performed by scientific community aimed at understanding protein/peptide aggregation. CPAD contains more than 2300 experimentally observed aggregation rates upon mutations in known amyloidogenic proteins. Each entry includes numerical values for the following parameters: change in rate of aggregation as measured by fluorescence intensity or turbidity, name and source of the protein, Uniprot and Protein Data Bank codes, single point as well as multiple mutations, and literature citation. The data in CPAD has been supplemented with five different types of additional information: (i) Amyloid fibril forming hexa-peptides, (ii) Amorphous β-aggregating hexa-peptides, (iii) Amyloid fibril forming peptides of different lengths, (iv) Amyloid fibril forming hexa-peptides whose crystal structures are available in the Protein Data Bank (PDB) and (v) Experimentally validated aggregation prone regions found in amyloidogenic proteins. Furthermore, CPAD is linked to other related databases and resources, such as Uniprot, Protein Data Bank, PUBMED, GAP, TANGO, WALTZ etc. We have set up a web interface with different search and display options so that users have the ability to get the data in multiple ways. CPAD is freely available at http://www.iitm.ac.in/bioinfo/CPAD/. The potential applications of CPAD have also been discussed.

Identification and characterization of a gibberellin-regulated protein, which is ASR5, in the basal region of rice leaf sheaths.

Science.gov (United States)

Takasaki, Hironori; Mahmood, Tariq; Matsuoka, Makoto; Matsumoto, Hiroshi; Komatsu, Setsuko

2008-04-01

Gibberellins (GAs) regulate growth and development in higher plants. To identify GA-regulated proteins during rice leaf sheath elongation, a proteomic approach was used. Proteins from the basal region of leaf sheath in rice seedling treated with GA(3) were analyzed by fluorescence two-dimensional difference gel electrophoresis. The levels of abscisic acid-stress-ripening-inducible 5 protein (ASR5), elongation factor-1 beta, translationally controlled tumor protein, fructose-bisphosphate aldolase and a novel protein increased; whereas the level of RuBisCO subunit binding-protein decreased by GA(3) treatment. ASR5 out of these six proteins was significantly regulated by GA(3) at the protein level but not at the mRNA level in the basal region of leaf sheaths. Since this protein is regulated not only by abscisic acid but also by GA(3), these results indicate that ASR5 might be involved in plant growth in addition to stress in the basal regions of leaf sheaths.
Wetting of nonconserved residue-backbones: A feature indicative of aggregation associated regions of proteins.

Science.gov (United States)

Pradhan, Mohan R; Pal, Arumay; Hu, Zhongqiao; Kannan, Srinivasaraghavan; Chee Keong, Kwoh; Lane, David P; Verma, Chandra S

2016-02-01

Aggregation is an irreversible form of protein complexation and often toxic to cells. The process entails partial or major unfolding that is largely driven by hydration. We model the role of hydration in aggregation using "Dehydrons." "Dehydrons" are unsatisfied backbone hydrogen bonds in proteins that seek shielding from water molecules by associating with ligands or proteins. We find that the residues at aggregation interfaces have hydrated backbones, and in contrast to other forms of protein-protein interactions, are under less evolutionary pressure to be conserved. Combining evolutionary conservation of residues and extent of backbone hydration allows us to distinguish regions on proteins associated with aggregation (non-conserved dehydron-residues) from other interaction interfaces (conserved dehydron-residues). This novel feature can complement the existing strategies used to investigate protein aggregation/complexation. © 2015 Wiley Periodicals, Inc.
An unusual internal ribosomal entry site of inverted symmetry directs expression of a potato leafroll polerovirus replication-associated protein

Science.gov (United States)

Jaag, Hannah Miriam; Kawchuk, Lawrence; Rohde, Wolfgang; Fischer, Rainer; Emans, Neil; Prüfer, Dirk

2003-01-01

Potato leafroll polerovirus (PLRV) genomic RNA acts as a polycistronic mRNA for the production of proteins P0, P1, and P2 translated from the 5′-proximal half of the genome. Within the P1 coding region we identified a 5-kDa replication-associated protein 1 (Rap1) essential for viral multiplication. An internal ribosome entry site (IRES) with unusual structure and location was identified that regulates Rap1 translation. Core structural elements for internal ribosome entry include a conserved AUG codon and a downstream GGAGAGAGAGG motif with inverted symmetry. Reporter gene expression in potato protoplasts confirmed the internal ribosome entry function. Unlike known IRES motifs, the PLRV IRES is located completely within the coding region of Rap1 at the center of the PLRV genome. PMID:12835413
Two-terminal video coding.

Science.gov (United States)

Yang, Yang; Stanković, Vladimir; Xiong, Zixiang; Zhao, Wei

2009-03-01

Following recent works on the rate region of the quadratic Gaussian two-terminal source coding problem and limit-approaching code designs, this paper examines multiterminal source coding of two correlated, i.e., stereo, video sequences to save the sum rate over independent coding of both sequences. Two multiterminal video coding schemes are proposed. In the first scheme, the left sequence of the stereo pair is coded by H.264/AVC and used at the joint decoder to facilitate Wyner-Ziv coding of the right video sequence. The first I-frame of the right sequence is successively coded by H.264/AVC Intracoding and Wyner-Ziv coding. An efficient stereo matching algorithm based on loopy belief propagation is then adopted at the decoder to produce pixel-level disparity maps between the corresponding frames of the two decoded video sequences on the fly. Based on the disparity maps, side information for both motion vectors and motion-compensated residual frames of the right sequence are generated at the decoder before Wyner-Ziv encoding. In the second scheme, source splitting is employed on top of classic and Wyner-Ziv coding for compression of both I-frames to allow flexible rate allocation between the two sequences. Experiments with both schemes on stereo video sequences using H.264/AVC, LDPC codes for Slepian-Wolf coding of the motion vectors, and scalar quantization in conjunction with LDPC codes for Wyner-Ziv coding of the residual coefficients give a slightly lower sum rate than separate H.264/AVC coding of both sequences at the same video quality.
The C-terminal region of A-kinase anchor protein 350 (AKAP350A) enables formation of microtubule-nucleation centers and interacts with pericentriolar proteins.

Science.gov (United States)

Kolobova, Elena; Roland, Joseph T; Lapierre, Lynne A; Williams, Janice A; Mason, Twila A; Goldenring, James R

2017-12-15

Microtubules in animal cells assemble (nucleate) from both the centrosome and the cis-Golgi cisternae. A-kinase anchor protein 350 kDa (AKAP350A, also called AKAP450/CG-NAP/AKAP9) is a large scaffolding protein located at both the centrosome and Golgi apparatus. Previous findings have suggested that AKAP350 is important for microtubule dynamics at both locations, but how this scaffolding protein assembles microtubule nucleation machinery is unclear. Here, we found that overexpression of the C-terminal third of AKAP350A, enhanced GFP-AKAP350A(2691-3907), induces the formation of multiple microtubule-nucleation centers (MTNCs). Nevertheless, these induced MTNCs lacked "true" centriole proteins, such as Cep135. Mapping analysis with AKAP350A truncations demonstrated that AKAP350A contains discrete regions responsible for promoting or inhibiting the formation of multiple MTNCs. Moreover, GFP-AKAP350A(2691-3907) recruited several pericentriolar proteins to MTNCs, including γ-tubulin, pericentrin, Cep68, Cep170, and Cdk5RAP2. Proteomic analysis indicated that Cdk5RAP2 and Cep170 both interact with the microtubule nucleation-promoting region of AKAP350A, whereas Cep68 interacts with the distal C-terminal AKAP350A region. Yeast two-hybrid assays established a direct interaction of Cep170 with AKAP350A. Super-resolution and deconvolution microscopy analyses were performed to define the association of AKAP350A with centrosomes, and these studies disclosed that AKAP350A spans the bridge between centrioles, co-localizing with rootletin and Cep68 in the linker region. siRNA-mediated depletion of AKAP350A caused displacement of both Cep68 and Cep170 from the centrosome. These results suggest that AKAP350A acts as a scaffold for factors involved in microtubule nucleation at the centrosome and coordinates the assembly of protein complexes associating with the intercentriolar bridge.
R-Matrix Codes for Charged-particle Induced Reactionsin the Resolved Resonance Region

Energy Technology Data Exchange (ETDEWEB)

Leeb, Helmut [Technical Univ. of Wien, Vienna (Austria); Dimitriou, Paraskevi [Intl Atomic Energy Agency (IAEA), Vienna (Austria); Thompson, Ian J. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

2017-01-01

A Consultant’s Meeting was held at the IAEA Headquarters, from 5 to 7 December 2016, to discuss the status of R-matrix codes currently used in calculations of charged-particle induced reaction cross sections at low energies. The meeting was a follow-up to the R-matrix Codes meeting held in December 2015, and served the purpose of monitoring progress in: the development of a translation code to enable exchange of input/output parameters between the various codes in different formats, fitting procedures and treatment of uncertainties, the evaluation methodology, and finally dissemination. The details of the presentations and technical discussions, as well as additional actions that were proposed to achieve all the goals of the meeting are summarized in this report.
Chemistry models in the Victoria code

International Nuclear Information System (INIS)

Grimley, A.J. III

1988-01-01

The VICTORIA Computer code consists of the fission product release and chemistry models for the MELPROG severe accident analysis code. The chemistry models in VICTORIA are used to treat multi-phase interactions in four separate physical regions: fuel grains, gap/open porosity/clad, coolant/aerosols, and structure surfaces. The physical and chemical environment of each region is very different from the others and different models are required for each. The common thread in the modelling is the use of a chemical equilibrium assumption. The validity of this assumption along with a description of the various physical constraints applicable to each region will be discussed. The models that result from the assumptions and constraints will be presented along with samples of calculations in each region
Coded Cooperation for Multiway Relaying in Wireless Sensor Networks

Directory of Open Access Journals (Sweden)

Zhongwei Si

2015-06-01

Full Text Available Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels.
PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

Science.gov (United States)

Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

2015-12-01

A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.
Sequential assignment of proline-rich regions in proteins: Application to modular binding domain complexes

International Nuclear Information System (INIS)

Kanelis, Voula; Donaldson, Logan; Muhandiram, D.R.; Rotin, Daniela; Forman-Kay, Julie D.; Kay, Lewis E.

2000-01-01

Many protein-protein interactions involve amino acid sequences containing proline-rich motifs and even poly-proline stretches. The lack of amide protons in such regions complicates assignment, since 1 HN-based triple-resonance assignment strategies cannot be employed. Two such systems that we are currently studying include an SH2 domain from the protein Crk with a region containing 9 prolines in a 14 amino acid sequence, as well as a WW domain that interacts with a proline-rich target. A modified version of the HACAN pulse scheme, originally described by Bax and co-workers [Wang et al. (1995) J. Biomol. NMR, 5, 376-382], and an experiment which correlates the intra-residue 1 H α , 13 C α / 13 C β chemical shifts with the 15 N shift of the subsequent residue are presented and applied to the two systems listed above, allowing sequential assignment of the molecules
Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

Science.gov (United States)

Yu, Jia-Feng; Sui, Tian-Xiang; Wang, Hong-Mei; Wang, Chun-Ling; Jing, Li; Wang, Ji-Hua

2015-12-01

Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. Project supported by the National Natural Science Foundation of China (Grant Nos. 61302186 and 61271378) and the Funding from the State Key Laboratory of Bioelectronics of Southeast University.
Regulation of protein homeostasis in neurodegenerative diseases : the role of coding and non-coding genes

NARCIS (Netherlands)

Alvarenga Fernandes Sin, Olga; Nollen, Ellen A. A.

Protein homeostasis is fundamental for cell function and survival, because proteins are involved in all aspects of cellular function, ranging from cell metabolism and cell division to the cell's response to environmental challenges. Protein homeostasis is tightly regulated by the synthesis, folding,
Construction of a plasmid coding for green fluorescent protein tagged cathepsin L and data on expression in colorectal carcinoma cells

Directory of Open Access Journals (Sweden)

Tripti Tamhane

2015-12-01

Full Text Available The endo-lysosomal cysteine cathepsin L has recently been shown to have moonlighting activities in that its unexpected nuclear localization in colorectal carcinoma cells is involved in cell cycle progression (Tamhane et al., 2015 [1]. Here, we show data on the construction and sequence of a plasmid coding for human cathepsin L tagged with an enhanced green fluorescent protein (phCL-EGFP in which the fluorescent protein is covalently attached to the C-terminus of the protease. The plasmid was used for transfection of HCT116 colorectal carcinoma cells, while data from non-transfected and pEGFP-N1-transfected cells is also shown. Immunoblotting data of lysates from non-transfected controls and HCT116 cells transfected with pEGFP-N1 and phCL-EGFP, showed stable expression of cathepsin L-enhanced green fluorescent protein chimeras, while endogenous cathepsin L protein amounts exceed those of hCL-EGFP chimeras. An effect of phCL-EGFP expression on proliferation and metabolic states of HCT116 cells at 24 h post-transfection was observed.
Interaction study of rice stripe virus proteins reveals a region of the nucleocapsid protein (NP) required for NP self-interaction and nuclear localization.

Science.gov (United States)

Lian, Sen; Cho, Won Kyong; Jo, Yeonhwa; Kim, Sang-Min; Kim, Kook-Hyung

2014-04-01

Rice stripe virus (RSV), which belongs to the genus Tenuivirus, is an emergent virus problem. The RSV genome is composed of four single-strand RNAs (RNA1-RNA4) and encodes seven proteins. We investigated interactions between six of the RSV proteins by yeast-two hybrid (Y2H) assay in vitro and by bimolecular fluorescence complementation (BiFC) in planta. Y2H identified self-interaction of the nucleocapsid protein (NP) and NS3, while BiFC revealed self-interaction of NP, NS3, and NCP. To identify regions(s) and/or crucial amino acid (aa) residues required for NP self-interaction, we generated various truncated and aa substitution mutants. Y2H assay showed that the N-terminal region of NP (aa 1-56) is necessary for NP self-interaction. Further analysis with substitution mutants demonstrated that additional aa residues located at 42-47 affected their interaction with full-length NP. These results indicate that the N-terminal region (aa 1-36 and 42-47) is required for NP self-interaction. BiFC and co-localization studies showed that the region required for NP self-interaction is also required for NP localization at the nucleus. Overall, our results indicate that the N-terminal region (aa 1-47) of the NP is important for NP self-interaction and that six aa residues (42-47) are essential for both NP self-interaction and nuclear localization. Copyright © 2014 Elsevier B.V. All rights reserved.
Biophysical properties of regions flanking the bHLH-Zip motif in the p22 Max protein

International Nuclear Information System (INIS)

Pursglove, Sharon E.; Fladvad, Malin; Bellanda, Massimo; Moshref, Ahmad; Henriksson, Marie; Carey, Jannette; Sunnerhagen, Maria

2004-01-01

The Max protein is the central dimerization partner in the Myc-Max-Mad network of transcriptional regulators, and a founding structural member of the family of basic-helix-loop-helix (bHLH)-leucine zipper (Zip) proteins. Biologically important regions flanking its bHLH-Zip motif have been disordered or absent in crystal structures. The present study shows that these regions are resistant to proteolysis in both the presence and absence of DNA, and that Max dimers containing both flanking regions have significantly higher helix content as measured by circular dichroism than that predicted from the crystal structures. Nuclear magnetic resonance measurements in the absence of DNA also support the inferred structural order. Deletion of both flanking regions is required to achieve maximal DNA affinity as measured by EMSA. Thus, the previously observed functionalities of these Max regions in DNA binding, phosphorylation, and apoptosis are suggested to be linked to structural properties
Different regions of the newcastle disease virus fusion protein modulate pathogenicity.

Directory of Open Access Journals (Sweden)

Sandra Heiden

Full Text Available Newcastle disease virus (NDV, also designated as Avian paramyxovirus type 1 (APMV-1, is the causative agent of a notifiable disease of poultry but it exhibits different pathogenicity dependent on the virus strain. The molecular basis for this variability is not fully understood. The efficiency of activation of the fusion protein (F is determined by presence or absence of a polybasic amino acid sequence at an internal proteolytic cleavage site which is a major determinant of NDV virulence. However, other determinants of pathogenicity must exist since APMV-1 of high (velogenic, intermediate (mesogenic and low (lentogenic virulence specify a polybasic F cleavage site. We aimed at elucidation of additional virulence determinants by constructing a recombinant virus that consists of a lentogenic NDV Clone 30 backbone and the F protein gene from a mesogenic pigeon paramyxovirus-1 (PPMV-1 isolate with an intracerebral pathogenicity index (ICPI of 1.1 specifying the polybasic sequence R-R-K-K-R*F motif at the cleavage site. The resulting virus was characterized by an ICPI of 0.6, indicating a lentogenic pathotype. In contrast, alteration of the cleavage site G-R-Q-G-R*L of the lentogenic Clone 30 to R-R-K-K-R*F resulted in a recombinant virus with an ICPI of 1.36 which was higher than that of parental PPMV-1. Substitution of different regions of the F protein of Clone 30 by those of PPMV-1, while maintaining the polybasic amino acid sequence at the F cleavage site, resulted in recombinant viruses with ICPIs ranging from 0.59 to 1.36 suggesting that virulence is modulated by regions of the F protein other than the polybasic cleavage site.
Benchmark studies of BOUT++ code and TPSMBI code on neutral transport during SMBI

Energy Technology Data Exchange (ETDEWEB)

Wang, Y.H. [Institute of Plasma Physics, Chinese Academy of Sciences, Hefei 230031 (China); University of Science and Technology of China, Hefei 230026 (China); Center for Magnetic Fusion Theory, Chinese Academy of Sciences, Hefei 230031 (China); Wang, Z.H., E-mail: zhwang@swip.ac.cn [Southwestern Institute of Physics, Chengdu 610041 (China); Guo, W., E-mail: wfguo@ipp.ac.cn [Institute of Plasma Physics, Chinese Academy of Sciences, Hefei 230031 (China); Center for Magnetic Fusion Theory, Chinese Academy of Sciences, Hefei 230031 (China); Ren, Q.L. [Institute of Plasma Physics, Chinese Academy of Sciences, Hefei 230031 (China); Sun, A.P.; Xu, M.; Wang, A.K. [Southwestern Institute of Physics, Chengdu 610041 (China); Xiang, N. [Institute of Plasma Physics, Chinese Academy of Sciences, Hefei 230031 (China); Center for Magnetic Fusion Theory, Chinese Academy of Sciences, Hefei 230031 (China)

2017-06-09

SMBI (supersonic molecule beam injection) plays an important role in tokamak plasma fuelling, density control and ELM mitigation in magnetic confinement plasma physics, which has been widely used in many tokamaks. The trans-neut module of BOUT++ code is the only large-scale parallel 3D fluid code used to simulate the SMBI fueling process, while the TPSMBI (transport of supersonic molecule beam injection) code is a recent developed 1D fluid code of SMBI. In order to find a method to increase SMBI fueling efficiency in H-mode plasma, especially for ITER, it is significant to first verify the codes. The benchmark study between the trans-neut module of BOUT++ code and the TPSMBI code on radial transport dynamics of neutral during SMBI has been first successfully achieved in both slab and cylindrical coordinates. The simulation results from the trans-neut module of BOUT++ code and TPSMBI code are consistent very well with each other. Different upwind schemes have been compared to deal with the sharp gradient front region during the inward propagation of SMBI for the code stability. The influence of the WENO3 (weighted essentially non-oscillatory) and the third order upwind schemes on the benchmark results has also been discussed. - Highlights: • A 1D model of SMBI has developed. • Benchmarks of BOUT++ and TPSMBI codes have first been finished. • The influence of the WENO3 and the third order upwind schemes on the benchmark results has also been discussed.
High GC content causes orphan proteins to be intrinsically disordered.

Directory of Open Access Journals (Sweden)

Walter Basile

2017-03-01

Full Text Available De novo creation of protein coding genes involves the formation of short ORFs from noncoding regions; some of these ORFs might then become fixed in the population. These orphan proteins need to, at the bare minimum, not cause serious harm to the organism, meaning that they should for instance not aggregate. Therefore, although the creation of short ORFs could be truly random, the fixation should be subjected to some selective pressure. The selective forces acting on orphan proteins have been elusive, and contradictory results have been reported. In Drosophila young proteins are more disordered than ancient ones, while the opposite trend is present in yeast. To the best of our knowledge no valid explanation for this difference has been proposed. To solve this riddle we studied structural properties and age of proteins in 187 eukaryotic organisms. We find that, with the exception of length, there are only small differences in the properties between proteins of different ages. However, when we take the GC content into account we noted that it could explain the opposite trends observed for orphans in yeast (low GC and Drosophila (high GC. GC content is correlated with codons coding for disorder promoting amino acids. This leads us to propose that intrinsic disorder is not a strong determining factor for fixation of orphan proteins. Instead these proteins largely resemble random proteins given a particular GC level. During evolution the properties of a protein change faster than the GC level causing the relationship between disorder and GC to gradually weaken.
Error-Rate Bounds for Coded PPM on a Poisson Channel

Science.gov (United States)

Moision, Bruce; Hamkins, Jon

2009-01-01

Equations for computing tight bounds on error rates for coded pulse-position modulation (PPM) on a Poisson channel at high signal-to-noise ratio have been derived. These equations and elements of the underlying theory are expected to be especially useful in designing codes for PPM optical communication systems. The equations and the underlying theory apply, more specifically, to a case in which a) At the transmitter, a linear outer code is concatenated with an inner code that includes an accumulator and a bit-to-PPM-symbol mapping (see figure) [this concatenation is known in the art as "accumulate-PPM" (abbreviated "APPM")]; b) The transmitted signal propagates on a memoryless binary-input Poisson channel; and c) At the receiver, near-maximum-likelihood (ML) decoding is effected through an iterative process. Such a coding/modulation/decoding scheme is a variation on the concept of turbo codes, which have complex structures, such that an exact analytical expression for the performance of a particular code is intractable. However, techniques for accurately estimating the performances of turbo codes have been developed. The performance of a typical turbo code includes (1) a "waterfall" region consisting of a steep decrease of error rate with increasing signal-to-noise ratio (SNR) at low to moderate SNR, and (2) an "error floor" region with a less steep decrease of error rate with increasing SNR at moderate to high SNR. The techniques used heretofore for estimating performance in the waterfall region have differed from those used for estimating performance in the error-floor region. For coded PPM, prior to the present derivations, equations for accurate prediction of the performance of coded PPM at high SNR did not exist, so that it was necessary to resort to time-consuming simulations in order to make such predictions. The present derivation makes it unnecessary to perform such time-consuming simulations.
HOXA1 and TALE proteins display cross-regulatory interactions and form a combinatorial binding code on HOXA1 targets.

Science.gov (United States)

De Kumar, Bony; Parker, Hugo J; Paulson, Ariel; Parrish, Mark E; Pushel, Irina; Singh, Narendra Pratap; Zhang, Ying; Slaughter, Brian D; Unruh, Jay R; Florens, Laurence; Zeitlinger, Julia; Krumlauf, Robb

2017-09-01

Hoxa1 has diverse functional roles in differentiation and development. We identify and characterize properties of regions bound by HOXA1 on a genome-wide basis in differentiating mouse ES cells. HOXA1-bound regions are enriched for clusters of consensus binding motifs for HOX, PBX, and MEIS, and many display co-occupancy of PBX and MEIS. PBX and MEIS are members of the TALE family and genome-wide analysis of multiple TALE members (PBX, MEIS, TGIF, PREP1, and PREP2) shows that nearly all HOXA1 targets display occupancy of one or more TALE members. The combinatorial binding patterns of TALE proteins define distinct classes of HOXA1 targets, which may create functional diversity. Transgenic reporter assays in zebrafish confirm enhancer activities for many HOXA1-bound regions and the importance of HOX-PBX and TGIF motifs for their regulation. Proteomic analyses show that HOXA1 physically interacts on chromatin with PBX, MEIS, and PREP family members, but not with TGIF, suggesting that TGIF may have an independent input into HOXA1-bound regions. Therefore, TALE proteins appear to represent a wide repertoire of HOX cofactors, which may coregulate enhancers through distinct mechanisms. We also discover extensive auto- and cross-regulatory interactions among the Hoxa1 and TALE genes, indicating that the specificity of HOXA1 during development may be regulated though a complex cross-regulatory network of HOXA1 and TALE proteins. This study provides new insight into a regulatory network involving combinatorial interactions between HOXA1 and TALE proteins. © 2017 De Kumar et al.; Published by Cold Spring Harbor Laboratory Press.

Sequence organization and control of transcription in the bacteriophage T4 tRNA region.

Science.gov (United States)

Broida, J; Abelson, J

1985-10-05

Bacteriophage T4 contains genes for eight transfer RNAs and two stable RNAs of unknown function. These are found in two clusters at 70 X 10(3) base-pairs on the T4 genetic map. To understand the control of transcription in this region we have completed the sequencing of 5000 base-pairs in this region. The sequence contains a part of gene 3, gene 1, gene 57, internal protein I, the tRNA genes and five open reading frames which most likely code for heretofore unidentified proteins. We have used subclones of the region to investigate the kinetics of transcription in vivo. The results show that transcription in this region consists of overlapping early, middle and late transcripts. Transcription is directed from two early promoters, one or two middle promoters and perhaps two late promoters. This region contains all of the features that are seen in T4 transcription and as such is a good place to study the phenomenon in more detail.
Truncation of the C-terminal region of Toscana Virus NSs protein is critical for interferon-β antagonism and protein stability.

Science.gov (United States)

Gori Savellini, Gianni; Gandolfo, Claudia; Cusi, Maria Grazia

2015-12-01

Toscana Virus (TOSV) is a Phlebovirus responsible for central nervous system (CNS) injury in humans. The TOSV non-structural protein (NSs), which interacting with RIG-I leads to its degradation, was analysed in the C terminus fragment in order to identify its functional domains. To this aim, two C-terminal truncated NSs proteins, Δ1C-NSs (aa 1-284) and Δ2C-NSs (aa 1-287) were tested. Only Δ1C-NSs did not present any inhibitory effect on RIG-I and it showed a greater stability than the whole NSs protein. Moreover, the deletion of the TLQ aa sequence interposed between the two ΔC constructs caused a greater accumulation of the protein with a weak inhibitory effect on RIG-I, indicating some involvement of these amino acids in the NSs activity. Nevertheless, all the truncated proteins were still able to interact with RIG-I, suggesting that the domains responsible for RIG-I signaling and RIG-I interaction are mapped on different regions of the protein. Copyright © 2015 Elsevier Inc. All rights reserved.
Representation mutations from standard genetic codes

Science.gov (United States)

Aisah, I.; Suyudi, M.; Carnia, E.; Suhendi; Supriatna, A. K.

2018-03-01

Graph is widely used in everyday life especially to describe model problem and describe it concretely and clearly. In addition graph is also used to facilitate solve various kinds of problems that are difficult to be solved by calculation. In Biology, graph can be used to describe the process of protein synthesis in DNA. Protein has an important role for DNA (deoxyribonucleic acid) or RNA (ribonucleic acid). Proteins are composed of amino acids. In this study, amino acids are related to genetics, especially the genetic code. The genetic code is also known as the triplet or codon code which is a three-letter arrangement of DNA nitrogen base. The bases are adenine (A), thymine (T), guanine (G) and cytosine (C). While on RNA thymine (T) is replaced with Urasil (U). The set of all Nitrogen bases in RNA is denoted by N = {C U, A, G}. This codon works at the time of protein synthesis inside the cell. This codon also encodes the stop signal as a sign of the stop of protein synthesis process. This paper will examine the process of protein synthesis through mathematical studies and present it in three-dimensional space or graph. The study begins by analysing the set of all codons denoted by NNN such that to obtain geometric representations. At this stage there is a matching between the sets of all nitrogen bases N with Z 2 × Z 2; C=(\\overline{0},\\overline{0}),{{U}}=(\\overline{0},\\overline{1}),{{A}}=(\\overline{1},\\overline{0}),{{G}}=(\\overline{1},\\overline{1}). By matching the algebraic structure will be obtained such as group, group Klein-4,Quotien group etc. With the help of Geogebra software, the set of all codons denoted by NNN can be presented in a three-dimensional space as a multicube NNN and also can be represented as a graph, so that can easily see relationship between the codon.
Diverse amino acid changes at specific positions in the N-terminal region of the coat protein allow Plum pox virus to adapt to new hosts.

Science.gov (United States)

Carbonell, Alberto; Maliogka, Varvara I; Pérez, José de Jesús; Salvador, Beatriz; León, David San; García, Juan Antonio; Simón-Mateo, Carmen

2013-10-01

Plum pox virus (PPV)-D and PPV-R are two isolates from strain D of PPV that differ in host specificity. Previous analyses of chimeras originating from PPV-R and PPV-D suggested that the N terminus of the coat protein (CP) includes host-specific pathogenicity determinants. Here, these determinants were mapped precisely by analyzing the infectivity in herbaceous and woody species of chimeras containing a fragment of the 3' region of PPV-D (including the region coding for the CP) in a PPV-R backbone. These chimeras were not infectious in Prunus persica, but systemically infected Nicotiana clevelandii and N. benthamiana when specific amino acids were modified or deleted in a short 30-amino-acid region of the N terminus of the CP. Most of these mutations did not reduce PPV fitness in Prunus spp. although others impaired systemic infection in this host. We propose a model in which the N terminus of the CP, highly relevant for virus systemic movement, is targeted by a host defense mechanism in Nicotiana spp. Mutations in this short region allow PPV to overcome the defense response in this host but can compromise the efficiency of PPV systemic movement in other hosts such as Prunus spp.
SNPs in the coding region of the metastasis-inducing gene MACC1 and clinical outcome in colorectal cancer

Directory of Open Access Journals (Sweden)

Schmid Felicitas

2012-07-01

Full Text Available Abstract Background Colorectal cancer is one of the main cancers in the Western world. About 90% of the deaths arise from formation of distant metastasis. The expression of the newly identified gene metastasis associated in colon cancer 1 (MACC1 is a prognostic indicator for colon cancer metastasis. Here, we analyzed for the first time the impact of single nucleotide polymorphisms (SNPs in the coding region of MACC1 for clinical outcome of colorectal cancer patients. Additionally, we screened met proto-oncogene (Met, the transcriptional target gene of MACC1, for mutations. Methods We sequenced the coding exons of MACC1 in 154 colorectal tumors (stages I, II and III and the crucial exons of Met in 60 colorectal tumors (stages I, II and III. We analyzed the association of MACC1 polymorphisms with clinical data, including metachronous metastasis, UICC stages, tumor invasion, lymph node metastasis and patients’ survival (n = 154, stages I, II and III. Furthermore, we performed biological assays in order to evaluate the functional impact of MACC1 SNPs on the motility of colorectal cancer cells. Results We genotyped three MACC1 SNPs in the coding region. Thirteen % of the tumors had the genotype cg (rs4721888, L31V, 48% a ct genotype (rs975263, S515L and 84% a gc or cc genotype (rs3735615, R804T. We found no association of these SNPs with clinicopathological parameters or with patients’ survival, when analyzing the entire patients’ cohort. An increased risk for a shorter metastasis-free survival of patients with a ct genotype (rs975263 was observed in younger colon cancer patients with stage I or II (P = 0.041, n = 18. In cell culture, MACC1 SNPs did not affect MACC1-induced cell motility and proliferation. Conclusion In summary, the identification of coding MACC1 SNPs in primary colorectal tumors does not improve the prediction for metastasis formation or for patients’ survival compared to MACC1 expression analysis alone. The ct genotype (rs
Selfish DNA in protein-coding genes of Rickettsia.

Science.gov (United States)

Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M

2000-10-13

Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.
Circular codes revisited: a statistical approach.

Science.gov (United States)

Gonzalez, D L; Giannerini, S; Rosa, R

2011-04-21

In 1996 Arquès and Michel [1996. A complementary circular code in the protein coding genes. J. Theor. Biol. 182, 45-58] discovered the existence of a common circular code in eukaryote and prokaryote genomes. Since then, circular code theory has provoked great interest and underwent a rapid development. In this paper we discuss some theoretical issues related to the synchronization properties of coding sequences and circular codes with particular emphasis on the problem of retrieval and maintenance of the reading frame. Motivated by the theoretical discussion, we adopt a rigorous statistical approach in order to try to answer different questions. First, we investigate the covering capability of the whole class of 216 self-complementary, C(3) maximal codes with respect to a large set of coding sequences. The results indicate that, on average, the code proposed by Arquès and Michel has the best covering capability but, still, there exists a great variability among sequences. Second, we focus on such code and explore the role played by the proportion of the bases by means of a hierarchy of permutation tests. The results show the existence of a sort of optimization mechanism such that coding sequences are tailored as to maximize or minimize the coverage of circular codes on specific reading frames. Such optimization clearly relates the function of circular codes with reading frame synchronization. Copyright © 2011 Elsevier Ltd. All rights reserved.
Genome analysis of Excretory/Secretory proteins in Taenia solium reveals their Abundance of Antigenic Regions (AAR).

Science.gov (United States)

Gomez, Sandra; Adalid-Peralta, Laura; Palafox-Fonseca, Hector; Cantu-Robles, Vito Adrian; Soberón, Xavier; Sciutto, Edda; Fragoso, Gladis; Bobes, Raúl J; Laclette, Juan P; Yauner, Luis del Pozo; Ochoa-Leyva, Adrián

2015-05-19

Excretory/Secretory (ES) proteins play an important role in the host-parasite interactions. Experimental identification of ES proteins is time-consuming and expensive. Alternative bioinformatics approaches are cost-effective and can be used to prioritize the experimental analysis of therapeutic targets for parasitic diseases. Here we predicted and functionally annotated the ES proteins in T. solium genome using an integration of bioinformatics tools. Additionally, we developed a novel measurement to evaluate the potential antigenicity of T. solium secretome using sequence length and number of antigenic regions of ES proteins. This measurement was formalized as the Abundance of Antigenic Regions (AAR) value. AAR value for secretome showed a similar value to that obtained for a set of experimentally determined antigenic proteins and was different to the calculated value for the non-ES proteins of T. solium genome. Furthermore, we calculated the AAR values for known helminth secretomes and they were similar to that obtained for T. solium. The results reveal the utility of AAR value as a novel genomic measurement to evaluate the potential antigenicity of secretomes. This comprehensive analysis of T. solium secretome provides functional information for future experimental studies, including the identification of novel ES proteins of therapeutic, diagnosis and immunological interest.
Role of accelerated segment switch in exons to alter targeting (ASSET in the molecular evolution of snake venom proteins

Directory of Open Access Journals (Sweden)

Kini R Manjunatha

2009-06-01

Full Text Available Abstract Background Snake venom toxins evolve more rapidly than other proteins through accelerated changes in the protein coding regions. Previously we have shown that accelerated segment switch in exons to alter targeting (ASSET might play an important role in its functional evolution of viperid three-finger toxins. In this phenomenon, short sequences in exons are radically changed to unrelated sequences and hence affect the folding and functional properties of the toxins. Results Here we analyzed other snake venom protein families to elucidate the role of ASSET in their functional evolution. ASSET appears to be involved in the functional evolution of three-finger toxins to a greater extent than in several other venom protein families. ASSET leads to replacement of some of the critical amino acid residues that affect the biological function in three-finger toxins as well as change the conformation of the loop that is involved in binding to specific target sites. Conclusion ASSET could lead to novel functions in snake venom proteins. Among snake venom serine proteases, ASSET contributes to changes in three surface segments. One of these segments near the substrate binding region is known to affect substrate specificity, and its exchange may have significant implications for differences in isoform catalytic activity on specific target protein substrates. ASSET therefore plays an important role in functional diversification of snake venom proteins, in addition to accelerated point mutations in the protein coding regions. Accelerated point mutations lead to fine-tuning of target specificity, whereas ASSET leads to large-scale replacement of multiple functionally important residues, resulting in change or gain of functions.
Assessment of genetic mutations in the XRCC2 coding region by high resolution melting curve analysis and the risk of differentiated thyroid carcinoma in Iran

Directory of Open Access Journals (Sweden)

Shima Fayaz

2012-01-01

Full Text Available Homologous recombination (HR is the major pathway for repairing double strand breaks (DSBs in eukaryotes and XRCC2 is an essential component of the HR repair machinery. To evaluate the potential role of mutations in gene repair by HR in individuals susceptible to differentiated thyroid carcinoma (DTC we used high resolution melting (HRM analysis, a recently introduced method for detecting mutations, to examine the entire XRCC2 coding region in an Iranian population. HRM analysis was used to screen for mutations in three XRCC2 coding regions in 50 patients and 50 controls. There was no variation in the HRM curves obtained from the analysis of exons 1 and 2 in the case and control groups. In exon 3, an Arg188His polymorphism (rs3218536 was detected as a new melting curve group (OR: 1.46; 95%CI: 0.432-4.969; p = 0.38 compared with the normal melting curve. We also found a new Ser150Arg polymorphism in exon 3 of the control group. These findings suggest that genetic variations in the XRCC2 coding region have no potential effects on susceptibility to DTC. However, further studies with larger populations are required to confirm this conclusion.
Biases in Drosophila melanogaster protein trap screens

Directory of Open Access Journals (Sweden)

Müller Ilka

2009-05-01

Full Text Available Abstract Background The ability to localise or follow endogenous proteins in real time in vivo is of tremendous utility for cell biology or systems biology studies. Protein trap screens utilise the random genomic insertion of a transposon-borne artificial reporter exon (e.g. encoding the green fluorescent protein, GFP into an intron of an endogenous gene to generate a fluorescent fusion protein. Despite recent efforts aimed at achieving comprehensive coverage of the genes encoded in the Drosophila genome, the repertoire of genes that yield protein traps is still small. Results We analysed the collection of available protein trap lines in Drosophila melanogaster and identified potential biases that are likely to restrict genome coverage in protein trap screens. The protein trap screens investigated here primarily used P-element vectors and thus exhibit some of the same positional biases associated with this transposon that are evident from the comprehensive Drosophila Gene Disruption Project. We further found that protein trap target genes usually exhibit broad and persistent expression during embryonic development, which is likely to facilitate better detection. In addition, we investigated the likely influence of the GFP exon on host protein structure and found that protein trap insertions have a significant bias for exon-exon boundaries that encode disordered protein regions. 38.8% of GFP insertions land in disordered protein regions compared with only 23.4% in the case of non-trapping P-element insertions landing in coding sequence introns (p -4. Interestingly, even in cases where protein domains are predicted, protein trap insertions frequently occur in regions encoding surface exposed areas that are likely to be functionally neutral. Considering the various biases observed, we predict that less than one third of intron-containing genes are likely to be amenable to trapping by the existing methods. Conclusion Our analyses suggest that the
The generation of recombinant influenza A viruses expressing a PB2 fusion protein requires the conservation of a packaging signal overlapping the coding and noncoding regions at the 5' end of the PB2 segment

International Nuclear Information System (INIS)

Dos Santos Afonso, Emmanuel; Escriou, Nicolas; Leclercq, India; Werf, Sylvie van der; Naffakh, Nadia

2005-01-01

We generated recombinant A/WSN/33 influenza A viruses expressing a PB2 protein fused to a Flag epitope at the N- (Flag-PB2) or C-terminus (PB2-Flag), which replicated efficiently and proved to be stable upon serial passage in vitro on MDCK cells. Rescue of PB2-Flag viruses required that the 5' end of the PB2 segment was kept identical to the wild-type beyond the 34 noncoding terminal nucleotides. This feature was achieved by a duplication of the 109 last nucleotides encoding PB2 between the Flag sequence and the 5'NCR. In PB2 minigenomes rescue experiments, both the 5' and 3' coding ends of the PB2 segment were found to promote the incorporation of minigenomes into virions. However, the presence of the Flag sequence at the junction between the 3'NCR and the coding sequence did not prevent the rescue of Flag-PB2 viruses. Our observations define requirements that may be useful for the purpose of engineering influenza RNAs
Preferential 5-Methylcytosine Oxidation in the Linker Region of Reconstituted Positioned Nucleosomes by Tet1 Protein.

Science.gov (United States)

Kizaki, Seiichiro; Zou, Tingting; Li, Yue; Han, Yong-Woon; Suzuki, Yuki; Harada, Yoshie; Sugiyama, Hiroshi

2016-11-07

Tet (ten-eleven translocation) family proteins oxidize 5-methylcytosine (mC) to 5-hydroxymethylcytosine (hmC), 5-formylcytosine (fC), and 5-carboxycytosine (caC), and are suggested to be involved in the active DNA demethylation pathway. In this study, we reconstituted positioned mononucleosomes using CpG-methylated 382 bp DNA containing the Widom 601 sequence and recombinant histone octamer, and subjected the nucleosome to treatment with Tet1 protein. The sites of oxidized methylcytosine were identified by bisulfite sequencing. We found that, for the oxidation reaction, Tet1 protein prefers mCs located in the linker region of the nucleosome compared with those located in the core region. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Nuclear proteins interacting with the promoter region of the human granulocyte/macrophage colony-stimulating factor gene

International Nuclear Information System (INIS)

Shannon, M.F.; Gamble, J.R.; Vadas, M.A.

1988-01-01

The gene for human granulocyte/macrophage colony-stimulating factor (GM-CSF) is expressed in a tissue-specific as well as an activation-dependent manner. The interaction of nuclear proteins with the promoter region of the GM-CSF gene that is likely to be responsible for this pattern of GM-CSF expression was investigated. The authors show that nuclear proteins interact with DNA fragments from the GM-CSF promoter in a cell-specific manner. A region spanning two cytokine-specific sequences, cytokine 1 (CK-1, 5', GAGATTCCAC 3') and cytokine 2 (CK-2, 5' TCAGGTA 3') bound two nuclear proteins from GM-CSF-expressing cells in gel retardation assays. NF-GMb was inducible with phorbol 12-myristate 13-acetate and accompanied induction of GM-CSF message. NF-GMb was absent in cell lines not producing GM-CSF, some of which had other distinct binding proteins. NF-GMa and NF-GMb eluted from a heparin-Sepharose column at 0.3 and 0.6 M KCl, respectively. They hypothesize that the sequences CK-1 and CK-2 bind specific proteins and regulate GM-CSF transcription
Code system BCG for gamma-ray skyshine calculation

International Nuclear Information System (INIS)

Ryufuku, Hiroshi; Numakunai, Takao; Miyasaka, Shun-ichi; Minami, Kazuyoshi.

1979-03-01

A code system BCG has been developed for calculating conveniently and efficiently gamma-ray skyshine doses using the transport calculation codes ANISN and DOT and the point-kernel calculation codes G-33 and SPAN. To simplify the input forms to the system, the forms for these codes are unified, twelve geometric patterns are introduced to give material regions, and standard data are available as a library. To treat complex arrangements of source and shield, it is further possible to use successively the code such that the results from one code may be used as input data to the same or other code. (author)
NeuCode Proteomics Reveals Bap1 Regulation of Metabolism

Directory of Open Access Journals (Sweden)

Joshua M. Baughman

2016-07-01

Full Text Available We introduce neutron-encoded (NeuCode amino acid labeling of mice as a strategy for multiplexed proteomic analysis in vivo. Using NeuCode, we characterize an inducible knockout mouse model of Bap1, a tumor suppressor and deubiquitinase whose in vivo roles outside of cancer are not well established. NeuCode proteomics revealed altered metabolic pathways following Bap1 deletion, including profound elevation of cholesterol biosynthetic machinery coincident with reduced expression of gluconeogenic and lipid homeostasis proteins in liver. Bap1 loss increased pancreatitis biomarkers and reduced expression of mitochondrial proteins. These alterations accompany a metabolic remodeling with hypoglycemia, hypercholesterolemia, hepatic lipid loss, and acinar cell degeneration. Liver-specific Bap1 null mice present with fully penetrant perinatal lethality, severe hypoglycemia, and hepatic lipid deficiency. This work reveals Bap1 as a metabolic regulator in liver and pancreas, and it establishes NeuCode as a reliable proteomic method for deciphering in vivo biology.
Coded Cooperation for Multiway Relaying in Wireless Sensor Networks †

Science.gov (United States)

Si, Zhongwei; Ma, Junyang; Thobaben, Ragnar

2015-01-01

Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC) codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET) SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels. PMID:26131675
A progressive data compression scheme based upon adaptive transform coding: Mixture block coding of natural images

Science.gov (United States)

Rost, Martin C.; Sayood, Khalid

1991-01-01

A method for efficiently coding natural images using a vector-quantized variable-blocksized transform source coder is presented. The method, mixture block coding (MBC), incorporates variable-rate coding by using a mixture of discrete cosine transform (DCT) source coders. Which coders are selected to code any given image region is made through a threshold driven distortion criterion. In this paper, MBC is used in two different applications. The base method is concerned with single-pass low-rate image data compression. The second is a natural extension of the base method which allows for low-rate progressive transmission (PT). Since the base method adapts easily to progressive coding, it offers the aesthetic advantage of progressive coding without incorporating extensive channel overhead. Image compression rates of approximately 0.5 bit/pel are demonstrated for both monochrome and color images.
Building a Better Campus: An Update on Building Codes.

Science.gov (United States)

Madden, Michael J.

2002-01-01

Discusses the implications for higher education institutions in terms of facility planning, design, construction, and renovation of the move from regionally-developed model-building codes to two international sets of codes. Also addresses the new performance-based design option within the codes. (EV)
Exploration of Protein Conformational Change with PELE and Meta-Dynamics.

Science.gov (United States)

Cossins, Benjamin P; Hosseini, Ali; Guallar, Victor

2012-03-13

Atomistic molecular simulation methods are now able to explore complex protein or protein-ligand dynamical space in a tractable way with methods such as meta-dynamics or adaptive biasing force. However, many of these methods either require a careful selection of reaction coordinates or the knowledge of an initial pathway of some kind. Thus, it is important that effective methods are developed to produce this pathway data in an efficient fashion. PELE, a proven protein-ligand sampling code, has been developed to provide rapid protein sampling in highly flexible cases, using a reduced network model eigen problem approach. The resulting method is able to rapidly sample configuration space with very general driving information. When applied to ubiquitin, PELE was able to reproduce RMSD and average force data found in molecular dynamics simulations. PELE was also applied to explore the opening/closing transition of T4 lysozyme. A meta-dynamics exploration using a low energy pathway validated that the configurations explored by PELE represent the most populated regions of phase space. PELE and meta-dynamics explorations also discovered a low free energy region where a large cross-domain helix of T4 lysozyme is broken in two. There is previous NMR evidence for the validity of this unfolded helix region.

PENGKLONAN DAN PERUNUTAN NUKLEOTIDA GEN SELUBUNG PROTEIN DAN 3’UTR (untranslated region PEANUT STRIPE VIRUS

Directory of Open Access Journals (Sweden)

Hasriadi Mat Akin

2011-10-01

Full Text Available Cloning and sequencing of coat protein gene and 3’UTR (untranslated region of peanut stripe virus. The cDNA of 3' terminal of peanut stripe virus genomic RNA was cloned and sequenced. The cDNA was ligated with plasmid vector pGEM-T Easy and transformed to competent cells of Escherichia coli. The 3' terminal of PstV genomic RNA contained 1195 nucleotides (nts. The region included the nucleotide sequences of NIb (nuclear inclusion body (129 nts, CP gene (coat protein gene (861 nts, and 3'UTR (untranslated region (205 nts. The nucleotide sequence of a CP gene contained one long uninterrupted open reading frame (ORF without a start codon, which ended a UAG stop codon. The 287 amino acid residues of PStV coat protein were predicted from the CP gene. The amino acid was analyzed for the presence of consensus polyprotein cleavage site for maturation of potyvirus polyprotein. A putative cleavage site was found at position 43 (Q/S following the Valine (V residue at -4 position. This isolate of PstV can be expected to be aphid transmissible because the coat protein contained a DAG triplet at position 53-55.
Extracellular vesicle associated long non-coding RNAs functionally enhance cell viability

Directory of Open Access Journals (Sweden)

Chris Hewson

2016-10-01

Full Text Available Cells communicate with one another to create microenvironments and share resources. One avenue by which cells communicate is through the action of exosomes. Exosomes are extracellular vesicles that are released by one cell and taken up by neighbouring cells. But how exosomes instigate communication between cells has remained largely unknown. We present evidence here that particular long non-coding RNA molecules are preferentially packaged into exosomes. We also find that a specific class of these exosome associated non-coding RNAs functionally modulate cell viability by direct interactions with l-lactate dehydrogenase B (LDHB, high-mobility group protein 17 (HMG-17, and CSF2RB, proteins involved in metabolism, nucleosomal architecture and cell signalling respectively. Knowledge of this endogenous cell to cell pathway, those proteins interacting with exosome associated non-coding transcripts and their interacting domains, could lead to a better understanding of not only cell to cell interactions but also the development of exosome targeted approaches in patient specific cell-based therapies. Keywords: Non-coding RNA, Extracellular RNA, Exosomes, Retroelement, Pseudogene
Efficient random access high resolution region-of-interest (ROI) image retrieval using backward coding of wavelet trees (BCWT)

Science.gov (United States)

Corona, Enrique; Nutter, Brian; Mitra, Sunanda; Guo, Jiangling; Karp, Tanja

2008-03-01

Efficient retrieval of high quality Regions-Of-Interest (ROI) from high resolution medical images is essential for reliable interpretation and accurate diagnosis. Random access to high quality ROI from codestreams is becoming an essential feature in many still image compression applications, particularly in viewing diseased areas from large medical images. This feature is easier to implement in block based codecs because of the inherent spatial independency of the code blocks. This independency implies that the decoding order of the blocks is unimportant as long as the position for each is properly identified. In contrast, wavelet-tree based codecs naturally use some interdependency that exploits the decaying spectrum model of the wavelet coefficients. Thus one must keep track of the decoding order from level to level with such codecs. We have developed an innovative multi-rate image subband coding scheme using "Backward Coding of Wavelet Trees (BCWT)" which is fast, memory efficient, and resolution scalable. It offers far less complexity than many other existing codecs including both, wavelet-tree, and block based algorithms. The ROI feature in BCWT is implemented through a transcoder stage that generates a new BCWT codestream containing only the information associated with the user-defined ROI. This paper presents an efficient technique that locates a particular ROI within the BCWT coded domain, and decodes it back to the spatial domain. This technique allows better access and proper identification of pathologies in high resolution images since only a small fraction of the codestream is required to be transmitted and analyzed.
IM (Integrity Management) software must show flexibility to local codes

Energy Technology Data Exchange (ETDEWEB)

Brors, Markus [ROSEN Technology and Research Center GmbH (Germany); Diggory, Ian [Macaw Engineering Ltd., Northumberland (United Kingdom)

2009-07-01

There are many internationally recognized codes and standards, such as API 1160 and ASME B31.8S, which help pipeline operators to manage and maintain the integrity of their pipeline networks. However, operators in many countries still use local codes that often reflect the history of pipeline developments in their region and are based on direct experience and research on their pipelines. As pipeline companies come under increasing regulatory and financial pressures to maintain the integrity of their networks, it is important that operators using regional codes are able to benchmark their integrity management schemes against these international standards. Any comprehensive Pipeline Integrity Management System (PIMS) software package should therefore not only incorporate industry standards for pipeline integrity assessment but also be capable of implementing regional codes for comparison purposes. This paper describes the challenges and benefits of incorporating one such set of regional pipeline standards into ROSEN Asset Integrity Management Software (ROAIMS). (author)
An improved version of the MICROX-2 code

Energy Technology Data Exchange (ETDEWEB)

Mathews, D. [Paul Scherrer Inst. (PSI), Villigen (Switzerland)

1997-11-01

The MICROX-2 code prepares broad group neutron cross sections for use in diffusion- and/or transport-theory codes from an input library of fine group and pointwise cross sections. The neutron weighting spectrum is obtained by solving the B{sub 1} neutron balance equations at about 10000 energies in a one-dimensional (planar, spherical or cylindrical), two-region unit cell. The regions are coupled by collision probabilities based upon spatially flat neutron emission. Energy dependent Dancoff factors and bucklings correct the one-dimensional calculations for multi-dimensional lattice effects. A critical buckling search option is also included. The inner region may include two different types of fuel particles (grains). This report describes the present PSI FORTRAN 90 version of the MICROX-2 code which operates on CRAY computers and IBM PC`s. The equations which are solved in the various energy ranges are given along with descriptions of various changes that have been made in the present PSI version of the code. A completely re-written description of the user input is also included. (author) 7 figs., 4 tabs., 59 refs.
Structure-aware Local Sparse Coding for Visual Tracking

KAUST Repository

Qi, Yuankai

2018-01-24

Sparse coding has been applied to visual tracking and related vision problems with demonstrated success in recent years. Existing tracking methods based on local sparse coding sample patches from a target candidate and sparsely encode these using a dictionary consisting of patches sampled from target template images. The discriminative strength of existing methods based on local sparse coding is limited as spatial structure constraints among the template patches are not exploited. To address this problem, we propose a structure-aware local sparse coding algorithm which encodes a target candidate using templates with both global and local sparsity constraints. For robust tracking, we show local regions of a candidate region should be encoded only with the corresponding local regions of the target templates that are the most similar from the global view. Thus, a more precise and discriminative sparse representation is obtained to account for appearance changes. To alleviate the issues with tracking drifts, we design an effective template update scheme. Extensive experiments on challenging image sequences demonstrate the effectiveness of the proposed algorithm against numerous stateof- the-art methods.
Primate-specific spliced PMCHL RNAs are non-protein coding in human and macaque tissues

Directory of Open Access Journals (Sweden)

Delerue-Audegond Audrey

2008-12-01

Full Text Available Abstract Background Brain-expressed genes that were created in primate lineage represent obvious candidates to investigate molecular mechanisms that contributed to neural reorganization and emergence of new behavioural functions in Homo sapiens. PMCHL1 arose from retroposition of a pro-melanin-concentrating hormone (PMCH antisense mRNA on the ancestral human chromosome 5p14 when platyrrhines and catarrhines diverged. Mutations before divergence of hylobatidae led to creation of new exons and finally PMCHL1 duplicated in an ancestor of hominids to generate PMCHL2 at the human chromosome 5q13. A complex pattern of spliced and unspliced PMCHL RNAs were found in human brain and testis. Results Several novel spliced PMCHL transcripts have been characterized in human testis and fetal brain, identifying an additional exon and novel splice sites. Sequencing of PMCHL genes in several non-human primates allowed to carry out phylogenetic analyses revealing that the initial retroposition event took place within an intron of the brain cadherin (CDH12 gene, soon after platyrrhine/catarrhine divergence, i.e. 30–35 Mya, and was concomitant with the insertion of an AluSg element. Sequence analysis of the spliced PMCHL transcripts identified only short ORFs of less than 300 bp, with low (VMCH-p8 and protein variants or no evolutionary conservation. Western blot analyses of human and macaque tissues expressing PMCHL RNA failed to reveal any protein corresponding to VMCH-p8 and protein variants encoded by spliced transcripts. Conclusion Our present results improve our knowledge of the gene structure and the evolutionary history of the primate-specific chimeric PMCHL genes. These genes produce multiple spliced transcripts, bearing short, non-conserved and apparently non-translated ORFs that may function as mRNA-like non-coding RNAs.
The Impact of Regional Differences on Elementary School Teachers’ Attitudes Towards Their Students’ Use of Code Switching in a South Texas School District

Directory of Open Access Journals (Sweden)

Guadalupe Nancy Nava Gómez

2012-01-01

Full Text Available This study focused on investigating whether the teachers' geographical distribution influences their attitudes towards their students' use of code switching. The study was guided by the following research question: Are there differences between teachers' opinions of the north elementary schools and teachers' opinions of the south elementary schools, which are predominantly Hispanic, towards their students' use of code switching? If so, why? A twenty-item structured survey was utilized. The population consisted of 279 elementary school teachers at seven Northern and seven Southern schools in the same South Texas region. The data were analyzed with descriptive statistics. Findings showed that Southern teachers had more prejudices towards code switching than those from the North, who were more receptive to this socio-cultural and linguistic phenomenon due to the ethnic makeup of their classrooms.
PanCoreGen – profiling, detecting, annotating protein-coding genes in microbial genomes

Science.gov (United States)

Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V.

2015-01-01

A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen – a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars – Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. PMID:26456591
Optimization of translation profiles enhances protein expression and solubility.

Directory of Open Access Journals (Sweden)

Anne-Katrin Hess

Full Text Available mRNA is translated with a non-uniform speed that actively coordinates co-translational folding of protein domains. Using structure-based homology we identified the structural domains in epoxide hydrolases (EHs and introduced slow-translating codons to delineate the translation of single domains. These changes in translation speed dramatically improved the solubility of two EHs of metagenomic origin in Escherichia coli. Conversely, the importance of transient attenuation for the folding, and consequently solubility, of EH was evidenced with a member of the EH family from Agrobacterium radiobacter, which partitions in the soluble fraction when expressed in E. coli. Synonymous substitutions of codons shaping the slow-transiting regions to fast-translating codons render this protein insoluble. Furthermore, we show that low protein yield can be enhanced by decreasing the free folding energy of the initial 5'-coding region, which can disrupt mRNA secondary structure and enhance ribosomal loading. This study provides direct experimental evidence that mRNA is not a mere messenger for translation of codons into amino acids but bears an additional layer of information for folding, solubility and expression level of the encoded protein. Furthermore, it provides a general frame on how to modulate and fine-tune gene expression of a target protein.
Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions

DEFF Research Database (Denmark)

Lavstsen, Thomas; Salanti, Ali; Jensen, Anja T R

2003-01-01

and organization of the 3D7 PfEMP1 repertoire was investigated on the basis of the complete genome sequence. METHODS: Using two tree-building methods we analysed the coding and non-coding sequences of 3D7 var and rif genes as well as var genes of other parasite strains. RESULTS: var genes can be sub...
Functional Anthology of Intrinsic Disorder. III. Ligands, Postranslational Modifications and Diseases Associated with Intrinsically Disordered Proteins

Science.gov (United States)

Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.

2008-01-01

Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). The second paper of the series was devoted to the presentation of 87 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes and coding sequence diversities possessing strong positive and negative correlation with long disordered regions (Vucetic S., Xie H., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. II. Cellular components, domains, technical terms, developmental processes and coding sequence diversities correlated with long disordered regions. J. Proteome Res.). Protein structure and functionality can be modulated by various posttranslational modifications or/and as a result of binding of specific ligands. Numerous human diseases are associated with protein misfolding/misassembly/ misfunctioning. This work concludes the series of papers dedicated to the functional anthology of intrinsic disorder and describes ~80 Swiss-Prot functional keywords that are related to ligands, posttranslational modifications and diseases possessing strong positive or negative correlation with the predicted long disordered regions in proteins. PMID:17391016
EST2Prot: Mapping EST sequences to proteins

Directory of Open Access Journals (Sweden)

Lin David M

2006-03-01

Full Text Available Abstract Background EST libraries are used in various biological studies, from microarray experiments to proteomic and genetic screens. These libraries usually contain many uncharacterized ESTs that are typically ignored since they cannot be mapped to known genes. Consequently, new discoveries are possibly overlooked. Results We describe a system (EST2Prot that uses multiple elements to map EST sequences to their corresponding protein products. EST2Prot uses UniGene clusters, substring analysis, information about protein coding regions in existing DNA sequences and protein database searches to detect protein products related to a query EST sequence. Gene Ontology terms, Swiss-Prot keywords, and protein similarity data are used to map the ESTs to functional descriptors. Conclusion EST2Prot extends and significantly enriches the popular UniGene mapping by utilizing multiple relations between known biological entities. It produces a mapping between ESTs and proteins in real-time through a simple web-interface. The system is part of the Biozon database and is accessible at http://biozon.org/tools/est/.
Long non-coding RNAs: Mechanism of action and functional utility

OpenAIRE

Bhat, Shakil Ahmad; Ahmad, Syed Mudasir; Mumtaz, Peerzada Tajamul; Malik, Abrar Ahad; Dar, Mashooq Ahmad; Urwat, Uneeb; Shah, Riaz Ahmad; Ganai, Nazir Ahmad

2016-01-01

Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs) are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA), small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs) are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological ...
Distinct timescales of population coding across cortex.

Science.gov (United States)

Runyan, Caroline A; Piasini, Eugenio; Panzeri, Stefano; Harvey, Christopher D

2017-08-03

The cortex represents information across widely varying timescales. For instance, sensory cortex encodes stimuli that fluctuate over few tens of milliseconds, whereas in association cortex behavioural choices can require the maintenance of information over seconds. However, it remains poorly understood whether diverse timescales result mostly from features intrinsic to individual neurons or from neuronal population activity. This question remains unanswered, because the timescales of coding in populations of neurons have not been studied extensively, and population codes have not been compared systematically across cortical regions. Here we show that population codes can be essential to achieve long coding timescales. Furthermore, we find that the properties of population codes differ between sensory and association cortices. We compared coding for sensory stimuli and behavioural choices in auditory cortex and posterior parietal cortex as mice performed a sound localization task. Auditory stimulus information was stronger in auditory cortex than in posterior parietal cortex, and both regions contained choice information. Although auditory cortex and posterior parietal cortex coded information by tiling in time neurons that were transiently informative for approximately 200 milliseconds, the areas had major differences in functional coupling between neurons, measured as activity correlations that could not be explained by task events. Coupling among posterior parietal cortex neurons was strong and extended over long time lags, whereas coupling among auditory cortex neurons was weak and short-lived. Stronger coupling in posterior parietal cortex led to a population code with long timescales and a representation of choice that remained consistent for approximately 1 second. In contrast, auditory cortex had a code with rapid fluctuations in stimulus and choice information over hundreds of milliseconds. Our results reveal that population codes differ across cortex
Modifications in the UNIMUG computer code (Addendum to technical note EAV/NT--003/80)

International Nuclear Information System (INIS)

Alcantara, H.G. de; Rosa, M.A.P.

1983-01-01

The modifications done in the UNIMUG computer code are described. The UNIMUG-VERSION 3 has options for condensating macroscopic cross sections in regions and microscopic cross sections of isotopes contained in each region. The code also calculates: spectral index, ratio between two specificated reactions of chosen isotopes in indicated groups and regions, average buckling by region/group, by region, reactor buckling and average flux of reactor. The code has no restrictions for number of regions, number of groups, number of intervals, and table size of macroscopic and microscopic cross sections. (M.C.K.) [pt
Comparing side chain packing in soluble proteins, protein-protein interfaces, and transmembrane proteins.

Science.gov (United States)

Gaines, J C; Acebes, S; Virrueta, A; Butler, M; Regan, L; O'Hern, C S

2018-05-01

We compare side chain prediction and packing of core and non-core regions of soluble proteins, protein-protein interfaces, and transmembrane proteins. We first identified or created comparable databases of high-resolution crystal structures of these 3 protein classes. We show that the solvent-inaccessible cores of the 3 classes of proteins are equally densely packed. As a result, the side chains of core residues at protein-protein interfaces and in the membrane-exposed regions of transmembrane proteins can be predicted by the hard-sphere plus stereochemical constraint model with the same high prediction accuracies (>90%) as core residues in soluble proteins. We also find that for all 3 classes of proteins, as one moves away from the solvent-inaccessible core, the packing fraction decreases as the solvent accessibility increases. However, the side chain predictability remains high (80% within 30°) up to a relative solvent accessibility, rSASA≲0.3, for all 3 protein classes. Our results show that ≈40% of the interface regions in protein complexes are "core", that is, densely packed with side chain conformations that can be accurately predicted using the hard-sphere model. We propose packing fraction as a metric that can be used to distinguish real protein-protein interactions from designed, non-binding, decoys. Our results also show that cores of membrane proteins are the same as cores of soluble proteins. Thus, the computational methods we are developing for the analysis of the effect of hydrophobic core mutations in soluble proteins will be equally applicable to analyses of mutations in membrane proteins. © 2018 Wiley Periodicals, Inc.
Regional distribution of serotonin transporter protein in postmortem human brain

International Nuclear Information System (INIS)

Kish, Stephen J.; Furukawa, Yoshiaki; Chang Lijan; Tong Junchao; Ginovart, Nathalie; Wilson, Alan; Houle, Sylvain; Meyer, Jeffrey H.

2005-01-01

Introduction: The primary approach in assessing the status of brain serotonin neurons in human conditions such as major depression and exposure to the illicit drug ecstasy has been the use of neuroimaging procedures involving radiotracers that bind to the serotonin transporter (SERT). However, there has been no consistency in the selection of a 'SERT-free' reference region for the estimation of free and nonspecific binding, as occipital cortex, cerebellum and white matter have all been employed. Objective and Methods: To identify areas of human brain that might have very low SERT levels, we measured, by a semiquantitative Western blotting procedure, SERT protein immunoreactivity throughout the postmortem brain of seven normal adult subjects. Results: Serotonin transporter could be quantitated in all examined brain areas. However, the SERT concentration in cerebellar cortex and white matter were only at trace values, being approximately 20% of average cerebral cortex and 5% of average striatum values. Conclusion: Although none of the examined brain areas are completely free of SERT, human cerebellar cortex has low SERT binding as compared to other examined brain regions, with the exception of white matter. Since the cerebellar cortical SERT binding is not zero, this region will not be a suitable reference region for SERT radioligands with very low free and nonspecific binding. For SERT radioligands with reasonably high free and nonspecific binding, the cerebellar cortex should be a useful reference region, provided other necessary radioligand assumptions are met
Regional distribution of serotonin transporter protein in postmortem human brain

Energy Technology Data Exchange (ETDEWEB)

Kish, Stephen J. [Human Neurochemical Pathology Laboratory, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada)]. E-mail: Stephen_Kish@CAMH.net; Furukawa, Yoshiaki [Human Neurochemical Pathology Laboratory, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada); Chang Lijan [Human Neurochemical Pathology Laboratory, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada); Tong Junchao [Human Neurochemical Pathology Laboratory, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada); Ginovart, Nathalie [PET Centre, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada); Wilson, Alan [PET Centre, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada); Houle, Sylvain [PET Centre, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada); Meyer, Jeffrey H. [PET Centre, Centre for Addiction and Mental Health, Toronto, ON, M5T 1R8 (Canada)

2005-02-01

Introduction: The primary approach in assessing the status of brain serotonin neurons in human conditions such as major depression and exposure to the illicit drug ecstasy has been the use of neuroimaging procedures involving radiotracers that bind to the serotonin transporter (SERT). However, there has been no consistency in the selection of a 'SERT-free' reference region for the estimation of free and nonspecific binding, as occipital cortex, cerebellum and white matter have all been employed. Objective and Methods: To identify areas of human brain that might have very low SERT levels, we measured, by a semiquantitative Western blotting procedure, SERT protein immunoreactivity throughout the postmortem brain of seven normal adult subjects. Results: Serotonin transporter could be quantitated in all examined brain areas. However, the SERT concentration in cerebellar cortex and white matter were only at trace values, being approximately 20% of average cerebral cortex and 5% of average striatum values. Conclusion: Although none of the examined brain areas are completely free of SERT, human cerebellar cortex has low SERT binding as compared to other examined brain regions, with the exception of white matter. Since the cerebellar cortical SERT binding is not zero, this region will not be a suitable reference region for SERT radioligands with very low free and nonspecific binding. For SERT radioligands with reasonably high free and nonspecific binding, the cerebellar cortex should be a useful reference region, provided other necessary radioligand assumptions are met.
Survey of nuclear fuel-cycle codes

International Nuclear Information System (INIS)

Thomas, C.R.; de Saussure, G.; Marable, J.H.

1981-04-01

A two-month survey of nuclear fuel-cycle models was undertaken. This report presents the information forthcoming from the survey. Of the nearly thirty codes reviewed in the survey, fifteen of these codes have been identified as potentially useful in fulfilling the tasks of the Nuclear Energy Analysis Division (NEAD) as defined in their FY 1981-1982 Program Plan. Six of the fifteen codes are given individual reviews. The individual reviews address such items as the funding agency, the author and organization, the date of completion of the code, adequacy of documentation, computer requirements, history of use, variables that are input and forecast, type of reactors considered, part of fuel cycle modeled and scope of the code (international or domestic, long-term or short-term, regional or national). The report recommends that the Model Evaluation Team perform an evaluation of the EUREKA uranium mining and milling code

Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology

DEFF Research Database (Denmark)

Rossin, Elizabeth J.; Hansen, Kasper Lage; Raychaudhuri, Soumya

2011-01-01

Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed by these r......Genome-wide association studies (GWAS) have defined over 150 genomic regions unequivocally containing variation predisposing to immune-mediated disease. Inferring disease biology from these observations, however, hinges on our ability to discover the molecular processes being perturbed...... in rheumatoid arthritis (RA) and Crohn's disease (CD) GWAS, we build protein-protein interaction (PPI) networks for genes within associated loci and find abundant physical interactions between protein products of associated genes. We apply multiple permutation approaches to show that these networks are more...... that the RA and CD networks have predictive power by demonstrating that proteins in these networks, not encoded in the confirmed list of disease associated loci, are significantly enriched for association to the phenotypes in question in extended GWAS analysis. Finally, we test our method in 3 non...
Comparative In silico Study of Sex-Determining Region Y (SRY) Protein Sequences Involved in Sex-Determining.

Science.gov (United States)

Vakili Azghandi, Masoume; Nasiri, Mohammadreza; Shamsa, Ali; Jalali, Mohsen; Shariati, Mohammad Mahdi

2016-04-01

The SRY gene (SRY) provides instructions for making a transcription factor called the sex-determining region Y protein. The sex-determining region Y protein causes a fetus to develop as a male. In this study, SRY of 15 spices included of human, chimpanzee, dog, pig, rat, cattle, buffalo, goat, sheep, horse, zebra, frog, urial, dolphin and killer whale were used for determine of bioinformatic differences. Nucleotide sequences of SRY were retrieved from the NCBI databank. Bioinformatic analysis of SRY is done by CLC Main Workbench version 5.5 and ClustalW (http:/www.ebi.ac.uk/clustalw/) and MEGA6 softwares. The multiple sequence alignment results indicated that SRY protein sequences from Orcinus orca (killer whale) and Tursiopsaduncus (dolphin) have least genetic distance of 0.33 in these 15 species and are 99.67% identical at the amino acid level. Homosapiens and Pantroglodytes (chimpanzee) have the next lowest genetic distance of 1.35 and are 98.65% identical at the amino acid level. These findings indicate that the SRY proteins are conserved in the 15 species, and their evolutionary relationships are similar.
Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins.

Science.gov (United States)

Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N

2007-05-01

devoted to the presentation of 87 Swiss-Prot keywords attributed to the cellular components, domains, technical terms, developmental processes, and coding sequence diversities possessing strong positive and negative correlation with long disordered regions (Vucetic, S.; Xie, H.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V. N. Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. J. Proteome Res. 2007, 5, 1899-1916). Protein structure and functionality can be modulated by various post-translational modifications or/and as a result of binding of specific ligands. Numerous human diseases are associated with protein misfolding/misassembly/misfunctioning. This work concludes the series of papers dedicated to the functional anthology of intrinsic disorder and describes approximately 80 Swiss-Prot functional keywords that are related to ligands, post-translational modifications, and diseases possessing strong positive or negative correlation with the predicted long disordered regions in proteins.
The crystal structure of the Split End protein SHARP adds a new layer of complexity to proteins containing RNA recognition motifs.

Science.gov (United States)

Arieti, Fabiana; Gabus, Caroline; Tambalo, Margherita; Huet, Tiphaine; Round, Adam; Thore, Stéphane

2014-06-01

The Split Ends (SPEN) protein was originally discovered in Drosophila in the late 1990s. Since then, homologous proteins have been identified in eukaryotic species ranging from plants to humans. Every family member contains three predicted RNA recognition motifs (RRMs) in the N-terminal region of the protein. We have determined the crystal structure of the region of the human SPEN homolog that contains these RRMs-the SMRT/HDAC1 Associated Repressor Protein (SHARP), at 2.0 Å resolution. SHARP is a co-regulator of the nuclear receptors. We demonstrate that two of the three RRMs, namely RRM3 and RRM4, interact via a highly conserved interface. Furthermore, we show that the RRM3-RRM4 block is the main platform mediating the stable association with the H12-H13 substructure found in the steroid receptor RNA activator (SRA), a long, non-coding RNA previously shown to play a crucial role in nuclear receptor transcriptional regulation. We determine that SHARP association with SRA relies on both single- and double-stranded RNA sequences. The crystal structure of the SHARP-RRM fragment, together with the associated RNA-binding studies, extend the repertoire of nucleic acid binding properties of RRM domains suggesting a new hypothesis for a better understanding of SPEN protein functions. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Detection of protein-protein interactions by ribosome display and protein in situ immobilisation.

Science.gov (United States)

He, Mingyue; Liu, Hong; Turner, Martin; Taussig, Michael J

2009-12-31

We describe a method for identification of protein-protein interactions by combining two cell-free protein technologies, namely ribosome display and protein in situ immobilisation. The method requires only PCR fragments as the starting material, the target proteins being made through cell-free protein synthesis, either associated with their encoding mRNA as ribosome complexes or immobilised on a solid surface. The use of ribosome complexes allows identification of interacting protein partners from their attached coding mRNA. To demonstrate the procedures, we have employed the lymphocyte signalling proteins Vav1 and Grb2 and confirmed the interaction between Grb2 and the N-terminal SH3 domain of Vav1. The method has promise for library screening of pairwise protein interactions, down to the analytical level of individual domain or motif mapping.
Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

Science.gov (United States)

Hoffmann, Robert D; Palmgren, Michael

2016-06-13

Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.
Identification of T1D susceptibility genes within the MHC region by combining protein interaction networks and SNP genotyping data

DEFF Research Database (Denmark)

Brorsson, C.; Hansen, Niclas Tue; Hansen, Kasper Lage

2009-01-01

genes. We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein-protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC......To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1...... region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein...
Targeting the middle region of CP4-EPSPS protein for its traceability in highly processed soy-related products.

Science.gov (United States)

Wu, Honghong; Wang, Xiaofu; Zhou, Xinghu; Zhang, Yihua; Huang, Ming; He, Jian; Shen, Wenbiao

2017-09-01

Transgenic components in genetically modified organisms consist not only of the transgenic genes, but also the transgenic protein. However, compared with transgenic DNA, less attention has been paid to the detection of expressed protein, especially those degraded from genetically modified soybean after food processing. In this study, the full length 5-enolpyruvyl-shikimate-3-phosphate synthase (CP4-EPSPS, 47.6 kD) protein was probed with the SC-16 (S19-R33) and the DC-16 (D219-K233) polyclonal antibodies in immunoblots. Both antibodies were able to detect the full length CP4-EPSPS and its residues in soy powder made from Roundup-Ready soybeans after heating and microwaving treatments which also reduced the molecular weight of the protein to 45.8 and 38.7 kD, respectively. Taken together the immunoblot results suggest that the middle region of the CP4-EPSPS protein possessed better stability than its N-terminal during thermal processing. This deduction was further validated by autoclave treatment, where a 37.4 kD residue of the protein was recognized by DC-16. A similar result was obtained in processed smoked sausage containing Roundup Ready soybean protein isolate (as an extender). The additional use of a further polyclonal antibody CK-17 (C372-K388), showed that compared with only the one signal for CP4-EPSPS detected by the SC-16 and CK-17 antibodies, the DC-16 middle region antibody detected four signals for CP4-EPSPS from five market sourced soy protein concentrates. Taken together, the study suggested that the middle region of CP4-EPSPS was more useful than the N- and C-terminal for tracing transgenic CP4-EPSPS protein and its remnants in highly processed soy-related products.
DNA clasping by mycobacterial HU: the C-terminal region of HupB mediates increased specificity of DNA binding.

Directory of Open Access Journals (Sweden)

Sandeep Kumar

Full Text Available BACKGROUND: HU a small, basic, histone like protein is a major component of the bacterial nucleoid. E. coli has two subunits of HU coded by hupA and hupB genes whereas Mycobacterium tuberculosis (Mtb has only one subunit of HU coded by ORF Rv2986c (hupB gene. One noticeable feature regarding Mtb HupB, based on sequence alignment of HU orthologs from different bacteria, was that HupB(Mtb bears at its C-terminal end, a highly basic extension and this prompted an examination of its role in Mtb HupB function. METHODOLOGY/PRINCIPAL FINDINGS: With this objective two clones of Mtb HupB were generated; one expressing full length HupB protein (HupB(Mtb and another which expresses only the N terminal region (first 95 amino acid of hupB (HupB(MtbN. Gel retardation assays revealed that HupB(MtbN is almost like E. coli HU (heat stable nucleoid protein in terms of its DNA binding, with a binding constant (K(d for linear dsDNA greater than 1000 nM, a value comparable to that obtained for the HUalphaalpha and HUalphabeta forms. However CTR (C-terminal Region of HupB(Mtb imparts greater specificity in DNA binding. HupB(Mtb protein binds more strongly to supercoiled plasmid DNA than to linear DNA, also this binding is very stable as it provides DNase I protection even up to 5 minutes. Similar results were obtained when the abilities of both proteins to mediate protection against DNA strand cleavage by hydroxyl radicals generated by the Fenton's reaction, were compared. It was also observed that both the proteins have DNA binding preference for A:T rich DNA which may occur at the regulatory regions of ORFs and the oriC region of Mtb. CONCLUSIONS/SIGNIFICANCE: These data thus point that HupB(Mtb may participate in chromosome organization in-vivo, it may also play a passive, possibly an architectural role.
Separate photosensitizers mediate degradation of the 32-kDa photosystem II reaction center protein in the visible and UV spectral regions

International Nuclear Information System (INIS)

Greenberg, B.M.; Gaba, V.; Canaani, O.; Malkin, S.; Mattoo, A.K.; Edelman, M.

1989-01-01

A component of the photosystem II reaction center, the 32-kDa protein, is rapidly turned over in the light. The mechanism of its light-dependent metabolism is largely unknown. We quantified the rate of 32-kDa protein degradation over a broad spectral range (UV, visible, and far red). The quantum yield for degradation was highest in the UVB (280-320 nm) region. Spectral evidence demonstrates two distinctly different photosensitizers for 32-kDa protein degradation. The data implicate the bulk photosynthetic pigments (primarily chlorophyll) in the visible and far red regions, and plastoquinone (in one or more of its redox states) in the UV region. A significant portion of 32-kDa protein degradation in sunlight is attributed to UVB irradiance
Optimal codes as Tanner codes with cyclic component codes

DEFF Research Database (Denmark)

Høholdt, Tom; Pinero, Fernando; Zeng, Peng

2014-01-01

In this article we study a class of graph codes with cyclic code component codes as affine variety codes. Within this class of Tanner codes we find some optimal binary codes. We use a particular subgraph of the point-line incidence plane of A(2,q) as the Tanner graph, and we are able to describe ...
Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

LENUS (Irish Health Repository)

Ivanov, Ivaylo P

2011-05-01

In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.
HippDB: a database of readily targeted helical protein-protein interactions.

Science.gov (United States)

Bergey, Christina M; Watkins, Andrew M; Arora, Paramjit S

2013-11-01

HippDB catalogs every protein-protein interaction whose structure is available in the Protein Data Bank and which exhibits one or more helices at the interface. The Web site accepts queries on variables such as helix length and sequence, and it provides computational alanine scanning and change in solvent-accessible surface area values for every interfacial residue. HippDB is intended to serve as a starting point for structure-based small molecule and peptidomimetic drug development. HippDB is freely available on the web at http://www.nyu.edu/projects/arora/hippdb. The Web site is implemented in PHP, MySQL and Apache. Source code freely available for download at http://code.google.com/p/helidb, implemented in Perl and supported on Linux. arora@nyu.edu.
The N-terminus of porcine circovirus type 2 replication protein is required for nuclear localization and ori binding activities

International Nuclear Information System (INIS)

Lin, W.-L.; Chien, M.-S.; Du, Y.-W.; Wu, P.-C.; Huang Chienjin

2009-01-01

Porcine circovirus type 2 possesses a circular, single-stranded DNA genome that requires the replication protein (Rep) for virus replication. To characterize the DNA binding potential and the significant region that confers the nuclear localization of the Rep protein, the defined coding regions of rep gene were cloned and expressed. All of the recombinant proteins except for the N-terminal 110 residues deletion mutant could bind to the double-stranded minimal binding site of replication origin (ori). In addition, the N-terminal deletion mutant lacking 110 residues exhibited mainly cytoplasmic staining in the transfected cells in contrast to the others, which localized dominantly in the nucleus, suggesting that this N-terminal domain is essential for nuclear localization. Furthermore, a series of green fluorescence proteins (GFP) containing potential nuclear localization signal (NLS) sequences were tested for their cellular distribution. The ability of the utmost 20 residues of the N-terminal region to target the GFP to the nucleus confirmed its role as a functional NLS.
Karyopherin-mediated nuclear import of the homing endonuclease VMA1-derived endonuclease is required for self-propagation of the coding region.

Science.gov (United States)

Nagai, Yuri; Nogami, Satoru; Kumagai-Sano, Fumi; Ohya, Yoshikazu

2003-03-01

VMA1-derived endonuclease (VDE), a site-specific endonuclease in Saccharomyces cerevisiae, enters the nucleus to generate a double-strand break in the VDE-negative allelic locus, mediating the self-propagating gene conversion called homing. Although VDE is excluded from the nucleus in mitotic cells, it relocalizes at premeiosis, becoming localized in both the nucleus and the cytoplasm in meiosis. The nuclear localization of VDE is induced by inactivation of TOR kinases, which constitute central regulators of cell differentiation in S. cerevisiae, and by nutrient depletion. A functional genomic approach revealed that at least two karyopherins, Srp1p and Kap142p, are required for the nuclear localization pattern. Genetic and physical interactions between Srp1p and VDE imply direct involvement of karyopherin-mediated nuclear transport in this process. Inactivation of TOR signaling or acquisition of an extra nuclear localization signal in the VDE coding region leads to artificial nuclear localization of VDE and thereby induces homing even during mitosis. These results serve as evidence that VDE utilizes the host systems of nutrient signal transduction and nucleocytoplasmic transport to ensure the propagation of its coding region.
Tree ferns: monophyletic groups and their relationships as revealed by four protein-coding plastid loci.

Science.gov (United States)

Korall, Petra; Pryer, Kathleen M; Metzgar, Jordan S; Schneider, Harald; Conant, David S

2006-06-01

Tree ferns are a well-established clade within leptosporangiate ferns. Most of the 600 species (in seven families and 13 genera) are arborescent, but considerable morphological variability exists, spanning the giant scaly tree ferns (Cyatheaceae), the low, erect plants (Plagiogyriaceae), and the diminutive endemics of the Guayana Highlands (Hymenophyllopsidaceae). In this study, we investigate phylogenetic relationships within tree ferns based on analyses of four protein-coding, plastid loci (atpA, atpB, rbcL, and rps4). Our results reveal four well-supported clades, with genera of Dicksoniaceae (sensu ) interspersed among them: (A) (Loxomataceae, (Culcita, Plagiogyriaceae)), (B) (Calochlaena, (Dicksonia, Lophosoriaceae)), (C) Cibotium, and (D) Cyatheaceae, with Hymenophyllopsidaceae nested within. How these four groups are related to one other, to Thyrsopteris, or to Metaxyaceae is weakly supported. Our results show that Dicksoniaceae and Cyatheaceae, as currently recognised, are not monophyletic and new circumscriptions for these families are needed.
The N-terminal, polybasic region is critical for prion protein neuroprotective activity.

Directory of Open Access Journals (Sweden)

Jessie A Turnbaugh

Full Text Available Several lines of evidence suggest that the normal form of the prion protein, PrP(C, exerts a neuroprotective activity against cellular stress or toxicity. One of the clearest examples of such activity is the ability of wild-type PrP(C to suppress the spontaneous neurodegenerative phenotype of transgenic mice expressing a deleted form of PrP (Δ32-134, called F35. To define domains of PrP involved in its neuroprotective activity, we have analyzed the ability of several deletion mutants of PrP (Δ23-31, Δ23-111, and Δ23-134 to rescue the phenotype of Tg(F35 mice. Surprisingly, all of these mutants displayed greatly diminished rescue activity, although Δ23-31 PrP partially suppressed neuronal loss when expressed at very high levels. Our results pinpoint the N-terminal, polybasic domain as a critical determinant of PrP(C neuroprotective activity, and suggest that identification of molecules interacting with this region will provide important clues regarding the normal function of the protein. Small molecule ligands targeting this region may also represent useful therapeutic agents for treatment of prion diseases.
Crystal Structure Analysis and the Identification of Distinctive Functional Regions of the Protein Elicitor Mohrip2.

Science.gov (United States)

Liu, Mengjie; Duan, Liangwei; Wang, Meifang; Zeng, Hongmei; Liu, Xinqi; Qiu, Dewen

2016-01-01

The protein elicitor MoHrip2, which was extracted from Magnaporthe oryzae as an exocrine protein, triggers the tobacco immune system and enhances blast resistance in rice. However, the detailed mechanisms by which MoHrip2 acts as an elicitor remain unclear. Here, we investigated the structure of MoHrip2 to elucidate its functions based on molecular structure. The three-dimensional structure of MoHrip2 was obtained. Overall, the crystal structure formed a β-barrel structure and showed high similarity to the pathogenesis-related (PR) thaumatin superfamily protein thaumatin-like xylanase inhibitor (TL-XI). To investigate the functional regions responsible for MoHrip2 elicitor activities, the full length and eight truncated proteins were expressed in Escherichia coli and were evaluated for elicitor activity in tobacco. Biological function analysis showed that MoHrip2 triggered the defense system against Botrytis cinerea in tobacco. Moreover, only MoHrip2M14 and other fragments containing the 14 amino acids residues in the middle region of the protein showed the elicitor activity of inducing a hypersensitive response and resistance related pathways, which were similar to that of full-length MoHrip2. These results revealed that the central 14 amino acid residues were essential for anti-pathogenic activity.
Crystal Structure Analysis and the Identification of Distinctive Functional Regions of the Protein Elicitor Mohrip2

Directory of Open Access Journals (Sweden)

Mengjie Liu

2016-07-01

Full Text Available The protein elicitor MoHrip2, which was extracted from Magnaporthe oryzae as an exocrine protein, triggers the tobacco immune system and enhances blast resistance in rice. However, the detailed mechanisms by which MoHrip2 acts as an elicitor remain unclear. Here, we investigated the structure of MoHrip2 to elucidate its functions based on molecular structure. The 3-dimensional structure of MoHrip2 was obtained. Overall, the crystal structure formed a β-barrel structure and showed high similarity to the pathogenesis-related (PR thaumatin superfamily protein thaumatin-like xylanase inhibitor (TL-XI. To investigate the functional regions responsible for MoHrip2 elicitor activities, the full length and 8 truncated proteins were expressed in Escherichia coli and were evaluated for elicitor activity in tobacco. Biological function analysis showed that MoHrip2 triggered the defense system against Botrytis cinerea in tobacco. Moreover, only MoHrip2M14 and other fragments containing the 14 amino acids residues in the middle region of the protein showed the elicitor activity of inducing a hypersensitive response and resistance related pathways, which were similar to that of full-length MoHrip2. These results revealed that the central 14 amino acid residues were essential for anti-pathogenic activity.
Identifying the adaptive mechanism in globular proteins: Fluctuations in densely packed regions manipulate flexible parts

Science.gov (United States)

Yilmaz, Lutfu Safak; Atilgan, Ali Rana

2000-09-01

A low-resolution structural model based on the packing geometry of α-carbons is utilized to establish a connection between the flexible and rigid parts of a folded protein. The former commonly recognizes a complementing molecule for making a complex, while the latter manipulates the necessary conformational change for binding. We attempt analytically to distinguish this control architecture that intrinsically exists in globular proteins. First with two-dimensional simple models, then for a native protein, bovine pancreatic trypsin inhibitor, we explicitly demonstrate that inserting fluctuations in tertiary contacts supported by the stable core, one can regulate the displacement of residues on loop regions. The positional fluctuations of the flexible regions are annihilated by the rest of the protein in conformity with the Le Chatelier-Braun principle. The results indicate that the distortion of the principal nonbonded contacts between highly packed residues is accompanied by that of the slavery fluctuations that are widely distributed over the native structure. These positional arrangements do not appear in a reciprocal relation between a perturbation and the associated response; the effect of a movement of residue i on residue j is not equal to that of the same movement of residue j on residue i.

CRNDE: a long non-coding RNA involved in CanceR, Neurobiology and DEvelopment

Directory of Open Access Journals (Sweden)

Blake C. Ellis

2012-11-01

Full Text Available CRNDE is the gene symbol for Colorectal Neoplasia Differentially Expressed (non protein-coding, a long non-coding RNA (lncRNA gene that expresses multiple splice variants and displays a very tissue-specific pattern of expression. CRNDE was initially identified as a lncRNA whose expression is highly elevated in colorectal cancer, but it is also upregulated in many other solid tumors and in leukemias. Indeed, CRNDE is the most upregulated lncRNA in gliomas and here, as in other cancers, it is associated with a stemness signature. CRNDE is expressed in specific regions within the human and mouse brain; the mouse ortholog is high in induced pluripotent stem cells and increases further during neuronal differentiation. We suggest that CRNDE is a multifunctional lncRNA whose different splice forms provide specific functional scaffolds for regulatory complexes, such as the polycomb repressive complex 2 (PRC2 and CoREST chromatin-modifying complexes, which CRNDE helps pilot to target genes.
A novel strategy using MASCOT Distiller for analysis of cleavable isotope-coded affinity tag data to quantify protein changes in plasma.

Science.gov (United States)

Leung, Kit-Yi; Lescuyer, Pierre; Campbell, James; Byers, Helen L; Allard, Laure; Sanchez, Jean-Charles; Ward, Malcolm A

2005-08-01

A novel strategy consisting of cleavable Isotope-Coded Affinity Tag (cICAT) combined with MASCOT Distiller was evaluated as a tool for the quantification of proteins in "abnormal" patient plasma, prepared by pooling samples from patients with acute stroke. Quantification of all light and heavy cICAT-labelled peptide ion pairs was obtained using MASCOT Distiller combined with a proprietary software. Peptides displaying differences were selected for identification by MS. These preliminary results show the promise of our approach to identify potential biomarkers.
Decoding the non-coding RNAs in Alzheimer's disease.

Science.gov (United States)

Schonrock, Nicole; Götz, Jürgen

2012-11-01

Non-coding RNAs (ncRNAs) are integral components of biological networks with fundamental roles in regulating gene expression. They can integrate sequence information from the DNA code, epigenetic regulation and functions of multimeric protein complexes to potentially determine the epigenetic status and transcriptional network in any given cell. Humans potentially contain more ncRNAs than any other species, especially in the brain, where they may well play a significant role in human development and cognitive ability. This review discusses their emerging role in Alzheimer's disease (AD), a human pathological condition characterized by the progressive impairment of cognitive functions. We discuss the complexity of the ncRNA world and how this is reflected in the regulation of the amyloid precursor protein and Tau, two proteins with central functions in AD. By understanding this intricate regulatory network, there is hope for a better understanding of disease mechanisms and ultimately developing diagnostic and therapeutic tools.
Present state of the SOURCES computer code

International Nuclear Information System (INIS)

Shores, Erik F.

2002-01-01

In various stages of development for over two decades, the SOURCES computer code continues to calculate neutron production rates and spectra from four types of problems: homogeneous media, two-region interfaces, three-region interfaces and that of a monoenergetic alpha particle beam incident on a slab of target material. Graduate work at the University of Missouri - Rolla, in addition to user feedback from a tutorial course, provided the impetus for a variety of code improvements. Recently upgraded to version 4B, initial modifications to SOURCES focused on updates to the 'tape5' decay data library. Shortly thereafter, efforts focused on development of a graphical user interface for the code. This paper documents the Los Alamos SOURCES Tape1 Creator and Library Link (LASTCALL) and describes additional library modifications in more detail. Minor improvements and planned enhancements are discussed.
Proteomic analysis of proteins expressing in regions of rat brain by a combination of SDS-PAGE with nano-liquid chromatography-quadrupole-time of flight tandem mass spectrometry

Directory of Open Access Journals (Sweden)

Maekawa Tsuyoshi

2010-07-01

Full Text Available Abstract Background Most biological functions controlled by the brain and their related disorders are closely associated with activation in specific regions of the brain. Neuroproteomics has been applied to the analysis of whole brain, and the general pattern of protein expression in all regions has been elucidated. However, the comprehensive proteome of each brain region remains unclear. Results In this study, we carried out comparative proteomics of six regions of the adult rat brain: thalamus, hippocampus, frontal cortex, parietal cortex, occipital cortex, and amygdala using semi-quantitative analysis by Mascot Score of the identified proteins. In order to identify efficiently the proteins that are present in the brain, the proteins were separated by a combination of SDS-PAGE on a C18 column-equipped nano-liquid chromatograph, and analyzed by quadrupole-time of flight-tandem-mass spectrometry. The proteomic data show 2,909 peptides in the rat brain, with more than 200 identified as region-abundant proteins by semi-quantitative analysis. The regions containing the identified proteins are membrane (20.0%, cytoplasm (19.5%, mitochondrion (17.1%, cytoskeleton (8.2%, nucleus (4.7%, extracellular region (3.3%, and other (18.0%. Of the identified proteins, the expressions of glial fibrillary acidic protein, GABA transporter 3, Septin 5, heat shock protein 90, synaptotagmin, heat shock protein 70, and pyruvate kinase were confirmed by immunoblotting. We examined the distributions in rat brain of GABA transporter 3, glial fibrillary acidic protein, and heat shock protein 70 by immunohistochemistry, and found that the proteins are localized around the regions observed by proteomic analysis and immunoblotting. IPA analysis indicates that pathways closely related to the biological functions of each region may be activated in rat brain. Conclusions These observations indicate that proteomics in each region of adult rat brain may provide a novel way to
TOPIC: a debugging code for torus geometry input data of Monte Carlo transport code

International Nuclear Information System (INIS)

Iida, Hiromasa; Kawasaki, Hiromitsu.

1979-06-01

TOPIC has been developed for debugging geometry input data of the Monte Carlo transport code. the code has the following features: (1) It debugs the geometry input data of not only MORSE-GG but also MORSE-I capable of treating torus geometry. (2) Its calculation results are shown in figures drawn by Plotter or COM, and the regions not defined or doubly defined are easily detected. (3) It finds a multitude of input data errors in a single run. (4) The input data required in this code are few, so that it is readily usable in a time sharing system of FACOM 230-60/75 computer. Example TOPIC calculations in design study of tokamak fusion reactors (JXFR, INTOR-J) are presented. (author)
C-terminal region of herpes simplex virus ICP8 protein needed for intranuclear localization

International Nuclear Information System (INIS)

Taylor, Travis J; Knipe, David M.

2003-01-01

The herpes simplex virus single-stranded DNA-binding protein, ICP8, localizes initially to structures in the nucleus called prereplicative sites. As replication proceeds, these sites mature into large globular structures called replication compartments. The details of what signals or proteins are involved in the redistribution of viral and cellular proteins within the nucleus between prereplicative sites and replication compartments are poorly understood; however, we showed previously that the dominant-negative d105 ICP8 does not localize to prereplicative sites and prevents the localization of other viral proteins to prereplicative sites (J. Virol. 74 (2000) 10122). Within the residues deleted in d105 (1083 to 1168), we identified a region between amino acid residues 1080 and 1135 that was predicted by computer models to contain two α-helices, one with considerable amphipathic nature. We used site-specific and random mutagenesis techniques to identify residues or structures within this region that are required for proper ICP8 localization within the nucleus. Proline substitutions in the predicted helix generated ICP8 molecules that did not localize to prereplicative sites and acted as dominant-negative inhibitors. Other substitutions that altered the charged residues in the predicted α-helix to alanine or leucine residues had little or no effect on ICP8 intranuclear localization. The predicted α-helix was dispensable for the interaction of ICP8 with the U L 9 origin-binding protein. We propose that this C-terminal α-helix is required for localization of ICP8 to prereplicative sites by binding viral or cellular factors that target or retain ICP8 at specific intranuclear sites
Structural analysis of human complement protein H: homology with C4b binding protein, beta 2-glycoprotein I, and the Ba fragment of B2

DEFF Research Database (Denmark)

Kristensen, Torsten; Wetsel, R A; Tack, B F

1986-01-01

We report here a partial primary structure for human complement protein H. Tryptic peptides comprising 27% of the H molecule were isolated by conventional techniques and were sequenced (333 amino acid residues). Several mixed-sequence oligonucleotide probes were constructed, based on the peptide...... sequence data, and were used to screen a human liver cDNA library. The largest recombinant plasmid (pH1050), which hybridized with two probes, was further characterized. The cDNA insert of this plasmid contained coding sequence (672 bp) for 224 amino acids of H. The 3' end of this clone had...... a polyadenylated tail preceded by a polyadenylation recognition site (ATTAAA) and a 3'-untranslated region (229 bp). Four regions of internal homology, each about 60 amino acids in length, were observed in the derived protein sequence from this cDNA clone, and a further seven from the tryptic peptide sequences...
The InterFrost benchmark of Thermo-Hydraulic codes for cold regions hydrology - first inter-comparison results

Science.gov (United States)

Grenier, Christophe; Roux, Nicolas; Anbergen, Hauke; Collier, Nathaniel; Costard, Francois; Ferrry, Michel; Frampton, Andrew; Frederick, Jennifer; Holmen, Johan; Jost, Anne; Kokh, Samuel; Kurylyk, Barret; McKenzie, Jeffrey; Molson, John; Orgogozo, Laurent; Rivière, Agnès; Rühaak, Wolfram; Selroos, Jan-Olof; Therrien, René; Vidstrand, Patrik

2015-04-01

The impacts of climate change in boreal regions has received considerable attention recently due to the warming trends that have been experienced in recent decades and are expected to intensify in the future. Large portions of these regions, corresponding to permafrost areas, are covered by water bodies (lakes, rivers) that interact with the surrounding permafrost. For example, the thermal state of the surrounding soil influences the energy and water budget of the surface water bodies. Also, these water bodies generate taliks (unfrozen zones below) that disturb the thermal regimes of permafrost and may play a key role in the context of climate change. Recent field studies and modeling exercises indicate that a fully coupled 2D or 3D Thermo-Hydraulic (TH) approach is required to understand and model the past and future evolution of landscapes, rivers, lakes and associated groundwater systems in a changing climate. However, there is presently a paucity of 3D numerical studies of permafrost thaw and associated hydrological changes, and the lack of study can be partly attributed to the difficulty in verifying multi-dimensional results produced by numerical models. Numerical approaches can only be validated against analytical solutions for a purely thermic 1D equation with phase change (e.g. Neumann, Lunardini). When it comes to the coupled TH system (coupling two highly non-linear equations), the only possible approach is to compare the results from different codes to provided test cases and/or to have controlled experiments for validation. Such inter-code comparisons can propel discussions to try to improve code performances. A benchmark exercise was initialized in 2014 with a kick-off meeting in Paris in November. Participants from USA, Canada, Germany, Sweden and France convened, representing altogether 13 simulation codes. The benchmark exercises consist of several test cases inspired by existing literature (e.g. McKenzie et al., 2007) as well as new ones. They
TEMP: a computer code to calculate fuel pin temperatures during a transient

International Nuclear Information System (INIS)

Bard, F.E.; Christensen, B.Y.; Gneiting, B.C.

1980-04-01

The computer code TEMP calculates fuel pin temperatures during a transient. It was developed to accommodate temperature calculations in any system of axi-symmetric concentric cylinders. When used to calculate fuel pin temperatures, the code will handle a fuel pin as simple as a solid cylinder or as complex as a central void surrounded by fuel that is broken into three regions by two circumferential cracks. Any fuel situation between these two extremes can be analyzed along with additional cladding, heat sink, coolant or capsule regions surrounding the fuel. The one-region version of the code accurately calculates the solution to two problems having closed-form solutions. The code uses an implicit method, an explicit method and a Crank-Nicolson (implicit-explicit) method
Developmental programming of long non-coding RNAs during postnatal liver maturation in mice.

Directory of Open Access Journals (Sweden)

Lai Peng

Full Text Available The liver is a vital organ with critical functions in metabolism, protein synthesis, and immune defense. Most of the liver functions are not mature at birth and many changes happen during postnatal liver development. However, it is unclear what changes occur in liver after birth, at what developmental stages they occur, and how the developmental processes are regulated. Long non-coding RNAs (lncRNAs are involved in organ development and cell differentiation. Here, we analyzed the transcriptome of lncRNAs in mouse liver from perinatal (day -2 to adult (day 60 by RNA-Sequencing, with an attempt to understand the role of lncRNAs in liver maturation. We found around 15,000 genes expressed, including about 2,000 lncRNAs. Most lncRNAs were expressed at a lower level than coding RNAs. Both coding RNAs and lncRNAs displayed three major ontogenic patterns: enriched at neonatal, adolescent, or adult stages. Neighboring coding and non-coding RNAs showed the trend to exhibit highly correlated ontogenic expression patterns. Gene ontology (GO analysis revealed that some lncRNAs enriched at neonatal ages have their neighbor protein coding genes also enriched at neonatal ages and associated with cell proliferation, immune activation related processes, tissue organization pathways, and hematopoiesis; other lncRNAs enriched at adolescent ages have their neighbor protein coding genes associated with different metabolic processes. These data reveal significant functional transition during postnatal liver development and imply the potential importance of lncRNAs in liver maturation.
Development of a thermal–hydraulic system code, TAPINS, for 10 MW regional energy reactor

International Nuclear Information System (INIS)

Lee, Yeon-Gun; Kim, Jong-Won; Park, Goon-Cherl

2012-01-01

Highlights: ► A thermal–hydraulic system code named TAPINS is developed for simulations of an integral reactor. ► The TAPINS is based on the one-dimensional momentum integral model. ► A dynamic model for the steam–gas pressurizer with non-condensable gas present is proposed. ► A series of pressurizer insurge test and natural circulation test are simulated by the TAPINS. ► It is proved that the TAPINS can provide reliable prediction of an integral reactor system on natural circulation. - Abstract: Small modular reactors (SMRs) with integral system layout have been drawing a great deal of attention as alternative options to branch out the utilization of nuclear energy as well as to offer the inherent safety features. Serving to confirm the design basis and analyze the transient behavior of an integral reactor such as REX-10, a thermal–hydraulic system code named TAPINS (Thermal–hydraulic Analysis Program for INtegral reactor System) is developed in this study. The TAPINS supports the simple pre-processing to build up the frameworks of node diagram for the typical integral reactor configuration. The TAPINS basically consists of mathematical models for the reactor coolant system, the core, the once-through helical-coil steam generator, and the built-in steam–gas pressurizer. The hydrodynamic model of the TAPINS is formulated using the one-dimensional momentum integral model, which is based on the analytical integration of the momentum equation around the closed loop in the system. As a key contribution of the study, a dynamic model for the steam–gas pressurizer with non-condensable gas present is newly proposed and incorporated into the code. The TAPINS is validated by comparing against the experimental data from the pressurizer insurge tests conducted at MIT (Massachusetts Institute of Technology) and natural circulation tests in the RTF (REX-10 Test Facility) at RERI (Regional Energy Reactor Institute). From the comparison results, it is
Comparative In silico Study of Sex-Determining Region Y (SRY Protein Sequences Involved in Sex-Determining

Directory of Open Access Journals (Sweden)

Masoume Vakili Azghandi

2016-05-01

Full Text Available Background: The SRY gene (SRY provides instructions for making a transcription factor called the sex-determining region Y protein. The sex-determining region Y protein causes a fetus to develop as a male. In this study, SRY of 15 spices included of human, chimpanzee, dog, pig, rat, cattle, buffalo, goat, sheep, horse, zebra, frog, urial, dolphin and killer whale were used for determine of bioinformatic differences. Methods: Nucleotide sequences of SRY were retrieved from the NCBI databank. Bioinformatic analysis of SRY is done by CLC Main Workbench version 5.5 and ClustalW (http:/www.ebi.ac.uk/clustalw/ and MEGA6 softwares. Results: The multiple sequence alignment results indicated that SRY protein sequences from Orcinus orca (killer whale and Tursiopsaduncus (dolphin have least genetic distance of 0.33 in these 15 species and are 99.67% identical at the amino acid level. Homosapiens and Pantroglodytes (chimpanzee have the next lowest genetic distance of 1.35 and are 98.65% identical at the amino acid level. Conclusion: These findings indicate that the SRY proteins are conserved in the 15 species, and their evolutionary relationships are similar.
Patient Dashboard: the use of a colour-coded computerised clinical reminder in Whanganui regional general practices.

Science.gov (United States)

McMenamin, John; Nicholson, Rick; Leech, Ken

2011-12-01

Clinical reminders have been shown to help general practice achieve an increase in some preventive care items, especially if they identify a patient's eligibility for the target item, prompt clinicians at the right time, provide a fast link to management tools and facilitate clinical recording. WRPHO has introduced the Patient Dashboard clinical reminder and monitored its impact on health targets. This paper reports the impact of a computerised colour-coded clinical reminder on achieving agreed health targets in Whanganui regional practices. Patient Dashboard was developed from previous versions in Auckland and Northland and provided to Whanganui regional practices with Primary Health Organisation (PHO) support. The Dashboard was linked with existing and new clinical management tools which automatically updated clinical records. Data from practices was pooled by Whanganui Regional Primary Health Organisation and target achievement rates reported over 15 months. Over the initial 15 months of Patient Dashboard use, recording of smoking status increased from 74% to 82% and of alcohol use from 15% to 47%. Screening for diabetes increased from 62% to 74%, cardiovascular risk assessment from 20% to 43%, cervical screening from 71% to 79%, and breast screening from 60% to 80%. Patient Dashboard was associated with increased performance indicators both for those targets which were part of a PHO programme and for targets without additional support.
Classifying Coding DNA with Nucleotide Statistics

Directory of Open Access Journals (Sweden)

Nicolas Carels

2009-10-01

Full Text Available In this report, we compared the success rate of classification of coding sequences (CDS vs. introns by Codon Structure Factor (CSF and by a method that we called Universal Feature Method (UFM. UFM is based on the scoring of purine bias (Rrr and stop codon frequency. We show that the success rate of CDS/intron classification by UFM is higher than by CSF. UFM classifies ORFs as coding or non-coding through a score based on (i the stop codon distribution, (ii the product of purine probabilities in the three positions of nucleotide triplets, (iii the product of Cytosine (C, Guanine (G, and Adenine (A probabilities in the 1st, 2nd, and 3rd positions of triplets, respectively, (iv the probabilities of G in 1st and 2nd position of triplets and (v the distance of their GC3 vs. GC2 levels to the regression line of the universal correlation. More than 80% of CDSs (true positives of Homo sapiens (>250 bp, Drosophila melanogaster (>250 bp and Arabidopsis thaliana (>200 bp are successfully classified with a false positive rate lower or equal to 5%. The method releases coding sequences in their coding strand and coding frame, which allows their automatic translation into protein sequences with 95% confidence. The method is a natural consequence of the compositional bias of nucleotides in coding sequences.
Quantification of non-coding RNA target localization diversity and its application in cancers.

Science.gov (United States)

Cheng, Lixin; Leung, Kwong-Sak

2018-04-01

Subcellular localization is pivotal for RNAs and proteins to implement biological functions. The localization diversity of protein interactions has been studied as a crucial feature of proteins, considering that the protein-protein interactions take place in various subcellular locations. Nevertheless, the localization diversity of non-coding RNA (ncRNA) target proteins has not been systematically studied, especially its characteristics in cancers. In this study, we provide a new algorithm, non-coding RNA target localization coefficient (ncTALENT), to quantify the target localization diversity of ncRNAs based on the ncRNA-protein interaction and protein subcellular localization data. ncTALENT can be used to calculate the target localization coefficient of ncRNAs and measure how diversely their targets are distributed among the subcellular locations in various scenarios. We focus our study on long non-coding RNAs (lncRNAs), and our observations reveal that the target localization diversity is a primary characteristic of lncRNAs in different biotypes. Moreover, we found that lncRNAs in multiple cancers, differentially expressed cancer lncRNAs, and lncRNAs with multiple cancer target proteins are prone to have high target localization diversity. Furthermore, the analysis of gastric cancer helps us to obtain a better understanding that the target localization diversity of lncRNAs is an important feature closely related to clinical prognosis. Overall, we systematically studied the target localization diversity of the lncRNAs and uncovered its association with cancer.
Molecular cloning and biologically active production of IpaD N-terminal region.

Science.gov (United States)

Hesaraki, Mahdi; Saadati, Mojtaba; Honari, Hossein; Olad, Gholamreza; Heiat, Mohammad; Malaei, Fatemeh; Ranjbar, Reza

2013-07-01

Shigella is known as pathogenic intestinal bacteria in high dispersion and pathogenic bacteria due to invasive plasmid antigen (Ipa). So far, a number of Ipa proteins have been studied to introduce a new candidate vaccine. Here, for the first time, we examined whether the N-terminal region of IpaD(72-162) could be a proper candidate for Shigella vaccine. Initially, the DNA sequence coding N-terminal region was isolated by PCR from Shigella dysenteriae type I and cloned into pET-28a expression vector. Then, the heterologous protein was expressed, optimized and purified by affinity Ni-NTA column. Western blot analysis using, His-tag and IpaD(72-162) polyclonal antibodies, confirmed the purity and specificity of the recombinant protein, respectively. Subsequently, the high immunogenicity of the antigen was shown by ELISA. The results of the sereny test in Guinea pigs showed that IpaD(72-162) provides a protective system against Shigella flexneri 5a and S. dysenteriae type I. Copyright © 2013. Published by Elsevier Ltd.
RNA-Binding Proteins in Trichomonas vaginalis: Atypical Multifunctional Proteins Involved in a Posttranscriptional Iron Regulatory Mechanism

Science.gov (United States)

Figueroa-Angulo, Elisa E.; Calla-Choque, Jaeson S.; Mancilla-Olea, Maria Inocente; Arroyo, Rossana

2015-01-01

Iron homeostasis is highly regulated in vertebrates through a regulatory system mediated by RNA-protein interactions between the iron regulatory proteins (IRPs) that interact with an iron responsive element (IRE) located in certain mRNAs, dubbed the IRE-IRP regulatory system. Trichomonas vaginalis, the causal agent of trichomoniasis, presents high iron dependency to regulate its growth, metabolism, and virulence properties. Although T. vaginalis lacks IRPs or proteins with aconitase activity, possesses gene expression mechanisms of iron regulation at the transcriptional and posttranscriptional levels. However, only one gene with iron regulation at the transcriptional level has been described. Recently, our research group described an iron posttranscriptional regulatory mechanism in the T. vaginalis tvcp4 and tvcp12 cysteine proteinase mRNAs. The tvcp4 and tvcp12 mRNAs have a stem-loop structure in the 5'-coding region or in the 3'-UTR, respectively that interacts with T. vaginalis multifunctional proteins HSP70, α-Actinin, and Actin under iron starvation condition, causing translation inhibition or mRNA stabilization similar to the previously characterized IRE-IRP system in eukaryotes. Herein, we summarize recent progress and shed some light on atypical RNA-binding proteins that may participate in the iron posttranscriptional regulation in T. vaginalis. PMID:26703754
The "periodic table" of the genetic code: A new way to look at the code and the decoding process.

Science.gov (United States)

Komar, Anton A

2016-01-01

Henri Grosjean and Eric Westhof recently presented an information-rich, alternative view of the genetic code, which takes into account current knowledge of the decoding process, including the complex nature of interactions between mRNA, tRNA and rRNA that take place during protein synthesis on the ribosome, and it also better reflects the evolution of the code. The new asymmetrical circular genetic code has a number of advantages over the traditional codon table and the previous circular diagrams (with a symmetrical/clockwise arrangement of the U, C, A, G bases). Most importantly, all sequence co-variances can be visualized and explained based on the internal logic of the thermodynamics of codon-anticodon interactions.
Proteomic Identification of Altered Cerebral Proteins in the Complex Regional Pain Syndrome Animal Model

Directory of Open Access Journals (Sweden)

Francis Sahngun Nahm

2014-01-01

Full Text Available Background. Complex regional pain syndrome (CRPS is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP model, a novel experimental model of CRPS. Materials and Methods. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Conclusion. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.

Proteomic identification of altered cerebral proteins in the complex regional pain syndrome animal model.

Science.gov (United States)

Nahm, Francis Sahngun; Park, Zee-Yong; Nahm, Sang-Soep; Kim, Yong Chul; Lee, Pyung Bok

2014-01-01

Complex regional pain syndrome (CRPS) is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP) model, a novel experimental model of CRPS. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.
URR [Unresolved Resonance Region] computer code: A code to calculate resonance neutron cross-section probability tables, Bondarenko self-shielding factors, and self-indication ratios for fissile and fertile nuclides

International Nuclear Information System (INIS)

Leal, L.C.; de Saussure, G.; Perez, R.B.

1990-01-01

The URR computer code has been developed to calculate cross-section probability tables, Bondarenko self-shielding factors, and self-indication ratios for fertile and fissile isotopes in the unresolved resonance region. Monte Carlo methods are utilized to select appropriate resonance parameters and to compute the cross sections at the desired reference energy. The neutron cross sections are calculated by the single-level Breit-Wigner formalism with s-, p-, and d-wave contributions. The cross-section probability tables are constructed by sampling by Doppler broadened cross-sections. The various self-shielding factors are computer numerically as Lebesgue integrals over the cross-section probability tables
MARS Code in Linux Environment

Energy Technology Data Exchange (ETDEWEB)

Hwang, Moon Kyu; Bae, Sung Won; Jung, Jae Joon; Chung, Bub Dong [Korea Atomic Energy Research Institute, Taejon (Korea, Republic of)

2005-07-01

The two-phase system analysis code MARS has been incorporated into Linux system. The MARS code was originally developed based on the RELAP5/MOD3.2 and COBRA-TF. The 1-D module which evolved from RELAP5 alone could be applied for the whole NSSS system analysis. The 3-D module developed based on the COBRA-TF, however, could be applied for the analysis of the reactor core region where 3-D phenomena would be better treated. The MARS code also has several other code units that could be incorporated for more detailed analysis. The separate code units include containment analysis modules and 3-D kinetics module. These code modules could be optionally invoked to be coupled with the main MARS code. The containment code modules (CONTAIN and CONTEMPT), for example, could be utilized for the analysis of the plant containment phenomena in a coupled manner with the nuclear reactor system. The mass and energy interaction during the hypothetical coolant leakage accident could, thereby, be analyzed in a more realistic manner. In a similar way, 3-D kinetics could be incorporated for simulating the three dimensional reactor kinetic behavior, instead of using the built-in point kinetics model. The MARS code system, developed initially for the MS Windows environment, however, would not be adequate enough for the PC cluster system where multiple CPUs are available. When parallelism is to be eventually incorporated into the MARS code, MS Windows environment is not considered as an optimum platform. Linux environment, on the other hand, is generally being adopted as a preferred platform for the multiple codes executions as well as for the parallel application. In this study, MARS code has been modified for the adaptation of Linux platform. For the initial code modification, the Windows system specific features have been removed from the code. Since the coupling code module CONTAIN is originally in a form of dynamic load library (DLL) in the Windows system, a similar adaptation method
MARS Code in Linux Environment

International Nuclear Information System (INIS)

Hwang, Moon Kyu; Bae, Sung Won; Jung, Jae Joon; Chung, Bub Dong

2005-01-01

The two-phase system analysis code MARS has been incorporated into Linux system. The MARS code was originally developed based on the RELAP5/MOD3.2 and COBRA-TF. The 1-D module which evolved from RELAP5 alone could be applied for the whole NSSS system analysis. The 3-D module developed based on the COBRA-TF, however, could be applied for the analysis of the reactor core region where 3-D phenomena would be better treated. The MARS code also has several other code units that could be incorporated for more detailed analysis. The separate code units include containment analysis modules and 3-D kinetics module. These code modules could be optionally invoked to be coupled with the main MARS code. The containment code modules (CONTAIN and CONTEMPT), for example, could be utilized for the analysis of the plant containment phenomena in a coupled manner with the nuclear reactor system. The mass and energy interaction during the hypothetical coolant leakage accident could, thereby, be analyzed in a more realistic manner. In a similar way, 3-D kinetics could be incorporated for simulating the three dimensional reactor kinetic behavior, instead of using the built-in point kinetics model. The MARS code system, developed initially for the MS Windows environment, however, would not be adequate enough for the PC cluster system where multiple CPUs are available. When parallelism is to be eventually incorporated into the MARS code, MS Windows environment is not considered as an optimum platform. Linux environment, on the other hand, is generally being adopted as a preferred platform for the multiple codes executions as well as for the parallel application. In this study, MARS code has been modified for the adaptation of Linux platform. For the initial code modification, the Windows system specific features have been removed from the code. Since the coupling code module CONTAIN is originally in a form of dynamic load library (DLL) in the Windows system, a similar adaptation method
Studies of relationships between variation of the human G protein-coupled receptor 40 Gene and Type 2 diabetes and insulin release

DEFF Research Database (Denmark)

Hamid, Y H; Vissing, H; Holst, B

2005-01-01

AIMS: Recently, a novel human G protein-coupled receptor 40 (GPR40), which is predominantly expressed in pancreatic islets, was shown to mediate an amplifying effect of long-chain fatty acids on glucose-induced insulin secretion. The present aim was to examine the coding region of GPR40 for varia......AIMS: Recently, a novel human G protein-coupled receptor 40 (GPR40), which is predominantly expressed in pancreatic islets, was shown to mediate an amplifying effect of long-chain fatty acids on glucose-induced insulin secretion. The present aim was to examine the coding region of GPR40...... compared with the wild type (P = 0.01). The Arg211His polymorphism had a similar allele frequency among 1384 Type 2 diabetic patients [MAF%; 23.4 (95% CI: 21.8-25.0)] and 4424 middle-aged glucose-tolerant subjects [24.1% (23.2-25.0)]. A genotype-quantitative trait study of 5597 non-diabetic, middle...
Differential DNA methylation profiles of coding and non-coding genes define hippocampal sclerosis in human temporal lobe epilepsy

Science.gov (United States)

Miller-Delaney, Suzanne F.C.; Bryan, Kenneth; Das, Sudipto; McKiernan, Ross C.; Bray, Isabella M.; Reynolds, James P.; Gwinn, Ryder; Stallings, Raymond L.

2015-01-01

Temporal lobe epilepsy is associated with large-scale, wide-ranging changes in gene expression in the hippocampus. Epigenetic changes to DNA are attractive mechanisms to explain the sustained hyperexcitability of chronic epilepsy. Here, through methylation analysis of all annotated C-phosphate-G islands and promoter regions in the human genome, we report a pilot study of the methylation profiles of temporal lobe epilepsy with or without hippocampal sclerosis. Furthermore, by comparative analysis of expression and promoter methylation, we identify methylation sensitive non-coding RNA in human temporal lobe epilepsy. A total of 146 protein-coding genes exhibited altered DNA methylation in temporal lobe epilepsy hippocampus (n = 9) when compared to control (n = 5), with 81.5% of the promoters of these genes displaying hypermethylation. Unique methylation profiles were evident in temporal lobe epilepsy with or without hippocampal sclerosis, in addition to a common methylation profile regardless of pathology grade. Gene ontology terms associated with development, neuron remodelling and neuron maturation were over-represented in the methylation profile of Watson Grade 1 samples (mild hippocampal sclerosis). In addition to genes associated with neuronal, neurotransmitter/synaptic transmission and cell death functions, differential hypermethylation of genes associated with transcriptional regulation was evident in temporal lobe epilepsy, but overall few genes previously associated with epilepsy were among the differentially methylated. Finally, a panel of 13, methylation-sensitive microRNA were identified in temporal lobe epilepsy including MIR27A, miR-193a-5p (MIR193A) and miR-876-3p (MIR876), and the differential methylation of long non-coding RNA documented for the first time. The present study therefore reports select, genome-wide DNA methylation changes in human temporal lobe epilepsy that may contribute to the molecular architecture of the epileptic brain. PMID
Covalent protein modification with ISG15 via a conserved cysteine in the hinge region.

Directory of Open Access Journals (Sweden)

Veronika N Bade

Full Text Available The ubiquitin-like protein ISG15 (interferon-stimulated gene of 15 kDa is strongly induced by type I interferons and displays antiviral activity. As other ubiquitin-like proteins (Ubls, ISG15 is post-translationally conjugated to substrate proteins by an isopeptide bond between the C-terminal glycine of ISG15 and the side chains of lysine residues in the substrates (ISGylation. ISG15 consists of two ubiquitin-like domains that are separated by a hinge region. In many orthologs, this region contains a single highly reactive cysteine residue. Several hundred potential substrates for ISGylation have been identified but only a few of them have been rigorously verified. In order to investigate the modification of several ISG15 substrates, we have purified ISG15 conjugates from cell extracts by metal-chelate affinity purification and immunoprecipitations. We found that the levels of proteins modified by human ISG15 can be decreased by the addition of reducing agents. With the help of thiol blocking reagents, a mutational analysis and miRNA mediated knock-down of ISG15 expression, we revealed that this modification occurs in living cells via a disulphide bridge between the substrates and Cys78 in the hinge region of ISG15. While the ISG15 activating enzyme UBE1L is conjugated by ISG15 in the classical way, we show that the ubiquitin conjugating enzyme Ubc13 can either be classically conjugated by ISG15 or can form a disulphide bridge with ISG15 at the active site cysteine 87. The latter modification would interfere with its function as ubiquitin conjugating enzyme. However, we found no evidence for an ISG15 modification of the dynamin-like GTPases MxA and hGBP1. These findings indicate that the analysis of potential substrates for ISG15 conjugation must be performed with great care to distinguish between the two types of modification since many assays such as immunoprecipitation or metal-chelate affinity purification are performed with little or no
3D-SURFER: software for high-throughput protein surface comparison and analysis.

Science.gov (United States)

La, David; Esquivel-Rodríguez, Juan; Venkatraman, Vishwesh; Li, Bin; Sael, Lee; Ueng, Stephen; Ahrendt, Steven; Kihara, Daisuke

2009-11-01

We present 3D-SURFER, a web-based tool designed to facilitate high-throughput comparison and characterization of proteins based on their surface shape. As each protein is effectively represented by a vector of 3D Zernike descriptors, comparison times for a query protein against the entire PDB take, on an average, only a couple of seconds. The web interface has been designed to be as interactive as possible with displays showing animated protein rotations, CATH codes and structural alignments using the CE program. In addition, geometrically interesting local features of the protein surface, such as pockets that often correspond to ligand binding sites as well as protrusions and flat regions can also be identified and visualized. 3D-SURFER is a web application that can be freely accessed from: http://dragon.bio.purdue.edu/3d-surfer dkihara@purdue.edu Supplementary data are available at Bioinformatics online.
Highly conserved non-coding sequences are associated with vertebrate development.

Directory of Open Access Journals (Sweden)

Adam Woolfe

2005-01-01

Full Text Available In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH, in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development
Complete cDNA sequence coding for human docking protein

Energy Technology Data Exchange (ETDEWEB)

Hortsch, M; Labeit, S; Meyer, D I

1988-01-11

Docking protein (DP, or SRP receptor) is a rough endoplasmic reticulum (ER)-associated protein essential for the targeting and translocation of nascent polypeptides across this membrane. It specifically interacts with a cytoplasmic ribonucleoprotein complex, the signal recognition particle (SRP). The nucleotide sequence of cDNA encoding the entire human DP and its deduced amino acid sequence are given.
An operon from Lactobacillus helveticus composed of a proline iminopeptidase gene (pepI) and two genes coding for putative members of the ABC transporter family of proteins.

Science.gov (United States)

Varmanen, P; Rantanen, T; Palva, A

1996-12-01

A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.
Intron-exon organization of the active human protein S gene PS. alpha. and its pseudogene PS. beta. : Duplication and silencing during primate evolution

Energy Technology Data Exchange (ETDEWEB)

Ploos van Amstel, H.; Reitsma, P.H.; van der Logt, C.P.; Bertina, R.M. (University Hospital, Leiden (Netherlands))

1990-08-28

The human protein S locus on chromosome 3 consists of two protein S genes, PS{alpha} and PS{beta}. Here the authors report the cloning and characterization of both genes. Fifteen exons of the PS{alpha} gene were identified that together code for protein S mRNA as derived from the reported protein S cDNAs. Analysis by primer extension of liver protein S mRNA, however, reveals the presence of two mRNA forms that differ in the length of their 5{prime}-noncoding region. Both transcripts contain a 5{prime}-noncoding region longer than found in the protein S cDNAs. The two products may arise from alternative splicing of an additional intron in this region or from the usage of two start sites for transcription. The intron-exon organization of the PS{alpha} gene fully supports the hypothesis that the protein S gene is the product of an evolutional assembling process in which gene modules coding for structural/functional protein units also found in other coagulation proteins have been put upstream of the ancestral gene of a steroid hormone binding protein. The PS{beta} gene is identified as a pseudogene. It contains a large variety of detrimental aberrations, viz., the absence of exon I, a splice site mutation, three stop codons, and a frame shift mutation. Overall the two genes PS{alpha} and PS{beta} show between their exonic sequences 96.5% homology. Southern analysis of primate DNA showed that the duplication of the ancestral protein S gene has occurred after the branching of the orangutan from the African apes. A nonsense mutation that is present in the pseudogene of man also could be identified in one of the two protein S genes of both chimpanzee and gorilla. This implicates that silencing of one of the two protein S genes must have taken place before the divergence of the three African apes.
The small RNA content of human sperm reveals pseudogene-derived piRNAs complementary to protein-coding genes

DEFF Research Database (Denmark)

Pantano, Lorena; Jodar, Meritxell; Bak, Mads

2015-01-01

-specific genes. The most abundant class of small noncoding RNAs in sperm are PIWI-interacting RNAs (piRNAs). Surprisingly, we found that human sperm cells contain piRNAs processed from pseudogenes. Clusters of piRNAs from human testes contain pseudogenes transcribed in the antisense strand and processed...... into small RNAs. Several human protein-coding genes contain antisense predicted targets of pseudogene-derived piRNAs in the male germline and these piRNAs are still found in mature sperm. Our study provides the most extensive data set and annotation of human sperm small RNAs to date and is a resource...... for further functional studies on the roles of sperm small RNAs. In addition, we propose that some of the pseudogene-derived human piRNAs may regulate expression of their parent gene in the male germline....
Functional interrogation of non-coding DNA through CRISPR genome editing.

Science.gov (United States)

Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

2017-05-15

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.
Transforming p21 ras protein: flexibility in the major variable region linking the catalytic and membrane-anchoring domains

DEFF Research Database (Denmark)

Willumsen, B M; Papageorge, A G; Hubbert, N

1985-01-01

or increasing it to 50 amino acids has relatively little effect on the capacity of the gene to induce morphological transformation of NIH 3T3 cells. Assays of GTP binding, GTPase and autophosphorylating activities of such mutant v-rasH-encoded proteins synthesized in bacteria indicated that the sequences...... that is required for post-translational processing, membrane localization and transforming activity of the proteins. We have now used the viral oncogene (v-rasH) of Harvey sarcoma virus to study the major variable region by deleting or duplicating parts of the gene. Reducing this region to five amino acids...... that encode these biochemical activities are located upstream from the major variable region. In the context of transformation, we propose that the region of sequence heterogeneity serves principally to connect the N-terminal catalytic domain with amino acids at the C terminus that are required to anchor...
A Legionella pneumophila effector protein encoded in a region of genomic plasticity binds to Dot/Icm-modified vacuoles.

Directory of Open Access Journals (Sweden)

Shira Ninio

2009-01-01

Full Text Available Legionella pneumophila is an opportunistic pathogen that can cause a severe pneumonia called Legionnaires' disease. In the environment, L. pneumophila is found in fresh water reservoirs in a large spectrum of environmental conditions, where the bacteria are able to replicate within a variety of protozoan hosts. To survive within eukaryotic cells, L. pneumophila require a type IV secretion system, designated Dot/Icm, that delivers bacterial effector proteins into the host cell cytoplasm. In recent years, a number of Dot/Icm substrate proteins have been identified; however, the function of most of these proteins remains unknown, and it is unclear why the bacterium maintains such a large repertoire of effectors to promote its survival. Here we investigate a region of the L. pneumophila chromosome that displays a high degree of plasticity among four sequenced L. pneumophila strains. Analysis of GC content suggests that several genes encoded in this region were acquired through horizontal gene transfer. Protein translocation studies establish that this region of genomic plasticity encodes for multiple Dot/Icm effectors. Ectopic expression studies in mammalian cells indicate that one of these substrates, a protein called PieA, has unique effector activities. PieA is an effector that can alter lysosome morphology and associates specifically with vacuoles that support L. pneumophila replication. It was determined that the association of PieA with vacuoles containing L. pneumophila requires modifications to the vacuole mediated by other Dot/Icm effectors. Thus, the localization properties of PieA reveal that the Dot/Icm system has the ability to spatially and temporally control the association of an effector with vacuoles containing L. pneumophila through activities mediated by other effector proteins.
Functional Diets Modulate lncRNA-Coding RNAs and Gene Interactions in the Intestine of Rainbow Trout Oncorhynchus mykiss.

Science.gov (United States)

Núñez-Acuña, Gustavo; Détrée, Camille; Gallardo-Escárate, Cristian; Gonçalves, Ana Teresa

2017-06-01

The advent of functional genomics has sparked the interest in inferring the function of non-coding regions from the transcriptome in non-model species. However, numerous biological processes remain understudied from this perspective, including intestinal immunity in farmed fish. The aim of this study was to infer long non-coding RNA (lncRNAs) expression profiles in rainbow trout (Oncorhynchus mykiss) fed for 30 days with functional diets based on pre- and probiotics. For this, whole transcriptome sequencing was conducted through Illumina technology, and lncRNAs were mined to evaluate transcriptional activity in conjunction with known protein sequences. To detect differentially expressed transcripts, 880 novels and 9067 previously described O. mykiss lncRNAs were used. Expression levels and genome co-localization correlations with coding genes were also analyzed. Significant differences in gene expression were primarily found in the probiotic diet, which had a twofold downregulation of lncRNAs compared to other treatments. Notable differences by diet were also evidenced between the coding genes of distinct metabolic processes. In contrast, genome co-localization of lncRNAs with coding genes was similar for all diets. This study contributes novel knowledge regarding lncRNAs in fish, suggesting key roles in salmons fed with in-feed additives with the capacity to modulate the intestinal homeostasis and host health.
Lateral mobility of plasma membrane proteins in dividing eggs of the loach (Misgurnus fossilis): Regional differences and changes during the cell cycle.

Science.gov (United States)

Bozhkova, V P; Budayova, M; Kvasnicka, P; Cigankova, N; Chorvat, D

1994-12-01

Regional differences in lateral diffusion rates of fluorescence-labeled proteins have been studied in the plasma membrane of dividing eggs of the loach (Misgurnus fossilis) by fluorescence recovery after photobleaching (FRAP). Apparent animal-vegetal differences in fluorescence intensity, lateral diffusion coefficients, and fractions of mobile proteins have been found, with all these quantities being higher in the animal pole region than in the yolk region. Cyclic changes in protein diffusion coefficients and mobile fractions during the first few cell cycles have also been recorded. Soon after the end of a cleavage, the diffusion coefficient reaches its minimal value and increases rapidly before the next cleavage.
XGC developments for a more efficient XGC-GENE code coupling

Science.gov (United States)

Dominski, Julien; Hager, Robert; Ku, Seung-Hoe; Chang, Cs

2017-10-01

In the Exascale Computing Program, the High-Fidelity Whole Device Modeling project initially aims at delivering a tightly-coupled simulation of plasma neoclassical and turbulence dynamics from the core to the edge of the tokamak. To permit such simulations, the gyrokinetic codes GENE and XGC will be coupled together. Numerical efforts are made to improve the numerical schemes agreement in the coupling region. One of the difficulties of coupling those codes together is the incompatibility of their grids. GENE is a continuum grid-based code and XGC is a Particle-In-Cell code using unstructured triangular mesh. A field-aligned filter is thus implemented in XGC. Even if XGC originally had an approximately field-following mesh, this field-aligned filter permits to have a perturbation discretization closer to the one solved in the field-aligned code GENE. Additionally, new XGC gyro-averaging matrices are implemented on a velocity grid adapted to the plasma properties, thus ensuring same accuracy from the core to the edge regions.
Transmissible familial Creutzfeldt-Jakob disease associated with five, seven, and eight extra octapeptide coding repeats in the PRNP gene

Energy Technology Data Exchange (ETDEWEB)

Goldfarb, L.G.; Brown, P.; McCombie, W.R.; Gibbs, C.J. Jr.; Gajdusek, D.C. (National Inst. of Health, Bethesda, MD (United States)); Goldgaber, D. (State Univ. of New York, Stony Brook (United States)); Swergold, G.D. (National Inst. of Health, Bethesda, MD (United States)); Wills, P.R. (Univ. of Auckland (New Zealand)); Cervenakova, L. (Inst. of Preventive and Clinical Medicine, Bratislava (Czechoslovakia)); Baron, H. (Searle Pharmaceuticals, Paris (France))

1991-12-01

The PRNP gene, encoding the amyloid precursor protein that is centrally involved in Creutzfeldt-Jakob disease (CJD), has an unstable region of five variant tandem octapeptide coding repeats between codons 51 and 91. The authors screened a total of 535 individuals for the presence of extra repeats in this region, including patients with sporadic and familial forms of spongiform encephalopathy, members of their families, other neurological and non-neurological patients, and normal controls. They identified three CJD families (in each of which the proband's disease was neuropathologically confirmed and experimentally transmitted to primates) that were heterozygous for alleles with 10, 12, or 13 repeats, some of which had wobble nucleotide substitutions. They also found one individual with 9 repeats and no nucleotide substitutions who had no evidence of neurological disease. These observations, together with data on published British patients with 11 and 14 repeats, strongly suggest that the occurrence of 10 or more octapeptide repeats in the encoded amyloid precursor protein predisposes to CJD.

Highly selective BSA imprinted polyacrylamide hydrogels facilitated by a metal-coding MIP approach.

Science.gov (United States)

El-Sharif, H F; Yapati, H; Kalluru, S; Reddy, S M

2015-12-01

We report the fabrication of metal-coded molecularly imprinted polymers (MIPs) using hydrogel-based protein imprinting techniques. A Co(II) complex was prepared using (E)-2-((2 hydrazide-(4-vinylbenzyl)hydrazono)methyl)phenol; along with iron(III) chloroprotoporphyrin (Hemin), vinylferrocene (VFc), zinc(II) protoporphyrin (ZnPP) and protoporphyrin (PP), these complexes were introduced into the MIPs as co-monomers for metal-coding of non-metalloprotein imprints. Results indicate a 66% enhancement for bovine serum albumin (BSA) protein binding capacities (Q, mg/g) via metal-ion/ligand exchange properties within the metal-coded MIPs. Specifically, Co(II)-complex-based MIPs exhibited 92 ± 1% specific binding with Q values of 5.7 ± 0.45 mg BSA/g polymer and imprinting factors (IF) of 14.8 ± 1.9 (MIP/non-imprinted (NIP) control). The selectivity of our Co(II)-coded BSA MIPs were also tested using bovine haemoglobin (BHb), lysozyme (Lyz), and trypsin (Tryp). By evaluating imprinting factors (K), each of the latter proteins was found to have lower affinities in comparison to cognate BSA template. The hydrogels were further characterised by thermal analysis and differential scanning calorimetry (DSC) to assess optimum polymer composition. The development of hydrogel-based molecularly imprinted polymer (HydroMIPs) technology for the memory imprinting of proteins and for protein biosensor development presents many possibilities, including uses in bio-sample clean-up or selective extraction, replacement of biological antibodies in immunoassays and biosensors for medicine and the environment. Biosensors for proteins and viruses are currently expensive to develop because they require the use of expensive antibodies. Because of their biomimicry capabilities (and their potential to act as synthetic antibodies), HydroMIPs potentially offer a route to the development of new low-cost biosensors. Herein, a metal ion-mediated imprinting approach was employed to metal-code our
RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location.

Science.gov (United States)

Serghini, M A; Fuchs, M; Pinck, M; Reinbolt, J; Walter, B; Pinck, L

1990-07-01

The nucleotide sequence of the genomic RNA2 (3774 nucleotides) of grapevine fanleaf virus strain F13 was determined from overlapping cDNA clones and its genetic organization was deduced. Two rapid and efficient methods were used for cDNA cloning of the 5' region of RNA2. The complete sequence contained only one long open reading frame of 3555 nucleotides (1184 codons, 131K product). The analysis of the N-terminal sequence of purified coat protein (CP) and identification of its C-terminal residue have allowed the CP cistron to be precisely positioned within the polyprotein. The CP produced by proteolytic cleavage at the Arg/Gly site between residues 680 and 681 contains 504 amino acids (Mr 56019) and has hydrophobic properties. The Arg/Gly cleavage site deduced by N-terminal amino acid sequence analysis is the first for a nepovirus coat protein and for plant viruses expressing their genomic RNAs by polyprotein synthesis. Comparison of GFLV RNA2 with M RNA of cowpea mosaic comovirus and with RNA2 of two closely related nepoviruses, tomato black ring virus and Hungarian grapevine chrome mosaic virus, showed strong similarities among the 3' non-coding regions but less similarity among the 5' end non-coding sequences than reported among other nepovirus RNAs.
Evidence for a novel coding sequence overlapping the 5'-terminal ~90 codons of the Gill-associated and Yellow head okavirus envelope glycoprotein gene

Directory of Open Access Journals (Sweden)

Atkins John F

2009-12-01

Full Text Available Abstract The genus Okavirus (order Nidovirales includes a number of viruses that infect crustaceans, causing major losses in the shrimp industry. These viruses have a linear positive-sense ssRNA genome of ~26-27 kb, encoding a large replicase polyprotein that is expressed from the genomic RNA, and several additional proteins that are expressed from a nested set of 3'-coterminal subgenomic RNAs. In this brief report, we describe the bioinformatic discovery of a new, apparently coding, ORF that overlaps the 5' end of the envelope glycoprotein encoding sequence, ORF3, in the +2 reading frame. The new ORF has a strong coding signature and, in fact, is more conserved at the amino acid level than the overlapping region of ORF3. We propose that translation of the new ORF initiates at a conserved AUG codon separated by just 2 nt from the ORF3 AUG initiation codon, resulting in a novel 86 amino acid protein.
The PLAC1-homology region of the ZP domain is sufficient for protein polymerisation

Directory of Open Access Journals (Sweden)

Litscher Eveline S

2006-04-01

Full Text Available Abstract Background Hundreds of extracellular proteins polymerise into filaments and matrices by using zona pellucida (ZP domains. ZP domain proteins perform highly diverse functions, ranging from structural to receptorial, and mutations in their genes are responsible for a number of severe human diseases. Recently, PLAC1, Oosp1-3, Papillote and CG16798 proteins were identified that share sequence homology with the N-terminal half of the ZP domain (ZP-N, but not with its C-terminal half (ZP-C. The functional significance of this partial conservation is unknown. Results By exploiting a highly engineered bacterial strain, we expressed in soluble form the PLAC1-homology region of mammalian sperm receptor ZP3 as a fusion to maltose binding protein. Mass spectrometry showed that the 4 conserved Cys residues within the ZP-N moiety of the fusion protein adopt the same disulfide bond connectivity as in full-length native ZP3, indicating that it is correctly folded, and electron microscopy and biochemical analyses revealed that it assembles into filaments. Conclusion These findings provide a function for PLAC1-like proteins and, by showing that ZP-N is a biologically active folding unit, prompt a re-evaluation of the architecture of the ZP domain and its polymers. Furthermore, they suggest that ZP-C might play a regulatory role in the assembly of ZP domain protein complexes.
Non-coding, mRNA-like RNAs database Y2K.

Science.gov (United States)

Erdmann, V A; Szymanski, M; Hochberg, A; Groot, N; Barciszewski, J

2000-01-01

In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www. man.poznan.pl/5SData/ncRNA/index.html
Molecular Evolution of the non-coding Eosinophil Granule Ontogeny Transcript EGOT

Directory of Open Access Journals (Sweden)

Dominic eRose

2011-10-01

Full Text Available Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs. The evolutionary history of mlncRNAs is still largely uncharted territory.In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT, an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs. EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyse patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrat here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved and thermodynamic stable secondary structures.Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element.
The Genomic Code: Genome Evolution and Potential Applications

KAUST Repository

Bernardi, Giorgio

2016-01-25

The genome of metazoans is organized according to a genomic code which comprises three laws: 1) Compositional correlations hold between contiguous coding and non-coding sequences, as well as among the three codon positions of protein-coding genes; these correlations are the consequence of the fact that the genomes under consideration consist of fairly homogeneous, long (≥200Kb) sequences, the isochores; 2) Although isochores are defined on the basis of purely compositional properties, GC levels of isochores are correlated with all tested structural and functional properties of the genome; 3) GC levels of isochores are correlated with chromosome architecture from interphase to metaphase; in the case of interphase the correlation concerns isochores and the three-dimensional “topological associated domains” (TADs); in the case of mitotic chromosomes, the correlation concerns isochores and chromosomal bands. Finally, the genomic code is the fourth and last pillar of molecular biology, the first three pillars being 1) the double helix structure of DNA; 2) the regulation of gene expression in prokaryotes; and 3) the genetic code.
CMCpy: Genetic Code-Message Coevolution Models in Python

Science.gov (United States)

Becich, Peter J.; Stark, Brian P.; Bhat, Harish S.; Ardell, David H.

2013-01-01

Code-message coevolution (CMC) models represent coevolution of a genetic code and a population of protein-coding genes (“messages”). Formally, CMC models are sets of quasispecies coupled together for fitness through a shared genetic code. Although CMC models display plausible explanations for the origin of multiple genetic code traits by natural selection, useful modern implementations of CMC models are not currently available. To meet this need we present CMCpy, an object-oriented Python API and command-line executable front-end that can reproduce all published results of CMC models. CMCpy implements multiple solvers for leading eigenpairs of quasispecies models. We also present novel analytical results that extend and generalize applications of perturbation theory to quasispecies models and pioneer the application of a homotopy method for quasispecies with non-unique maximally fit genotypes. Our results therefore facilitate the computational and analytical study of a variety of evolutionary systems. CMCpy is free open-source software available from http://pypi.python.org/pypi/CMCpy/. PMID:23532367
Identification of Common Epitopes on a Conserved Region of NSs Proteins Among Tospoviruses of Watermelon silver mottle virus Serogroup.

Science.gov (United States)

Chen, Tsung-Chi; Huang, Ching-Wen; Kuo, Yan-Wen; Liu, Fang-Lin; Yuan, Chao-Hsiu Hsuan; Hsu, Hei-Ti; Yeh, Shyi-Dong

2006-12-01

ABSTRACT The NSs protein of Watermelon silver mottle virus (WSMoV) was expressed by a Zucchini yellow mosaic virus (ZYMV) vector in squash. The expressed NSs protein with a histidine tag and an additional NIa protease cleavage sequence was isolated by Ni(2+)-NTA resins as a free-form protein and further eluted after sodium dodecyl sulfate-polyacrylamide gel electrophoresis for production of rabbit antiserum and mouse monoclonal antibodies (MAbs). The rabbit antiserum strongly reacted with the NSs crude antigen of WSMoV and weakly reacted with that of a high-temperature-recovered gloxinia isolate (HT-1) of Capsicum chlorosis virus (CaCV), but not with that of Calla lily chlorotic spot virus (CCSV). In contrast, the MAbs reacted strongly with all crude NSs antigens of WSMoV, CaCV, and CCSV. Various deletions of the NSs open reading frame were constructed and expressed by ZYMV vector. Results indicate that all three MAbs target the 89- to 125-amino-acid (aa) region of WSMoV NSs protein. Two indispensable residues of cysteine and lysine were essential for MAbs recognition. Sequence comparison of the deduced MAbs-recognized region with the reported tospoviral NSs proteins revealed the presence of a consensus sequence VRKPGVKNTGCKFTMHNQIFNPN (denoted WNSscon), at the 98- to 120-aa position of NSs proteins, sharing 86 to 100% identities among those of WSMoV, CaCV, CCSV, and Peanut bud necrosis virus. A synthetic WNSscon peptide reacted with the MAbs and verified that the epitopes are present in the 98- to 120-aa region of WSMoV NSs protein. The WSMoV sero-group-specific NSs MAbs provide a means for reliable identification of tospoviruses in this large serogroup.
Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

Directory of Open Access Journals (Sweden)

Guo Xiang

2008-12-01

Full Text Available Abstract Background Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know
The hypervariable region of Streptococcus pyogenes M protein escapes antibody attack by antigenic variation and weak immunogenicity

DEFF Research Database (Denmark)

Lannergård, Jonas; Gustafsson, Caj Ulrik Mattias; Waldemarsson, Johan

2011-01-01

Sequence variation of antigenic proteins allows pathogens to evade antibody attack. The variable protein commonly includes a hypervariable region (HVR), which represents a key target for antibodies and is therefore predicted to be immunodominant. To understand the mechanism(s) of antibody evasion...
URR [Unresolved Resonance Region] computer code: A code to calculate resonance neutron cross-section probability tables, Bondarenko self-shielding factors, and self-indication ratios for fissile and fertile nuclides

International Nuclear Information System (INIS)

Leal, L.C.; de Saussure, G.; Perez, R.B.

1989-01-01

The URR computer code has been developed to calculate cross-section probability tables, Bondarenko self-shielding factors, and self- indication ratios for fertile and fissile isotopes in the unresolved resonance region. Monte Carlo methods are utilized to select appropriate resonance parameters and to compute the cross sections at the desired reference energy. The neutron cross sections are calculated by the single-level Breit-Wigner formalism with s-, p-, and d-wave contributions. The cross-section probability tables are constructed by sampling the Doppler broadened cross-section. The various shelf-shielded factors are computed numerically as Lebesgue integrals over the cross-section probability tables. 6 refs
Flexibility of the genetic code with respect to DNA structure

DEFF Research Database (Denmark)

Baisnée, P. F.; Baldi, Pierre; Brunak, Søren

2001-01-01

Motivation. The primary function of DNA is to carry genetic information through the genetic code. DNA, however, contains a variety of other signals related, for instance, to reading frame, codon bias, pairwise codon bias, splice sites and transcription regulation, nucleosome positioning and DNA...... structure. Here we study the relationship between the genetic code and DNA structure and address two questions. First, to which degree does the degeneracy of the genetic code and the acceptable amino acid substitution patterns allow for the superimposition of DNA structural signals to protein coding...... sequences? Second, is the origin or evolution of the genetic code likely to have been constrained by DNA structure? Results. We develop an index for code flexibility with respect to DNA structure. Using five different di- or tri-nucleotide models of sequence-dependent DNA structure, we show...
Intergenic disease-associated regions are abundant in novel transcripts.

Science.gov (United States)

Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E

2017-12-28

Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.
Targets of DNA-binding proteins in bacterial promoter regions present enhanced probabilities for spontaneous thermal openings

International Nuclear Information System (INIS)

Apostolaki, Angeliki; Kalosakas, George

2011-01-01

We mapped promoter regions of double-stranded DNA with respect to the probabilities of appearance of relatively large bubble openings exclusively due to thermal fluctuations at physiological temperatures. We analyzed five well-studied promoter regions of procaryotic type and found a spatial correlation between the binding sites of transcription factors and the position of peaks in the probability pattern of large thermal openings. Other distinct peaks of the calculated patterns correlate with potential binding sites of DNA-binding proteins. These results suggest that a DNA molecule would more frequently expose the bases that participate in contacts with proteins, which would probably enhance the probability of the latter to reach their targets. It also stands for using this method as a means to analyze DNA sequences based on their intrinsic thermal properties
Reliability of cause of death coding: an international comparison.

Science.gov (United States)

Antini, Carmen; Rajs, Danuta; Muñoz-Quezada, María Teresa; Mondaca, Boris Andrés Lucero; Heiss, Gerardo

2015-07-01

This study evaluates the agreement of nosologic coding of cardiovascular causes of death between a Chilean coder and one in the United States, in a stratified random sample of death certificates of persons aged ≥ 60, issued in 2008 in the Valparaíso and Metropolitan regions, Chile. All causes of death were converted to ICD-10 codes in parallel by both coders. Concordance was analyzed with inter-coder agreement and Cohen's kappa coefficient by level of specification ICD-10 code for the underlying cause and the total causes of death coding. Inter-coder agreement was 76.4% for all causes of death and 80.6% for the underlying cause (agreement at the four-digit level), with differences by the level of specification of the ICD-10 code, by line of the death certificate, and by number of causes of death per certificate. Cohen's kappa coefficient was 0.76 (95%CI: 0.68-0.84) for the underlying cause and 0.75 (95%CI: 0.74-0.77) for the total causes of death. In conclusion, causes of death coding and inter-coder agreement for cardiovascular diseases in two regions of Chile are comparable to an external benchmark and with reports from other countries.
Role of horizontal gene transfer as a control on the coevolution of ribosomal proteins and the genetic code

Energy Technology Data Exchange (ETDEWEB)

Woese, Carl R.; Goldenfeld, Nigel; Luthey-Schulten, Zaida

2011-03-31

Our main goal is to develop the conceptual and computational tools necessary to understand the evolution of the universal processes of translation and replication and to identify events of horizontal gene transfer that occurred within the components. We will attempt to uncover the major evolutionary transitions that accompanied the development of protein synthesis by the ribosome and associated components of the translation apparatus. Our project goes beyond standard genomic approaches to explore homologs that are represented at both the structure and sequence level. Accordingly, use of structural phylogenetic analysis allows us to probe further back into deep evolutionary time than competing approaches, permitting greater resolution of primitive folds and structures. Specifically, our work focuses on the elements of translation, ranging from the emergence of the canonical genetic code to the evolution of specific protein folds, mediated by the predominance of horizontal gene transfer in early life. A unique element of this study is the explicit accounting for the impact of phenotype selection on translation, through a coevolutionary control mechanism. Our work contributes to DOE mission objectives through: (1) sophisticated computer simulation of protein dynamics and evolution, and the further refinement of techniques for structural phylogeny, which complement sequence information, leading to improved annotation of genomic databases; (2) development of evolutionary approaches to exploring cellular function and machinery in an integrated way; and (3) documentation of the phenotype interaction with translation over evolutionary time, reflecting the system response to changing selection pressures through horizontal gene transfer.
Extreme heterogeneity of polyadenylation sites in mRNAs encoding chloroplast RNA-binding proteins in Nicotiana plumbaginifolia.

Science.gov (United States)

Klahre, U; Hemmings-Mieszczak, M; Filipowicz, W

1995-06-01

We have previously characterized nuclear cDNA clones encoding two RNA binding proteins, CP-RBP30 and CP-RBP-31, which are targeted to chloroplasts in Nicotiana plumbaginifolia. In this report we describe the analysis of the 3'-untranslated regions (3'-UTRs) in 22 CP-RBP30 and 8 CP-RBP31 clones which reveals that mRNAs encoding both proteins have a very complex polyadenylation pattern. Fourteen distinct poly(A) sites were identified among CP-RBP30 clones and four sites among the CP-RBP31 clones. The authenticity of the sites was confirmed by RNase A/T1 mapping of N. plumbaginifolia RNA. CP-RBP30 provides an extreme example of the heterogeneity known to be a feature of mRNA polyadenylation in higher plants. Using PCR we have demonstrated that CP-RBP genes in N. plumbaginifolia and N. sylvestris, in addition to the previously described introns interrupting the coding region, contain an intron located in the 3' non-coding part of the gene. In the case of the CP-RBP31, we have identified one polyadenylation event occurring in this intron.
Extensive in silico analysis of Mimivirus coded Rab GTPase homolog suggests a possible role in virion membrane biogenesis

Directory of Open Access Journals (Sweden)

Amrutraj eZade

2015-09-01

Full Text Available Rab GTPases are the key regulators of intracellular membrane trafficking in eukaryotes. Many viruses and intracellular bacterial pathogens have evolved to hijack the host Rab GTPase functions, mainly through activators and effector proteins, for their benefit. Acanthamoeba polyphaga mimivirus (APMV is one of the largest viruses and belongs to the monophyletic clade of nucleo-cytoplasmic large DNA viruses (NCLDV. The inner membrane lining is integral to the APMV virion structure. APMV assembly involves extensive host membrane modifications, like vesicle budding and fusion, leading to the formation of a membrane sheet that is incorporated into the virion. Intriguingly, APMV and all group I members of the Mimiviridae family code for a putative Rab GTPase protein. APMV is the first reported virus to code for a Rab GTPase (encoded by R214 gene. Our thorough in silico analysis of the subfamily specific (SF region of Mimiviridae Rab GTPase sequences suggests that they are related to Rab5, a member of the group II Rab GTPases, of lower eukaryotes. Because of their high divergence from the existing three isoforms, A, B and C of the Rab5-family, we suggest that Mimiviridae Rabs constitute a new isoform, Rab5D. Phylogenetic analysis indicated probable horizontal acquisition from a lower eukaryotic ancestor followed by selection and divergence. Furthermore, interaction network analysis suggests that vps34 (a Class III P13K homolog, coded by APMV L615, Atg-8 and dynamin (host proteins are recruited by APMV Rab GTPase during capsid assembly. Based on these observations, we hypothesize that APMV Rab plays a role in the acquisition of inner membrane during virion assembly.
Measurements in Regions of Shock Wave/Turbulent Boundary Layer Interaction from Mach 3 to 10 for Open and Blind Code Evaluation/Validation

Science.gov (United States)

2013-03-01

34Blind" Code Evaluation/Validation Michael S. Holden, Timothy P. Wadhams, Matthew G. MacLean, Aaron Dufrene CUBRC , Inc March 2013 Final...298 Back (Rev. 8/98) *Fellow, AIAA, Vice President-Hypersonics, CUBRC , 4455 Genesee Street, Buffalo, NY 14225 ** Member, AIAA, Project Engineers... CUBRC , 4455 Genesee Street, Buffalo, NY 14225 This work was supported by AFOSR Grant No. FA9550-11-1-0290 MEASUREMENTS IN REGIONS OF SHOCK WAVE

Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm

Energy Technology Data Exchange (ETDEWEB)

Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard; Nix, David; Pollard, Daniel A.; Iyer, Venky N.; Hechmer, Aaron; Simirenko, Lisa; Stapleton, Mark; Luengo Hendriks, Cris L.; Chu, Hou Cheng; Ogawa, Nobuo; Inwood, William; Sementchenko, Victor; Beaton, Amy; Weiszmann, Richard; Celniker, Susan E.; Knowles, David W.; Gingeras, Tom; Speed, Terence P.; Eisen, Michael B.; Biggin, Mark D.

2008-01-10

Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched in bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early
A deeper look into transcription regulatory code by preferred pair distance templates for transcription factor binding sites

KAUST Repository

Kulakovskiy, Ivan V.

2011-08-18

Motivation: Modern experimental methods provide substantial information on protein-DNA recognition. Studying arrangements of transcription factor binding sites (TFBSs) of interacting transcription factors (TFs) advances understanding of the transcription regulatory code. Results: We constructed binding motifs for TFs forming a complex with HIF-1α at the erythropoietin 3\\'-enhancer. Corresponding TFBSs were predicted in the segments around transcription start sites (TSSs) of all human genes. Using the genome-wide set of regulatory regions, we observed several strongly preferred distances between hypoxia-responsive element (HRE) and binding sites of a particular cofactor protein. The set of preferred distances was called as a preferred pair distance template (PPDT). PPDT dramatically depended on the TF and orientation of its binding sites relative to HRE. PPDT evaluated from the genome-wide set of regulatory sequences was used to detect significant PPDT-consistent binding site pairs in regulatory regions of hypoxia-responsive genes. We believe PPDT can help to reveal the layout of eukaryotic regulatory segments. © The Author 2011. Published by Oxford University Press. All rights reserved.
Performance Comparison of Containment PT analysis between CAP and CONTEMPT Code

Energy Technology Data Exchange (ETDEWEB)

Choo, Yeon Jun; Hong, Soon Joon; Hwang, Su Hyun; Kim, Min Ki; Lee, Byung Chul [FNC Tech., Seoul (Korea, Republic of); Ha, Sang Jun; Choi, Hoon [KHNP-CENTERAL RESEARCH INSTITUTE, Daejeon (Korea, Republic of)

2013-10-15

CAP, in the form that is linked with SPACE, computed the containment back-pressure during LOCA accident. In previous SAR (safety analysis report) report of Shin-Kori Units 3 and 4, the CONTEMPT series of codes(hereby referred to as just 'CONTEMPT') is used to evaluate the containment safety during the postulated loss-of-coolant accident (LOCA). In more detail, CONTEMPT-LT/028 was used to calculate the containment maximum PT, while CONTEMPT4/MOD5 to calculate the minimum PT. Actually, in minimum PT analysis, CONTEMPT4/MOD5, which provide back pressure condition of containment, was linked with RELAP5/MOD3.3 which calculate the amount of blowdown into containment. In this analysis, CONTEMPT4/MOD5 was modified based on KREM. CONTEMPT code was developed to predict the long term behavior of water-cooled nuclear reactor containment systems subjected to LOCA conditions. It calculates the time variation of compartment pressures, temperatures, mass and energy inventories, heat structure temperature distributions, and energy exchange with adjacent compartments, leakage on containment response. Models are provided for fan cooler and cooling spray as engineered safety systems. Any compartment may have both a liquid pool region and an air-vapor atmosphere region above the pool. Each region is assumed to have a uniform temperature, but the temperatures of the two regions may be different. As mentioned above, CONTEMP has the similar code features and it therefore is expected to show the similar analysis performance with CAP. In this study, the differences between CAP and two CONTEMPT code versions (CONTEMPT-LT/028 for maximum PT and CONTEMPT4/MOD5 for minimum PT) are, in detail, identified and the code performances were compared for the same problem. Code by code comparison was carried out to identify the difference of LOCA analysis between a series of COMTEMPT and CAP code. With regard to important factors that affect the transient behavior of compartment thermodynamic
Performance Comparison of Containment PT analysis between CAP and CONTEMPT Code

International Nuclear Information System (INIS)

Choo, Yeon Jun; Hong, Soon Joon; Hwang, Su Hyun; Kim, Min Ki; Lee, Byung Chul; Ha, Sang Jun; Choi, Hoon

2013-01-01

CAP, in the form that is linked with SPACE, computed the containment back-pressure during LOCA accident. In previous SAR (safety analysis report) report of Shin-Kori Units 3 and 4, the CONTEMPT series of codes(hereby referred to as just 'CONTEMPT') is used to evaluate the containment safety during the postulated loss-of-coolant accident (LOCA). In more detail, CONTEMPT-LT/028 was used to calculate the containment maximum PT, while CONTEMPT4/MOD5 to calculate the minimum PT. Actually, in minimum PT analysis, CONTEMPT4/MOD5, which provide back pressure condition of containment, was linked with RELAP5/MOD3.3 which calculate the amount of blowdown into containment. In this analysis, CONTEMPT4/MOD5 was modified based on KREM. CONTEMPT code was developed to predict the long term behavior of water-cooled nuclear reactor containment systems subjected to LOCA conditions. It calculates the time variation of compartment pressures, temperatures, mass and energy inventories, heat structure temperature distributions, and energy exchange with adjacent compartments, leakage on containment response. Models are provided for fan cooler and cooling spray as engineered safety systems. Any compartment may have both a liquid pool region and an air-vapor atmosphere region above the pool. Each region is assumed to have a uniform temperature, but the temperatures of the two regions may be different. As mentioned above, CONTEMP has the similar code features and it therefore is expected to show the similar analysis performance with CAP. In this study, the differences between CAP and two CONTEMPT code versions (CONTEMPT-LT/028 for maximum PT and CONTEMPT4/MOD5 for minimum PT) are, in detail, identified and the code performances were compared for the same problem. Code by code comparison was carried out to identify the difference of LOCA analysis between a series of COMTEMPT and CAP code. With regard to important factors that affect the transient behavior of compartment thermodynamic state in
Estradiol-Induced Transcriptional Regulation of Long Non-Coding RNA, HOTAIR.

Science.gov (United States)

Bhan, Arunoday; Mandal, Subhrangsu S

2016-01-01

HOTAIR (HOX antisense intergenic RNA) is a 2.2 kb long non-coding RNA (lncRNA), transcribed from the antisense strand of homeobox C (HOXC) gene locus in chromosome 12. HOTAIR acts as a scaffolding lncRNA. It interacts and guides various chromatin-modifying complexes such as PRC2 (polycomb-repressive complex 2) and LSD1 (lysine-specific demethylase 1) to the target gene promoters leading to their gene silencing. Various studies have demonstrated that HOTAIR overexpression is associated with breast cancer. Recent studies from our laboratory demonstrate that HOTAIR is required for viability of breast cancer cells and is transcriptionally regulated by estradiol (E2) in vitro and in vivo. This chapter describes protocols for analysis of the HOTAIR promoter, cloning, transfection and dual luciferase assays, knockdown of protein synthesis by antisense oligonucleotides, and chromatin immunoprecipitation (ChIP) assay. These protocols are useful for studying the estrogen-mediated transcriptional regulation of lncRNA HOTAIR, as well as other protein coding genes and non-coding RNAs.
HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

Science.gov (United States)

Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

2017-11-01

The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by
Evidence for widespread degradation of gene control regions in hominid genomes.

Directory of Open Access Journals (Sweden)

Peter D Keightley

2005-02-01

Full Text Available Although sequences containing regulatory elements located close to protein-coding genes are often only weakly conserved during evolution, comparisons of rodent genomes have implied that these sequences are subject to some selective constraints. Evolutionary conservation is particularly apparent upstream of coding sequences and in first introns, regions that are enriched for regulatory elements. By comparing the human and chimpanzee genomes, we show here that there is almost no evidence for conservation in these regions in hominids. Furthermore, we show that gene expression is diverging more rapidly in hominids than in murids per unit of neutral sequence divergence. By combining data on polymorphism levels in human noncoding DNA and the corresponding human-chimpanzee divergence, we show that the proportion of adaptive substitutions in these regions in hominids is very low. It therefore seems likely that the lack of conservation and increased rate of gene expression divergence are caused by a reduction in the effectiveness of natural selection against deleterious mutations because of the low effective population sizes of hominids. This has resulted in the accumulation of a large number of deleterious mutations in sequences containing gene control elements and hence a widespread degradation of the genome during the evolution of humans and chimpanzees.
Identifying Floppy and Rigid Regions in Proteins

Science.gov (United States)

Jacobs, D. J.; Thorpe, M. F.; Kuhn, L. A.

1998-03-01

In proteins it is possible to separate hard covalent forces involving bond lengths and bond angles from other weak forces. We model the microstructure of the protein as a generic bar-joint truss framework, where the hard covalent forces and strong hydrogen bonds are regarded as rigid bar constraints. We study the mechanical stability of proteins using FIRST (Floppy Inclusions and Rigid Substructure Topography) based on a recently developed combinatorial constraint counting algorithm (the 3D Pebble Game), which is a generalization of the 2D pebble game (D. J. Jacobs and M. F. Thorpe, ``Generic Rigidity: The Pebble Game'', Phys. Rev. Lett.) 75, 4051-4054 (1995) for the special class of bond-bending networks (D. J. Jacobs, "Generic Rigidity in Three Dimensional Bond-bending Networks", Preprint Aug (1997)). This approach is useful in identifying rigid motifs and flexible linkages in proteins, and thereby determines the essential degrees of freedom. We will show some preliminary results from the FIRST analysis on the myohemerythrin and lyozyme proteins.
Lossy to lossless object-based coding of 3-D MRI data.

Science.gov (United States)

Menegaz, Gloria; Thiran, Jean-Philippe

2002-01-01

We propose a fully three-dimensional (3-D) object-based coding system exploiting the diagnostic relevance of the different regions of the volumetric data for rate allocation. The data are first decorrelated via a 3-D discrete wavelet transform. The implementation via the lifting steps scheme allows to map integer-to-integer values, enabling lossless coding, and facilitates the definition of the object-based inverse transform. The coding process assigns disjoint segments of the bitstream to the different objects, which can be independently accessed and reconstructed at any up-to-lossless quality. Two fully 3-D coding strategies are considered: embedded zerotree coding (EZW-3D) and multidimensional layered zero coding (MLZC), both generalized for region of interest (ROI)-based processing. In order to avoid artifacts along region boundaries, some extra coefficients must be encoded for each object. This gives rise to an overheading of the bitstream with respect to the case where the volume is encoded as a whole. The amount of such extra information depends on both the filter length and the decomposition depth. The system is characterized on a set of head magnetic resonance images. Results show that MLZC and EZW-3D have competitive performances. In particular, the best MLZC mode outperforms the others state-of-the-art techniques on one of the datasets for which results are available in the literature.
Implementation of the SAMPO computer code in the Cyber 170-750

International Nuclear Information System (INIS)

Chagas, E.F.; Liguori Neto, R.; Gomes, P.R.S.

1985-01-01

The code SAMPO, in this available version, incorporates algorithms that determine energy, eficiency and peak shape. The code also includes processing subroutines that provide automatic surveys of peaks raising all their characteristics. The handling of the code has been improved and its analysing capacity in each region of the spectrum has been amplified. Practical information regarding the use of the code is enclosed. Tests made guarantee the good performance of the code SAMPO in the Cyber system-IEAv. (Author) [pt
From concatenated codes to graph codes

DEFF Research Database (Denmark)

Justesen, Jørn; Høholdt, Tom

2004-01-01

We consider codes based on simple bipartite expander graphs. These codes may be seen as the first step leading from product type concatenated codes to more complex graph codes. We emphasize constructions of specific codes of realistic lengths, and study the details of decoding by message passing...
mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences.

Science.gov (United States)

Links, Matthew G; Chaban, Bonnie; Hemmingsen, Sean M; Muirhead, Kevin; Hill, Janet E

2013-08-15

Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly.
A Novel Polymorphism of VLDLR Signal Peptide Coding Region and Its Association with Growth and Abdominal Fat Traits of Gaoyou Domestic Ducks

Directory of Open Access Journals (Sweden)

C Ming-liang

Full Text Available ABSTRACT The VLDLR gene plays important roles in the growth and adiposity in humans and mice. The purpose of this study was to investigate the relationship between VLDLR gene genetic polymorphisms and growth and abdominal fat traits of the Gaoyou domestic duck. A total of 267 Gaoyou ducks were employed for testing. A 18bp deletion was identified in VLDLR signal peptide coding region. The results of c2 test suggested that the genotype frequencies of VLDLR signal peptide coding region were not in Hardy-Weinberg equilibrium. Least squares analysis showed that body weight (BW of -18bp/-18bp genotype ducks was significantly higher than those of other genotypes from six (BW6 (p0.05 and body weight for AFP and different genotypes had a significant effect on AFP (p<0.05. The results of Bonferroni t-test revealed that the abdominal fat percentage (AFP of -18bp/-18bp genotype was significantly lower than those of +18bp/-18bp (p<0.05. Preliminary studies have shown that VLDLR may be a candidate gene for the selection for growth and abdominal fat, and the results of the present study indicate that VLDLR strongly influences carcass abdominal fat content of Gaoyou ducks.
Long Non-Coding RNAs in Metabolic Organs and Energy Homeostasis

Directory of Open Access Journals (Sweden)

Maude Giroud

2017-11-01

Full Text Available Single cell organisms can surprisingly exceed the number of human protein-coding genes, which are thus not at the origin of the complexity of an organism. In contrast, the relative amount of non-protein-coding sequences increases consistently with organismal complexity. Moreover, the mammalian transcriptome predominantly comprises non-(protein-coding RNAs (ncRNA, of which the long ncRNAs (lncRNAs constitute the most abundant part. lncRNAs are highly species- and tissue-specific with very versatile modes of action in accordance with their binding to a large spectrum of molecules and their diverse localization. lncRNAs are transcriptional regulators adding an additional regulatory layer in biological processes and pathophysiological conditions. Here, we review lncRNAs affecting metabolic organs with a focus on the liver, pancreas, skeletal muscle, cardiac muscle, brain, and adipose organ. In addition, we will discuss the impact of lncRNAs on metabolic diseases such as obesity and diabetes. In contrast to the substantial number of lncRNA loci in the human genome, the functionally characterized lncRNAs are just the tip of the iceberg. So far, our knowledge concerning lncRNAs in energy homeostasis is still in its infancy, meaning that the rest of the iceberg is a treasure chest yet to be discovered.
Cloning and expression of gene encoding P23 protein from Cryptosporidium parvum

Directory of Open Access Journals (Sweden)

Dinh Thi Bich Lan

2014-12-01

Full Text Available We cloned the cp23 gene coding P23 (glycoprotein from Cryptosporidium parvum isolated from Thua Thien Hue province, Vietnam. The coding region of cp23 gene from C. parvum is 99% similar with cp23 gene deposited in NCBI (accession number: U34390. SDS-PAGE and Western blot analysis showed that the cp23 gene in E. coli BL21 StarTM (DE3 produced polypeptides with molecular weights of approximately 37, 40 and 49 kDa. These molecules may be non-glycosylated or glycosylated P23 fusion polypeptides. Recombinant P23 protein purified by GST (glutathione S-transferase affinity chromatography can be used as an antigen for C. parvum antibody production as well as to develop diagnostic kit for C. parvum.
Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics

Science.gov (United States)

Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.
Evolution of naturally occurring 5'non-coding region variants of Hepatitis C virus in human populations of the South American region

Directory of Open Access Journals (Sweden)

García-Aguirre Laura

2007-08-01

Full Text Available Abstract Background Hepatitis C virus (HCV has been the subject of intense research and clinical investigation as its major role in human disease has emerged. Previous and recent studies have suggested a diversification of type 1 HCV in the South American region. The degree of genetic variation among HCV strains circulating in Bolivia and Colombia is currently unknown. In order to get insight into these matters, we performed a phylogenetic analysis of HCV 5' non-coding region (5'NCR sequences from strains isolated in Bolivia, Colombia and Uruguay, as well as available comparable sequences of HCV strains isolated in South America. Methods Phylogenetic tree analysis was performed using the neighbor-joining method under a matrix of genetic distances established under the Kimura-two parameter model. Signature pattern analysis, which identifies particular sites in nucleic acid alignments of variable sequences that are distinctly representative relative to a background set, was performed using the method of Korber & Myers, as implemented in the VESPA program. Prediction of RNA secondary structures was done by the method of Zuker & Turner, as implemented in the mfold program. Results Phylogenetic tree analysis of HCV strains isolated in the South American region revealed the presence of a distinct genetic lineage inside genotype 1. Signature pattern analysis revealed that the presence of this lineage is consistent with the presence of a sequence signature in the 5'NCR of HCV strains isolated in South America. Comparisons of these results with the ones found for Europe or North America revealed that this sequence signature is characteristic of the South American region. Conclusion Phylogentic analysis revealed the presence of a sequence signature in the 5'NCR of type 1 HCV strains isolated in South America. This signature is frequent enough in type 1 HCV populations circulating South America to be detected in a phylogenetic tree analysis as a distinct
Properties of Sequence Conservation in Upstream Regulatory and Protein Coding Sequences among Paralogs in Arabidopsis thaliana

Science.gov (United States)

Richardson, Dale N.; Wiehe, Thomas

Whole genome duplication (WGD) has catalyzed the formation of new species, genes with novel functions, altered expression patterns, complexified signaling pathways and has provided organisms a level of genetic robustness. We studied the long-term evolution and interrelationships of 5’ upstream regulatory sequences (URSs), protein coding sequences (CDSs) and expression correlations (EC) of duplicated gene pairs in Arabidopsis. Three distinct methods revealed significant evolutionary conservation between paralogous URSs and were highly correlated with microarray-based expression correlation of the respective gene pairs. Positional information on exact matches between sequences unveiled the contribution of micro-chromosomal rearrangements on expression divergence. A three-way rank analysis of URS similarity, CDS divergence and EC uncovered specific gene functional biases. Transcription factor activity was associated with gene pairs exhibiting conserved URSs and divergent CDSs, whereas a broad array of metabolic enzymes was found to be associated with gene pairs showing diverged URSs but conserved CDSs.
ProClaT, a new bioinformatics tool for in silico protein reclassification: case study of DraB, a protein coded from the draTGB operon in Azospirillum brasilense.

Science.gov (United States)

Rubel, Elisa Terumi; Raittz, Roberto Tadeu; Coimbra, Nilson Antonio da Rocha; Gehlen, Michelly Alves Coutinho; Pedrosa, Fábio de Oliveira

2016-12-15

Azopirillum brasilense is a plant-growth promoting nitrogen-fixing bacteria that is used as bio-fertilizer in agriculture. Since nitrogen fixation has a high-energy demand, the reduction of N 2 to NH 4 + by nitrogenase occurs only under limiting conditions of NH 4 + and O 2 . Moreover, the synthesis and activity of nitrogenase is highly regulated to prevent energy waste. In A. brasilense nitrogenase activity is regulated by the products of draG and draT. The product of the draB gene, located downstream in the draTGB operon, may be involved in the regulation of nitrogenase activity by an, as yet, unknown mechanism. A deep in silico analysis of the product of draB was undertaken aiming at suggesting its possible function and involvement with DraT and DraG in the regulation of nitrogenase activity in A. brasilense. In this work, we present a new artificial intelligence strategy for protein classification, named ProClaT. The features used by the pattern recognition model were derived from the primary structure of the DraB homologous proteins, calculated by a ProClaT internal algorithm. ProClaT was applied to this case study and the results revealed that the A. brasilense draB gene codes for a protein highly similar to the nitrogenase associated NifO protein of Azotobacter vinelandii. This tool allowed the reclassification of DraB/NifO homologous proteins, hypothetical, conserved hypothetical and those annotated as putative arsenate reductase, ArsC, as NifO-like. An analysis of co-occurrence of draB, draT, draG and of other nif genes was performed, suggesting the involvement of draB (nifO) in nitrogen fixation, however, without the definition of a specific function.
Development and characterization of polyclonal antibodies against the linker region of the telomere-binding protein TRF2

Directory of Open Access Journals (Sweden)

Nadya V. Ilicheva

2018-03-01

Full Text Available Background: TRF2 (telomeric repeat binding factor 2 is an essential component of the telomere-binding protein complex shelterin. TRF2 induces the formation of a special structure of telomeric DNA and counteracts activation of DNA damage-response pathways telomeres. TRF2 has a poorly characterized linker region (udTRF2 between its homodimerization and DNA-binding domains. Some lines of evidence have shown that this region could be involved in TRF2 interaction with nuclear lamina. Results: In this study, the fragment of the TERF2 gene encoding udTRF2 domain of telomere-binding protein TRF2 was produced by PCR and cloned into the pET32a vector. The resulting plasmid pET32a-udTRF2 was used for the expression of the recombinant udTRF2 in E. coli RosettaBlue (DE3. The protein was isolated and purified using ammonium sulfate precipitation followed by ion-exchange chromatography. The purified recombinant protein udTRF2 was injected into guinea pigs to generate polyclonal antibodies. The ability of anti-udTRF2 antibodies to bind endogenous TRF2 in human skin fibroblasts was tested by western blotting and immunofluorescent staining. Conclusions: In this study, the recombinant protein udTRF2 and antibodies to it were generated. Both protein and antibodies will provide a useful tool for investigation of the functions of the udTRF2 domain and its role in the interaction between TRF2 and nuclear lamina. Keywords: Chromosomes, Molecular cloning, Nuclear lamina, Nucleoprotein complexes, Polyclonal antibodies, Recombinant polypeptide, Shelterin, Telomere-binding protein TRF2, Telomeres, Telomeric DNA, TTAGGG repeats

Bactericidal activity of M protein conserved region antibodies against group A streptococcal isolates from the Northern Thai population

Directory of Open Access Journals (Sweden)

Pruksachatkunakorn Chulabhorn

2006-08-01

Full Text Available Abstract Background Most group A streptococcal (GAS vaccine strategies have focused on the surface M protein, a major virulence factor of GAS. The amino-terminus of the M protein elicits antibodies, that are both opsonic and protective, but which are type specific. J14, a chimeric peptide that contains 14 amino acids from the M protein conserved C-region at the carboxy-terminus, offers the possibility of a vaccine which will elicit protective opsonic antibodies against multiple different GAS strains. In this study, we searched for J14 and J14-like sequences and the number of their repeats in the C-region of the M protein from GAS strains isolated from the Northern Thai population. Then, we examined the bactericidal activity of J14, J14.1, J14-R1 and J14-R2 antisera against multiple Thai GAS strains. Results The emm genes of GAS isolates were sequenced and grouped as 14 different J14-types. The most diversity of J14-types was found in the C1-repeat. The J14.1 type was the major sequence in the C2 and C3-repeats. We have shown that antisera raised against the M protein conserved C-repeat region peptides, J14, J14.1, J14-R1 and J14-R2, commonly found in GAS isolates from the Northern Thai population, are able to kill GAS of multiple different emm types derived from an endemic area. The mean percent of bactericidal activities for all J14 and J14-like peptide antisera against GAS isolates were more than 70%. The mean percent of bactericidal activity was highest for J14 antisera followed by J14-R2, J14.1 and J14-R1 antisera. Conclusion Our study demonstrated that antisera raised against the M protein conserved C-repeat region are able to kill multiple different strains of GAS isolated from the Northern Thai population. Therefore, the four conserved "J14" peptides have the potential to be used as GAS vaccine candidates to prevent streptococcal infections in an endemic area.
Characterization of the regions from E. coli 16 S RNA covalently linked to ribosomal proteins S4 and S20 after ultraviolet irradiation

International Nuclear Information System (INIS)

Ehresmann, B.; Backendorf, C.; Ehresmann, C.; Ebel, J.P.

1977-01-01

The use of ultraviolet irradiation to form photochemical covalent bonds between the 16 S RNA and a ribosomal protein is a reliable method to check RNA regions which are interacting with the protein. This technique was successfully used to covalently link RNA or DNA and specific proteins in several cases. In the case of ribosome, it has been shown that the irradiation of 30 S and 50 S subunits using high doses of ultraviolet light allowed the covalent binding of almost all of the ribosomal proteins to the 16 S or 23 S RNAs. Using mild conditions, only proteins S7 and L4 could be covalently linked to the 16 S and 23 S RNAs, respectively, and the 16 S RNA region linked to protein S7 has now been characterized. The specificity of the photoreaction was demonstrated earlier and the tryptic peptides from proteins S4 and S7, photochemically linked to the 16 S RNA complexes, were identified. A report is presented on the sequences of the RNA regions which can be photochemically linked to proteins S4 and S7 after ultraviolet irradiation of the specific S4-16 S RNA and 20 S-16 S RNA complexes
Td4IN2: A drought-responsive durum wheat (Triticum durum Desf.) gene coding for a resistance like protein with serine/threonine protein kinase, nucleotide binding site and leucine rich domains.

Science.gov (United States)

Rampino, Patrizia; De Pascali, Mariarosaria; De Caroli, Monica; Luvisi, Andrea; De Bellis, Luigi; Piro, Gabriella; Perrotta, Carla

2017-11-01

Wheat, the main food source for a third of world population, appears strongly under threat because of predicted increasing temperatures coupled to drought. Plant complex molecular response to drought stress relies on the gene network controlling cell reactions to abiotic stress. In the natural environment, plants are subjected to the combination of abiotic and biotic stresses. Also the response of plants to biotic stress, to cope with pathogens, involves the activation of a molecular network. Investigations on combination of abiotic and biotic stresses indicate the existence of cross-talk between the two networks and a kind of overlapping can be hypothesized. In this work we describe the isolation and characterization of a drought-related durum wheat (Triticum durum Desf.) gene, identified in a previous study, coding for a protein combining features of NBS-LRR type resistance protein with a S/TPK domain, involved in drought stress response. This is one of the few examples reported where all three domains are present in a single protein and, to our knowledge, it is the first report on a gene specifically induced by drought stress and drought-related conditions, with this particular structure. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Multiuser Random Coding Techniques for Mismatched Decoding

OpenAIRE

Scarlett, Jonathan; Martinez, Alfonso; Guillén i Fàbregas, Albert

2016-01-01

This paper studies multiuser random coding techniques for channel coding with a given (possibly suboptimal) decoding rule. For the mismatched discrete memoryless multiple-access channel, an error exponent is obtained that is tight with respect to the ensemble average, and positive within the interior of Lapidoth's achievable rate region. This exponent proves the ensemble tightness of the exponent of Liu and Hughes in the case of maximum-likelihood decoding. An equivalent dual form of Lapidoth...
[Promoting the code of ethics for nurses].

Science.gov (United States)

Chamboredon, Patrick; Lecointre, Brigitte

2017-09-01

The publication of the code of ethics for nurses requires the French National Order of Nurses' structures to undertake initiatives with the aim of promoting it as well as implementing the public service missions which have now been attributed to the Order. Each regional and departmental body has its role to play in raising awareness of this code and its application in the field. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Simple method for identification of plasmid-coded proteins

International Nuclear Information System (INIS)

Sancar, A.; Hack, A.M.; Rupp, W.D.

1979-01-01

Proteins encoded by plasmid DNA are specifically labeled in uv-irradiated cells of Escherichia coli carrying recA and uvrA mutations because extensive degradation of the chromosome DNA occurs concurrently with amplification of plasmid DNA
Decoding the function of nuclear long non-coding RNAs.

Science.gov (United States)

Chen, Ling-Ling; Carmichael, Gordon G

2010-06-01

Long non-coding RNAs (lncRNAs) are mRNA-like, non-protein-coding RNAs that are pervasively transcribed throughout eukaryotic genomes. Rather than silently accumulating in the nucleus, many of these are now known or suspected to play important roles in nuclear architecture or in the regulation of gene expression. In this review, we highlight some recent progress in how lncRNAs regulate these important nuclear processes at the molecular level. Copyright 2010 Elsevier Ltd. All rights reserved.
Butane-1,2,3,4-tetraol-based amphiphilic stereoisomers for membrane protein study: importance of chirality in the linker region

DEFF Research Database (Denmark)

Das, Manabendra; Du, Yang; Mortensen, Jonas S.

2017-01-01

of the targeted membrane proteins depending on the chirality of the linker region. These findings indicate an important role for detergent stereochemistry in membrane protein stabilization. In addition, we generally observed enhanced detergent efficacy with increasing alkyl chain length, reinforcing...
Junk DNA and the long non-coding RNA twist in cancer genetics

NARCIS (Netherlands)

H. Ling (Hui); K. Vincent; M. Pichler; R. Fodde (Riccardo); I. Berindan-Neagoe (Ioana); F.J. Slack (Frank); G.A. Calin (George)

2015-01-01

textabstractThe central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs
CFRX, a one-and-a-quarter-dimensional transport code for field-reversed configuration studies

International Nuclear Information System (INIS)

Hsiao Mingyuan

1989-01-01

A one-and-a-quarter-dimensional transport code, which includes radial as well as some two-dimensional effects for field-reversed configurations, is described. The set of transport equations is transformed to a set of new independent and dependent variables and is solved as a coupled initial-boundary value problem. The code simulation includes both the closed and open field regions. The axial effects incorporated include global axial force balance, axial losses in the open field region, and flux surface averaging over the closed field region. A typical example of the code results is also given. (orig.)
Design of ACM system based on non-greedy punctured LDPC codes

Science.gov (United States)

Lu, Zijun; Jiang, Zihong; Zhou, Lin; He, Yucheng

2017-08-01

In this paper, an adaptive coded modulation (ACM) scheme based on rate-compatible LDPC (RC-LDPC) codes was designed. The RC-LDPC codes were constructed by a non-greedy puncturing method which showed good performance in high code rate region. Moreover, the incremental redundancy scheme of LDPC-based ACM system over AWGN channel was proposed. By this scheme, code rates vary from 2/3 to 5/6 and the complication of the ACM system is lowered. Simulations show that more and more obvious coding gain can be obtained by the proposed ACM system with higher throughput.
Toric Varieties and Codes, Error-correcting Codes, Quantum Codes, Secret Sharing and Decoding

DEFF Research Database (Denmark)

Hansen, Johan Peder

We present toric varieties and associated toric codes and their decoding. Toric codes are applied to construct Linear Secret Sharing Schemes (LSSS) with strong multiplication by the Massey construction. Asymmetric Quantum Codes are obtained from toric codes by the A.R. Calderbank P.W. Shor and A.......M. Steane construction of stabilizer codes (CSS) from linear codes containing their dual codes....
Markov analysis of alpha-helical, beta-sheet and random coil regions of proteins

International Nuclear Information System (INIS)

Macchiato, M.; Tramontano, A.

1983-01-01

The rules up to now used to predict the spatial configuration of proteins from their primary structure are mostly based on the recurrence analysis of some doublets, triplets and so on of contiguous amino acids, but they do not take into account the correlation characteristics of the whole amino acid sequence. A statistical analysis of amino acid sequences for the alpha-helical, beta-sheet and random coil regions of about twenty proteins with known secondary structure by considering correlations effects has been carried out. The obtained results demonstrate that these sequences are at least a second-order Markov chain, i.e. they appear as if they were generated by a source that remembers at least the two aminoacids before the one being generated and that these two previous symbols influence the present choice
Mutations of C19orf12, coding for a transmembrane glycine zipper containing mitochondrial protein, cause mis-localization of the protein, inability to respond to oxidative stress and increased mitochondrial Ca2+.

Directory of Open Access Journals (Sweden)

Paola eVenco

2015-05-01

Full Text Available Mutations in C19orf12 have been identified in patients affected by Neurodegeneration with Brain Iron Accumulation (NBIA, a clinical entity characterized by iron accumulation in the basal ganglia. By using western blot analysis with specific antibody and confocal studies, we showed that wild-type C19orf12 protein was not exclusively present in mitochondria, but also in the Endoplasmic Reticulum (ER and MAM (Mitochondria Associated Membrane, while mutant C19orf12 variants presented a different localization. Moreover, after induction of oxidative stress, a GFP-tagged C19orf12 wild-type protein was able to relocate to the cytosol. On the contrary, mutant isoforms were not able to respond to oxidative stress. High mitochondrial calcium concentration and increased H2O2 induced apoptosis were found in fibroblasts derived from one patient as compared to controls.C19orf12 protein is a 17kDa mitochondrial membrane-associated protein whose function is still unknown. Our in silico investigation suggests that, the glycine zipper motifs of C19orf12 form helical regions spanning the membrane. The N- and C-terminal regions with respect to the transmembrane portion, on the contrary, are predicted to rearrange in a structural domain, which is homologues to the N-terminal regulatory domain of the magnesium transporter MgtE, suggesting that C19orf12 may act as a regulatory protein for human MgtE transporters. The mutations here described affect respectively one glycine residue of the glycine zipper motifs, which are involved in dimerization of transmembrane helices and predicted to impair the correct localization of the protein into the membranes, and one residue present in the regulatory domain, which is important for protein-protein interaction.
Novel classes of non-coding RNAs and cancer

Directory of Open Access Journals (Sweden)

Sana Jiri

2012-05-01

Full Text Available Abstract For the many years, the central dogma of molecular biology has been that RNA functions mainly as an informational intermediate between a DNA sequence and its encoded protein. But one of the great surprises of modern biology was the discovery that protein-coding genes represent less than 2% of the total genome sequence, and subsequently the fact that at least 90% of the human genome is actively transcribed. Thus, the human transcriptome was found to be more complex than a collection of protein-coding genes and their splice variants. Although initially argued to be spurious transcriptional noise or accumulated evolutionary debris arising from the early assembly of genes and/or the insertion of mobile genetic elements, recent evidence suggests that the non-coding RNAs (ncRNAs may play major biological roles in cellular development, physiology and pathologies. NcRNAs could be grouped into two major classes based on the transcript size; small ncRNAs and long ncRNAs. Each of these classes can be further divided, whereas novel subclasses are still being discovered and characterized. Although, in the last years, small ncRNAs called microRNAs were studied most frequently with more than ten thousand hits at PubMed database, recently, evidence has begun to accumulate describing the molecular mechanisms by which a wide range of novel RNA species function, providing insight into their functional roles in cellular biology and in human disease. In this review, we summarize newly discovered classes of ncRNAs, and highlight their functioning in cancer biology and potential usage as biomarkers or therapeutic targets.
Molecular mechanics work station for protein conformational studies

International Nuclear Information System (INIS)

Fine, R.; Levinthal, C.; Schoenborn, B.; Dimmier, G.; Rankowitz, C.

1984-01-01

Interest in computational problems in Biology has intensified over the last few years, partly due to the development of techniques for the rapid cloning, sequencing, and mutagenesis of genes from organisims ranging from E. coli to Man. The central dogma of molecular biology; that DNA codes for mRNA which codes for protein, has been understood in a linear programming sense since the genetic code was cracked. But what is not understood at present is how a protein, once assembled as a long sequence of amino acids, folds back on itself to produce a three-dimensional structure which is unique to that protein and which dictates its chemical and biological activity. This folding process is purely physics, and involves the time evolution of a system of several thousand atoms which interact with each other and with atoms from the surrounding solvent. Molecular dynamics simulations on smaller molecules suggest that approaches which treat the protein as a classical ensemble of atoms interacting with each other via an empirical Hamiltonian can yield the kind of predictive results one would like when applied to proteins
A long-term, integrated impact assessment of alternative building energy code scenarios in China

International Nuclear Information System (INIS)

Yu, Sha; Eom, Jiyong; Evans, Meredydd; Clarke, Leon

2014-01-01

China is the second largest building energy user in the world, ranking first and third in residential and commercial energy consumption. Beginning in the early 1980s, the Chinese government has developed a variety of building energy codes to improve building energy efficiency and reduce total energy demand. This paper studies the impact of building energy codes on energy use and CO 2 emissions by using a detailed building energy model that represents four distinct climate zones each with three building types, nested in a long-term integrated assessment framework GCAM. An advanced building stock module, coupled with the building energy model, is developed to reflect the characteristics of future building stock and its interaction with the development of building energy codes in China. This paper also evaluates the impacts of building codes on building energy demand in the presence of economy-wide carbon policy. We find that building energy codes would reduce Chinese building energy use by 13–22% depending on building code scenarios, with a similar effect preserved even under the carbon policy. The impact of building energy codes shows regional and sectoral variation due to regionally differentiated responses of heating and cooling services to shell efficiency improvement. - Highlights: • We assessed long-term impacts of building codes and climate policy using GCAM. • Building energy codes would reduce Chinese building energy use by 13–22%. • The impacts of codes on building energy use vary by climate region and sub-sector
Automatic coding method of the ACR Code

International Nuclear Information System (INIS)

Park, Kwi Ae; Ihm, Jong Sool; Ahn, Woo Hyun; Baik, Seung Kook; Choi, Han Yong; Kim, Bong Gi

1993-01-01

The authors developed a computer program for automatic coding of ACR(American College of Radiology) code. The automatic coding of the ACR code is essential for computerization of the data in the department of radiology. This program was written in foxbase language and has been used for automatic coding of diagnosis in the Department of Radiology, Wallace Memorial Baptist since May 1992. The ACR dictionary files consisted of 11 files, one for the organ code and the others for the pathology code. The organ code was obtained by typing organ name or code number itself among the upper and lower level codes of the selected one that were simultaneous displayed on the screen. According to the first number of the selected organ code, the corresponding pathology code file was chosen automatically. By the similar fashion of organ code selection, the proper pathologic dode was obtained. An example of obtained ACR code is '131.3661'. This procedure was reproducible regardless of the number of fields of data. Because this program was written in 'User's Defined Function' from, decoding of the stored ACR code was achieved by this same program and incorporation of this program into program in to another data processing was possible. This program had merits of simple operation, accurate and detail coding, and easy adjustment for another program. Therefore, this program can be used for automation of routine work in the department of radiology
UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

Science.gov (United States)

Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

2016-01-04

The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Translational regulation of ribosomal protein S15 drives characteristic patterns of protein-mRNA epistasis.

Science.gov (United States)

Mallik, Saurav; Basu, Sudipto; Hait, Suman; Kundu, Sudip

2018-04-21

Do coding and regulatory segments of a gene co-evolve with each-other? Seeking answers to this question, here we analyze the case of Escherichia coli ribosomal protein S15, that represses its own translation by specifically binding its messenger RNA (rpsO mRNA) and stabilizing a pseudoknot structure at the upstream untranslated region, thus trapping the ribosome into an incomplete translation initiation complex. In the absence of S15, ribosomal protein S1 recognizes rpsO and promotes translation by melting this very pseudoknot. We employ a robust statistical method to detect signatures of positive epistasis between residue site pairs and find that biophysical constraints of translational regulation (S15-rpsO and S1-rpsO recognition, S15-mediated rpsO structural rearrangement, and S1-mediated melting) are strong predictors of positive epistasis. Transforming the epistatic pairs into a network, we find that signatures of two different, but interconnected regulatory cascades are imprinted in the sequence-space and can be captured in terms of two dense network modules that are sparsely connected to each other. This network topology further reflects a general principle of how functionally coupled components of biological networks are interconnected. These results depict a model case, where translational regulation drives characteristic residue-level epistasis-not only between a protein and its own mRNA but also between a protein and the mRNA of an entirely different protein. © 2018 Wiley Periodicals, Inc.

Origins of gene, genetic code, protein and life: comprehensive view ...

Indian Academy of Sciences (India)

Unknown

production, suggesting that proteins were originally produced by random peptide formation of amino acids restricted in specific amino acid compositions .... using random numbers by a computer, to confirm whether main chains of ...... world on the origin of life by the pseudo-replication of. [GADV]-proteins in the absence of ...
Code stroke in Asturias.

Science.gov (United States)

Benavente, L; Villanueva, M J; Vega, P; Casado, I; Vidal, J A; Castaño, B; Amorín, M; de la Vega, V; Santos, H; Trigo, A; Gómez, M B; Larrosa, D; Temprano, T; González, M; Murias, E; Calleja, S

2016-04-01

Intravenous thrombolysis with alteplase is an effective treatment for ischaemic stroke when applied during the first 4.5 hours, but less than 15% of patients have access to this technique. Mechanical thrombectomy is more frequently able to recanalise proximal occlusions in large vessels, but the infrastructure it requires makes it even less available. We describe the implementation of code stroke in Asturias, as well as the process of adapting various existing resources for urgent stroke care in the region. By considering these resources, and the demographic and geographic circumstances of our region, we examine ways of reorganising the code stroke protocol that would optimise treatment times and provide the most appropriate treatment for each patient. We distributed the 8 health districts in Asturias so as to permit referral of candidates for reperfusion therapies to either of the 2 hospitals with 24-hour stroke units and on-call neurologists and providing IV fibrinolysis. Hospitals were assigned according to proximity and stroke severity; the most severe cases were immediately referred to the hospital with on-call interventional neurology care. Patient triage was provided by pre-hospital emergency services according to the NIHSS score. Modifications to code stroke in Asturias have allowed us to apply reperfusion therapies with good results, while emphasising equitable care and managing the severity-time ratio to offer the best and safest treatment for each patient as soon as possible. Copyright © 2015 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.
Coding in pigeons: Multiple-coding versus single-code/default strategies.

Science.gov (United States)

Pinto, Carlos; Machado, Armando

2015-05-01

To investigate the coding strategies that pigeons may use in a temporal discrimination tasks, pigeons were trained on a matching-to-sample procedure with three sample durations (2s, 6s and 18s) and two comparisons (red and green hues). One comparison was correct following 2-s samples and the other was correct following both 6-s and 18-s samples. Tests were then run to contrast the predictions of two hypotheses concerning the pigeons' coding strategies, the multiple-coding and the single-code/default. According to the multiple-coding hypothesis, three response rules are acquired, one for each sample. According to the single-code/default hypothesis, only two response rules are acquired, one for the 2-s sample and a "default" rule for any other duration. In retention interval tests, pigeons preferred the "default" key, a result predicted by the single-code/default hypothesis. In no-sample tests, pigeons preferred the key associated with the 2-s sample, a result predicted by multiple-coding. Finally, in generalization tests, when the sample duration equaled 3.5s, the geometric mean of 2s and 6s, pigeons preferred the key associated with the 6-s and 18-s samples, a result predicted by the single-code/default hypothesis. The pattern of results suggests the need for models that take into account multiple sources of stimulus control. © Society for the Experimental Analysis of Behavior.
Expression of Aleutian mink disease parvovirus capsid proteins in defined segments: localization of immunoreactive sites and neutralizing epitopes to specific regions.

Science.gov (United States)

Bloom, M E; Martin, D A; Oie, K L; Huhtanen, M E; Costello, F; Wolfinbarger, J B; Hayes, S F; Agbandje-McKenna, M

1997-01-01

The capsid proteins of the ADV-G isolate of Aleutian mink disease parvovirus (ADV) were expressed in 10 nonoverlapping segments as fusions with maltose-binding protein in pMAL-C2 (pVP1, pVP2a through pVP2i). The constructs were designed to capture the VP1 unique sequence and the portions analogous to the four variable surface loops of canine parvovirus (CPV) in individual fragments (pVP2b, pVP2d, pVP2e, and pVP2g, respectively). The panel of fusion proteins was immunoblotted with sera from mink infected with ADV. Seropositive mink infected with either ADV-TR, ADV-Utah, or ADV-Pullman reacted preferentially against certain segments, regardless of mink genotype or virus inoculum. The most consistently immunoreactive regions were pVP2g, pVP2e, and pVP2f, the segments that encompassed the analogs of CPV surface loops 3 and 4. The VP1 unique region was also consistently immunoreactive. These findings indicated that infected mink recognize linear epitopes that localized to certain regions of the capsid protein sequence. The segment containing the hypervariable region (pVP2d), corresponding to CPV loop 2, was also expressed from ADV-Utah. An anti-ADV-G monoclonal antibody and a rabbit anti-ADV-G capsid antibody reacted exclusively with the ADV-G pVP2d segment but not with the corresponding segment from ADV-Utah. Mink infected with ADV-TR or ADV-Utah also preferentially reacted with the pVP2d sequence characteristic of that virus. These results suggested that the loop 2 region may contain a type-specific linear epitope and that the epitope may also be specifically recognized by infected mink. Heterologous antisera were prepared against the VP1 unique region and the four segments capturing the variable surface loops of CPV. The antisera against the proteins containing loop 3 or loop 4, as well as the anticapsid antibody, neutralized ADV-G infectivity in vitro and bound to capsids in immune electron microscopy. These results suggested that regions of the ADV capsid proteins
Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region

International Nuclear Information System (INIS)

Riley, L.K.; Morrow, J.K.; Danton, M.J.; Coleman, M.S.

1988-01-01

Human terminal deoxyribonucleotidyltransferase cDNA contains an open reading frame of 1530 base pairs (bp) corresponding to a protein containing 510 amino acids. The encoded protein is a template-independent DNA polymerase found only in a restricted population of normal and malignant prelymphocytes. To begin to investigate the genetic elements responsible for the tissue-specific expression of terminal deoxyribonucleotidyltransferase, genomic clones, containing the entire human gene were isolated and characterized. Initially, cDNA clones were isolated from a library generated from the human lymphoblastoid cell line, MOLT-4R. A cDNA clone containing the entire coding region of the protein was used to isolate a series of overlapping clones from two human genomic libraries. The gene comprises 11 exons and 10 introns and spans 49.4 kilobases. The 5' flanking region (709 bp) including exon 1 was sequenced. Several putative transcription initiation sites were mapped. Within 500 nucleotides of the translation start site, a series of promoter elements was detected. TATA and CAAT sequences, respectively, were found to start at nucleotides -185 and -204, -328 and -370, and -465 and -505. Start sites were found for a cyclic AMP-dependent promoter analog at nucleotide -121, an eight-base sequence corresponding to the IgG promoter enhancer (cd) at nucleotide -455, and an analog of the IgG promoter (pd) at nucleotide -159. These findings suggest that transcripts coding for terminal deoxyribonucleotidyltransferase may be variable in length and that transcription may be influenced by a variety of genetic elements
Sub-nanoscale surface ruggedness provides a water-tight seal for exposed regions in soluble protein structure.

Directory of Open Access Journals (Sweden)

Erica Schulz

2010-09-01

Full Text Available Soluble proteins must maintain backbone hydrogen bonds (BHBs water-tight to ensure structural integrity. This protection is often achieved by burying the BHBs or wrapping them through intermolecular associations. On the other hand, water has low coordination resilience, with loss of hydrogen-bonding partnerships carrying significant thermodynamic cost. Thus, a core problem in structural biology is whether natural design actually exploits the water coordination stiffness to seal the backbone in regions that are exposed to the solvent. This work explores the molecular design features that make this type of seal operative, focusing on the side-chain arrangements that shield the protein backbone. We show that an efficient sealing is achieved by adapting the sub-nanoscale surface topography to the stringency of water coordination: an exposed BHB may be kept dry if the local concave curvature is small enough to impede formation of the coordination shell of a penetrating water molecule. Examination of an exhaustive database of uncomplexed proteins reveals that exposed BHBs invariably occur within such sub-nanoscale cavities in native folds, while this level of local ruggedness is absent in other regions. By contrast, BHB exposure in misfolded proteins occurs with larger local curvature promoting backbone hydration and consequently, structure disruption. These findings unravel physical constraints fitting a spatially dependent least-action for water coordination, introduce a molecular design concept, and herald the advent of water-tight peptide-based materials with sufficient backbone exposure to remain flexible.
Code Cactus; Code Cactus

Energy Technology Data Exchange (ETDEWEB)

Fajeau, M; Nguyen, L T; Saunier, J [Commissariat a l' Energie Atomique, Centre d' Etudes Nucleaires de Saclay, 91 - Gif-sur-Yvette (France)

1966-09-01

This code handles the following problems: -1) Analysis of thermal experiments on a water loop at high or low pressure; steady state or transient behavior; -2) Analysis of thermal and hydrodynamic behavior of water-cooled and moderated reactors, at either high or low pressure, with boiling permitted; fuel elements are assumed to be flat plates: - Flowrate in parallel channels coupled or not by conduction across plates, with conditions of pressure drops or flowrate, variable or not with respect to time is given; the power can be coupled to reactor kinetics calculation or supplied by the code user. The code, containing a schematic representation of safety rod behavior, is a one dimensional, multi-channel code, and has as its complement (FLID), a one-channel, two-dimensional code. (authors) [French] Ce code permet de traiter les problemes ci-dessous: 1. Depouillement d'essais thermiques sur boucle a eau, haute ou basse pression, en regime permanent ou transitoire; 2. Etudes thermiques et hydrauliques de reacteurs a eau, a plaques, a haute ou basse pression, ebullition permise: - repartition entre canaux paralleles, couples on non par conduction a travers plaques, pour des conditions de debit ou de pertes de charge imposees, variables ou non dans le temps; - la puissance peut etre couplee a la neutronique et une representation schematique des actions de securite est prevue. Ce code (Cactus) a une dimension d'espace et plusieurs canaux, a pour complement Flid qui traite l'etude d'un seul canal a deux dimensions. (auteurs)
Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.

Science.gov (United States)

Li, Yifeng; Shi, Wenqiang; Wasserman, Wyeth W

2018-05-31

In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.
AHM1, a Novel Type of Nuclear Matrix–Localized, MAR Binding Protein with a Single AT Hook and a J Domain–Homologous Region

Science.gov (United States)

Morisawa, Gaku; Han-yama, Atsushi; Moda, Ichiro; Tamai, Atsushi; Iwabuchi, Masaki; Meshi, Tetsuo

2000-01-01

Interactions between the nuclear matrix and special regions of chromosomal DNA called matrix attachment regions (MARs) have been implicated in various nuclear functions. We have identified a novel protein from wheat, AT hook–containing MAR binding protein1 (AHM1), that binds preferentially to MARs. A multidomain protein, AHM1 has the special combination of a J domain–homologous region and a Zn finger–like motif (a J-Z array) and an AT hook. For MAR binding, the AT hook at the C terminus was essential, and an internal portion containing the Zn finger–like motif was additionally required in vivo. AHM1 was found in the nuclear matrix fraction and was localized in the nucleoplasm. AHM1 fused to green fluorescent protein had a speckled distribution pattern inside the nucleus. AHM1 is most likely a nuclear matrix component that functions between intranuclear framework and MARs. J-Z arrays can be found in a group of (hypothetical) proteins in plants, which may share some functions, presumably to recruit specific Hsp70 partners as co-chaperones. PMID:11041885
Improvement of heterologous protein production in Aspergillus oryzae by RNA interference with alpha-amylase genes.

Science.gov (United States)

Nemoto, Takashi; Maruyama, Jun-ichi; Kitamoto, Katsuhiko

2009-11-01

Aspergillus oryzae RIB40 has three alpha-amylase genes (amyA, amyB, and amyC), and secretes alpha-amylase abundantly. However, large amounts of endogenous secretory proteins such as alpha-amylase can compete with heterologous protein in the secretory pathway and decrease its production yields. In this study, we examined the effects of suppression of alpha-amylase on heterologous protein production in A. oryzae, using the bovine chymosin (CHY) as a reporter heterologous protein. The three alpha-amylase genes in A. oryzae have nearly identical DNA sequences from those promoters to the coding regions. Hence we performed silencing of alpha-amylase genes by RNA interference (RNAi) in the A. oryzae CHY producing strain. The silenced strains exhibited a reduction in alpha-amylase activity and an increase in CHY production in the culture medium. This result suggests that suppression of alpha-amylase is effective in heterologous protein production in A. oryzae.
The Non-Coding Regulatory RNA Revolution in Archaea

Directory of Open Access Journals (Sweden)

Diego Rivera Gelsinger

2018-03-01

Full Text Available Small non-coding RNAs (sRNAs are ubiquitously found in the three domains of life playing large-scale roles in gene regulation, transposable element silencing and defense against foreign elements. While a substantial body of experimental work has been done to uncover function of sRNAs in Bacteria and Eukarya, the functional roles of sRNAs in Archaea are still poorly understood. Recently, high throughput studies using RNA-sequencing revealed that sRNAs are broadly expressed in the Archaea, comprising thousands of transcripts within the transcriptome during non-challenged and stressed conditions. Antisense sRNAs, which overlap a portion of a gene on the opposite strand (cis-acting, are the most abundantly expressed non-coding RNAs and they can be classified based on their binding patterns to mRNAs (3′ untranslated region (UTR, 5′ UTR, CDS-binding. These antisense sRNAs target many genes and pathways, suggesting extensive roles in gene regulation. Intergenic sRNAs are less abundantly expressed and their targets are difficult to find because of a lack of complete overlap between sRNAs and target mRNAs (trans-acting. While many sRNAs have been validated experimentally, a regulatory role has only been reported for very few of them. Further work is needed to elucidate sRNA-RNA binding mechanisms, the molecular determinants of sRNA-mediated regulation, whether protein components are involved and how sRNAs integrate with complex regulatory networks.
Long- and short-term selective forces on malaria parasite genomes

KAUST Repository

Nygaard, Sanne

2010-09-09

Plasmodium parasites, the causal agents of malaria, result in more than 1 million deaths annually. Plasmodium are unicellular eukaryotes with small ~23 Mb genomes encoding ~5200 protein-coding genes. The protein-coding genes comprise about half of these genomes. Although evolutionary processes have a significant impact on malaria control, the selective pressures within Plasmodium genomes are poorly understood, particularly in the non-protein-coding portion of the genome. We use evolutionary methods to describe selective processes in both the coding and non-coding regions of these genomes. Based on genome alignments of seven Plasmodium species, we show that protein-coding, intergenic and intronic regions are all subject to purifying selection and we identify 670 conserved non-genic elements. We then use genome-wide polymorphism data from P. falciparum to describe short-term selective processes in this species and identify some candidate genes for balancing (diversifying) selection. Our analyses suggest that there are many functional elements in the non-genic regions of these genomes and that adaptive evolution has occurred more frequently in the protein-coding regions of the genome. © 2010 Nygaard et al.
An RNA-Seq strategy to detect the complete coding and non-coding transcriptome including full-length imprinted macro ncRNAs.

Directory of Open Access Journals (Sweden)

Ru Huang

Full Text Available Imprinted macro non-protein-coding (nc RNAs are cis-repressor transcripts that silence multiple genes in at least three imprinted gene clusters in the mouse genome. Similar macro or long ncRNAs are abundant in the mammalian genome. Here we present the full coding and non-coding transcriptome of two mouse tissues: differentiated ES cells and fetal head using an optimized RNA-Seq strategy. The data produced is highly reproducible in different sequencing locations and is able to detect the full length of imprinted macro ncRNAs such as Airn and Kcnq1ot1, whose length ranges between 80-118 kb. Transcripts show a more uniform read coverage when RNA is fragmented with RNA hydrolysis compared with cDNA fragmentation by shearing. Irrespective of the fragmentation method, all coding and non-coding transcripts longer than 8 kb show a gradual loss of sequencing tags towards the 3' end. Comparisons to published RNA-Seq datasets show that the strategy presented here is more efficient in detecting known functional imprinted macro ncRNAs and also indicate that standardization of RNA preparation protocols would increase the comparability of the transcriptome between different RNA-Seq datasets.
Complete mitochondrial genome of endangered Yellow-shouldered Amazon (Amazona barbadensis): two control region copies in parrot species of the Amazona genus.

Science.gov (United States)

Urantowka, Adam Dawid; Hajduk, Kacper; Kosowska, Barbara

2013-08-01

Amazona barbadensis is an endangered species of parrot living in northern coastal Venezuela and in several Caribbean islands. In this study, we sequenced full mitochondrial genome of the considered species. The total length of the mitogenome was 18,983 bp and contained 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, duplicated control region, and degenerate copies of ND6 and tRNA (Glu) genes. High degree of identity between two copies of control region suggests their coincident evolution and functionality. Comparative analysis of both the control region sequences from four Amazona species revealed their 89.1% identity over a region of 1300 bp and indicates the presence of distinctive parts of two control region copies.
The origins and evolutionary history of human non-coding RNA regulatory networks.

Science.gov (United States)

Sherafatian, Masih; Mowla, Seyed Javad

2017-04-01

The evolutionary history and origin of the regulatory function of animal non-coding RNAs are not well understood. Lack of conservation of long non-coding RNAs and small sizes of microRNAs has been major obstacles in their phylogenetic analysis. In this study, we tried to shed more light on the evolution of ncRNA regulatory networks by changing our phylogenetic strategy to focus on the evolutionary pattern of their protein coding targets. We used available target databases of miRNAs and lncRNAs to find their protein coding targets in human. We were able to recognize evolutionary hallmarks of ncRNA targets by phylostratigraphic analysis. We found the conventional 3'-UTR and lesser known 5'-UTR targets of miRNAs to be enriched at three consecutive phylostrata. Firstly, in eukaryata phylostratum corresponding to the emergence of miRNAs, our study revealed that miRNA targets function primarily in cell cycle processes. Moreover, the same overrepresentation of the targets observed in the next two consecutive phylostrata, opisthokonta and eumetazoa, corresponded to the expansion periods of miRNAs in animals evolution. Coding sequence targets of miRNAs showed a delayed rise at opisthokonta phylostratum, compared to the 3' and 5' UTR targets of miRNAs. LncRNA regulatory network was the latest to evolve at eumetazoa.
Genome-wide conserved non-coding microsatellite (CNMS) marker-based integrative genetical genomics for quantitative dissection of seed weight in chickpea.

Science.gov (United States)

Bajaj, Deepak; Saxena, Maneesha S; Kujur, Alice; Das, Shouvik; Badoni, Saurabh; Tripathi, Shailesh; Upadhyaya, Hari D; Gowda, C L L; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K; Parida, Swarup K

2015-03-01

Phylogenetic footprinting identified 666 genome-wide paralogous and orthologous CNMS (conserved non-coding microsatellite) markers from 5'-untranslated and regulatory regions (URRs) of 603 protein-coding chickpea genes. The (CT)n and (GA)n CNMS carrying CTRMCAMV35S and GAGA8BKN3 regulatory elements, respectively, are abundant in the chickpea genome. The mapped genic CNMS markers with robust amplification efficiencies (94.7%) detected higher intraspecific polymorphic potential (37.6%) among genotypes, implying their immense utility in chickpea breeding and genetic analyses. Seventeen differentially expressed CNMS marker-associated genes showing strong preferential and seed tissue/developmental stage-specific expression in contrasting genotypes were selected to narrow down the gene targets underlying seed weight quantitative trait loci (QTLs)/eQTLs (expression QTLs) through integrative genetical genomics. The integration of transcript profiling with seed weight QTL/eQTL mapping, molecular haplotyping, and association analyses identified potential molecular tags (GAGA8BKN3 and RAV1AAT regulatory elements and alleles/haplotypes) in the LOB-domain-containing protein- and KANADI protein-encoding transcription factor genes controlling the cis-regulated expression for seed weight in the chickpea. This emphasizes the potential of CNMS marker-based integrative genetical genomics for the quantitative genetic dissection of complex seed weight in chickpea. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology.
A novel TaqMan® assay for Nosema ceranae quantification in honey bee, based on the protein coding gene Hsp70.

Science.gov (United States)

Cilia, Giovanni; Cabbri, Riccardo; Maiorana, Giacomo; Cardaio, Ilaria; Dall'Olio, Raffaele; Nanetti, Antonio

2018-04-01

Nosema ceranae is now a widespread honey bee pathogen with high incidence in apiculture. Rapid and reliable detection and quantification methods are a matter of concern for research community, nowadays mainly relying on the use of biomolecular techniques such as PCR, RT-PCR or HRMA. The aim of this technical paper is to provide a new qPCR assay, based on the highly-conserved protein coding gene Hsp70, to detect and quantify the microsporidian Nosema ceranae affecting the western honey bee Apis mellifera. The validation steps to assess efficiency, sensitivity, specificity and robustness of the assay are described also. Copyright © 2018 Elsevier GmbH. All rights reserved.
Primary structure and localization of a conserved immunogenic Plasmodium falciparum glutamate rich protein (GLURP) expressed in both the preerythrocytic and erythrocytic stages of the vertebrate life cycle

DEFF Research Database (Denmark)

Borre, M B; Dziegiel, M; Høgh, B

1991-01-01

A gene coding for a 220-kDa glutamate rich protein (GLURP), an exoantigen of Plasmodium falciparum, was isolated and its nucleotide sequence was determined. The deduced amino acid sequence contains 2 repeat regions. The sequence of one of these was shown to be conserved among geographically...
Non-coding RNAs and epigenome: de novo DNA methylation, allelic exclusion and X-inactivation

Directory of Open Access Journals (Sweden)

V. A. Halytskiy

2013-12-01

Full Text Available Non-coding RNAs are widespread class of cell RNAs. They participate in many important processes in cells – signaling, posttranscriptional silencing, protein biosynthesis, splicing, maintenance of genome stability, telomere lengthening, X-inactivation. Nevertheless, activity of these RNAs is not restricted to posttranscriptional sphere, but cover also processes that change or maintain the epigenetic information. Non-coding RNAs can directly bind to the DNA targets and cause their repression through recruitment of DNA methyltransferases as well as chromatin modifying enzymes. Such events constitute molecular mechanism of the RNA-dependent DNA methylation. It is possible, that the RNA-DNA interaction is universal mechanism triggering DNA methylation de novo. Allelic exclusion can be also based on described mechanism. This phenomenon takes place, when non-coding RNA, which precursor is transcribed from one allele, triggers DNA methylation in all other alleles present in the cell. Note, that miRNA-mediated transcriptional silencing resembles allelic exclusion, because both miRNA gene and genes, which can be targeted by this miRNA, contain elements with the same sequences. It can be assumed that RNA-dependent DNA methylation and allelic exclusion originated with the purpose of counteracting the activity of mobile genetic elements. Probably, thinning and deregulation of the cellular non-coding RNA pattern allows reactivation of silent mobile genetic elements resulting in genome instability that leads to ageing and carcinogenesis. In the course of X-inactivation, DNA methylation and subsequent heterochromatinization of X chromosome can be triggered by direct hybridization of 5′-end of large non-coding RNA Xist with DNA targets in remote regions of the X chromosome.
Positive selection neighboring functionally essential sites and disease-implicated regions of mammalian reproductive proteins.

LENUS (Irish Health Repository)

Morgan, Claire C

2010-01-01

ABSTRACT: BACKGROUND: Reproductive proteins are central to the continuation of all mammalian species. The evolution of these proteins has been greatly influenced by environmental pressures induced by pathogens, rival sperm, sexual selection and sexual conflict. Positive selection has been demonstrated in many of these proteins with particular focus on primate lineages. However, the mammalia are a diverse group in terms of mating habits, population sizes and germ line generation times. We have examined the selective pressures at work on a number of novel reproductive proteins across a wide variety of mammalia. RESULTS: We show that selective pressures on reproductive proteins are highly varied. Of the 10 genes analyzed in detail, all contain signatures of positive selection either across specific sites or in specific lineages or a combination of both. Our analysis of SP56 and Col1a1 are entirely novel and the results show positively selected sites present in each gene. Our findings for the Col1a1 gene are suggestive of a link between positive selection and severe disease type. We find evidence in our dataset to suggest that interacting proteins are evolving in symphony: most likely to maintain interacting functionality. CONCLUSION: Our in silico analyses show positively selected sites are occurring near catalytically important regions suggesting selective pressure to maximize efficient fertilization. In those cases where a mechanism of protein function is not fully understood, the sites presented here represent ideal candidates for mutational study. This work has highlighted the widespread rate heterogeneity in mutational rates across the mammalia and specifically has shown that the evolution of reproductive proteins is highly varied depending on the species and interacting partners. We have shown that positive selection and disease are closely linked in the Col1a1 gene.

Construction of a mutagenesis cartridge for poliovirus genome-linked viral protein: isolation and characterization of viable and nonviable mutants

International Nuclear Information System (INIS)

Kuhn, R.J.; Tada, H.; Ypma-Wong, M.F.; Dunn, J.J.; Semler, B.L.; Wimmer, E.

1988-01-01

By following a strategy of genetic analysis of poliovirus, the authors have constructed a synthetic mutagenesis cartridge spanning the genome-linked viral protein coding region and flanking cleavage sites in an infectious cDNA clone of the type I (Mahoney) genome. The insertion of new restriction sites within the infectious clone has allowed them to replace the wild-type sequences with short complementary pairs of synthetic oligonucleotides containing various mutations. A set of mutations have been made that create methionine codons within the genome-linked viral protein region. The resulting viruses have growth characteristics similar to wild type. Experiments that led to an alteration of the tyrosine residue responsible for the linkage to RNA have resulted in nonviable virus. In one mutant, proteolytic processing assayed in vitro appeared unimpaired by the mutation. They suggest that the position of the tyrosine residue is important for genome-linked viral protein function(s)
Safety, codes and standards for hydrogen installations. Metrics development and benchmarking

Energy Technology Data Exchange (ETDEWEB)

Harris, Aaron P. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Dedrick, Daniel E. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); LaFleur, Angela Christine [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); San Marchi, Christopher W. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

2014-04-01

Automakers and fuel providers have made public commitments to commercialize light duty fuel cell electric vehicles and fueling infrastructure in select US regions beginning in 2014. The development, implementation, and advancement of meaningful codes and standards is critical to enable the effective deployment of clean and efficient fuel cell and hydrogen solutions in the energy technology marketplace. Metrics pertaining to the development and implementation of safety knowledge, codes, and standards are important to communicate progress and inform future R&D investments. This document describes the development and benchmarking of metrics specific to the development of hydrogen specific codes relevant for hydrogen refueling stations. These metrics will be most useful as the hydrogen fuel market transitions from pre-commercial to early-commercial phases. The target regions in California will serve as benchmarking case studies to quantify the success of past investments in research and development supporting safety codes and standards R&D.
Accelerator-driven transmutation reactor analysis code system (ATRAS)

Energy Technology Data Exchange (ETDEWEB)

Sasa, Toshinobu; Tsujimoto, Kazufumi; Takizuka, Takakazu; Takano, Hideki [Japan Atomic Energy Research Inst., Tokai, Ibaraki (Japan). Tokai Research Establishment

1999-03-01

JAERI is proceeding a design study of the hybrid type minor actinide transmutation system which mainly consist of an intense proton accelerator and a fast subcritical core. Neutronics and burnup characteristics of the accelerator-driven system is important from a view point of the maintenance of subcriticality and energy balance during the system operation. To determine those characteristics accurately, it is necessary to involve reactions at high-energy region, which are not treated on ordinary reactor analysis codes. The authors developed a code system named ATRAS to analyze the neutronics and burnup characteristics of accelerator-driven subcritical reactor systems. ATRAS has a function of burnup analysis taking account of the effect of spallation neutron source. ATRAS consists of a spallation analysis code, a neutron transport codes and a burnup analysis code. Utility programs for fuel exchange, pre-processing and post-processing are also incorporated. (author)
Evaluation Codes from an Affine Veriety Code Perspective

DEFF Research Database (Denmark)

Geil, Hans Olav

2008-01-01

Evaluation codes (also called order domain codes) are traditionally introduced as generalized one-point geometric Goppa codes. In the present paper we will give a new point of view on evaluation codes by introducing them instead as particular nice examples of affine variety codes. Our study...... includes a reformulation of the usual methods to estimate the minimum distances of evaluation codes into the setting of affine variety codes. Finally we describe the connection to the theory of one-pointgeometric Goppa codes. Contents 4.1 Introduction...... . . . . . . . . . . . . . . . . . . . . . . . 171 4.9 Codes form order domains . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173 4.10 One-point geometric Goppa codes . . . . . . . . . . . . . . . . . . . . . . . . 176 4.11 Bibliographical Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 178 References...
A novel bidirectional expression system for simultaneous expression of both the protein-coding genes and short hairpin RNAs in mammalian cells

International Nuclear Information System (INIS)

Hung, C.-F.; Cheng, T.-L.; Wu, R.-H.; Teng, C.-F.; Chang, W.-T.

2006-01-01

RNA interference (RNAi) is an extremely powerful and widely used gene silencing approach for reverse functional genomics and molecular therapeutics. In mammals, the conserved poly(ADP-ribose) polymerase 2 (PARP-2)/RNase P bidirectional control promoter simultaneously expresses both the PARP-2 protein and RNase P RNA by RNA polymerase II- and III-dependent mechanisms, respectively. To explore this unique bidirectional control system in RNAi-mediated gene silencing strategy, we have constructed two novel bidirectional expression vectors, pbiHsH1 and pbiMmH1, which contained the PARP-2/RNase P bidirectional control promoters from human and mouse, for simultaneous expression of both the protein-coding genes and short hairpin RNAs. Analyses of the dual transcriptional activities indicated that these two bidirectional expression vectors could not only express enhanced green fluorescent protein as a functional reporter but also simultaneously transcribe shLuc for inhibiting the firefly luciferase expression. In addition, to extend its utility for the establishment of inherited stable clones, we have also reconstructed this bidirectional expression system with the blasticidin S deaminase gene, an effective dominant drug resistance selectable marker, and examined both the selection and inhibition efficiencies in drug resistance and gene expression. Moreover, we have further demonstrated that this bidirectional expression system could efficiently co-regulate the functionally important genes, such as overexpression of tumor suppressor protein p53 and inhibition of anti-apoptotic protein Bcl-2 at the same time. In summary, the bidirectional expression vectors, pbiHsH1 and pbiMmH1, should provide a simple, convenient, and efficient novel tool for manipulating the gene function in mammalian cells
Genetic Diversity and Natural Selection in 42 kDa Region of Plasmodium vivax Merozoite Surface Protein-1 from China-Myanmar Endemic Border.

Science.gov (United States)

Zhou, Xia; Tambo, Ernest; Su, Jing; Fang, Qiang; Ruan, Wei; Chen, Jun-Hu; Yin, Ming-Bo; Zhou, Xiao-Nong

2017-10-01

Plasmodium vivax merozoite surface protein-1 (PvMSP1) gene codes for a major malaria vaccine candidate antigen. However, its polymorphic nature represents an obstacle to the design of a protective vaccine. In this study, we analyzed the genetic polymorphism and natural selection of the C-terminal 42 kDa fragment within PvMSP1 gene (Pv MSP142) from 77 P. vivax isolates, collected from imported cases of China-Myanmar border (CMB) areas in Yunnan province and the inland cases from Anhui, Yunnan, and Zhejiang province in China during 2009-2012. Totally, 41 haplotypes were identified and 30 of them were new haplotypes. The differences between the rates of non-synonymous and synonymous mutations suggest that PvMSP142 has evolved under natural selection, and a high selective pressure preferentially acted on regions identified of PvMSP133. Our results also demonstrated that PvMSP142 of P. vivax isolates collected on China-Myanmar border areas display higher genetic polymorphisms than those collected from inland of China. Such results have significant implications for understanding the dynamic of the P. vivax population and may be useful information towards China malaria elimination campaign strategies.
Novel overlapping coding sequences in Chlamydia trachomatis

DEFF Research Database (Denmark)

Jensen, Klaus Thorleif; Petersen, Lise; Falk, Søren

2006-01-01

that are in agreement with the primary annotation. Forty two genes from the primary annotation are not predicted by EasyGene. The majority of these genes are listed as hypothetical in the primary annotation. The 15 novel predicted genes all overlap with genes on the complementary strand. We find homologues of several...... of the novel genes in C. trachomatis Serovar A and Chlamydia muridarum. Several of the genes have typical gene-like and protein-like features. Furthermore, we confirm transcriptional activity from 10 of the putative genes. The combined evidence suggests that at least seven of the 15 are protein coding genes...
Genome-wide occupancy profile of mediator and the Srb8-11 module reveals interactions with coding regions

DEFF Research Database (Denmark)

Zhu, Xuefeng; Wirén, Marianna; Sinha, Indranil

2006-01-01

Mediator exists in a free form containing the Med12, Med13, CDK8, and CycC subunits (the Srb8-11 module) and a smaller form, which lacks these four subunits and associates with RNA polymerase II (Pol II), forming a holoenzyme. We use chromatin immunoprecipitation (ChIP) and DNA microarrays...... to investigate genome-wide localization of Mediator and the Srb8-11 module in fission yeast. Mediator and the Srb8-11 module display similar binding patterns, and interactions with promoters and upstream activating sequences correlate with increased transcription activity. Unexpectedly, Mediator also interacts...... with the downstream coding region of many genes. These interactions display a negative bias for positions closer to the 5' ends of open reading frames (ORFs) and appear functionally important, because downregulation of transcription in a temperature-sensitive med17 mutant strain correlates with increased Mediator...
QR Codes: Taking Collections Further

Science.gov (United States)

Ahearn, Caitlin

2014-01-01

With some thought and direction, QR (quick response) codes are a great tool to use in school libraries to enhance access to information. From March through April 2013, Caitlin Ahearn interned at Sanborn Regional High School (SRHS) under the supervision of Pam Harland. As a result of Harland's un-Deweying of the nonfiction collection at SRHS,…
An Optimal Linear Coding for Index Coding Problem

OpenAIRE

Pezeshkpour, Pouya

2015-01-01

An optimal linear coding solution for index coding problem is established. Instead of network coding approach by focus on graph theoric and algebraic methods a linear coding program for solving both unicast and groupcast index coding problem is presented. The coding is proved to be the optimal solution from the linear perspective and can be easily utilize for any number of messages. The importance of this work is lying mostly on the usage of the presented coding in the groupcast index coding ...
Hypoxic-induced stress protein expression in rat cardiac myocytes

International Nuclear Information System (INIS)

Howard, G.; Geoghegan, T.E.

1986-01-01

Mammalian stress proteins can be induced in cells and tissues exposed to a variety of conditions including hyperthermia and diminished O 2 supply. The authors have previously shown that the expression of three stress proteins (71, 85, and 95 kDa) was induced in cardiac tissue from mice exposed to hypoxic conditions. The expression of mRNAs coding for the 85 and 95 kDa proteins increase with time of exposure to hypoxia, while the mRNA coding for the 71 kDa protein is transiently induced. The authors extended these studies to investigate the expression of stress proteins in isolated rat cardiac myocytes. Freshly prepared myocytes were exposed to control, hypoxic, anoxic, or heat-shock environments for up to 16 h. The proteins were then labeled for 6 hours with [ 35 S]methionine. Analysis of the solubilized proteins by SDS-PAGE and autoradiography showed that there was a 6-fold increase in synthesis of the 85 kDa protein upon exposure to hypoxia but not heat-shock conditions. The 71 kDa protein was present at high levels in both control and treated myocyte protein preparations, and presumably had been induced during the isolation procedure. Total RNA isolated from intact rat heart and isolated myocytes was compared by cell-free translation analysis and showed induction of RNAs coding for several stress proteins in the myocyte preparation. The induced proteins at 85 and 95 kDa have molecular weights similar to reported cell stress and/or glucose-regulated proteins
Non-coding genomic regions possessing enhancer and silencer potential are associated with healthy aging and exceptional survival.

Science.gov (United States)

Kim, Sangkyu; Welsh, David A; Myers, Leann; Cherry, Katie E; Wyckoff, Jennifer; Jazwinski, S Michal

2015-02-28

We have completed a genome-wide linkage scan for healthy aging using data collected from a family study, followed by fine-mapping by association in a separate population, the first such attempt reported. The family cohort consisted of parents of age 90 or above and their children ranging in age from 50 to 80. As a quantitative measure of healthy aging, we used a frailty index, called FI34, based on 34 health and function variables. The linkage scan found a single significant linkage peak on chromosome 12. Using an independent cohort of unrelated nonagenarians, we carried out a fine-scale association mapping of the region suggestive of linkage and identified three sites associated with healthy aging. These healthy-aging sites (HASs) are located in intergenic regions at 12q13-14. HAS-1 has been previously associated with multiple diseases, and an enhancer was recently mapped and experimentally validated within the site. HAS-2 is a previously uncharacterized site possessing genomic features suggestive of enhancer activity. HAS-3 contains features associated with Polycomb repression. The HASs also contain variants associated with exceptional longevity, based on a separate analysis. Our results provide insight into functional genomic networks involving non-coding regulatory elements that are involved in healthy aging and longevity.
Transfection of Chinese hamster ovary DHFR/sup -/ cells with the gene coding for heat shock protein 70 from drosophila melanogaster

International Nuclear Information System (INIS)

Duffy, J.J.; Carper, S.W.; Gerner, E.W.

1987-01-01

Chinese hamster ovary DHFR/sup -/ cells (CHO-DHFR/sup -/) were transfected with the plasmid pSV2-dhfr expressing the mouse gene coding for dhfr or with the same plasmid containing the gene coding for the Drosophila melanogaster heat shock protein 70 (hsp70), pSVd-hsp70. Three subcloned cell lines selected for expression of the dhfr gene were shown to contain either the vector sequence (G cells) or varying copies of pSVd-hsp70 (H cells). One line of H cells was shown to contain > 30 copies of the D. melanogaster hsp70 gene and to express the hsp70 RNA at significant levels. No difference between G and H cells was observed in the rate of growth, in the development of thermotolerance, or in the sensitivity of actin microfilament bundles to heat shock. However, H cells containing the transfected hsp70 gene had an altered morphology when compared to the G cells and the parental CHO-DHFR/sup -/ cells being more fibroblastic. The adhesion properties of the H cells was also decreased when compared to the G cells. These results show that insertion of the D. melanogaster gene into CHO cells does not effect growth rates or heat shock responses but may alter cell morphology and adhesion
The coevolution of genes and genetic codes: Crick's frozen accident revisited.

Science.gov (United States)

Sella, Guy; Ardell, David H

2006-09-01

The standard genetic code is the nearly universal system for the translation of genes into proteins. The code exhibits two salient structural characteristics: it possesses a distinct organization that makes it extremely robust to errors in replication and translation, and it is highly redundant. The origin of these properties has intrigued researchers since the code was first discovered. One suggestion, which is the subject of this review, is that the code's organization is the outcome of the coevolution of genes and genetic codes. In 1968, Francis Crick explored the possible implications of coevolution at different stages of code evolution. Although he argues that coevolution was likely to influence the evolution of the code, he concludes that it falls short of explaining the organization of the code we see today. The recent application of mathematical modeling to study the effects of errors on the course of coevolution, suggests a different conclusion. It shows that coevolution readily generates genetic codes that are highly redundant and similar in their error-correcting organization to the standard code. We review this recent work and suggest that further affirmation of the role of coevolution can be attained by investigating the extent to which the outcome of coevolution is robust to other influences that were present during the evolution of the code.
Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions

Energy Technology Data Exchange (ETDEWEB)

MacArthur, Stewart; Li, Xiao-Yong; Li, Jingyi; Brown, James B.; Chu, Hou Cheng; Zeng, Lucy; Grondona, Brandi P.; Hechmer, Aaron; Simirenko, Lisa; Keranen, Soile V.E.; Knowles, David W.; Stapleton, Mark; Bickel, Peter; Biggin, Mark D.; Eisen, Michael B.

2009-05-15

BACKGROUND: We previously established that six sequence-specific transcription factors that initiate anterior/posterior patterning in Drosophila bind to overlapping sets of thousands of genomic regions in blastoderm embryos. While regions bound at high levels include known and probable functional targets, more poorly bound regions are preferentially associated with housekeeping genes and/or genes not transcribed in the blastoderm, and are frequently found in protein coding sequences or in less conserved non-coding DNA, suggesting that many are likely non-functional. RESULTS: Here we show that an additional 15 transcription factors that regulate other aspects of embryo patterning show a similar quantitative continuum of function and binding to thousands of genomic regions in vivo. Collectively, the 21 regulators show a surprisingly high overlap in the regions they bind given that they belong to 11 DNA binding domain families, specify distinct developmental fates, and can act via different cis-regulatory modules. We demonstrate, however, that quantitative differences in relative levels of binding to shared targets correlate with the known biological and transcriptional regulatory specificities of these factors. CONCLUSIONS: It is likely that the overlap in binding of biochemically and functionally unrelated transcription factors arises from the high concentrations of these proteins in nuclei, which, coupled with their broad DNA binding specificities, directs them to regions of open chromatin. We suggest that most animal transcription factors will be found to show a similar broad overlapping pattern of binding in vivo, with specificity achieved by modulating the amount, rather than the identity, of bound factor.
Specificity and affinity quantification of protein-protein interactions.

Science.gov (United States)

Yan, Zhiqiang; Guo, Liyong; Hu, Liang; Wang, Jin

2013-05-01

Most biological processes are mediated by the protein-protein interactions. Determination of the protein-protein structures and insight into their interactions are vital to understand the mechanisms of protein functions. Currently, compared with the isolated protein structures, only a small fraction of protein-protein structures are experimentally solved. Therefore, the computational docking methods play an increasing role in predicting the structures and interactions of protein-protein complexes. The scoring function of protein-protein interactions is the key responsible for the accuracy of the computational docking. Previous scoring functions were mostly developed by optimizing the binding affinity which determines the stability of the protein-protein complex, but they are often lack of the consideration of specificity which determines the discrimination of native protein-protein complex against competitive ones. We developed a scoring function (named as SPA-PP, specificity and affinity of the protein-protein interactions) by incorporating both the specificity and affinity into the optimization strategy. The testing results and comparisons with other scoring functions show that SPA-PP performs remarkably on both predictions of binding pose and binding affinity. Thus, SPA-PP is a promising quantification of protein-protein interactions, which can be implemented into the protein docking tools and applied for the predictions of protein-protein structure and affinity. The algorithm is implemented in C language, and the code can be downloaded from http://dl.dropbox.com/u/1865642/Optimization.cpp.
Role of N-terminal 28-amino-acid region of Rhizopus oryzae lipase in directing proteins to secretory pathway of Aspergillus oryzae.

Science.gov (United States)

Hama, Shinji; Tamalampudi, Sriappareddy; Shindo, Naoki; Numata, Takao; Yamaji, Hideki; Fukuda, Hideki; Kondo, Akihiko

2008-07-01

To develop a new approach for improving heterologous protein production in Aspergillus oryzae, we focused on the functional role of the N-terminal region of Rhizopus oryzae lipase (ROL). Several N-terminal deletion variants of ROL were expressed in A. oryzae. Interestingly, a segment of 28 amino acids from the C-terminal region of the propeptide (N28) was found to be critical for secretion of ROL into the culture medium. To further investigate the role of N28, the ROL secretory process was visualized in vivo using ROL-green fluorescent protein (GFP) fusion proteins. In cells producing ROL with N28, fluorescence observations showed that the fusion proteins are transported through endoplasmic reticulum (ER), Golgi, and cell wall, which is one of the typical secretory processes in a eukaryotic cell. Because the expression of the mature ROL-GFP fusion protein induced fluorescence accumulation without its translocation into the ER, N28 is considered to play a crucial role in protein transport. When N28 was inserted between the secretion signal and GFP, fluorescence observations showed that GFP, which is originally a cytoplasmic protein, was efficiently translocated into the ER of A. oryzae, resulting in an enhanced secretion of mature GFP after proteolytic cleavage of N28. These findings suggest that N28 facilitates protein translocation into ER and can be a promising candidate for improving heterologous protein production in A. oryzae.
Partitioning of genetic variation between regulatory and coding gene segments: the predominance of software variation in genes encoding introvert proteins.

Science.gov (United States)

Mitchison, A

1997-01-01

In considering genetic variation in eukaryotes, a fundamental distinction can be made between variation in regulatory (software) and coding (hardware) gene segments. For quantitative traits the bulk of variation, particularly that near the population mean, appears to reside in regulatory segments. The main exceptions to this rule concern proteins which handle extrinsic substances, here termed extrovert proteins. The immune system includes an unusually large proportion of this exceptional category, but even so its chief source of variation may well be polymorphism in regulatory gene segments. The main evidence for this view emerges from genome scanning for quantitative trait loci (QTL), which in the case of the immune system points to a major contribution of pro-inflammatory cytokine genes. Further support comes from sequencing of major histocompatibility complex (Mhc) class II promoters, where a high level of polymorphism has been detected. These Mhc promoters appear to act, in part at least, by gating the back-signal from T cells into antigen-presenting cells. Both these forms of polymorphism are likely to be sustained by the need for flexibility in the immune response. Future work on promoter polymorphism is likely to benefit from the input from genome informatics.
Lateral Organization of Influenza Virus Proteins in the Budozone Region of the Plasma Membrane.

Science.gov (United States)

Leser, George P; Lamb, Robert A

2017-05-01

between viral proteins in the plasma membrane. Some proteins, such as HA and M2, inherently cocluster within the membrane, although M2 is found mostly at the periphery of regions of HA, consistent with the proposed role of M2 in scission at the end of budding. The association between some pairs of influenza virus proteins, such as M2 and NP, appears to be brokered by additional influenza virus proteins, in this case M1. HA and NA, while raft associated, reside in distinct domains, reflecting their distributions in the viral membrane. Copyright © 2017 American Society for Microbiology.
The development of the code package PERMAK--3D//SC--1

International Nuclear Information System (INIS)

Bolobov, P. A.; Oleksuk, D. A.

2011-01-01

Code package PERMAK-3D//SC-1 was developed for performing pin-by-pin coupled neutronic and thermal hydraulic calculation of the core fragment of seven fuel assemblies and was designed on the basis of 3D multigroup pin-by-pin code PERMAK-3D and 3D (subchannel) thermal hydraulic code SC-1 The code package predicts axial and radial pin-by-pin power distribution and coolant parameters in stimulated region (enthalpies,, velocities,, void fractions,, boiling and DNBR margins).. The report describes some new steps in code package development. Some PERMAK-3D//SC-1 outcomes of WWER calculations are presented in the report. (Authors)

Some links on this page may take you to non-federal websites. Their policies may differ from this site.