WorldWideScience

Sample records for coding gene region

  1. Fast rate of evolution in alternatively spliced coding regions of mammalian genes

    Directory of Open Access Journals (Sweden)

    Nurtdinov Ramil N

    2006-04-01

    Full Text Available Abstract Background At least half of mammalian genes are alternatively spliced. Alternative isoforms are often genome-specific and it has been suggested that alternative splicing is one of the major mechanisms for generating protein diversity in the course of evolution. Another way of looking at alternative splicing is to consider sequence evolution of constitutive and alternative regions of protein-coding genes. Indeed, it turns out that constitutive and alternative regions evolve in different ways. Results A set of 3029 orthologous pairs of human and mouse alternatively spliced genes was considered. The rate of nonsynonymous substitutions (dN, the rate of synonymous substitutions (dS, and their ratio (ω = dN/dS appear to be significantly higher in alternatively spliced coding regions compared to constitutive regions. When N-terminal, internal and C-terminal alternatives are analysed separately, C-terminal alternatives appear to make the main contribution to the observed difference. The effects become even more pronounced in a subset of fast evolving genes. Conclusion These results provide evidence of weaker purifying selection and/or stronger positive selection in alternative regions and thus one more confirmation of accelerated evolution in alternative regions. This study corroborates the theory that alternative splicing serves as a testing ground for molecular evolution.

  2. Evidence for gene-specific rather than transcription rate-dependent histone H3 exchange in yeast coding regions.

    Science.gov (United States)

    Gat-Viks, Irit; Vingron, Martin

    2009-02-01

    In eukaryotic organisms, histones are dynamically exchanged independently of DNA replication. Recent reports show that different coding regions differ in their amount of replication-independent histone H3 exchange. The current paradigm is that this histone exchange variability among coding regions is a consequence of transcription rate. Here we put forward the idea that this variability might be also modulated in a gene-specific manner independently of transcription rate. To that end, we study transcription rate-independent replication-independent coding region histone H3 exchange. We term such events relative exchange. Our genome-wide analysis shows conclusively that in yeast, relative exchange is a novel consistent feature of coding regions. Outside of replication, each coding region has a characteristic pattern of histone H3 exchange that is either higher or lower than what was expected by its RNAPII transcription rate alone. Histone H3 exchange in coding regions might be a way to add or remove certain histone modifications that are important for transcription elongation. Therefore, our results that gene-specific coding region histone H3 exchange is decoupled from transcription rate might hint at a new epigenetic mechanism of transcription regulation.

  3. Detecting non-coding selective pressure in coding regions

    Directory of Open Access Journals (Sweden)

    Blanchette Mathieu

    2007-02-01

    Full Text Available Abstract Background Comparative genomics approaches, where orthologous DNA regions are compared and inter-species conserved regions are identified, have proven extremely powerful for identifying non-coding regulatory regions located in intergenic or intronic regions. However, non-coding functional elements can also be located within coding region, as is common for exonic splicing enhancers, some transcription factor binding sites, and RNA secondary structure elements affecting mRNA stability, localization, or translation. Since these functional elements are located in regions that are themselves highly conserved because they are coding for a protein, they generally escaped detection by comparative genomics approaches. Results We introduce a comparative genomics approach for detecting non-coding functional elements located within coding regions. Codon evolution is modeled as a mixture of codon substitution models, where each component of the mixture describes the evolution of codons under a specific type of coding selective pressure. We show how to compute the posterior distribution of the entropy and parsimony scores under this null model of codon evolution. The method is applied to a set of growth hormone 1 orthologous mRNA sequences and a known exonic splicing elements is detected. The analysis of a set of CORTBP2 orthologous genes reveals a region of several hundred base pairs under strong non-coding selective pressure whose function remains unknown. Conclusion Non-coding functional elements, in particular those involved in post-transcriptional regulation, are likely to be much more prevalent than is currently known. With the numerous genome sequencing projects underway, comparative genomics approaches like that proposed here are likely to become increasingly powerful at detecting such elements.

  4. Genetic variants in promoters and coding regions of the muscle glycogen synthase and the insulin-responsive GLUT4 genes in NIDDM

    DEFF Research Database (Denmark)

    Bjørbaek, C; Echwald, Søren Morgenthaler; Hubricht, P

    1994-01-01

    To examine the hypothesis that variants in the regulatory or coding regions of the glycogen synthase (GS) and insulin-responsive glucose transporter (GLUT4) genes contribute to insulin-resistant glucose processing of muscle from non-insulin-dependent diabetes mellitus (NIDDM) patients, promoter...... volunteers. By applying inverse polymerase chain reaction and direct DNA sequencing, 532 base pairs (bp) of the GS promoter were identified and the transcriptional start site determined by primer extension. SSCP scanning of the promoter region detected five single nucleotide substitutions, positioned at 42......'-untranslated region, and the coding region of the GLUT4 gene showed four polymorphisms, all single nucleotide substitutions, positioned at -581, 1, 30, and 582. None of the three changes in the regulatory region of the gene had any major influence on expression of the GLUT4 gene in muscle. The variant at 582...

  5. Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

    Science.gov (United States)

    Hoffmann, Robert D; Palmgren, Michael

    2016-06-13

    Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.

  6. Systematic screening for mutations in the promoter and the coding region of the 5-HT{sub 1A} gene

    Energy Technology Data Exchange (ETDEWEB)

    Erdmann, J.; Shimron-Abarbanell, D.; Cichon, S. [Univ. of Bonn (Germany)] [and others

    1995-10-09

    In the present study we sought to identify genetic variation in the 5-HT{sub 1A} receptor gene which through alteration of protein function or level of expression might contribute to the genetic predisposition to neuropsychiatric diseases. Genomic DNA samples from 159 unrelated subjects (including 45 schizophrenic, 46 bipolar affective, and 43 patients with Tourette`s syndrome, as well as 25 healthy controls) were investigated by single-strand conformation analysis. Overlapping PCR (polymerase chain reaction) fragments covered the whole coding sequence as well as the 5{prime} untranslated region of the 5-HT{sub 1A} gene. The region upstream to the coding sequence we investigated contains a functional promoter. We found two rare nucleotide sequence variants. Both mutations are located in the coding region of the gene: a coding mutation (A{yields}G) in nucleotide position 82 which leads to an amino acid exchange (Ile{yields}Val) in position 28 of the receptor protein and a silent mutation (C{yields}T) in nucleotide position 549. The occurrence of the Ile-28-Val substitution was studied in an extended sample of patients (n = 352) and controls (n = 210) but was found in similar frequencies in all groups. Thus, this mutation is unlikely to play a significant role in the genetic predisposition to the diseases investigated. In conclusion, our study does not provide evidence that the 5-HT{sub 1A} gene plays either a major or a minor role in the genetic predisposition to schizophrenia, bipolar affective disorder, or Tourette`s syndrome. 29 refs., 4 figs., 1 tab.

  7. XGC developments for a more efficient XGC-GENE code coupling

    Science.gov (United States)

    Dominski, Julien; Hager, Robert; Ku, Seung-Hoe; Chang, Cs

    2017-10-01

    In the Exascale Computing Program, the High-Fidelity Whole Device Modeling project initially aims at delivering a tightly-coupled simulation of plasma neoclassical and turbulence dynamics from the core to the edge of the tokamak. To permit such simulations, the gyrokinetic codes GENE and XGC will be coupled together. Numerical efforts are made to improve the numerical schemes agreement in the coupling region. One of the difficulties of coupling those codes together is the incompatibility of their grids. GENE is a continuum grid-based code and XGC is a Particle-In-Cell code using unstructured triangular mesh. A field-aligned filter is thus implemented in XGC. Even if XGC originally had an approximately field-following mesh, this field-aligned filter permits to have a perturbation discretization closer to the one solved in the field-aligned code GENE. Additionally, new XGC gyro-averaging matrices are implemented on a velocity grid adapted to the plasma properties, thus ensuring same accuracy from the core to the edge regions.

  8. Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions

    DEFF Research Database (Denmark)

    Lavstsen, Thomas; Salanti, Ali; Jensen, Anja T R

    2003-01-01

    and organization of the 3D7 PfEMP1 repertoire was investigated on the basis of the complete genome sequence. METHODS: Using two tree-building methods we analysed the coding and non-coding sequences of 3D7 var and rif genes as well as var genes of other parasite strains. RESULTS: var genes can be sub...

  9. A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

    Science.gov (United States)

    Kress, W John; Erickson, David L

    2007-06-06

    A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.

  10. Mutational analysis of the promoter and the coding region of the 5-HT1A gene

    Energy Technology Data Exchange (ETDEWEB)

    Erdmann, J.; Noethen, M.M.; Shimron-Abarbanell, D. [Univ. of Bonn (Germany)] [and others

    1994-09-01

    Disturbances of serotonergic pathways have been implicated in many neuropsychiatric disorders. Serotonin (5HT) receptors can be subdivided into at least three major families (5HT1, 5HT2, and 5HT3). Five human 5HT1 receptor subtypes have been cloned, namely 1A, 1D{alpha}, 1D{beta}, 1E, and 1F. Of these, the 5HT1A receptor is the best characterized subtype. In the present study we sought to identify genetic variation in the 5HT1A receptor gene which through alteration of protein function or level of expression might contribute to the genetics of neuropsychiatric diseases. The coding region and the 5{prime} promoter region of the 5HT1A gene from 159 unrelated subjects (45 schizophrenic, 46 bipolar affective, and 43 patients with Tourette`s syndrome, as well as 25 controls) were analyzed using SSCA. SSCA revealed the presence of two mutations both located in the coding region of the 5HT1A receptor gene. The first mutation is a rare silent C{r_arrow}T substitution at nucleotide position 549. The second mutation is characterized by a base pair substitution (A{r_arrow}G) at the first position of codon 28 and results in an amino acid exchange (Ile{r_arrow}Val). Since Val28 was found only in a single schizophrenic patient and in none of the other patients or controls, we decided to extend our samples and to use a restriction assay for screening a further 74 schizophrenic, 95 bipolar affective, and 49 patients with Tourette`s syndrome, as well as 185 controls, for the presence of the mutation. In total, the mutation was found in 2 schizophrenic patients, in 3 bipolars, in 1 Tourette patient, and in 5 controls. To our knowledge the Ile-28-Val substitution reported here is the first natural occuring molecular variant which has been identified for a serotonin receptor so far.

  11. Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.

    Science.gov (United States)

    Goz, Eli; Mioduser, Oriah; Diament, Alon; Tuller, Tamir

    2017-08-01

    Deciphering the way gene expression regulatory aspects are encoded in viral genomes is a challenging mission with ramifications related to all biomedical disciplines. Here, we aimed to understand how the evolution shapes the bacteriophage lambda genes by performing a high resolution analysis of ribosomal profiling data and gene expression related synonymous/silent information encoded in bacteriophage coding regions.We demonstrated evidence of selection for distinct compositions of synonymous codons in early and late viral genes related to the adaptation of translation efficiency to different bacteriophage developmental stages. Specifically, we showed that evolution of viral coding regions is driven, among others, by selection for codons with higher decoding rates; during the initial/progressive stages of infection the decoding rates in early/late genes were found to be superior to those in late/early genes, respectively. Moreover, we argued that selection for translation efficiency could be partially explained by adaptation to Escherichia coli tRNA pool and the fact that it can change during the bacteriophage life cycle.An analysis of additional aspects related to the expression of viral genes, such as mRNA folding and more complex/longer regulatory signals in the coding regions, is also reported. The reported conclusions are likely to be relevant also to additional viruses. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  12. Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs.

    Directory of Open Access Journals (Sweden)

    Chen Xie

    2012-09-01

    Full Text Available Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA-Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis, which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level.

  13. A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements.

    Directory of Open Access Journals (Sweden)

    Eugeny A Elisaphenko

    2008-06-01

    Full Text Available X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC. Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA.

  14. Single nucleotide polymorphisms (SNPs in coding regions of canine dopamine- and serotonin-related genes

    Directory of Open Access Journals (Sweden)

    Lingaas Frode

    2008-01-01

    Full Text Available Abstract Background Polymorphism in genes of regulating enzymes, transporters and receptors of the neurotransmitters of the central nervous system have been associated with altered behaviour, and single nucleotide polymorphisms (SNPs represent the most frequent type of genetic variation. The serotonin and dopamine signalling systems have a central influence on different behavioural phenotypes, both of invertebrates and vertebrates, and this study was undertaken in order to explore genetic variation that may be associated with variation in behaviour. Results Single nucleotide polymorphisms in canine genes related to behaviour were identified by individually sequencing eight dogs (Canis familiaris of different breeds. Eighteen genes from the dopamine and the serotonin systems were screened, revealing 34 SNPs distributed in 14 of the 18 selected genes. A total of 24,895 bp coding sequence was sequenced yielding an average frequency of one SNP per 732 bp (1/732. A total of 11 non-synonymous SNPs (nsSNPs, which may be involved in alteration of protein function, were detected. Of these 11 nsSNPs, six resulted in a substitution of amino acid residue with concomitant change in structural parameters. Conclusion We have identified a number of coding SNPs in behaviour-related genes, several of which change the amino acids of the proteins. Some of the canine SNPs exist in codons that are evolutionary conserved between five compared species, and predictions indicate that they may have a functional effect on the protein. The reported coding SNP frequency of the studied genes falls within the range of SNP frequencies reported earlier in the dog and other mammalian species. Novel SNPs are presented and the results show a significant genetic variation in expressed sequences in this group of genes. The results can contribute to an improved understanding of the genetics of behaviour.

  15. Annotating pathogenic non-coding variants in genic regions.

    Science.gov (United States)

    Gelfman, Sahar; Wang, Quanli; McSweeney, K Melodi; Ren, Zhong; La Carpia, Francesca; Halvorsen, Matt; Schoch, Kelly; Ratzon, Fanni; Heinzen, Erin L; Boland, Michael J; Petrovski, Slavé; Goldstein, David B

    2017-08-09

    Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.While non-coding synonymous and intronic variants are often not under strong selective constraint, they can be pathogenic through affecting splicing or transcription. Here, the authors develop a score that uses sequence context alterations to predict pathogenicity of synonymous and non-coding genetic variants, and provide a web server of pre-computed scores.

  16. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Christian J. Michel

    2017-12-01

    Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together

  17. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  18. [Variation of CAG repeats in coding region of ATXN2 gene in different ethnic groups].

    Science.gov (United States)

    Chen, Xiao-Chen; Sun, Hao; Mi, Dong-Qing; Huang, Xiao-Qin; Lin, Ke-Qin; Yi, Wen; Yu, Liang; Shi, Lei; Shi, Li; Yang, Zhao-Qing; Chu, Jia-You

    2011-04-01

    Toinvestigate CAG repeats variation of ATXN2 gene coding region in six ethnic groups that live in comparatively different environments, to evaluate whether these variations are under positive selection, and to find factors driving selection effects, 291 unrelated healthy individuals were collected from six ethnic groups and their STR geneotyping was performed. The frequencies of alleles and genotypes were counted and thereby Slatkin's linearized Fst values were calculated. The UPGMA tree against this gene was constructed. The MDS analysis among these groups was carried out as well. The results from the linearized Fst values indicated that there were significant evolutionary differences of the STR in ATXN2 gene between Hui and Yi groups, but not among the other 4 groups. Further analysis was performed by combining our data with published data obtained from other groups. These results indicated that there were significant differences between Japanese and other groups including Hui, Hani, Yunnan Mongolian, and Inner Mongolian. Both Hui and Mongolian from Inner Mongolia were significantly different from Han. In conclusion, the six ethnic groups had their own distribution characterizations of allelic frequencies of ATXN2 STR, and the potential cause of frequency changes in rare alleles could be the consequence of positive selection.

  19. Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

    Directory of Open Access Journals (Sweden)

    Herington Adrian C

    2008-10-01

    Full Text Available Abstract Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS, which spans the promoter and untranslated regions of the ghrelin gene (GHRL. Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2. Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis, as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA genes, including 5' capping, polyadenylation, extensive splicing and short open reading

  20. Cloning and identification of the gene coding for the 140-kd subunit of Drosophila RNA polymerase II

    OpenAIRE

    Faust, Daniela M.; Renkawitz-Pohl, Renate; Falkenburg, Dieter; Gasch, Alexander; Bialojan, Siegfried; Young, Richard A.; Bautz, Ekkehard K. F.

    1986-01-01

    Genomic clones of Drosophila melanogaster were isolated from a λ library by cross-hybridization with the yeast gene coding for the 150-kd subunit of RNA polymerase II. Clones containing a region of ∼2.0 kb with strong homology to the yeast gene were shown to code for a 3.9-kb poly(A)+-RNA. Part of the coding region was cloned into an expression vector. A fusion protein was obtained which reacted with an antibody directed against RNA polymerase II of Drosophila. Peptide mapping of the fusion p...

  1. Rapid sequence divergence rates in the 5 prime regulatory regions of young Drosophila melanogaster duplicate gene pairs

    Directory of Open Access Journals (Sweden)

    Michael H. Kohn

    2008-01-01

    Full Text Available While it remains a matter of some debate, rapid sequence evolution of the coding sequences of duplicate genes is characteristic for early phases past duplication, but long established duplicates generally evolve under constraint, much like the rest of the coding genome. As for coding sequences, it may be possible to infer evolutionary rate, selection, and constraint via contrasts between duplicate gene divergence in the 5 prime regions and in the corresponding synonymous site divergence in the coding regions. Finding elevated rates for the 5 prime regions of duplicated genes, in addition to the coding regions, would enable statements regarding the early processes of duplicate gene evolution. Here, 1 kb of each of the 5 prime regulatory regions of Drosophila melanogaster duplicate gene pairs were mapped onto one another to isolate shared sequence blocks. Genetic distances within shared sequence blocks (d5’ were found to increase as a function of synonymous (dS, and to a lesser extend, amino-acid (dA site divergence between duplicates. The rate d5’/dS was found to rapidly decay from values > 1 in young duplicate pairs (dS 0.8. Such rapid rates of 5 prime evolution exceeding 1 (~neutral predominantly were found to occur in duplicate pairs with low amino-acid site divergence and that tended to be co-regulated when assayed on microarrays. Conceivably, functional redundancy and relaxation of selective constraint facilitates subsequent positive selection on the 5 prime regions of young duplicate genes. This might promote the evolution of new functions (neofunctionalization or division of labor among duplicate genes (subfunctionalization. In contrast, similar to the vast portion of the non-coding genome, the 5 prime regions of long-established gene duplicates appear to evolve under selective constraint, indicating that these long-established gene duplicates have assumed critical functions.

  2. Novel polymorphisms in UTR and coding region of inducible heat shock protein 70.1 gene in tropically adapted Indian zebu cattle (Bos indicus) and riverine buffalo (Bubalus bubalis).

    Science.gov (United States)

    Sodhi, M; Mukesh, M; Kishore, A; Mishra, B P; Kataria, R S; Joshi, B K

    2013-09-25

    Due to evolutionary divergence, cattle (taurine, and indicine) and buffalo are speculated to have different responses to heat stress condition. Variation in candidate genes associated with a heat-shock response may provide an insight into the dissimilarity and suggest targets for intervention. The present work was undertaken to characterize one of the inducible heat shock protein genes promoter and coding regions in diverse breeds of Indian zebu cattle and buffaloes. The genomic DNA from a panel of 117 unrelated animals representing 14 diversified native cattle breeds and 6 buffalo breeds were utilized to determine the complete sequence and gene diversity of HSP70.1 gene. The coding region of HSP70.1 gene in Indian zebu cattle, Bos taurus and buffalo was similar in length (1,926 bp) encoding a HSP70 protein of 641 amino acids with a calculated molecular weight (Mw) of 70.26 kDa. However buffalo had a longer 5' and 3' untranslated region (UTR) of 204 and 293 nucleotides respectively, in comparison to Indian zebu cattle and Bos taurus wherein length of 5' and 3'-UTR was 172 and 286 nucleotides, respectively. The increased length of buffalo HSP70.1 gene compared to indicine and taurine gene was due to two insertions each in 5' and 3'-UTR. Comparative sequence analysis of cattle (taurine and indicine) and buffalo HSP70.1 gene revealed a total of 54 gene variations (50 SNPs and 4 INDELs) among the three species in the HSP70.1 gene. The minor allele frequencies of these nucleotide variations varied from 0.03 to 0.5 with an average of 0.26. Among the 14 B. indicus cattle breeds studied, a total of 19 polymorphic sites were identified: 4 in the 5'-UTR and 15 in the coding region (of these 2 were non-synonymous). Analysis among buffalo breeds revealed 15 SNPs throughout the gene: 6 at the 5' flanking region and 9 in the coding region. In bubaline 5'-UTR, 2 additional putative transcription factor binding sites (Elk-1 and C-Re1) were identified, other than three common sites

  3. SNPs in the coding region of the metastasis-inducing gene MACC1 and clinical outcome in colorectal cancer

    Directory of Open Access Journals (Sweden)

    Schmid Felicitas

    2012-07-01

    Full Text Available Abstract Background Colorectal cancer is one of the main cancers in the Western world. About 90% of the deaths arise from formation of distant metastasis. The expression of the newly identified gene metastasis associated in colon cancer 1 (MACC1 is a prognostic indicator for colon cancer metastasis. Here, we analyzed for the first time the impact of single nucleotide polymorphisms (SNPs in the coding region of MACC1 for clinical outcome of colorectal cancer patients. Additionally, we screened met proto-oncogene (Met, the transcriptional target gene of MACC1, for mutations. Methods We sequenced the coding exons of MACC1 in 154 colorectal tumors (stages I, II and III and the crucial exons of Met in 60 colorectal tumors (stages I, II and III. We analyzed the association of MACC1 polymorphisms with clinical data, including metachronous metastasis, UICC stages, tumor invasion, lymph node metastasis and patients’ survival (n = 154, stages I, II and III. Furthermore, we performed biological assays in order to evaluate the functional impact of MACC1 SNPs on the motility of colorectal cancer cells. Results We genotyped three MACC1 SNPs in the coding region. Thirteen % of the tumors had the genotype cg (rs4721888, L31V, 48% a ct genotype (rs975263, S515L and 84% a gc or cc genotype (rs3735615, R804T. We found no association of these SNPs with clinicopathological parameters or with patients’ survival, when analyzing the entire patients’ cohort. An increased risk for a shorter metastasis-free survival of patients with a ct genotype (rs975263 was observed in younger colon cancer patients with stage I or II (P = 0.041, n = 18. In cell culture, MACC1 SNPs did not affect MACC1-induced cell motility and proliferation. Conclusion In summary, the identification of coding MACC1 SNPs in primary colorectal tumors does not improve the prediction for metastasis formation or for patients’ survival compared to MACC1 expression analysis alone. The ct genotype (rs

  4. De novo origin of human protein-coding genes.

    Directory of Open Access Journals (Sweden)

    Dong-Dong Wu

    2011-11-01

    Full Text Available The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA-seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes.

  5. De Novo Origin of Human Protein-Coding Genes

    Science.gov (United States)

    Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping

    2011-01-01

    The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831

  6. Both noncoding and protein-coding RNAs contribute to gene expression evolution in the primate brain.

    Science.gov (United States)

    Babbitt, Courtney C; Fedrigo, Olivier; Pfefferle, Adam D; Boyle, Alan P; Horvath, Julie E; Furey, Terrence S; Wray, Gregory A

    2010-01-18

    Despite striking differences in cognition and behavior between humans and our closest primate relatives, several studies have found little evidence for adaptive change in protein-coding regions of genes expressed primarily in the brain. Instead, changes in gene expression may underlie many cognitive and behavioral differences. Here, we used digital gene expression: tag profiling (here called Tag-Seq, also called DGE:tag profiling) to assess changes in global transcript abundance in the frontal cortex of the brains of 3 humans, 3 chimpanzees, and 3 rhesus macaques. A substantial fraction of transcripts we identified as differentially transcribed among species were not assayed in previous studies based on microarrays. Differentially expressed tags within coding regions are enriched for gene functions involved in synaptic transmission, transport, oxidative phosphorylation, and lipid metabolism. Importantly, because Tag-Seq technology provides strand-specific information about all polyadenlyated transcripts, we were able to assay expression in noncoding intragenic regions, including both sense and antisense noncoding transcripts (relative to nearby genes). We find that many noncoding transcripts are conserved in both location and expression level between species, suggesting a possible functional role. Lastly, we examined the overlap between differential gene expression and signatures of positive selection within putative promoter regions, a sign that these differences represent adaptations during human evolution. Comparative approaches may provide important insights into genes responsible for differences in cognitive functions between humans and nonhuman primates, as well as highlighting new candidate genes for studies investigating neurological disorders.

  7. Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

    Science.gov (United States)

    Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

    2017-10-03

    Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.

  8. Differential DNA methylation profiles of coding and non-coding genes define hippocampal sclerosis in human temporal lobe epilepsy

    Science.gov (United States)

    Miller-Delaney, Suzanne F.C.; Bryan, Kenneth; Das, Sudipto; McKiernan, Ross C.; Bray, Isabella M.; Reynolds, James P.; Gwinn, Ryder; Stallings, Raymond L.

    2015-01-01

    Temporal lobe epilepsy is associated with large-scale, wide-ranging changes in gene expression in the hippocampus. Epigenetic changes to DNA are attractive mechanisms to explain the sustained hyperexcitability of chronic epilepsy. Here, through methylation analysis of all annotated C-phosphate-G islands and promoter regions in the human genome, we report a pilot study of the methylation profiles of temporal lobe epilepsy with or without hippocampal sclerosis. Furthermore, by comparative analysis of expression and promoter methylation, we identify methylation sensitive non-coding RNA in human temporal lobe epilepsy. A total of 146 protein-coding genes exhibited altered DNA methylation in temporal lobe epilepsy hippocampus (n = 9) when compared to control (n = 5), with 81.5% of the promoters of these genes displaying hypermethylation. Unique methylation profiles were evident in temporal lobe epilepsy with or without hippocampal sclerosis, in addition to a common methylation profile regardless of pathology grade. Gene ontology terms associated with development, neuron remodelling and neuron maturation were over-represented in the methylation profile of Watson Grade 1 samples (mild hippocampal sclerosis). In addition to genes associated with neuronal, neurotransmitter/synaptic transmission and cell death functions, differential hypermethylation of genes associated with transcriptional regulation was evident in temporal lobe epilepsy, but overall few genes previously associated with epilepsy were among the differentially methylated. Finally, a panel of 13, methylation-sensitive microRNA were identified in temporal lobe epilepsy including MIR27A, miR-193a-5p (MIR193A) and miR-876-3p (MIR876), and the differential methylation of long non-coding RNA documented for the first time. The present study therefore reports select, genome-wide DNA methylation changes in human temporal lobe epilepsy that may contribute to the molecular architecture of the epileptic brain. PMID

  9. Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes

    KAUST Repository

    Alam, Tanvir

    2014-10-02

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptional regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.

  10. Evidence for widespread degradation of gene control regions in hominid genomes.

    Directory of Open Access Journals (Sweden)

    Peter D Keightley

    2005-02-01

    Full Text Available Although sequences containing regulatory elements located close to protein-coding genes are often only weakly conserved during evolution, comparisons of rodent genomes have implied that these sequences are subject to some selective constraints. Evolutionary conservation is particularly apparent upstream of coding sequences and in first introns, regions that are enriched for regulatory elements. By comparing the human and chimpanzee genomes, we show here that there is almost no evidence for conservation in these regions in hominids. Furthermore, we show that gene expression is diverging more rapidly in hominids than in murids per unit of neutral sequence divergence. By combining data on polymorphism levels in human noncoding DNA and the corresponding human-chimpanzee divergence, we show that the proportion of adaptive substitutions in these regions in hominids is very low. It therefore seems likely that the lack of conservation and increased rate of gene expression divergence are caused by a reduction in the effectiveness of natural selection against deleterious mutations because of the low effective population sizes of hominids. This has resulted in the accumulation of a large number of deleterious mutations in sequences containing gene control elements and hence a widespread degradation of the genome during the evolution of humans and chimpanzees.

  11. New PAH gene promoter KLF1 and 3'-region C/EBPalpha motifs influence transcription in vitro.

    Science.gov (United States)

    Klaassen, Kristel; Stankovic, Biljana; Kotur, Nikola; Djordjevic, Maja; Zukic, Branka; Nikcevic, Gordana; Ugrin, Milena; Spasovski, Vesna; Srzentic, Sanja; Pavlovic, Sonja; Stojiljkovic, Maja

    2017-02-01

    Phenylketonuria (PKU) is a metabolic disease caused by mutations in the phenylalanine hydroxylase (PAH) gene. Although the PAH genotype remains the main determinant of PKU phenotype severity, genotype-phenotype inconsistencies have been reported. In this study, we focused on unanalysed sequences in non-coding PAH gene regions to assess their possible influence on the PKU phenotype. We transiently transfected HepG2 cells with various chloramphenicol acetyl transferase (CAT) reporter constructs which included PAH gene non-coding regions. Selected non-coding regions were indicated by in silico prediction to contain transcription factor binding sites. Furthermore, electrophoretic mobility shift assay (EMSA) and supershift assays were performed to identify which transcriptional factors were engaged in the interaction. We found novel KLF1 motif in the PAH promoter, which decreases CAT activity by 50 % in comparison to basal transcription in vitro. The cytosine at the c.-170 promoter position creates an additional binding site for the protein complex involving KLF1 transcription factor. Moreover, we assessed for the first time the role of a multivariant variable number tandem repeat (VNTR) region located in the 3'-region of the PAH gene. We found that the VNTR3, VNTR7 and VNTR8 constructs had approximately 60 % of CAT activity. The regulation is mediated by the C/EBPalpha transcription factor, present in protein complex binding to VNTR3. Our study highlighted two novel promoter KLF1 and 3'-region C/EBPalpha motifs in the PAH gene which decrease transcription in vitro and, thus, could be considered as PAH expression modifiers. New transcription motifs in non-coding regions will contribute to better understanding of the PKU phenotype complexity and may become important for the optimisation of PKU treatment.

  12. Ribosome Profiling Reveals Pervasive Translation Outside of Annotated Protein-Coding Genes

    Directory of Open Access Journals (Sweden)

    Nicholas T. Ingolia

    2014-09-01

    Full Text Available Ribosome profiling suggests that ribosomes occupy many regions of the transcriptome thought to be noncoding, including 5′ UTRs and long noncoding RNAs (lncRNAs. Apparent ribosome footprints outside of protein-coding regions raise the possibility of artifacts unrelated to translation, particularly when they occupy multiple, overlapping open reading frames (ORFs. Here, we show hallmarks of translation in these footprints: copurification with the large ribosomal subunit, response to drugs targeting elongation, trinucleotide periodicity, and initiation at early AUGs. We develop a metric for distinguishing between 80S footprints and nonribosomal sources using footprint size distributions, which validates the vast majority of footprints outside of coding regions. We present evidence for polypeptide production beyond annotated genes, including the induction of immune responses following human cytomegalovirus (HCMV infection. Translation is pervasive on cytosolic transcripts outside of conserved reading frames, and direct detection of this expanded universe of translated products enables efforts at understanding how cells manage and exploit its consequences.

  13. Nucleotide sequence of the Escherichia coli pyrE gene and of the DNA in front of the protein-coding region

    DEFF Research Database (Denmark)

    Poulsen, Peter; Jensen, Kaj Frank; Valentin-Hansen, Poul

    1983-01-01

    leader segment in front of the protein-coding region. This leader contains a structure with features characteristic for a (translated?) rho-independent transcriptional terminator, which is preceded by a cluster of uridylate residues. This indicates that the frequency of pyrE transcription is regulated......Orotate phosphoribosyltransferase (EC 2.4.2.10) was purified to electrophoretic homogeneity from a strain of Escherichia coli containing the pyrE gene cloned on a multicopy plasmid. The relative molecular masses (Mr) of the native enzyme and its subunit were estimated by means of gel filtration...

  14. A study on climatic adaptation of dipteran mitochondrial protein coding genes

    Directory of Open Access Journals (Sweden)

    Debajyoti Kabiraj

    2017-10-01

    Full Text Available Diptera, the true flies are frequently found in nature and their habitat is found all over the world including Antarctica and Polar Regions. The number of documented species for order diptera is quite high and thought to be 14% of the total animal present in the earth [1]. Most of the study in diptera has focused on the taxa of economic and medical importance, such as the fruit flies Ceratitis capitata and Bactrocera spp. (Tephritidae, which are serious agricultural pests; the blowflies (Calliphoridae and oestrid flies (Oestridae, which can cause myiasis; the anopheles mosquitoes (Culicidae, are the vectors of malaria; and leaf-miners (Agromyzidae, vegetable and horticultural pests [2]. Insect mitochondrion consists of 13 protein coding genes, 22 tRNAs and 2 rRNAs, are the remnant portion of alpha-proteobacteria is responsible for simultaneous function of energy production and thermoregulation of the cell through the bi-genomic system thus different adaptability in different climatic condition might have compensated by complementary changes is the both genomes [3,4]. In this study we have collected complete mitochondrial genome and occurrence data of one hundred thirteen such dipteran insects from different databases and literature survey. Our understanding of the genetic basis of climatic adaptation in diptera is limited to the basic information on the occurrence location of those species and mito genetic factors underlying changes in conspicuous phenotypes. To examine this hypothesis, we have taken an approach of Nucleotide substitution analysis for 13 protein coding genes of mitochondrial DNA individually and combined by different software for monophyletic group as well as paraphyletic group of dipteran species. Moreover, we have also calculated codon adaptation index for all dipteran mitochondrial protein coding genes. Following this work, we have classified our sample organisms according to their location data from GBIF (https

  15. Experimental annotation of post-translational features and translated coding regions in the pathogen Salmonella Typhimurium

    Energy Technology Data Exchange (ETDEWEB)

    Ansong, Charles; Tolic, Nikola; Purvine, Samuel O.; Porwollik, Steffen; Jones, Marcus B.; Yoon, Hyunjin; Payne, Samuel H.; Martin, Jessica L.; Burnet, Meagan C.; Monroe, Matthew E.; Venepally, Pratap; Smith, Richard D.; Peterson, Scott; Heffron, Fred; Mcclelland, Michael; Adkins, Joshua N.

    2011-08-25

    Complete and accurate genome annotation is crucial for comprehensive and systematic studies of biological systems. For example systems biology-oriented genome scale modeling efforts greatly benefit from accurate annotation of protein-coding genes to develop proper functioning models. However, determining protein-coding genes for most new genomes is almost completely performed by inference, using computational predictions with significant documented error rates (> 15%). Furthermore, gene prediction programs provide no information on biologically important post-translational processing events critical for protein function. With the ability to directly measure peptides arising from expressed proteins, mass spectrometry-based proteomics approaches can be used to augment and verify coding regions of a genomic sequence and importantly detect post-translational processing events. In this study we utilized “shotgun” proteomics to guide accurate primary genome annotation of the bacterial pathogen Salmonella Typhimurium 14028 to facilitate a systems-level understanding of Salmonella biology. The data provides protein-level experimental confirmation for 44% of predicted protein-coding genes, suggests revisions to 48 genes assigned incorrect translational start sites, and uncovers 13 non-annotated genes missed by gene prediction programs. We also present a comprehensive analysis of post-translational processing events in Salmonella, revealing a wide range of complex chemical modifications (70 distinct modifications) and confirming more than 130 signal peptide and N-terminal methionine cleavage events in Salmonella. This study highlights several ways in which proteomics data applied during the primary stages of annotation can improve the quality of genome annotations, especially with regards to the annotation of mature protein products.

  16. The artificial zinc finger coding gene 'Jazz' binds the utrophin promoter and activates transcription.

    Science.gov (United States)

    Corbi, N; Libri, V; Fanciulli, M; Tinsley, J M; Davies, K E; Passananti, C

    2000-06-01

    Up-regulation of utrophin gene expression is recognized as a plausible therapeutic approach in the treatment of Duchenne muscular dystrophy (DMD). We have designed and engineered new zinc finger-based transcription factors capable of binding and activating transcription from the promoter of the dystrophin-related gene, utrophin. Using the recognition 'code' that proposes specific rules between zinc finger primary structure and potential DNA binding sites, we engineered a new gene named 'Jazz' that encodes for a three-zinc finger peptide. Jazz belongs to the Cys2-His2 zinc finger type and was engineered to target the nine base pair DNA sequence: 5'-GCT-GCT-GCG-3', present in the promoter region of both the human and mouse utrophin gene. The entire zinc finger alpha-helix region, containing the amino acid positions that are crucial for DNA binding, was specifically chosen on the basis of the contacts more frequently represented in the available list of the 'code'. Here we demonstrate that Jazz protein binds specifically to the double-stranded DNA target, with a dissociation constant of about 32 nM. Band shift and super-shift experiments confirmed the high affinity and specificity of Jazz protein for its DNA target. Moreover, we show that chimeric proteins, named Gal4-Jazz and Sp1-Jazz, are able to drive the transcription of a test gene from the human utrophin promoter.

  17. Gene mutation in ATM/PI3K region of nasopharyngeal carcinoma cell lines

    International Nuclear Information System (INIS)

    Wang Hongmei; Wu Xinyao; Xia Yunfei

    2002-01-01

    Objective: To define the correlation between nasopharyngeal carcinoma (NPC) cell radiosensitivity and gene mutation in the ATM/PI3K coding region. Methods: The gene mutation in the ATM/PI3K region of nasopharyngeal carcinoma cell lines which vary in radiosensitivity, was monitored by reverse transcription-polymerase chain reaction (RT-PCR) and fluorescence-marked ddNTP cycle sequencing technique. Results: No gene mutation was detected in the ATM/PI3K region of either CNE1 or CNE2. Conclusion: Disparity in intrinsic radiosensitivity between different NPC cell lines depends on some other factors and mechanism without being related to ATM/PI3K mutations

  18. Single nucleotide polymorphism in transcriptional regulatory regions and expression of environmentally responsive genes

    International Nuclear Information System (INIS)

    Wang, Xuting; Tomso, Daniel J.; Liu Xuemei; Bell, Douglas A.

    2005-01-01

    Single nucleotide polymorphisms (SNPs) in the human genome are DNA sequence variations that can alter an individual's response to environmental exposure. SNPs in gene coding regions can lead to changes in the biological properties of the encoded protein. In contrast, SNPs in non-coding gene regulatory regions may affect gene expression levels in an allele-specific manner, and these functional polymorphisms represent an important but relatively unexplored class of genetic variation. The main challenge in analyzing these SNPs is a lack of robust computational and experimental methods. Here, we first outline mechanisms by which genetic variation can impact gene regulation, and review recent findings in this area; then, we describe a methodology for bioinformatic discovery and functional analysis of regulatory SNPs in cis-regulatory regions using the assembled human genome sequence and databases on sequence polymorphism and gene expression. Our method integrates SNP and gene databases and uses a set of computer programs that allow us to: (1) select SNPs, from among the >9 million human SNPs in the NCBI dbSNP database, that are similar to cis-regulatory element (RE) consensus sequences; (2) map the selected dbSNP entries to the human genome assembly in order to identify polymorphic REs near gene start sites; (3) prioritize the candidate polymorphic RE containing genes by searching the existing genotype and gene expression data sets. The applicability of this system has been demonstrated through studies on p53 responsive elements and is being extended to additional pathways and environmentally responsive genes

  19. Structural and functional studies of a family of Dictyostelium discoideum developmentally regulated, prestalk genes coding for small proteins

    Directory of Open Access Journals (Sweden)

    Escalante Ricardo

    2008-01-01

    Full Text Available Abstract Background The social amoeba Dictyostelium discoideum executes a multicellular development program upon starvation. This morphogenetic process requires the differential regulation of a large number of genes and is coordinated by extracellular signals. The MADS-box transcription factor SrfA is required for several stages of development, including slug migration and spore terminal differentiation. Results Subtractive hybridization allowed the isolation of a gene, sigN (SrfA-induced gene N, that was dependent on the transcription factor SrfA for expression at the slug stage of development. Homology searches detected the existence of a large family of sigN-related genes in the Dictyostelium discoideum genome. The 13 most similar genes are grouped in two regions of chromosome 2 and have been named Group1 and Group2 sigN genes. The putative encoded proteins are 87–89 amino acids long. All these genes have a similar structure, composed of a first exon containing a 13 nucleotides long open reading frame and a second exon comprising the remaining of the putative coding region. The expression of these genes is induced at10 hours of development. Analyses of their promoter regions indicate that these genes are expressed in the prestalk region of developing structures. The addition of antibodies raised against SigN Group 2 proteins induced disintegration of multi-cellular structures at the mound stage of development. Conclusion A large family of genes coding for small proteins has been identified in D. discoideum. Two groups of very similar genes from this family have been shown to be specifically expressed in prestalk cells during development. Functional studies using antibodies raised against Group 2 SigN proteins indicate that these genes could play a role during multicellular development.

  20. Identification of coding and non-coding mutational hotspots in cancer genomes.

    Science.gov (United States)

    Piraino, Scott W; Furney, Simon J

    2017-01-05

    The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from

  1. Sequence of the intron/exon junctions of the coding region of the human androgen receptor gene and identification of a point mutation in a family with complete androgen insensitivity

    International Nuclear Information System (INIS)

    Lubahn, D.B.; Simental, J.A.; Higgs, H.N.; Wilson, E.M.; French, F.S.; Brown, T.R.; Migeon, C.J.

    1989-01-01

    Androgens act through a receptor protein (AR) to mediate sex differentiation and development of the male phenotype. The authors have isolated the eight exons in the amino acid coding region of the AR gene from a human X chromosome library. Nucleotide sequences of the AR gene intron/exon boundaries were determined for use in designing synthetic oligonucleotide primers to bracket coding exons for amplification by the polymerase chain reaction. Genomic DNA was amplified from 46, XY phenotypic female siblings with complete androgen insensitivity syndrome. AR binding affinity for dihydrotestosterone in the affected siblings was lower than in normal males, but the binding capacity was normal. Sequence analysis of amplified exons demonstrated within the AR steroid-binding domain (exon G) a single guanine to adenine mutation, resulting in replacement of valine with methionine at amino acid residue 866. As expected, the carrier mother had both normal and mutant AR genes. Thus, a single point mutation in the steroid-binding domain of the AR gene correlated with the expression of an AR protein ineffective in stimulating male sexual development

  2. Gene-Auto: Automatic Software Code Generation for Real-Time Embedded Systems

    Science.gov (United States)

    Rugina, A.-E.; Thomas, D.; Olive, X.; Veran, G.

    2008-08-01

    This paper gives an overview of the Gene-Auto ITEA European project, which aims at building a qualified C code generator from mathematical models under Matlab-Simulink and Scilab-Scicos. The project is driven by major European industry partners, active in the real-time embedded systems domains. The Gene- Auto code generator will significantly improve the current development processes in such domains by shortening the time to market and by guaranteeing the quality of the generated code through the use of formal methods. The first version of the Gene-Auto code generator has already been released and has gone thought a validation phase on real-life case studies defined by each project partner. The validation results are taken into account in the implementation of the second version of the code generator. The partners aim at introducing the Gene-Auto results into industrial development by 2010.

  3. Unusually effective microRNA targeting within repeat-rich coding regions of mammalian mRNAs

    Science.gov (United States)

    Schnall-Levin, Michael; Rissland, Olivia S.; Johnston, Wendy K.; Perrimon, Norbert; Bartel, David P.; Berger, Bonnie

    2011-01-01

    MicroRNAs (miRNAs) regulate numerous biological processes by base-pairing with target messenger RNAs (mRNAs), primarily through sites in 3′ untranslated regions (UTRs), to direct the repression of these targets. Although miRNAs have sometimes been observed to target genes through sites in open reading frames (ORFs), large-scale studies have shown such targeting to be generally less effective than 3′ UTR targeting. Here, we show that several miRNAs each target significant groups of genes through multiple sites within their coding regions. This ORF targeting, which mediates both predictable and effective repression, arises from highly repeated sequences containing miRNA target sites. We show that such sequence repeats largely arise through evolutionary duplications and occur particularly frequently within families of paralogous C2H2 zinc-finger genes, suggesting the potential for their coordinated regulation. Examples of ORFs targeted by miR-181 include both the well-known tumor suppressor RB1 and RBAK, encoding a C2H2 zinc-finger protein and transcriptional binding partner of RB1. Our results indicate a function for repeat-rich coding sequences in mediating post-transcriptional regulation and reveal circumstances in which miRNA-mediated repression through ORF sites can be reliably predicted. PMID:21685129

  4. Investigation of genes coding for inflammatory components in Parkinson's disease.

    Science.gov (United States)

    Håkansson, Anna; Westberg, Lars; Nilsson, Staffan; Buervenich, Silvia; Carmine, Andrea; Holmberg, Björn; Sydow, Olof; Olson, Lars; Johnels, Bo; Eriksson, Elias; Nissbrandt, Hans

    2005-05-01

    Several findings obtained recently indicate that inflammation may contribute to the pathogenesis in Parkinson's disease (PD). Genetic variants of genes coding for components involved in immune reactions in the brain might therefore influence the risk of developing PD or the age of disease onset. Five single nucleotide polymorphisms (SNPs) in the genes coding for interferon-gamma (IFN-gamma; T874A in intron 1), interferon-gamma receptor 2 (IFN-gamma R2; Gln64Arg), interleukin-10 (IL-10; G1082A in the promoter region), platelet-activating factor acetylhydrolase (PAF-AH; Val379Ala), and intercellular adhesion molecule 1 (ICAM-1; Lys469Glu) were genotyped, using pyrosequencing, in 265 patients with PD and 308 controls. None of the investigated SNPs was found to be associated with PD; however, the G1082A polymorphism in the IL-10 gene promoter was found to be related to the age of disease onset. Linear regression showed a significantly earlier onset with more A-alleles (P = 0.0095; after Bonferroni correction, P = 0.048), resulting in a 5-year delayed age of onset of the disease for individuals having two G-alleles compared with individuals having two A-alleles. The results indicate that the IL-10 G1082A SNP could possibly be related to the age of onset of PD. Copyright 2005 Movement Disorder Society.

  5. The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

    Science.gov (United States)

    Holland, M J; Holland, J P; Thill, G P; Jackson, K A

    1981-02-10

    Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5

  6. Functional Diets Modulate lncRNA-Coding RNAs and Gene Interactions in the Intestine of Rainbow Trout Oncorhynchus mykiss.

    Science.gov (United States)

    Núñez-Acuña, Gustavo; Détrée, Camille; Gallardo-Escárate, Cristian; Gonçalves, Ana Teresa

    2017-06-01

    The advent of functional genomics has sparked the interest in inferring the function of non-coding regions from the transcriptome in non-model species. However, numerous biological processes remain understudied from this perspective, including intestinal immunity in farmed fish. The aim of this study was to infer long non-coding RNA (lncRNAs) expression profiles in rainbow trout (Oncorhynchus mykiss) fed for 30 days with functional diets based on pre- and probiotics. For this, whole transcriptome sequencing was conducted through Illumina technology, and lncRNAs were mined to evaluate transcriptional activity in conjunction with known protein sequences. To detect differentially expressed transcripts, 880 novels and 9067 previously described O. mykiss lncRNAs were used. Expression levels and genome co-localization correlations with coding genes were also analyzed. Significant differences in gene expression were primarily found in the probiotic diet, which had a twofold downregulation of lncRNAs compared to other treatments. Notable differences by diet were also evidenced between the coding genes of distinct metabolic processes. In contrast, genome co-localization of lncRNAs with coding genes was similar for all diets. This study contributes novel knowledge regarding lncRNAs in fish, suggesting key roles in salmons fed with in-feed additives with the capacity to modulate the intestinal homeostasis and host health.

  7. Identification of an ICP27-responsive element in the coding region of a herpes simplex virus type 1 late gene.

    Science.gov (United States)

    Sedlackova, Lenka; Perkins, Keith D; Meyer, Julia; Strain, Anna K; Goldman, Oksana; Rice, Stephen A

    2010-03-01

    During productive herpes simplex virus type 1 (HSV-1) infection, a subset of viral delayed-early (DE) and late (L) genes require the immediate-early (IE) protein ICP27 for their expression. However, the cis-acting regulatory sequences in DE and L genes that mediate their specific induction by ICP27 are unknown. One viral L gene that is highly dependent on ICP27 is that encoding glycoprotein C (gC). We previously demonstrated that this gene is posttranscriptionally transactivated by ICP27 in a plasmid cotransfection assay. Based on our past results, we hypothesized that the gC gene possesses a cis-acting inhibitory sequence and that ICP27 overcomes the effects of this sequence to enable efficient gC expression. To test this model, we systematically deleted sequences from the body of the gC gene and tested the resulting constructs for expression. In so doing, we identified a 258-bp "silencing element" (SE) in the 5' portion of the gC coding region. When present, the SE inhibits gC mRNA accumulation from a transiently transfected gC gene, unless ICP27 is present. Moreover, the SE can be transferred to another HSV-1 gene, where it inhibits mRNA accumulation in the absence of ICP27 and confers high-level expression in the presence of ICP27. Thus, for the first time, an ICP27-responsive sequence has been identified in a physiologically relevant ICP27 target gene. To see if the SE functions during viral infection, we engineered HSV-1 recombinants that lack the SE, either in a wild-type (WT) or ICP27-null genetic background. In an ICP27-null background, deletion of the SE led to ICP27-independent expression of the gC gene, demonstrating that the SE functions during viral infection. Surprisingly, the ICP27-independent gC expression seen with the mutant occurred even in the absence of viral DNA synthesis, indicating that the SE helps to regulate the tight DNA replication-dependent expression of gC.

  8. Human growth hormone-related latrogenic Creutzfeldt-Jakob disease: Search for a genetic susceptibility by analysis of the PRNP coding region

    Energy Technology Data Exchange (ETDEWEB)

    Jaegly, A.; Boussin, F.; Deslys, J.P. [CEA/CRSSA/DSV/DPTE, Fontenay-aux-Roses (France)] [and others

    1995-05-20

    The human PRNP gene encoding PrP is located on chromosome 20 and consists of two exons and a single intron. The open reading frame is entirely fitted into the second exon. Genetic studies indicate that all of the familial and several sporadic forms of TSSEs are associated with mutations in the PRNP 759-bp coding region. Moreover, homozygosity at codon 129, a locus harboring a polymorphism among the general population, was proposed as a genetic susceptibility marker for both sporadic and iatrogenic CJD. To assess whether additional genetic predisposition markers exist in the PRNP gene, the authors sequenced the PRNP coding region of 17 of the 32 French patients who developed a hGH-related CJD.

  9. Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

    Directory of Open Access Journals (Sweden)

    Rachel Caldwell

    2015-01-01

    Full Text Available There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length.

  10. Origins of gene, genetic code, protein and life

    Indian Academy of Sciences (India)

    Unknown

    have concluded that newly-born genes are products of nonstop frames (NSF) ... research to determine tertiary structures of proteins such ... the present earth, is favourable for new genes to arise, if ..... NGG) in the universal genetic code table, cannot satisfy ..... which has been proposed to explain the development of life on.

  11. A photon dominated region code comparison study

    NARCIS (Netherlands)

    Roellig, M.; Abel, N. P.; Bell, T.; Bensch, F.; Black, J.; Ferland, G. J.; Jonkheid, B.; Kamp, I.; Kaufman, M. J.; Le Bourlot, J.; Le Petit, F.; Meijerink, R.; Morata, O.; Ossenkopf, Volker; Roueff, E.; Shaw, G.; Spaans, M.; Sternberg, A.; Stutzki, J.; Thi, W.-F.; van Dishoeck, E. F.; van Hoof, P. A. M.; Viti, S.; Wolfire, M. G.

    Aims. We present a comparison between independent computer codes, modeling the physics and chemistry of interstellar photon dominated regions (PDRs). Our goal was to understand the mutual differences in the PDR codes and their effects on the physical and chemical structure of the model clouds, and

  12. Bistability in self-activating genes regulated by non-coding RNAs

    International Nuclear Information System (INIS)

    Miro-Bueno, Jesus

    2015-01-01

    Non-coding RNA molecules are able to regulate gene expression and play an essential role in cells. On the other hand, bistability is an important behaviour of genetic networks. Here, we propose and study an ODE model in order to show how non-coding RNA can produce bistability in a simple way. The model comprises a single gene with positive feedback that is repressed by non-coding RNA molecules. We show how the values of all the reaction rates involved in the model are able to control the transitions between the high and low states. This new model can be interesting to clarify the role of non-coding RNA molecules in genetic networks. As well, these results can be interesting in synthetic biology for developing new genetic memories and biomolecular devices based on non-coding RNAs

  13. Characterisation of five candidate genes within the ETEC F4ab/ac candidate region in pigs

    DEFF Research Database (Denmark)

    Jacobsen, Mette Juul; Cirera Salicio, Susanna; Joller, David

    2011-01-01

    by haplotype sharing to a 2.5 Mb region on pig chromosome 13, a region containing 18 annotated genes. FINDINGS: The coding regions of five candidate genes for susceptibility to ETEC F4ab/ac infection (TFRC, ACK1, MUC20, MUC4 and KIAA0226), all located in the 2.5 Mb region, were investigated for the presence...... polymorphism in exon 22 of KIAA0226. Transcriptional profiles of the five genes were investigated in a porcine tissue panel including various intestinal tissues. All five genes were expressed in intestinal tissues at different levels but none of the genes were found differentially expressed between ETEC F4ab/ac...... of the amino acids composition. However, we cannot exclude that the five tested genes are bona fide candidate genes for susceptibility to ETEC F4ab/ac infection since the identified polymorphism might affect the translational apparatus, alternative splice forms may exist and post translational mechanisms might...

  14. 5' Region of the human interleukin 4 gene: structure and potential regulatory elements

    Energy Technology Data Exchange (ETDEWEB)

    Eder, A; Krafft-Czepa, H; Krammer, P H

    1988-01-25

    The lymphokine Interleukin 4 (IL-4) is secreted by antigen or mitogen activated T lymphocytes. IL-4 stimulates activation and differentiation of B lymphocytes and growth of T lymphocytes and mast cells. The authors isolated the human IL-4 gene from a lambda EMBL3 genomic library. As a probe they used a synthetic oligonucleotide spanning position 40 to 79 of the published IL-4 cDNA sequence. The 5' promoter region contains several sequence elements which may have a cis-acting regulatory function for IL-4 gene expression. These elements include a TATA-box, three CCAAT-elements (two are on the non-coding strand) and an octamer motif. A comparison of the 5' flanking region of the human murine IL-4 gene (4) shows that the region between position -306 and +44 is highly conserved (83% homology).

  15. New tools to analyze overlapping coding regions.

    Science.gov (United States)

    Bayegan, Amir H; Garcia-Martin, Juan Antonio; Clote, Peter

    2016-12-13

    Retroviruses transcribe messenger RNA for the overlapping Gag and Gag-Pol polyproteins, by using a programmed -1 ribosomal frameshift which requires a slippery sequence and an immediate downstream stem-loop secondary structure, together called frameshift stimulating signal (FSS). It follows that the molecular evolution of this genomic region of HIV-1 is highly constrained, since the retroviral genome must contain a slippery sequence (sequence constraint), code appropriate peptides in reading frames 0 and 1 (coding requirements), and form a thermodynamically stable stem-loop secondary structure (structure requirement). We describe a unique computational tool, RNAsampleCDS, designed to compute the number of RNA sequences that code two (or more) peptides p,q in overlapping reading frames, that are identical (or have BLOSUM/PAM similarity that exceeds a user-specified value) to the input peptides p,q. RNAsampleCDS then samples a user-specified number of messenger RNAs that code such peptides; alternatively, RNAsampleCDS can exactly compute the position-specific scoring matrix and codon usage bias for all such RNA sequences. Our software allows the user to stipulate overlapping coding requirements for all 6 possible reading frames simultaneously, even allowing IUPAC constraints on RNA sequences and fixing GC-content. We generalize the notion of codon preference index (CPI) to overlapping reading frames, and use RNAsampleCDS to generate control sequences required in the computation of CPI. Moreover, by applying RNAsampleCDS, we are able to quantify the extent to which the overlapping coding requirement in HIV-1 [resp. HCV] contribute to the formation of the stem-loop [resp. double stem-loop] secondary structure known as the frameshift stimulating signal. Using our software, we confirm that certain experimentally determined deleterious HCV mutations occur in positions for which our software RNAsampleCDS and RNAiFold both indicate a single possible nucleotide. We

  16. Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

    Science.gov (United States)

    Pujar, Shashikant; O'Leary, Nuala A; Farrell, Catherine M; Loveland, Jane E; Mudge, Jonathan M; Wallin, Craig; Girón, Carlos G; Diekhans, Mark; Barnes, If; Bennett, Ruth; Berry, Andrew E; Cox, Eric; Davidson, Claire; Goldfarb, Tamara; Gonzalez, Jose M; Hunt, Toby; Jackson, John; Joardar, Vinita; Kay, Mike P; Kodali, Vamsi K; Martin, Fergal J; McAndrews, Monica; McGarvey, Kelly M; Murphy, Michael; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Seal, Ruth L; Suner, Marie-Marthe; Webb, David; Zhu, Sophia; Aken, Bronwen L; Bruford, Elspeth A; Bult, Carol J; Frankish, Adam; Murphy, Terence; Pruitt, Kim D

    2018-01-04

    The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.

  17. Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes

    KAUST Repository

    Alam, Tanvir; Medvedeva, Yulia A.; Jia, Hui; Brown, James B.; Lipovich, Leonard; Bajic, Vladimir B.

    2014-01-01

    raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted

  18. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

    Directory of Open Access Journals (Sweden)

    Nicolás Sarute

    Full Text Available Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

  19. The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

    Science.gov (United States)

    Sarute, Nicolás; Calderón, Marina Gallo; Pérez, Ruben; La Torre, José; Hernández, Martín; Francia, Lourdes; Panzera, Yanina

    2013-01-01

    Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus) is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H) gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp) coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

  20. Genic regions of a large salamander genome contain long introns and novel genes

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2009-01-01

    Full Text Available Abstract Background The basis of genome size variation remains an outstanding question because DNA sequence data are lacking for organisms with large genomes. Sixteen BAC clones from the Mexican axolotl (Ambystoma mexicanum: c-value = 32 × 109 bp were isolated and sequenced to characterize the structure of genic regions. Results Annotation of genes within BACs showed that axolotl introns are on average 10× longer than orthologous vertebrate introns and they are predicted to contain more functional elements, including miRNAs and snoRNAs. Loci were discovered within BACs for two novel EST transcripts that are differentially expressed during spinal cord regeneration and skin metamorphosis. Unexpectedly, a third novel gene was also discovered while manually annotating BACs. Analysis of human-axolotl protein-coding sequences suggests there are 2% more lineage specific genes in the axolotl genome than the human genome, but the great majority (86% of genes between axolotl and human are predicted to be 1:1 orthologs. Considering that axolotl genes are on average 5× larger than human genes, the genic component of the salamander genome is estimated to be incredibly large, approximately 2.8 gigabases! Conclusion This study shows that a large salamander genome has a correspondingly large genic component, primarily because genes have incredibly long introns. These intronic sequences may harbor novel coding and non-coding sequences that regulate biological processes that are unique to salamanders.

  1. MicroRNA genes and their target 3'-untranslated regions are infrequently somatically mutated in ovarian cancers.

    Directory of Open Access Journals (Sweden)

    Georgina L Ryland

    Full Text Available MicroRNAs are key regulators of gene expression and have been shown to have altered expression in a variety of cancer types, including epithelial ovarian cancer. MiRNA function is most often achieved through binding to the 3'-untranslated region of the target protein coding gene. Mutation screening using massively-parallel sequencing of 712 miRNA genes in 86 ovarian cancer cases identified only 5 mutated miRNA genes, each in a different case. One mutation was located in the mature miRNA, and three mutations were predicted to alter the secondary structure of the miRNA transcript. Screening of the 3'-untranslated region of 18 candidate cancer genes identified one mutation in each of AKT2, EGFR, ERRB2 and CTNNB1. The functional effect of these mutations is unclear, as expression data available for AKT2 and EGFR showed no increase in gene transcript. Mutations in miRNA genes and 3'-untranslated regions are thus uncommon in ovarian cancer.

  2. The coevolution of genes and genetic codes: Crick's frozen accident revisited.

    Science.gov (United States)

    Sella, Guy; Ardell, David H

    2006-09-01

    The standard genetic code is the nearly universal system for the translation of genes into proteins. The code exhibits two salient structural characteristics: it possesses a distinct organization that makes it extremely robust to errors in replication and translation, and it is highly redundant. The origin of these properties has intrigued researchers since the code was first discovered. One suggestion, which is the subject of this review, is that the code's organization is the outcome of the coevolution of genes and genetic codes. In 1968, Francis Crick explored the possible implications of coevolution at different stages of code evolution. Although he argues that coevolution was likely to influence the evolution of the code, he concludes that it falls short of explaining the organization of the code we see today. The recent application of mathematical modeling to study the effects of errors on the course of coevolution, suggests a different conclusion. It shows that coevolution readily generates genetic codes that are highly redundant and similar in their error-correcting organization to the standard code. We review this recent work and suggest that further affirmation of the role of coevolution can be attained by investigating the extent to which the outcome of coevolution is robust to other influences that were present during the evolution of the code.

  3. Spectrum of small mutations in the dystrophin coding region

    Energy Technology Data Exchange (ETDEWEB)

    Prior, T.W.; Bartolo, C.; Pearl, D.K. [Ohio State Univ., Columbus, OH (United States)] [and others

    1995-07-01

    Duchenne and Becker muscular dystrophies (DMD and BMD) are caused by defects in the dystrophin gene. About two-thirds of the affected patients have large deletions or duplications, which occur in the 5` and central portion of the gene. The nondeletion/duplication cases are most likely the result of smaller mutations that cannot be identified by current diagnostic screening strategies. We screened {approximately} 80% of the dystrophin coding sequence for small mutations in 158 patients without deletions or duplications and identified 29 mutations. The study indicates that many of the DMD and the majority of the BMD small mutations lie in noncoding regions of the gene. All of the mutations identified were unique to single patients, and most of the mutations resulted in protein truncation. We did not find a clustering of small mutations similar to the deletion distribution but found > 40% of the small mutations 3` of exon 55. The extent of protein truncation caused by the 3` mutations did not determine the phenotype, since even the exon 76 nonsense mutation resulted in the severe DMD phenotype. Our study confirms that the dystrophin gene is subject to a high rate of mutation in CpG sequences. As a consequence of not finding any hotspots or prevalent small mutations, we conclude that it is presently not possible to perform direct carrier and prenatal diagnostics for many families without deletions or duplications. 71 refs., 2 figs., 2 tabs.

  4. An evolutionary conserved region (ECR in the human dopamine receptor D4 gene supports reporter gene expression in primary cultures derived from the rat cortex

    Directory of Open Access Journals (Sweden)

    Haddley Kate

    2011-05-01

    Full Text Available Abstract Background Detecting functional variants contributing to diversity of behaviour is crucial for dissecting genetics of complex behaviours. At a molecular level, characterisation of variation in exons has been studied as they are easily identified in the current genome annotation although the functional consequences are less well understood; however, it has been difficult to prioritise regions of non-coding DNA in which genetic variation could also have significant functional consequences. Comparison of multiple vertebrate genomes has allowed the identification of non-coding evolutionary conserved regions (ECRs, in which the degree of conservation can be comparable with exonic regions suggesting functional significance. Results We identified ECRs at the dopamine receptor D4 gene locus, an important gene for human behaviours. The most conserved non-coding ECR (D4ECR1 supported high reporter gene expression in primary cultures derived from neonate rat frontal cortex. Computer aided analysis of the sequence of the D4ECR1 indicated the potential transcription factors that could modulate its function. D4ECR1 contained multiple consensus sequences for binding the transcription factor Sp1, a factor previously implicated in DRD4 expression. Co-transfection experiments demonstrated that overexpression of Sp1 significantly decreased the activity of the D4ECR1 in vitro. Conclusion Bioinformatic analysis complemented by functional analysis of the DRD4 gene locus has identified a a strong enhancer that functions in neurons and b a transcription factor that may modulate the function of that enhancer.

  5. Paracantor: A two group, two region reactor code

    Energy Technology Data Exchange (ETDEWEB)

    Stone, Stuart

    1956-07-01

    Paracantor I a two energy group, two region, time independent reactor code, which obtains a closed solution for a critical reactor assembly. The code deals with cylindrical reactors of finite length and with a radial reflector of finite thickness. It is programmed for the 1.B.M: Magnetic Drum Data-Processing Machine, Type 650. The limited memory space available does not permit a flux solution to be included in the basic Paracantor code. A supplementary code, Paracantor 11, has been programmed which computes fluxes, .including adjoint fluxes, from the .output of Paracamtor I.

  6. DNA rearrangement in human follicular lymphoma can involve the 5' or the 3' region of the bcl-2 gene

    International Nuclear Information System (INIS)

    Tsujimoto, Y.; Bashir, M.M.; Givol, I.; Cossman, J.; Jaffe, E.; Croce, C.M.

    1987-01-01

    In most human lymphomas, the chromosome translocation t(14;18) occurs within two breakpoint clustering regions on chromosome 18, the major one at the 3' untranslated region of the bcl-2 gene and the minor one at 3' of the gene. Analysis of a panel of follicular lymphoma DNAs using probes for the first exon of the bcl-2 gene indicates that DNA rearrangements may also occur 5' to the involved bcl-2 gene. In this case the IgH locus and the bcl-2 gene are found in an order suggesting that an inversion also occurred during the translocation process. The coding region of the bcl-2 gene, however, are left intact in all cases of follicular lymphoma studied to date

  7. MICB gene diversity and balancing selection on its promoter region in Yao population in southern China.

    Science.gov (United States)

    Chen, Xiang; Liu, Xuexiang; Wei, Xiaomou; Meng, Yuming; Liu, Limin; Qin, Shini; Liu, Yanyu; Dai, Shengming

    2016-12-01

    To comprehensively examine the MICB gene polymorphism and identify its differences in Chinese Yao population from other ethnic groups, we investigated the polymorphism in the 5'-upstream regulation region (5'-URR), coding region (exons 2-4), and the 3'-untranslated region (3'-UTR) of MICB gene by using PCR-SBT method in 125 healthy unrelated Yao individuals in Guangxi Zhuang Autonomous Region. Higher polymorphism was observed in the 5'-URR, nine single nucleotide polymorphisms (SNPs) and a two base pairs deletion at position -139/-138 were found in our study. Only five different variation sites, however, were detected in exons 2-4 and three were observed in the 3'-UTR. The minor allele frequencies of all variants were greater than 5%, except for rs3828916, rs3131639, rs45627734, rs113620316, rs779737471, and the variation at position +11803 in the 3'-UTR. The first nine SNPs of 5'-URR and rs1065075, rs1051788 of the coding region showed significant linkage disequilibrium with each other. Ten different MICB extended haplotypes (EH) encompassing the 5'-URR, exons 2-4, and 3'-UTR were found in this population, and the most frequent was EH1 (23.2%). We provided several evidences for balancing selection effect on the 5'-URR of MICB gene in Yao population. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  8. Generation of a gene cassette for genetically engineered Salmonella Enteritidis in the specific region of the sipC gene

    Directory of Open Access Journals (Sweden)

    M Ghasemi

    2017-05-01

    Full Text Available Introduction: Salmonellosis is an infection caused by eating contaminated food with Salmonella, and it can occur in humans and other animals. Salmonella has acquired the ability to create the infection due to the presence of several virulence genes. One of the virulence genes of salmonella is sipC gene that coding the SipC protein. The aim of this study was creating the gene cassette to genetically engineered Salmonella enteritidis in the specific region of the sipC gene. Methods: In this study, after DNA extraction from Salmonella, the upstream and downstream regions of the sipC gene was amplified based on PCR method. The PCR products were cloned with T/A cloning method and they were inserted into the pGEM vector. In order to generate the final gene cassette, each of the upstream and downstream regions of the sipC gene was subcloned into the pET32 vector, and cloning accuracy was assessed by PCR and enzyme digestion methods. Results: Amplification of the 320 bp upstream and 206 bp downstream of sipC gene was successful by PCR method. T/A cloning of these fragments were caused the formation of two pGEM-up and pGEM-down recombinant vectors. Results that were confirmed the sub-cloning accuracy indicate the formation of the final pET32-up-down gene cassette. Conclusion: The generated gene cassette in this study was considered as a multi-purpose cassette that is able to specific gene manipulation of Salmonella sipC gene by homologous recombination matched. This gene cassette has the necessary potential for sipC gene deletion or insertion of any useful gene instead of sipC gene.

  9. Organization and annotation of the Xcat critical region: elimination of seven positional candidate genes.

    Science.gov (United States)

    Huang, Kristen M; Geunes-Boyer, Scarlett; Wu, Sufen; Dutra, Amalia; Favor, Jack; Stambolian, Dwight

    2004-05-01

    Xcat mice display X-linked congenital cataracts and are a mouse model for the human X-linked cataract disease Nance Horan syndrome (NHS). The genetic defect in Xcat mice and NHS patients is not known. We isolated and sequenced a BAC contig representing a portion of the Xcat critical region. We combined our sequencing data with the most recent mouse sequence assemblies from both Celera and public databases. The sequence of the 2.2-Mb Xcat critical region was then analyzed for potential Xcat candidate genes. The coding regions of the seven known genes within this area (Rai2, Rbbp7, Ctps2, Calb3, Grpr, Reps2, and Syap1) were sequenced in Xcat mice and no mutations were detected. The expression of Rai2 was quantitatively identical in wild-type and Xcat mutant eyes. These results indicate that the Xcat mutation is within a novel, undiscovered gene.

  10. Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

    Directory of Open Access Journals (Sweden)

    Maggi Giorgio P

    2008-06-01

    Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.

  11. Maternally Expressed Gene 3, an imprinted non-coding RNA gene, is associated with meningioma pathogenesis and progression

    Science.gov (United States)

    Zhang, Xun; Gejman, Roger; Mahta, Ali; Zhong, Ying; Rice, Kimberley A.; Zhou, Yunli; Cheunsuchon, Pornsuk; Louis, David N.; Klibanski, Anne

    2010-01-01

    Meningiomas are common tumors, representing 15-25% of all central nervous system tumors. NF2 gene inactivation on chromosome 22 has been shown as an early event in tumorigenesis; however, few factors underlying tumor growth and progression have been identified. Chromosomal abnormalities of 14q32 are often associated with meningioma pathogenesis and progression; therefore it has been proposed that an as yet unidentified tumor suppressor is present at this locus. MEG3 is an imprinted gene located at 14q32 that encodes a non-coding RNA with an anti-proliferative function. We found that MEG3 mRNA is highly expressed in normal arachnoidal cells. However, MEG3 is not expressed in the majority of human meningiomas or the human meningioma cell lines IOMM-Lee and CH157-MN. There is a strong association between loss of MEG3 expression and tumor grade. Allelic loss at the MEG3 locus is also observed in meningiomas, with increasing prevalence in higher grade tumors. In addition, there is an increase in CpG methylation within the promoter and the imprinting control region of MEG3 gene in meningiomas. Functionally, MEG3 suppresses DNA synthesis in both IOMM-Lee and CH157-MN cells by approximately 60% in BrdU incorporation assays. Colony-forming efficiency assays show that MEG3 inhibits colony formation in CH157-MN cells by approximately 80%. Furthermore, MEG3 stimulates p53-mediated transactivation in these cell lines. Therefore, these data are consistent with the hypothesis that MEG3, which encodes a non-coding RNA, may be a tumor suppressor gene at chromosome 14q32 involved in meningioma progression via a novel mechanism. PMID:20179190

  12. Orthologous microRNA genes are located in cancer-associated genomic regions in human and mouse.

    Directory of Open Access Journals (Sweden)

    Igor V Makunin

    Full Text Available BACKGROUND: MicroRNAs (miRNAs are short non-coding RNAs that regulate differentiation and development in many organisms and play an important role in cancer. METHODOLOGY/PRINCIPAL FINDINGS: Using a public database of mapped retroviral insertion sites from various mouse models of cancer we demonstrate that MLV-derived retroviral inserts are enriched in close proximity to mouse miRNA loci. Clustered inserts from cancer-associated regions (Common Integration Sites, CIS have a higher association with miRNAs than non-clustered inserts. Ten CIS-associated miRNA loci containing 22 miRNAs are located within 10 kb of known CIS insertions. Only one CIS-associated miRNA locus overlaps a RefSeq protein-coding gene and six loci are located more than 10 kb from any RefSeq gene. CIS-associated miRNAs on average are more conserved in vertebrates than miRNAs associated with non-CIS inserts and their human homologs are also located in regions perturbed in cancer. In addition we show that miRNA genes are enriched around promoter and/or terminator regions of RefSeq genes in both mouse and human. CONCLUSIONS/SIGNIFICANCE: We provide a list of ten miRNA loci potentially involved in the development of blood cancer or brain tumors. There is independent experimental support from other studies for the involvement of miRNAs from at least three CIS-associated miRNA loci in cancer development.

  13. Molecular analysis of human argininosuccinate lyase: Mutant characterization and alternative splicing of the coding region

    International Nuclear Information System (INIS)

    Walker, D.C.; McCloskey, D.A.; Simard, L.R.; McInnes, R.R.

    1990-01-01

    Argininosuccinic acid lyase (ASAL) deficiency is a clinically heterogeneous autosomal recessive urea cycle disorder. The authors previously established by complementation analysis that 29 ASAL-deficient patients have heterogeneous mutations in a single gene. To prove that the ASAL structural gene is the affected locus, they sequenced polymerase chain reaction-amplified ASAL cDNA of a representative mutant from the single complementation group. Fibroblast strain 944 from a late-onset patient who was the product of a consanguineous mating, had only a single base-pair change in the coding region, a C-283→ T transition at a CpG dinucleotide in exon 3. This substitution converts Arg-95 to Cys (R95C), occurs in a stretch of 13 residues that is identical in yeast and human ASAL, and was present in both of the patient's alleles but not in 14 other mutant or 10 normal alleles. They observed that amplified cDNA from mutant 944 and normal cells (liver, keratinocytes, lymphoblasts, and fibroblasts) contained, in addition to the expected 5' 513-base-pair band, a prominent 318-base-pair ASAL band formed by the splicing of exon 2 from the transcript. The short transcript maintains the ASAL reading frame but removes Lys-51, a residue that may be essential for catalysis, since it binds the argininosuccinate substrate. They conclude (i) that the identification of the R95C mutation in strain 944 demonstrates that virtually all ASAL deficiency results from defects in the ASAL structural gene and (ii) that minor alternative splicing of the coding region occurs at the ASAL locus

  14. Formation of a unique cluster of G-quadruplex structures in the HIV-1 Nef coding region: implications for antiviral activity.

    Directory of Open Access Journals (Sweden)

    Rosalba Perrone

    Full Text Available G-quadruplexes are tetraplex structures of nucleic acids that can form in G-rich sequences. Their presence and functional role have been established in telomeres, oncogene promoters and coding regions of the human chromosome. In particular, they have been proposed to be directly involved in gene regulation at the level of transcription. Because the HIV-1 Nef protein is a fundamental factor for efficient viral replication, infectivity and pathogenesis in vitro and in vivo, we investigated G-quadruplex formation in the HIV-1 nef gene to assess the potential for viral inhibition through G-quadruplex stabilization. A comprehensive computational analysis of the nef coding region of available strains showed the presence of three conserved sequences that were uniquely clustered. Biophysical testing proved that G-quadruplex conformations were efficiently stabilized or induced by G-quadruplex ligands in all three sequences. Upon incubation with a G-quadruplex ligand, Nef expression was reduced in a reporter gene assay and Nef-dependent enhancement of HIV-1 infectivity was significantly repressed in an antiviral assay. These data constitute the first evidence of the possibility to regulate HIV-1 gene expression and infectivity through G-quadruplex targeting and therefore open a new avenue for viral treatment.

  15. Understanding Epistatic Interactions between Genes Targeted by Non-coding Regulatory Elements in Complex Diseases

    Directory of Open Access Journals (Sweden)

    Min Kyung Sung

    2014-12-01

    Full Text Available Genome-wide association studies have proven the highly polygenic architecture of complex diseases or traits; therefore, single-locus-based methods are usually unable to detect all involved loci, especially when individual loci exert small effects. Moreover, the majority of associated single-nucleotide polymorphisms resides in non-coding regions, making it difficult to understand their phenotypic contribution. In this work, we studied epistatic interactions associated with three common diseases using Korea Association Resource (KARE data: type 2 diabetes mellitus (DM, hypertension (HT, and coronary artery disease (CAD. We showed that epistatic single-nucleotide polymorphisms (SNPs were enriched in enhancers, as well as in DNase I footprints (the Encyclopedia of DNA Elements [ENCODE] Project Consortium 2012, which suggested that the disruption of the regulatory regions where transcription factors bind may be involved in the disease mechanism. Accordingly, to identify the genes affected by the SNPs, we employed whole-genome multiple-cell-type enhancer data which discovered using DNase I profiles and Cap Analysis Gene Expression (CAGE. Assigned genes were significantly enriched in known disease associated gene sets, which were explored based on the literature, suggesting that this approach is useful for detecting relevant affected genes. In our knowledge-based epistatic network, the three diseases share many associated genes and are also closely related with each other through many epistatic interactions. These findings elucidate the genetic basis of the close relationship between DM, HT, and CAD.

  16. Expression profile of genes coding for carotenoid biosynthetic ...

    Indian Academy of Sciences (India)

    Expression profile of genes coding for carotenoid biosynthetic pathway during ripening and their association with accumulation of lycopene in tomato fruits. Shuchi Smita, Ravi Rajwanshi, Sangram Keshari Lenka, Amit Katiyar, Viswanathan Chinnusamy and. Kailash Chander Bansal. J. Genet. 92, 363–368. Table 1.

  17. HIV1 V3 loop hypermutability is enhanced by the guanine usage bias in the part of env gene coding for it.

    Science.gov (United States)

    Khrustalev, Vladislav Victorovich

    2009-01-01

    Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).

  18. Bioinformatic Analysis of Deleterious Non-Synonymous Single Nucleotide Polymorphisms (nsSNPs in the Coding Regions of Human Prion Protein Gene (PRNP

    Directory of Open Access Journals (Sweden)

    Kourosh Bamdad

    2016-12-01

    Full Text Available Background & Objective: Single nucleotide polymorphisms are the cause of genetic variation to living organisms. Single nucleotide polymorphisms alter residues in the protein sequence. In this investigation, the relationship between prion protein gene polymorphisms and its relevance to pathogenicity was studied. Material & Method: Amino acid sequence of the main isoform from the human prion protein gene (PRNP was extracted from UniProt database and evaluated by FoldAmyloid and AmylPred servers. All non-synonymous single nucleotide polymorphisms (nsSNPs from SNP database (dbSNP were further analyzed by bioinformatics servers including SIFT, PolyPhen-2, I-Mutant-3.0, PANTHER, SNPs & GO, PHD-SNP, Meta-SNP, and MutPred to determine the most damaging nsSNPs. Results: The results of the first structure analyses by FoldAmyloid and AmylPerd servers implied that regions including 5-15, 174-178, 180-184, 211-217, and 240-252 were the most sensitive parts of the protein sequence to amyloidosis. Screening all nsSNPs of the main protein isoform using bioinformatic servers revealed that substitution of Aspartic acid with Valine at position 178 (ID code: rs11538766 was the most deleterious nsSNP in the protein structure. Conclusion:  Substitution of the Aspartic acid with Valine at position 178 (D178V was the most pathogenic mutation in the human prion protein gene. Analyses from the MutPred server also showed that beta-sheets’ increment in the secondary structure was the main reason behind the molecular mechanism of the prion protein aggregation.

  19. Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

    Science.gov (United States)

    Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

    2017-12-02

    The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.

  20. Revisiting the missing protein-coding gene catalog of the domestic dog

    Directory of Open Access Journals (Sweden)

    Galibert Francis

    2009-02-01

    Full Text Available Abstract Background Among mammals for which there is a high sequence coverage, the whole genome assembly of the dog is unique in that it predicts a low number of protein-coding genes, ~19,000, compared to the over 20,000 reported for other mammalian species. Of particular interest are the more than 400 of genes annotated in primates and rodent genomes, but missing in dog. Results Using over 14,000 orthologous genes between human, chimpanzee, mouse rat and dog, we built multiple pairwise synteny maps to infer short orthologous intervals that were targeted for characterizing the canine missing genes. Based on gene prediction and a functionality test using the ratio of replacement to silent nucleotide substitution rates (dN/dS, we provide compelling structural and functional evidence for the identification of 232 new protein-coding genes in the canine genome and 69 gene losses, characterized as undetected gene or pseudogenes. Gene loss phyletic pattern analysis using ten species from chicken to human allowed us to characterize 28 canine-specific gene losses that have functional orthologs continuously from chicken or marsupials through human, and 10 genes that arose specifically in the evolutionary lineage leading to rodent and primates. Conclusion This study demonstrates the central role of comparative genomics for refining gene catalogs and exploring the evolutionary history of gene repertoires, particularly as applied for the characterization of species-specific gene gains and losses.

  1. HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

    Science.gov (United States)

    Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

    2017-11-01

    The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by

  2. A "White" Anthocyanin-less Pomegranate (Punica granatum L.) Caused by an Insertion in the Coding Region of the Leucoanthocyanidin Dioxygenase (LDOX; ANS) Gene.

    Science.gov (United States)

    Ben-Simhon, Zohar; Judeinstein, Sylvie; Trainin, Taly; Harel-Beja, Rotem; Bar-Ya'akov, Irit; Borochov-Neori, Hamutal; Holland, Doron

    2015-01-01

    Color is an important determinant of pomegranate fruit quality and commercial value. To understand the genetic factors controlling color in pomegranate, chemical, molecular and genetic characterization of a "white" pomegranate was performed. This unique accession is lacking the typical pomegranate color rendered by anthocyanins in all tissues of the plant, including flowers, fruit (skin and arils) and leaves. Steady-state gene-expression analysis indicated that none of the analyzed "white" pomegranate tissues are able to synthesize mRNA corresponding to the PgLDOX gene (leucoanthocyanidin dioxygenase, also called ANS, anthocyanidin synthase), which is one of the central structural genes in the anthocyanin-biosynthesis pathway. HPLC analysis revealed that none of the "white" pomegranate tissues accumulate anthocyanins, whereas other flavonoids, corresponding to biochemical reactions upstream of LDOX, were present. Molecular analysis of the "white" pomegranate revealed the presence of an insertion and an SNP within the coding region of PgLDOX. It was found that the SNP does not change amino acid sequence and is not fully linked with the "white" phenotype in all pomegranate accessions from the collection. On the other hand, genotyping of pomegranate accessions from the collection and segregating populations for the "white" phenotype demonstrated its complete linkage with the insertion, inherited as a recessive single-gene trait. Taken together, the results indicate that the insertion in PgLDOX is responsible for the "white" anthocyanin-less phenotype. These data provide the first direct molecular, genetic and chemical evidence for the effect of a natural modification in the LDOX gene on color accumulation in a fruit-bearing woody perennial deciduous tree. This modification can be further utilized to elucidate the physiological role of anthocyanins in protecting the tree organs from harmful environmental conditions, such as temperature and UV radiation.

  3. Gene Expression and Polymorphism of Myostatin Gene and its Association with Growth Traits in Chicken.

    Science.gov (United States)

    Dushyanth, K; Bhattacharya, T K; Shukla, R; Chatterjee, R N; Sitaramamma, T; Paswan, C; Guru Vishnu, P

    2016-10-01

    Myostatin is a member of TGF-β super family and is directly involved in regulation of body growth through limiting muscular growth. A study was carried out in three chicken lines to identify the polymorphism in the coding region of the myostatin gene through SSCP and DNA sequencing. A total of 12 haplotypes were observed in myostatin coding region of chicken. Significant associations between haplogroups with body weight at day 1, 14, 28, and 42 days, and carcass traits at 42 days were observed across the lines. It is concluded that the coding region of myostatin gene was polymorphic, with varied levels of expression among lines and had significant effects on growth traits. The expression of MSTN gene varied during embryonic and post hatch development stage.

  4. Influence of Coding Variability in APP-Aβ Metabolism Genes in Sporadic Alzheimer's Disease.

    Directory of Open Access Journals (Sweden)

    Celeste Sassi

    Full Text Available The cerebral deposition of Aβ42, a neurotoxic proteolytic derivate of amyloid precursor protein (APP, is a central event in Alzheimer's disease (AD(Amyloid hypothesis. Given the key role of APP-Aβ metabolism in AD pathogenesis, we selected 29 genes involved in APP processing, Aβ degradation and clearance. We then used exome and genome sequencing to investigate the single independent (single-variant association test and cumulative (gene-based association test effect of coding variants in these genes as potential susceptibility factors for AD, in a cohort composed of 332 sporadic and mainly late-onset AD cases and 676 elderly controls from North America and the UK. Our study shows that common coding variability in these genes does not play a major role for the disease development. In the single-variant association analysis, the main hits, none of which statistically significant after multiple testing correction (1.9e-4coding variants (0.009%genes mainly involved in Aβ extracellular degradation (TTR, ACE, clearance (LRP1 and APP trafficking and recycling (SORL1. These results were partially replicated in the gene-based analysis (c-alpha and SKAT tests, that reports ECE1, LYZ and TTR as nominally associated to AD (1.7e-3 coding variability in APP-Aβ genes is not a critical factor for AD development and 2 Aβ degradation and clearance, rather than Aβ production, may play a key role in the etiology of sporadic AD.

  5. Color differences among feral pigeons (Columba livia) are not attributable to sequence variation in the coding region of the melanocortin-1 receptor gene (MC1R)

    Science.gov (United States)

    2013-01-01

    Background Genetic variation at the melanocortin-1 receptor (MC1R) gene is correlated with melanin color variation in many birds. Feral pigeons (Columba livia) show two major melanin-based colorations: a red coloration due to pheomelanic pigment and a black coloration due to eumelanic pigment. Furthermore, within each color type, feral pigeons display continuous variation in the amount of melanin pigment present in the feathers, with individuals varying from pure white to a full dark melanic color. Coloration is highly heritable and it has been suggested that it is under natural or sexual selection, or both. Our objective was to investigate whether MC1R allelic variants are associated with plumage color in feral pigeons. Findings We sequenced 888 bp of the coding sequence of MC1R among pigeons varying both in the type, eumelanin or pheomelanin, and the amount of melanin in their feathers. We detected 10 non-synonymous substitutions and 2 synonymous substitution but none of them were associated with a plumage type. It remains possible that non-synonymous substitutions that influence coloration are present in the short MC1R fragment that we did not sequence but this seems unlikely because we analyzed the entire functionally important region of the gene. Conclusions Our results show that color differences among feral pigeons are probably not attributable to amino acid variation at the MC1R locus. Therefore, variation in regulatory regions of MC1R or variation in other genes may be responsible for the color polymorphism of feral pigeons. PMID:23915680

  6. Conserved syntenic clusters of protein coding genes are missing in birds.

    Science.gov (United States)

    Lovell, Peter V; Wirthlin, Morgan; Wilhelm, Larry; Minx, Patrick; Lazar, Nathan H; Carbone, Lucia; Warren, Wesley C; Mello, Claudio V

    2014-01-01

    Birds are one of the most highly successful and diverse groups of vertebrates, having evolved a number of distinct characteristics, including feathers and wings, a sturdy lightweight skeleton and unique respiratory and urinary/excretion systems. However, the genetic basis of these traits is poorly understood. Using comparative genomics based on extensive searches of 60 avian genomes, we have found that birds lack approximately 274 protein coding genes that are present in the genomes of most vertebrate lineages and are for the most part organized in conserved syntenic clusters in non-avian sauropsids and in humans. These genes are located in regions associated with chromosomal rearrangements, and are largely present in crocodiles, suggesting that their loss occurred subsequent to the split of dinosaurs/birds from crocodilians. Many of these genes are associated with lethality in rodents, human genetic disorders, or biological functions targeting various tissues. Functional enrichment analysis combined with orthogroup analysis and paralog searches revealed enrichments that were shared by non-avian species, present only in birds, or shared between all species. Together these results provide a clearer definition of the genetic background of extant birds, extend the findings of previous studies on missing avian genes, and provide clues about molecular events that shaped avian evolution. They also have implications for fields that largely benefit from avian studies, including development, immune system, oncogenesis, and brain function and cognition. With regards to the missing genes, birds can be considered ‘natural knockouts’ that may become invaluable model organisms for several human diseases.

  7. Kinetic models of gene expression including non-coding RNAs

    Energy Technology Data Exchange (ETDEWEB)

    Zhdanov, Vladimir P., E-mail: zhdanov@catalysis.r

    2011-03-15

    In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.

  8. Molecular analysis of two genes between let-653 and let-56 in the unc-22(IV) region of Caenorhabditis elegans.

    Science.gov (United States)

    Marra, M A; Prasad, S S; Baillie, D L

    1993-01-01

    A previous study of genomic organization described the identification of nine potential coding regions in 150 kb of genomic DNA from the unc-22(IV) region of Caenorhabditis elegans. In this study, we focus on the genomic organization of a small interval of 0.1 map unit bordered on the right by unc-22 and on the left by the left-hand breakpoints of the deficiencies sDf9, sDf19 and sDf65. This small interval at present contains a single mutagenically defined locus, the essential gene let-56. The cosmid C11F2 has previously been used to rescue let-56. Therefore, at least some of C11F2 must reside in the interval. In this paper, we report the characterization of two coding elements that reside on C11F2. Analysis of nucleotide sequence data obtained from cDNAs and cosmid subclones revealed that one of the coding elements closely resembles aromatic amino acid decarboxylases from several species. The other of these coding elements was found to closely resemble a human growth factor activatable Na+/H+ antiporter. Paris of oligonucleotide primers, predicted from both coding elements, have been used in PCR experiments to position these coding elements between the left breakpoint of sDf19 and the left breakpoint of sDf65, between the essential genes let-653 and let-56.

  9. Discovery of rare protein-coding genes in model methylotroph Methylobacterium extorquens AM1.

    Science.gov (United States)

    Kumar, Dhirendra; Mondal, Anupam Kumar; Yadav, Amit Kumar; Dash, Debasis

    2014-12-01

    Proteogenomics involves the use of MS to refine annotation of protein-coding genes and discover genes in a genome. We carried out comprehensive proteogenomic analysis of Methylobacterium extorquens AM1 (ME-AM1) from publicly available proteomics data with a motive to improve annotation for methylotrophs; organisms capable of surviving in reduced carbon compounds such as methanol. Besides identifying 2482(50%) proteins, 29 new genes were discovered and 66 annotated gene models were revised in ME-AM1 genome. One such novel gene is identified with 75 peptides, lacks homolog in other methylobacteria but has glycosyl transferase and lipopolysaccharide biosynthesis protein domains, indicating its potential role in outer membrane synthesis. Many novel genes are present only in ME-AM1 among methylobacteria. Distant homologs of these genes in unrelated taxonomic classes and low GC-content of few genes suggest lateral gene transfer as a potential mode of their origin. Annotations of methylotrophy related genes were also improved by the discovery of a short gene in methylotrophy gene island and redefining a gene important for pyrroquinoline quinone synthesis, essential for methylotrophy. The combined use of proteogenomics and rigorous bioinformatics analysis greatly enhanced the annotation of protein-coding genes in model methylotroph ME-AM1 genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Analysis of t(9;17)(q33.2;q25.3) chromosomal breakpoint regions and genetic association reveals novel candidate genes for bipolar disorder

    DEFF Research Database (Denmark)

    Rajkumar, Anto P; Christensen, Jane H; Mattheisen, Manuel

    2015-01-01

    ,856) data. Genetic associations between these disorders and single nucleotide polymorphisms within these breakpoint regions were analysed by BioQ, FORGE, and RegulomeDB programmes. RESULTS: Four protein-coding genes [coding for (endonuclease V (ENDOV), neuronal pentraxin I (NPTX1), ring finger protein 213...

  11. Annotating non-coding regions of the genome.

    Science.gov (United States)

    Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

    2010-08-01

    Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

  12. Two rare deletions upstream of the NRXN1 gene (2p16.3) affecting the non-coding mRNA AK127244 segregate with diverse psychopathological phenotypes in a family

    DEFF Research Database (Denmark)

    Duong, L. T. T.; Hoeffding, L. K.; Petersen, K. B.

    2015-01-01

    127244 in addition to the pathogenic 15q11.2 deletion in distinct family members. The two deletions upstream of the NRXN1 gene were found to segregate with psychiatric disorders in the family and further similar deletions have been observed in patients diagnosed with autism spectrum disorder. Thus, we...... susceptibility. In this study, we describe a family affected by a wide range of psychiatric disorders including early onset schizophrenia, schizophreniform disorder, and affective disorders. Microarray analysis identified two rare deletions immediately upstream of the NRXN1 gene affecting the non-coding mRNA AK...... suggest that non-coding regions upstream of the NRXN1 gene affecting AK127244 might (as NRXN1) contain susceptibility regions for a wide spectrum of neuropsychiatric disorders. (C) 2015 Elsevier Masson SAS. All rights reserved....

  13. Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.

    Science.gov (United States)

    Wyszyńska-Koko, J; Kurył, J

    2004-01-01

    MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.

  14. Annotation of the protein coding regions of the equine genome

    DEFF Research Database (Denmark)

    Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.

    2015-01-01

    Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...

  15. Female-biased expression of long non-coding RNAs in domains that escape X-inactivation in mouse

    Directory of Open Access Journals (Sweden)

    Lu Lu

    2010-11-01

    Full Text Available Abstract Background Sexual dimorphism in brain gene expression has been recognized in several animal species. However, the relevant regulatory mechanisms remain poorly understood. To investigate whether sex-biased gene expression in mammalian brain is globally regulated or locally regulated in diverse brain structures, and to study the genomic organisation of brain-expressed sex-biased genes, we performed a large scale gene expression analysis of distinct brain regions in adult male and female mice. Results This study revealed spatial specificity in sex-biased transcription in the mouse brain, and identified 173 sex-biased genes in the striatum; 19 in the neocortex; 12 in the hippocampus and 31 in the eye. Genes located on sex chromosomes were consistently over-represented in all brain regions. Analysis on a subset of genes with sex-bias in more than one tissue revealed Y-encoded male-biased transcripts and X-encoded female-biased transcripts known to escape X-inactivation. In addition, we identified novel coding and non-coding X-linked genes with female-biased expression in multiple tissues. Interestingly, the chromosomal positions of all of the female-biased non-coding genes are in close proximity to protein-coding genes that escape X-inactivation. This defines X-chromosome domains each of which contains a coding and a non-coding female-biased gene. Lack of repressive chromatin marks in non-coding transcribed loci supports the possibility that they escape X-inactivation. Moreover, RNA-DNA combined FISH experiments confirmed the biallelic expression of one such novel domain. Conclusion This study demonstrated that the amount of genes with sex-biased expression varies between individual brain regions in mouse. The sex-biased genes identified are localized on many chromosomes. At the same time, sexually dimorphic gene expression that is common to several parts of the brain is mostly restricted to the sex chromosomes. Moreover, the study uncovered

  16. Conceptual Approach to Forming the Basic Code of Neo-Industrial Development of a Region

    Directory of Open Access Journals (Sweden)

    Elena Leonidovna Andreeva

    2017-09-01

    Full Text Available In the article, the authors propose the conceptual fundamentals of the “code approach” to the regional neo-industrial development. The purpose of the research is to reveal the essence of the transition to a new type of industrial and economic relations through a prism of “genetic codes” of the region. We consider these codes as a system of the “racial memory” of a territory, which determines the specificity and features of neo-industrialization realization. We substantiated the hypothesis about the influence of the “genetic codes” of the region on the effectiveness of the neo-industrialization. We have defined the participants, or else the carriers of the codes in the transformation of regional inheritance for the stimulation of the neoindustrial development of region’s economy. The subject matter of the research is the distinctive features of the functioning of the determinative region’s codes. Their content determines the socio-economic specificity of the region and the features of innovative, informational, value-based and competence-based development of the territory. The determinative codes generate the dynamic codes of the region, which are understood as their derivatives. They have a high probability of occurrence, higher speed of development and distribution, internal forces that make possible the self-development of the region. The scientific contribution is the substantiation of the basic code of the regional neo-industrial development. It represents the evolutionary accumulation of the rapid changes of its innovative, informational, value-based and competence-based codes stimulating the generation and implementation of new ideas regarding to economic entities adapted to the historical and cultural conditions. The article presents the code model of neo-industrial development of the region described by formulas. We applied the system analysis methods, historical and civilization approaches, evolutionary and

  17. Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

    Science.gov (United States)

    Spielmann, A; Stutz, E

    1983-10-25

    The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.

  18. Inheritance-mode specific pathogenicity prioritization (ISPP) for human protein coding genes.

    Science.gov (United States)

    Hsu, Jacob Shujui; Kwan, Johnny S H; Pan, Zhicheng; Garcia-Barcelo, Maria-Mercè; Sham, Pak Chung; Li, Miaoxin

    2016-10-15

    Exome sequencing studies have facilitated the detection of causal genetic variants in yet-unsolved Mendelian diseases. However, the identification of disease causal genes among a list of candidates in an exome sequencing study is still not fully settled, and it is often difficult to prioritize candidate genes for follow-up studies. The inheritance mode provides crucial information for understanding Mendelian diseases, but none of the existing gene prioritization tools fully utilize this information. We examined the characteristics of Mendelian disease genes under different inheritance modes. The results suggest that Mendelian disease genes with autosomal dominant (AD) inheritance mode are more haploinsufficiency and de novo mutation sensitive, whereas those autosomal recessive (AR) genes have significantly more non-synonymous variants and regulatory transcript isoforms. In addition, the X-linked (XL) Mendelian disease genes have fewer non-synonymous and synonymous variants. As a result, we derived a new scoring system for prioritizing candidate genes for Mendelian diseases according to the inheritance mode. Our scoring system assigned to each annotated protein-coding gene (N = 18 859) three pathogenic scores according to the inheritance mode (AD, AR and XL). This inheritance mode-specific framework achieved higher accuracy (area under curve  = 0.84) in XL mode. The inheritance-mode specific pathogenicity prioritization (ISPP) outperformed other well-known methods including Haploinsufficiency, Recessive, Network centrality, Genic Intolerance, Gene Damage Index and Gene Constraint scores. This systematic study suggests that genes manifesting disease inheritance modes tend to have unique characteristics. ISPP is included in KGGSeq v1.0 (http://grass.cgs.hku.hk/limx/kggseq/), and source code is available from (https://github.com/jacobhsu35/ISPP.git). mxli@hku.hkSupplementary information: Supplementary data are available at Bioinformatics online. © The Author

  19. Improvement of genome assembly completeness and identification of novel full-length protein-coding genes by RNA-seq in the giant panda genome.

    Science.gov (United States)

    Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan

    2015-12-11

    High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.

  20. Emerging putative associations between non-coding RNAs and protein-coding genes in Neuropathic Pain. Added value from re-using microarray data.

    Directory of Open Access Journals (Sweden)

    Enrico Capobianco

    2016-10-01

    Full Text Available Regeneration of injured nerves is likely occurring in the peripheral nervous system, but not in the central nervous system. Although protein-coding gene expression has been assessed during nerve regeneration, little is currently known about the role of non-coding RNAs (ncRNAs. This leaves open questions about the potential effects of ncRNAs at transcriptome level. Due to the limited availability of human neuropathic pain data, we have identified the most comprehensive time-course gene expression profile referred to sciatic nerve injury, and studied in a rat model, using two neuronal tissues, namely dorsal root ganglion (DRG and sciatic nerve (SN. We have developed a methodology to identify differentially expressed bioentities starting from microarray probes, and re-purposing them to annotate ncRNAs, while analyzing the expression profiles of protein-coding genes. The approach is designed to reuse microarray data and perform first profiling and then meta-analysis through three main steps. First, we used contextual analysis to identify what we considered putative or potential protein coding targets for selected ncRNAs. Relevance was therefore assigned to differential expression of neighbor protein-coding genes, with neighborhood defined by a fixed genomic distance from long or antisense ncRNA loci, and of parent genes associated with pseudogenes. Second, connectivity among putative targets was used to build networks, in turn useful to conduct inference at interactomic scale. Last, network paths were annotated to assess relevance to neuropathic pain. We found significant differential expression in long-intergenic ncRNAs (32 lincRNAs in SN, and 8 in DRG, antisense RNA (31 asRNA in SN, and 12 in DRG and pseudogenes (456 in SN, 56 in DRG. In particular, contextual analysis centered on pseudogenes revealed some targets with known association to neurodegeneration and/or neurogenesis processes. While modules of the olfactory receptors were clearly

  1. A Novel Polymorphism of VLDLR Signal Peptide Coding Region and Its Association with Growth and Abdominal Fat Traits of Gaoyou Domestic Ducks

    Directory of Open Access Journals (Sweden)

    C Ming-liang

    Full Text Available ABSTRACT The VLDLR gene plays important roles in the growth and adiposity in humans and mice. The purpose of this study was to investigate the relationship between VLDLR gene genetic polymorphisms and growth and abdominal fat traits of the Gaoyou domestic duck. A total of 267 Gaoyou ducks were employed for testing. A 18bp deletion was identified in VLDLR signal peptide coding region. The results of c2 test suggested that the genotype frequencies of VLDLR signal peptide coding region were not in Hardy-Weinberg equilibrium. Least squares analysis showed that body weight (BW of -18bp/-18bp genotype ducks was significantly higher than those of other genotypes from six (BW6 (p0.05 and body weight for AFP and different genotypes had a significant effect on AFP (p<0.05. The results of Bonferroni t-test revealed that the abdominal fat percentage (AFP of -18bp/-18bp genotype was significantly lower than those of +18bp/-18bp (p<0.05. Preliminary studies have shown that VLDLR may be a candidate gene for the selection for growth and abdominal fat, and the results of the present study indicate that VLDLR strongly influences carcass abdominal fat content of Gaoyou ducks.

  2. Histone modification profiles are predictive for tissue/cell-type specific expression of both protein-coding and microRNA genes

    Directory of Open Access Journals (Sweden)

    Zhang Michael Q

    2011-05-01

    Full Text Available Abstract Background Gene expression is regulated at both the DNA sequence level and through modification of chromatin. However, the effect of chromatin on tissue/cell-type specific gene regulation (TCSR is largely unknown. In this paper, we present a method to elucidate the relationship between histone modification/variation (HMV and TCSR. Results A classifier for differentiating CD4+ T cell-specific genes from housekeeping genes using HMV data was built. We found HMV in both promoter and gene body regions to be predictive of genes which are targets of TCSR. For example, the histone modification types H3K4me3 and H3K27ac were identified as the most predictive for CpG-related promoters, whereas H3K4me3 and H3K79me3 were the most predictive for nonCpG-related promoters. However, genes targeted by TCSR can be predicted using other type of HMVs as well. Such redundancy implies that multiple type of underlying regulatory elements, such as enhancers or intragenic alternative promoters, which can regulate gene expression in a tissue/cell-type specific fashion, may be marked by the HMVs. Finally, we show that the predictive power of HMV for TCSR is not limited to protein-coding genes in CD4+ T cells, as we successfully predicted TCSR targeted genes in muscle cells, as well as microRNA genes with expression specific to CD4+ T cells, by the same classifier which was trained on HMV data of protein-coding genes in CD4+ T cells. Conclusion We have begun to understand the HMV patterns that guide gene expression in both tissue/cell-type specific and ubiquitous manner.

  3. The in vitro transcription of a rainbow trout (Salmo gairdnerii) protamine gene. II. Controlled mutation of the cap site region.

    Science.gov (United States)

    Jankowski, J M; Dixon, G H

    1985-02-01

    A series of plasmids containing new fusion genes in which the trout protamine gene is placed under the control of the complete herpes virus (HSV-1) tk promoter Pvu II-Bgl II fragment (pM8), or a shortened thymidine kinase (tk) promoter in which the region between the TATA box and the cap site is altered by using the Pvu II-Mlu I fragment (pM7), have been constructed. An additional recombinant plasmid was constructed in which the Bgl II-Ava II fragment of the protamine gene containing the entire protamine promoter but missing the protamine coding region was cloned into pBR322 between the Xho II 1666 and Hind III sites (pP5). For in vitro transcription, a HeLa cell lysate system was prepared and the RNA transcription products, after glyoxalation, were electrophoretically analyzed on 5% polyacrylamide gels. In constructing pM8 the DNA sequence between the tk promoter and the cap site was present while in pM7 it was deleted. Similar multiple transcripts were seen in both cases, indicating that the region between the promoter and the cap site has no effect upon transcription in vitro. The multiple transcripts appear to be due to the presence of a cryptic promoter in the complementary strand of the protamine gene. The activity of this cryptic promoter has been confirmed by comparison of the transcription of plasmid pP5, in which the protamine mRNA coding region has been deleted, with a previously described plasmid, pJBRP (Jankowski JM and Dixon GH (1984) Can. J. Biochem. Cell. Biol. 62, 291-300), containing the intact protamine gene.

  4. A human-specific de novo protein-coding gene associated with human brain functions.

    Directory of Open Access Journals (Sweden)

    Chuan-Yun Li

    2010-03-01

    Full Text Available To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203. Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.

  5. Discrete Ramanujan transform for distinguishing the protein coding regions from other regions.

    Science.gov (United States)

    Hua, Wei; Wang, Jiasong; Zhao, Jian

    2014-01-01

    Based on the study of Ramanujan sum and Ramanujan coefficient, this paper suggests the concepts of discrete Ramanujan transform and spectrum. Using Voss numerical representation, one maps a symbolic DNA strand as a numerical DNA sequence, and deduces the discrete Ramanujan spectrum of the numerical DNA sequence. It is well known that of discrete Fourier power spectrum of protein coding sequence has an important feature of 3-base periodicity, which is widely used for DNA sequence analysis by the technique of discrete Fourier transform. It is performed by testing the signal-to-noise ratio at frequency N/3 as a criterion for the analysis, where N is the length of the sequence. The results presented in this paper show that the property of 3-base periodicity can be only identified as a prominent spike of the discrete Ramanujan spectrum at period 3 for the protein coding regions. The signal-to-noise ratio for discrete Ramanujan spectrum is defined for numerical measurement. Therefore, the discrete Ramanujan spectrum and the signal-to-noise ratio of a DNA sequence can be used for distinguishing the protein coding regions from the noncoding regions. All the exon and intron sequences in whole chromosomes 1, 2, 3 and 4 of Caenorhabditis elegans have been tested and the histograms and tables from the computational results illustrate the reliability of our method. In addition, we have analyzed theoretically and gotten the conclusion that the algorithm for calculating discrete Ramanujan spectrum owns the lower computational complexity and higher computational accuracy. The computational experiments show that the technique by using discrete Ramanujan spectrum for classifying different DNA sequences is a fast and effective method. Copyright © 2014 Elsevier Ltd. All rights reserved.

  6. Sequence analysis of the 3’-untranslated region of HSP70 (type I genes in the genus Leishmania: its usefulness as a molecular marker for species identification

    Directory of Open Access Journals (Sweden)

    Requena Jose M

    2012-04-01

    Full Text Available Abstract Background The Leishmaniases are a group of clinically diverse diseases caused by parasites of the genus Leishmania. To distinguish between species is crucial for correct diagnosis and prognosis as well as for treatment decisions. Recently, sequencing of the HSP70 coding region has been applied in phylogenetic studies and for identifying of Leishmania species with excellent results. Methods In the present study, we analyzed the 3’-untranslated region (UTR of Leishmania HSP70-type I gene from 24 strains representing eleven Leishmania species in the belief that this non-coding region would have a better discriminatory capacity for species typing than coding regions. Results It was observed that there was a remarkable degree of sequence conservation in this region, even between species of the subgenus Leishmania and Viannia. In addition, the presence of many microsatellites was a common feature of the 3´-UTR of HSP70-I genes in the Leishmania genus. Finally, we constructed dendrograms based on global sequence alignments of the analyzed Leishmania species and strains, the results indicated that this particular region of HSP70 genes might be useful for species (or species complex typing, improving for particular species the discrimination capacity of phylogenetic trees based on HSP70 coding sequences. Given the large size variation of the analyzed region between the Leishmania and Viannia subgenera, direct visualization of the PCR amplification product would allow discrimination between subgenera, and a HaeIII-PCR-RFLP analysis might be used for differentiating some species within each subgenera. Conclusions Sequence and phylogenetic analyses indicated that this region, which is readily amplified using a single pair of primers from both Old and New World Leishmania species, might be useful as a molecular marker for species discrimination.

  7. Molecular cloning and construction of the coding region for human acetylcholinesterase reveals a G + C-rich attenuating structure

    International Nuclear Information System (INIS)

    Soreq, H.; Ben-Aziz, R.; Prody, C.A.; Seidman, S.; Gnatt, A.; Neville, L.; Lieman-Hurwitz, J.; Lev-Lehman, E.; Ginzberg, D.; Lapidot-Lifson, Y.; Zakut, H.

    1990-01-01

    To study the primary structure of human acetylcholinesterase and its gene expression and amplification, cDNA libraries from human tissues expressing oocyte-translatable AcChoEase mRNA were constructed and screened with labeled oligodeoxynucleotide probes. Several cDNA clones were isolated that encoded a polypeptide with ≥50% identically aligned amino acids to Torpedo AcChoEase and human butyrylcholinesterase. However, these cDNA clones were all truncated within a 300-nucleotide-long G + C-rich region with a predicted pattern of secondary structure having a high Gibbs free energy downstream from the expected 5' end of the coding region. Screening of a genomic DNA library revealed the missing 5' domain. When ligated to the cDNA and constructed into a transcription vector, this sequence encoded a synthetic mRNA translated in microinjected oocytes into catalytically active AcChoEase with marked preference for acetylthiocholine over butyrylthiocholine as a substrate, susceptibility to inhibition by the AcChoEase inhibitor BW284C51, and resistance to the AcChoEase inhibitor tetraisopropylpyrophosphoramide. Blot hybridization of genomic DNA from different individuals carrying amplified AcChoEase genes revealed variable intensities and restriction patterns with probes from the regions upstream and downstream from the predicted G + C-rich structure. Thus, the human AcChoEase gene includes a putative G + C-rich attenuator domain and is subject to structural alterations in cases of AcChoEase gene amplification

  8. Block-based wavelet transform coding of mammograms with region-adaptive quantization

    Science.gov (United States)

    Moon, Nam Su; Song, Jun S.; Kwon, Musik; Kim, JongHyo; Lee, ChoongWoong

    1998-06-01

    To achieve both high compression ratio and information preserving, it is an efficient way to combine segmentation and lossy compression scheme. Microcalcification in mammogram is one of the most significant sign of early stage of breast cancer. Therefore in coding, detection and segmentation of microcalcification enable us to preserve it well by allocating more bits to it than to other regions. Segmentation of microcalcification is performed both in spatial domain and in wavelet transform domain. Peak error controllable quantization step, which is off-line designed, is suitable for medical image compression. For region-adaptive quantization, block- based wavelet transform coding is adopted and different peak- error-constrained quantizers are applied to blocks according to the segmentation result. In view of preservation of microcalcification, the proposed coding scheme shows better performance than JPEG.

  9. Atypical DNA methylation of genes encoding cysteine-rich peptides in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    You Wanhui

    2012-04-01

    Full Text Available Abstract Background In plants, transposons and non-protein-coding repeats are epigenetically silenced by CG and non-CG methylation. This pattern of methylation is mediated in part by small RNAs and two specialized RNA polymerases, termed Pol IV and Pol V, in a process called RNA-directed DNA methylation. By contrast, many protein-coding genes transcribed by Pol II contain in their gene bodies exclusively CG methylation that is independent of small RNAs and Pol IV/Pol V activities. It is unclear how the different methylation machineries distinguish between transposons and genes. Here we report on a group of atypical genes that display in their coding region a transposon-like methylation pattern, which is associated with gene silencing in sporophytic tissues. Results We performed a methylation-sensitive amplification polymorphism analysis to search for targets of RNA-directed DNA methylation in Arabidopsis thaliana and identified several members of a gene family encoding cysteine-rich peptides (CRPs. In leaves, the CRP genes are silent and their coding regions contain dense, transposon-like methylation in CG, CHG and CHH contexts, which depends partly on the Pol IV/Pol V pathway and small RNAs. Methylation in the coding region is reduced, however, in the synergid cells of the female gametophyte, where the CRP genes are specifically expressed. Further demonstrating that expressed CRP genes lack gene body methylation, a CRP4-GFP fusion gene under the control of the constitutive 35 S promoter remains unmethylated in leaves and is transcribed to produce a translatable mRNA. By contrast, a CRP4-GFP fusion gene under the control of a CRP4 promoter fragment acquires CG and non-CG methylation in the CRP coding region in leaves similar to the silent endogenous CRP4 gene. Conclusions Unlike CG methylation in gene bodies, which does not dramatically affect Pol II transcription, combined CG and non-CG methylation in CRP coding regions is likely to

  10. Sensitivity Study of Regional TDC in MATRA-S code Using PSBT Benchmark Exercise

    International Nuclear Information System (INIS)

    Kim, Seong Jin; Cha, Jeong Hun; Seo, Kyong Won; Kwon, Hyuk; Hwang, Dae Hyun

    2012-01-01

    In the sub-channel analysis code, the modeling of interchannel exchanges between adjacent sub-channels expressed as diversion cross flow, turbulent mixing and so on. The turbulent mixing in MATRA-S code is considered as TDC( β : thermal diffusion coefficient). The TDC becomes different according to the bundle, grid type, mixing vane, and so on. Generally, the thermal mixing test is conducted to optimize the TDC. In the OECD/NRC PSBT benchmark, the thermal mixing test was conducted and the optimized TDC was analyzed using MATRA-S code. It was shown that the exit temperature distribution of MATRA-S code was different from an experimental result even though the optimized TDC was applied to the code. In this study, concept of the regional TDC was introduced and sensitivity analysis of the regional TDC was presented

  11. The PPARγ coding region and its role in visceral obesity

    International Nuclear Information System (INIS)

    Boon Yin, Khoo; Najimudin, Nazalan; Muhammad, Tengku Sifzizul Tengku

    2008-01-01

    Peroxisome proliferator-activated receptor gamma (PPARγ) is a ligand activated transcription factor, plays many essential roles of biological function in higher organisms. The PPARγ is mainly expressed in adipose tissue. It regulates the transcriptional activity of genes by binding with other transcription factor. The PPARγ coding region has been found to be closest to that of monkey in ours and other research groups. Thus, monkey is a more suitable animal model for future PPARγ studying, although mice and rat are frequently being used. The PPARγ is involved in regulating alterations of adipose tissue masses result from changes in mature adipocyte size and/or number through a complex interplay process called adipogenesis. However, the role of PPARγ in negatively regulating the process of adipogenesis remains unclear. This review may help we investigate the differential expression of key transcription factor in adipose tissue in response to visceral obesity-induced diet in vivo. The study may also provide valuable information to define a more appropriate physiological condition in adipogenesis which may help to prevent diseases cause by negative regulation of the transcription factors in adipose tissue

  12. Identification of Differentially Expressed Genes through Integrated Study of Alzheimer's Disease Affected Brain Regions.

    Directory of Open Access Journals (Sweden)

    Nisha Puthiyedth

    identified the presence of 23 non-coding features, including four miRNA precursors (miR-7, miR570, miR-1229 and miR-6821, dysregulated across the brain regions. Furthermore, we compared our results with two popular meta-analysis methods RankProd and GeneMeta to validate our findings and performed a sensitivity analysis by removing one dataset at a time to assess the robustness of our results. These new findings may provide new insights into the disease mechanisms and thus make a significant contribution in the near future towards understanding, prevention and cure of AD.

  13. The low-recombining pericentromeric region of barley restricts gene diversity and evolution but not gene expression

    Science.gov (United States)

    Baker, Katie; Bayer, Micha; Cook, Nicola; Dreißig, Steven; Dhillon, Taniya; Russell, Joanne; Hedley, Pete E; Morris, Jenny; Ramsay, Luke; Colas, Isabelle; Waugh, Robbie; Steffenson, Brian; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J

    2014-01-01

    The low-recombining pericentromeric region of the barley genome contains roughly a quarter of the genes of the species, embedded in low-recombining DNA that is rich in repeats and repressive chromatin signatures. We have investigated the effects of pericentromeric region residency upon the expression, diversity and evolution of these genes. We observe no significant difference in average transcript level or developmental RNA specificity between the barley pericentromeric region and the rest of the genome. In contrast, all of the evolutionary parameters studied here show evidence of compromised gene evolution in this region. First, genes within the pericentromeric region of wild barley show reduced diversity and significantly weakened purifying selection compared with the rest of the genome. Second, gene duplicates (ohnolog pairs) derived from the cereal whole-genome duplication event ca. 60MYa have been completely eliminated from the barley pericentromeric region. Third, local gene duplication in the pericentromeric region is reduced by 29% relative to the rest of the genome. Thus, the pericentromeric region of barley is a permissive environment for gene expression but has restricted gene evolution in a sizeable fraction of barley's genes. PMID:24947331

  14. Trans-acting GC-rich non-coding RNA at var expression site modulates gene counting in malaria parasite.

    Science.gov (United States)

    Guizetti, Julien; Barcons-Simon, Anna; Scherf, Artur

    2016-11-16

    Monoallelic expression of the var multigene family enables immune evasion of the malaria parasite Plasmodium falciparum in its human host. At a given time only a single member of the 60-member var gene family is expressed at a discrete perinuclear region called the 'var expression site'. However, the mechanism of var gene counting remains ill-defined. We hypothesize that activation factors associating specifically with the expression site play a key role in this process. Here, we investigate the role of a GC-rich non-coding RNA (ncRNA) gene family composed of 15 highly homologous members. GC-rich genes are positioned adjacent to var genes in chromosome-central gene clusters but are absent near subtelomeric var genes. Fluorescence in situ hybridization demonstrates that GC-rich ncRNA localizes to the perinuclear expression site of central and subtelomeric var genes in trans. Importantly, overexpression of distinct GC-rich ncRNA members disrupts the gene counting process at the single cell level and results in activation of a specific subset of var genes in distinct clones. We identify the first trans-acting factor targeted to the elusive perinuclear var expression site and open up new avenues to investigate ncRNA function in antigenic variation of malaria and other protozoan pathogens. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. An evolutionary model for protein-coding regions with conserved RNA structure

    DEFF Research Database (Denmark)

    Pedersen, Jakob Skou; Forsberg, Roald; Meyer, Irmtraud Margret

    2004-01-01

    in the RNA structure. The overlap of these fundamental dependencies is sufficient to cause "contagious" context dependencies which cascade across many nucleotide sites. Such large-scale dependencies challenge the use of traditional phylogenetic models in evolutionary inference because they explicitly assume...... components of traditional phylogenetic models. We applied this to a data set of full-genome sequences from the hepatitis C virus where five RNA structures are mapped within the coding region. This allowed us to partition the effects of selection on different structural elements and to test various hypotheses......Here we present a model of nucleotide substitution in protein-coding regions that also encode the formation of conserved RNA structures. In such regions, apparent evolutionary context dependencies exist, both between nucleotides occupying the same codon and between nucleotides forming a base pair...

  16. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets.

    Science.gov (United States)

    Khan, Aziz; Mathelier, Anthony

    2017-05-31

    A common task for scientists relies on comparing lists of genes or genomic regions derived from high-throughput sequencing experiments. While several tools exist to intersect and visualize sets of genes, similar tools dedicated to the visualization of genomic region sets are currently limited. To address this gap, we have developed the Intervene tool, which provides an easy and automated interface for the effective intersection and visualization of genomic region or list sets, thus facilitating their analysis and interpretation. Intervene contains three modules: venn to generate Venn diagrams of up to six sets, upset to generate UpSet plots of multiple sets, and pairwise to compute and visualize intersections of multiple sets as clustered heat maps. Intervene, and its interactive web ShinyApp companion, generate publication-quality figures for the interpretation of genomic region and list sets. Intervene and its web application companion provide an easy command line and an interactive web interface to compute intersections of multiple genomic and list sets. They have the capacity to plot intersections using easy-to-interpret visual approaches. Intervene is developed and designed to meet the needs of both computer scientists and biologists. The source code is freely available at https://bitbucket.org/CBGR/intervene , with the web application available at https://asntech.shinyapps.io/intervene .

  17. Cloning and characterization of stress responsive Glp genes and their promotor regions from rice (abstract)

    International Nuclear Information System (INIS)

    Naqvi, S.M.S.; Mahmood, T.

    2005-01-01

    Plants respond to a number of environmental stimuli by modulating expression of genes. One such family of genes is now known as germin/germin-like protein genes (Glps). In order to detect any Glp gene response in rice, a pair of degenerate primers was designed based on consensus region from Glp sequences in Genbank. Using these primers a DNA fragment of about 550 bp was obtained by PCR amplification from genomic template. This 550 bp DNA was used as probe in Northern analysis. These studies provided evidence pointing to differential response of Glp expression to salt stress. RNA obtained from the roots was used for synthesis of cDNA. This cDNA was amplifiable with sense primer (RGLP1) from above mentioned pair and oligo-(dt) yielding a fragment of approx. 800 bp. Restriction analysis revealed that the PCR product was heterogeneous. After establishing that 800 bp fragment was the desired product, it was cloned in pCRII-TOPO. Five clones were picked up and analyzed by restriction analysis and sequencing. Two different Glp cDNAs were represented by these partial clones. Remaining sequence of the 5' end for clone 4 and 16 was obtained by Rapid Amplification of cDNA ends (RACE). The resultant sequences have been submitted to Genbank as Oryza sativa Rice Germin-like Protein 1 and 2 (osRGLP1 and 2). When full length genes corresponding to these sequences were amplified from genomic templates, resulting fragments were nearly 150 by larger than cDNAs. Cloning of structural genes for osRGLP1 revealed presence of a 162 bp intron in the coding region near 3' end. Preliminary evidence shows that expression of both osRGLP1 and 2 is severely reduced during salt stress. Another approach to establish both osRGLP1 and 2 genes involvement in stress tolerance is to study the ability of their promotor regions to drive expression of some reporter gene during stress. Promotor regions of about 1100 bp has been amplified and cloned and has been confirmed by restriction analysis and nested

  18. Implementation of the International Code of Marketing of Breastmilk Substitutes in the Eastern Mediterranean Region.

    Science.gov (United States)

    Al Jawaldeh, Ayoub; Sayed, Ghada

    2018-04-05

    Optimal breastfeeding practices and appropriate complementary feeding improve child health, survival and development. The countries of the Eastern Mediterranean Region have made significant strides in formulation and implementation of legislation to protect and promote breastfeeding based on The International Code of Marketing of Breast-milk Substitutes (the Code) and subsequent relevant World Health Assembly resolutions. To assess the implementation of the Code in the Region. Assessment was conducted by the World Health Organization (WHO) Regional Office for the Eastern Mediterranean using a WHO standard questionnaire. Seventeen countries in the Region have enacted legislation to protect breastfeeding. Only 6 countries have comprehensive legislation or other legal measures reflecting all or most provisions of the Code; 4 countries have legal measures incorporating many provisions of the Code; 7 countries have legal measures that contain a few provisions of the Code; 4 countries are currently studying the issue; and only 1 country has no measures in place. Further analysis of the legislation found that the text of articles in the laws fully reflected the Code articles in only 6 countries. Most countries need to revisit and amend existing national legislation to implement fully the Code and relevant World Health Assembly resolutions, supported by systematic monitoring and reporting. Copyright © World Health Organization (WHO) 2018. Some rights reserved. This work is available under the CC BY-NC-SA 3.0 IGO license (https://creativecommons.org/licenses/by-nc-sa/3.0/igo).

  19. Determining coding CpG islands by identifying regions significant for pattern statistics on Markov chains.

    Science.gov (United States)

    Singer, Meromit; Engström, Alexander; Schönhuth, Alexander; Pachter, Lior

    2011-09-23

    Recent experimental and computational work confirms that CpGs can be unmethylated inside coding exons, thereby showing that codons may be subjected to both genomic and epigenomic constraint. It is therefore of interest to identify coding CpG islands (CCGIs) that are regions inside exons enriched for CpGs. The difficulty in identifying such islands is that coding exons exhibit sequence biases determined by codon usage and constraints that must be taken into account. We present a method for finding CCGIs that showcases a novel approach we have developed for identifying regions of interest that are significant (with respect to a Markov chain) for the counts of any pattern. Our method begins with the exact computation of tail probabilities for the number of CpGs in all regions contained in coding exons, and then applies a greedy algorithm for selecting islands from among the regions. We show that the greedy algorithm provably optimizes a biologically motivated criterion for selecting islands while controlling the false discovery rate. We applied this approach to the human genome (hg18) and annotated CpG islands in coding exons. The statistical criterion we apply to evaluating islands reduces the number of false positives in existing annotations, while our approach to defining islands reveals significant numbers of undiscovered CCGIs in coding exons. Many of these appear to be examples of functional epigenetic specialization in coding exons.

  20. Co-expression of the Thermotoga neapolitana aglB gene with an upstream 3'-coding fragment of the malG gene improves enzymatic characteristics of recombinant AglB cyclomaltodextrinase.

    Science.gov (United States)

    Lunina, Natalia A; Agafonova, Elena V; Chekanovskaya, Lyudmila A; Dvortsov, Igor A; Berezina, Oksana V; Shedova, Ekaterina N; Kostrov, Sergey V; Velikodvorskaya, Galina A

    2007-07-01

    A cluster of Thermotoga neapolitana genes participating in starch degradation includes the malG gene of sugar transport protein and the aglB gene of cyclomaltodextrinase. The start and stop codons of these genes share a common overlapping sequence, aTGAtg. Here, we compared properties of expression products of three different constructs with aglB from T. neapolitana. The first expression vector contained the aglB gene linked to an upstream 90-bp 3'-terminal region of the malG gene with the stop codon overlapping with the start codon of aglB. The second construct included the isolated coding sequence of aglB with two tandem potential start codons. The expression product of this construct in Escherichia coli had two tandem Met residues at its N terminus and was characterized by low thermostability and high tendency to aggregate. In contrast, co-expression of aglB and the 3'-terminal region of malG (the first construct) resulted in AglB with only one N-terminal Met residue and a much higher specific activity of cyclomaltodextrinase. Moreover, the enzyme expressed by such a construct was more thermostable and less prone to aggregation. The third construct was the same as the second one except that it contained only one ATG start codon. The product of its expression had kinetic and other properties similar to those of the enzyme with only one N-terminal Met residue.

  1. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes.

    Science.gov (United States)

    Kumar, Dhirendra; Mondal, Anupam Kumar; Kutum, Rintu; Dash, Debasis

    2016-01-01

    Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  2. Distinctive mitochondrial genome of Calanoid copepod Calanus sinicus with multiple large non-coding regions and reshuffled gene order: Useful molecular markers for phylogenetic and population studies

    Science.gov (United States)

    2011-01-01

    Background Copepods are highly diverse and abundant, resulting in extensive ecological radiation in marine ecosystems. Calanus sinicus dominates continental shelf waters in the northwest Pacific Ocean and plays an important role in the local ecosystem by linking primary production to higher trophic levels. A lack of effective molecular markers has hindered phylogenetic and population genetic studies concerning copepods. As they are genome-level informative, mitochondrial DNA sequences can be used as markers for population genetic studies and phylogenetic studies. Results The mitochondrial genome of C. sinicus is distinct from other arthropods owing to the concurrence of multiple non-coding regions and a reshuffled gene arrangement. Further particularities in the mitogenome of C. sinicus include low A + T-content, symmetrical nucleotide composition between strands, abbreviated stop codons for several PCGs and extended lengths of the genes atp6 and atp8 relative to other copepods. The monophyletic Copepoda should be placed within the Vericrustacea. The close affinity between Cyclopoida and Poecilostomatoida suggests reassigning the latter as subordinate to the former. Monophyly of Maxillopoda is rejected. Within the alignment of 11 C. sinicus mitogenomes, there are 397 variable sites harbouring three 'hotspot' variable sites and three microsatellite loci. Conclusion The occurrence of the circular subgenomic fragment during laboratory assays suggests that special caution should be taken when sequencing mitogenomes using long PCR. Such a phenomenon may provide additional evidence of mitochondrial DNA recombination, which appears to have been a prerequisite for shaping the present mitochondrial profile of C. sinicus during its evolution. The lack of synapomorphic gene arrangements among copepods has cast doubt on the utility of gene order as a useful molecular marker for deep phylogenetic analysis. However, mitochondrial genomic sequences have been valuable markers for

  3. Regional Atmospheric Transport Code for Hanford Emission Tracking, Version 2 (RATCHET2)

    International Nuclear Information System (INIS)

    Ramsdell, James V.; Rishel, Jeremy P.

    2006-01-01

    This manual describes the atmospheric model and computer code for the Atmospheric Transport Module within SAC. The Atmospheric Transport Module, called RATCHET2, calculates the time-integrated air concentration and surface deposition of airborne contaminants to the soil. The RATCHET2 code is an adaptation of the Regional Atmospheric Transport Code for Hanford Emissions Tracking (RATCHET). The original RATCHET code was developed to perform the atmospheric transport for the Hanford Environmental Dose Reconstruction Project. Fundamentally, the two sets of codes are identical; no capabilities have been deleted from the original version of RATCHET. Most modifications are generally limited to revision of the run-specification file to streamline the simulation process for SAC.

  4. Regional Atmospheric Transport Code for Hanford Emission Tracking, Version 2(RATCHET2)

    Energy Technology Data Exchange (ETDEWEB)

    Ramsdell, James V.; Rishel, Jeremy P.

    2006-07-01

    This manual describes the atmospheric model and computer code for the Atmospheric Transport Module within SAC. The Atmospheric Transport Module, called RATCHET2, calculates the time-integrated air concentration and surface deposition of airborne contaminants to the soil. The RATCHET2 code is an adaptation of the Regional Atmospheric Transport Code for Hanford Emissions Tracking (RATCHET). The original RATCHET code was developed to perform the atmospheric transport for the Hanford Environmental Dose Reconstruction Project. Fundamentally, the two sets of codes are identical; no capabilities have been deleted from the original version of RATCHET. Most modifications are generally limited to revision of the run-specification file to streamline the simulation process for SAC.

  5. Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes

    DEFF Research Database (Denmark)

    Lin, Michael F; Kheradpour, Pouya; Washietl, Stefan

    2011-01-01

    conservation compared to typical protein-coding genes—especially at synonymous sites. In this study, we use genome alignments of 29 placental mammals to systematically locate short regions within human ORFs that show conspicuously low estimated rates of synonymous substitution across these species. The 29......-species alignment provides statistical power to locate more than 10,000 such regions with resolution down to nine-codon windows, which are found within more than a quarter of all human protein-coding genes and contain ~2% of their synonymous sites. We collect numerous lines of evidence that the observed...... synonymous constraint in these regions reflects selection on overlapping functional elements including splicing regulatory elements, dual-coding genes, RNA secondary structures, microRNA target sites, and developmental enhancers. Our results show that overlapping functional elements are common in mammalian...

  6. Evaluation of the efficacy of twelve mitochondrial protein-coding genes as barcodes for mollusk DNA barcoding.

    Science.gov (United States)

    Yu, Hong; Kong, Lingfeng; Li, Qi

    2016-01-01

    In this study, we evaluated the efficacy of 12 mitochondrial protein-coding genes from 238 mitochondrial genomes of 140 molluscan species as potential DNA barcodes for mollusks. Three barcoding methods (distance, monophyly and character-based methods) were used in species identification. The species recovery rates based on genetic distances for the 12 genes ranged from 70.83 to 83.33%. There were no significant differences in intra- or interspecific variability among the 12 genes. The monophyly and character-based methods provided higher resolution than the distance-based method in species delimitation. Especially in closely related taxa, the character-based method showed some advantages. The results suggested that besides the standard COI barcode, other 11 mitochondrial protein-coding genes could also be potentially used as a molecular diagnostic for molluscan species discrimination. Our results also showed that the combination of mitochondrial genes did not enhance the efficacy for species identification and a single mitochondrial gene would be fully competent.

  7. Compositional gradients in Gramineae genes

    DEFF Research Database (Denmark)

    Wong, Gane Ka-Shu; Wang, Jun; Tao, Lin

    2002-01-01

    In this study, we describe a property of Gramineae genes, and perhaps all monocot genes, that is not observed in eudicot genes. Along the direction of transcription, beginning at the junction of the 5'-UTR and the coding region, there are gradients in GC content, codon usage, and amino-acid usage...

  8. Genome-wide analysis of regions similar to promoters of histone genes

    KAUST Repository

    Chowdhary, Rajesh

    2010-05-28

    Background: The purpose of this study is to: i) develop a computational model of promoters of human histone-encoding genes (shortly histone genes), an important class of genes that participate in various critical cellular processes, ii) use the model so developed to identify regions across the human genome that have similar structure as promoters of histone genes; such regions could represent potential genomic regulatory regions, e.g. promoters, of genes that may be coregulated with histone genes, and iii/ identify in this way genes that have high likelihood of being coregulated with the histone genes.Results: We successfully developed a histone promoter model using a comprehensive collection of histone genes. Based on leave-one-out cross-validation test, the model produced good prediction accuracy (94.1% sensitivity, 92.6% specificity, and 92.8% positive predictive value). We used this model to predict across the genome a number of genes that shared similar promoter structures with the histone gene promoters. We thus hypothesize that these predicted genes could be coregulated with histone genes. This hypothesis matches well with the available gene expression, gene ontology, and pathways data. Jointly with promoters of the above-mentioned genes, we found a large number of intergenic regions with similar structure as histone promoters.Conclusions: This study represents one of the most comprehensive computational analyses conducted thus far on a genome-wide scale of promoters of human histone genes. Our analysis suggests a number of other human genes that share a high similarity of promoter structure with the histone genes and thus are highly likely to be coregulated, and consequently coexpressed, with the histone genes. We also found that there are a large number of intergenic regions across the genome with their structures similar to promoters of histone genes. These regions may be promoters of yet unidentified genes, or may represent remote control regions that

  9. FunGene: the functional gene pipeline and repository.

    Science.gov (United States)

    Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

    2013-01-01

    Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  10. FunGene: the Functional Gene Pipeline and Repository

    Directory of Open Access Journals (Sweden)

    Jordan A. Fish

    2013-10-01

    Full Text Available Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer.While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/ offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.

  11. Benchmarking of gene prediction programs for metagenomic data.

    Science.gov (United States)

    Yok, Non; Rosen, Gail

    2010-01-01

    This manuscript presents the most rigorous benchmarking of gene annotation algorithms for metagenomic datasets to date. We compare three different programs: GeneMark, MetaGeneAnnotator (MGA) and Orphelia. The comparisons are based on their performances over simulated fragments from one hundred species of diverse lineages. We defined four different types of fragments; two types come from the inter- and intra-coding regions and the other types are from the gene edges. Hoff et al. used only 12 species in their comparison; therefore, their sample is too small to represent an environmental sample. Also, no predecessors has separately examined fragments that contain gene edges as opposed to intra-coding regions. General observations in our results are that performances of all these programs improve as we increase the length of the fragment. On the other hand, intra-coding fragments of our data show low annotation error in all of the programs if compared to the gene edge fragments. Overall, we found an upper-bound performance by combining all the methods.

  12. Mechanisms of radiation-induced gene responses

    International Nuclear Information System (INIS)

    Woloschak, G.E.; Paunesku, T.

    1996-01-01

    In the process of identifying genes differentially expressed in cells exposed ultraviolet radiation, we have identified a transcript having a 26-bp region that is highly conserved in a variety of species including Bacillus circulans, yeast, pumpkin, Drosophila, mouse, and man. When the 5' region (flanking region or UTR) of a gene, the sequence is predominantly in +/+ orientation with respect to the coding DNA strand; while in the coding region and the 3' region (UTR), the sequence is most frequently in the +/-orientation with respect to the coding DNA strand. In two genes, the element is split into two parts; however, in most cases, it is found only once but with a minimum of 11 consecutive nucleotides precisely depicting the original sequence. The element is found in a large number of different genes with diverse functions (from human ras p21 to B. circulans chitonase). Gel shift assays demonstrated the presence of a protein in HeLa cell extracts that binds to the sense and antisense single-stranded consensus oligomers, as well as to the double- stranded oligonucleotide. When double-stranded oligomer was used, the size shift demonstrated as additional protein-oligomer complex larger than the one bound to either sense or antisense single-stranded consensus oligomers alone. It is speculated either that this element binds to protein(s) important in maintaining DNA is a single-stranded orientation for transcription or, alternatively that this element is important in the transcription-coupled DNA repair process

  13. Natural selection in avian protein-coding genes expressed in brain.

    Science.gov (United States)

    Axelsson, Erik; Hultin-Rosenberg, Lina; Brandström, Mikael; Zwahlén, Martin; Clayton, David F; Ellegren, Hans

    2008-06-01

    The evolution of birds from theropod dinosaurs took place approximately 150 million years ago, and was associated with a number of specific adaptations that are still evident among extant birds, including feathers, song and extravagant secondary sexual characteristics. Knowledge about the molecular evolutionary background to such adaptations is lacking. Here, we analyse the evolution of > 5000 protein-coding gene sequences expressed in zebra finch brain by comparison to orthologous sequences in chicken. Mean d(N)/d(S) is 0.085 and genes with their maximal expression in the eye and central nervous system have the lowest mean d(N)/d(S) value, while those expressed in digestive and reproductive tissues exhibit the highest. We find that fast-evolving genes (those which have higher than expected rate of nonsynonymous substitution, indicative of adaptive evolution) are enriched for biological functions such as fertilization, muscle contraction, defence response, response to stress, wounding and endogenous stimulus, and cell death. After alignment to mammalian orthologues, we identify a catalogue of 228 genes that show a significantly higher rate of protein evolution in the two bird lineages than in mammals. These accelerated bird genes, representing candidates for avian-specific adaptations, include genes implicated in vocal learning and other cognitive processes. Moreover, colouration genes evolve faster in birds than in mammals, which may have been driven by sexual selection for extravagant plumage characteristics.

  14. Orion: Detecting regions of the human non-coding genome that are intolerant to variation using population genetics.

    Science.gov (United States)

    Gussow, Ayal B; Copeland, Brett R; Dhindsa, Ryan S; Wang, Quanli; Petrovski, Slavé; Majoros, William H; Allen, Andrew S; Goldstein, David B

    2017-01-01

    There is broad agreement that genetic mutations occurring outside of the protein-coding regions play a key role in human disease. Despite this consensus, we are not yet capable of discerning which portions of non-coding sequence are important in the context of human disease. Here, we present Orion, an approach that detects regions of the non-coding genome that are depleted of variation, suggesting that the regions are intolerant of mutations and subject to purifying selection in the human lineage. We show that Orion is highly correlated with known intolerant regions as well as regions that harbor putatively pathogenic variation. This approach provides a mechanism to identify pathogenic variation in the human non-coding genome and will have immediate utility in the diagnostic interpretation of patient genomes and in large case control studies using whole-genome sequences.

  15. Paralogous Genes as a Tool to Study the Regulation of Gene Expression

    DEFF Research Database (Denmark)

    Hoffmann, Robert D

    The genomes of plants are marked by reoccurring events of whole-genome duplication. These events are major contributors to speciation and provide the genetic material for organisms to evolve ever greater complexity. Duplicated genes, referred to as paralogs, may be retained because they acquired...... regions. These results suggest that a concurrent purifying selection acts on coding and non-coding sequences of paralogous genes in A. thaliana. Mutational analyses of the promoters from a paralogous gene pair were performed in transgenic A. thaliana plants. The results revealed a 170-bp long DNA sequence...... that forms a bifunctional cis-regulatory module; it represses gene expression in the sporophyte while activating it in pollen. This finding is important for many aspects of gene regulation and the transcriptional changes underlying gametophyte development. In conclusion, the presented thesis suggests that...

  16. Self-complementary circular codes in coding theory.

    Science.gov (United States)

    Fimmel, Elena; Michel, Christian J; Starman, Martin; Strüngmann, Lutz

    2018-04-01

    Self-complementary circular codes are involved in pairing genetic processes. A maximal [Formula: see text] self-complementary circular code X of trinucleotides was identified in genes of bacteria, archaea, eukaryotes, plasmids and viruses (Michel in Life 7(20):1-16 2017, J Theor Biol 380:156-177, 2015; Arquès and Michel in J Theor Biol 182:45-58 1996). In this paper, self-complementary circular codes are investigated using the graph theory approach recently formulated in Fimmel et al. (Philos Trans R Soc A 374:20150058, 2016). A directed graph [Formula: see text] associated with any code X mirrors the properties of the code. In the present paper, we demonstrate a necessary condition for the self-complementarity of an arbitrary code X in terms of the graph theory. The same condition has been proven to be sufficient for codes which are circular and of large size [Formula: see text] trinucleotides, in particular for maximal circular codes ([Formula: see text] trinucleotides). For codes of small-size [Formula: see text] trinucleotides, some very rare counterexamples have been constructed. Furthermore, the length and the structure of the longest paths in the graphs associated with the self-complementary circular codes are investigated. It has been proven that the longest paths in such graphs determine the reading frame for the self-complementary circular codes. By applying this result, the reading frame in any arbitrary sequence of trinucleotides is retrieved after at most 15 nucleotides, i.e., 5 consecutive trinucleotides, from the circular code X identified in genes. Thus, an X motif of a length of at least 15 nucleotides in an arbitrary sequence of trinucleotides (not necessarily all of them belonging to X) uniquely defines the reading (correct) frame, an important criterion for analyzing the X motifs in genes in the future.

  17. Analysis of Canis mitochondrial DNA demonstrates high concordance between the control region and ATPase genes

    Directory of Open Access Journals (Sweden)

    White Bradley N

    2010-07-01

    Full Text Available Abstract Background Phylogenetic studies of wild Canis species have relied heavily on the mitochondrial DNA control region (mtDNA CR to infer species relationships and evolutionary lineages. Previous analyses of the CR provided evidence for a North American evolved eastern wolf (C. lycaon, that is more closely related to red wolves (C. rufus and coyotes (C. latrans than grey wolves (C. lupus. Eastern wolf origins, however, continue to be questioned. Therefore, we analyzed mtDNA from 89 wolves and coyotes across North America and Eurasia at 347 base pairs (bp of the CR and 1067 bp that included the ATPase6 and ATPase8 genes. Phylogenies and divergence estimates were used to clarify the evolutionary history of eastern wolves, and regional comparisons of nonsynonomous to synonomous substitutions (dN/dS at the ATPase6 and ATPase8 genes were used to elucidate the potential role of selection in shaping mtDNA geographic distribution. Results We found high concordance across analyses between the mtDNA regions studied. Both had a high percentage of variable sites (CR = 14.6%; ATP = 9.7% and both phylogenies clustered eastern wolf haplotypes monophyletically within a North American evolved lineage apart from coyotes. Divergence estimates suggest the putative red wolf sequence is more closely related to coyotes (DxyCR = 0.01982 ± 0.00494 SD; DxyATP = 0.00332 ± 0.00097 SD than the eastern wolf sequences (DxyCR = 0.03047 ± 0.00664 SD; DxyATP = 0.00931 ± 0.00205 SD. Neutrality tests on both genes were indicative of the population expansion of coyotes across eastern North America, and dN/dS ratios suggest a possible role for purifying selection in the evolution of North American lineages. dN/dS ratios were higher in European evolved lineages from northern climates compared to North American evolved lineages from temperate regions, but these differences were not statistically significant. Conclusions These results demonstrate high concordance between coding

  18. Regional Atmospheric Transport Code for Hanford Emission Tracking (RATCHET)

    International Nuclear Information System (INIS)

    Ramsdell, J.V. Jr.; Simonen, C.A.; Burk, K.W.

    1994-02-01

    The purpose of the Hanford Environmental Dose Reconstruction (HEDR) Project is to estimate radiation doses that individuals may have received from operations at the Hanford Site since 1944. This report deals specifically with the atmospheric transport model, Regional Atmospheric Transport Code for Hanford Emission Tracking (RATCHET). RATCHET is a major rework of the MESOILT2 model used in the first phase of the HEDR Project; only the bookkeeping framework escaped major changes. Changes to the code include (1) significant changes in the representation of atmospheric processes and (2) incorporation of Monte Carlo methods for representing uncertainty in input data, model parameters, and coefficients. To a large extent, the revisions to the model are based on recommendations of a peer working group that met in March 1991. Technical bases for other portions of the atmospheric transport model are addressed in two other documents. This report has three major sections: a description of the model, a user's guide, and a programmer's guide. These sections discuss RATCHET from three different perspectives. The first provides a technical description of the code with emphasis on details such as the representation of the model domain, the data required by the model, and the equations used to make the model calculations. The technical description is followed by a user's guide to the model with emphasis on running the code. The user's guide contains information about the model input and output. The third section is a programmer's guide to the code. It discusses the hardware and software required to run the code. The programmer's guide also discusses program structure and each of the program elements

  19. Functional mitochondrial ATP synthase proteolipid gene produced by recombination of parental genes in a petunia somatic hybrid

    International Nuclear Information System (INIS)

    Rothenberg, M.; Hanson, M.R.

    1988-01-01

    A novel ATP synthase subunit 9 gene (atp9) was identified in the mitochondrial genome of a Petunia somatic hybrid line (13-133) which was produced from a fusion between Petunia lines 3688 and 3704. The novel gene was generated by intergenomic recombination between atp9 genes from the two parental plant lines. The entire atp9 coding region is represented on the recombinant gene. Comparison of gene sequences using electrophoresis and autoradiography, indicate that the 5' transcribed region is contributed by an atp9 gene from 3704 and the 3' transcribed region is contributed by an atp9 gene from 3688. The recombinant atp9 gene is transcriptionally active. The location of the 5' and 3' transcript termini are conserved with respect to the parental genes, resulting in the production of hybrid transcripts

  20. Complete re-sequencing of a 2Mb topological domain encompassing the FTO/IRXB genes identifies a novel obesity-associated region upstream of IRX5

    DEFF Research Database (Denmark)

    Hunt, Lilian E; Noyvert, Boris; Bhaw-Rosun, Leena

    2015-01-01

    BACKGROUND: Association studies have identified a number of loci that contribute to an increased body mass index (BMI), the strongest of which is in the first intron of the FTO gene on human chromosome 16q12.2. However, this region is both non-coding and under strong linkage disequilibrium, making...... it recalcitrant to functional interpretation. Furthermore, the FTO gene is located within a complex cis-regulatory landscape defined by a topologically associated domain that includes the IRXB gene cluster, a trio of developmental regulators. Consequently, at least three genes in this interval have been...... implicated in the aetiology of obesity. METHODS: Here, we sequence a 2 Mb region encompassing the FTO, RPGRIP1L and IRXB cluster genes in 284 individuals from a well-characterised study group of Danish men containing extremely overweight young adults and controls. We further replicate our findings both...

  1. RGmatch: matching genomic regions to proximal genes in omics data integration

    Directory of Open Access Journals (Sweden)

    Pedro Furió-Tarí

    2016-11-01

    Full Text Available Abstract Background The integrative analysis of multiple genomics data often requires that genome coordinates-based signals have to be associated with proximal genes. The relative location of a genomic region with respect to the gene (gene area is important for functional data interpretation; hence algorithms that match regions to genes should be able to deliver insight into this information. Results In this work we review the tools that are publicly available for making region-to-gene associations. We also present a novel method, RGmatch, a flexible and easy-to-use Python tool that computes associations either at the gene, transcript, or exon level, applying a set of rules to annotate each region-gene association with the region location within the gene. RGmatch can be applied to any organism as long as genome annotation is available. Furthermore, we qualitatively and quantitatively compare RGmatch to other tools. Conclusions RGmatch simplifies the association of a genomic region with its closest gene. At the same time, it is a powerful tool because the rules used to annotate these associations are very easy to modify according to the researcher’s specific interests. Some important differences between RGmatch and other similar tools already in existence are RGmatch’s flexibility, its wide range of user options, compatibility with any annotatable organism, and its comprehensive and user-friendly output.

  2. Genomic sequence around butterfly wing development genes: annotation and comparative analysis.

    Directory of Open Access Journals (Sweden)

    Inês C Conceição

    Full Text Available BACKGROUND: Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS: We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes. CONCLUSIONS: The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1 the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2 the high

  3. Cis-regulatory somatic mutations and gene-expression alteration in B-cell lymphomas.

    Science.gov (United States)

    Mathelier, Anthony; Lefebvre, Calvin; Zhang, Allen W; Arenillas, David J; Ding, Jiarui; Wasserman, Wyeth W; Shah, Sohrab P

    2015-04-23

    With the rapid increase of whole-genome sequencing of human cancers, an important opportunity to analyze and characterize somatic mutations lying within cis-regulatory regions has emerged. A focus on protein-coding regions to identify nonsense or missense mutations disruptive to protein structure and/or function has led to important insights; however, the impact on gene expression of mutations lying within cis-regulatory regions remains under-explored. We analyzed somatic mutations from 84 matched tumor-normal whole genomes from B-cell lymphomas with accompanying gene expression measurements to elucidate the extent to which these cancers are disrupted by cis-regulatory mutations. We characterize mutations overlapping a high quality set of well-annotated transcription factor binding sites (TFBSs), covering a similar portion of the genome as protein-coding exons. Our results indicate that cis-regulatory mutations overlapping predicted TFBSs are enriched in promoter regions of genes involved in apoptosis or growth/proliferation. By integrating gene expression data with mutation data, our computational approach culminates with identification of cis-regulatory mutations most likely to participate in dysregulation of the gene expression program. The impact can be measured along with protein-coding mutations to highlight key mutations disrupting gene expression and pathways in cancer. Our study yields specific genes with disrupted expression triggered by genomic mutations in either the coding or the regulatory space. It implies that mutated regulatory components of the genome contribute substantially to cancer pathways. Our analyses demonstrate that identifying genomically altered cis-regulatory elements coupled with analysis of gene expression data will augment biological interpretation of mutational landscapes of cancers.

  4. Genes encoding two lipoproteins in the leuS-dacA region of the Escherichia coli chromosome

    International Nuclear Information System (INIS)

    Takase, I.; Ishino, F.; Wachi, M.; Kamata, H.; Doi, M.; Asoh, S.; Matsuzawa, H.; Ohta, T.; Matsuhashi, M.

    1987-01-01

    The coding of two rare lipoproteins by two genes, rlpA and rlpB, located in the leuS-dacA region (15 min) on the Escherichia coli chromosome was demonstrated by expression of subcloned genes in a maxicell system. The formation of these two proteins was inhibited by globomycin, which is an inhibitor of the signal peptidase for the known lipoproteins of E. coli. In each case, this inhibition was accompanied by formation of a new protein, which showed a slightly lower mobility on sodium dodecyl sulfate-polyacrylamide gel electrophoresis and which we suppose to be a prolipoprotein with an N-terminal signal peptide sequence similar to those of the bacterial major lipoproteins and lysis proteins of some bacteriocins. The incorporation of 3 H-labeled palmitate and glycerol into the two lipoproteins was also observed. Sequencing of DNA showed that the two lipoprotein genes contained sequences that could code for signal peptide sequences of 17 amino acids (rlpA lipoprotein) and 18 amino acids (rlpB lipoprotein). The deduced sequences of the mature peptides consisted of 345 amino acids (M/sub r/ 35,615, rlpA lipoprotein) and 175 amino acids (M/sub r/ 19,445, rlpB lipoprotein), with an N-terminal cysteine to which thioglyceride and N-fatty acyl residues may be attached. These two lioproteins may be important in duplication of the cells

  5. Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region

    International Nuclear Information System (INIS)

    Riley, L.K.; Morrow, J.K.; Danton, M.J.; Coleman, M.S.

    1988-01-01

    Human terminal deoxyribonucleotidyltransferase cDNA contains an open reading frame of 1530 base pairs (bp) corresponding to a protein containing 510 amino acids. The encoded protein is a template-independent DNA polymerase found only in a restricted population of normal and malignant prelymphocytes. To begin to investigate the genetic elements responsible for the tissue-specific expression of terminal deoxyribonucleotidyltransferase, genomic clones, containing the entire human gene were isolated and characterized. Initially, cDNA clones were isolated from a library generated from the human lymphoblastoid cell line, MOLT-4R. A cDNA clone containing the entire coding region of the protein was used to isolate a series of overlapping clones from two human genomic libraries. The gene comprises 11 exons and 10 introns and spans 49.4 kilobases. The 5' flanking region (709 bp) including exon 1 was sequenced. Several putative transcription initiation sites were mapped. Within 500 nucleotides of the translation start site, a series of promoter elements was detected. TATA and CAAT sequences, respectively, were found to start at nucleotides -185 and -204, -328 and -370, and -465 and -505. Start sites were found for a cyclic AMP-dependent promoter analog at nucleotide -121, an eight-base sequence corresponding to the IgG promoter enhancer (cd) at nucleotide -455, and an analog of the IgG promoter (pd) at nucleotide -159. These findings suggest that transcripts coding for terminal deoxyribonucleotidyltransferase may be variable in length and that transcription may be influenced by a variety of genetic elements

  6. Nucleotide sequences of the Erwinia chrysanthemi ogl and pelE genes negatively regulated by the kdgR gene product.

    Science.gov (United States)

    Reverchon, S; Huang, Y; Bourson, C; Robert-Baudouy, J

    1989-12-21

    The nucleotide sequences of the coding and regulatory regions of the genes encoding oligoglacturonate lyase (OGL) and pectate lyase e isoenzyme (PLe) from Erwinia chrysanthemi 3937 were determined. The ogl sequence contains an open reading frame (ORF) of 1164 bp coding for a 388-amino acid (aa) polypeptide with a predicted Mr of 44,124. A possible transcriptional start signal showing homology with the Escherichia coli promoter consensus sequence was detected. In addition, a sequence 3' to the coding region was found to be able to form a secondary structure which may function as an Rho-independent transcriptional termination signal. For the pelE sequence, a long ORF of 1212 bp coding for a 404-aa polypeptide was detected. PLe is secreted into the external medium by E. chrysanthemi, and a potential signal peptide sequence was identified in the pelE gene. In the 5' upstream pelE coding region, a putative promoter resembling E. coli promoter consensus sequences was detected. Furthermore, the region immediately 3' to the pelE translational stop codon may function as an Rho-independent translational termination signal. In strain 3937, the synthesis of OGL and PLe, as well as the other enzymes involved in the pectin-degradative pathway (particularly the kdgT product), are known to be regulated by the KdgR repressor, which mediates galacturonate and polygalacturonate induction. Synthesis of these enzymes is also regulated by the CRP-cAMP complex which mediates catabolite repression. Analysis of the regulatory regions of ogl and pelE allowed us to identify possible CRP-binding sites for these two genes.(ABSTRACT TRUNCATED AT 250 WORDS)

  7. Combining gene prediction methods to improve metagenomic gene annotation

    Directory of Open Access Journals (Sweden)

    Rosen Gail L

    2011-01-01

    Full Text Available Abstract Background Traditional gene annotation methods rely on characteristics that may not be available in short reads generated from next generation technology, resulting in suboptimal performance for metagenomic (environmental samples. Therefore, in recent years, new programs have been developed that optimize performance on short reads. In this work, we benchmark three metagenomic gene prediction programs and combine their predictions to improve metagenomic read gene annotation. Results We not only analyze the programs' performance at different read-lengths like similar studies, but also separate different types of reads, including intra- and intergenic regions, for analysis. The main deficiencies are in the algorithms' ability to predict non-coding regions and gene edges, resulting in more false-positives and false-negatives than desired. In fact, the specificities of the algorithms are notably worse than the sensitivities. By combining the programs' predictions, we show significant improvement in specificity at minimal cost to sensitivity, resulting in 4% improvement in accuracy for 100 bp reads with ~1% improvement in accuracy for 200 bp reads and above. To correctly annotate the start and stop of the genes, we find that a consensus of all the predictors performs best for shorter read lengths while a unanimous agreement is better for longer read lengths, boosting annotation accuracy by 1-8%. We also demonstrate use of the classifier combinations on a real dataset. Conclusions To optimize the performance for both prediction and annotation accuracies, we conclude that the consensus of all methods (or a majority vote is the best for reads 400 bp and shorter, while using the intersection of GeneMark and Orphelia predictions is the best for reads 500 bp and longer. We demonstrate that most methods predict over 80% coding (including partially coding reads on a real human gut sample sequenced by Illumina technology.

  8. Identification and characterization of a novel serine-threonine kinase gene from the Xp22 region.

    Science.gov (United States)

    Montini, E; Andolfi, G; Caruso, A; Buchner, G; Walpole, S M; Mariani, M; Consalez, G; Trump, D; Ballabio, A; Franco, B

    1998-08-01

    Eukaryotic protein kinases are part of a large and expanding family of proteins. Through our transcriptional mapping effort in the Xp22 region, we have isolated and sequenced the full-length transcript of STK9, a novel cDNA highly homologous to serine-threonine kinases. A number of human genetic disorders have been mapped to the region where STK9 has been localized including Nance-Horan (NH) syndrome, oral-facial-digital syndrome type 1 (OFD1), and a novel locus for nonsyndromic sensorineural deafness (DFN6). To evaluate the possible involvement of STK9 in any of the above-mentioned disorders, a 2416-bp full-length cDNA was assembled. The entire genomic structure of the gene, which is composed of 20 coding exons, was determined. Northern analysis revealed a transcript larger than 9.5 kb in several tissues including brain, lung, and kidney. The mouse homologue (Stk9) was identified and mapped in the mouse in the region syntenic to human Xp. This location is compatible with the location of the Xcat mutant, which shows congenital cataracts very similar to those observed in NH patients. Sequence homologies, expression pattern, and mapping information in both human and mouse make STK9 a candidate gene for the above-mentioned disorders. Copyright 1998 Academic Press.

  9. The Asian Rice Gall Midge (Orseolia oryzae Mitogenome Has Evolved Novel Gene Boundaries and Tandem Repeats That Distinguish Its Biotypes.

    Directory of Open Access Journals (Sweden)

    Isha Atray

    Full Text Available The complete mitochondrial genome of the Asian rice gall midge, Orseolia oryzae (Diptera; Cecidomyiidae was sequenced, annotated and analysed in the present study. The circular genome is 15,286 bp with 13 protein-coding genes, 22 tRNAs and 2 ribosomal RNA genes, and a 578 bp non-coding control region. All protein coding genes used conventional start codons and terminated with a complete stop codon. The genome presented many unusual features: (1 rearrangement in the order of tRNAs as well as protein coding genes; (2 truncation and unusual secondary structures of tRNAs; (3 presence of two different repeat elements in separate non-coding regions; (4 presence of one pseudo-tRNA gene; (5 inversion of the rRNA genes; (6 higher percentage of non-coding regions when compared with other insect mitogenomes. Rearrangements of the tRNAs and protein coding genes are explained on the basis of tandem duplication and random loss model and why intramitochondrial recombination is a better model for explaining rearrangements in the O. oryzae mitochondrial genome is discussed. Furthermore, we evaluated the number of iterations of the tandem repeat elements found in the mitogenome. This led to the identification of genetic markers capable of differentiating rice gall midge biotypes and the two Orseolia species investigated.

  10. Gene prediction using the Self-Organizing Map: automatic generation of multiple gene models.

    Science.gov (United States)

    Mahony, Shaun; McInerney, James O; Smith, Terry J; Golden, Aaron

    2004-03-05

    Many current gene prediction methods use only one model to represent protein-coding regions in a genome, and so are less likely to predict the location of genes that have an atypical sequence composition. It is likely that future improvements in gene finding will involve the development of methods that can adequately deal with intra-genomic compositional variation. This work explores a new approach to gene-prediction, based on the Self-Organizing Map, which has the ability to automatically identify multiple gene models within a genome. The current implementation, named RescueNet, uses relative synonymous codon usage as the indicator of protein-coding potential. While its raw accuracy rate can be less than other methods, RescueNet consistently identifies some genes that other methods do not, and should therefore be of interest to gene-prediction software developers and genome annotation teams alike. RescueNet is recommended for use in conjunction with, or as a complement to, other gene prediction methods.

  11. Physical map location of the multicopy genes coding for ammonia monooxygenase and hydroxylamine oxidoreductase in the ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11.

    Science.gov (United States)

    Hirota, R; Yamagata, A; Kato, J; Kuroda, A; Ikeda, T; Takiguchi, N; Ohtake, H

    2000-02-01

    Pulsed-field gel electrophoresis of PmeI digests of the Nitrosomonas sp. strain ENI-11 chromosome produced four bands ranging from 1,200 to 480 kb in size. Southern hybridizations suggested that a 487-kb PmeI fragment contained two copies of the amoCAB genes, coding for ammonia monooxygenase (designated amoCAB(1) and amoCAB(2)), and three copies of the hao gene, coding for hydroxylamine oxidoreductase (hao(1), hao(2), and hao(3)). In this DNA fragment, amoCAB(1) and amoCAB(2) were about 390 kb apart, while hao(1), hao(2), and hao(3) were separated by at least about 100 kb from each other. Interestingly, hao(1) and hao(2) were located relatively close to amoCAB(1) and amoCAB(2), respectively. DNA sequence analysis revealed that hao(1) and hao(2) shared 160 identical nucleotides immediately upstream of each translation initiation codon. However, hao(3) showed only 30% nucleotide identity in the 160-bp corresponding region.

  12. Assessment of genetic mutations in the XRCC2 coding region by high resolution melting curve analysis and the risk of differentiated thyroid carcinoma in Iran

    Directory of Open Access Journals (Sweden)

    Shima Fayaz

    2012-01-01

    Full Text Available Homologous recombination (HR is the major pathway for repairing double strand breaks (DSBs in eukaryotes and XRCC2 is an essential component of the HR repair machinery. To evaluate the potential role of mutations in gene repair by HR in individuals susceptible to differentiated thyroid carcinoma (DTC we used high resolution melting (HRM analysis, a recently introduced method for detecting mutations, to examine the entire XRCC2 coding region in an Iranian population. HRM analysis was used to screen for mutations in three XRCC2 coding regions in 50 patients and 50 controls. There was no variation in the HRM curves obtained from the analysis of exons 1 and 2 in the case and control groups. In exon 3, an Arg188His polymorphism (rs3218536 was detected as a new melting curve group (OR: 1.46; 95%CI: 0.432-4.969; p = 0.38 compared with the normal melting curve. We also found a new Ser150Arg polymorphism in exon 3 of the control group. These findings suggest that genetic variations in the XRCC2 coding region have no potential effects on susceptibility to DTC. However, further studies with larger populations are required to confirm this conclusion.

  13. Cloning and expression of the coding regions of the heat shock proteins HSP10 and HSP16 from Piscirickettsia salmonis

    Directory of Open Access Journals (Sweden)

    VIVIAN WILHELM

    2003-01-01

    Full Text Available The genes encoding the heat shock proteins HSP10 and HSP16 of the salmon pathogen Piscirickettsia salmonis have been isolated and sequenced. The HSP10 coding sequence is located in an open reading frame of 291 base pairs encoding 96 aminoacids. The HSP16 coding region was isolated as a 471 base pair fragment encoding a protein of 156 aminoacids. The deduced aminoacid sequences of both proteins show a significant homology to the respective protein from other prokaryotic organisms. Both proteins were expressed in E. coli as fusion proteins with thioredoxin and purified by chromatography on Ni-column. A rabbit serum against P. salmonis total proteins reacts with the recombinant HSP10 and HSP16 proteins. Similar reactivity was determined by ELISA using serum from salmon infected with P. salmonis. The possibility of formulating a vaccine containing these two proteins is discussed

  14. Gene cluster statistics with gene families.

    Science.gov (United States)

    Raghupathy, Narayanan; Durand, Dannie

    2009-05-01

    Identifying genomic regions that descended from a common ancestor is important for understanding the function and evolution of genomes. In distantly related genomes, clusters of homologous gene pairs are evidence of candidate homologous regions. Demonstrating the statistical significance of such "gene clusters" is an essential component of comparative genomic analyses. However, currently there are no practical statistical tests for gene clusters that model the influence of the number of homologs in each gene family on cluster significance. In this work, we demonstrate empirically that failure to incorporate gene family size in gene cluster statistics results in overestimation of significance, leading to incorrect conclusions. We further present novel analytical methods for estimating gene cluster significance that take gene family size into account. Our methods do not require complete genome data and are suitable for testing individual clusters found in local regions, such as contigs in an unfinished assembly. We consider pairs of regions drawn from the same genome (paralogous clusters), as well as regions drawn from two different genomes (orthologous clusters). Determining cluster significance under general models of gene family size is computationally intractable. By assuming that all gene families are of equal size, we obtain analytical expressions that allow fast approximation of cluster probabilities. We evaluate the accuracy of this approximation by comparing the resulting gene cluster probabilities with cluster probabilities obtained by simulating a realistic, power-law distributed model of gene family size, with parameters inferred from genomic data. Surprisingly, despite the simplicity of the underlying assumption, our method accurately approximates the true cluster probabilities. It slightly overestimates these probabilities, yielding a conservative test. We present additional simulation results indicating the best choice of parameter values for data

  15. Natural selection on protein-coding genes in the human genome

    DEFF Research Database (Denmark)

    Bustamente, Carlos D.; Fledel-Alon, Adi; Williamson, Scott

    2005-01-01

    , showing an excess of deleterious variation within local populations 9, 10 . Here we contrast patterns of coding sequence polymorphism identified by direct sequencing of 39 humans for over 11,000 genes to divergence between humans and chimpanzees, and find strong evidence that natural selection has shaped......Comparisons of DNA polymorphism within species to divergence between species enables the discovery of molecular adaptation in evolutionarily constrained genes as well as the differentiation of weak from strong purifying selection 1, 2, 3, 4 . The extent to which weak negative and positive darwinian...... selection have driven the molecular evolution of different species varies greatly 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 , with some species, such as Drosophila melanogaster, showing strong evidence of pervasive positive selection 6, 7, 8, 9 , and others, such as the selfing weed Arabidopsis thaliana...

  16. ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data.

    Science.gov (United States)

    Zhou, Ke-Ren; Liu, Shun; Sun, Wen-Ju; Zheng, Ling-Ling; Zhou, Hui; Yang, Jian-Hua; Qu, Liang-Hu

    2017-01-04

    The abnormal transcriptional regulation of non-coding RNAs (ncRNAs) and protein-coding genes (PCGs) is contributed to various biological processes and linked with human diseases, but the underlying mechanisms remain elusive. In this study, we developed ChIPBase v2.0 (http://rna.sysu.edu.cn/chipbase/) to explore the transcriptional regulatory networks of ncRNAs and PCGs. ChIPBase v2.0 has been expanded with ∼10 200 curated ChIP-seq datasets, which represent about 20 times expansion when comparing to the previous released version. We identified thousands of binding motif matrices and their binding sites from ChIP-seq data of DNA-binding proteins and predicted millions of transcriptional regulatory relationships between transcription factors (TFs) and genes. We constructed 'Regulator' module to predict hundreds of TFs and histone modifications that were involved in or affected transcription of ncRNAs and PCGs. Moreover, we built a web-based tool, Co-Expression, to explore the co-expression patterns between DNA-binding proteins and various types of genes by integrating the gene expression profiles of ∼10 000 tumor samples and ∼9100 normal tissues and cell lines. ChIPBase also provides a ChIP-Function tool and a genome browser to predict functions of diverse genes and visualize various ChIP-seq data. This study will greatly expand our understanding of the transcriptional regulations of ncRNAs and PCGs. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Mechanosensitive promoter region in the human HB-GAM gene

    DEFF Research Database (Denmark)

    Liedert, Astrid; Kassem, Moustapha; Claes, Lutz

    2009-01-01

    Mechanical loading is essential for maintaining bone mass in the adult skeleton. However, the underlying process of the transfer of the physical stimulus into a biochemical response, which is termed mechanotransduction is poorly understood. Mechanotransduction results in the modulation of gene...... cells. Analysis of the human HB-GAM gene upstream regulatory region with luciferase reporter gene assays revealed that the upregulation of HB-GAM expression occurred at the transcriptional level and was mainly dependent on the HB-GAM promoter region most upstream containing three potential AP-1 binding...

  18. Evolutionary modeling and prediction of non-coding RNAs in Drosophila.

    Directory of Open Access Journals (Sweden)

    Robert K Bradley

    2009-08-01

    Full Text Available We performed benchmarks of phylogenetic grammar-based ncRNA gene prediction, experimenting with eight different models of structural evolution and two different programs for genome alignment. We evaluated our models using alignments of twelve Drosophila genomes. We find that ncRNA prediction performance can vary greatly between different gene predictors and subfamilies of ncRNA gene. Our estimates for false positive rates are based on simulations which preserve local islands of conservation; using these simulations, we predict a higher rate of false positives than previous computational ncRNA screens have reported. Using one of the tested prediction grammars, we provide an updated set of ncRNA predictions for D. melanogaster and compare them to previously-published predictions and experimental data. Many of our predictions show correlations with protein-coding genes. We found significant depletion of intergenic predictions near the 3' end of coding regions and furthermore depletion of predictions in the first intron of protein-coding genes. Some of our predictions are colocated with larger putative unannotated genes: for example, 17 of our predictions showing homology to the RFAM family snoR28 appear in a tandem array on the X chromosome; the 4.5 Kbp spanned by the predicted tandem array is contained within a FlyBase-annotated cDNA.

  19. RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.

    Science.gov (United States)

    Grassi, Luigi; Leoni, Guido; Tramontano, Anna

    2015-07-14

    When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.

  20. Drosophila polytene chromosome bands formed by gene introns.

    Science.gov (United States)

    Zhimulev, I F; Boldyreva, L V; Demakova, O V; Poholkova, G V; Khoroshko, V A; Zykova, T Yu; Lavrov, S A; Belyaeva, E S

    2016-01-01

    Genetic organization of bands and interbands in polytene chromosomes has long remained a puzzle for geneticists. It has been recently demonstrated that interbands typically correspond to the 5'-ends of house-keeping genes, whereas adjacent loose bands tend to be composed of coding sequences of the genes. In the present work, we made one important step further and mapped two large introns of ubiquitously active genes on the polytene chromosome map. We show that alternative promoter regions of these genes map to interbands, whereas introns and coding sequences found between those promoters correspond to loose grey bands. Thus, a gene having its long intron "sandwiched" between to alternative promoters and a common coding sequence may occupy two interbands and one band in the context of polytene chromosomes. Loose, partially decompacted bands appear to host large introns.

  1. Development of TIGER code for radionuclide transport in a geochemically evolving region

    International Nuclear Information System (INIS)

    Mihara, Morihiro; Ooi, Takao

    2004-01-01

    In a transuranic (TRU) waste geological disposal facility, using cementitious materials is being considered. Cementitious materials will gradually dissolve in groundwater over the long-term. In the performance assessment report of a TRU waste repository in Japan already published, the most conservative radionuclide migration parameter set was selected considering the evolving cementitious material. Therefore, a tool to perform the calculation of radionuclide transport considering long-term geochemically evolving cementitious materials, named the TIGER code, Transport In Geochemically Evolving Region was developed to calculate a more realistic performance assessment. It can calculate radionuclide transport in engineered and natural barrier systems. In this report, mathematical equations of this code are described and validated with analytical solutions and results of other codes for radionuclide transport. The more realistic calculation of radionuclide transport for a TRU waste geological disposal system using the TIGER code could be performed. (author)

  2. Regional and temporal variations in coding of hospital diagnoses referring to upper gastrointestinal and oesophageal bleeding in Germany

    Directory of Open Access Journals (Sweden)

    Garbe Edeltraut

    2011-08-01

    Full Text Available Abstract Background Health insurance claims data are increasingly used for health services research in Germany. Hospital diagnoses in these data are coded according to the International Classification of Diseases, German modification (ICD-10-GM. Due to the historical division into West and East Germany, different coding practices might persist in both former parts. Additionally, the introduction of Diagnosis Related Groups (DRGs in Germany in 2003/2004 might have changed the coding. The aim of this study was to investigate regional and temporal variations in coding of hospitalisation diagnoses in Germany. Methods We analysed hospitalisation diagnoses for oesophageal bleeding (OB and upper gastrointestinal bleeding (UGIB from the official German Hospital Statistics provided by the Federal Statistical Office. Bleeding diagnoses were classified as "specific" (origin of bleeding provided or "unspecific" (origin of bleeding not provided coding. We studied regional (former East versus West Germany differences in incidence of hospitalisations with specific or unspecific coding for OB and UGIB and temporal variations between 2000 and 2005. For each year, incidence ratios of hospitalisations for former East versus West Germany were estimated with log-linear regression models adjusting for age, gender and population density. Results Significant differences in specific and unspecific coding between East and West Germany and over time were found for both, OB and UGIB hospitalisation diagnoses, respectively. For example in 2002, incidence ratios of hospitalisations for East versus West Germany were 1.24 (95% CI 1.16-1.32 for specific and 0.67 (95% CI 0.60-0.74 for unspecific OB diagnoses and 1.43 (95% CI 1.36-1.51 for specific and 0.83 (95% CI 0.80-0.87 for unspecific UGIB. Regional differences nearly disappeared and time trends were less marked when using combined specific and unspecific diagnoses of OB or UGIB, respectively. Conclusions During the study

  3. Karyopherin-mediated nuclear import of the homing endonuclease VMA1-derived endonuclease is required for self-propagation of the coding region.

    Science.gov (United States)

    Nagai, Yuri; Nogami, Satoru; Kumagai-Sano, Fumi; Ohya, Yoshikazu

    2003-03-01

    VMA1-derived endonuclease (VDE), a site-specific endonuclease in Saccharomyces cerevisiae, enters the nucleus to generate a double-strand break in the VDE-negative allelic locus, mediating the self-propagating gene conversion called homing. Although VDE is excluded from the nucleus in mitotic cells, it relocalizes at premeiosis, becoming localized in both the nucleus and the cytoplasm in meiosis. The nuclear localization of VDE is induced by inactivation of TOR kinases, which constitute central regulators of cell differentiation in S. cerevisiae, and by nutrient depletion. A functional genomic approach revealed that at least two karyopherins, Srp1p and Kap142p, are required for the nuclear localization pattern. Genetic and physical interactions between Srp1p and VDE imply direct involvement of karyopherin-mediated nuclear transport in this process. Inactivation of TOR signaling or acquisition of an extra nuclear localization signal in the VDE coding region leads to artificial nuclear localization of VDE and thereby induces homing even during mitosis. These results serve as evidence that VDE utilizes the host systems of nutrient signal transduction and nucleocytoplasmic transport to ensure the propagation of its coding region.

  4. Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

    International Nuclear Information System (INIS)

    Yu Jia-Feng; Sui Tian-Xiang; Wang Ji-Hua; Wang Hong-Mei; Wang Chun-Ling; Jing Li

    2015-01-01

    Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. (special topic)

  5. A novel polymorphism in the coding region of the vasopressin type 2 receptor gene

    Directory of Open Access Journals (Sweden)

    J.L. Rocha

    1997-04-01

    Full Text Available Nephrogenic diabetes insipidus (NDI is a rare disease characterized by renal inability to respond properly to arginine vasopressin due to mutations in the vasopressin type 2 receptor (V2(R gene in affected kindreds. In most kindreds thus far reported, the mode of inheritance follows an X chromosome-linked recessive pattern although autosomal-dominant and autosomal-recessive modes of inheritance have also been described. Studies demonstrating mutations in the V2(R gene in affected kindreds that modify the receptor structure, resulting in a dys- or nonfunctional receptor have been described, but phenotypically indistinguishable NDI patients with a structurally normal V2(R gene have also been reported. In the present study, we analyzed exon 3 of the V2(R gene in 20 unrelated individuals by direct sequencing. A C®T alteration in the third position of codon 331 (AGC®AGT, which did not alter the encoded amino acid, was found in nine individuals, including two unrelated patients with NDI. Taken together, these observations emphasize the molecular heterogeneity of a phenotypically homogeneous syndrome

  6. Investigation of QTL regions on Chromosome 17 for genes associated with meat color in the pig.

    Science.gov (United States)

    Fan, B; Glenn, K L; Geiger, B; Mileham, A; Rothschild, M F

    2008-08-01

    Previous studies have uncovered several significant quantitative trait loci (QTL) relevant to meat colour traits mapped at the end of SSC17 in the pig. Furthermore, results released from the porcine genome sequencing project have identified genes underlying the entire QTL regions and can further contribute to mining the region for likely causative genes. Ten protein coding genes or novel transcripts located within the QTL regions were screened for single nucleotide polymorphisms (SNPs). Linkage mapping and association studies were carried out in the ISU Berkshire x Yorkshire (B x Y) pig resource family. The total length of the new SSC17 linkage map was 126.6 cM and additional markers including endothelin 3 (EDN3) and phosphatase and actin regulator 3 (PHACTR3) genes were assigned at positions 119.4 cM and 122.9 cM, respectively. A new QTL peak was noted at approximately 120 cM, close to the EDN3 gene, and for some colour traits QTL exceeded the 5% chromosome-wise significance threshold. The association analyses in the B x Y family showed that the EDN3 BslI and PHACTR3 PstI polymorphisms were strongly associated with the subjective colour score and objective colour reflectance measures in the loin, as well as average drip loss percentage and pH value. The RNPC1 DpnII and CTCFL HpyCH4III polymorphisms were associated with some meat colour traits. No significant association between CBLN4, TFAP2C, and four novel transcripts and meat colour traits were detected. The association analyses conducted in one commercial pig line found that both EDN3 BslI and PHACTR3 PstI polymorphisms were associated with meat colour reflectance traits such as centre loin hue angle and Minolta Lightness score. The present findings suggested that the EDN3 and PHACTR3 genes might have potential effects on meat colour in pigs, and molecular mechanisms of their functions are worth exploring.

  7. MARG1D: One dimensional outer region matching data code

    International Nuclear Information System (INIS)

    Tokuda, Shinji; Watanabe, Tomoko.

    1995-08-01

    A code MARG1D has been developed which computes outer region matching data of the one dimensional Newcomb equation. Matching data play an important role in the resistive (and non ideal) Magneto-hydrodynamic (MHD) stability analysis in a tokamak plasma. The MARG1D code computes matching data by using the boundary value method or by the eigenvalue method. Variational principles are derived for the problems to be solved and a finite element method is applied. Except for the case of marginal stability, the eigenvalue method is equivalent to the boundary value method. However, the eigenvalue method has the several advantages: it is a new method of ideal MHD stability analysis for which the marginally stable state can be identified, and it guarantees numerical stability in computing matching data close to marginal stability. We perform detailed numerical experiments for a model equation with analytical solutions and for the Newcomb equation in the m=1 mode theory. Numerical experiments show that MARG1D code gives the matching data with numerical stability and high accuracy. (author)

  8. Novel methods for the molecular discrimination of Fasciola spp. on the basis of nuclear protein-coding genes.

    Science.gov (United States)

    Shoriki, Takuya; Ichikawa-Seki, Madoka; Suganuma, Keisuke; Naito, Ikunori; Hayashi, Kei; Nakao, Minoru; Aita, Junya; Mohanta, Uday Kumar; Inoue, Noboru; Murakami, Kenji; Itagaki, Tadashi

    2016-06-01

    Fasciolosis is an economically important disease of livestock caused by Fasciola hepatica, Fasciola gigantica, and aspermic Fasciola flukes. The aspermic Fasciola flukes have been discriminated morphologically from the two other species by the absence of sperm in their seminal vesicles. To date, the molecular discrimination of F. hepatica and F. gigantica has relied on the nucleotide sequences of the internal transcribed spacer 1 (ITS1) region. However, ITS1 genotypes of aspermic Fasciola flukes cannot be clearly differentiated from those of F. hepatica and F. gigantica. Therefore, more precise and robust methods are required to discriminate Fasciola spp. In this study, we developed PCR restriction fragment length polymorphism and multiplex PCR methods to discriminate F. hepatica, F. gigantica, and aspermic Fasciola flukes on the basis of the nuclear protein-coding genes, phosphoenolpyruvate carboxykinase and DNA polymerase delta, which are single locus genes in most eukaryotes. All aspermic Fasciola flukes used in this study had mixed fragment pattern of F. hepatica and F. gigantica for both of these genes, suggesting that the flukes are descended through hybridization between the two species. These molecular methods will facilitate the identification of F. hepatica, F. gigantica, and aspermic Fasciola flukes, and will also prove useful in etiological studies of fasciolosis. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  9. Arabidopsis RNASE THREE LIKE2 Modulates the Expression of Protein-Coding Genes via 24-Nucleotide Small Interfering RNA-Directed DNA Methylation.

    Science.gov (United States)

    Elvira-Matelot, Emilie; Hachet, Mélanie; Shamandi, Nahid; Comella, Pascale; Sáez-Vásquez, Julio; Zytnicki, Matthias; Vaucheret, Hervé

    2016-02-01

    RNaseIII enzymes catalyze the cleavage of double-stranded RNA (dsRNA) and have diverse functions in RNA maturation. Arabidopsis thaliana RNASE THREE LIKE2 (RTL2), which carries one RNaseIII and two dsRNA binding (DRB) domains, is a unique Arabidopsis RNaseIII enzyme resembling the budding yeast small interfering RNA (siRNA)-producing Dcr1 enzyme. Here, we show that RTL2 modulates the production of a subset of small RNAs and that this activity depends on both its RNaseIII and DRB domains. However, the mode of action of RTL2 differs from that of Dcr1. Whereas Dcr1 directly cleaves dsRNAs into 23-nucleotide siRNAs, RTL2 likely cleaves dsRNAs into longer molecules, which are subsequently processed into small RNAs by the DICER-LIKE enzymes. Depending on the dsRNA considered, RTL2-mediated maturation either improves (RTL2-dependent loci) or reduces (RTL2-sensitive loci) the production of small RNAs. Because the vast majority of RTL2-regulated loci correspond to transposons and intergenic regions producing 24-nucleotide siRNAs that guide DNA methylation, RTL2 depletion modifies DNA methylation in these regions. Nevertheless, 13% of RTL2-regulated loci correspond to protein-coding genes. We show that changes in 24-nucleotide siRNA levels also affect DNA methylation levels at such loci and inversely correlate with mRNA steady state levels, thus implicating RTL2 in the regulation of protein-coding gene expression. © 2016 American Society of Plant Biologists. All rights reserved.

  10. MICROX-2: an improved two-region flux spectrum code for the efficient calculation of group cross sections

    International Nuclear Information System (INIS)

    Mathews, D.; Koch, P.

    1979-12-01

    The MICROX-2 code is an improved version of the MICROX code. The improvements allow MICROX-2 to be used for the efficient and rigorous preparation of broad group neutron cross sections for poorly moderated systems such as fast breeder reactors in addition to the well moderated thermal reactors for which MICROX was designed. MICROX-2 is an integral transport theory code which solves the neutron slowing down and thermalization equations on a detailed energy grid for two-region lattice cells. The fluxes in the two regions are coupled by transport corrected collision probabilities. The inner region may include two different types of grains (particles). Neutron leakage effects are treated by performing B 1 slowing down and P 0 plus DB 2 thermalization calculations in each region. Cell averaged diffusion coefficients are prepared with the Benoist cell homogenization prescription

  11. A HYDROCHEMICAL HYBRID CODE FOR ASTROPHYSICAL PROBLEMS. I. CODE VERIFICATION AND BENCHMARKS FOR A PHOTON-DOMINATED REGION (PDR)

    International Nuclear Information System (INIS)

    Motoyama, Kazutaka; Morata, Oscar; Hasegawa, Tatsuhiko; Shang, Hsien; Krasnopolsky, Ruben

    2015-01-01

    A two-dimensional hydrochemical hybrid code, KM2, is constructed to deal with astrophysical problems that would require coupled hydrodynamical and chemical evolution. The code assumes axisymmetry in a cylindrical coordinate system and consists of two modules: a hydrodynamics module and a chemistry module. The hydrodynamics module solves hydrodynamics using a Godunov-type finite volume scheme and treats included chemical species as passively advected scalars. The chemistry module implicitly solves nonequilibrium chemistry and change of energy due to thermal processes with transfer of external ultraviolet radiation. Self-shielding effects on photodissociation of CO and H 2 are included. In this introductory paper, the adopted numerical method is presented, along with code verifications using the hydrodynamics module and a benchmark on the chemistry module with reactions specific to a photon-dominated region (PDR). Finally, as an example of the expected capability, the hydrochemical evolution of a PDR is presented based on the PDR benchmark

  12. A Common histone modification code on C4 genes in maize and its conservation in Sorghum and Setaria italica.

    Science.gov (United States)

    Heimann, Louisa; Horst, Ina; Perduns, Renke; Dreesen, Björn; Offermann, Sascha; Peterhansel, Christoph

    2013-05-01

    C4 photosynthesis evolved more than 60 times independently in different plant lineages. Each time, multiple genes were recruited into C4 metabolism. The corresponding promoters acquired new regulatory features such as high expression, light induction, or cell type-specific expression in mesophyll or bundle sheath cells. We have previously shown that histone modifications contribute to the regulation of the model C4 phosphoenolpyruvate carboxylase (C4-Pepc) promoter in maize (Zea mays). We here tested the light- and cell type-specific responses of three selected histone acetylations and two histone methylations on five additional C4 genes (C4-Ca, C4-Ppdk, C4-Me, C4-Pepck, and C4-RbcS2) in maize. Histone acetylation and nucleosome occupancy assays indicated extended promoter regions with regulatory upstream regions more than 1,000 bp from the transcription initiation site for most of these genes. Despite any detectable homology of the promoters on the primary sequence level, histone modification patterns were highly coregulated. Specifically, H3K9ac was regulated by illumination, whereas H3K4me3 was regulated in a cell type-specific manner. We further compared histone modifications on the C4-Pepc and C4-Me genes from maize and the homologous genes from sorghum (Sorghum bicolor) and Setaria italica. Whereas sorghum and maize share a common C4 origin, C4 metabolism evolved independently in S. italica. The distribution of histone modifications over the promoters differed between the species, but differential regulation of light-induced histone acetylation and cell type-specific histone methylation were evident in all three species. We propose that a preexisting histone code was recruited into C4 promoter control during the evolution of C4 metabolism.

  13. Bioinformatics analysis identify novel OB fold protein coding genes in C. elegans.

    Directory of Open Access Journals (Sweden)

    Daryanaz Dargahi

    Full Text Available BACKGROUND: The C. elegans genome has been extensively annotated by the WormBase consortium that uses state of the art bioinformatics pipelines, functional genomics and manual curation approaches. As a result, the identification of novel genes in silico in this model organism is becoming more challenging requiring new approaches. The Oligonucleotide-oligosaccharide binding (OB fold is a highly divergent protein family, in which protein sequences, in spite of having the same fold, share very little sequence identity (5-25%. Therefore, evidence from sequence-based annotation may not be sufficient to identify all the members of this family. In C. elegans, the number of OB-fold proteins reported is remarkably low (n=46 compared to other evolutionary-related eukaryotes, such as yeast S. cerevisiae (n=344 or fruit fly D. melanogaster (n=84. Gene loss during evolution or differences in the level of annotation for this protein family, may explain these discrepancies. METHODOLOGY/PRINCIPAL FINDINGS: This study examines the possibility that novel OB-fold coding genes exist in the worm. We developed a bioinformatics approach that uses the most sensitive sequence-sequence, sequence-profile and profile-profile similarity search methods followed by 3D-structure prediction as a filtering step to eliminate false positive candidate sequences. We have predicted 18 coding genes containing the OB-fold that have remarkably partially been characterized in C. elegans. CONCLUSIONS/SIGNIFICANCE: This study raises the possibility that the annotation of highly divergent protein fold families can be improved in C. elegans. Similar strategies could be implemented for large scale analysis by the WormBase consortium when novel versions of the genome sequence of C. elegans, or other evolutionary related species are being released. This approach is of general interest to the scientific community since it can be used to annotate any genome.

  14. Regional differences in gene expression and promoter usage in aged human brains

    KAUST Repository

    Pardo, Luba M.

    2013-02-19

    To characterize the promoterome of caudate and putamen regions (striatum), frontal and temporal cortices, and hippocampi from aged human brains, we used high-throughput cap analysis of gene expression to profile the transcription start sites and to quantify the differences in gene expression across the 5 brain regions. We also analyzed the extent to which methylation influenced the observed expression profiles. We sequenced more than 71 million cap analysis of gene expression tags corresponding to 70,202 promoter regions and 16,888 genes. More than 7000 transcripts were differentially expressed, mainly because of differential alternative promoter usage. Unexpectedly, 7% of differentially expressed genes were neurodevelopmental transcription factors. Functional pathway analysis on the differentially expressed genes revealed an overrepresentation of several signaling pathways (e.g., fibroblast growth factor and wnt signaling) in hippocampus and striatum. We also found that although 73% of methylation signals mapped within genes, the influence of methylation on the expression profile was small. Our study underscores alternative promoter usage as an important mechanism for determining the regional differences in gene expression at old age.

  15. Using the NCBI Genome Databases to Compare the Genes for Human & Chimpanzee Beta Hemoglobin

    Science.gov (United States)

    Offner, Susan

    2010-01-01

    The beta hemoglobin protein is identical in humans and chimpanzees. In this tutorial, students see that even though the proteins are identical, the genes that code for them are not. There are many more differences in the introns than in the exons, which indicates that coding regions of DNA are more highly conserved than non-coding regions.

  16. Have we found an optimal insertion site in a Newcastle disease virus vector to express a foreign gene for vaccine and gene therapy purposes?

    Science.gov (United States)

    Using reverse genetics technology, many strains of Newcastle disease virus (NDV) have been developed as vectors to express foreign genes for vaccine and gene therapy purposes. The foreign gene is usually inserted into a non-coding region of the NDV genome as an independent transcription unit. Eval...

  17. Mapping the transcription termination region of the mouse immunoglobulin kappa gene

    International Nuclear Information System (INIS)

    Xu, M.; Garrard, W.T.

    1986-01-01

    To define the transcription termination region of the mouse immunoglobulin kappa gene, they have subcloned single copy DNA sequences corresponding to both the template and the non-template strands of this locus. In vitro nuclear transcription with isolated MPC-11 nuclei was performed and the resulting 32 P-labeled RNA was hybridized to slot-blotted, single-stranded M13 probes covering regions within and flanking the kappa gene. The hybridization pattern for the template-strand reveals that transcription terminates within the region between 1.1 to 2.3 kb downstream from the poly(A) site. Ten different short sequences (8-13 bp) reside within 460 bp of this region that exhibit homology with sequences found in the termination regions of mouse β-globin and chicken ovalbumin genes. Transcription of the non-template strand occurs on either side of this termination region. They note that no transcription is detectable on the non-template strand downstream of the enhancer, indicating that if RNA polymerase II enters at this site, it does not initiate transcription during transit to the promoter region. They conclude that transcription of the kappa gene passes the poly(A) addition site and terminates within 2.3 Kb downstream

  18. Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

    Science.gov (United States)

    Roux, Julien; Liu, Jialin; Robinson-Rechavi, Marc

    2017-11-01

    The evolutionary history of vertebrates is marked by three ancient whole-genome duplications: two successive rounds in the ancestor of vertebrates, and a third one specific to teleost fishes. Biased loss of most duplicates enriched the genome for specific genes, such as slow evolving genes, but this selective retention process is not well understood. To understand what drives the long-term preservation of duplicate genes, we characterized duplicated genes in terms of their expression patterns. We used a new method of expression enrichment analysis, TopAnat, applied to in situ hybridization data from thousands of genes from zebrafish and mouse. We showed that the presence of expression in the nervous system is a good predictor of a higher rate of retention of duplicate genes after whole-genome duplication. Further analyses suggest that purifying selection against the toxic effects of misfolded or misinteracting proteins, which is particularly strong in nonrenewing neural tissues, likely constrains the evolution of coding sequences of nervous system genes, leading indirectly to the preservation of duplicate genes after whole-genome duplication. Whole-genome duplications thus greatly contributed to the expansion of the toolkit of genes available for the evolution of profound novelties of the nervous system at the base of the vertebrate radiation. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Targeted deep resequencing identifies coding variants in the PEAR1 gene that play a role in platelet aggregation.

    Directory of Open Access Journals (Sweden)

    Yoonhee Kim

    Full Text Available Platelet aggregation is heritable, and genome-wide association studies have detected strong associations with a common intronic variant of the platelet endothelial aggregation receptor1 (PEAR1 gene both in African American and European American individuals. In this study, we used a sequencing approach to identify additional exonic variants in PEAR1 that may also determine variability in platelet aggregation in the GeneSTAR Study. A 0.3 Mb targeted region on chromosome 1q23.1 including the entire PEAR1 gene was Sanger sequenced in 104 subjects (45% male, 49% African American, age = 52±13 selected on the basis of hyper- and hypo- aggregation across three different agonists (collagen, epinephrine, and adenosine diphosphate. Single-variant and multi-variant burden tests for association were performed. Of the 235 variants identified through sequencing, 61 were novel, and three of these were missense variants. More rare variants (MAF<5% were noted in African Americans compared to European Americans (108 vs. 45. The common intronic GWAS-identified variant (rs12041331 demonstrated the most significant association signal in African Americans (p = 4.020×10(-4; no association was seen for additional exonic variants in this group. In contrast, multi-variant burden tests indicated that exonic variants play a more significant role in European Americans (p = 0.0099 for the collective coding variants compared to p = 0.0565 for intronic variant rs12041331. Imputation of the individual exonic variants in the rest of the GeneSTAR European American cohort (N = 1,965 supports the results noted in the sequenced discovery sample: p = 3.56×10(-4, 2.27×10(-7, 5.20×10(-5 for coding synonymous variant rs56260937 and collagen, epinephrine and adenosine diphosphate induced platelet aggregation, respectively. Sequencing approaches confirm that a common intronic variant has the strongest association with platelet aggregation in African Americans

  20. Integrating Diverse Types of Genomic Data to Identify Genes that Underlie Adverse Pregnancy Phenotypes.

    Directory of Open Access Journals (Sweden)

    Jibril Hirbo

    Full Text Available Progress in understanding complex genetic diseases has been bolstered by synthetic approaches that overlay diverse data types and analyses to identify functionally important genes. Pre-term birth (PTB, a major complication of pregnancy, is a leading cause of infant mortality worldwide. A major obstacle in addressing PTB is that the mechanisms controlling parturition and birth timing remain poorly understood. Integrative approaches that overlay datasets derived from comparative genomics with function-derived ones have potential to advance our understanding of the genetics of birth timing, and thus provide insights into the genes that may contribute to PTB. We intersected data from fast evolving coding and non-coding gene regions in the human and primate lineage with data from genes expressed in the placenta, from genes that show enriched expression only in the placenta, as well as from genes that are differentially expressed in four distinct PTB clinical subtypes. A large fraction of genes that are expressed in placenta, and differentially expressed in PTB clinical subtypes (23-34% are fast evolving, and are associated with functions that include adhesion neurodevelopmental and immune processes. Functional categories of genes that express fast evolution in coding regions differ from those linked to fast evolution in non-coding regions. Finally, there is a surprising lack of overlap between fast evolving genes that are differentially expressed in four PTB clinical subtypes. Integrative approaches, especially those that incorporate evolutionary perspectives, can be successful in identifying potential genetic contributions to complex genetic diseases, such as PTB.

  1. Computational prediction of over-annotated protein-coding genes in the genome of Agrobacterium tumefaciens strain C58

    Science.gov (United States)

    Yu, Jia-Feng; Sui, Tian-Xiang; Wang, Hong-Mei; Wang, Chun-Ling; Jing, Li; Wang, Ji-Hua

    2015-12-01

    Agrobacterium tumefaciens strain C58 is a type of pathogen that can cause tumors in some dicotyledonous plants. Ever since the genome of A. tumefaciens strain C58 was sequenced, the quality of annotation of its protein-coding genes has been queried continually, because the annotation varies greatly among different databases. In this paper, the questionable hypothetical genes were re-predicted by integrating the TN curve and Z curve methods. As a result, 30 genes originally annotated as “hypothetical” were discriminated as being non-coding sequences. By testing the re-prediction program 10 times on data sets composed of the function-known genes, the mean accuracy of 99.99% and mean Matthews correlation coefficient value of 0.9999 were obtained. Further sequence analysis and COG analysis showed that the re-annotation results were very reliable. This work can provide an efficient tool and data resources for future studies of A. tumefaciens strain C58. Project supported by the National Natural Science Foundation of China (Grant Nos. 61302186 and 61271378) and the Funding from the State Key Laboratory of Bioelectronics of Southeast University.

  2. The structure of the human interferon alpha/beta receptor gene.

    Science.gov (United States)

    Lutfalla, G; Gardiner, K; Proudhon, D; Vielh, E; Uzé, G

    1992-02-05

    Using the cDNA coding for the human interferon alpha/beta receptor (IFNAR), the IFNAR gene has been physically mapped relative to the other loci of the chromosome 21q22.1 region. 32,906 base pairs covering the IFNAR gene have been cloned and sequenced. Primer extension and solution hybridization-ribonuclease protection have been used to determine that the transcription of the gene is initiated in a broad region of 20 base pairs. Some aspects of the polymorphism of the gene, including noncoding sequences, have been analyzed; some are allelic differences in the coding sequence that induce amino acid variations in the resulting protein. The exon structure of the IFNAR gene and of that of the available genes for the receptors of the cytokine/growth hormone/prolactin/interferon receptor family have been compared with the predictions for the secondary structure of those receptors. From this analysis, we postulate a common origin and propose an hypothesis for the divergence from the immunoglobulin superfamily.

  3. Transcriptomic Analysis of Long Non-Coding RNAs and Coding Genes Uncovers a Complex Regulatory Network That Is Involved in Maize Seed Development

    Directory of Open Access Journals (Sweden)

    Ming Zhu

    2017-10-01

    Full Text Available Long non-coding RNAs (lncRNAs have been reported to be involved in the development of maize plant. However, few focused on seed development of maize. Here, we identified 753 lncRNA candidates in maize genome from six seed samples. Similar to the mRNAs, lncRNAs showed tissue developmental stage specific and differential expression, indicating their putative role in seed development. Increasing evidence shows that crosstalk among RNAs mediated by shared microRNAs (miRNAs represents a novel layer of gene regulation, which plays important roles in plant development. Functional roles and regulatory mechanisms of lncRNAs as competing endogenous RNAs (ceRNA in plants, particularly in maize seed development, are unclear. We combined analyses of consistently altered 17 lncRNAs, 840 mRNAs and known miRNA to genome-wide investigate potential lncRNA-mediated ceRNA based on “ceRNA hypothesis”. The results uncovered seven novel lncRNAs as potential functional ceRNAs. Functional analyses based on their competitive coding-gene partners by Gene Ontology (GO and KEGG biological pathway demonstrated that combined effects of multiple ceRNAs can have major impacts on general developmental and metabolic processes in maize seed. These findings provided a useful platform for uncovering novel mechanisms of maize seed development and may provide opportunities for the functional characterization of individual lncRNA in future studies.

  4. Characterization of human cardiac myosin heavy chain genes

    International Nuclear Information System (INIS)

    Yamauchi-Takihara, K.; Sole, M.J.; Liew, J.; Ing, D.; Liew, C.C.

    1989-01-01

    The authors have isolated and analyzed the structure of the genes coding for the α and β forms of the human cardiac myosin heavy chain (MYHC). Detailed analysis of four overlapping MYHC genomic clones shows that the α-MYHC and β-MYHC genes constitute a total length of 51 kilobases and are tandemly linked. The β-MYHC-encoding gene, predominantly expressed in the normal human ventricle and also in slow-twitch skeletal muscle, is located 4.5 kilobases upstream of the α-MYHC-encoding gene, which is predominantly expressed in normal human atrium. The authors have determined the nucleotide sequences of the β form of the MYHC gene, which is 100% homologous to the cardiac MYHC cDNA clone (pHMC3). It is unlikely that the divergence of a few nucleotide sequences from the cardiac β-MYHC cDNA clone (pHMC3) reported in a MYHC cDNA clone (PSMHCZ) from skeletal muscle is due to a splicing mechanism. This finding suggests that the same β form of the cardiac MYHC gene is expressed in both ventricular and slow-twitch skeletal muscle. The promoter regions of both α- and β-MYHC genes, as well as the first four coding regions in the respective genes, have also been sequenced. The sequences in the 5'-flanking region of the α- and β-MYHC-encoding genes diverge extensively from one another, suggesting that expression of the α- and β-MYHC genes is independently regulated

  5. CAR gene cluster and transcript levels of carotenogenic genes in Rhodotorula mucilaginosa.

    Science.gov (United States)

    Landolfo, Sara; Ianiri, Giuseppe; Camiolo, Salvatore; Porceddu, Andrea; Mulas, Giuliana; Chessa, Rossella; Zara, Giacomo; Mannazzu, Ilaria

    2018-01-01

    A molecular approach was applied to the study of the carotenoid biosynthetic pathway of Rhodotorula mucilaginosa. At first, functional annotation of the genome of R. mucilaginosa C2.5t1 was carried out and gene ontology categories were assigned to 4033 predicted proteins. Then, a set of genes involved in different steps of carotenogenesis was identified and those coding for phytoene desaturase, phytoene synthase/lycopene cyclase and carotenoid dioxygenase (CAR genes) proved to be clustered within a region of ~10 kb. Quantitative PCR of the genes involved in carotenoid biosynthesis showed that genes coding for 3-hydroxy-3-methylglutharyl-CoA reductase and mevalonate kinase are induced during exponential phase while no clear trend of induction was observed for phytoene synthase/lycopene cyclase and phytoene dehydrogenase encoding genes. Thus, in R. mucilaginosa the induction of genes involved in the early steps of carotenoid biosynthesis is transient and accompanies the onset of carotenoid production, while that of CAR genes does not correlate with the amount of carotenoids produced. The transcript levels of genes coding for carotenoid dioxygenase, superoxide dismutase and catalase A increased during the accumulation of carotenoids, thus suggesting the activation of a mechanism aimed at the protection of cell structures from oxidative stress during carotenoid biosynthesis. The data presented herein, besides being suitable for the elucidation of the mechanisms that underlie carotenoid biosynthesis, will contribute to boosting the biotechnological potential of this yeast by improving the outcome of further research efforts aimed at also exploring other features of interest.

  6. The Drosophila gene CG9918 codes for a pyrokinin-1 receptor

    DEFF Research Database (Denmark)

    Cazzamali, Giuseppe; Torp, Malene; Hauser, Frank

    2005-01-01

    The database from the Drosophila Genome Project contains a gene, CG9918, annotated to code for a G protein-coupled receptor. We cloned the cDNA of this gene and functionally expressed it in Chinese hamster ovary cells. We tested a library of about 25 Drosophila and other insect neuropeptides......, and seven insect biogenic amines on the expressed receptor and found that it was activated by low concentrations of the Drosophila neuropeptide, pyrokinin-1 (TGPSASSGLWFGPRLamide; EC50, 5 x 10(-8) M). The receptor was also activated by other Drosophila neuropeptides, terminating with the sequence PRLamide...... (Hug-gamma, ecdysis-triggering-hormone-1, pyrokinin-2), but in these cases about six to eight times higher concentrations were needed. The receptor was not activated by Drosophila neuropeptides, containing a C-terminal PRIamide sequence (such as ecdysis-triggering-hormone-2), or PRVamide (such as capa...

  7. nocoRNAc: Characterization of non-coding RNAs in prokaryotes

    Directory of Open Access Journals (Sweden)

    Nieselt Kay

    2011-01-01

    Full Text Available Abstract Background The interest in non-coding RNAs (ncRNAs constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not. Results We present NOCORNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. NOCORNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and NOCORNAc to the genome of Streptomyces coelicolor and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner. Conclusions We have developed NOCORNAc, a framework that facilitates the automated characterization of functional ncRNAs. NOCORNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. NOCORNAc is not restricted to

  8. Comprehensive analysis of coding-lncRNA gene co-expression network uncovers conserved functional lncRNAs in zebrafish.

    Science.gov (United States)

    Chen, Wen; Zhang, Xuan; Li, Jing; Huang, Shulan; Xiang, Shuanglin; Hu, Xiang; Liu, Changning

    2018-05-09

    Zebrafish is a full-developed model system for studying development processes and human disease. Recent studies of deep sequencing had discovered a large number of long non-coding RNAs (lncRNAs) in zebrafish. However, only few of them had been functionally characterized. Therefore, how to take advantage of the mature zebrafish system to deeply investigate the lncRNAs' function and conservation is really intriguing. We systematically collected and analyzed a series of zebrafish RNA-seq data, then combined them with resources from known database and literatures. As a result, we obtained by far the most complete dataset of zebrafish lncRNAs, containing 13,604 lncRNA genes (21,128 transcripts) in total. Based on that, a co-expression network upon zebrafish coding and lncRNA genes was constructed and analyzed, and used to predict the Gene Ontology (GO) and the KEGG annotation of lncRNA. Meanwhile, we made a conservation analysis on zebrafish lncRNA, identifying 1828 conserved zebrafish lncRNA genes (1890 transcripts) that have their putative mammalian orthologs. We also found that zebrafish lncRNAs play important roles in regulation of the development and function of nervous system; these conserved lncRNAs present a significant sequential and functional conservation, with their mammalian counterparts. By integrative data analysis and construction of coding-lncRNA gene co-expression network, we gained the most comprehensive dataset of zebrafish lncRNAs up to present, as well as their systematic annotations and comprehensive analyses on function and conservation. Our study provides a reliable zebrafish-based platform to deeply explore lncRNA function and mechanism, as well as the lncRNA commonality between zebrafish and human.

  9. Genome-wide occupancy profile of mediator and the Srb8-11 module reveals interactions with coding regions

    DEFF Research Database (Denmark)

    Zhu, Xuefeng; Wirén, Marianna; Sinha, Indranil

    2006-01-01

    Mediator exists in a free form containing the Med12, Med13, CDK8, and CycC subunits (the Srb8-11 module) and a smaller form, which lacks these four subunits and associates with RNA polymerase II (Pol II), forming a holoenzyme. We use chromatin immunoprecipitation (ChIP) and DNA microarrays...... to investigate genome-wide localization of Mediator and the Srb8-11 module in fission yeast. Mediator and the Srb8-11 module display similar binding patterns, and interactions with promoters and upstream activating sequences correlate with increased transcription activity. Unexpectedly, Mediator also interacts...... with the downstream coding region of many genes. These interactions display a negative bias for positions closer to the 5' ends of open reading frames (ORFs) and appear functionally important, because downregulation of transcription in a temperature-sensitive med17 mutant strain correlates with increased Mediator...

  10. The water-borne protein signals (pheromones) of the Antarctic ciliated protozoan Euplotes nobilii: structure of the gene coding for the En-6 pheromone.

    Science.gov (United States)

    La Terza, Antonietta; Dobri, Nicoleta; Alimenti, Claudio; Vallesi, Adriana; Luporini, Pierangelo

    2009-01-01

    The marine Antarctic ciliate, Euplotes nobilii, secretes a family of water-borne signal proteins, denoted as pheromones, which control vegetative proliferation and mating in the cell. Based on the knowledge of the amino acid sequences of a set of these pheromones isolated from the culture supernatant of wild-type strains, we designed probes to identify their encoding genes in the cell somatic nucleus (macronucleus). The full-length gene of the pheromone En-6 was determined and found to contain an open-reading frame specific for the synthesis of the En-6 cytoplasmic precursor (pre-pro-En-6), which requires 2 proteolytic cleavages to remove the signal peptide (pre) and the prosegment before secretion of the mature protein. In contrast to the sequence variability that distinguishes the secreted pheromones, the pre- and pro-sequences appear to be tightly conserved and useful for the construction of probes to clone every other E. nobilii pheromone gene. Potential intron sequences in the coding region of the En-6 gene imply the synthesis of more En-6 isoforms.

  11. Transmissible familial Creutzfeldt-Jakob disease associated with five, seven, and eight extra octapeptide coding repeats in the PRNP gene

    Energy Technology Data Exchange (ETDEWEB)

    Goldfarb, L.G.; Brown, P.; McCombie, W.R.; Gibbs, C.J. Jr.; Gajdusek, D.C. (National Inst. of Health, Bethesda, MD (United States)); Goldgaber, D. (State Univ. of New York, Stony Brook (United States)); Swergold, G.D. (National Inst. of Health, Bethesda, MD (United States)); Wills, P.R. (Univ. of Auckland (New Zealand)); Cervenakova, L. (Inst. of Preventive and Clinical Medicine, Bratislava (Czechoslovakia)); Baron, H. (Searle Pharmaceuticals, Paris (France))

    1991-12-01

    The PRNP gene, encoding the amyloid precursor protein that is centrally involved in Creutzfeldt-Jakob disease (CJD), has an unstable region of five variant tandem octapeptide coding repeats between codons 51 and 91. The authors screened a total of 535 individuals for the presence of extra repeats in this region, including patients with sporadic and familial forms of spongiform encephalopathy, members of their families, other neurological and non-neurological patients, and normal controls. They identified three CJD families (in each of which the proband's disease was neuropathologically confirmed and experimentally transmitted to primates) that were heterozygous for alleles with 10, 12, or 13 repeats, some of which had wobble nucleotide substitutions. They also found one individual with 9 repeats and no nucleotide substitutions who had no evidence of neurological disease. These observations, together with data on published British patients with 11 and 14 repeats, strongly suggest that the occurrence of 10 or more octapeptide repeats in the encoded amyloid precursor protein predisposes to CJD.

  12. Do prion protein gene polymorphisms induce apoptosis in non ...

    Indian Academy of Sciences (India)

    2016-08-26

    Aug 26, 2016 ... Genetic variations such as single nucleotide polymorphisms (SNPs) in prion protein coding gene, Prnp, greatly affect susceptibility to prion diseases in mammals. Here, the coding region of Prnp was screened for polymorphisms in redeared turtle, Trachemys scripta. Four polymorphisms, L203V, N205I, ...

  13. Dual CRISPR-Cas9 Cleavage Mediated Gene Excision and Targeted Integration in Yarrowia lipolytica.

    Science.gov (United States)

    Gao, Difeng; Smith, Spencer; Spagnuolo, Michael; Rodriguez, Gabriel; Blenner, Mark

    2018-05-29

    CRISPR-Cas9 technology has been successfully applied in Yarrowia lipolytica for targeted genomic editing including gene disruption and integration; however, disruptions by existing methods typically result from small frameshift mutations caused by indels within the coding region, which usually resulted in unnatural protein. In this study, a dual cleavage strategy directed by paired sgRNAs is developed for gene knockout. This method allows fast and robust gene excision, demonstrated on six genes of interest. The targeted regions for excision vary in length from 0.3 kb up to 3.5 kb and contain both non-coding and coding regions. The majority of the gene excisions are repaired by perfect nonhomologous end-joining without indel. Based on this dual cleavage system, two targeted markerless integration methods are developed by providing repair templates. While both strategies are effective, homology mediated end joining (HMEJ) based method are twice as efficient as homology recombination (HR) based method. In both cases, dual cleavage leads to similar or improved gene integration efficiencies compared to gene excision without integration. This dual cleavage strategy will be useful for not only generating more predictable and robust gene knockout, but also for efficient targeted markerless integration, and simultaneous knockout and integration in Y. lipolytica. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Structure and expression of MHC class Ib genes of the central M region in rat and mouse: M4, M5, and M6.

    Science.gov (United States)

    Lambracht-Washington, Doris; Moore, Yuki F; Wonigeit, Kurt; Lindahl, Kirsten Fischer

    2008-04-01

    The M region at the telomeric end of the murine major histocompatibility complex (MHC) contains class I genes that are highly conserved in rat and mouse. We have sequenced a cosmid clone of the LEW rat strain (RT1 haplotype) containing three class I genes, RT1.M6-1, RT1.M4, and RT1.M5. The sequences of allelic genes of the BN strain (RT1n haplotype) were obtained either from cDNAs or genomic clones. For the coding parts of the genes few differences were found between the two RT1 haplotypes. In LEW, however, only RT1.M5 and RT1.M6 have open reading frames; whereas in BN all three genes were intact. In line with the findings in BN, transcription was found for all three rat genes in several tissues from strain Sprague Dawley. Protein expression in transfectants could be demonstrated for RT1.M6-1 using the monoclonal antibody OX18. By sequencing of transcripts obtained by RT-PCR, a second, transcribed M6 gene, RT1.M6-2, was discovered, which maps next to RT1.M6-1 outside of the region covered by the cosmid. In addition, alternatively spliced forms for RT1.M5 and RT1.M6 were detected. Of the orthologous mouse genes, H2-M4, H2-M5, and H2-M6, only H2-M5 has an open reading frame. Other important differences between the corresponding parts of the M region of the two species are insertion of long LINE repeats, duplication of RT1.M6, and the inversion of RT1.M5 in the rat. This demonstrates substantial evolutionary dynamics in this region despite conservation of the class I gene sequences themselves.

  15. Characterization and cloning of TMV resistance gene N homologues ...

    African Journals Online (AJOL)

    Tobacco cultivars Nicotiana tabacum cv. Samsun NN plants carrying the N gene contain a multitude of N-related genes. We cloned a few N homologues and isolated two full-length cDNAs of NL-C26 and NL-B69 genes from N. tabacum cv. Samsun NN. Nucleotide sequence analysis showed that the coding regions of ...

  16. Changes in the Coding and Non-coding Transcriptome and DNA Methylome that Define the Schwann Cell Repair Phenotype after Nerve Injury.

    Science.gov (United States)

    Arthur-Farraj, Peter J; Morgan, Claire C; Adamowicz, Martyna; Gomez-Sanchez, Jose A; Fazal, Shaline V; Beucher, Anthony; Razzaghi, Bonnie; Mirsky, Rhona; Jessen, Kristjan R; Aitman, Timothy J

    2017-09-12

    Repair Schwann cells play a critical role in orchestrating nerve repair after injury, but the cellular and molecular processes that generate them are poorly understood. Here, we perform a combined whole-genome, coding and non-coding RNA and CpG methylation study following nerve injury. We show that genes involved in the epithelial-mesenchymal transition are enriched in repair cells, and we identify several long non-coding RNAs in Schwann cells. We demonstrate that the AP-1 transcription factor C-JUN regulates the expression of certain micro RNAs in repair Schwann cells, in particular miR-21 and miR-34. Surprisingly, unlike during development, changes in CpG methylation are limited in injury, restricted to specific locations, such as enhancer regions of Schwann cell-specific genes (e.g., Nedd4l), and close to local enrichment of AP-1 motifs. These genetic and epigenomic changes broaden our mechanistic understanding of the formation of repair Schwann cell during peripheral nervous system tissue repair. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  17. Evaluation of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy.

    Science.gov (United States)

    Stabej, Polona; Leegwater, Peter A; Stokhof, Arnold A; Domanjko-Petric, Aleksandra; van Oost, Bernard A

    2005-03-01

    To evaluate the role of the phospholamban gene in purebred large-breed dogs with dilated cardiomyopathy (DCM). 6 dogs with DCM, including 2 Doberman Pinschers, 2 Newfoundlands, and 2 Great Danes. All dogs had clinical signs of congestive heart failure, and a diagnosis of DCM was made on the basis of echocardiographic findings. Blood samples were collected from each dog, and genomic DNA was isolated by a salt extraction method. Specific oligonucleotides were designed to amplify the promoter, exon 1, the 5'-part of exon 2 including the complete coding region, and part of intron 1 of the canine phospholamban gene via polymerase chain reaction procedures. These regions were screened for mutations in DNA obtained from the 6 dogs with DCM. No mutations were identified in the promoter, 5' untranslated region, part of intron 1, part of the 3' untranslated region, and the complete coding region of the phospholamban gene in dogs with DCM. Results indicate that mutations in the phospholamban gene are not a frequent cause of DCM in Doberman Pinschers, Newfoundlands, and Great Danes.

  18. Functional Analysis of an ATP-Binding Cassette Transporter Gene in Botrytis cinerea by Gene Disruption

    OpenAIRE

    Masami, NAKAJIMA; Junko, SUZUKI; Takehiko, HOSAKA; Tadaaki, HIBI; Katsumi, AKUTSU; School of Agriculture, Ibaraki University; School of Agriculture, Ibaraki University; School of Agriculture, Ibaraki University; Department of Agriculture and Environmental Biology, The University of Tokyo; School of Agriculture, Ibaraki University

    2001-01-01

    The BMR1 gene encoding an ABC transporter was cloned from Botrytis cinerea. To examine the function of BMR1 in B.cinerea, we isolated BMR1-deficient mutants after gene disruption. Disruption vector pBcDF4 was constructed by replacing the BMR1-coding region with a hygromycin B phosphotransferase gene(hph)cassette. The BMR1 disruptants had an increased sensitivity to polyoxin and iprobenfos. Polyoxin and iprobenfos, structurally unrelated compounds, may therefore be substrates of BMR1.

  19. The CUP2 gene product regulates the expression of the CUP1 gene, coding for yeast metallothionein.

    OpenAIRE

    Welch, J; Fogel, S; Buchman, C; Karin, M

    1989-01-01

    The yeast CUP1 gene codes for a copper-binding protein similar to metallothionein. Copper sensitive cup1s strains contain a single copy of the CUP1 locus. Resistant strains (CUP1r) carry 12 or more multiple tandem copies. We isolated 12 ethyl methane sulfonate-induced copper sensitive mutants in a wild-type CUP1r parental strain, X2180-1A. Most mutants reduce the copper resistance phenotype only slightly. However, the mutant cup2 lowers resistance by nearly two orders of magnitude. We cloned ...

  20. Evaluation of 10 genes encoding cardiac proteins in Doberman Pinschers with dilated cardiomyopathy.

    Science.gov (United States)

    O'Sullivan, M Lynne; O'Grady, Michael R; Pyle, W Glen; Dawson, John F

    2011-07-01

    To identify a causative mutation for dilated cardiomyopathy (DCM) in Doberman Pinschers by sequencing the coding regions of 10 cardiac genes known to be associated with familial DCM in humans. 5 Doberman Pinschers with DCM and congestive heart failure and 5 control mixed-breed dogs that were euthanized or died. RNA was extracted from frozen ventricular myocardial samples from each dog, and first-strand cDNA was synthesized via reverse transcription, followed by PCR amplification with gene-specific primers. Ten cardiac genes were analyzed: cardiac actin, α-actinin, α-tropomyosin, β-myosin heavy chain, metavinculin, muscle LIM protein, myosinbinding protein C, tafazzin, titin-cap (telethonin), and troponin T. Sequences for DCM-affected and control dogs and the published canine genome were compared. None of the coding sequences yielded a common causative mutation among all Doberman Pinscher samples. However, 3 variants were identified in the α-actinin gene in the DCM-affected Doberman Pinschers. One of these variants, identified in 2 of the 5 Doberman Pinschers, resulted in an amino acid change in the rod-forming triple coiled-coil domain. Mutations in the coding regions of several genes associated with DCM in humans did not appear to consistently account for DCM in Doberman Pinschers. However, an α-actinin variant was detected in some Doberman Pinschers that may contribute to the development of DCM given its potential effect on the structure of this protein. Investigation of additional candidate gene coding and noncoding regions and further evaluation of the role of α-actinin in development of DCM in Doberman Pinschers are warranted.

  1. Amino acid codes in mitochondria as possible clues to primitive codes

    Science.gov (United States)

    Jukes, T. H.

    1981-01-01

    Differences between mitochondrial codes and the universal code indicate that an evolutionary simplification has taken place, rather than a return to a more primitive code. However, these differences make it evident that the universal code is not the only code possible, and therefore earlier codes may have differed markedly from the previous code. The present universal code is probably a 'frozen accident.' The change in CUN codons from leucine to threonine (Neurospora vs. yeast mitochondria) indicates that neutral or near-neutral changes occurred in the corresponding proteins when this code change took place, caused presumably by a mutation in a tRNA gene.

  2. Cloning and characterization of the 5'-flanking region of the Ehox gene

    International Nuclear Information System (INIS)

    Lee, Woon Kyu; Kim, Yong-Man; Malik, Nasir; Ma Chang; Westphal, Heiner

    2006-01-01

    The paired-like homeobox-containing gene Ehox plays a role in embryonic stem cell differentiation and is highly expressed in the developing placenta and thymus. To understand the mechanisms of regulation of Ehox gene expression, the 5'-flanking region of the Ehox gene was isolated from a mouse BAC library. 5'-RACE analysis revealed a single transcriptional start site 130 nucleotides upstream of the translation initiation codon. Transient transfection with a luciferase reporter gene under the control of serially deleted 5'-flanking sequences revealed that the nt -84 to -68 region contained a positive cis-acting element for efficient expression of the Ehox gene. Mutational analysis of this region and oligonucleotide competition in the electrophoretic mobility shift assay revealed the presence of a CCAAT box, which is a target for transcription nuclear factor Y (NFY). NFY is essential for positive gene regulation. No tissue-specific enhancer was identified in the 1.9-kb 5'-flanking region of the Ehox gene. Ehox is expressed during the early stages of embryo development, specifically in Brain at 9.5 dpc, as well as during the late stages of embryo development. These results suggest that NFY is an essential regulatory factor for Ehox transcriptional activity, which is important for the post-implantation stage of the developing embryo

  3. Common and rare variants in the exons and regulatory regions of osteoporosis-related genes improve osteoporotic fracture risk prediction.

    Science.gov (United States)

    Lee, Seung Hun; Kang, Moo Il; Ahn, Seong Hee; Lim, Kyeong-Hye; Lee, Gun Eui; Shin, Eun-Soon; Lee, Jong-Eun; Kim, Beom-Jun; Cho, Eun-Hee; Kim, Sang-Wook; Kim, Tae-Ho; Kim, Hyun-Ju; Yoon, Kun-Ho; Lee, Won Chul; Kim, Ghi Su; Koh, Jung-Min; Kim, Shin-Yoon

    2014-11-01

    Osteoporotic fracture risk is highly heritable, but genome-wide association studies have explained only a small proportion of the heritability to date. Genetic data may improve prediction of fracture risk in osteopenic subjects and assist early intervention and management. To detect common and rare variants in coding and regulatory regions related to osteoporosis-related traits, and to investigate whether genetic profiling improves the prediction of fracture risk. This cross-sectional study was conducted in three clinical units in Korea. Postmenopausal women with extreme phenotypes (n = 982) were used for the discovery set, and 3895 participants were used for the replication set. We performed targeted resequencing of 198 genes. Genetic risk scores from common variants (GRS-C) and from common and rare variants (GRS-T) were calculated. Nineteen common variants in 17 genes (of the discovered 34 functional variants in 26 genes) and 31 rare variants in five genes (of the discovered 87 functional variants in 15 genes) were associated with one or more osteoporosis-related traits. Accuracy of fracture risk classification was improved in the osteopenic patients by adding GRS-C to fracture risk assessment models (6.8%; P risk in an osteopenic individual.

  4. New progress in snake mitochondrial gene rearrangement.

    Science.gov (United States)

    Chen, Nian; Zhao, Shujin

    2009-08-01

    To further understand the evolution of snake mitochondrial genomes, the complete mitochondrial DNA (mtDNA) sequences were determined for representative species from two snake families: the Many-banded krait, the Banded krait, the Chinese cobra, the King cobra, the Hundred-pace viper, the Short-tailed mamushi, and the Chain viper. Thirteen protein-coding genes, 22-23 tRNA genes, 2 rRNA genes, and 2 control regions were identified in these mtDNAs. Duplication of the control region and translocation of the tRNAPro gene were two notable features of the snake mtDNAs. These results from the gene rearrangement comparisons confirm the correctness of traditional classification schemes and validate the utility of comparing complete mtDNA sequences for snake phylogeny reconstruction.

  5. Cross-verification of the GENE and XGC codes in preparation for their coupling

    Science.gov (United States)

    Jenko, Frank; Merlo, Gabriele; Bhattacharjee, Amitava; Chang, Cs; Dominski, Julien; Ku, Seunghoe; Parker, Scott; Lanti, Emmanuel

    2017-10-01

    A high-fidelity Whole Device Model (WDM) of a magnetically confined plasma is a crucial tool for planning and optimizing the design of future fusion reactors, including ITER. Aiming at building such a tool, in the framework of the Exascale Computing Project (ECP) the two existing gyrokinetic codes GENE (Eulerian delta-f) and XGC (PIC full-f) will be coupled, thus enabling to carry out first principle kinetic WDM simulations. In preparation for this ultimate goal, a benchmark between the two codes is carried out looking at ITG modes in the adiabatic electron limit. This verification exercise is also joined by the global Lagrangian PIC code ORB5. Linear and nonlinear comparisons have been carried out, neglecting for simplicity collisions and sources. A very good agreement is recovered on frequency, growth rate and mode structure of linear modes. A similarly excellent agreement is also observed comparing the evolution of the heat flux and of the background temperature profile during nonlinear simulations. Work supported by the US DOE under the Exascale Computing Project (17-SC-20-SC).

  6. PanCoreGen - Profiling, detecting, annotating protein-coding genes in microbial genomes.

    Science.gov (United States)

    Paul, Sandip; Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V; Chattopadhyay, Sujay

    2015-12-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing the pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen - a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for a species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars - Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. Molecular mechanisms of extensive mitochondrial gene rearrangementin plethodontid salamanders

    Energy Technology Data Exchange (ETDEWEB)

    Mueller, Rachel Lockridge; Boore, Jeffrey L.

    2005-06-01

    Extensive gene rearrangement is reported in the mitochondrial genomes of lungless salamanders (Plethodontidae). In each genome with a novel gene order, there is evidence that the rearrangement was mediated by duplication of part of the mitochondrial genome, including the presence of both pseudogenes and additional, presumably functional, copies of duplicated genes. All rearrangement-mediating duplications include either the origin of light strand replication and the nearby tRNA genes or the regions flanking the origin of heavy strand replication. The latter regions comprise nad6, trnE, cob, trnT, an intergenic spacer between trnT and trnP and, in some genomes, trnP, the control region, trnF, rrnS, trnV, rrnL, trnL1, and nad1. In some cases, two copies of duplicated genes, presumptive regulatory regions, and/or sequences with no assignable function have been retained in the genome following the initial duplication; in other genomes, only one of the duplicated copies has been retained. Both tandem and non-tandem duplications are present in these genomes, suggesting different duplication mechanisms. In some of these mtDNAs, up to 25 percent of the total length is composed of tandem duplications of non-coding sequence that includes putative regulatory regions and/or pseudogenes of tRNAs and protein-coding genes along with otherwise unassignable sequences. These data indicate that imprecise initiation and termination of replication, slipped-strand mispairing, and intra-molecular recombination may all have played a role in generating repeats during the evolutionary history of plethodontid mitochondrial genomes.

  8. Cloning of a human insulin-stimulated protein kinase (ISPK-1) gene and analysis of coding regions and mRNA levels of the ISPK-1 and the protein phosphatase-1 genes in muscle from NIDDM patients

    DEFF Research Database (Denmark)

    Bjørbaek, C; Vik, T A; Echwald, S M

    1995-01-01

    with non-insulin-dependent diabetes mellitus (NIDDM). The human ISPK-1 cDNA was cloned from T-cell leukemia and placental cDNA libraries and mapped to the short arm of the human X chromosome. Single-strand conformation polymorphism (SSCP) analysis identified a total of six variations in the coding regions...

  9. Numerical code to determine the particle trapping region in the LISA machine

    International Nuclear Information System (INIS)

    Azevedo, M.T. de; Raposo, C.C. de; Tomimura, A.

    1984-01-01

    A numerical code is constructed to determine the trapping region in machine like LISA. The variable magnetic field is two deimensional and is coupled to the Runge-Kutta through the Tchebichev polynomial. Various particle orbits including particle interactions were analysed. Beside this, a strong electric field is introduced to see the possible effects happening inside the plasma. (Author) [pt

  10. In silico comparison of genomic regions containing genes coding for enzymes and transcription factors for the phenylpropanoid pathway in Phaseolus vulgaris L. and Glycine max L. Merr

    Directory of Open Access Journals (Sweden)

    Yarmilla eReinprecht

    2013-09-01

    Full Text Available Legumes contain a variety of phytochemicals derived from the phenylpropanoid pathway that have important effects on human health as well as seed coat color, plant disease resistance and nodulation. However, the information about the genes involved in this important pathway is fragmentary in common bean (Phaseolus vulgaris L.. The objectives of this research were to isolate genes that function in and control the phenylpropanoid pathway in common bean, determine their genomic locations in silico in common bean and soybean, and analyze sequences of the 4CL gene family in two common bean genotypes. Sequences of phenylpropanoid pathway genes available for common bean or other plant species were aligned, and the conserved regions were used to design sequence-specific primers. The PCR products were cloned and sequenced and the gene sequences along with common bean gene-based (g markers were BLASTed against the Glycine max v.1.0 genome and the P. vulgaris v.1.0 (Andean early release genome. In addition, gene sequences were BLASTed against the OAC Rex (Mesoamerican genome sequence assembly. In total, fragments of 46 structural and regulatory phenylpropanoid pathway genes were characterized in this way and placed in silico on common bean and soybean sequence maps. The maps contain over 250 common bean g and SSR (simple sequence repeat markers and identify the positions of more than 60 additional phenylpropanoid pathway gene sequences, plus the putative locations of seed coat color genes. The majority of cloned phenylpropanoid pathway gene sequences were mapped to one location in the common bean genome but had two positions in soybean. The comparison of the genomic maps confirmed previous studies, which show that common bean and soybean share genomic regions, including those containing phenylpropanoid pathway gene sequences, with conserved synteny. Indels identified in the comparison of Andean and Mesoamerican common bean sequences might be used to develop

  11. Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Joshi NV

    2009-01-01

    Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.

  12. Insights into inner ear-specific gene regulation: epigenetics and non-coding RNAs in inner ear development and regeneration

    Science.gov (United States)

    Avraham, Karen B.

    2016-01-01

    The vertebrate inner ear houses highly specialized sensory organs, tuned to detect and encode sound, head motion and gravity. Gene expression programs under the control of transcription factors orchestrate the formation and specialization of the non-sensory inner ear labyrinth and its sensory constituents. More recently, epigenetic factors and non-coding RNAs emerged as an additional layer of gene regulation, both in inner ear development and disease. In this review, we provide an overview on how epigenetic modifications and non-coding RNAs, in particular microRNAs (miRNAs), influence gene expression and summarize recent discoveries that highlight their critical role in the proper formation of the inner ear labyrinth and its sensory organs. In contrast to non-mammalian vertebrates, adult mammals lack the ability to regenerate inner ear mechano-sensory hair cells. Finally, we discuss recent insights into how epigenetic factors and miRNAs may facilitate, or in the case of mammals, restrict sensory hair cell regeneration. PMID:27836639

  13. Chronic ethanol exposure produces time- and brain region-dependent changes in gene coexpression networks.

    Directory of Open Access Journals (Sweden)

    Elizabeth A Osterndorff-Kahanek

    Full Text Available Repeated ethanol exposure and withdrawal in mice increases voluntary drinking and represents an animal model of physical dependence. We examined time- and brain region-dependent changes in gene coexpression networks in amygdala (AMY, nucleus accumbens (NAC, prefrontal cortex (PFC, and liver after four weekly cycles of chronic intermittent ethanol (CIE vapor exposure in C57BL/6J mice. Microarrays were used to compare gene expression profiles at 0-, 8-, and 120-hours following the last ethanol exposure. Each brain region exhibited a large number of differentially expressed genes (2,000-3,000 at the 0- and 8-hour time points, but fewer changes were detected at the 120-hour time point (400-600. Within each region, there was little gene overlap across time (~20%. All brain regions were significantly enriched with differentially expressed immune-related genes at the 8-hour time point. Weighted gene correlation network analysis identified modules that were highly enriched with differentially expressed genes at the 0- and 8-hour time points with virtually no enrichment at 120 hours. Modules enriched for both ethanol-responsive and cell-specific genes were identified in each brain region. These results indicate that chronic alcohol exposure causes global 'rewiring' of coexpression systems involving glial and immune signaling as well as neuronal genes.

  14. Anterior-posterior regionalized gene expression in the Ciona notochord.

    Science.gov (United States)

    Reeves, Wendy; Thayer, Rachel; Veeman, Michael

    2014-04-01

    In the simple ascidian chordate Ciona, the signaling pathways and gene regulatory networks giving rise to initial notochord induction are largely understood and the mechanisms of notochord morphogenesis are being systematically elucidated. The notochord has generally been thought of as a non-compartmentalized or regionalized organ that is not finely patterned at the level of gene expression. Quantitative imaging methods have recently shown, however, that notochord cell size, shape, and behavior vary consistently along the anterior-posterior (AP) axis. Here we screen candidate genes by whole mount in situ hybridization for potential AP asymmetry. We identify 4 genes that show non-uniform expression in the notochord. Ezrin/radixin/moesin (ERM) is expressed more strongly in the secondary notochord lineage than the primary. CTGF is expressed stochastically in a subset of notochord cells. A novel calmodulin-like gene (BCamL) is expressed more strongly at both the anterior and posterior tips of the notochord. A TGF-β ortholog is expressed in a gradient from posterior to anterior. The asymmetries in ERM, BCamL, and TGF-β expression are evident even before the notochord cells have intercalated into a single-file column. We conclude that the Ciona notochord is not a homogeneous tissue but instead shows distinct patterns of regionalized gene expression. Copyright © 2013 Wiley Periodicals, Inc.

  15. Identification of a cis-regulatory region of a gene in Arabidopsis thaliana whose induction by dehydration is mediated by abscisic acid and requires protein synthesis.

    Science.gov (United States)

    Iwasaki, T; Yamaguchi-Shinozaki, K; Shinozaki, K

    1995-05-20

    In Arabidopsis thaliana, the induction of a dehydration-responsive gene, rd22, is mediated by abscisic acid (ABA) but the gene does not include any sequence corresponding to the consensus ABA-responsive element (ABRE), RYACGTGGYR, in its promoter region. The cis-regulatory region of the rd22 promoter was identified by monitoring the expression of beta-glucuronidase (GUS) activity in leaves of transgenic tobacco plants transformed with chimeric gene fusions constructed between 5'-deleted promoters of rd22 and the coding region of the GUS reporter gene. A 67-bp nucleotide fragment corresponding to positions -207 to -141 of the rd22 promoter conferred responsiveness to dehydration and ABA on a non-responsive promoter. The 67-bp fragment contains the sequences of the recognition sites for some transcription factors, such as MYC, MYB, and GT-1. The fact that accumulation of rd22 mRNA requires protein synthesis raises the possibility that the expression of rd22 might be regulated by one of these trans-acting protein factors whose de novo synthesis is induced by dehydration or ABA. Although the structure of the RD22 protein is very similar to that of a non-storage seed protein, USP, of Vicia faba, the expression of the GUS gene driven by the rd22 promoter in non-stressed transgenic Arabidopsis plants was found mainly in flowers and bolted stems rather than in seeds.

  16. Characterization of a human X-linked gene from the DXS732E locus in the candidate region for the anhidrotic ectodermal dysplasia (EDA) gene (Xq13.1)

    Energy Technology Data Exchange (ETDEWEB)

    Gault, J.; Zonana, J. [Oregon Health Sciences Univ., Portland, OR (United States); Zeltinger, J. [Univ. of Washington, Seattle, WA (United States)] [and others

    1994-09-01

    A conserved mouse genomic clone was used to identify a homologous human genomic clone (the DXS732E locus), which was subsequently employed to isolate cDNAs from a human fetal brain library. Nine unique overlapping cDNAs were isolated, and sequences analysis of 3.9 kb identified a putative 1 kb ORF. GRAIL analysis of the sequence supported the hypothesis that the putative ORF was coding sequence, and Prosite analysis of the putative ORF identified potential glycosylation and phosphorylation sites. The 5{prime} end of the gene maps within a CpG island, and comparison of cDNA sequences indicate the gene is alternatively spliced at its 3{prime} end. Northern analysis and RT-PCR indicate that two different sized messages appear to be expressed with the gene expressed in human fetal kidney, intestine, brain, and muscle. The gene is expressed in 77 day human skin, a time when hair follicle formation occurs. Anhidrotic ectodermal dysplasia (EDA) results in the abnormal morphogenesis of hair, teeth and eccrine sweat glands. A positional cloning strategy towards cloning the EDA gene had been used, and deletion and X-autosome translocation patients have been useful in further delimiting the EDA region. The present gene at the DXS732E locus is partially deleted in one EDA patient who does not have other apparent abnormalities. No rearrangements of the gene have been detected in two female X-autosome translocation EDA patients, nor in four additional male patients with submicroscopic molecular deletions.

  17. Cloning of human genes encoding novel G protein-coupled receptors

    Energy Technology Data Exchange (ETDEWEB)

    Marchese, A.; Docherty, J.M.; Heiber, M. [Univ. of Toronto, (Canada)] [and others

    1994-10-01

    We report the isolation and characterization of several novel human genes encoding G protein-coupled receptors. Each of the receptors contained the familiar seven transmembrane topography and most closely resembled peptide binding receptors. Gene GPR1 encoded a receptor protein that is intronless in the coding region and that shared identity (43% in the transmembrane regions) with the opioid receptors. Northern blot analysis revealed that GPR1 transcripts were expressed in the human hippocampus, and the gene was localized to chromosome 15q21.6. Gene GPR2 encoded a protein that most closely resembled an interleukin-8 receptor (51% in the transmembrane regions), and this gene, not expressed in the six brain regions examined, was localized to chromosome 17q2.1-q21.3. A third gene, GPR3, showed identity (56% in the transmembrane regions) with a previously characterized cDNA clone from rat and was localized to chromosome 1p35-p36.1. 31 refs., 5 figs., 1 tab.

  18. Comparative analysis of chromatin landscape in regulatory regions of human housekeeping and tissue specific genes

    Directory of Open Access Journals (Sweden)

    Dasgupta Dipayan

    2005-05-01

    Full Text Available Abstract Background Global regulatory mechanisms involving chromatin assembly and remodelling in the promoter regions of genes is implicated in eukaryotic transcription control especially for genes subjected to spatial and temporal regulation. The potential to utilise global regulatory mechanisms for controlling gene expression might depend upon the architecture of the chromatin in and around the gene. In-silico analysis can yield important insights into this aspect, facilitating comparison of two or more classes of genes comprising of a large number of genes within each group. Results In the present study, we carried out a comparative analysis of chromatin characteristics in terms of the scaffold/matrix attachment regions, nucleosome formation potential and the occurrence of repetitive sequences, in the upstream regulatory regions of housekeeping and tissue specific genes. Our data show that putative scaffold/matrix attachment regions are more abundant and nucleosome formation potential is higher in the 5' regions of tissue specific genes as compared to the housekeeping genes. Conclusion The differences in the chromatin features between the two groups of genes indicate the involvement of chromatin organisation in the control of gene expression. The presence of global regulatory mechanisms mediated through chromatin organisation can decrease the burden of invoking gene specific regulators for maintenance of the active/silenced state of gene expression. This could partially explain the lower number of genes estimated in the human genome.

  19. Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion.

    Science.gov (United States)

    Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An

    2017-09-11

    The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.

  20. Human serum amyloid genes--molecular characterization

    International Nuclear Information System (INIS)

    Sack, G.H.; Lease, J.J.

    1986-01-01

    Three clones containing human genes for serum amyloid A protein (SAA) have been isolated and characterized. Each of two clones, GSAA 1 and 2 (of 12.8 and 15.9 kilobases, respectively), contains two exons, accouting for amino acids 12-58 and 58-103 of mature SAA; the extreme 5' termini and 5' untranslated regions have not yet been defined but are anticipated to be close based on studies of murine SAA genes. Initial amino acid sequence comparisons show 78/89 identical residues. At 4 of the 11 discrepant residues, the amino acid specified by the codon is the same as the corresponding residue in murine SAA. Identification of regions containing coding regions has permitted use of selected subclones for blot hybridization studies of larger human SAA chromosomal gene organization. The third clone, GSAA 3 also contains SAA coding information by DNA sequence analysis but has a different organization which has not yet been fully described. We have reported the isolation of clones of human DNA hybridizing with pRS48 - a plasmid containing a complementary DNA (cDNA) clone for murine serum amyloid A (SAA; 1, 2). We now present more detailed data confirming the identity and defining some of the organizational features of these clones

  1. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-05-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  2. Identifying Regulatory Patterns at the 3'end Regions of Over-expressed and Under-expressed Genes

    KAUST Repository

    Othoum, Ghofran K

    2013-01-01

    Promoters, neighboring regulatory regions and those extending further upstream of the 5’end of genes, are considered one of the main components affecting the expression status of genes in a specific phenotype. More recently research by Chen et al. (2006, 2012) and Mapendano et al. (2010) demonstrated that the 3’end regulatory regions of genes also influence gene expression. However, the association between the regulatory regions surrounding 3’end of genes and their over- or under-expression status in a particular phenotype has not been systematically studied. The aim of this study is to ascertain if regulatory regions surrounding the 3’end of genes contain sufficient regulatory information to correlate genes with their expression status in a particular phenotype. Over- and under-expressed ovarian cancer (OC) genes were used as a model. Exploratory analysis of the 3’end regions were performed by transforming the annotated regions using principal component analysis (PCA), followed by clustering the transformed data thereby achieving a clear separation of genes with different expression status. Additionally, several classification algorithms such as Naïve Bayes, Random Forest and Support Vector Machine (SVM) were tested with different parameter settings to analyze the discriminatory capacity of the 3’end regions of genes related to their gene expression status. The best performance was achieved using the SVM classification model with 10-fold cross-validation that yielded an accuracy of 98.4%, sensitivity of 99.5% and specificity of 92.5%. For gene expression status for newly available instances, based on information derived from the 3’end regions, an SVM predictive model was developed with 10-fold cross-validation that yielded an accuracy of 67.0%, sensitivity of 73.2% and specificity of 61.0%. Moreover, building an SVM with polynomial kernel model to PCA transformed data yielded an accuracy of 83.1%, sensitivity of 92.5% and specificity of 74.8% using

  3. Evolutionary acquisition of promoter-associated non-coding RNA (pancRNA) repertoires diversifies species-dependent gene activation mechanisms in mammals

    OpenAIRE

    Uesaka, Masahiro; Agata, Kiyokazu; Oishi, Takao; Nakashima, Kinichi; Imamura, Takuya

    2017-01-01

    Background Recent transcriptome analyses have shown that long non-coding RNAs (ncRNAs) play extensive roles in transcriptional regulation. In particular, we have reported that promoter-associated ncRNAs (pancRNAs) activate the partner gene expression via local epigenetic changes. Results Here, we identify thousands of genes under pancRNA-mediated transcriptional activation in five mammalian species in common. In the mouse, 1) pancRNA-partnered genes confined their expression pattern to certai...

  4. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    Science.gov (United States)

    Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

    2012-01-01

    Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.

  5. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    Directory of Open Access Journals (Sweden)

    Ai-bing Zhang

    Full Text Available Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish and two representing non-coding ITS barcodes (rust fungi and brown algae. Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ and Maximum likelihood (ML methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40% for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37% for 1094 brown algae queries, both using ITS barcodes.

  6. Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

    LENUS (Irish Health Repository)

    Ivanov, Ivaylo P

    2011-05-01

    In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.

  7. Natural variation of rice blast resistance gene Pi-d2

    Science.gov (United States)

    Studying natural variation of rice resistance (R) genes in cultivated and wild rice relatives can predict resistance stability to rice blast fungus. In the present study, the protein coding regions of rice R gene Pi-d2 in 35 rice accessions of subgroups, aus (AUS), indica (IND), temperate japonica (...

  8. A Common Histone Modification Code on C4 Genes in Maize and Its Conservation in Sorghum and Setaria italica1[W][OA

    Science.gov (United States)

    Heimann, Louisa; Horst, Ina; Perduns, Renke; Dreesen, Björn; Offermann, Sascha; Peterhansel, Christoph

    2013-01-01

    C4 photosynthesis evolved more than 60 times independently in different plant lineages. Each time, multiple genes were recruited into C4 metabolism. The corresponding promoters acquired new regulatory features such as high expression, light induction, or cell type-specific expression in mesophyll or bundle sheath cells. We have previously shown that histone modifications contribute to the regulation of the model C4 phosphoenolpyruvate carboxylase (C4-Pepc) promoter in maize (Zea mays). We here tested the light- and cell type-specific responses of three selected histone acetylations and two histone methylations on five additional C4 genes (C4-Ca, C4-Ppdk, C4-Me, C4-Pepck, and C4-RbcS2) in maize. Histone acetylation and nucleosome occupancy assays indicated extended promoter regions with regulatory upstream regions more than 1,000 bp from the transcription initiation site for most of these genes. Despite any detectable homology of the promoters on the primary sequence level, histone modification patterns were highly coregulated. Specifically, H3K9ac was regulated by illumination, whereas H3K4me3 was regulated in a cell type-specific manner. We further compared histone modifications on the C4-Pepc and C4-Me genes from maize and the homologous genes from sorghum (Sorghum bicolor) and Setaria italica. Whereas sorghum and maize share a common C4 origin, C4 metabolism evolved independently in S. italica. The distribution of histone modifications over the promoters differed between the species, but differential regulation of light-induced histone acetylation and cell type-specific histone methylation were evident in all three species. We propose that a preexisting histone code was recruited into C4 promoter control during the evolution of C4 metabolism. PMID:23564230

  9. Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

    Directory of Open Access Journals (Sweden)

    Tadashi Imanishi

    2004-06-01

    Full Text Available The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/. It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs, identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA

  10. Non-Coding RNAs in Arabidopsis

    DEFF Research Database (Denmark)

    van Wonterghem, Miranda

    This work evolves around elucidating the mechanisms of micro RNAs (miRNAs) in Arabidopsis thaliana. I identified a new class of nuclear non-coding RNAs derived from protein coding genes. The genes are miRNA targets with extensive gene body methylation. The RNA species are nuclear localized and de...

  11. Analysis of antisense expression by whole genome tiling microarrays and siRNAs suggests mis-annotation of Arabidopsis orphan protein-coding genes.

    Directory of Open Access Journals (Sweden)

    Casey R Richardson

    2010-05-01

    Full Text Available MicroRNAs (miRNAs and trans-acting small-interfering RNAs (tasi-RNAs are small (20-22 nt long RNAs (smRNAs generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery.We explored rice (Oryza sativa sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis 'orphan' hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the "ancient" (deeply conserved class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for "new" rapidly-evolving MIRNA genes.Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other

  12. Cloning of a postreplication repair gene in Drosophila

    International Nuclear Information System (INIS)

    Banga, S.S.; Yamamoto, A.H.; Mason, J.M.; Boyd, J.B.

    1987-01-01

    Mutants at the mei-41 locus in Drosophila are strongly hypersensitive to each of eight tested mutagens. Mutant flies exhibit reduced meiotic recombination and elevated levels of chromosomal aberrations. In analogy with the defect in xeroderma pigmentosum variant cells, mei-41 cells are strongly defective in postreplication repair following UV radiation. In preparation for cloning that gene they have performed complementation studies between chromosomal aberrations and mei-41 mutants. That study has localized the mei-41 gene to polytene chromosome bands 14C4-6. A chromosomal walk conducted in that region has recovered about 65 kb of contiguous DNA sequence. The position of the mei-41 gene within that region has been established with the aid of a mutation in that gene which was generated by the insertion of a transposable element. Transcription mapping is being employed to define the complete coding region of the gene in preparation for investigations of gene function

  13. Nucleotide sequence of the triosephosphate isomerase gene from Macaca mulatta

    Energy Technology Data Exchange (ETDEWEB)

    Old, S.E.; Mohrenweiser, H.W. (Univ. of Michigan, Ann Arbor (USA))

    1988-09-26

    The triosephosphate isomerase gene from a rhesus monkey, Macaca mulatta, charon 34 library was sequenced. The human and chimpanzee enzymes differ from the rhesus enzyme at ASN 20 and GLU 198. The nucleotide sequence identity between rhesus and human is 97% in the coding region and >94% in the flanking regions. Comparison of the rhesus and chimp genes, including the intron and flanking sequences, does not suggest a mechanism for generating the two TPI peptides of proliferating cells from hominoids and a single peptide from the rhesus gene.

  14. Isolation and characterization of an auxin-inducible glutathione S-transferase gene of Arabidopsis thaliana

    NARCIS (Netherlands)

    Kop, D.A.M. van der; Schuyer, M.; Scheres, B.J.G.; Zaal, B.J. van der; Hooykaas, P.J.J.

    1996-01-01

    Genes homologous to the auxin-inducible Nt103 glutathione S-transferase (GST) gene of tobacco, were isolated from a genomic library of Arabidopsis thaliana. We isolated a λ clone containing an auxin-inducible gene, At103-1a, and part of a constitutively expressed gene, At103-1b. The coding regions

  15. The spatial distribution of fixed mutations within genes coding for proteins

    Science.gov (United States)

    Holmquist, R.; Goodman, M.; Conroy, T.; Czelusniak, J.

    1983-01-01

    An examination has been conducted of the extensive amino acid sequence data now available for five protein families - the alpha crystallin A chain, myoglobin, alpha and beta hemoglobin, and the cytochromes c - with the goal of estimating the true spatial distribution of base substitutions within genes that code for proteins. In every case the commonly used Poisson density failed to even approximate the experimental pattern of base substitution. For the 87 species of beta hemoglobin examined, for example, the probability that the observed results were from a Poisson process was the minuscule 10 to the -44th. Analogous results were obtained for the other functional families. All the data were reasonably, but not perfectly, described by the negative binomial density. In particular, most of the data were described by one of the very simple limiting forms of this density, the geometric density. The implications of this for evolutionary inference are discussed. It is evident that most estimates of total base substitutions between genes are badly in need of revision.

  16. Evolution of the snake body form reveals homoplasy in amniote Hox gene function.

    Science.gov (United States)

    Head, Jason J; Polly, P David

    2015-04-02

    Hox genes regulate regionalization of the axial skeleton in vertebrates, and changes in their expression have been proposed to be a fundamental mechanism driving the evolution of new body forms. The origin of the snake-like body form, with its deregionalized pre-cloacal axial skeleton, has been explained as either homogenization of Hox gene expression domains, or retention of standard vertebrate Hox domains with alteration of downstream expression that suppresses development of distinct regions. Both models assume a highly regionalized ancestor, but the extent of deregionalization of the primaxial domain (vertebrae, dorsal ribs) of the skeleton in snake-like body forms has never been analysed. Here we combine geometric morphometrics and maximum-likelihood analysis to show that the pre-cloacal primaxial domain of elongate, limb-reduced lizards and snakes is not deregionalized compared with limbed taxa, and that the phylogenetic structure of primaxial morphology in reptiles does not support a loss of regionalization in the evolution of snakes. We demonstrate that morphometric regional boundaries correspond to mapped gene expression domains in snakes, suggesting that their primaxial domain is patterned by a normally functional Hox code. Comparison of primaxial osteology in fossil and modern amniotes with Hox gene distributions within Amniota indicates that a functional, sequentially expressed Hox code patterned a subtle morphological gradient along the anterior-posterior axis in stem members of amniote clades and extant lizards, including snakes. The highly regionalized skeletons of extant archosaurs and mammals result from independent evolution in the Hox code and do not represent ancestral conditions for clades with snake-like body forms. The developmental origin of snakes is best explained by decoupling of the primaxial and abaxial domains and by increases in somite number, not by changes in the function of primaxial Hox genes.

  17. Divergence of recently duplicated M{gamma}-type MADS-box genes in Petunia.

    Science.gov (United States)

    Bemer, Marian; Gordon, Jonathan; Weterings, Koen; Angenent, Gerco C

    2010-02-01

    The MADS-box transcription factor family has expanded considerably in plants via gene and genome duplications and can be subdivided into type I and MIKC-type genes. The two gene classes show a different evolutionary history. Whereas the MIKC-type genes originated during ancient genome duplications, as well as during more recent events, the type I loci appear to experience high turnover with many recent duplications. This different mode of origin also suggests a different fate for the type I duplicates, which are thought to have a higher chance to become silenced or lost from the genome. To get more insight into the evolution of the type I MADS-box genes, we isolated nine type I genes from Petunia, which belong to the Mgamma subclass, and investigated the divergence of their coding and regulatory regions. The isolated genes could be subdivided into two categories: two genes were highly similar to Arabidopsis Mgamma-type genes, whereas the other seven genes showed less similarity to Arabidopsis genes and originated more recently. Two of the recently duplicated genes were found to contain deleterious mutations in their coding regions, and expression analysis revealed that a third paralog was silenced by mutations in its regulatory region. However, in addition to the three genes that were subjected to nonfunctionalization, we also found evidence for neofunctionalization of one of the Petunia Mgamma-type genes. Our study shows a rapid divergence of recently duplicated Mgamma-type MADS-box genes and suggests that redundancy among type I paralogs may be less common than expected.

  18. IN-MACA-MCC: Integrated Multiple Attractor Cellular Automata with Modified Clonal Classifier for Human Protein Coding and Promoter Prediction

    Directory of Open Access Journals (Sweden)

    Kiran Sree Pokkuluri

    2014-01-01

    Full Text Available Protein coding and promoter region predictions are very important challenges of bioinformatics (Attwood and Teresa, 2000. The identification of these regions plays a crucial role in understanding the genes. Many novel computational and mathematical methods are introduced as well as existing methods that are getting refined for predicting both of the regions separately; still there is a scope for improvement. We propose a classifier that is built with MACA (multiple attractor cellular automata and MCC (modified clonal classifier to predict both regions with a single classifier. The proposed classifier is trained and tested with Fickett and Tung (1992 datasets for protein coding region prediction for DNA sequences of lengths 54, 108, and 162. This classifier is trained and tested with MMCRI datasets for protein coding region prediction for DNA sequences of lengths 252 and 354. The proposed classifier is trained and tested with promoter sequences from DBTSS (Yamashita et al., 2006 dataset and nonpromoters from EID (Saxonov et al., 2000 and UTRdb (Pesole et al., 2002 datasets. The proposed model can predict both regions with an average accuracy of 90.5% for promoter and 89.6% for protein coding region predictions. The specificity and sensitivity values of promoter and protein coding region predictions are 0.89 and 0.92, respectively.

  19. Highly conserved non-coding sequences are associated with vertebrate development.

    Directory of Open Access Journals (Sweden)

    Adam Woolfe

    2005-01-01

    Full Text Available In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH, in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development

  20. Regional and temporal differences in gene expression of LH(BETA)T(AG) retinoblastoma tumors.

    Science.gov (United States)

    Houston, Samuel K; Pina, Yolanda; Clarke, Jennifer; Koru-Sengul, Tulay; Scott, William K; Nathanson, Lubov; Schefler, Amy C; Murray, Timothy G

    2011-07-23

    The purpose of this study was to evaluate by microarray the hypothesis that LH(BETA)T(AG) retinoblastoma tumors exhibit regional and temporal variations in gene expression. LH(BETA)T(AG) mice aged 12, 16, and 20 weeks were euthanatized (n = 9). Specimens were taken from five tumor areas (apex, anterior lateral, center, base, and posterior lateral). Samples were hybridized to gene microarrays. The data were preprocessed and analyzed, and genes with a P 2.5 were considered to be differentially expressed. Differentially expressed genes were analyzed for overlap with known networks by using pathway analysis tools. There were significant temporal (P regional differences in gene expression for LH(BETA)T(AG) retinoblastoma tumors. At P 2.5, there were significant changes in gene expression of 190 genes apically, 84 genes anterolaterally, 126 genes posteriorly, 56 genes centrally, and 134 genes at the base. Differentially expressed genes overlapped with known networks, with significant involvement in regulation of cellular proliferation and growth, response to oxygen levels and hypoxia, regulation of cellular processes, cellular signaling cascades, and angiogenesis. There are significant temporal and regional variations in the LH(BETA)T(AG) retinoblastoma model. Differentially expressed genes overlap with key pathways that may play pivotal roles in murine retinoblastoma development. These findings suggest the mechanisms involved in tumor growth and progression in murine retinoblastoma tumors and identify pathways for analysis at a functional level, to determine significance in human retinoblastoma. Microarray analysis of LH(BETA)T(AG) retinal tumors showed significant regional and temporal variations in gene expression, including dysregulation of genes involved in hypoxic responses and angiogenesis.

  1. Transfection of Chinese hamster ovary DHFR/sup -/ cells with the gene coding for heat shock protein 70 from drosophila melanogaster

    International Nuclear Information System (INIS)

    Duffy, J.J.; Carper, S.W.; Gerner, E.W.

    1987-01-01

    Chinese hamster ovary DHFR/sup -/ cells (CHO-DHFR/sup -/) were transfected with the plasmid pSV2-dhfr expressing the mouse gene coding for dhfr or with the same plasmid containing the gene coding for the Drosophila melanogaster heat shock protein 70 (hsp70), pSVd-hsp70. Three subcloned cell lines selected for expression of the dhfr gene were shown to contain either the vector sequence (G cells) or varying copies of pSVd-hsp70 (H cells). One line of H cells was shown to contain > 30 copies of the D. melanogaster hsp70 gene and to express the hsp70 RNA at significant levels. No difference between G and H cells was observed in the rate of growth, in the development of thermotolerance, or in the sensitivity of actin microfilament bundles to heat shock. However, H cells containing the transfected hsp70 gene had an altered morphology when compared to the G cells and the parental CHO-DHFR/sup -/ cells being more fibroblastic. The adhesion properties of the H cells was also decreased when compared to the G cells. These results show that insertion of the D. melanogaster gene into CHO cells does not effect growth rates or heat shock responses but may alter cell morphology and adhesion

  2. PanCoreGen – profiling, detecting, annotating protein-coding genes in microbial genomes

    Science.gov (United States)

    Bhardwaj, Archana; Bag, Sumit K; Sokurenko, Evgeni V.

    2015-01-01

    A large amount of genomic data, especially from multiple isolates of a single species, has opened new vistas for microbial genomics analysis. Analyzing pan-genome (i.e. the sum of genetic repertoire) of microbial species is crucial in understanding the dynamics of molecular evolution, where virulence evolution is of major interest. Here we present PanCoreGen – a standalone application for pan- and core-genomic profiling of microbial protein-coding genes. PanCoreGen overcomes key limitations of the existing pan-genomic analysis tools, and develops an integrated annotation-structure for species-specific pan-genomic profile. It provides important new features for annotating draft genomes/contigs and detecting unidentified genes in annotated genomes. It also generates user-defined group-specific datasets within the pan-genome. Interestingly, analyzing an example-set of Salmonella genomes, we detect potential footprints of adaptive convergence of horizontally transferred genes in two human-restricted pathogenic serovars – Typhi and Paratyphi A. Overall, PanCoreGen represents a state-of-the-art tool for microbial phylogenomics and pathogenomics study. PMID:26456591

  3. A Pectate Lyase-Coding Gene Abundantly Expressed during Early Stages of Infection Is Required for Full Virulence in Alternaria brassicicola.

    Directory of Open Access Journals (Sweden)

    Yangrae Cho

    Full Text Available Alternaria brassicicola causes black spot disease of Brassica species. The functional importance of pectin digestion enzymes and unidentified phytotoxins in fungal pathogenesis has been suspected but not verified in A. brassicicola. The fungal transcription factor AbPf2 is essential for pathogenicity and induces 106 genes during early pathogenesis, including the pectate lyase-coding gene, PL1332. The aim of this study was to test the importance and roles of PL1332 in pathogenesis. We generated deletion strains of the PL1332 gene, produced heterologous PL1332 proteins, and evaluated their association with virulence. Deletion strains of the PL1332 gene were approximately 30% less virulent than wild-type A. brassicicola, without showing differences in colony expansion on solid media and mycelial growth in nutrient-rich liquid media or minimal media with pectins as a major carbon source. Heterologous PL1332 expressed as fusion proteins digested polygalacturons in vitro. When the fusion proteins were injected into the apoplast between leaf veins of host plants the tissues turned dark brown and soft, resembling necrotic leaf tissue. The PL1332 gene was the first example identified as a general toxin-coding gene and virulence factor among the 106 genes regulated by the transcription factor, AbPf2. It was also the first gene to have its functions investigated among the 19 pectate lyase genes and several hundred putative cell-wall degrading enzymes in A. brassicicola. These results further support the importance of the AbPf2 gene as a key pathogenesis regulator and possible target for agrochemical development.

  4. Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation

    International Nuclear Information System (INIS)

    O'Hara, P.J.; Grant, F.J.; Haldeman, B.A.; Gray, C.L.; Insley, M.Y.; Hagen, F.S.; Murray, M.J.

    1987-01-01

    Activated factor VII (factor VIIa) is a vitamin K-dependent plasma serine protease that participates in a cascade of reactions leading to the coagulation of blood. Two overlapping genomic clones containing sequences encoding human factor VII were isolated and characterized. The complete sequence of the gene was determined and found to span about 12.8 kilobases. The mRNA for factor VII as demonstrated by cDNA cloning is polyadenylylated at multiple sites but contains only one AAUAAA poly(A) signal sequence. The mRNA can undergo alternative splicing, forming one transcript containing eight segments as exons and another with an additional exon that encodes a larger prepro leader sequence. The latter transcript has no known counterpart in the other vitamin K-dependent proteins. The positions of the introns with respect to the amino acid sequence encoded by the eight essential exons of factor VII are the same as those present in factor IX, factor X, protein C, and the first three exons of prothrombin. These exons code for domains generally conserved among members of this gene family. The comparable introns in these genes, however, are dissimilar with respect to size and sequence, with the exception of intron C in factor VII and protein C. The gene for factor VII also contains five regions made up of tandem repeats of oligonucleotide monomer elements. More than a quarter of the intron sequences and more than a third of the 3' untranslated portion of the mRNA transcript consist of these minisatellite tandem repeats

  5. Cloning and analysis of the promoter region of the human fibronectin gene

    International Nuclear Information System (INIS)

    Dean, D.C.; Bowlus, C.L.; Bourgeois, S.

    1987-01-01

    Human fibronectin (FN) genomic clones were isolated by screening a human genomic library with a 75-base oligonucleotide. The sequence of the oligonucleotide corresponds to a region near the 5' end of the human FN cDNA clone pFH6 that contains the amino-terminal coding sequences but does not extend to the 5' end of the mRNA. The 5' end of the FN gene is found on a 3.7-kilobase-pair EcoRI fragment that contains about 2.7 kilobase pairs of flanking sequence. The first exon is 414 base pairs long, with a 5' untranslated region of 267 base pairs. As deduced on the basis of the position of the initiation codon, FN is synthesized with a 31-residue amino acid extension on the amion terminus that is not present in the mature polypeptide. This amino-terminal extension appears to contain both a signal peptide and a propeptide. The first 200 base pairs of 5'-flanking sequence is very G+C rich. Upstream of this the sequence becomes relatively A+T rich. The sequence ATATAA is found at -25 and the sequence CAAT is present at -150. The sequence GGGGCGGGGC at -102 exhibits homology to the binding site for the transcription factor SP1, and the sequence TGACGTCA at -173 exhibits homology to 5'-flanking sequences important for induction by cAMP

  6. Genetic coding and gene expression - new Quadruplet genetic coding model

    Science.gov (United States)

    Shankar Singh, Rama

    2012-07-01

    Successful demonstration of human genome project has opened the door not only for developing personalized medicine and cure for genetic diseases, but it may also answer the complex and difficult question of the origin of life. It may lead to making 21st century, a century of Biological Sciences as well. Based on the central dogma of Biology, genetic codons in conjunction with tRNA play a key role in translating the RNA bases forming sequence of amino acids leading to a synthesized protein. This is the most critical step in synthesizing the right protein needed for personalized medicine and curing genetic diseases. So far, only triplet codons involving three bases of RNA, transcribed from DNA bases, have been used. Since this approach has several inconsistencies and limitations, even the promise of personalized medicine has not been realized. The new Quadruplet genetic coding model proposed and developed here involves all four RNA bases which in conjunction with tRNA will synthesize the right protein. The transcription and translation process used will be the same, but the Quadruplet codons will help overcome most of the inconsistencies and limitations of the triplet codes. Details of this new Quadruplet genetic coding model and its subsequent potential applications including relevance to the origin of life will be presented.

  7. Regional TEC model under quiet geomagnetic conditions and low-to-moderate solar activity based on CODE GIMs

    Science.gov (United States)

    Feng, Jiandi; Jiang, Weiping; Wang, Zhengtao; Zhao, Zhenzhen; Nie, Linjuan

    2017-08-01

    Global empirical total electron content (TEC) models based on TEC maps effectively describe the average behavior of the ionosphere. However, the accuracy of these global models for a certain region may not be ideal. Due to the number and distribution of the International GNSS Service (IGS) stations, the accuracy of TEC maps is geographically different. The modeling database derived from the global TEC maps with different accuracy is likely one of the main reasons that limits the accuracy of the new models. Moreover, many anomalies in the ionosphere are geographic or geomagnetic dependent, and as such the accuracy of global models can deteriorate if these anomalies are not fully incorporated into the modeling approach. For regional models built in small areas, these influences on modeling are immensely weakened. Thus, the regional TEC models may better reflect the temporal and spatial variations of TEC. In our previous work (Feng et al., 2016), a regional TEC model TECM-NEC is proposed for northeast China. However, this model is only directed against the typical region of Mid-latitude Summer Nighttime Anomaly (MSNA) occurrence, which is meaningless in other regions without MSNA. Following the technique of TECM-NEC model, this study proposes another regional empirical TEC model for other regions in mid-latitudes. Taking a small area BeiJing-TianJin-Tangshan (JJT) region (37.5°-42.5° N, 115°-120° E) in China as an example, a regional empirical TEC model (TECM-JJT) is proposed using the TEC grid data from January 1, 1999 to June 30, 2015 provided by the Center for Orbit Determination in Europe (CODE) under quiet geomagnetic conditions. The TECM-JJT model fits the input CODE TEC data with a bias of 0.11TECU and a root mean square error of 3.26TECU. Result shows that the regional model TECM-JJT is consistent with CODE TEC data and GPS-TEC data.

  8. Identification of a locus control region for quadruplicated green-sensitive opsin genes in zebrafish

    Science.gov (United States)

    Tsujimura, Taro; Chinen, Akito; Kawamura, Shoji

    2007-01-01

    Duplication of opsin genes has a crucial role in the evolution of visual system. Zebrafish have four green-sensitive (RH2) opsin genes (RH2–1, RH2–2, RH2–3, and RH2–4) arrayed in tandem. They are expressed in the short member of the double cones (SDC) but differ in expression areas in the retina and absorption spectra of their encoding photopigments. The shortest and the second shortest wavelength subtypes, RH2–1 and RH2–2, are expressed in the central-to-dorsal retina. The longer wavelength subtype, RH2–3, is expressed circumscribing the RH2–1/RH2–2 area, and the longest subtype, RH2–4, is expressed further circumscribing the RH2–3 area and mainly occupying the ventral retina. The present report shows that a 0.5-kb region located 15 kb upstream of the RH2 gene array is an essential regulator for their expression. When the 0.5-kb region was deleted from a P1-artificial chromosome (PAC) clone encompassing the four RH2 genes and when one of these genes was replaced with a reporter GFP gene, the GFP expression in SDCs was abolished in the zebrafish to which a series of the modified PAC clones were introduced. Transgenic studies also showed that the 0.5-kb region conferred the SDC-specific expression for promoters of a non-SDC (UV opsin) and a nonretinal (keratin 8) gene. Changing the location of the 0.5-kb region in the PAC clone conferred the highest expression for its proximal gene. The 0.5-kb region was thus designated as RH2-LCR analogous to the locus control region of the L-M opsin genes of primates. PMID:17646658

  9. Sequencing analysis reveals a unique gene organization in the gyrB region of Mycoplasma hominis

    DEFF Research Database (Denmark)

    Ladefoged, Søren; Christiansen, Gunna

    1994-01-01

    of which showed similarity to that which encodes the LicA protein of Haemophilus influenzae. The organization of the genes in the region showed no resemblance to that in the corresponding regions of other bacteria sequenced so far. The gyrA gene was mapped 35 kb downstream from the gyrB gene.......The homolog of the gyrB gene, which has been reported to be present in the vicinity of the initiation site of replication in bacteria, was mapped on the Mycoplasma hominis genome, and the region was subsequently sequenced. Five open reading frames were identified flanking the gyrB gene, one...

  10. Non-Protein Coding RNAs

    CERN Document Server

    Walter, Nils G; Batey, Robert T

    2009-01-01

    This book assembles chapters from experts in the Biophysics of RNA to provide a broadly accessible snapshot of the current status of this rapidly expanding field. The 2006 Nobel Prize in Physiology or Medicine was awarded to the discoverers of RNA interference, highlighting just one example of a large number of non-protein coding RNAs. Because non-protein coding RNAs outnumber protein coding genes in mammals and other higher eukaryotes, it is now thought that the complexity of organisms is correlated with the fraction of their genome that encodes non-protein coding RNAs. Essential biological processes as diverse as cell differentiation, suppression of infecting viruses and parasitic transposons, higher-level organization of eukaryotic chromosomes, and gene expression itself are found to largely be directed by non-protein coding RNAs. The biophysical study of these RNAs employs X-ray crystallography, NMR, ensemble and single molecule fluorescence spectroscopy, optical tweezers, cryo-electron microscopy, and ot...

  11. Cloning and characterization of the major histone H2A genes completes the cloning and sequencing of known histone genes of Tetrahymena thermophila.

    Science.gov (United States)

    Liu, X; Gorovsky, M A

    1996-01-01

    A truncated cDNA clone encoding Tetrahymena thermophila histone H2A2 was isolated using synthetic degenerate oligonucleotide probes derived from H2A protein sequences of Tetrahymena pyriformis. The cDNA clone was used as a homologous probe to isolate a truncated genomic clone encoding H2A1. The remaining regions of the genes for H2A1 (HTA1) and H2A2 (HTA2) were then isolated using inverse PCR on circularized genomic DNA fragments. These partial clones were assembled into intact HTA1 and HTA2 clones. Nucleotide sequences of the two genes were highly homologous within the coding region but not in the noncoding regions. Comparison of the deduced amino acid sequences with protein sequences of T. pyriformis H2As showed only two and three differences respectively, in a total of 137 amino acids for H2A1, and 132 amino acids for H2A2, indicating the two genes arose before the divergence of these two species. The HTA2 gene contains a TAA triplet within the coding region, encoding a glutamine residue. In contrast with the T. thermophila HHO and HTA3 genes, no introns were identified within the two genes. The 5'- and 3'-ends of the histone H2A mRNAs; were determined by RNase protection and by PCR mapping using RACE and RLM-RACE methods. Both genes encode polyadenylated mRNAs and are highly expressed in vegetatively growing cells but only weakly expressed in starved cultures. With the inclusion of these two genes, T. thermophila is the first organism whose entire complement of known core and linker histones, including replication-dependent and basal variants, has been cloned and sequenced. PMID:8760889

  12. Identification of a set of genes showing regionally enriched expression in the mouse brain

    Directory of Open Access Journals (Sweden)

    Marra Marco A

    2008-07-01

    Full Text Available Abstract Background The Pleiades Promoter Project aims to improve gene therapy by designing human mini-promoters ( Results We have utilized LongSAGE to identify regionally enriched transcripts in the adult mouse brain. As supplemental strategies, we also performed a meta-analysis of published literature and inspected the Allen Brain Atlas in situ hybridization data. From a set of approximately 30,000 mouse genes, 237 were identified as showing specific or enriched expression in 30 target regions of the mouse brain. GO term over-representation among these genes revealed co-involvement in various aspects of central nervous system development and physiology. Conclusion Using a multi-faceted expression validation approach, we have identified mouse genes whose human orthologs are good candidates for design of mini-promoters. These mouse genes represent molecular markers in several discrete brain regions/cell-types, which could potentially provide a mechanistic explanation of unique functions performed by each region. This set of markers may also serve as a resource for further studies of gene regulatory elements influencing brain expression.

  13. GUMAP: A GUPIXWIN-compatible code for extracting regional spectra from nuclear microbeam list mode files

    Science.gov (United States)

    Russell, John L.; Campbell, John L.; Boyd, Nicholas I.; Dias, Johnny F.

    2018-02-01

    The newly developed GUMAP software creates element maps from OMDAQ list mode files, displays these maps individually or collectively, and facilitates on-screen definitions of specified regions from which a PIXE spectrum can be built. These include a free-hand region defined by moving the cursor. The regional charge is entered automatically into the spectrum file in a new GUPIXWIN-compatible format, enabling a GUPIXWIN analysis of the spectrum. The code defaults to the OMDAQ dead time treatment but also facilitates two other methods for dead time correction in sample regions with count rates different from the average.

  14. Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

    Science.gov (United States)

    Beauparlant, Marc A; Drouin, Guy

    2014-02-01

    Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.

  15. A novel TaqI polymorphism in the coding region of the ovine TNXB gene in the MHC class III region: morphostructural and physiological influences.

    Science.gov (United States)

    Ajayi, Oyeyemi O; Adefenwa, Mufliat A; Agaviezor, Brilliant O; Ikeobi, Christian O N; Wheto, Matthew; Okpeku, Moses; Amusan, Samuel A; Yakubu, Abdulmojeed; De Donato, Marcos; Peters, Sunday O; Imumorin, Ikhide G

    2014-02-01

    The tenascin-XB (TNXB) gene has antiadhesive effects, functions in matrix maturation in connective tissues, and localizes to the major histocompatibility complex class III region. We hypothesized that it may influence adaptive physiological response through an effect on blood vessel function. We identified a novel g.1324 A→G polymorphism at a TaqI recognition site in a 454 bp fragment of ovine TNXB and genotyped it in 150 Nigerian sheep using PCR-RFLP. The missense mutation changes glutamic acid (GAA) to glycine (GGA). Among SNP genotypes, significant differences (P bone length. Interaction effects of breed, SNP genotype, and geographic location had a significant effect (P < 0.05) on chest girth. The SNP genotype was significantly (P < 0.05) associated with physiological traits of pulse rate and skin temperature. The observed effect of this novel polymorphism may be mediated through its role in connective tissue biology, requiring further association and functional studies.

  16. Structure of genes for dermaseptins B, antimicrobial peptides from frog skin. Exon 1-encoded prepropeptide is conserved in genes for peptides of highly different structures and activities.

    Science.gov (United States)

    Vouille, V; Amiche, M; Nicolas, P

    1997-09-01

    We cloned the genes of two members of the dermaseptin family, broad-spectrum antimicrobial peptides isolated from the skin of the arboreal frog Phyllomedusa bicolor. The dermaseptin gene Drg2 has a 2-exon coding structure interrupted by a small 137-bp intron, wherein exon 1 encoded a 22-residue hydrophobic signal peptide and the first three amino acids of the acidic propiece; exon 2 contained the 18 additional acidic residues of the propiece plus a typical prohormone processing signal Lys-Arg and a 32-residue dermaseptin progenitor sequence. The dermaseptin genes Drg2 and Drg1g2 have conserved sequences at both untranslated ends and in the first and second coding exons. In contrast, Drg1g2 comprises a third coding exon for a short version of the acidic propiece and a second dermaseptin progenitor sequence. Structural conservation between the two genes suggests that Drg1g2 arose recently from an ancestral Drg2-like gene through amplification of part of the second coding exon and 3'-untranslated region. Analysis of the cDNAs coding precursors for several frog skin peptides of highly different structures and activities demonstrates that the signal peptides and part of the acidic propieces are encoded by conserved nucleotides encompassed by the first coding exon of the dermaseptin genes. The organization of the genes that belong to this family, with the signal peptide and the progenitor sequence on separate exons, permits strikingly different peptides to be directed into the secretory pathway. The recruitment of such a homologous 'secretory' exon by otherwise non-homologous genes may have been an early event in the evolution of amphibian.

  17. Deep developmental transcriptome sequencing uncovers numerous new genes and enhances gene annotation in the sponge Amphimedon queenslandica.

    Science.gov (United States)

    Fernandez-Valverde, Selene L; Calcino, Andrew D; Degnan, Bernard M

    2015-05-15

    The demosponge Amphimedon queenslandica is amongst the few early-branching metazoans with an assembled and annotated draft genome, making it an important species in the study of the origin and early evolution of animals. Current gene models in this species are largely based on in silico predictions and low coverage expressed sequence tag (EST) evidence. Amphimedon queenslandica protein-coding gene models are improved using deep RNA-Seq data from four developmental stages and CEL-Seq data from 82 developmental samples. Over 86% of previously predicted genes are retained in the new gene models, although 24% have additional exons; there is also a marked increase in the total number of annotated 3' and 5' untranslated regions (UTRs). Importantly, these new developmental transcriptome data reveal numerous previously unannotated protein-coding genes in the Amphimedon genome, increasing the total gene number by 25%, from 30,060 to 40,122. In general, Amphimedon genes have introns that are markedly smaller than those in other animals and most of the alternatively spliced genes in Amphimedon undergo intron-retention; exon-skipping is the least common mode of alternative splicing. Finally, in addition to canonical polyadenylation signal sequences, Amphimedon genes are enriched in a number of unique AT-rich motifs in their 3' UTRs. The inclusion of developmental transcriptome data has substantially improved the structure and composition of protein-coding gene models in Amphimedon queenslandica, providing a more accurate and comprehensive set of genes for functional and comparative studies. These improvements reveal the Amphimedon genome is comprised of a remarkably high number of tightly packed genes. These genes have small introns and there is pervasive intron retention amongst alternatively spliced transcripts. These aspects of the sponge genome are more similar unicellular opisthokont genomes than to other animal genomes.

  18. JJ1017 committee report: image examination order codes--standardized codes for imaging modality, region, and direction with local expansion: an extension of DICOM.

    Science.gov (United States)

    Kimura, Michio; Kuranishi, Makoto; Sukenobu, Yoshiharu; Watanabe, Hiroki; Tani, Shigeki; Sakusabe, Takaya; Nakajima, Takashi; Morimura, Shinya; Kabata, Shun

    2002-06-01

    The digital imaging and communications in medicine (DICOM) standard includes parts regarding nonimage data information, such as image study ordering data and performed procedure data, and is used for sharing information between HIS/RIS and modality systems, which is essential for IHE. To bring such parts of the DICOM standard into force in Japan, a joint committee of JIRA and JAHIS established the JJ1017 management guideline, specifying, for example, which items are legally required in Japan, while remaining optional in the DICOM standard. In Japan, the contents of orders from referring physicians for radiographic examinations include details of the examination. Such details are not used typically by referring physicians requesting radiographic examinations in the United States, because radiologists in the United States often determine the examination protocol. The DICOM standard has code tables for examination type, region, and direction for image examination orders. However, this investigation found that it does not include items that are detailed sufficiently for use in Japan, because of the above-mentioned reason. To overcome these drawbacks, we have generated the JJ1017 code for these 3 codes for use based on the JJ1017 guidelines. This report introduces the JJ1017 code. These codes (the study type codes in particular) must be expandable to keep up with technical advances in equipment. Expansion has 2 directions: width for covering more categories and depth for specifying the information in more detail (finer categories). The JJ1017 code takes these requirements into consideration and clearly distinguishes between the stem part as the common term and the expansion. The stem part of the JJ1017 code partially utilizes the DICOM codes to remain in line with the DICOM standard. This work is an example of how local requirements can be met by using the DICOM standard and extending it.

  19. A large-scale study of the random variability of a coding sequence: a study on the CFTR gene.

    Science.gov (United States)

    Modiano, Guido; Bombieri, Cristina; Ciminelli, Bianca Maria; Belpinati, Francesca; Giorgi, Silvia; Georges, Marie des; Scotet, Virginie; Pompei, Fiorenza; Ciccacci, Cinzia; Guittard, Caroline; Audrézet, Marie Pierre; Begnini, Angela; Toepfer, Michael; Macek, Milan; Ferec, Claude; Claustres, Mireille; Pignatti, Pier Franco

    2005-02-01

    Coding single nucleotide substitutions (cSNSs) have been studied on hundreds of genes using small samples (n(g) approximately 100-150 genes). In the present investigation, a large random European population sample (average n(g) approximately 1500) was studied for a single gene, the CFTR (Cystic Fibrosis Transmembrane conductance Regulator). The nonsynonymous (NS) substitutions exhibited, in accordance with previous reports, a mean probability of being polymorphic (q > 0.005), much lower than that of the synonymous (S) substitutions, but they showed a similar rate of subpolymorphic (q < 0.005) variability. This indicates that, in autosomal genes that may have harmful recessive alleles (nonduplicated genes with important functions), genetic drift overwhelms selection in the subpolymorphic range of variability, making disadvantageous alleles behave as neutral. These results imply that the majority of the subpolymorphic nonsynonymous alleles of these genes are selectively negative or even pathogenic.

  20. Two-stage sparse coding of region covariance via Log-Euclidean kernels to detect saliency.

    Science.gov (United States)

    Zhang, Ying-Ying; Yang, Cai; Zhang, Ping

    2017-05-01

    In this paper, we present a novel bottom-up saliency detection algorithm from the perspective of covariance matrices on a Riemannian manifold. Each superpixel is described by a region covariance matrix on Riemannian Manifolds. We carry out a two-stage sparse coding scheme via Log-Euclidean kernels to extract salient objects efficiently. In the first stage, given background dictionary on image borders, sparse coding of each region covariance via Log-Euclidean kernels is performed. The reconstruction error on the background dictionary is regarded as the initial saliency of each superpixel. In the second stage, an improvement of the initial result is achieved by calculating reconstruction errors of the superpixels on foreground dictionary, which is extracted from the first stage saliency map. The sparse coding in the second stage is similar to the first stage, but is able to effectively highlight the salient objects uniformly from the background. Finally, three post-processing methods-highlight-inhibition function, context-based saliency weighting, and the graph cut-are adopted to further refine the saliency map. Experiments on four public benchmark datasets show that the proposed algorithm outperforms the state-of-the-art methods in terms of precision, recall and mean absolute error, and demonstrate the robustness and efficiency of the proposed method. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

    Directory of Open Access Journals (Sweden)

    Pupko Tal

    2008-10-01

    Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.

  2. Methylation of the chicken vitellogenin gene: influence of estradiol administration.

    Science.gov (United States)

    Meijlink, F C; Philipsen, J N; Gruber, M; Ab, G

    1983-01-01

    The degree of methylation of the chicken vitellogenin gene has been investigated. Upon induction by administration of estradiol to a rooster, methyl groups at specific sites near the 5'-end of the gene are eliminated. The process of demethylation is slower than the activation of the gene. Demethylation is therefore probably not a prerequisite to gene transcription. At least two other sites in the coding region of the gene are methylated in the liver of estrogenized roosters, but not in the liver of a laying hen, where the gene is naturally active. Images PMID:6298743

  3. Genetic diversity of the HLA-G coding region in Amerindian populations from the Brazilian Amazon: a possible role of natural selection.

    Science.gov (United States)

    Mendes-Junior, C T; Castelli, E C; Meyer, D; Simões, A L; Donadi, E A

    2013-12-01

    HLA-G has an important role in the modulation of the maternal immune system during pregnancy, and evidence that balancing selection acts in the promoter and 3'UTR regions has been previously reported. To determine whether selection acts on the HLA-G coding region in the Amazon Rainforest, exons 2, 3 and 4 were analyzed in a sample of 142 Amerindians from nine villages of five isolated tribes that inhabit the Central Amazon. Six previously described single-nucleotide polymorphisms (SNPs) were identified and the Expectation-Maximization (EM) and PHASE algorithms were used to computationally reconstruct SNP haplotypes (HLA-G alleles). A new HLA-G allele, which originated in Amerindian populations by a crossing-over event between two widespread HLA-G alleles, was identified in 18 individuals. Neutrality tests evidenced that natural selection has a complex part in the HLA-G coding region. Although balancing selection is the type of selection that shapes variability at a local level (Native American populations), we have also shown that purifying selection may occur on a worldwide scale. Moreover, the balancing selection does not seem to act on the coding region as strongly as it acts on the flanking regulatory regions, and such coding signature may actually reflect a hitchhiking effect.

  4. Gene expression meta-analysis identifies chromosomal regions involved in ovarian cancer survival

    DEFF Research Database (Denmark)

    Thomassen, Mads; Jochumsen, Kirsten M; Mogensen, Ole

    2009-01-01

    the relation of gene expression and chromosomal position to identify chromosomal regions of importance for early recurrence of ovarian cancer. By use of *Gene Set Enrichment Analysis*, we have ranked chromosomal regions according to their association to survival. Over-representation analysis including 1...... using death (P = 0.015) and recurrence (P = 0.002) as outcome. The combined mutation score is strongly associated to upregulation of several growth factor pathways....

  5. Partial characterization of nif genes from the bacterium Azospirillum amazonense

    Directory of Open Access Journals (Sweden)

    D.P. Potrich

    2001-09-01

    Full Text Available Azospirillum amazonense revealed genomic organization patterns of the nitrogen fixation genes similar to those of the distantly related species A. brasilense. Our work suggests that A. brasilense nifHDK, nifENX, fixABC operons and nifA and glnB genes may be structurally homologous to the counterpart genes of A. amazonense. This is the first analysis revealing homology between A. brasilense nif genes and the A. amazonense genome. Sequence analysis of PCR amplification products revealed similarities between the amino acid sequences of the highly conserved nifD and glnB genes of A. amazonense and related genes of A. brasilense and other bacteria. However, the A. amazonense non-coding regions (the upstream activator sequence region and the region between the nifH and nifD genes differed from related regions of A. brasilense even in nitrogenase structural genes which are highly conserved among diazotrophic bacteria. The feasibility of the 16S ribosomal RNA gene-based PCR system for specific detection of A. amazonense was shown. Our results indicate that the PCR primers for 16S rDNA defined in this article are highly specific to A. amazonense and can distinguish this species from A. brasilense.

  6. Influence of the Leader protein coding region of foot-and-mouth disease virus on virus replication

    DEFF Research Database (Denmark)

    Belsham, Graham

    2013-01-01

    The foot-and-mouth disease virus (FMDV) Leader (L) protein is produced in two forms, Lab and Lb, differing only at their amino-termini, due to the use of separate initiation codons, usually 84 nt apart. It has been shown previously, and confirmed here, that precise deletion of the Lab coding......, in the context of the virus lacking the Lb coding region, was also tolerated by the virus within BHK cells. However, precise loss of the Lb coding sequence alone blocked FMDV replication in primary bovine thyroid cells. Thus, the requirement for the Leader protein coding sequences is highly dependent...... on the nature and extent of the residual Leader protein sequences and on the host cell system used. FMDVs precisely lacking Lb and with the Lab initiation codon modified may represent safer seed viruses for vaccine production....

  7. An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

    Science.gov (United States)

    Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

    2017-10-06

    Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.

  8. SCREENING OF ANTIMICROBIAL ACTIVITY AND GENES CODING POLYKETIDE SYNTHETASE AND NONRIBOSOMAL PEPTIDE SYNTHETASE OF ACTINOMYCETE ISOLATES

    Directory of Open Access Journals (Sweden)

    Silvia Kovácsová

    2013-12-01

    Full Text Available The aim of this study was to observe antimicrobial activity using agar plate diffusion method and screening genes coding polyketide synthetase (PKS-I and nonribosomal peptide synthetase (NRPS from actinomycetes. A total of 105 actinomycete strains were isolated from arable soil. Antimicrobial activity was demonstrated at 54 strains against at least 1 of total 12 indicator organisms. Antifungal properties were recorded more often than antibacterial properties. The presence of PKS-I and NRPS genes were founded at 61 of total 105 strains. The number of strains with mentioned biosynthetic enzyme gene fragments matching the anticipated length were 19 (18% and 50 (47% respectively. Overall, five actinomycete strains carried all the biosynthetical genes, yet no antimicrobial activity was found against any of tested pathogens. On the other hand, twenty-one strains showed antimicrobial activity even though we were not able to amplify any of the PKS or NRPS genes from them. Combination of the two methods showed broad-spectrum antimicrobial activity of actinomycetes isolated from arable soil, which indicate that actinomycetes are valuable reservoirs of novel bioactive compounds.

  9. Alternative polyadenylation of tumor suppressor genes in small intestinal neuroendocrine tumors

    DEFF Research Database (Denmark)

    Rehfeld, Anders Aagaard; Plass, Mireya; Døssing, Kristina

    2014-01-01

    The tumorigenesis of small intestinal neuroendocrine tumors (SI-NETs) is poorly understood. Recent studies have associated alternative polyadenylation (APA) with proliferation, cell transformation, and cancer. Polyadenylation is the process in which the pre-messenger RNA is cleaved at a polyA site...... and a polyA tail is added. Genes with two or more polyA sites can undergo APA. This produces two or more distinct mRNA isoforms with different 3' untranslated regions. Additionally, APA can also produce mRNAs containing different 3'-terminal coding regions. Therefore, APA alters both the repertoire...... and the expression level of proteins. Here, we used high-throughput sequencing data to map polyA sites and characterize polyadenylation genome-wide in three SI-NETs and a reference sample. In the tumors, 16 genes showed significant changes of APA pattern, which lead to either the 3' truncation of mRNA coding regions...

  10. Stereoscopic Visual Attention-Based Regional Bit Allocation Optimization for Multiview Video Coding

    Directory of Open Access Journals (Sweden)

    Dai Qionghai

    2010-01-01

    Full Text Available We propose a Stereoscopic Visual Attention- (SVA- based regional bit allocation optimization for Multiview Video Coding (MVC by the exploiting visual redundancies from human perceptions. We propose a novel SVA model, where multiple perceptual stimuli including depth, motion, intensity, color, and orientation contrast are utilized, to simulate the visual attention mechanisms of human visual system with stereoscopic perception. Then, a semantic region-of-interest (ROI is extracted based on the saliency maps of SVA. Both objective and subjective evaluations of extracted ROIs indicated that the proposed SVA model based on ROI extraction scheme outperforms the schemes only using spatial or/and temporal visual attention clues. Finally, by using the extracted SVA-based ROIs, a regional bit allocation optimization scheme is presented to allocate more bits on SVA-based ROIs for high image quality and fewer bits on background regions for efficient compression purpose. Experimental results on MVC show that the proposed regional bit allocation algorithm can achieve over % bit-rate saving while maintaining the subjective image quality. Meanwhile, the image quality of ROIs is improved by  dB at the cost of insensitive image quality degradation of the background image.

  11. Functional and crystallographic characterization of Salmonella typhimurium Cu,Zn superoxide dismutase coded by the sodCI virulence gene

    NARCIS (Netherlands)

    Pesce, A; Battistoni, A; Stroppolo, ME; Polizio, F; Nardini, M; Kroll, JS; Langford, PR; O'Neill, P; Sette, M; Desideri, A; Bolognesi, M

    2000-01-01

    The functional and three-dimensional structural features of Cu,Zn superoxide dismutase coded by the Salmonella typhimurium sodCI gene, have been characterized. Measurements of the catalytic rate indicate that this enzyme is the most efficient superoxide dismutase analyzed so far, a feature that may

  12. Alternative polyadenylation of tumor suppressor genes in small intestinal neuroendocrine tumors.

    Science.gov (United States)

    Rehfeld, Anders; Plass, Mireya; Døssing, Kristina; Knigge, Ulrich; Kjær, Andreas; Krogh, Anders; Friis-Hansen, Lennart

    2014-01-01

    The tumorigenesis of small intestinal neuroendocrine tumors (SI-NETs) is poorly understood. Recent studies have associated alternative polyadenylation (APA) with proliferation, cell transformation, and cancer. Polyadenylation is the process in which the pre-messenger RNA is cleaved at a polyA site and a polyA tail is added. Genes with two or more polyA sites can undergo APA. This produces two or more distinct mRNA isoforms with different 3' untranslated regions. Additionally, APA can also produce mRNAs containing different 3'-terminal coding regions. Therefore, APA alters both the repertoire and the expression level of proteins. Here, we used high-throughput sequencing data to map polyA sites and characterize polyadenylation genome-wide in three SI-NETs and a reference sample. In the tumors, 16 genes showed significant changes of APA pattern, which lead to either the 3' truncation of mRNA coding regions or 3' untranslated regions. Among these, 11 genes had been previously associated with cancer, with 4 genes being known tumor suppressors: DCC, PDZD2, MAGI1, and DACT2. We validated the APA in three out of three cases with quantitative real-time-PCR. Our findings suggest that changes of APA pattern in these 16 genes could be involved in the tumorigenesis of SI-NETs. Furthermore, they also point to APA as a new target for both diagnostic and treatment of SI-NETs. The identified genes with APA specific to the SI-NETs could be further tested as diagnostic markers and drug targets for disease prevention and treatment.

  13. Information-processing genes

    International Nuclear Information System (INIS)

    Tahir Shah, K.

    1995-01-01

    There are an estimated 100,000 genes in the human genome of which 97% is non-coding. On the other hand, bacteria have little or no non-coding DNA. Non-coding region includes introns, ALU sequences, satellite DNA, and other segments not expressed as proteins. Why it exists? Why nature has kept non-coding during the long evolutionary period if it has no role in the development of complex life forms? Does complexity of a species somehow correlated to the existence of apparently useless sequences? What kind of capability is encoded within such nucleotide sequences that is a necessary, but not a sufficient condition for the evolution of complex life forms, keeping in mind the C-value paradox and the omnipresence of non-coding segments in higher eurkaryotes and also in many archea and prokaryotes. The physico-chemical description of biological processes is hardware oriented and does not highlight algorithmic or information processing aspect. However, an algorithm without its hardware implementation is useless as much as hardware without its capability to run an algorithm. The nature and type of computation an information-processing hardware can perform depends only on its algorithm and the architecture that reflects the algorithm. Given that enormously difficult tasks such as high fidelity replication, transcription, editing and regulation are all achieved within a long linear sequence, it is natural to think that some parts of a genome are involved is these tasks. If some complex algorithms are encoded with these parts, then it is natural to think that non-coding regions contain processing-information algorithms. A comparison between well-known automatic sequences and sequences constructed out of motifs is found in all species proves the point: noncoding regions are a sort of ''hardwired'' programs, i.e., they are linear representations of information-processing machines. Thus in our model, a noncoding region, e.g., an intron contains a program (or equivalently, it is

  14. Characterization of the bovine pregnancy-associated glycoprotein gene family – analysis of gene sequences, regulatory regions within the promoter and expression of selected genes

    Directory of Open Access Journals (Sweden)

    Walker Angela M

    2009-04-01

    Full Text Available Abstract Background The Pregnancy-associated glycoproteins (PAGs belong to a large family of aspartic peptidases expressed exclusively in the placenta of species in the Artiodactyla order. In cattle, the PAG gene family is comprised of at least 22 transcribed genes, as well as some variants. Phylogenetic analyses have shown that the PAG family segregates into 'ancient' and 'modern' groupings. Along with sequence differences between family members, there are clear distinctions in their spatio-temporal distribution and in their relative level of expression. In this report, 1 we performed an in silico analysis of the bovine genome to further characterize the PAG gene family, 2 we scrutinized proximal promoter sequences of the PAG genes to evaluate the evolution pressures operating on them and to identify putative regulatory regions, 3 we determined relative transcript abundance of selected PAGs during pregnancy and, 4 we performed preliminary characterization of the putative regulatory elements for one of the candidate PAGs, bovine (bo PAG-2. Results From our analysis of the bovine genome, we identified 18 distinct PAG genes and 14 pseudogenes. We observed that the first 500 base pairs upstream of the translational start site contained multiple regions that are conserved among all boPAGs. However, a preponderance of conserved regions, that harbor recognition sites for putative transcriptional factors (TFs, were found to be unique to the modern boPAG grouping, but not the ancient boPAGs. We gathered evidence by means of Q-PCR and screening of EST databases to show that boPAG-2 is the most abundant of all boPAG transcripts. Finally, we provided preliminary evidence for the role of ETS- and DDVL-related TFs in the regulation of the boPAG-2 gene. Conclusion PAGs represent a relatively large gene family in the bovine genome. The proximal promoter regions of these genes display differences in putative TF binding sites, likely contributing to observed

  15. Origins of De Novo Genes in Human and Chimpanzee.

    Science.gov (United States)

    Ruiz-Orera, Jorge; Hernandez-Rodriguez, Jessica; Chiva, Cristina; Sabidó, Eduard; Kondova, Ivanela; Bontrop, Ronald; Marqués-Bonet, Tomàs; Albà, M Mar

    2015-12-01

    The birth of new genes is an important motor of evolutionary innovation. Whereas many new genes arise by gene duplication, others originate at genomic regions that did not contain any genes or gene copies. Some of these newly expressed genes may acquire coding or non-coding functions and be preserved by natural selection. However, it is yet unclear which is the prevalence and underlying mechanisms of de novo gene emergence. In order to obtain a comprehensive view of this process, we have performed in-depth sequencing of the transcriptomes of four mammalian species--human, chimpanzee, macaque, and mouse--and subsequently compared the assembled transcripts and the corresponding syntenic genomic regions. This has resulted in the identification of over five thousand new multiexonic transcriptional events in human and/or chimpanzee that are not observed in the rest of species. Using comparative genomics, we show that the expression of these transcripts is associated with the gain of regulatory motifs upstream of the transcription start site (TSS) and of U1 snRNP sites downstream of the TSS. In general, these transcripts show little evidence of purifying selection, suggesting that many of them are not functional. However, we find signatures of selection in a subset of de novo genes which have evidence of protein translation. Taken together, the data support a model in which frequently-occurring new transcriptional events in the genome provide the raw material for the evolution of new proteins.

  16. GWAS of DNA Methylation Variation Within Imprinting Control Regions Suggests Parent-of-Origin Association

    NARCIS (Netherlands)

    Renteria, M.E.; Coolen, M.W.; Statham, A.L.; Choi, R.S.; Qu, W.; Campbell, M.J.; Smith, S.; Henders, A.K.; Montgomery, G.W.; Clark, S. J.; Martin, N.G.; Medland, S.E.

    2013-01-01

    Imprinting control regions (ICRs) play a fundamental role in establishing and maintaining the non-random monoallelic expression of certain genes, via common regulatory elements such as non-coding RNAs and differentially methylated regions (DMRs) of DNA. We recently surveyed DNA methylation levels

  17. Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.

    Science.gov (United States)

    Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S

    2013-12-10

    Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding

  18. The cld mutation: narrowing the critical chromosomal region and selecting candidate genes.

    Science.gov (United States)

    Péterfy, Miklós; Mao, Hui Z; Doolittle, Mark H

    2006-10-01

    Combined lipase deficiency (cld) is a recessive, lethal mutation specific to the tw73 haplotype on mouse Chromosome 17. While the cld mutation results in lipase proteins that are inactive, aggregated, and retained in the endoplasmic reticulum (ER), it maps separately from the lipase structural genes. We have narrowed the gene critical region by about 50% using the tw18 haplotype for deletion mapping and a recombinant chromosome used originally to map cld with respect to the phenotypic marker tf. The region now extends from 22 to 25.6 Mbp on the wild-type chromosome, currently containing 149 genes and 50 expressed sequence tags (ESTs). To identify the affected gene, we have selected candidates based on their known role in associated biological processes, cellular components, and molecular functions that best fit with the predicted function of the cld gene. A secondary approach was based on differences in mRNA levels between mutant (cld/cld) and unaffected (+/cld) cells. Using both approaches, we have identified seven functional candidates with an ER localization and/or an involvement in protein maturation and folding that could explain the lipase deficiency, and six expression candidates that exhibit large differences in mRNA levels between mutant and unaffected cells. Significantly, two genes were found to be candidates with regard to both function and expression, thus emerging as the strongest candidates for cld. We discuss the implications of our mapping results and our selection of candidates with respect to other genes, deletions, and mutations occurring in the cld critical region.

  19. Sequences of the joining region genes for immunoglobulin heavy chains and their role in generation of antibody diversity.

    OpenAIRE

    Gough, N M; Bernard, O

    1981-01-01

    To assess the contribution to immunoglobulin heavy chain diversity made by recombination between variable region (VH) genes and joining region (JH) genes, we have determined the sequence of about 2000 nucleotides spanning the rearranged JH gene cluster associated with the VH gene expressed in plasmacytoma HPC76. The active VH76 gene has recombined with the second germ-line JH gene. The region we have studied contains two other JH genes, designated JH3 and JH4. No other JH gene was found withi...

  20. Gene screening in a Chinese family with Marfan syndrome

    Directory of Open Access Journals (Sweden)

    Wen-Jiao Xia

    2016-05-01

    Full Text Available AIM:To analyze the causative gene mutation for Marfan syndrome(MFSwith autosomal dominant hereditary in a Chinese family in Liaoning Province,China. METHODS: Venous blood was collected and candidate gene was selected to design primers according to the clinical phenotype. With genomic polymerase chain reaction(PCRperformed, the coding exons and their flanking intron in sequences of candidate gene were sequenced,DNA fragments separated by agarose gel electrophoresis and direct sequencing method was used to determine the pathogenic gene.RESULTS:Phenotype of the proband was presented as ectopic lentis. Sequencing of the coding regions of FBN1 gene showed the presence of a heterozygous A→G transversion at nucleotide 640 in the 7 exon of FBN1 and the missense mutation made for Glycine into Serine(G214S. CONCLUSION:A heterozygous mutation of FBN1 c.A640G(p.G214Sis responsible for the Marfan syndrome in the four generation Chinese pedigree.

  1. CRNDE: a long non-coding RNA involved in CanceR, Neurobiology and DEvelopment

    Directory of Open Access Journals (Sweden)

    Blake C. Ellis

    2012-11-01

    Full Text Available CRNDE is the gene symbol for Colorectal Neoplasia Differentially Expressed (non protein-coding, a long non-coding RNA (lncRNA gene that expresses multiple splice variants and displays a very tissue-specific pattern of expression. CRNDE was initially identified as a lncRNA whose expression is highly elevated in colorectal cancer, but it is also upregulated in many other solid tumors and in leukemias. Indeed, CRNDE is the most upregulated lncRNA in gliomas and here, as in other cancers, it is associated with a stemness signature. CRNDE is expressed in specific regions within the human and mouse brain; the mouse ortholog is high in induced pluripotent stem cells and increases further during neuronal differentiation. We suggest that CRNDE is a multifunctional lncRNA whose different splice forms provide specific functional scaffolds for regulatory complexes, such as the polycomb repressive complex 2 (PRC2 and CoREST chromatin-modifying complexes, which CRNDE helps pilot to target genes.

  2. Cloning and expression of gene encoding P23 protein from Cryptosporidium parvum

    Directory of Open Access Journals (Sweden)

    Dinh Thi Bich Lan

    2014-12-01

    Full Text Available We cloned the cp23 gene coding P23 (glycoprotein from Cryptosporidium parvum isolated from Thua Thien Hue province, Vietnam. The coding region of cp23 gene from C. parvum is 99% similar with cp23 gene deposited in NCBI (accession number: U34390. SDS-PAGE and Western blot analysis showed that the cp23 gene in E. coli BL21 StarTM (DE3 produced polypeptides with molecular weights of approximately 37, 40 and 49 kDa. These molecules may be non-glycosylated or glycosylated P23 fusion polypeptides. Recombinant P23 protein purified by GST (glutathione S-transferase affinity chromatography can be used as an antigen for C. parvum antibody production as well as to develop diagnostic kit for C. parvum.

  3. Polymorphisms in Genes Coding for Cytokines, Mannose-Binding Lectin, Collagen Metabolism and Thrombophilia in Women with Cervical Insufficiency

    DEFF Research Database (Denmark)

    Sundtoft, Iben; Uldbjerg, Niels; Steffensen, Rudi

    2015-01-01

    OBJECTIVE: To study the association between cervical insufficiency and single nucleotide polymorphisms in seven genes coding for pro- and anti-inflammatory cytokine-related factors, mannose-binding lectin 2 (MBL2), collagen1α1 (COL1A1), factor II and factor V Leiden genes. METHODS: In a case......-control study, potential maternal biomarkers for cervical insufficiency were investigated in 30 women with a history of second-trimester miscarriage or preterm birth due to cervical insufficiency and in 70 control women. RESULTS: Homozygous carriers of the interleukin 6 (IL6) -174 genotype GG had an odds ratio...... (OR) of 3.1 [95% confidence interval (95% CI) 1.3-7.4, p = 0.01] and MBL2 genotypes coding for low or intermediate levels of plasma MBL had an OR of 3.3 (95% CI 1.2-9.0, p = 0.01) for cervical insufficiency compared with controls. Serum MBL levels were lower in women with cervical insufficiency than...

  4. Porcine SOX9 Gene Expression Is Influenced by an 18 bp Indel in the 5'-Untranslated Region.

    Directory of Open Access Journals (Sweden)

    Bertram Brenig

    Full Text Available Sex determining region Y-box 9 (SOX9 is an important regulator of sex and skeletal development and is expressed in a variety of embryonal and adult tissues. Loss or gain of function resulting from mutations within the coding region or chromosomal aberrations of the SOX9 locus lead to a plethora of detrimental phenotypes in humans and animals. One of these phenotypes is the so-called male-to-female or female-to-male sex-reversal which has been observed in several mammals including pig, dog, cat, goat, horse, and deer. In 38,XX sex-reversal French Large White pigs, a genome-wide association study suggested SOX9 as the causal gene, although no functional mutations were identified in affected animals. However, besides others an 18 bp indel had been detected in the 5'-untranslated region of the SOX9 gene by comparing affected animals and controls. We have identified the same indel (Δ18 between position +247 bp and +266 bp downstream the transcription start site of the porcine SOX9 gene in four other pig breeds; i.e., German Large White, Laiwu Black, Bamei, and Erhualian. These animals have been genotyped in an attempt to identify candidate genes for porcine inguinal and/or scrotal hernia. Because the 18 bp segment in the wild type 5'-UTR harbours a highly conserved cAMP-response element (CRE half-site, we analysed its role in SOX9 expression in vitro. Competition and immunodepletion electromobility shift assays demonstrate that the CRE half-site is specifically recognized by CREB. Both binding of CREB to the wild type as well as the absence of the CRE half-site in Δ18 reduced expression efficiency in HEK293T, PK-15, and ATDC5 cells significantly. Transfection experiments of wild type and Δ18 SOX9 promoter luciferase constructs show a significant reduction of RNA and protein levels depending on the presence or absence of the 18 bp segment. Hence, the data presented here demonstrate that the 18 bp indel in the porcine SOX9 5'-UTR is of functional

  5. Kynurenine 3-Monooxygenase Gene Associated With Nicotine Initiation and Addiction: Analysis of Novel Regulatory Features at 5′ and 3′-Regions

    Directory of Open Access Journals (Sweden)

    Hassan A. Aziz

    2018-06-01

    Full Text Available Tobacco smoking is widespread behavior in Qatar and worldwide and is considered one of the major preventable causes of ill health and death. Nicotine is part of tobacco smoke that causes numerous health risks and is incredibly addictive; it binds to the α7 nicotinic acetylcholine receptor (α7nAChR in the brain. Recent studies showed α7nAChR involvement in the initiation and addiction of smoking. Kynurenic acid (KA, a significant tryptophan metabolite, is an antagonist of α7nAChR. Inhibition of kynurenine 3-monooxygenase enzyme encoded by KMO enhances the KA levels. Modulating KMO gene expression could be a useful tactic for the treatment of tobacco initiation and dependence. Since KMO regulation is still poorly understood, we aimed to investigate the 5′ and 3′-regulatory factors of KMO gene to advance our knowledge to modulate KMO gene expression. In this study, bioinformatics methods were used to identify the regulatory sequences associated with expression of KMO. The displayed differential expression of KMO mRNA in the same tissue and different tissues suggested the specific usage of the KMO multiple alternative promoters. Eleven KMO alternative promoters identified at 5′-regulatory region contain TATA-Box, lack CpG Island (CGI and showed dinucleotide base-stacking energy values specific to transcription factor binding sites (TFBSs. The structural features of regulatory sequences can influence the transcription process and cell type-specific expression. The uncharacterized LOC105373233 locus coding for non-coding RNA (ncRNA located on the reverse strand in a convergent manner at the 3′-side of KMO locus. The two genes likely expressed by a promoter that lacks TATA-Box harbor CGI and two TFBSs linked to the bidirectional transcription, the NRF1, and ZNF14 motifs. We identified two types of microRNA (miR in the uncharacterized LOC105373233 ncRNA, which are like hsa-miR-5096 and hsa-miR-1285-3p and can target the miR recognition

  6. Non-Coding RNAs in Hodgkin Lymphoma

    Directory of Open Access Journals (Sweden)

    Anna Cordeiro

    2017-05-01

    Full Text Available MicroRNAs (miRNAs, small non-coding RNAs that regulate gene expression by binding to the 3’-UTR of their target genes, can act as oncogenes or tumor suppressors. Recently, other types of non-coding RNAs—piwiRNAs and long non-coding RNAs—have also been identified. Hodgkin lymphoma (HL is a B cell origin disease characterized by the presence of only 1% of tumor cells, known as Hodgkin and Reed-Stenberg (HRS cells, which interact with the microenvironment to evade apoptosis. Several studies have reported specific miRNA signatures that can differentiate HL lymph nodes from reactive lymph nodes, identify histologic groups within classical HL, and distinguish HRS cells from germinal center B cells. Moreover, some signatures are associated with survival or response to chemotherapy. Most of the miRNAs in the signatures regulate genes related to apoptosis, cell cycle arrest, or signaling pathways. Here we review findings on miRNAs in HL, as well as on other non-coding RNAs.

  7. Circuit-wide Transcriptional Profiling Reveals Brain Region-Specific Gene Networks Regulating Depression Susceptibility.

    Science.gov (United States)

    Bagot, Rosemary C; Cates, Hannah M; Purushothaman, Immanuel; Lorsch, Zachary S; Walker, Deena M; Wang, Junshi; Huang, Xiaojie; Schlüter, Oliver M; Maze, Ian; Peña, Catherine J; Heller, Elizabeth A; Issler, Orna; Wang, Minghui; Song, Won-Min; Stein, Jason L; Liu, Xiaochuan; Doyle, Marie A; Scobie, Kimberly N; Sun, Hao Sheng; Neve, Rachael L; Geschwind, Daniel; Dong, Yan; Shen, Li; Zhang, Bin; Nestler, Eric J

    2016-06-01

    Depression is a complex, heterogeneous disorder and a leading contributor to the global burden of disease. Most previous research has focused on individual brain regions and genes contributing to depression. However, emerging evidence in humans and animal models suggests that dysregulated circuit function and gene expression across multiple brain regions drive depressive phenotypes. Here, we performed RNA sequencing on four brain regions from control animals and those susceptible or resilient to chronic social defeat stress at multiple time points. We employed an integrative network biology approach to identify transcriptional networks and key driver genes that regulate susceptibility to depressive-like symptoms. Further, we validated in vivo several key drivers and their associated transcriptional networks that regulate depression susceptibility and confirmed their functional significance at the levels of gene transcription, synaptic regulation, and behavior. Our study reveals novel transcriptional networks that control stress susceptibility and offers fundamentally new leads for antidepressant drug discovery. Copyright © 2016 Elsevier Inc. All rights reserved.

  8. Spectrum and Frequency of the GJB2 Gene Pathogenic Variants in a Large Cohort of Patients with Hearing Impairment Living in a Subarctic Region of Russia (the Sakha Republic.

    Directory of Open Access Journals (Sweden)

    Nikolay A Barashkov

    Full Text Available Pathogenic variants in the GJB2 gene, encoding connexin 26, are known to be a major cause of hearing impairment (HI. More than 300 allelic variants have been identified in the GJB2 gene. Spectrum and allelic frequencies of the GJB2 gene vary significantly among different ethnic groups worldwide. Until now, the spectrum and frequency of the pathogenic variants in exon 1, exon 2 and the flanking intronic regions of the GJB2 gene have not been described thoroughly in the Sakha Republic (Yakutia, which is located in a subarctic region in Russia. The complete sequencing of the non-coding and coding regions of the GJB2 gene was performed in 393 patients with HI (Yakuts-296, Russians-51, mixed and other ethnicities-46 and in 187 normal hearing individuals of Yakut (n = 107 and Russian (n = 80 populations. In the total sample (n = 580, we revealed 12 allelic variants of the GJB2 gene, 8 of which were recessive pathogenic variants. Ten genotypes with biallelic recessive pathogenic variants in the GJB2 gene (in a homozygous or a compound heterozygous state were found in 192 out of 393 patients (48.85%. We found that the most frequent GJB2 pathogenic variant in the Yakut patients was c.-23+1G>A (51.82% and that the second most frequent was c.109G>A (2.37%, followed by c.35delG (1.64%. Pathogenic variants с.35delG (22.34%, c.-23+1G>A (5.31%, and c.313_326del14 (2.12% were found to be the most frequent among the Russian patients. The carrier frequencies of the c.-23+1G>A and с.109G>A pathogenic variants in the Yakut control group were 10.20% and 2.80%, respectively. The carrier frequencies of с.35delG and c.101T>C were identical (2.5% in the Russian control group. We found that the contribution of the GJB2 gene pathogenic variants in HI in the population of the Sakha Republic (48.85% was the highest among all of the previously studied regions of Asia. We suggest that extensive accumulation of the c.-23+1G>A pathogenic variant in the indigenous Yakut

  9. Spectrum and Frequency of the GJB2 Gene Pathogenic Variants in a Large Cohort of Patients with Hearing Impairment Living in a Subarctic Region of Russia (the Sakha Republic).

    Science.gov (United States)

    Barashkov, Nikolay A; Pshennikova, Vera G; Posukh, Olga L; Teryutin, Fedor M; Solovyev, Aisen V; Klarov, Leonid A; Romanov, Georgii P; Gotovtsev, Nyurgun N; Kozhevnikov, Andrey A; Kirillina, Elena V; Sidorova, Oksana G; Vasilyevа, Lena M; Fedotova, Elvira E; Morozov, Igor V; Bondar, Alexander A; Solovyevа, Natalya A; Kononova, Sardana K; Rafailov, Adyum M; Sazonov, Nikolay N; Alekseev, Anatoliy N; Tomsky, Mikhail I; Dzhemileva, Lilya U; Khusnutdinova, Elza K; Fedorova, Sardana A

    2016-01-01

    Pathogenic variants in the GJB2 gene, encoding connexin 26, are known to be a major cause of hearing impairment (HI). More than 300 allelic variants have been identified in the GJB2 gene. Spectrum and allelic frequencies of the GJB2 gene vary significantly among different ethnic groups worldwide. Until now, the spectrum and frequency of the pathogenic variants in exon 1, exon 2 and the flanking intronic regions of the GJB2 gene have not been described thoroughly in the Sakha Republic (Yakutia), which is located in a subarctic region in Russia. The complete sequencing of the non-coding and coding regions of the GJB2 gene was performed in 393 patients with HI (Yakuts-296, Russians-51, mixed and other ethnicities-46) and in 187 normal hearing individuals of Yakut (n = 107) and Russian (n = 80) populations. In the total sample (n = 580), we revealed 12 allelic variants of the GJB2 gene, 8 of which were recessive pathogenic variants. Ten genotypes with biallelic recessive pathogenic variants in the GJB2 gene (in a homozygous or a compound heterozygous state) were found in 192 out of 393 patients (48.85%). We found that the most frequent GJB2 pathogenic variant in the Yakut patients was c.-23+1G>A (51.82%) and that the second most frequent was c.109G>A (2.37%), followed by c.35delG (1.64%). Pathogenic variants с.35delG (22.34%), c.-23+1G>A (5.31%), and c.313_326del14 (2.12%) were found to be the most frequent among the Russian patients. The carrier frequencies of the c.-23+1G>A and с.109G>A pathogenic variants in the Yakut control group were 10.20% and 2.80%, respectively. The carrier frequencies of с.35delG and c.101T>C were identical (2.5%) in the Russian control group. We found that the contribution of the GJB2 gene pathogenic variants in HI in the population of the Sakha Republic (48.85%) was the highest among all of the previously studied regions of Asia. We suggest that extensive accumulation of the c.-23+1G>A pathogenic variant in the indigenous Yakut

  10. Partitioning of genetic variation between regulatory and coding gene segments: the predominance of software variation in genes encoding introvert proteins.

    Science.gov (United States)

    Mitchison, A

    1997-01-01

    In considering genetic variation in eukaryotes, a fundamental distinction can be made between variation in regulatory (software) and coding (hardware) gene segments. For quantitative traits the bulk of variation, particularly that near the population mean, appears to reside in regulatory segments. The main exceptions to this rule concern proteins which handle extrinsic substances, here termed extrovert proteins. The immune system includes an unusually large proportion of this exceptional category, but even so its chief source of variation may well be polymorphism in regulatory gene segments. The main evidence for this view emerges from genome scanning for quantitative trait loci (QTL), which in the case of the immune system points to a major contribution of pro-inflammatory cytokine genes. Further support comes from sequencing of major histocompatibility complex (Mhc) class II promoters, where a high level of polymorphism has been detected. These Mhc promoters appear to act, in part at least, by gating the back-signal from T cells into antigen-presenting cells. Both these forms of polymorphism are likely to be sustained by the need for flexibility in the immune response. Future work on promoter polymorphism is likely to benefit from the input from genome informatics.

  11. Tissue specific promoters improve the localization of radiation-inducible gene expression

    International Nuclear Information System (INIS)

    Hallahan, Dennis; Kataoka, Yasushi; Kuchibhotla, Jaya; Virudachalam, Subbu; Weichselbaum, Ralph

    1996-01-01

    Purpose: Site-specific activation of gene expression can be achieved by the use of a promoter that is induced by physical agents such as x-rays. The purpose of the present study was to determine whether site-specific activation of gene therapy can also be achieved within the vascular endothelium by use of radiation-inducible promoters. We studied induction of promoter-reporter gene constructs using previously identified radiation-promoters from c-jun, c-fos, Egr-1, ICAM-1, ELAM-1 after transfection into in the vascular endothelium. Methods: The following radiation-inducible genetic constructs were created: The ELAM-1 promoter fragment was cloned into pOGH to obtain the pE-sel(-587 +35)GH reporter construct. The ICAM-1 promoter fragment (-1162/+1) was cloned upstream of the CAT coding region of the pCAT-plasmid (Promega) after removal of the SV40 promoter by Bgl2/Stu1 digestion to create the pBS-CAT plasmid. The 132 to +170 bp segment of the 5' untranslated region of the c-jun promoter was cloned to the CAT reporter gene to create the -132/+170 cjun-CAT. The Egr-1 promoter fragment (-425/+75) was cloned upstream of the CAT coding region to create the pE425-CAT plasmid. Tandem repeats of the AP-1 binding site were cloned upstream of the CAT coding region (3 xTRE-CAT). Tandem repeats of the Egr binding site (EBS) were cloned upstream of the CAT coding region (EBS-CAT). Human vascular endothelial cells from both large vessel and small vessel origin (HUVEC and HMEC), as well as human tumor cell lines were transfected with plasmids -132/+170 cjun-CAT, pE425-CAT, 3 xTRE-CAT, EBS-CAT, pE-sel-GH and pBS-CAT by use of liposomes. Humor tumor cell lines included SQ20B (squamous), RIT3 (sarcoma), and HL525 (leukemia). Each plasmid was cotransfected with a plasmid containing a CMV promoter linked to the LacZ gene (1 μg). Transfected cells were treated with mock irradiation or x-rays. Cell extracts were assayed for reporter gene expression. Results: Radiation-induced gene

  12. Nonsynonymous substitution rate (Ka is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes

    Directory of Open Access Journals (Sweden)

    Wang Lei

    2011-02-01

    Full Text Available Abstract Background Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functions and evolutionary dynamics are, through comparative genomics. Results We chose human and eleven other high-coverage mammalian genome data–as well as an avian genome as an outgroup–to analyze orthologous protein-coding genes using nonsynonymous (Ka and synonymous (Ks substitution rates. After evaluating eight commonly-used methods of Ka and Ks calculation, we observed that these methods yielded a nearly uniform result when estimating Ka, but not Ks (or Ka/Ks. When sorting genes based on Ka, we noticed that fast-evolving and slow-evolving genes often belonged to different functional classes, with respect to species-specificity and lineage-specificity. In particular, we identified two functional classes of genes in the acquired immune system. Fast-evolving genes coded for signal-transducing proteins, such as receptors, ligands, cytokines, and CDs (cluster of differentiation, mostly surface proteins, whereas the slow-evolving genes were for function-modulating proteins, such as kinases and adaptor proteins. In addition, among slow-evolving genes that had functions related to the central nervous system, neurodegenerative disease-related pathways were enriched significantly in most mammalian species. We also confirmed that gene expression was negatively correlated with evolution rate, i.e. slow-evolving genes were expressed at higher levels than fast-evolving genes. Our results indicated that the functional specializations of the three major mammalian clades were: sensory perception and oncogenesis in primates, reproduction and hormone regulation in large mammals, and immunity and angiotensin in rodents. Conclusion Our study suggests that Ka calculation, which is less biased compared to Ks and Ka

  13. A Third Approach to Gene Prediction Suggests Thousands of Additional Human Transcribed Regions

    Science.gov (United States)

    Glusman, Gustavo; Qin, Shizhen; El-Gewely, M. Raafat; Siegel, Andrew F; Roach, Jared C; Hood, Leroy; Smit, Arian F. A

    2006-01-01

    The identification and characterization of the complete ensemble of genes is a main goal of deciphering the digital information stored in the human genome. Many algorithms for computational gene prediction have been described, ultimately derived from two basic concepts: (1) modeling gene structure and (2) recognizing sequence similarity. Successful hybrid methods combining these two concepts have also been developed. We present a third orthogonal approach to gene prediction, based on detecting the genomic signatures of transcription, accumulated over evolutionary time. We discuss four algorithms based on this third concept: Greens and CHOWDER, which quantify mutational strand biases caused by transcription-coupled DNA repair, and ROAST and PASTA, which are based on strand-specific selection against polyadenylation signals. We combined these algorithms into an integrated method called FEAST, which we used to predict the location and orientation of thousands of putative transcription units not overlapping known genes. Many of the newly predicted transcriptional units do not appear to code for proteins. The new algorithms are particularly apt at detecting genes with long introns and lacking sequence conservation. They therefore complement existing gene prediction methods and will help identify functional transcripts within many apparent “genomic deserts.” PMID:16543943

  14. Targeted sequencing of large genomic regions with CATCH-Seq.

    Directory of Open Access Journals (Sweden)

    Kenneth Day

    Full Text Available Current target enrichment systems for large-scale next-generation sequencing typically require synthetic oligonucleotides used as capture reagents to isolate sequences of interest. The majority of target enrichment reagents are focused on gene coding regions or promoters en masse. Here we introduce development of a customizable targeted capture system using biotinylated RNA probe baits transcribed from sheared bacterial artificial chromosome clone templates that enables capture of large, contiguous blocks of the genome for sequencing applications. This clone adapted template capture hybridization sequencing (CATCH-Seq procedure can be used to capture both coding and non-coding regions of a gene, and resolve the boundaries of copy number variations within a genomic target site. Furthermore, libraries constructed with methylated adapters prior to solution hybridization also enable targeted bisulfite sequencing. We applied CATCH-Seq to diverse targets ranging in size from 125 kb to 3.5 Mb. Our approach provides a simple and cost effective alternative to other capture platforms because of template-based, enzymatic probe synthesis and the lack of oligonucleotide design costs. Given its similarity in procedure, CATCH-Seq can also be performed in parallel with commercial systems.

  15. Selection of reference genes in different myocardial regions of an in vivo ischemia/reperfusion rat model for normalization of antioxidant gene expression

    Directory of Open Access Journals (Sweden)

    Vesentini Nicoletta

    2012-02-01

    Full Text Available Abstract Background Changes in cardiac gene expression due to myocardial injury are usually assessed in whole heart tissue. However, as the heart is a heterogeneous system, spatial and temporal heterogeneity is expected in gene expression. Results In an ischemia/reperfusion (I/R rat model we evaluated gene expression of mitochondrial and cytoplasmatic superoxide dismutase (MnSod, Cu-ZnSod and thioredoxin reductase (trxr1 upon short (4 h and long (72 h reperfusion times in the right ventricle (RV, and in the ischemic/reperfused (IRR and the remote region (RR of the left ventricle. Gene expression was assessed by Real-time reverse-transcription quantitative PCR (RT-qPCR. In order to select most stable reference genes suitable for normalization purposes, in each myocardial region we tested nine putative reference genes by geNorm analysis. The genes investigated were: Actin beta (actb, Glyceraldehyde-3-P-dehydrogenase (gapdh, Ribosomal protein L13A (rpl13a, Tyrosine 3-monooxygenase (ywhaz, Beta-glucuronidase (gusb, Hypoxanthine guanine Phosphoribosyltransferase 1 (hprt, TATA binding box protein (tbp, Hydroxymethylbilane synthase (hmbs, Polyadenylate-binding protein 1 (papbn1. According to our findings, most stable reference genes in the RV and RR were hmbs/hprt and hmbs/tbp/hprt respectively. In the IRR, six reference genes were recommended for normalization purposes; however, in view of experimental feasibility limitations, target gene expression could be normalized against the three most stable reference genes (ywhaz/pabp/hmbs without loss of sensitivity. In all cases MnSod and Cu-ZnSod expression decreased upon long reperfusion, the former in all myocardial regions and the latter in IRR alone. trxr1 expression did not vary. Conclusions This study provides a validation of reference genes in the RV and in the anterior and posterior wall of the LV of cardiac ischemia/reperfusion model and shows that gene expression should be assessed separately in

  16. Human polyomavirus JCV late leader peptide region contains important regulatory elements

    International Nuclear Information System (INIS)

    Akan, Ilhan; Sariyer, Ilker Kudret; Biffi, Renato; Palermo, Victoria; Woolridge, Stefanie; White, Martyn K.; Amini, Shohreh; Khalili, Kamel; Safak, Mahmut

    2006-01-01

    Transcription is a complex process that relies on the cooperative interaction between sequence-specific factors and the basal transcription machinery. The strength of a promoter depends on upstream or downstream cis-acting DNA elements, which bind transcription factors. In this study, we investigated whether DNA elements located downstream of the JCV late promoter, encompassing the late leader peptide region, which encodes agnoprotein, play regulatory roles in the JCV lytic cycle. For this purpose, the entire coding region of the leader peptide was deleted and the functional consequences of this deletion were analyzed. We found that viral gene expression and replication were drastically reduced. Gene expression also decreased from a leader peptide point mutant but to a lesser extent. This suggested that the leader peptide region of JCV might contain critical cis-acting DNA elements to which transcription factors bind and regulate viral gene expression and replication. We analyzed the entire coding region of the late leader peptide by a footprinting assay and identified three major regions (region I, II and III) that were protected by nuclear proteins. Further investigation of the first two protected regions by band shift assays revealed a new band that appeared in new infection cycles, suggesting that viral infection induces new factors that interact with the late leader peptide region of JCV. Analysis of the effect of the leader peptide region on the promoter activity of JCV by transfection assays demonstrated that this region has a positive and negative effect on the large T antigen (LT-Ag)-mediated activation of the viral early and late promoters, respectively. Furthermore, a partial deletion analysis of the leader peptide region encompassing the protected regions I and II demonstrated a significant down-regulation of viral gene expression and replication. More importantly, these results were similar to that obtained from a complete deletion of the late leader

  17. Mapping of the serotonin 5-HT{sub 1D{alpha}} autoreceptor gene (HTR1D) on chromosome 1 using a silent polymorphism in the coding region

    Energy Technology Data Exchange (ETDEWEB)

    Ozaki, N.; Lappalainen, J.; Linnoila, M. [National Institute on Alcohol Abuse and Alcoholism, Rockville, MD (United States)] [and others

    1995-04-24

    Serotonin (5-HT){sub ID} receptors are 5-HT release-regulating autoreceptors in the human brain. Abnormalities in brain 5-HT function have been hypothesized in the pathophysiology of various psychiatric disorders, including obsessive-compulsive disorder, autism, mood disorders, eating disorders, impulsive violent behavior, and alcoholism. Thus, mutations occurring in 5-HT autoreceptors may cause or increase the vulnerability to any of these conditions. 5-HT{sub 1D{alpha}} and 5-HT{sub 1D{Beta}} subtypes have been previously localized to chromosomes 1p36.3-p34.3 and 6q13, respectively, using rodent-human hybrids and in situ localization. In this communication, we report the detection of a 5-HT{sub 1D{alpha}} receptor gene polymorphism by single strand conformation polymorphism (SSCP) analysis of the coding sequence. The polymorphism was used for fine scale linkage mapping of 5-HT{sub 1D{alpha}} on chromosome 1. This polymorphism should also be useful for linkage studies in populations and in families. Our analysis also demonstrates that functionally significant coding sequence variants of the 5-HT{sub 1D{alpha}} are probably not abundant either among alcoholics or in the general population. 14 refs., 1 fig., 1 tab.

  18. The Non-Coding Regulatory RNA Revolution in Archaea

    Directory of Open Access Journals (Sweden)

    Diego Rivera Gelsinger

    2018-03-01

    Full Text Available Small non-coding RNAs (sRNAs are ubiquitously found in the three domains of life playing large-scale roles in gene regulation, transposable element silencing and defense against foreign elements. While a substantial body of experimental work has been done to uncover function of sRNAs in Bacteria and Eukarya, the functional roles of sRNAs in Archaea are still poorly understood. Recently, high throughput studies using RNA-sequencing revealed that sRNAs are broadly expressed in the Archaea, comprising thousands of transcripts within the transcriptome during non-challenged and stressed conditions. Antisense sRNAs, which overlap a portion of a gene on the opposite strand (cis-acting, are the most abundantly expressed non-coding RNAs and they can be classified based on their binding patterns to mRNAs (3′ untranslated region (UTR, 5′ UTR, CDS-binding. These antisense sRNAs target many genes and pathways, suggesting extensive roles in gene regulation. Intergenic sRNAs are less abundantly expressed and their targets are difficult to find because of a lack of complete overlap between sRNAs and target mRNAs (trans-acting. While many sRNAs have been validated experimentally, a regulatory role has only been reported for very few of them. Further work is needed to elucidate sRNA-RNA binding mechanisms, the molecular determinants of sRNA-mediated regulation, whether protein components are involved and how sRNAs integrate with complex regulatory networks.

  19. Cloning and characterization of the promoter regions from the parent and paralogous creatine transporter genes.

    Science.gov (United States)

    Ndika, Joseph D T; Lusink, Vera; Beaubrun, Claudine; Kanhai, Warsha; Martinez-Munoz, Cristina; Jakobs, Cornelis; Salomons, Gajja S

    2014-01-10

    Interconversion between phosphocreatine and creatine, catalyzed by creatine kinase is crucial in the supply of ATP to tissues with high energy demand. Creatine's importance has been established by its use as an ergogenic aid in sport, as well as the development of intellectual disability in patients with congenital creatine deficiency. Creatine biosynthesis is complemented by dietary creatine uptake. Intracellular transport of creatine is carried out by a creatine transporter protein (CT1/CRT/CRTR) encoded by the SLC6A8 gene. Most tissues express this gene, with highest levels detected in skeletal muscle and kidney. There are lower levels of the gene detected in colon, brain, heart, testis and prostate. The mechanism(s) by which this regulation occurs is still poorly understood. A duplicated unprocessed pseudogene of SLC6A8-SLC6A10P has been mapped to chromosome 16p11.2 (contains the entire SLC6A8 gene, plus 2293 bp of 5'flanking sequence and its entire 3'UTR). Expression of SLC6A10P has so far only been shown in human testis and brain. It is still unclear as to what is the function of SLC6A10P. In a patient with autism, a chromosomal breakpoint that intersects the 5'flanking region of SLC6A10P was identified; suggesting that SLC6A10P is a non-coding RNA involved in autism. Our aim was to investigate the presence of cis-acting factor(s) that regulate expression of the creatine transporter, as well as to determine if these factors are functionally conserved upstream of the creatine transporter pseudogene. Via gene-specific PCR, cloning and functional luciferase assays we identified a 1104 bp sequence proximal to the mRNA start site of the SLC6A8 gene with promoter activity in five cell types. The corresponding 5'flanking sequence (1050 bp) on the pseudogene also had promoter activity in all 5 cell lines. Surprisingly the pseudogene promoter was stronger than that of its parent gene in 4 of the cell lines tested. To the best of our knowledge, this is the first

  20. The unique genomic properties of sex-biased genes: Insights from avian microarray data

    Directory of Open Access Journals (Sweden)

    Webster Matthew T

    2008-03-01

    Full Text Available Abstract Background In order to develop a framework for the analysis of sex-biased genes, we present a characterization of microarray data comparing male and female gene expression in 18 day chicken embryos for brain, gonad, and heart tissue. Results From the 15982 significantly expressed coding regions that have been assigned to either the autosomes or the Z chromosome (12979 in brain, 13301 in gonad, and 12372 in heart, roughly 18% were significantly sex-biased in any one tissue, though only 4 gene targets were biased in all tissues. The gonad was the most sex-biased tissue, followed by the brain. Sex-biased autosomal genes tended to be expressed at lower levels and in fewer tissues than unbiased gene targets, and autosomal somatic sex-biased genes had more expression noise than similar unbiased genes. Sex-biased genes linked to the Z-chromosome showed reduced expression in females, but not in males, when compared to unbiased Z-linked genes, and sex-biased Z-linked genes were also expressed in fewer tissues than unbiased Z coding regions. Third position GC content, and codon usage bias showed some sex-biased effects, primarily for autosomal genes expressed in the gonad. Finally, there were several over-represented Gene Ontology terms in the sex-biased gene sets. Conclusion On the whole, this analysis suggests that sex-biased genes have unique genomic and organismal properties that delineate them from genes that are expressed equally in males and females.

  1. The PIES2012 Code for Calculating 3D Equilibria with Islands and Stochastic Regions

    Science.gov (United States)

    Monticello, Donald; Reiman, Allan; Raburn, Daniel

    2013-10-01

    We have made major modifications to the PIES 3D equilibrium code to produce a new version, PIES2012. The new version uses an adaptive radial grid for calculating equilibrium currents. A subset of the flux surfaces conform closely to island separatrices, providing an accurate treatment of the effects driving the neoclassical tearing mode. There is now a set of grid surfaces that conform to the flux surfaces in the interiors of the islands, allowing the proper treatment of the current profiles in the islands, which play an important role in tearing phenomena. We have verified that we can introduce appropriate current profiles in the islands to suppress their growth, allowing us to simulate situations where islands are allowed to grow at some rational surfaces but not others. Placement of grid surfaces between islands is guided by the locations of high order fixed points, allowing us to avoid spectral polution and providing a more robust, and smoother convergence of the code. The code now has an option for turning on a vertical magnetic field to fix the position of the magnetic axis, which models the horizontal feedback positioning of a tokamak plasma. The code has a new option for using a Jacobian-Free Newton Krylov scheme for convergence. The code now also contains a model that properly handles stochastic regions with nonzero pressure gradients. Work supported by DOE contract DE-AC02-09CH11466.

  2. Sequence organization and control of transcription in the bacteriophage T4 tRNA region.

    Science.gov (United States)

    Broida, J; Abelson, J

    1985-10-05

    Bacteriophage T4 contains genes for eight transfer RNAs and two stable RNAs of unknown function. These are found in two clusters at 70 X 10(3) base-pairs on the T4 genetic map. To understand the control of transcription in this region we have completed the sequencing of 5000 base-pairs in this region. The sequence contains a part of gene 3, gene 1, gene 57, internal protein I, the tRNA genes and five open reading frames which most likely code for heretofore unidentified proteins. We have used subclones of the region to investigate the kinetics of transcription in vivo. The results show that transcription in this region consists of overlapping early, middle and late transcripts. Transcription is directed from two early promoters, one or two middle promoters and perhaps two late promoters. This region contains all of the features that are seen in T4 transcription and as such is a good place to study the phenomenon in more detail.

  3. Revised genomic structure of the human ghrelin gene and identification of novel exons, alternative splice variants and natural antisense transcripts

    Directory of Open Access Journals (Sweden)

    Herington Adrian C

    2007-08-01

    Full Text Available Abstract Background Ghrelin is a multifunctional peptide hormone expressed in a range of normal tissues and pathologies. It has been reported that the human ghrelin gene consists of five exons which span 5 kb of genomic DNA on chromosome 3 and includes a 20 bp non-coding first exon (20 bp exon 0. The availability of bioinformatic tools enabling comparative analysis and the finalisation of the human genome prompted us to re-examine the genomic structure of the ghrelin locus. Results We have demonstrated the presence of an additional novel exon (exon -1 and 5' extensions to exon 0 and 1 using comparative in silico analysis and have demonstrated their existence experimentally using RT-PCR and 5' RACE. A revised exon-intron structure demonstrates that the human ghrelin gene spans 7.2 kb and consists of six rather than five exons. Several ghrelin gene-derived splice forms were detected in a range of human tissues and cell lines. We have demonstrated ghrelin gene-derived mRNA transcripts that do not code for ghrelin, but instead may encode the C-terminal region of full-length preproghrelin (C-ghrelin, which contains the coding region for obestatin and a transcript encoding obestatin-only. Splice variants that differed in their 5' untranslated regions were also found, suggesting a role of these regions in the post-transcriptional regulation of preproghrelin translation. Finally, several natural antisense transcripts, termed ghrelinOS (ghrelin opposite strand transcripts, were demonstrated via orientation-specific RT-PCR, 5' RACE and in silico analysis of ESTs and cloned amplicons. Conclusion The sense and antisense alternative transcripts demonstrated in this study may function as non-coding regulatory RNA, or code for novel protein isoforms. This is the first demonstration of putative obestatin and C-ghrelin specific transcripts and these findings suggest that these ghrelin gene-derived peptides may also be produced independently of preproghrelin

  4. Origin of sphinx, a young chimeric RNA gene in Drosophila melanogaster

    Science.gov (United States)

    Wang, Wen; Brunet, Frédéric G.; Nevo, Eviatar; Long, Manyuan

    2002-01-01

    Non-protein-coding RNA genes play an important role in various biological processes. How new RNA genes originated and whether this process is controlled by similar evolutionary mechanisms for the origin of protein-coding genes remains unclear. A young chimeric RNA gene that we term sphinx (spx) provides the first insight into the early stage of evolution of RNA genes. spx originated as an insertion of a retroposed sequence of the ATP synthase chain F gene at the cytological region 60DB since the divergence of Drosophila melanogaster from its sibling species 2–3 million years ago. This retrosequence, which is located at 102F on the fourth chromosome, recruited a nearby exon and intron, thereby evolving a chimeric gene structure. This molecular process suggests that the mechanism of exon shuffling, which can generate protein-coding genes, also plays a role in the origin of RNA genes. The subsequent evolutionary process of spx has been associated with a high nucleotide substitution rate, possibly driven by a continuous positive Darwinian selection for a novel function, as is shown in its sex- and development-specific alternative splicing. To test whether spx has adapted to different environments, we investigated its population genetic structure in the unique “Evolution Canyon” in Israel, revealing a similar haplotype structure in spx, and thus similar evolutionary forces operating on spx between environments. PMID:11904380

  5. Investigation of Gamma-aminobutyric acid (GABA A receptors genes and migraine susceptibility

    Directory of Open Access Journals (Sweden)

    Ciccodicola Alfredo

    2008-12-01

    Full Text Available Abstract Background Migraine is a neurological disorder characterized by recurrent attacks of severe headache, affecting around 12% of Caucasian populations. It is well known that migraine has a strong genetic component, although the number and type of genes involved is still unclear. Prior linkage studies have reported mapping of a migraine gene to chromosome Xq 24–28, a region containing a cluster of genes for GABA A receptors (GABRE, GABRA3, GABRQ, which are potential candidate genes for migraine. The GABA neurotransmitter has been implicated in migraine pathophysiology previously; however its exact role has not yet been established, although GABA receptors agonists have been the target of therapeutic developments. The aim of the present research is to investigate the role of the potential candidate genes reported on chromosome Xq 24–28 region in migraine susceptibility. In this study, we have focused on the subunit GABA A receptors type ε (GABRE and type θ (GABRQ genes and their involvement in migraine. Methods We have performed an association analysis in a large population of case-controls (275 unrelated Caucasian migraineurs versus 275 controls examining a set of 3 single nucleotide polymorphisms (SNPs in the coding region (exons 3, 5 and 9 of the GABRE gene and also the I478F coding variant of the GABRQ gene. Results Our study did not show any association between the examined SNPs in our test population (P > 0.05. Conclusion Although these particular GABA receptor genes did not show positive association, further studies are necessary to consider the role of other GABA receptor genes in migraine susceptibility.

  6. The chicken beta 2-microglobulin gene is located on a non-major histocompatibility complex microchromosome: a small, G+C-rich gene with X and Y boxes in the promoter

    DEFF Research Database (Denmark)

    Riegert, P; Andersen, R; Bumstead, N

    1996-01-01

    a similar genomic organization but smaller introns and higher G+C content than mammalian beta 2-microglobulin genes. The promoter region is particularly G+C-rich and contains, in addition to interferon regulatory elements, potential S/W, X, and Y boxes that were originally described for mammalian class II...... but not class I alpha or beta 2-microglobulin genes. There is a single chicken beta 2-microglobulin gene that has little polymorphism in the coding region. Restriction fragment length polymorphisms from Mhc homozygous lines, Mhc congenic lines, and backcross families, as well as in situ hybridization, show...

  7. The small RNA content of human sperm reveals pseudogene-derived piRNAs complementary to protein-coding genes

    DEFF Research Database (Denmark)

    Pantano, Lorena; Jodar, Meritxell; Bak, Mads

    2015-01-01

    -specific genes. The most abundant class of small noncoding RNAs in sperm are PIWI-interacting RNAs (piRNAs). Surprisingly, we found that human sperm cells contain piRNAs processed from pseudogenes. Clusters of piRNAs from human testes contain pseudogenes transcribed in the antisense strand and processed...... into small RNAs. Several human protein-coding genes contain antisense predicted targets of pseudogene-derived piRNAs in the male germline and these piRNAs are still found in mature sperm. Our study provides the most extensive data set and annotation of human sperm small RNAs to date and is a resource...... for further functional studies on the roles of sperm small RNAs. In addition, we propose that some of the pseudogene-derived human piRNAs may regulate expression of their parent gene in the male germline....

  8. Potential efficacy of mitochondrial genes for animal DNA barcoding: a case study using eutherian mammals.

    Science.gov (United States)

    Luo, Arong; Zhang, Aibing; Ho, Simon Yw; Xu, Weijun; Zhang, Yanzhou; Shi, Weifeng; Cameron, Stephen L; Zhu, Chaodong

    2011-01-28

    A well-informed choice of genetic locus is central to the efficacy of DNA barcoding. Current DNA barcoding in animals involves the use of the 5' half of the mitochondrial cytochrome oxidase 1 gene (CO1) to diagnose and delimit species. However, there is no compelling a priori reason for the exclusive focus on this region, and it has been shown that it performs poorly for certain animal groups. To explore alternative mitochondrial barcoding regions, we compared the efficacy of the universal CO1 barcoding region with the other mitochondrial protein-coding genes in eutherian mammals. Four criteria were used for this comparison: the number of recovered species, sequence variability within and between species, resolution to taxonomic levels above that of species, and the degree of mutational saturation. Based on 1,179 mitochondrial genomes of eutherians, we found that the universal CO1 barcoding region is a good representative of mitochondrial genes as a whole because the high species-recovery rate (> 90%) was similar to that of other mitochondrial genes, and there were no significant differences in intra- or interspecific variability among genes. However, an overlap between intra- and interspecific variability was still problematic for all mitochondrial genes. Our results also demonstrated that any choice of mitochondrial gene for DNA barcoding failed to offer significant resolution at higher taxonomic levels. We suggest that the CO1 barcoding region, the universal DNA barcode, is preferred among the mitochondrial protein-coding genes as a molecular diagnostic at least for eutherian species identification. Nevertheless, DNA barcoding with this marker may still be problematic for certain eutherian taxa and our approach can be used to test potential barcoding loci for such groups.

  9. Homology-dependent Gene Silencing in Paramecium

    Science.gov (United States)

    Ruiz, Françoise; Vayssié, Laurence; Klotz, Catherine; Sperling, Linda; Madeddu, Luisa

    1998-01-01

    Microinjection at high copy number of plasmids containing only the coding region of a gene into the Paramecium somatic macronucleus led to a marked reduction in the expression of the corresponding endogenous gene(s). The silencing effect, which is stably maintained throughout vegetative growth, has been observed for all Paramecium genes examined so far: a single-copy gene (ND7), as well as members of multigene families (centrin genes and trichocyst matrix protein genes) in which all closely related paralogous genes appeared to be affected. This phenomenon may be related to posttranscriptional gene silencing in transgenic plants and quelling in Neurospora and allows the efficient creation of specific mutant phenotypes thus providing a potentially powerful tool to study gene function in Paramecium. For the two multigene families that encode proteins that coassemble to build up complex subcellular structures the analysis presented herein provides the first experimental evidence that the members of these gene families are not functionally redundant. PMID:9529389

  10. Characterization of the hemA-prs region of the Escherichia coli and Salmonella typhimurium chromosomes

    DEFF Research Database (Denmark)

    Post, David A.; Hove-Jensen, Bjarne; Switzer, Robert L.

    1993-01-01

    The prs gene, encoding phosphoribosylpyrophosphate synthetase, is preceded by a leader, which is 302 bp long in Escherichia coli and 417 bp in Salmonella typhimurium. A potential open reading frame (ORF) extends across the prs promoter and into the leader. The region between the prs coding region...... two promoters, the first promoter (P1) originating upstream of ORF 1, and expressing the prs gene in a tricistronic operon and a second promoter (P2), located within the ORF 2 coding frame, which transcribes the prs gene only. The transcripts encoding prs only were 20 times as abundant...... in the amount of message originating from the promoter P2....

  11. The neurovirulence and neuroinvasiveness of chimeric tick-borne encephalitis/dengue virus can be attenuated by introducing defined mutations into the envelope and NS5 protein genes and the 3' non-coding region of the genome

    International Nuclear Information System (INIS)

    Engel, Amber R.; Rumyantsev, Alexander A.; Maximova, Olga A.; Speicher, James M.; Heiss, Brian; Murphy, Brian R.; Pletnev, Alexander G.

    2010-01-01

    Tick-borne encephalitis (TBE) is a severe disease affecting thousands of people throughout Eurasia. Despite the use of formalin-inactivated vaccines in endemic areas, an increasing incidence of TBE emphasizes the need for an alternative vaccine that will induce a more durable immunity against TBE virus (TBEV). The chimeric attenuated virus vaccine candidate containing the structural protein genes of TBEV on a dengue virus genetic background (TBEV/DEN4) retains a high level of neurovirulence in both mice and monkeys. Therefore, attenuating mutations were introduced into the envelope (E 315 ) and NS5 (NS5 654,655 ) proteins, and into the 3' non-coding region (Δ30) of TBEV/DEN4. The variant that contained all three mutations (vΔ30/E 315 /NS5 654,655 ) was significantly attenuated for neuroinvasiveness and neurovirulence and displayed a reduced level of replication and virus-induced histopathology in the brains of mice. The high level of safety in the central nervous system indicates that vΔ30/E 315 /NS5 654,655 should be further evaluated as a TBEV vaccine.

  12. Epigenetic codes programming class switch recombination

    Directory of Open Access Journals (Sweden)

    Bharat eVaidyanathan

    2015-09-01

    Full Text Available Class switch recombination imparts B cells with a fitness-associated adaptive advantage during a humoral immune response by using a precision-tailored DNA excision and ligation process to swap the default constant region gene of the antibody with a new one that has unique effector functions. This secondary diversification of the antibody repertoire is a hallmark of the adaptability of B cells when confronted with environmental and pathogenic challenges. Given that the nucleotide sequence of genes during class switching remains unchanged (genetic constraints, it is logical and necessary therefore, to integrate the adaptability of B cells to an epigenetic state, which is dynamic and can be heritably modulated before, after or even during an antibody-dependent immune response. Epigenetic regulation encompasses heritable changes that affect function (phenotype without altering the sequence information embedded in a gene, and include histone, DNA and RNA modifications. Here, we review current literature on how B cells use an epigenetic code language as a means to ensure antibody plasticity in light of pathogenic insults.

  13. Molecular Evolution of the non-coding Eosinophil Granule Ontogeny Transcript EGOT

    Directory of Open Access Journals (Sweden)

    Dominic eRose

    2011-10-01

    Full Text Available Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs. The evolutionary history of mlncRNAs is still largely uncharted territory.In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT, an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs. EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyse patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrat here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved and thermodynamic stable secondary structures.Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element.

  14. Frequent gene conversion events between the X and Y homologous chromosomal regions in primates

    Directory of Open Access Journals (Sweden)

    Hirai Hirohisa

    2010-07-01

    Full Text Available Abstract Background Mammalian sex-chromosomes originated from a pair of autosomes. A step-wise cessation of recombination is necessary for the proper maintenance of sex-determination and, consequently, generates a four strata structure on the X chromosome. Each stratum shows a specific per-site nucleotide sequence difference (p-distance between the X and Y chromosomes, depending on the time of recombination arrest. Stratum 4 covers the distal half of the human X chromosome short arm and the p-distance of the stratum is ~10%, on average. However, a 100-kb region, which includes KALX and VCX, in the middle of stratum 4 shows a significantly lower p-distance (1-5%, suggesting frequent sequence exchanges or gene conversions between the X and Y chromosomes in humans. To examine the evolutionary mechanism for this low p-distance region, sequences of a corresponding region including KALX/Y from seven species of non-human primates were analyzed. Results Phylogenetic analysis of this low p-distance region in humans and non-human primate species revealed that gene conversion like events have taken place at least ten times after the divergence of New World monkeys and Catarrhini (i.e., Old World monkeys and hominoids. A KALY-converted KALX allele in white-handed gibbons also suggests a possible recent gene conversion between the X and Y chromosomes. In these primate sequences, the proximal boundary of this low p-distance region is located in a LINE element shared between the X and Y chromosomes, suggesting the involvement of this element in frequent gene conversions. Together with a palindrome on the Y chromosome, a segmental palindrome structure on the X chromosome at the distal boundary near VCX, in humans and chimpanzees, may mediate frequent sequence exchanges between X and Y chromosomes. Conclusion Gene conversion events between the X and Y homologous regions have been suggested, mainly in humans. Here, we found frequent gene conversions in the

  15. The clinical impact of hypoxia-regulated gene expression in loco-regional gastroesophageal cancer

    DEFF Research Database (Denmark)

    Winther, M.; Alsner, J.; Tramm, T.

    2015-01-01

    Purpose/Objective: In a former study (1), the hypoxia gene expression classifier, developed in head and neck squamous cell carcinomas, was applied in 89 patients with loco-regional gastroesophageal cancer (GC). Analysis of the 15 genes was indicative of hypoxia being more profound in esophagus...... and display greater heterogeneity compared to AC. However, previous indications that the hypoxia classifier might hold prognostic significance in ESCC patients could not be confirmed. Ongoing work includes in vitro studies of esophageal cancer cell lines in order to identify alternative hypoxia induced genes...... and to further explore the prognostic value of hypoxia in patients with loco-regional gastroesophageal cancer. (Figure Presented)....

  16. Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

    Directory of Open Access Journals (Sweden)

    Guo Xiang

    2008-12-01

    Full Text Available Abstract Background Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know

  17. CpG + CpNpG Analysis of Protein-Coding Sequences from Tomato

    DEFF Research Database (Denmark)

    Hobolth, Asger; Nielsen, Rasmus; Wang, Ying

    2006-01-01

    We develop codon-based models for simultaneously inferring the mutational effects of CpG and CpNpG methylation in coding regions. In a data set of 369 tomato genes, we show that there is very little effect of CpNpG methylation but a strong effect of CpG methylation affecting almost all genes. We...... further show that the CpNpG and CpG effects are largely uncorrelated. Our results suggest different roles of CpG and CpNpG methylation, with CpNpG methylation possibly playing a specialized role in defense against transposons and RNA viruses....

  18. Microsatellites in the Eukaryotic DNA Mismatch Repair Genes as Modulators of Evolutionary Mutation Rate

    Science.gov (United States)

    Chang, Dong Kyung; Metzgar, David; Wills, Christopher; Boland, C. Richard

    2003-01-01

    All "minor" components of the human DNA mismatch repair (MMR) system-MSH3, MSH6, PMS2, and the recently discovered MLH3-contain mononucleotide microsatellites in their coding sequences. This intriguing finding contrasts with the situation found in the major components of the DNA MMR system-MSH2 and MLH1-and, in fact, most human genes. Although eukaryotic genomes are rich in microsatellites, non-triplet microsatellites are rare in coding regions. The recurring presence of exonal mononucleotide repeat sequences within a single family of human genes would therefore be considered exceptional.

  19. Genome-wide identification of long non-coding RNA genes and their association with insecticide resistance and metamorphosis in diamondback moth, Plutella xylostella.

    Science.gov (United States)

    Liu, Feiling; Guo, Dianhao; Yuan, Zhuting; Chen, Chen; Xiao, Huamei

    2017-11-20

    Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.

  20. Taurine‑upregulated gene 1: A vital long non‑coding RNA associated with cancer in humans (Review).

    Science.gov (United States)

    Wang, Wen-Yu; Wang, Yan-Fen; Ma, Pei; Xu, Tong-Peng; Shu, Yong-Qian

    2017-11-01

    It is widely reported that long non‑coding RNAs (lncRNAs) are involved in regulating cell differentiation, proliferation, apoptosis and other biological processes. Certain lncRNAs have been found to be crucial in various types of tumor. Taurine‑upregulated gene 1 (TUG1) has been shown to be expressed in a tissue‑specific pattern and exert oncogenic or tumor suppressive functions in different types of cancer in humans. According to previous studies, TUG1 is predominantly located in the nucleus and may regulate gene expression at the transcriptional level. It mediates chromosomal remodeling and coordinates with polycomb repressive complex 2 (PRC2) to regulate gene expression. Although the mechanisms of how TUG1 affects the tumor genesis process remain to be fully elucidated, increasing studies have suggested that TUG1 offers potential as a diagnostic and prognostic biomarker, and as a therapeutic target in certain types of tumor. This review aims to summarize current evidence concerning the characteristics, mechanisms and associations with cancer of TUG1.

  1. Domestication of transposable elements into MicroRNA genes in plants.

    Directory of Open Access Journals (Sweden)

    Yang Li

    Full Text Available Transposable elements (TE usually take up a substantial portion of eukaryotic genome. Activities of TEs can cause genome instability or gene mutations that are harmful or even disastrous to the host. TEs also contribute to gene and genome evolution at many aspects. Part of miRNA genes in mammals have been found to derive from transposons while convincing evidences are absent for plants. We found that a considerable number of previously annotated plant miRNAs are identical or homologous to transposons (TE-MIR, which include a small number of bona fide miRNA genes that conform to generally accepted plant miRNA annotation rules, and hairpin derived siRNAs likely to be pre-evolved miRNAs. Analysis of these TE-MIRs indicate that transitions from the medium to high copy TEs into miRNA genes may undergo steps such as inverted repeat formation, sequence speciation and adaptation to miRNA biogenesis. We also identified initial target genes of the TE-MIRs, which contain homologous sequences in their CDS as consequence of cognate TE insertions. About one-third of the initial target mRNAs are supported by publicly available degradome sequencing data for TE-MIR sRNA induced cleavages. Targets of the TE-MIRs are biased to non-TE related genes indicating their penchant to acquire cellular functions during evolution. Interestingly, most of these TE insertions span boundaries between coding and non-coding sequences indicating their incorporation into CDS through alteration of splicing or translation start or stop signals. Taken together, our findings suggest that TEs in gene rich regions can form foldbacks in non-coding part of transcripts that may eventually evolve into miRNA genes or be integrated into protein coding sequences to form potential targets in a "temperate" manner. Thus, transposons may supply as resources for the evolution of miRNA-target interactions in plants.

  2. Analysis of gene expression profile microarray data in complex regional pain syndrome.

    Science.gov (United States)

    Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing

    2017-09-01

    The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.

  3. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    Science.gov (United States)

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A

  4. Somatic frameshift mutations in the Bloom syndrome BLM gene are frequent in sporadic gastric carcinomas with microsatellite mutator phenotype

    Directory of Open Access Journals (Sweden)

    Matei Irina

    2001-08-01

    Full Text Available Abstract Background Genomic instability has been reported at microsatellite tracts in few coding sequences. We have shown that the Bloom syndrome BLM gene may be a target of microsatelliteinstability (MSI in a short poly-adenine repeat located in its coding region. To further characterize the involvement of BLM in tumorigenesis, we have investigated mutations in nine genes containing coding microsatellites in microsatellite mutator phenotype (MMP positive and negative gastric carcinomas (GCs. Methods We analyzed 50 gastric carcinomas (GCs for mutations in the BLM poly(A tract aswell as in the coding microsatellites of the TGFβ1-RII, IGFIIR, hMSH3, hMSH6, BAX, WRN, RECQL and CBL genes. Results BLM mutations were found in 27% of MMP+ GCs (4/15 cases but not in any of the MMP negative GCs (0/35 cases. The frequency of mutations in the other eight coding regions microsatellite was the following: TGFβ1-RII (60 %, BAX (27%, hMSH6 (20%,hMSH3 (13%, CBL (13%, IGFIIR (7%, RECQL (0% and WRN (0%. Mutations in BLM appear to be more frequently associated with frameshifts in BAX and in hMSH6and/or hMSH3. Tumors with BLM alterations present a higher frequency of unstable mono- and trinucleotide repeats located in coding regions as compared with mutator phenotype tumors without BLM frameshifts. Conclusions BLM frameshifts are frequent alterations in GCs specifically associated with MMP+tumors. We suggest that BLM loss of function by MSI may increase the genetic instability of a pre-existent unstable genotype in gastric tumors.

  5. Somatic frameshift mutations in the Bloom syndrome BLM gene are frequent in sporadic gastric carcinomas with microsatellite mutator phenotype

    Science.gov (United States)

    Calin, George; Ranzani, Guglielmina N; Amadori, Dino; Herlea, Vlad; Matei, Irina; Barbanti-Brodano, Giuseppe; Negrini, Massimo

    2001-01-01

    Background Genomic instability has been reported at microsatellite tracts in few coding sequences. We have shown that the Bloom syndrome BLM gene may be a target of microsatelliteinstability (MSI) in a short poly-adenine repeat located in its coding region. To further characterize the involvement of BLM in tumorigenesis, we have investigated mutations in nine genes containing coding microsatellites in microsatellite mutator phenotype (MMP) positive and negative gastric carcinomas (GCs). Methods We analyzed 50 gastric carcinomas (GCs) for mutations in the BLM poly(A) tract aswell as in the coding microsatellites of the TGFβ1-RII, IGFIIR, hMSH3, hMSH6, BAX, WRN, RECQL and CBL genes. Results BLM mutations were found in 27% of MMP+ GCs (4/15 cases) but not in any of the MMP negative GCs (0/35 cases). The frequency of mutations in the other eight coding regions microsatellite was the following: TGFβ1-RII (60 %), BAX (27%), hMSH6 (20%),hMSH3 (13%), CBL (13%), IGFIIR (7%), RECQL (0%) and WRN (0%). Mutations in BLM appear to be more frequently associated with frameshifts in BAX and in hMSH6and/or hMSH3. Tumors with BLM alterations present a higher frequency of unstable mono- and trinucleotide repeats located in coding regions as compared with mutator phenotype tumors without BLM frameshifts. Conclusions BLM frameshifts are frequent alterations in GCs specifically associated with MMP+tumors. We suggest that BLM loss of function by MSI may increase the genetic instability of a pre-existent unstable genotype in gastric tumors. PMID:11532193

  6. The complete mitochondrial genome of the common sea slater, Ligia oceanica (Crustacea, Isopoda bears a novel gene order and unusual control region features

    Directory of Open Access Journals (Sweden)

    Podsiadlowski Lars

    2006-09-01

    Full Text Available Abstract Background Sequence data and other characters from mitochondrial genomes (gene translocations, secondary structure of RNA molecules are useful in phylogenetic studies among metazoan animals from population to phylum level. Moreover, the comparison of complete mitochondrial sequences gives valuable information about the evolution of small genomes, e.g. about different mechanisms of gene translocation, gene duplication and gene loss, or concerning nucleotide frequency biases. The Peracarida (gammarids, isopods, etc. comprise about 21,000 species of crustaceans, living in many environments from deep sea floor to arid terrestrial habitats. Ligia oceanica is a terrestrial isopod living at rocky seashores of the european North Sea and Atlantic coastlines. Results The study reveals the first complete mitochondrial DNA sequence from a peracarid crustacean. The mitochondrial genome of Ligia oceanica is a circular double-stranded DNA molecule, with a size of 15,289 bp. It shows several changes in mitochondrial gene order compared to other crustacean species. An overview about mitochondrial gene order of all crustacean taxa yet sequenced is also presented. The largest non-coding part (the putative mitochondrial control region of the mitochondrial genome of Ligia oceanica is unexpectedly not AT-rich compared to the remainder of the genome. It bears two repeat regions (4× 10 bp and 3× 64 bp, and a GC-rich hairpin-like secondary structure. Some of the transfer RNAs show secondary structures which derive from the usual cloverleaf pattern. While some tRNA genes are putative targets for RNA editing, trnR could not be localized at all. Conclusion Gene order is not conserved among Peracarida, not even among isopods. The two isopod species Ligia oceanica and Idotea baltica show a similarly derived gene order, compared to the arthropod ground pattern and to the amphipod Parhyale hawaiiensis, suggesting that most of the translocation events were already

  7. The Norrie disease gene maps to a 150 kb region on chromosome Xp11.3.

    Science.gov (United States)

    Sims, K B; Lebo, R V; Benson, G; Shalish, C; Schuback, D; Chen, Z Y; Bruns, G; Craig, I W; Golbus, M S; Breakefield, X O

    1992-05-01

    Norrie disease is a human X-linked recessive disorder of unknown etiology characterized by congenital blindness, sensory neural deafness and mental retardation. This disease gene was previously linked to the DXS7 (L1.28) locus and the MAO genes in band Xp11.3. We report here fine physical mapping of the obligate region containing the Norrie disease gene (NDP) defined by a recombination and by the smallest submicroscopic chromosomal deletion associated with Norrie disease identified to date. Analysis, using in addition two overlapping YAC clones from this region, allowed orientation of the MAOA and MAOB genes in a 5'-3'-3'-5' configuration. A recombination event between a (GT)n polymorphism in intron 2 of the MAOB gene and the NDP locus, in a family previously reported to have a recombination between DXS7 and NDP, delineates a flanking marker telomeric to this disease gene. An anonymous DNA probe, dc12, present in one of the YACs and in a patient with a submicroscopic deletion which includes MAOA and MAOB but not L1.28, serves as a flanking marker centromeric to the disease gene. An Alu-PCR fragment from the right arm of the MAO YAC (YMAO.AluR) is not deleted in this patient and also delineates the centromeric extent of the obligate disease region. The apparent order of these loci is telomere ... DXS7-MAOA-MAOB-NDP-dc12-YMAO.AluR ... centromere. Together these data define the obligate region containing the NDP gene to a chromosomal segment less than 150 kb.

  8. Genetic recombination is targeted towards gene promoter regions in dogs.

    Science.gov (United States)

    Auton, Adam; Rui Li, Ying; Kidd, Jeffrey; Oliveira, Kyle; Nadel, Julie; Holloway, J Kim; Hayward, Jessica J; Cohen, Paula E; Greally, John M; Wang, Jun; Bustamante, Carlos D; Boyko, Adam R

    2013-01-01

    The identification of the H3K4 trimethylase, PRDM9, as the gene responsible for recombination hotspot localization has provided considerable insight into the mechanisms by which recombination is initiated in mammals. However, uniquely amongst mammals, canids appear to lack a functional version of PRDM9 and may therefore provide a model for understanding recombination that occurs in the absence of PRDM9, and thus how PRDM9 functions to shape the recombination landscape. We have constructed a fine-scale genetic map from patterns of linkage disequilibrium assessed using high-throughput sequence data from 51 free-ranging dogs, Canis lupus familiaris. While broad-scale properties of recombination appear similar to other mammalian species, our fine-scale estimates indicate that canine highly elevated recombination rates are observed in the vicinity of CpG rich regions including gene promoter regions, but show little association with H3K4 trimethylation marks identified in spermatocytes. By comparison to genomic data from the Andean fox, Lycalopex culpaeus, we show that biased gene conversion is a plausible mechanism by which the high CpG content of the dog genome could have occurred.

  9. Complete mitochondrial genome of endangered Yellow-shouldered Amazon (Amazona barbadensis): two control region copies in parrot species of the Amazona genus.

    Science.gov (United States)

    Urantowka, Adam Dawid; Hajduk, Kacper; Kosowska, Barbara

    2013-08-01

    Amazona barbadensis is an endangered species of parrot living in northern coastal Venezuela and in several Caribbean islands. In this study, we sequenced full mitochondrial genome of the considered species. The total length of the mitogenome was 18,983 bp and contained 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, duplicated control region, and degenerate copies of ND6 and tRNA (Glu) genes. High degree of identity between two copies of control region suggests their coincident evolution and functionality. Comparative analysis of both the control region sequences from four Amazona species revealed their 89.1% identity over a region of 1300 bp and indicates the presence of distinctive parts of two control region copies.

  10. Enhancer-driven chromatin interactions during development promote escape from silencing by a long non-coding RNA

    Directory of Open Access Journals (Sweden)

    Korostowski Lisa

    2011-11-01

    Full Text Available Abstract Background Gene regulation in eukaryotes is a complex process entailing the establishment of transcriptionally silent chromatin domains interspersed with regions of active transcription. Imprinted domains consist of clusters of genes, some of which exhibit parent-of-origin dependent monoallelic expression, while others are biallelic. The Kcnq1 imprinted domain illustrates the complexities of long-range regulation that coexists with local exceptions. A paternally expressed repressive non-coding RNA, Kcnq1ot1, regulates a domain of up to 750 kb, encompassing 14 genes. We study how the Kcnq1 gene, initially silenced by Kcnq1ot1, undergoes tissue-specific escape from imprinting during development. Specifically, we uncover the role of chromosome conformation during these events. Results We show that Kcnq1 transitions from monoallelic to biallelic expression during mid gestation in the developing heart. This transition is not associated with the loss of methylation on the Kcnq1 promoter. However, by exploiting chromosome conformation capture (3C technology, we find tissue-specific and stage-specific chromatin loops between the Kcnq1 promoter and newly identified DNA regulatory elements. These regulatory elements showed in vitro activity in a luciferase assay and in vivo activity in transgenic embryos. Conclusions By exploring the spatial organization of the Kcnq1 locus, our results reveal a novel mechanism by which local activation of genes can override the regional silencing effects of non-coding RNAs.

  11. Analysis of Copy Number Variation in the Abp Gene Regions of Two House Mouse Subspecies Suggests Divergence during the Gene Family Expansions.

    Science.gov (United States)

    Pezer, Željka; Chung, Amanda G; Karn, Robert C; Laukaitis, Christina M

    2017-06-01

    The Androgen-binding protein ( Abp ) gene region of the mouse genome contains 64 genes, some encoding pheromones that influence assortative mating between mice from different subspecies. Using CNVnator and quantitative PCR, we explored copy number variation in this gene family in natural populations of Mus musculus domesticus ( Mmd ) and Mus musculus musculus ( Mmm ), two subspecies of house mice that form a narrow hybrid zone in Central Europe. We found that copy number variation in the center of the Abp gene region is very common in wild Mmd , primarily representing the presence/absence of the final duplications described for the mouse genome. Clustering of Mmd individuals based on this variation did not reflect their geographical origin, suggesting no population divergence in the Abp gene cluster. However, copy number variation patterns differ substantially between Mmd and other mouse taxa. Large blocks of Abp genes are absent in Mmm , Mus musculus castaneus and an outgroup, Mus spretus , although with differences in variation and breakpoint locations. Our analysis calls into question the reliance on a reference genome for interpreting the detailed organization of genes in taxa more distant from the Mmd reference genome. The polymorphic nature of the gene family expansion in all four taxa suggests that the number of Abp genes, especially in the central gene region, is not critical to the survival and reproduction of the mouse. However, Abp haplotypes of variable length may serve as a source of raw genetic material for new signals influencing reproductive communication and thus speciation of mice. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  12. Altered phenotypic expression of immunoglobulin heavy-chain variable-region (VH) genes in Alicia rabbits probably reflects a small deletion in the VH genes closest to the joining region.

    Science.gov (United States)

    Allegrucci, M; Newman, B A; Young-Cooper, G O; Alexander, C B; Meier, D; Kelus, A S; Mage, R G

    1990-07-01

    Rabbits of the Alicia strain have a mutation (ali) that segregates with the immunoglobulin heavy-chain (lgh) locus and has a cis effect upon the expression of heavy-chain variable-region (VH) genes encoding the a2 allotype. In heterozygous a1/ali or a3/ali rabbits, serum immunoglobulins are almost entirely the products of the normal a1 or a3 allele and only traces of a2 immunoglobulin are detectable. Adult homozygous ali/ali rabbits likewise have normal immunoglobulin levels resulting from increased production of a-negative immunoglobulins and some residual ability to produce the a2 allotype. By contrast, the majority of the immunoglobulins of wild-type a2 rabbits are a2-positive and only a small percentage are a-negative. Genomic DNAs from homozygous mutant and wild-type animals were indistinguishable by Southern analyses using a variety of restriction enzyme digests and lgh probes. However, when digests with infrequently cutting enzymes were analyzed by transverse alternating-field electrophoresis, the ali DNA fragments were 10-15 kilobases smaller than the wild type. These fragments hybridized to probes both for VH and for a region of DNA a few kilobases downstream of the VH genes nearest the joining region. We suggest that this relatively small deletion affects a segment containing 3' VH genes with important regulatory functions, the loss of which leads to the ali phenotype. These results, and the fact that the 3' VH genes rearrange early in B-cell development, indicate that the 3' end of the VH locus probably plays a key role in regulation of VH gene expression.

  13. Reprint of "Two-stage sparse coding of region covariance via Log-Euclidean kernels to detect saliency".

    Science.gov (United States)

    Zhang, Ying-Ying; Yang, Cai; Zhang, Ping

    2017-08-01

    In this paper, we present a novel bottom-up saliency detection algorithm from the perspective of covariance matrices on a Riemannian manifold. Each superpixel is described by a region covariance matrix on Riemannian Manifolds. We carry out a two-stage sparse coding scheme via Log-Euclidean kernels to extract salient objects efficiently. In the first stage, given background dictionary on image borders, sparse coding of each region covariance via Log-Euclidean kernels is performed. The reconstruction error on the background dictionary is regarded as the initial saliency of each superpixel. In the second stage, an improvement of the initial result is achieved by calculating reconstruction errors of the superpixels on foreground dictionary, which is extracted from the first stage saliency map. The sparse coding in the second stage is similar to the first stage, but is able to effectively highlight the salient objects uniformly from the background. Finally, three post-processing methods-highlight-inhibition function, context-based saliency weighting, and the graph cut-are adopted to further refine the saliency map. Experiments on four public benchmark datasets show that the proposed algorithm outperforms the state-of-the-art methods in terms of precision, recall and mean absolute error, and demonstrate the robustness and efficiency of the proposed method. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. Characterization of the porcine TOR1A gene: The first step towards generation of a pig model for dystonia

    DEFF Research Database (Denmark)

    Henriksen, Carina; Madsen, Lone Bruhn; Bendixen, Christian

    2009-01-01

    . The TOR1A gene was demonstrated to be localized on porcine chromosome 1. Single nucleotide polymorphism (SNP) analysis revealed several SNPs in the porcine TOR1A gene, both in the coding region and also in the 3′ UTR region. Overexpression of mutant (Δ∆E303-304) porcine TorsinA in neuroblastoma cells...

  15. Whole-Exome Sequencing of 2,000 Danish Individuals and the Role of Rare Coding Variants in Type 2 Diabetes

    DEFF Research Database (Denmark)

    Lohmueller, Kirk E.; Sparsø, Thomas; Li, Qibin

    2013-01-01

    number of genes. We applied a series of gene-based tests to detect such susceptibility genes. However, no gene showed a significant association with disease risk after we corrected for the number of genes analyzed. Thus, we could reject a model for the genetic architecture of type 2 diabetes where rare......It has been hypothesized that, in aggregate, rare variants in coding regions of genes explain a substantial fraction of the heritability of common diseases. We sequenced the exomes of 1,000 Danish cases with common forms of type 2 diabetes (including body mass index > 27.5 kg/m2 and hypertension...

  16. Integrating Ontological Knowledge and Textual Evidence in Estimating Gene and Gene Product Similarity

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Posse, Christian; Gopalan, Banu; Tratz, Stephen C.; Gregory, Michelle L.

    2006-06-08

    With the rising influence of the Gene On-tology, new approaches have emerged where the similarity between genes or gene products is obtained by comparing Gene Ontology code annotations associ-ated with them. So far, these approaches have solely relied on the knowledge en-coded in the Gene Ontology and the gene annotations associated with the Gene On-tology database. The goal of this paper is to demonstrate that improvements to these approaches can be obtained by integrating textual evidence extracted from relevant biomedical literature.

  17. Cloning of cDNAs coding for the heavy chain region and connecting region of human factor V, a blood coagulation factor with four types of internal repeats

    International Nuclear Information System (INIS)

    Kane, W.H.; Ichinose, A.; Hagen, F.S.; Davie, E.W.

    1987-01-01

    Human factor V is a high molecular weight plasma glycoprotein that participates as a cofactor in the conversion of prothrombin to thrombin by factor X/sub a/. Prior to its participation in the coagulation cascade, factor V is converted to factor V/sub a/ by thrombin generating a heavy chain and a light chain, and these two chains are held together by calcium ions. A connecting region originally located between the heavy and light chains is liberated during the activation reaction. In a previous study, a cDNA of 2970 nucleotides that codes for the carboxyl-terminal 938 amino acids of factor V was isolated and characterized from a Hep G2 cDNA library. This cDNA has been used to obtain additional clones from Hep G2 and human liver cDNA libraries. Furthermore, a Hep G2 cDNA library prepared with an oligonucleotide from the 5' end of these cDNAs was screened to obtain overlapping cDNA clones that code for the amino-terminal region of the molecule. The composite sequence of these clones spans 6911 nucleotides and is consistent with the size of the factor V message present in Hep G2 cells (approximately 7 kilobases). The cDNA codes for a leader sequence of 28 amino acids and a mature protein of 2196 amino acids. The amino acid sequence predicted from the cDNA was in complete agreement with 139 amino acid residues that were identified by Edman degradation of cyanogen bromide peptides isolated from the heavy chain region and connecting region of plasma factor V. The domain structure of human factor V is similar to that previously reported for human coagulation factor VIII. Two types of tandem repeats (17 and 9 amino acids) have also been identified in the connecting region of factor V. The present data indicate that the amino acid sequence in the heavy and light chain regions of factor V is ∼ 40% identical with the corresponding regions of factor VIII

  18. Porcine lung surfactant protein B gene (SFTPB)

    DEFF Research Database (Denmark)

    Cirera Salicio, Susanna; Fredholm, Merete

    2008-01-01

    The porcine surfactant protein B (SFTPB) is a single copy gene on chromosome 3. Three different cDNAs for the SFTPB have been isolated and sequenced. Nucleotide sequence comparison revealed six nonsynonymous single nucleotide polymorphisms (SNPs), four synonymous SNPs and an in-frame deletion of 69...... bp in the region coding for the active protein. Northern analysis showed lung-specific expression of three different isoforms of the SFTPB transcript. The expression level for the SFTPB gene is low in 50 days-old fetus and it increases during lung development. Quantitative real-time polymerase chain...

  19. Population genetic implications from sequence variation in four Y chromosome genes.

    Science.gov (United States)

    Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J

    2000-06-20

    Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.

  20. Scarless and sequential gene modification in Pseudomonas using PCR product flanked by short homology regions

    Directory of Open Access Journals (Sweden)

    Liang Rubing

    2010-08-01

    Full Text Available Abstract Background The lambda Red recombination system has been used to inactivate chromosomal genes in various bacteria and fungi. The procedure consists of electroporating a polymerase chain reaction (PCR fragment containing antibiotic cassette flanked by homology regions to the target locus into a strain that can express the lambda Red proteins (Gam, Bet, Exo. Results Here a scarless gene modification strategy based on the Red recombination system has been developed to modify Pseudomonas genome DNA via sequential deletion of multiple targets. This process was mediated by plasmid pRKaraRed encoding the Red proteins regulated by PBAD promoter, which was functional in P. aeruginosa as well as in other bacteria. First the target gene was substituted for the sacB-bla cassette flanked by short homology regions (50 bp, and then this marker gene cassette could be replaced by the PCR fragment flanking itself, generating target-deleted genome without any remnants and no change happened to the surrounding region. Twenty genes involved in the synthesis and regulation pathways of the phenazine derivate, pyocyanin, were modified, including one single-point mutation and deletion of two large operons. The recombination efficiencies ranged from 88% to 98%. Multiple-gene modification was also achieved, generating a triple-gene deletion strain PCA (PAO1, ΔphzHΔphzMΔphzS, which could produce another phenazine derivate, phenazine-1-carboxylic acid (PCA, efficiently and exclusively. Conclusions This lambda Red-based technique can be used to generate scarless and sequential gene modification mutants of P. aeruginosa efficiently, using one-step PCR product flanked by short homology regions. Single-point mutation, scarless deletion of genes can be achieved easily in less than three days. This method may give a new way to construct genetically modified P. aeruginosa strains more efficiently and advance the regulatory network study of this organism.

  1. Kinetics and regional specificity of irinotecan-induced gene expression in the gastrointestinal tract

    International Nuclear Information System (INIS)

    Bowen, Joanne M.; Tsykin, Anna; Stringer, Andrea M.; Logan, Richard M.; Gibson, Rachel J.; Keefe, Dorothy M.K.

    2010-01-01

    Gastrointestinal toxicity remains a significant and dose-limiting complication of cancer treatment. While the pathophysiology is becoming clearer, considerable gaps in the knowledge remain surrounding the timing and site-specific gene changes which occur in response to insult. As such, this study aimed to assess gene expression profiles in a number of regions along the gastrointestinal tract following treatment with the chemotherapy agent, irinotecan, and correlate them with markers of cell death and tissue damage. Data analysis of microarray results found that genes involved in apoptosis, mitogen activated kinase (MAPK) signalling and inflammation were upregulated within 6 h, while genes involved in cell proliferation, wound healing and blood vessel formation were upregulated at later time points up to 72 h. Cell death was significantly increased at 6 and 24 h, and the stomach showed the lowest severity of overt tissue damage. Real time PCR of MAPK signalling pathway genes found that the jejunum and colon had significantly increased expression in a number of genes at 72 h, where as the stomach was unchanged. These results indicate that overall severity of tissue damage may be determined by precisely timed target gene responses specific to each region. Therapeutic targeting of key gene responses at the appropriate time point may prove to be effective for prevention of chemotherapy-induced gastrointestinal damage.

  2. Label-free detection of sex determining region Y (SRY) via capacitive biosensor

    KAUST Repository

    Sivashankar, Shilpa; Sapsanis, Christos; Agambayev, Sumeyra; Buttner, Ulrich; Salama, Khaled N.

    2016-01-01

    In this work, we present for the first time, the use of a simple fractal capacitive biosensor for the quantification and detection of sex-determining region Y (SRY) genes. This section of genetic code, which is found on the Y chromosome, finds

  3. Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans.

    Science.gov (United States)

    Gottlieb, Assaf; Daneshjou, Roxana; DeGorter, Marianne; Bourgeois, Stephane; Svensson, Peter J; Wadelius, Mia; Deloukas, Panos; Montgomery, Stephen B; Altman, Russ B

    2017-11-24

    Genome-wide association studies are useful for discovering genotype-phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into "gene level" effects. Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression-on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort.

  4. Effects of nickel treatment on H3K4 trimethylation and gene expression.

    Directory of Open Access Journals (Sweden)

    Kam-Meng Tchou-Wong

    Full Text Available Occupational exposure to nickel compounds has been associated with lung and nasal cancers. We have previously shown that exposure of the human lung adenocarcinoma A549 cells to NiCl(2 for 24 hr significantly increased global levels of trimethylated H3K4 (H3K4me3, a transcriptional activating mark that maps to the promoters of transcribed genes. To further understand the potential epigenetic mechanism(s underlying nickel carcinogenesis, we performed genome-wide mapping of H3K4me3 by chromatin immunoprecipitation and direct genome sequencing (ChIP-seq and correlated with transcriptome genome-wide mapping of RNA transcripts by massive parallel sequencing of cDNA (RNA-seq. The effect of NiCl(2 treatment on H3K4me3 peaks within 5,000 bp of transcription start sites (TSSs on a set of genes highly induced by nickel in both A549 cells and human peripheral blood mononuclear cells were analyzed. Nickel exposure increased the level of H3K4 trimethylation in both the promoters and coding regions of several genes including CA9 and NDRG1 that were increased in expression in A549 cells. We have also compared the extent of the H3K4 trimethylation in the absence and presence of formaldehyde crosslinking and observed that crosslinking of chromatin was required to observe H3K4 trimethylation in the coding regions immediately downstream of TSSs of some nickel-induced genes including ADM and IGFBP3. This is the first genome-wide mapping of trimethylated H3K4 in the promoter and coding regions of genes induced after exposure to NiCl(2. This study may provide insights into the epigenetic mechanism(s underlying the carcinogenicity of nickel compounds.

  5. First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

    Science.gov (United States)

    Chen, Zhi-Teng; Du, Yu-Zhou

    2017-05-05

    The complete mitochondrial genome (mitogenome) of Nemoura nankinensis (Plecoptera: Nemouridae) was sequenced as the first reported mitogenome from the family Nemouridae. The N. nankinensis mitogenome was the longest (16,602 bp) among reported plecopteran mitogenomes, and it contains 37 genes including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. Most PCGs used standard ATN as start codons, and TAN as termination codons. All tRNA genes of N. nankinensis could fold into the cloverleaf secondary structures except for trnSer ( AGN ), whose dihydrouridine (DHU) arm was reduced to a small loop. There was also a large non-coding region (control region, CR) in the N. nankinensis mitogenome. The 1751 bp CR was the longest and had the highest A+T content (81.8%) among stoneflies. A large tandem repeat region, five potential stem-loop (SL) structures, four tRNA-like structures and four conserved sequence blocks (CSBs) were detected in the elongated CR. The presence of these tRNA-like structures in the CR has never been reported in other plecopteran mitogenomes. These novel features of the elongated CR in N. nankinensis may have functions associated with the process of replication and transcription. Finally, phylogenetic reconstruction suggested that Nemouridae was the sister-group of Capniidae.

  6. Characterization of a gene from the EDM1-PSACH region of human chromosome 19p

    Energy Technology Data Exchange (ETDEWEB)

    Lennon, G.G.; Giorgi, D.; Martin, J.R. [Lawrence Livermore National Lab., CA (United States)] [and others

    1994-09-01

    Genetic linkage mapping has indicated that both multiple epiphyseal dysplasia (EDM1), a dominantly inherited chondrodysplasia, and pseudoachondroplasia (PSACH), a skeletal disorder associated with dwarfism, map to a 2-3 Mb region of human chromosome 19p. We have isolated a partial cDNA from this region using hybrid selection, and report on progress towards the characterization of the genomic structure and transcription of the corresponding gene. Sequence analysis of the cDNA to date indicates that this gene is likely to be expressed within extracellular matrix tissues. Defects in this gene or neighboring gene family members may therefore lead to EDM1, PSACH, or other connective tissue and skeletal disorders.

  7. An operon from Lactobacillus helveticus composed of a proline iminopeptidase gene (pepI) and two genes coding for putative members of the ABC transporter family of proteins.

    Science.gov (United States)

    Varmanen, P; Rantanen, T; Palva, A

    1996-12-01

    A proline iminopeptidase gene (pepI) of an industrial Lactobacillus helveticus strain was cloned and found to be organized in an operon-like structure of three open reading frames (ORF1, ORF2 and ORF3). ORF1 was preceded by a typical prokaryotic promoter region, and a putative transcription terminator was found downstream of ORF3, identified as the pepI gene. Using primer-extension analyses, only one transcription start site, upstream of ORF1, was identifiable in the predicted operon. Although the size of mRNA could not be judged by Northern analysis either with ORF1-, ORF2- or pepI-specific probes, reverse transcription-PCR analyses further supported the operon structure of the three genes. ORF1, ORF2 and ORF3 had coding capacities for 50.7, 24.5 and 33.8 kDa proteins, respectively. The ORF3-encoded PepI protein showed 65% identity with the PepI proteins from Lactobacillus delbrueckii subsp. bulgaricus and Lactobacillus delbrueckii subsp. lactis. The ORF1-encoded protein had significant homology with several members of the ABC transporter family but, with two distinct putative ATP-binding sites, it would represent an unusual type among the bacterial ABC transporters. ORF2 encoded a putative integral membrane protein also characteristic of the ABC transporter family. The pepI gene was overexpressed in Escherichia coli. Purified PepI hydrolysed only di and tripeptides with proline in the first position. Optimum PepI activity was observed at pH 7.5 and 40 degrees C. A gel filtration analysis indicated that PepI is a dimer of M(r) 53,000. PepI was shown to be a metal-independent serine peptidase having thiol groups at or near the active site. Kinetic studies with proline-p-nitroanilide as substrate revealed Km and Vmax values of 0.8 mM and 350 mmol min-1 mg-1, respectively, and a very high turnover number of 135,000 s-1.

  8. Characterisation of silent and active genes for a variable large protein of Borrelia recurrentis

    Directory of Open Access Journals (Sweden)

    Scragg Ian G

    2002-10-01

    Full Text Available Abstract Background We report the characterisation of the variable large protein (vlp gene expressed by clinical isolate A1 of Borrelia recurrentis; the agent of the life-threatening disease louse-borne relapsing fever. Methods The major vlp protein of this isolate was characterised and a DNA probe created. Use of this together with standard molecular methods was used to determine the location of the vlp1B. recurrentis A1 gene in both this and other isolates. Results This isolate was found to carry silent and expressed copies of the vlp1B. recurrentis A1 gene on plasmids of 54 kbp and 24 kbp respectively, whereas a different isolate, A17, had only the silent vlp1B. recurrentis A17 on a 54 kbp plasmid. Silent and expressed vlp1 have identical mature protein coding regions but have different 5' regions, both containing different potential lipoprotein leader sequences. Only one form of vlp1 is transcribed in the A1 isolate of B. recurrentis, yet both 5' upstream sequences of this vlp1 gene possess features of bacterial promoters. Conclusion Taken together these results suggest that antigenic variation in B. recurrentis may result from recombination of variable large and small protein genes at the junction between lipoprotein leader sequence and mature protein coding region. However, this hypothetical model needs to be validated by further identification of expressed and silent variant protein genes in other B. recurrentis isolates.

  9. On fuzzy semantic similarity measure for DNA coding.

    Science.gov (United States)

    Ahmad, Muneer; Jung, Low Tang; Bhuiyan, Md Al-Amin

    2016-02-01

    A coding measure scheme numerically translates the DNA sequence to a time domain signal for protein coding regions identification. A number of coding measure schemes based on numerology, geometry, fixed mapping, statistical characteristics and chemical attributes of nucleotides have been proposed in recent decades. Such coding measure schemes lack the biologically meaningful aspects of nucleotide data and hence do not significantly discriminate coding regions from non-coding regions. This paper presents a novel fuzzy semantic similarity measure (FSSM) coding scheme centering on FSSM codons׳ clustering and genetic code context of nucleotides. Certain natural characteristics of nucleotides i.e. appearance as a unique combination of triplets, preserving special structure and occurrence, and ability to own and share density distributions in codons have been exploited in FSSM. The nucleotides׳ fuzzy behaviors, semantic similarities and defuzzification based on the center of gravity of nucleotides revealed a strong correlation between nucleotides in codons. The proposed FSSM coding scheme attains a significant enhancement in coding regions identification i.e. 36-133% as compared to other existing coding measure schemes tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Aberrant DNA methylation in 5'regions of DNA methyltransferase genes in aborted bovine clones

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    High rate of abortion and developmental abnormalities is thought to be closely associated with inefficient epigenetic reprogramming of the transplanted nuclei during bovine cloning.It is known that one of the important mechanisms for epigenetic reprogramming is DNA methylation.DNA methylation is established and maintained by DNA methyltransferases(DNMTs),therefore,it is postulated that the inefficient epigenetic reprogramming of transplanted nuclei may be due to abnormal expression of DNMTs.Since DNA methylation can strongly inhibit gene expression,aberrant DNA methylation of DNMT genes may disturb gene expression.But presently,it is not clear whether the methylation abnormality of DNMT genes is related to developmental failure of somatic cell nuclear transfer embryos.In our study,we analyzed methylation patterns of the 5' regions of four DNMT genes including Dnmt3a,Dnmt3b,Dnmtl and Dnmt2 in four aborted bovine clones.Using bisulfite sequencing method,we found that 3 out of 4 aborted bovine clones(AF1,AF2 and AF3)showed either hypermethylation or hypomethylation in the 5' regions of Dnmt3a and Dnmt3b.indicating that Dnmt3a and Dnmt3b genes are not properly reprogrammed.However,the individual AF4 exhibited similar methylation level and pattern to age-matched in vitro fertilized (IVF)fetuses.Besides,we found that tle 5'regions of Dnmtl and Dnmt2 were nearly completely unmethylated in all normal adults.IVF fetuses,sperm and aborted clones.Together,our results suggest that the aberrant methylation of Dnmt3a and Dnmt3b 5' regions is probably associated with the high abortion of bovine clones.

  11. Sequencing the GRHL3 Coding Region Reveals Rare Truncating Mutations and a Common Susceptibility Variant for Nonsyndromic Cleft Palate

    Science.gov (United States)

    Mangold, Elisabeth; Böhmer, Anne C.; Ishorst, Nina; Hoebel, Ann-Kathrin; Gültepe, Pinar; Schuenke, Hannah; Klamt, Johanna; Hofmann, Andrea; Gölz, Lina; Raff, Ruth; Tessmann, Peter; Nowak, Stefanie; Reutter, Heiko; Hemprich, Alexander; Kreusch, Thomas; Kramer, Franz-Josef; Braumann, Bert; Reich, Rudolf; Schmidt, Gül; Jäger, Andreas; Reiter, Rudolf; Brosch, Sibylle; Stavusis, Janis; Ishida, Miho; Seselgyte, Rimante; Moore, Gudrun E.; Nöthen, Markus M.; Borck, Guntram; Aldhorae, Khalid A.; Lace, Baiba; Stanier, Philip; Knapp, Michael; Ludwig, Kerstin U.

    2016-01-01

    Nonsyndromic cleft lip with/without cleft palate (nsCL/P) and nonsyndromic cleft palate only (nsCPO) are the most frequent subphenotypes of orofacial clefts. A common syndromic form of orofacial clefting is Van der Woude syndrome (VWS) where individuals have CL/P or CPO, often but not always associated with lower lip pits. Recently, ∼5% of VWS-affected individuals were identified with mutations in the grainy head-like 3 gene (GRHL3). To investigate GRHL3 in nonsyndromic clefting, we sequenced its coding region in 576 Europeans with nsCL/P and 96 with nsCPO. Most strikingly, nsCPO-affected individuals had a higher minor allele frequency for rs41268753 (0.099) than control subjects (0.049; p = 1.24 × 10−2). This association was replicated in nsCPO/control cohorts from Latvia, Yemen, and the UK (pcombined = 2.63 × 10−5; ORallelic = 2.46 [95% CI 1.6–3.7]) and reached genome-wide significance in combination with imputed data from a GWAS in nsCPO triads (p = 2.73 × 10−9). Notably, rs41268753 is not associated with nsCL/P (p = 0.45). rs41268753 encodes the highly conserved p.Thr454Met (c.1361C>T) (GERP = 5.3), which prediction programs denote as deleterious, has a CADD score of 29.6, and increases protein binding capacity in silico. Sequencing also revealed four novel truncating GRHL3 mutations including two that were de novo in four families, where all nine individuals harboring mutations had nsCPO. This is important for genetic counseling: given that VWS is rare compared to nsCPO, our data suggest that dominant GRHL3 mutations are more likely to cause nonsyndromic than syndromic CPO. Thus, with rare dominant mutations and a common risk variant in the coding region, we have identified an important contribution for GRHL3 in nsCPO. PMID:27018475

  12. Signalign: An Ontology of DNA as Signal for Comparative Gene Structure Prediction Using Information-Coding-and-Processing Techniques.

    Science.gov (United States)

    Yu, Ning; Guo, Xuan; Gu, Feng; Pan, Yi

    2016-03-01

    Conventional character-analysis-based techniques in genome analysis manifest three main shortcomings-inefficiency, inflexibility, and incompatibility. In our previous research, a general framework, called DNA As X was proposed for character-analysis-free techniques to overcome these shortcomings, where X is the intermediates, such as digit, code, signal, vector, tree, graph network, and so on. In this paper, we further implement an ontology of DNA As Signal, by designing a tool named Signalign for comparative gene structure analysis, in which DNA sequences are converted into signal series, processed by modified method of dynamic time warping and measured by signal-to-noise ratio (SNR). The ontology of DNA As Signal integrates the principles and concepts of other disciplines including information coding theory and signal processing into sequence analysis and processing. Comparing with conventional character-analysis-based methods, Signalign can not only have the equivalent or superior performance, but also enrich the tools and the knowledge library of computational biology by extending the domain from character/string to diverse areas. The evaluation results validate the success of the character-analysis-free technique for improved performances in comparative gene structure prediction.

  13. Genetic analysis and gene mapping of a low stigma exposed mutant gene by high-throughput sequencing.

    Directory of Open Access Journals (Sweden)

    Xiao Ma

    Full Text Available Rice is one of the main food crops and several studies have examined the molecular mechanism of the exposure of the rice plant stigma. The improvement in the exposure of the stigma in female parent hybrid combinations can enhance the efficiency of hybrid breeding. In the present study, a mutant plant with low exposed stigma (lesr was discovered among the descendants of the indica thermo-sensitive sterile line 115S. The ES% rate of the mutant decreased by 70.64% compared with the wild type variety. The F2 population was established by genetic analysis considering the mutant as the female parent and the restorer line 93S as the male parent. The results indicated a normal F1 population, while a clear division was noted for the high and low exposed stigma groups, respectively. This process was possible only by a ES of 25% in the F2 population. This was in agreement with the ratio of 3:1, which indicated that the mutant was controlled by a recessive main-effect QTL locus, temporarily named as LESR. Genome-wide comparison of the SNP profiles between the early, high and low production bulks were constructed from F2 plants using bulked segregant analysis in combination with high-throughput sequencing technology. The results demonstrated that the candidate loci was located on the chromosome 10 of the rice. Following screening of the recombinant rice plants with newly developed molecular markers, the genetic region was narrowed down to 0.25 Mb. This region was flanked by InDel-2 and InDel-2 at the physical location from 13.69 to 13.94 Mb. Within this region, 7 genes indicated base differences between parents. A total of 2 genes exhibited differences at the coding region and upstream of the coding region, respectively. The present study aimed to further clone the LESR gene, verify its function and identify the stigma variation.

  14. Efficient CRISPR/Cas9-Mediated Versatile, Predictable, and Donor-Free Gene Knockout in Human Pluripotent Stem Cells.

    Science.gov (United States)

    Liu, Zhongliang; Hui, Yi; Shi, Lei; Chen, Zhenyu; Xu, Xiangjie; Chi, Liankai; Fan, Beibei; Fang, Yujiang; Liu, Yang; Ma, Lin; Wang, Yiran; Xiao, Lei; Zhang, Quanbin; Jin, Guohua; Liu, Ling; Zhang, Xiaoqing

    2016-09-13

    Loss-of-function studies in human pluripotent stem cells (hPSCs) require efficient methodologies for lesion of genes of interest. Here, we introduce a donor-free paired gRNA-guided CRISPR/Cas9 knockout strategy (paired-KO) for efficient and rapid gene ablation in hPSCs. Through paired-KO, we succeeded in targeting all genes of interest with high biallelic targeting efficiencies. More importantly, during paired-KO, the cleaved DNA was repaired mostly through direct end joining without insertions/deletions (precise ligation), and thus makes the lesion product predictable. The paired-KO remained highly efficient for one-step targeting of multiple genes and was also efficient for targeting of microRNA, while for long non-coding RNA over 8 kb, cleavage of a short fragment of the core promoter region was sufficient to eradicate downstream gene transcription. This work suggests that the paired-KO strategy is a simple and robust system for loss-of-function studies for both coding and non-coding genes in hPSCs. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

  15. Mutations in the S gene region of hepatitis B virus genotype D in ...

    Indian Academy of Sciences (India)

    The gene region of the hepatitis B virus (HBV) is responsible for the expression of surface antigens and includes the 'a'-determinant region. Thus, mutation(s) in this region would afford HBV variants a distinct survival advantage, permitting the mutant virus to escape from the immune system. The aim of this study was to ...

  16. Physical linkage of a human immunoglobulin heavy chain variable region gene segment to diversity and joining region elements

    International Nuclear Information System (INIS)

    Schroeder, H.W. Jr.; Walter, M.A.; Hofker, M.H.; Ebens, A.; Van Dijk, K.W.; Liao, L.C.; Cox, D.W.; Milner, E.C.B.; Perlmutter, R.M.

    1988-01-01

    Antibody genes are assembled from a series of germ-line gene segments that are juxtaposed during the maturation of B lymphocytes. Although diversification of the adult antibody repertoire results in large part from the combinatorial joining of these gene segments, a restricted set of antibody heavy chain variable (V H ), diversity (D H ), and joining (J H ) region gene segments appears preferentially in the human fetal repertoire. The authors report here that one of these early-expressed V H elements (termed V H 6) is the most 3' V H gene segment, positioned 77 kilobases on the 5' side of the J H locus and immediately adjacent to a set of previously described D H sequences. In addition to providing a physical map linking human V H , D H , and J H elements, these results support the view that the programmed development of the antibody V H repertoire is determined in part by the chromosomal position of these gene segments

  17. Regional brain metabolite abnormalities in inherited prion disease and asymptomatic gene carriers demonstrated in vivo by quantitative proton magnetic resonance spectroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Waldman, A.D.; Cordery, R.J.; Godbolt, A.; Rossor, M.N. [University College London, Dementia Research Group, Department of Neurodegenerative Disease, Institute of Neurology, London (United Kingdom); Imperial College of Science, Technology and Medicine, Division of Neuroscience and Psychological Medicine, Faculty of Medicine, London (United Kingdom); MacManus, D.G. [University College London, NMR Research Unit, Department of Clinical Neurology, Institute of Neurology, London (United Kingdom); Collinge, J. [University College London, MRC Prion Unit, Department of Neurodegenerative Disease, Institute of Neurology, London (United Kingdom)

    2006-06-15

    Inherited prion diseases are caused by mutations in the gene which codes for prion protein (PrP), leading to proliferation of abnormal PrP isomers in the brain and neurodegeneration; they include Gerstmann-Straeussler-Scheinker disease (GSS), fatal familial insomnia (FFI) and familial Creutzfeldt-Jakob disease (fCJD). We studied two patients with symptomatic inherited prion disease (P102L) and two pre-symptomatic P102L gene carriers using quantitative magnetic resonance spectroscopy (MRS). Short echo time spectra were acquired from the thalamus, caudate region and frontal white matter, metabolite levels and ratios were measured and z-scores calculated for individual patients relative to age-matched normal controls. MRS data were compared with structural magnetic resonance imaging. One fCJD case had generalised atrophy and showed increased levels of myo-inositol (MI) in the thalamus (z=3.7). The other had decreased levels of N-acetylaspartate (z=4) and diffuse signal abnormality in the frontal white matter. Both asymptomatic gene carriers had normal imaging, but increased frontal white matter MI (z=4.3, 4.1), and one also had increased MI in the caudate (z=5.3). Isolated MI abnormalities in asymptomatic gene carriers are a novel finding and may reflect early glial proliferation, prior to significant neuronal damage. MRS provides potential non-invasive surrogate markers of early disease and progression in inherited prion disease. (orig.)

  18. Capture Hi-C identifies a novel causal gene, IL20RA, in the pan-autoimmune genetic susceptibility region 6q23.

    Science.gov (United States)

    McGovern, Amanda; Schoenfelder, Stefan; Martin, Paul; Massey, Jonathan; Duffus, Kate; Plant, Darren; Yarwood, Annie; Pratt, Arthur G; Anderson, Amy E; Isaacs, John D; Diboll, Julie; Thalayasingam, Nishanthi; Ospelt, Caroline; Barton, Anne; Worthington, Jane; Fraser, Peter; Eyre, Stephen; Orozco, Gisela

    2016-11-01

    The identification of causal genes from genome-wide association studies (GWAS) is the next important step for the translation of genetic findings into biologically meaningful mechanisms of disease and potential therapeutic targets. Using novel chromatin interaction detection techniques and allele specific assays in T and B cell lines, we provide compelling evidence that redefines causal genes at the 6q23 locus, one of the most important loci that confers autoimmunity risk. Although the function of disease-associated non-coding single nucleotide polymorphisms (SNPs) at 6q23 is unknown, the association is generally assigned to TNFAIP3, the closest gene. However, the DNA fragment containing the associated SNPs interacts through chromatin looping not only with TNFAIP3, but also with IL20RA, located 680 kb upstream. The risk allele of the most likely causal SNP, rs6927172, is correlated with both a higher frequency of interactions and increased expression of IL20RA, along with a stronger binding of both the NFκB transcription factor and chromatin marks characteristic of active enhancers in T-cells. Our results highlight the importance of gene assignment for translating GWAS findings into biologically meaningful mechanisms of disease and potential therapeutic targets; indeed, monoclonal antibody therapy targeting IL-20 is effective in the treatment of rheumatoid arthritis and psoriasis, both with strong GWAS associations to this region.

  19. Regional brain metabolite abnormalities in inherited prion disease and asymptomatic gene carriers demonstrated in vivo by quantitative proton magnetic resonance spectroscopy

    International Nuclear Information System (INIS)

    Waldman, A.D.; Cordery, R.J.; Godbolt, A.; Rossor, M.N.; MacManus, D.G.; Collinge, J.

    2006-01-01

    Inherited prion diseases are caused by mutations in the gene which codes for prion protein (PrP), leading to proliferation of abnormal PrP isomers in the brain and neurodegeneration; they include Gerstmann-Straeussler-Scheinker disease (GSS), fatal familial insomnia (FFI) and familial Creutzfeldt-Jakob disease (fCJD). We studied two patients with symptomatic inherited prion disease (P102L) and two pre-symptomatic P102L gene carriers using quantitative magnetic resonance spectroscopy (MRS). Short echo time spectra were acquired from the thalamus, caudate region and frontal white matter, metabolite levels and ratios were measured and z-scores calculated for individual patients relative to age-matched normal controls. MRS data were compared with structural magnetic resonance imaging. One fCJD case had generalised atrophy and showed increased levels of myo-inositol (MI) in the thalamus (z=3.7). The other had decreased levels of N-acetylaspartate (z=4) and diffuse signal abnormality in the frontal white matter. Both asymptomatic gene carriers had normal imaging, but increased frontal white matter MI (z=4.3, 4.1), and one also had increased MI in the caudate (z=5.3). Isolated MI abnormalities in asymptomatic gene carriers are a novel finding and may reflect early glial proliferation, prior to significant neuronal damage. MRS provides potential non-invasive surrogate markers of early disease and progression in inherited prion disease. (orig.)

  20. [Identification of Clonorchis sinensis metacercariae based on PCR targeting ribosomal DNA ITS regions and COX1 gene].

    Science.gov (United States)

    Yang, Qing-Li; Shen, Ji-Qing; Jiang, Zhi-Hua; Yang, Yi-Chao; Li, Hong-Mei; Chen, Ying-Dan; Zhou, Xiao-Nong

    2014-06-01

    To identify Clonorchis sinensis metacercariae using PCR targeting ribosomal DNA ITS region and COX1 gene. Pseudorasbora parva were collected from Hengxian County of Guangxi at the end of May 2013. Single metacercaria of C. sinensis and other trematodes were separated from muscle tissue of P. parva by digestion method. Primers targeting ribosomal DNA ITS region and COX1 gene of C. sinensis were designed for PCR and the universal primers were used as control. The sensitivity and specificity of the PCR detection were analyzed. C. sinensis metacercariae at different stages were identified by PCR. DNA from single C. sinensis metacercaria was detected by PCR targeting ribosomal DNA ITS region and COX1 gene. The specific amplicans have sizes of 437/549, 156/249 and 195/166 bp, respectively. The ratio of the two positive numbers in PCR with universal primers and specific primers targeting C. sinensis ribosomal DNA ITS1 and ITS2 regions was 0.905 and 0.952, respectively. The target gene fragments were amplified by PCR using COX1 gene-specific primers. The PCR with specific primers did not show any non-specific amplification. However, the PCR with universal primers targeting ribosomal DNA ITS regions performed serious non-specific amplification. C. sinensis metacercariae at different stages are identified by morphological observation and PCR method. Species-specific primers targeting ribosomal DNA ITS region show higher sensitivity and specificity than the universal primers. PCR targeting COX1 gene shows similar sensitivity and specificity to PCR with specific primers targeting ribosomal DNA ITS regions.

  1. Gene expression and adaptive noncoding changes during human evolution.

    Science.gov (United States)

    Babbitt, Courtney C; Haygood, Ralph; Nielsen, William J; Wray, Gregory A

    2017-06-05

    Despite evidence for adaptive changes in both gene expression and non-protein-coding, putatively regulatory regions of the genome during human evolution, the relationship between gene expression and adaptive changes in cis-regulatory regions remains unclear. Here we present new measurements of gene expression in five tissues of humans and chimpanzees, and use them to assess this relationship. We then compare our results with previous studies of adaptive noncoding changes, analyzing correlations at the level of gene ontology groups, in order to gain statistical power to detect correlations. Consistent with previous studies, we find little correlation between gene expression and adaptive noncoding changes at the level of individual genes; however, we do find significant correlations at the level of biological function ontology groups. The types of function include processes regulated by specific transcription factors, responses to genetic or chemical perturbations, and differentiation of cell types within the immune system. Among functional categories co-enriched with both differential expression and noncoding adaptation, prominent themes include cancer, particularly epithelial cancers, and neural development and function.

  2. Changes is genes coding for laccases 1 and 2 may contribute to deformation and reduction of wings in apollo butterfly (Parnassius apollo, Lepidoptera: Papilionidae) from the isolated population in Pieniny National Park (Poland).

    Science.gov (United States)

    Łukasiewicz, Kinga; Węgrzyn, Grzegorz

    2016-01-01

    An isolated population of apollo butterfly (Parnassius apollo, Lepidoptera: Papilionidae) occurs in Pieniny National Park (Poland). Deformations and reductions of wings in a relatively large number of individuals from this population is found, yet the reasons for these defects are unknown. During studies devoted to identify cause(s) of this phenomenon, we found that specific regions of genes coding of enzymes laccases 1 and 2 could not be amplified from DNA samples isolated from large fractions of malformed insects while expected PCR products were detected in almost all (with one exception) normal butterflies. Laccases (p-diphenol:dioxygen oxidoreductases) are oxidases containing several copper atoms. They catalyse single-electron oxidations of phenolic or other compounds with concomitant reduction of oxygen to water. In insects, their enzymatic activities were found previously in epidermis, midgut, Malpighian tubules, salivary glands, and reproductive tissues. Therefore, we suggest that defects in genes coding for laccases might contribute to deformation and reduction of wings in apollo butterflies, though it seems obvious that deficiency in these enzymes could not be the sole cause of these developmental improperties in P. apollo from Pieniny National Park.

  3. The CAZyome of Phytophthora spp.: A comprehensive analysis of the gene complement coding for carbohydrate-active enzymes in species of the genus Phytophthora

    Directory of Open Access Journals (Sweden)

    Laird Emma W

    2010-09-01

    Full Text Available Abstract Background Enzymes involved in carbohydrate metabolism include Carbohydrate esterases (CE, Glycoside hydrolases (GH, Glycosyl transferases (GT, and Polysaccharide lyases (PL, commonly referred to as carbohydrate-active enzymes (CAZymes. The CE, GH, and PL superfamilies are also known as cell wall degrading enzymes (CWDE due to their role in the disintegration of the plant cell wall by bacterial and fungal pathogens. In Phytophthora infestans, penetration of the plant cells occurs through a specialized hyphal structure called appressorium; however, it is likely that members of the genus Phytophthora also use CWDE for invasive growth because hyphal forces are below the level of tensile strength exhibited by the plant cell wall. Because information regarding the frequency and distribution of CAZyme coding genes in Phytophthora is currently unknown, we have scanned the genomes of P. infestans, P. sojae, and P. ramorum for the presence of CAZyme-coding genes using a homology-based approach and compared the gene collinearity in the three genomes. In addition, we have tested the expression of several genes coding for CE in cultures grown in vitro. Results We have found that P. infestans, P. sojae and P. ramorum contain a total of 435, 379, and 310 CAZy homologs; in each genome, most homologs belong to the GH superfamily. Most GH and PL homologs code for enzymes that hydrolyze substances present in the pectin layer forming the middle lamella of the plant cells. In addition, a significant number of CE homologs catalyzing the deacetylation of compounds characteristic of the plant cell cuticle were found. In general, a high degree of gene location conservation was observed, as indicated by the presence of sequential orthologous pairs in the three genomes. Such collinearity was frequently observed among members of the GH superfamily. On the other hand, the CE and PL superfamilies showed less collinearity for some of their putative members

  4. Single-nucleotide variations in the genes encoding the mitochondrial Hsp60/Hsp10 chaperone system and their disease-causing potential

    DEFF Research Database (Denmark)

    Bross, Peter; Li, Zhijie; Hansen, Jakob

    2007-01-01

    for variations in the HSPD1 and HSPE1 genes encoding the mitochondrial Hsp60/Hsp10 chaperone complex: two patients with multiple mitochondrial enzyme deficiency, 61 sudden infant death syndrome cases (MIM: #272120), and 60 patients presenting with ethylmalonic aciduria carrying non-synonymous susceptibility...... variations in the ACADS gene (MIM: *606885 and #201470). Besides previously reported variations we detected six novel variations: two in the bidirectional promoter region, and one synonymous and three non-synonymous variations in the HSPD1 coding region. One of the non-synonymous variations was polymorphic...... in patient and control samples, and the rare variations were each only found in single patients and absent in 100 control chromosomes. Functional investigation of the effects of the variations in the promoter region and the non-synonymous variations in the coding region indicated that none of them had...

  5. Comprehensive search for intra- and inter-specific sequence polymorphisms among coding envelope genes of retroviral origin found in the human genome: genes and pseudogenes

    Directory of Open Access Journals (Sweden)

    Vasilescu Alexandre

    2005-09-01

    Full Text Available Abstract Background The human genome carries a high load of proviral-like sequences, called Human Endogenous Retroviruses (HERVs, which are the genomic traces of ancient infections by active retroviruses. These elements are in most cases defective, but open reading frames can still be found for the retroviral envelope gene, with sixteen such genes identified so far. Several of them are conserved during primate evolution, having possibly been co-opted by their host for a physiological role. Results To characterize further their status, we presently sequenced 12 of these genes from a panel of 91 Caucasian individuals. Genomic analyses reveal strong sequence conservation (only two non synonymous Single Nucleotide Polymorphisms [SNPs] for the two HERV-W and HERV-FRD envelope genes, i.e. for the two genes specifically expressed in the placenta and possibly involved in syncytiotrophoblast formation. We further show – using an ex vivo fusion assay for each allelic form – that none of these SNPs impairs the fusogenic function. The other envelope proteins disclose variable polymorphisms, with the occurrence of a stop codon and/or frameshift for most – but not all – of them. Moreover, the sequence conservation analysis of the orthologous genes that can be found in primates shows that three env genes have been maintained in a fully coding state throughout evolution including envW and envFRD. Conclusion Altogether, the present study strongly suggests that some but not all envelope encoding sequences are bona fide genes. It also provides new tools to elucidate the possible role of endogenous envelope proteins as susceptibility factors in a number of pathologies where HERVs have been suspected to be involved.

  6. The SHOX region and its mutations.

    Science.gov (United States)

    Capone, L; Iughetti, L; Sabatini, S; Bacciaglia, A; Forabosco, A

    2010-06-01

    The short stature homeobox-containing (SHOX) gene lies in the pseudoautosomal region 1 (PAR1) that comprises 2.6 Mb of the short-arm tips of both the X and Y chromosomes. It is known that its heterozygous mutations cause Leri-Weill dyschondrosteosis (LWD) (OMIM #127300), while its homozygous mutations cause a severe form of dwarfism known as Langer mesomelic dysplasia (LMD) (OMIM #249700). The analysis of 238 LWD patients between 1998 and 2007 by multiple authors shows a prevalence of deletions (46.4%) compared to point mutations (21.2%). On the whole, deletions and point mutations account for about 67% of LWD patients. SHOX is located within a 1000 kb desert region without genes. The comparative genomic analysis of this region between genomes of different vertebrates has led to the identification of evolutionarily conserved non-coding DNA elements (CNE). Further functional studies have shown that one of these CNE downstream of the SHOX gene is necessary for the expression of SHOX; this is considered to be typical "enhancer" activity. Including the enhancer, the overall mutation of the SHOX region in LWD patients does not hold in 100% of cases. Various authors have demonstrated the existence of other CNE both downstream and upstream of SHOX regions. The resulting conclusion is that it is necessary to reanalyze all LWD/LMD patients without SHOX mutations for the presence of mutations in the 5'- and 3'-flanking SHOX regions.

  7. RNA-Seq analysis of D. radiodurans find non coding RNAs expressed in response to radiation stress

    International Nuclear Information System (INIS)

    Gadewal, Nikhil; Mukhopadhyaya, Rita

    2015-01-01

    In bacteria discovery of functional RNA molecules that are not translated into protein, noncoding RNAs, became possible with advent of Next Generation Sequencing technology. Bacterial non coding RNAs are typically 50-300 nucleotides long and work as internal signals controlling various levels of gene expression. Deep sequencing of total cellular RNA captures all coding and noncoding transcripts with their differential levels of expression in the transcriptome. It provides a powerful approach to study bacterial gene expression and mechanisms of gene regulation. We subjected the 3 h transcriptome of Deinococcus radiodurans R1 cells post exposure to 6 KGy gamma radiation to 100 x 2 cycles of deep sequencing on the Illumina HiSeq 2000 to look for ncRNA transcripts. Bioinformatics pipeline for analysis and interpretation of RNA Seq data was done in house using Softwares available in public domains. Our sequence data aligned with 21 putative ncRNAs expressed in the intergenic regions of annotated genome of D radiodurans. Verification of 2 ncRNA candidates and 3 transcription factor genes by Real Time PCR confirmed presence of these transcripts in the 3 h transcriptome sequenced by us. Any relationship between ncRNAs and control of radiation induced gene expression in D radiodurans can be proved only after specific gene knock outs in future. (author)

  8. Mutational analysis of the PITX2 coding region revealed no common cause for transposition of the great arteries (dTGA

    Directory of Open Access Journals (Sweden)

    Goldmuntz Elizabeth

    2005-05-01

    Full Text Available Abstract Background PITX2 is a bicoid-related homeodomain transcription factor that plays an important role in asymmetric cardiogenesis. Loss of function experiments in mice cause severe heart malformations, including transposition of the great arteries (TGA. TGA accounts for 5–7% of all congenital heart diseases affecting 0.2 per 1000 live births, thereby representing the most frequent cyanotic heart defect diagnosed in the neonatal period. Methods To address whether altered PITX2 function could also contribute to the formation of dTGA in humans, we screened 96 patients with dTGA by means of dHPLC and direct sequencing for mutations within the PITX2 gene. Results Several SNPs could be detected, but no stop or frame shift mutation. In particular, we found seven intronic and UTR variants, two silent mutations and two polymorphisms within the coding region. Conclusion As most sequence variants were also found in controls we conclude that mutations in PITX2 are not a common cause of dTGA.

  9. Alternative-splicing in the exon-10 region of GABA(A receptor beta(2 subunit gene: relationships between novel isoforms and psychotic disorders.

    Directory of Open Access Journals (Sweden)

    Cunyou Zhao

    Full Text Available BACKGROUND: Non-coding single nucleotide polymorphisms (SNPs in GABRB2, the gene for beta(2-subunit of gamma-aminobutyric acid type A (GABA(A receptor, have been associated with schizophrenia (SCZ and quantitatively correlated to mRNA expression and alternative splicing. METHODS AND FINDINGS: Expression of the Exon 10 region of GABRB2 from minigene constructs revealed this region to be an "alternative splicing hotspot" that readily gave rise to differently spliced isoforms depending on intron sequences. This led to a search in human brain cDNA libraries, and the discovery of two novel isoforms, beta(2S1 and beta(2S2, bearing variations in the neighborhood of Exon-10. Quantitative real-time PCR analysis of postmortem brain samples showed increased beta(2S1 expression and decreased beta(2S2 expression in both SCZ and bipolar disorder (BPD compared to controls. Disease-control differences were significantly correlated with SNP rs187269 in BPD males for both beta(2S1 and beta(2S2 expressions, and significantly correlated with SNPs rs2546620 and rs187269 in SCZ males for beta(2S2 expression. Moreover, site-directed mutagenesis indicated that Thr(365, a potential phosphorylation site in Exon-10, played a key role in determining the time profile of the ATP-dependent electrophysiological current run-down. CONCLUSION: This study therefore provided experimental evidence for the importance of non-coding sequences in the Exon-10 region in GABRB2 with respect to beta(2-subunit splicing diversity and the etiologies of SCZ and BPD.

  10. Organization and transient expression of the gene for human U11 snRNA

    Science.gov (United States)

    Clemens, Suter-Crazzolara; Walter, Keller

    1991-01-01

    The nucleotide sequence of U11 small nuclear RNA, a minor U RNA from HeLa cells, was determined. Computer analysis of the sequence (135 residues) predicts two strong hairpin loops which are separated by seventeen nucleotides containing an Sm binding site (AAUUUUUUGG). A synthetic gene was constructed in which the coding region of U11 RNA is under the control of a T7 promoter. This vector can be used to produce U11 RNA in vitro. Southern hybridization and PCR analysis of HeLa genomic DNA suggest that U11 RNA is encoded by a single copy gene, and that at least three genomic regions could be U11 RNA pseudogenes. A HeLa genomic copy of a U11 gene was isolated by inverted PCR. This gene contains the U11 RNA coding sequence and several sequence elements unique for the U RNA genes. These include a Distal Sequence Element (DSE, ATTTGCATA) present between positions −215 and −223 relative to the start of transcription; a Proximal Sequence Element (PSE, TTCACCTTTACCAAAAATG) located between positions −43 and −63 ; and a 3′box (GTTAGGCGAAATATTA) between positions +150 and +166. Transfection of HeLa cells with this gene revealed that it is functioning in vivo and can produce U11 RNA. PMID:1820214

  11. Sequence analysis of Epstein-Barr virus EBNA-2 gene coding amino acid 148-487 in nasopharyngeal and gastric carcinomas

    Directory of Open Access Journals (Sweden)

    Wang Xinying

    2012-02-01

    Full Text Available Abstract Background The Epstein-Barr virus (EBV nuclear antigen 2 (EBNA-2 plays a key role in the B-cell growth transformation by initiating and maintaining the proliferation of infected B-cell upon EBV infection in vitro. Most studies about EBNA-2 have focused on its functions yet little is known for its intertypic polymorphisms. Results Coding region for amino acid (aa 148-487 of the EBNA-2 gene was sequenced in 25 EBV-associated gastric carcinomas (EBVaGCs, 56 nasopharyngeal carcinomas (NPCs and 32 throat washings (TWs from healthy donors in Northern China. Three variations (g48991t, c48998a, t49613a were detected in all of the samples (113/113, 100%. EBNA-2 could be classified into four distinct subtypes: E2-A, E2-B, E2-C and E2-D based on the deletion status of three aa (294Q, 357K and 358G. Subtypes E2-A and E2-C were detected in 56/113 (49.6%, 38/113 (33.6% samples, respectively. E2-A was observed more in EBVaGCs samples and subtype E2-D was only detected in the NPC samples. Variation analysis in EBNA-2 functional domains: the TAD residue (I438L and the NLS residues (E476G, P484H and I486T were only detected in NPC samples which located in the carboxyl terminus of EBNA-2 gene. Conclusions The subtypes E2-A and E2-C were the dominant genotypes of the EBNA-2 gene in Northern China. The subtype E2-D may be associated with the tumorigenesis of NPC. The NPC isolates were prone harbor to more mutations than the other two groups in the functional domains.

  12. Growth and gene expression are predominantly controlled by distinct regions of the human IL-4 receptor.

    Science.gov (United States)

    Ryan, J J; McReynolds, L J; Keegan, A; Wang, L H; Garfein, E; Rothman, P; Nelms, K; Paul, W E

    1996-02-01

    IL-4 causes hematopoietic cells to proliferate and express a series of genes, including CD23. We examined whether IL-4-mediated growth, as measured by 4PS phosphorylation, and gene induction were similarly controlled. Studies of M12.4.1 cells expressing human IL-4R truncation mutants indicated that the region between amino acids 557-657 is necessary for full gene expression, which correlated with Stat6 DNA binding activity. This region was not required for 4PS phosphorylation. Tyrosine-to-phenylalanine mutations in the interval between amino acids 557-657 revealed that as long as one tyrosine remained unmutated, CD23 was fully induced. When all three tyrosines were mutated, the receptor was unable to induce CD23. The results indicate that growth regulation and gene expression are principally controlled by distinct regions of IL-4R.

  13. Different expression patterns of genes from the exo-xis region of bacteriophage λ and Shiga toxin-converting bacteriophage Ф24B following infection or prophage induction in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Sylwia Bloch

    Full Text Available Lambdoid bacteriophages serve as useful models in microbiological and molecular studies on basic biological process. Moreover, this family of viruses plays an important role in pathogenesis of enterohemorrhagic Escherichia coli (EHEC strains, as they are carriers of genes coding for Shiga toxins. Efficient expression of these genes requires lambdoid prophage induction and multiplication of the phage genome. Therefore, understanding the mechanisms regulating these processes appears essential for both basic knowledge and potential anti-EHEC applications. The exo-xis region, present in genomes of lambdoid bacteriophages, contains highly conserved genes of largely unknown functions. Recent report indicated that the Ea8.5 protein, encoded in this region, contains a newly discovered fused homeodomain/zinc-finger fold, suggesting its plausible regulatory role. Moreover, subsequent studies demonstrated that overexpression of the exo-xis region from a multicopy plasmid resulted in impaired lysogenization of E. coli and more effective induction of λ and Ф24B prophages. In this report, we demonstrate that after prophage induction, the increase in phage DNA content in the host cells is more efficient in E. coli bearing additional copies of the exo-xis region, while survival rate of such bacteria is lower, which corroborated previous observations. Importantly, by using quantitative real-time reverse transcription PCR, we have determined patterns of expressions of particular genes from this region. Unexpectedly, in both phages λ and Ф24B, these patterns were significantly different not only between conditions of the host cells infection by bacteriophages and prophage induction, but also between induction of prophages with various agents (mitomycin C and hydrogen peroxide. This may shed a new light on our understanding of regulation of lambdoid phage development, depending on the mode of lytic cycle initiation.

  14. Computational Approaches Reveal New Insights into Regulation and Function of Non; coding RNAs and their Targets

    KAUST Repository

    Alam, Tanvir

    2016-01-01

    Regulation and function of protein-coding genes are increasingly well-understood, but no comparable evidence exists for non-coding RNA (ncRNA) genes, which appear to be more numerous than protein-coding genes. We developed a novel machine

  15. Distinct gene number-genome size relationships for eukaryotes and non-eukaryotes: gene content estimation for dinoflagellate genomes.

    Directory of Open Access Journals (Sweden)

    Yubo Hou

    Full Text Available The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log(10-transformed protein-coding gene number (Y' versus log(10-transformed genome size (X', genome size in kbp were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y' = ln(-46.200+22.678X', whereas non-eukaryotes a linear model, Y' = 0.045+0.977X', both with high significance (p0.91. Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%-1% compared to higher and relatively stable percentages in prokaryotes and viruses (97%-47%. The eukaryotic regression models project that the smallest dinoflagellate genome (3x10(6 kbp contains 38,188 protein-coding (40,086 total genes and the largest (245x10(6 kbp 87,688 protein-coding (92,013 total genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species.

  16. Development of Coolant Radioactivity Interpretation Code

    International Nuclear Information System (INIS)

    Kim, Kiyoung; Jung, Youngsuk; Kim, Kyounghyun; Kim, Jangwook

    2013-01-01

    In Korea, the coolant radioactivity analysis has been performed by using the computer codes of foreign companies such as CADE (Westinghouse), IODYNE and CESIUM (ABB-CE). However, these computer codes are too conservative and have involved considerable errors. Furthermore, since these codes are DOS-based program, their easy operability is not satisfactory. Therefore it is required development of an enhanced analysis algorithm applying an analytical method reflecting the change of operational environments of domestic nuclear power plants and a fuel failure evaluation software considering user' conveniences. We have developed a nuclear fuel failure evaluation code able to estimate the number of failed fuel rods and the burn-up of failed fuels during nuclear power plant operation cycle. A Coolant Radio-activity Interpretation Code (CRIC) for LWR has been developed as the output of the project 'Development of Fuel Reliability Enhanced Technique' organized by Korea Institute of Energy Technology Evaluation and Planning (KETEP). The CRIC is Windows based-software able to evaluate the number of failed fuel rods and the burn-up of failed fuel region by analyzing coolant radioactivity of LWR in operation. The CRIC is based on the model of fission products release commonly known as 'three region model' (pellet region, gap region, and coolant region), and we are verifying the CRIC results based on the cases of domestic fuel failures. CRIC users are able to estimate the number of failed fuel rods, burn-up and regions of failed fuel considered enrichment and power distribution of fuel region by using operational cycle data, coolant activity data, fuel loading pattern, Cs-134/Cs-137 ratio according to burn-up and U-235 enrichment provided in the code. Due to development of the CRIC, it is secured own unique fuel failure evaluation code. And, it is expected to have the following significant meaning. This is that the code reflecting a proprietary technique for quantitatively

  17. Gene Map of the HLA Region, Graves' Disease and Hashimoto Thyroiditis, and Hematopoietic Stem Cell Transplantation.

    Science.gov (United States)

    Sasazuki, Takehiko; Inoko, Hidetoshi; Morishima, Satoko; Morishima, Yasuo

    2016-01-01

    The human leukocyte antigen (HLA) genomic region spanning about 4 Mb is the most gene dense and the polymorphic stretches in the human genome. A total of the 269 loci were identified, including 145 protein coding genes mostly important for immunity and 50 noncoding RNAs (ncRNAs). Biological function of these ncRNAs remains unknown, becoming hot spot in the studies of HLA-associated diseases. The genomic diversity analysis in the HLA region facilitated by next-generation sequencing will pave the way to molecular understanding of linkage disequilibrium structure, population diversity, histocompatibility in transplantation, and associations with autoimmune diseases. The 4-digit DNA genotyping of HLA for six HLA loci, HLA-A through DP, in the patients with Graves' disease (GD) and Hashimoto thyroiditis (HT) identified six susceptible and three resistant HLA alleles. Their epistatic interactions in controlling the development of these diseases are shown. Four susceptible and one resistant HLA alleles are shared by GD and HT. Two HLA alleles associated with GD or HT control the titers of autoantibodies to thyroid antigens. All these observations led us to propose a new model for the development of GD and HT. Hematopoietic stem cell transplantation from unrelated donor (UR-HSCT) provides a natural experiment to elucidate the role of allogenic HLA molecules in immune response. Large cohort studies using HLA allele and clinical outcome data have elucidated that (1) HLA locus, allele, and haplotype mismatches between donor and patient, (2) specific amino acid substitution at specific positions of HLA molecules, and (3) ethnic background are all responsible for the immunological events related to UR-HSCT including acute graft-versus-host disease (GVHD), chronic GVHD, graft-versus-leukemia (GvL) effect, and graft failure. © 2016 Elsevier Inc. All rights reserved.

  18. Targeted sequencing reveals low-frequency variants in EPHA genes as markers of paclitaxel-induced peripheral neuropathy.

    OpenAIRE

    Apellániz-Ruiz, Maria; Tejero, Héctor; Inglada-Pérez, Lucía; Sánchez-Barroso, Lara; Gutiérrez-Gutiérrez, Gerardo; Calvo, Isabel; Castelo, Beatriz; Redondo, Andrés; García-Donás, Jesus; Romero-Laorden, Nuria; Sereno, Maria; Merino, María; Currás-Freixes, Maria; Montero-Conde, Cristina; Mancikova, Veronika

    2017-01-01

    PURPOSE: Neuropathy is the dose limiting toxicity of paclitaxel and a major cause for decreased quality of life. Genetic factors have been shown to contribute to paclitaxel neuropathy susceptibility; however, the major causes for inter-individual differences remain unexplained. In this study we identified genetic markers associated with paclitaxel-induced neuropathy through massive sequencing of candidate genes. EXPERIMENTAL DESIGN: We sequenced the coding region of 4 EPHA genes, 5 genes invo...

  19. MHC class I–associated peptides derive from selective regions of the human genome

    Science.gov (United States)

    Pearson, Hillary; Granados, Diana Paola; Durette, Chantal; Bonneil, Eric; Courcelles, Mathieu; Rodenbrock, Anja; Laverdure, Jean-Philippe; Côté, Caroline; Thibault, Pierre

    2016-01-01

    MHC class I–associated peptides (MAPs) define the immune self for CD8+ T lymphocytes and are key targets of cancer immunosurveillance. Here, the goals of our work were to determine whether the entire set of protein-coding genes could generate MAPs and whether specific features influence the ability of discrete genes to generate MAPs. Using proteogenomics, we have identified 25,270 MAPs isolated from the B lymphocytes of 18 individuals who collectively expressed 27 high-frequency HLA-A,B allotypes. The entire MAP repertoire presented by these 27 allotypes covered only 10% of the exomic sequences expressed in B lymphocytes. Indeed, 41% of expressed protein-coding genes generated no MAPs, while 59% of genes generated up to 64 MAPs, often derived from adjacent regions and presented by different allotypes. We next identified several features of transcripts and proteins associated with efficient MAP production. From these data, we built a logistic regression model that predicts with good accuracy whether a gene generates MAPs. Our results show preferential selection of MAPs from a limited repertoire of proteins with distinctive features. The notion that the MHC class I immunopeptidome presents only a small fraction of the protein-coding genome for monitoring by the immune system has profound implications in autoimmunity and cancer immunology. PMID:27841757

  20. Characterization and expression of the maize β-carbonic anhydrase gene repeat regions.

    Science.gov (United States)

    Tems, Ursula; Burnell, James N

    2010-12-01

    In maize, carbonic anhydrase (CA; EC 4.2.1.1) catalyzes the first reaction of the C(4) photosynthetic pathway; it catalyzes the hydration of CO(2) to bicarbonate and provides an inorganic carbon source for the primary carboxylation reaction catalyzed by phosphoenolpyruvate (PEP) carboxylase. The β-CA isozymes from maize, as well as other agronomically important NADP-malic enzyme (NADP-ME) type C(4) crops, have remained relatively uncharacterized but differ significantly from the β-CAs of other C(4) monocot species primarily due to transcript length and the presence of repeat sequences. This research confirmed earlier findings of repeat sequences in maize CA transcripts, and demonstrated that the gene encoding these transcripts is also composed of repeat sequences. One of the maize CA genes was sequenced and found to encode two domains, with distinct groups of exons corresponding to the repeat regions of the transcript. We have also shown that expression of a single repeat region of the CA transcript produced active enzyme that associated as a dimer and was composed primarily of α-helices, consistent with that observed for other plant CAs. As the presence of repeat regions in the CA gene is unique to NADP-ME type C(4) monocot species, the implications of these findings in the context of the evolution of the location and function of this C(4) pathway enzyme are strongly suggestive of CA gene duplication resulting in an evolutionary advantage and a higher photosynthetic efficiency. Copyright © 2010 Elsevier Masson SAS. All rights reserved.

  1. Cohort-specific imputation of gene expression improves prediction of warfarin dose for African Americans

    Directory of Open Access Journals (Sweden)

    Assaf Gottlieb

    2017-11-01

    Full Text Available Abstract Background Genome-wide association studies are useful for discovering genotype–phenotype associations but are limited because they require large cohorts to identify a signal, which can be population-specific. Mapping genetic variation to genes improves power and allows the effects of both protein-coding variation as well as variation in expression to be combined into “gene level” effects. Methods Previous work has shown that warfarin dose can be predicted using information from genetic variation that affects protein-coding regions. Here, we introduce a method that improves dose prediction by integrating tissue-specific gene expression. In particular, we use drug pathways and expression quantitative trait loci knowledge to impute gene expression—on the assumption that differential expression of key pathway genes may impact dose requirement. We focus on 116 genes from the pharmacokinetic and pharmacodynamic pathways of warfarin within training and validation sets comprising both European and African-descent individuals. Results We build gene-tissue signatures associated with warfarin dose in a cohort-specific manner and identify a signature of 11 gene-tissue pairs that significantly augments the International Warfarin Pharmacogenetics Consortium dosage-prediction algorithm in both populations. Conclusions Our results demonstrate that imputed expression can improve dose prediction and bridge population-specific compositions. MATLAB code is available at https://github.com/assafgo/warfarin-cohort

  2. Transcription mapping and expression patterns of genes in the major immediate-early region of Kaposi's sarcoma-associated herpesvirus.

    Science.gov (United States)

    Saveliev, Alexei; Zhu, Fan; Yuan, Yan

    2002-08-01

    Viral immediate-early (IE) genes are the first class of viral genes expressed during primary infection or reactivation from latency. They usually encode regulatory proteins that play crucial roles in viral life cycle. In a previous study, four regions in the KSHV genome were found to be actively transcribed in the immediate-early stage of viral reactivation in primary effusion lymphoma cells. Three immediate-early transcripts were characterized in these regions, as follows: mRNAs for ORF50 (KIE-1), ORF-45 (KIE-2), and ORF K4.2 (KIE-3) (F. X. Zhu, T. Cusano, and Y. Yuan, 1999, J. Virol. 73, 5556-5567). In the present study, we further analyzed the expression of genes in these IE regions in BC-1 and BCBL-1 cells. One of the immediate-early regions (KIE-1) that encompasses ORF50 and other genes was intensively studied to establish a detailed transcription map and expression patterns of genes in this region. This study led to identification of several novel IE transcripts in this region. They include a 2.6-kb mRNA which encodes ORF48/ORF29b, a family of transcripts that are complementary to ORF50 mRNA and a novel K8 IE mRNA of 1.5 kb. Together with the IE mRNA for ORF50 which was identified previously, four immediate-early genes have been mapped to KIE-1 region. Therefore, we would designate KIE-1 the major immediate-early region of KSHV. In addition, we showed that transcription of K8 gene is controlled by two promoters, yielding two transcripts, an immediate-early mRNA of 1.5 kb and a delayed-early mRNA of 1.3 kb.

  3. Nucleotide sequence of the melA gene, coding for alpha-galactosidase in Escherichia coli K-12.

    OpenAIRE

    Liljeström, P L; Liljeström, P

    1987-01-01

    Melibiose uptake and hydrolysis in E.coli is performed by the MelB and MelA proteins, respectively. We report the cloning and sequencing of the melA gene. The nucleotide sequence data showed that melA codes for a 450 amino acid long protein with a molecular weight of 50.6 kd. The sequence data also supported the assumption that the mel locus forms an operon with melA in proximal position. A comparison of MelA with alpha-galactosidase proteins from yeast and human origin showed that these prot...

  4. Screening and association testing of common coding variation in steroid hormone receptor co-activator and co-repressor genes in relation to breast cancer risk: the Multiethnic Cohort

    Directory of Open Access Journals (Sweden)

    Stallcup Michael R

    2009-01-01

    Full Text Available Abstract Background Only a limited number of studies have performed comprehensive investigations of coding variation in relation to breast cancer risk. Given the established role of estrogens in breast cancer, we hypothesized that coding variation in steroid receptor coactivator and corepressor genes may alter inter-individual response to estrogen and serve as markers of breast cancer risk. Methods We sequenced the coding exons of 17 genes (EP300, CCND1, NME1, NCOA1, NCOA2, NCOA3, SMARCA4, SMARCA2, CARM1, FOXA1, MPG, NCOR1, NCOR2, CALCOCO1, PRMT1, PPARBP and CREBBP suggested to influence transcriptional activation by steroid hormone receptors in a multiethnic panel of women with advanced breast cancer (n = 95: African Americans, Latinos, Japanese, Native Hawaiians and European Americans. Association testing of validated coding variants was conducted in a breast cancer case-control study (1,612 invasive cases and 1,961 controls nested in the Multiethnic Cohort. We used logistic regression to estimate odds ratios for allelic effects in ethnic-pooled analyses as well as in subgroups defined by disease stage and steroid hormone receptor status. We also investigated effect modification by established breast cancer risk factors that are associated with steroid hormone exposure. Results We identified 45 coding variants with frequencies ≥ 1% in any one ethnic group (43 non-synonymous variants. We observed nominally significant positive associations with two coding variants in ethnic-pooled analyses (NCOR2: His52Arg, OR = 1.79; 95% CI, 1.05–3.05; CALCOCO1: Arg12His, OR = 2.29; 95% CI, 1.00–5.26. A small number of variants were associated with risk in disease subgroup analyses and we observed no strong evidence of effect modification by breast cancer risk factors. Based on the large number of statistical tests conducted in this study, the nominally significant associations that we observed may be due to chance, and will need to be confirmed in other

  5. Screening and association testing of common coding variation in steroid hormone receptor co-activator and co-repressor genes in relation to breast cancer risk: the Multiethnic Cohort

    International Nuclear Information System (INIS)

    Haiman, Christopher A; Stallcup, Michael R; Greene, Geoffrey L; Press, Michael F; Garcia, Rachel R; Hsu, Chris; Xia, Lucy; Ha, Helen; Sheng, Xin; Le Marchand, Loic; Kolonel, Laurence N; Henderson, Brian E

    2009-01-01

    Only a limited number of studies have performed comprehensive investigations of coding variation in relation to breast cancer risk. Given the established role of estrogens in breast cancer, we hypothesized that coding variation in steroid receptor coactivator and corepressor genes may alter inter-individual response to estrogen and serve as markers of breast cancer risk. We sequenced the coding exons of 17 genes (EP300, CCND1, NME1, NCOA1, NCOA2, NCOA3, SMARCA4, SMARCA2, CARM1, FOXA1, MPG, NCOR1, NCOR2, CALCOCO1, PRMT1, PPARBP and CREBBP) suggested to influence transcriptional activation by steroid hormone receptors in a multiethnic panel of women with advanced breast cancer (n = 95): African Americans, Latinos, Japanese, Native Hawaiians and European Americans. Association testing of validated coding variants was conducted in a breast cancer case-control study (1,612 invasive cases and 1,961 controls) nested in the Multiethnic Cohort. We used logistic regression to estimate odds ratios for allelic effects in ethnic-pooled analyses as well as in subgroups defined by disease stage and steroid hormone receptor status. We also investigated effect modification by established breast cancer risk factors that are associated with steroid hormone exposure. We identified 45 coding variants with frequencies ≥ 1% in any one ethnic group (43 non-synonymous variants). We observed nominally significant positive associations with two coding variants in ethnic-pooled analyses (NCOR2: His52Arg, OR = 1.79; 95% CI, 1.05–3.05; CALCOCO1: Arg12His, OR = 2.29; 95% CI, 1.00–5.26). A small number of variants were associated with risk in disease subgroup analyses and we observed no strong evidence of effect modification by breast cancer risk factors. Based on the large number of statistical tests conducted in this study, the nominally significant associations that we observed may be due to chance, and will need to be confirmed in other studies. Our findings suggest that common coding

  6. Chronic intermittent hypoxia exerts CNS region-specific effects on rat microglial inflammatory and TLR4 gene expression.

    Directory of Open Access Journals (Sweden)

    Stephanie M C Smith

    Full Text Available Intermittent hypoxia (IH during sleep is a hallmark of sleep apnea, causing significant neuronal apoptosis, and cognitive and behavioral deficits in CNS regions underlying memory processing and executive functions. IH-induced neuroinflammation is thought to contribute to cognitive deficits after IH. In the present studies, we tested the hypothesis that IH would differentially induce inflammatory factor gene expression in microglia in a CNS region-dependent manner, and that the effects of IH would differ temporally. To test this hypothesis, adult rats were exposed to intermittent hypoxia (2 min intervals of 10.5% O2 for 8 hours/day during their respective sleep cycles for 1, 3 or 14 days. Cortex, medulla and spinal cord tissues were dissected, microglia were immunomagnetically isolated and mRNA levels of the inflammatory genes iNOS, COX-2, TNFα, IL-1β and IL-6 and the innate immune receptor TLR4 were compared to levels in normoxia. Inflammatory gene expression was also assessed in tissue homogenates (containing all CNS cells. We found that microglia from different CNS regions responded to IH differently. Cortical microglia had longer lasting inflammatory gene expression whereas spinal microglial gene expression was rapid and transient. We also observed that inflammatory gene expression in microglia frequently differed from that in tissue homogenates from the same region, indicating that cells other than microglia also contribute to IH-induced neuroinflammation. Lastly, microglial TLR4 mRNA levels were strongly upregulated by IH in a region- and time-dependent manner, and the increase in TLR4 expression appeared to coincide with timing of peak inflammatory gene expression, suggesting that TLR4 may play a role in IH-induced neuroinflammation. Together, these data indicate that microglial-specific neuroinflammation may play distinct roles in the effects of intermittent hypoxia in different CNS regions.

  7. Transcriptomic profiling of interacting nasal staphylococci species reveals global changes in gene and non-coding RNA expression

    DEFF Research Database (Denmark)

    Hermansen, Grith Miriam Maigaard; Sazinas, Pavelas; Kofod, Ditte

    2018-01-01

    Interspecies interactions between bacterial pathogens and the commensal microbiota can influence disease outcome. In the nasal cavities, Staphylococcus epidermidis has been shown to be a determining factor for Staphylococcus aureus colonization and biofilm formation. However, the interaction...... between S. epidermidis and S. aureus has mainly been described by phenotypic analysis, and little is known about how this interaction modulates gene expression.This study aimed to determine the interactome of nasal S. aureus and S. epidermidis isolates to understand the molecular effect of interaction...... also identified putative non-coding RNAs (ncRNAs) and, interestingly, detected a putative ncRNA transcribed antisense to esp, the serine protease of S. epidermidis, that has previously been shown to inhibit nasal colonization of S. aureus. In our study, the gene encoding Esp and the antisense nc...

  8. Biased Gene Conversion and GC-Content Evolution in the Coding Sequences of Reptiles and Vertebrates

    Science.gov (United States)

    Figuet, Emeric; Ballenghien, Marion; Romiguier, Jonathan; Galtier, Nicolas

    2015-01-01

    Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins. PMID:25527834

  9. Immediate-early gene region of human cytomegalovirus trans-activates the promoter of human immunodeficiency virus

    International Nuclear Information System (INIS)

    Davis, M.G.; Kenney, S.C.; Kamine, J.; Pagano, J.S.; Huang, E.S.

    1987-01-01

    Almost all homosexual patients with acquired immunodeficiency syndrome are also actively infected with human cytomegalovirus (HCMV). The authors have hypothesized that an interaction between HCMV and human immunodeficiency virus (HIV), the agent that causes acquired immunodeficiency syndrome, may exist at a molecular level and contribute to the manifestations of HIV infection. In this report, they demonstrate that the immediate-early gene region of HCMV, in particular immediate-early region 2, trans-activates the expression of the bacterial gene chloramphenicol acetyltransferase that is fused to the HIV long terminal repeat and carried by plasmid pHIV-CAT. The HCMV immediate-early trans-activator increases the level of mRNA from the plamid pHIV-CAT. The sequences of HIV that are responsive to trans-activation by the HDMV immediate-early region are distinct from HIV sequences that are required for response to the HIV tat. The stimulation of HIV gene expression by HDMV gene functions could enhance the consequences of HIV infection in persons with previous or concurrent HCMV infection

  10. Enhancer elements upstream of the SHOX gene are active in the developing limb.

    Science.gov (United States)

    Durand, Claudia; Bangs, Fiona; Signolet, Jason; Decker, Eva; Tickle, Cheryll; Rappold, Gudrun

    2010-05-01

    Léri-Weill Dyschondrosteosis (LWD) is a dominant skeletal disorder characterized by short stature and distinct bone anomalies. SHOX gene mutations and deletions of regulatory elements downstream of SHOX resulting in haploinsufficiency have been found in patients with LWD. SHOX encodes a homeodomain transcription factor and is known to be expressed in the developing limb. We have now analyzed the regulatory significance of the region upstream of the SHOX gene. By comparative genomic analyses, we identified several conserved non-coding elements, which subsequently were tested in an in ovo enhancer assay in both chicken limb bud and cornea, where SHOX is also expressed. In this assay, we found three enhancers to be active in the developing chicken limb, but none were functional in the developing cornea. A screening of 60 LWD patients with an intact SHOX coding and downstream region did not yield any deletion of the upstream enhancer region. Thus, we speculate that SHOX upstream deletions occur at a lower frequency because of the structural organization of this genomic region and/or that SHOX upstream deletions may cause a phenotype that differs from the one observed in LWD.

  11. Molecular analysis of MECP2 gene in Egyptian patients with Rett ...

    African Journals Online (AJOL)

    Molecular analysis of MECP2 gene in Egyptian patients with Rett syndrome. ... Egyptian Journal of Medical Human Genetics ... This study represents one of the limited MECP2 molecular analyses done on Egyptian patients with RTT, in which direct sequencing of MECP2 coding region in 10 female Egyptian patients ...

  12. Transport code and nuclear data in intermediate energy region

    Energy Technology Data Exchange (ETDEWEB)

    Hasegawa, Akira; Odama, Naomitsu [Japan Atomic Energy Research Inst., Tokai, Ibaraki (Japan). Tokai Research Establishment; Maekawa, F.; Ueki, K.; Kosaka, K.; Oyama, Y.

    1998-11-01

    We briefly reviewed the problems of intermediate energy nuclear data file and transport codes in connection with processing of the data. This is a summary of our group in the task force on JENDL High Energy File Integral Evaluation (JHEFIE). In this article we stress the necessity of the production of intermediate evaluated nuclear data file up to 3 GeV for the application of accelerator driven transmutation (ADT) system. And also we state the necessity of having our own transport code system to calculate the radiation fields using these evaluated files from the strategic points of view to keep our development of the ADT technology completely free from other conditions outside of our own such as imported codes and data with poor maintenance or unknown accuracy. (author)

  13. Transport code and nuclear data in intermediate energy region

    International Nuclear Information System (INIS)

    Hasegawa, Akira; Odama, Naomitsu; Maekawa, F.; Ueki, K.; Kosaka, K.; Oyama, Y.

    1998-01-01

    We briefly reviewed the problems of intermediate energy nuclear data file and transport codes in connection with processing of the data. This is a summary of our group in the task force on JENDL High Energy File Integral Evaluation (JHEFIE). In this article we stress the necessity of the production of intermediate evaluated nuclear data file up to 3 GeV for the application of accelerator driven transmutation (ADT) system. And also we state the necessity of having our own transport code system to calculate the radiation fields using these evaluated files from the strategic points of view to keep our development of the ADT technology completely free from other conditions outside of our own such as imported codes and data with poor maintenance or unknown accuracy. (author)

  14. Acetylcholinesterase (AChE) gene modification in transgenic animals: functional consequences of selected exon and regulatory region deletion.

    Science.gov (United States)

    Camp, Shelley; Zhang, Limin; Marquez, Michael; de la Torre, Brian; Long, Jeffery M; Bucht, Goran; Taylor, Palmer

    2005-12-15

    AChE is an alternatively spliced gene. Exons 2, 3 and 4 are invariantly spliced, and this sequence is responsible for catalytic function. The 3' alternatively spliced exons, 5 and 6, are responsible for AChE disposition in tissue [J. Massoulie, The origin of the molecular diversity and functional anchoring of cholinesterases. Neurosignals 11 (3) (2002) 130-143; Y. Li, S. Camp, P. Taylor, Tissue-specific expression and alternative mRNA processing of the mammalian acetylcholinesterase gene. J. Biol. Chem. 268 (8) (1993) 5790-5797]. The splice to exon 5 produces the GPI anchored form of AChE found in the hematopoietic system, whereas the splice to exon 6 produces a sequence that binds to the structural subunits PRiMA and ColQ, producing AChE expression in brain and muscle. A third alternative RNA species is present that is not spliced at the 3' end; the intron 3' of exon 4 is used as coding sequence and produces the read-through, unanchored form of AChE. In order to further understand the role of alternative splicing in the expression of the AChE gene, we have used homologous recombination in stem cells to produce gene specific deletions in mice. Alternatively and together exon 5 and exon 6 were deleted. A cassette containing the neomycin gene flanked by loxP sites was used to replace the exon(s) of interest. Tissue analysis of mice with exon 5 deleted and the neomycin cassette retained showed very low levels of AChE expression, far less than would have been anticipated. Only the read-through species of the enzyme was produced; clearly the inclusion of the selection cassette disrupted splicing of exon 4 to exon 6. The selection cassette was then deleted in exon 5, exon 6 and exons 5 + 6 deleted mice by breeding to Ella-cre transgenic mice. AChE expression in serum, brain and muscle has been analyzed. Another AChE gene targeted mouse strain involving a region in the first intron, found to be critical for AChE expression in muscle cells [S. Camp, L. Zhang, M. Marquez, B

  15. Comparative analysis of vertebrate EIF2AK2 (PKR genes and assignment of the equine gene to ECA15q24–q25 and the bovine gene to BTA11q12–q15

    Directory of Open Access Journals (Sweden)

    Zharkikh Andrey A

    2006-09-01

    Full Text Available Abstract The structures of the canine, rabbit, bovine and equine EIF2AK2 genes were determined. Each of these genes has a 5' non-coding exon as well as 15 coding exons. All of the canine, bovine and equine EIF2AK2 introns have consensus donor and acceptor splice sites. In the equine EIF2AK2 gene, a unique single nucleotide polymorphism that encoded a Tyr329Cys substitution was detected. Regulatory elements predicted in the promoter region were conserved in ungulates, primates, rodents, Afrotheria (elephant and Insectifora (shrew. Western clawed frog and fugu EIF2AK2 gene sequences were detected in the USCS Genome Browser and compared to those of other vertebrate EIF2AK2 genes. A comparison of EIF2AK2 protein domains in vertebrates indicates that the kinase catalytic domains were evolutionarily more conserved than the nucleic acid-binding motifs. Nucleotide substitution rates were uniform among the vertebrate sequences with the exception of the zebrafish and goldfish EIF2AK2 genes, which showed substitution rates about 20% higher than those of other vertebrates. FISH was used to physically assign the horse and cattle genes to chromosome locations, ECA15q24–q25 and BTA11q12–15, respectively. Comparative mapping data confirmed conservation of synteny between ungulates, humans and rodents.

  16. Investigation of the N-terminal coding region of MUC7 alterations in dentistry students with and without caries

    Directory of Open Access Journals (Sweden)

    Koç Öztürk L

    2016-06-01

    Full Text Available Human low-molecular weight salivary mucin (MUC7 is a small, secreted glycoprotein coded by MUC7. In the oral cavity, they inhibit the colonization of oral bacteria, including cariogenic ones, by masking their surface adhesions, thus helping saliva to avoid dental caries. The N-terminal domain is important for low-molecular weight (MG2 mucins to contact with oral microorganisms. In this study, we aimed to identify the N-terminal coding region of the MUC7 gene between individuals with and without caries. Forty-four healthy dental students were enrolled in this study; 24 of them were classified to have caries [decayed, missing, filled-teeth (DMFT = 5.6] according to the World Health Organization (WHO criteria, and 20 of them were caries-free (DMFT = 0. Simplified oral hygiene index (OHI-S and gingival index (GI were used to determine the oral hygiene and gingival conditions. Total protein levels and salivary total protein levels and salivary buffer capacity (SBC were determined by Lowry and Ericsson methods. DNA was extracted from peripheral blood cells of all the participants and genotyping was carried out by a polymerase chain reaction (PCR-sequencing method. No statistical differences were found between two groups in the terms of salivary parameters, oral hygiene and gingival conditions. We detected one common single nucleotide polymorphism (SNP that leads to a change of asparagine to lysine at codon 80. This substitution was found in 29.0 and 40.0%, respectively, of the groups with and without caries. No other sequence variations were detected. The SNP found in this study may be a specific polymorphism affecting the Turkish population. Further studies with extended numbers are necessary in order to clarify this finding.

  17. Divergent evolution and purifying selection of the H (FUT1 gene in New World monkeys (Primates, Platyrrhini

    Directory of Open Access Journals (Sweden)

    Bárbara do Nascimento Borges

    2004-01-01

    Full Text Available In the present study, the coding region of the H gene was sequenced and analyzed in fourteen genera of New World primates (Alouatta, Aotus, Ateles, Brachyteles, Cacajao, Callicebus, Callithrix, Cebus, Chiropotes, Lagothrix, Leontopithecus, Pithecia, Saguinus, and Saimiri, in order to investigate the evolution of the gene. The analyses revealed that this coding region contains 1,101 nucleotides, with the exception of Brachyteles, the callitrichines (Callithrix, Leontopithecus, and Saguinus and one species of Callicebus (moloch, in which one codon was deleted. In the primates studied, the high GC content (63%, the nonrandom distribution of codons and the low evolution rate of the gene (0.513 substitutions/site/MA in the order Primates suggest the action of a purifying type of selective pressure, confirmed by the Z-test. Our analyses did not identify mutations equivalent to those responsible for the H-deficient phenotypes found in humans, nor any other alteration that might explain the lack of expression of the gene in the erythrocytes of Neotropical monkeys. The phylogenetic trees obtained for the H gene and the distance matrix data suggest the occurrence of divergent evolution in the primates.

  18. Long non-coding RNAs and mRNAs profiling during spleen development in pig.

    Science.gov (United States)

    Che, Tiandong; Li, Diyan; Jin, Long; Fu, Yuhua; Liu, Yingkai; Liu, Pengliang; Wang, Yixin; Tang, Qianzi; Ma, Jideng; Wang, Xun; Jiang, Anan; Li, Xuewei; Li, Mingzhou

    2018-01-01

    Genome-wide transcriptomic studies in humans and mice have become extensive and mature. However, a comprehensive and systematic understanding of protein-coding genes and long non-coding RNAs (lncRNAs) expressed during pig spleen development has not been achieved. LncRNAs are known to participate in regulatory networks for an array of biological processes. Here, we constructed 18 RNA libraries from developing fetal pig spleen (55 days before birth), postnatal pig spleens (0, 30, 180 days and 2 years after birth), and the samples from the 2-year-old Wild Boar. A total of 15,040 lncRNA transcripts were identified among these samples. We found that the temporal expression pattern of lncRNAs was more restricted than observed for protein-coding genes. Time-series analysis showed two large modules for protein-coding genes and lncRNAs. The up-regulated module was enriched for genes related to immune and inflammatory function, while the down-regulated module was enriched for cell proliferation processes such as cell division and DNA replication. Co-expression networks indicated the functional relatedness between protein-coding genes and lncRNAs, which were enriched for similar functions over the series of time points examined. We identified numerous differentially expressed protein-coding genes and lncRNAs in all five developmental stages. Notably, ceruloplasmin precursor (CP), a protein-coding gene participating in antioxidant and iron transport processes, was differentially expressed in all stages. This study provides the first catalog of the developing pig spleen, and contributes to a fuller understanding of the molecular mechanisms underpinning mammalian spleen development.

  19. CHIR99021 promotes self-renewal of mouse embryonic stem cells by modulation of protein-encoding gene and long intergenic non-coding RNA expression

    Energy Technology Data Exchange (ETDEWEB)

    Wu, Yongyan [College of Veterinary Medicine, Northwest A and F University, Yangling 712100, Shaanxi (China); Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi (China); Ai, Zhiying [Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi (China); College of Life Sciences, Northwest A and F University, Yangling 712100, Shaanxi (China); Yao, Kezhen [College of Veterinary Medicine, Northwest A and F University, Yangling 712100, Shaanxi (China); Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi (China); Cao, Lixia; Du, Juan; Shi, Xiaoyan [Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi (China); College of Life Sciences, Northwest A and F University, Yangling 712100, Shaanxi (China); Guo, Zekun, E-mail: gzk@nwsuaf.edu.cn [College of Veterinary Medicine, Northwest A and F University, Yangling 712100, Shaanxi (China); Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi (China); Zhang, Yong, E-mail: zhylab@hotmail.com [College of Veterinary Medicine, Northwest A and F University, Yangling 712100, Shaanxi (China); Key Laboratory of Animal Biotechnology, Ministry of Agriculture, Northwest A and F University, Yangling 712100, Shaanxi (China)

    2013-10-15

    Embryonic stem cells (ESCs) can proliferate indefinitely in vitro and differentiate into cells of all three germ layers. These unique properties make them exceptionally valuable for drug discovery and regenerative medicine. However, the practical application of ESCs is limited because it is difficult to derive and culture ESCs. It has been demonstrated that CHIR99021 (CHIR) promotes self-renewal and enhances the derivation efficiency of mouse (m)ESCs. However, the downstream targets of CHIR are not fully understood. In this study, we identified CHIR-regulated genes in mESCs using microarray analysis. Our microarray data demonstrated that CHIR not only influenced the Wnt/β-catenin pathway by stabilizing β-catenin, but also modulated several other pluripotency-related signaling pathways such as TGF-β, Notch and MAPK signaling pathways. More detailed analysis demonstrated that CHIR inhibited Nodal signaling, while activating bone morphogenetic protein signaling in mESCs. In addition, we found that pluripotency-maintaining transcription factors were up-regulated by CHIR, while several developmental-related genes were down-regulated. Furthermore, we found that CHIR altered the expression of epigenetic regulatory genes and long intergenic non-coding RNAs. Quantitative real-time PCR results were consistent with microarray data, suggesting that CHIR alters the expression pattern of protein-encoding genes (especially transcription factors), epigenetic regulatory genes and non-coding RNAs to establish a relatively stable pluripotency-maintaining network. - Highlights: • Combined use of CHIR with LIF promotes self-renewal of J1 mESCs. • CHIR-regulated genes are involved in multiple pathways. • CHIR inhibits Nodal signaling and promotes Bmp4 expression to activate BMP signaling. • Expression of epigenetic regulatory genes and lincRNAs is altered by CHIR.

  20. CHIR99021 promotes self-renewal of mouse embryonic stem cells by modulation of protein-encoding gene and long intergenic non-coding RNA expression

    International Nuclear Information System (INIS)

    Wu, Yongyan; Ai, Zhiying; Yao, Kezhen; Cao, Lixia; Du, Juan; Shi, Xiaoyan; Guo, Zekun; Zhang, Yong

    2013-01-01

    Embryonic stem cells (ESCs) can proliferate indefinitely in vitro and differentiate into cells of all three germ layers. These unique properties make them exceptionally valuable for drug discovery and regenerative medicine. However, the practical application of ESCs is limited because it is difficult to derive and culture ESCs. It has been demonstrated that CHIR99021 (CHIR) promotes self-renewal and enhances the derivation efficiency of mouse (m)ESCs. However, the downstream targets of CHIR are not fully understood. In this study, we identified CHIR-regulated genes in mESCs using microarray analysis. Our microarray data demonstrated that CHIR not only influenced the Wnt/β-catenin pathway by stabilizing β-catenin, but also modulated several other pluripotency-related signaling pathways such as TGF-β, Notch and MAPK signaling pathways. More detailed analysis demonstrated that CHIR inhibited Nodal signaling, while activating bone morphogenetic protein signaling in mESCs. In addition, we found that pluripotency-maintaining transcription factors were up-regulated by CHIR, while several developmental-related genes were down-regulated. Furthermore, we found that CHIR altered the expression of epigenetic regulatory genes and long intergenic non-coding RNAs. Quantitative real-time PCR results were consistent with microarray data, suggesting that CHIR alters the expression pattern of protein-encoding genes (especially transcription factors), epigenetic regulatory genes and non-coding RNAs to establish a relatively stable pluripotency-maintaining network. - Highlights: • Combined use of CHIR with LIF promotes self-renewal of J1 mESCs. • CHIR-regulated genes are involved in multiple pathways. • CHIR inhibits Nodal signaling and promotes Bmp4 expression to activate BMP signaling. • Expression of epigenetic regulatory genes and lincRNAs is altered by CHIR

  1. Study on the binding sites of radiosensitivity associated transcription factor in the promoter region of Ier5 gene

    International Nuclear Information System (INIS)

    Cui Wei; Yin Lingling; Dong Lingyue

    2012-01-01

    Objective: To clarify the mechanism of immediate early response gene 5 (Ier5) transcription induced by radiation. Methods: Deletant construction, site-specific mutagenesis,electrophoretic mobility shift assay (EMSA) and chromatin immunoprecipitation (ChIP) were used to forecast the promoter region, binding sites and transcription factors of Ier5 gene in HeLa cells. Results: The promoter region of Ier5 gene might be in the region of Ier5 -8 deletant (-408 - -238 bp). The Ier5 gene had two transcription factors of GCF and NFI, and GCF had two binding sites located in the region of -388 - -382 bp and -274 - -270 bp of Ier5 promoter. The binding site of NFI was located in -362 - -357 bp of Ier5 promoter. GCF could inhibit the expression of Ier5 gene and this inhibition was diminished when the radiation dose increased. In contrast, NFI increased the expression of Ier5. Conclusions: The most possible region of Ier5 promoter is from -408 to -238 bp which has two binding sites for the radiosensitivity transcription factors of GCF and NFI that could negatively and positively regulate the expression of Ier5 respectively. (authors)

  2. Isolation of Genes from Chromosome Region Ip31 Involved in the Development of Breast Cancer

    National Research Council Canada - National Science Library

    Cowell, John

    2000-01-01

    .... Using gene analysis tools, we have been able to demonstrate that few full-length genes are located in this region and that the ESTs from the databases are clustered to a proximal position of the contig...

  3. Mutations in the NDP gene: contribution to Norrie disease, familial exudative vitreoretinopathy and retinopathy of prematurity.

    Science.gov (United States)

    Dickinson, Joanne L; Sale, Michèle M; Passmore, Abraham; FitzGerald, Liesel M; Wheatley, Catherine M; Burdon, Kathryn P; Craig, Jamie E; Tengtrisorn, Supaporn; Carden, Susan M; Maclean, Hector; Mackey, David A

    2006-01-01

    To examine the contribution of mutations within the Norrie disease (NDP) gene to the clinically similar retinal diseases Norrie disease, X-linked familial exudative vitreoretinopathy (FEVR), Coat's disease and retinopathy of prematurity (ROP). A dataset comprising 13 Norrie-FEVR, one Coat's disease, 31 ROP patients and 90 ex-premature babies of Norrie disease patients. Furthermore, a previously described 14-bp deletion located in the 5' unstranslated region of the NDP gene was detected in three cases of regressed ROP. A second heterozygotic 14-bp deletion was detected in an unaffected ex-premature girl. Only two of the 13 Norrie-FEVR index cases had the full features of Norrie disease with deafness and mental retardation. Two novel mutations within the coding region of the NDP gene were found, one associated with a severe disease phenotypes of Norrie disease and the other with FEVR. A deletion within the non-coding region was associated with only mild-regressed ROP, despite the presence of low birthweight, prematurity and exposure to oxygen. In full-term children with retinal detachment only 15% appear to have the full features of Norrie disease and this is important for counselling parents on the possible long-term outcome.

  4. Gene expression levels of elastin and fibulin-5 according to differences between carotid plaque regions.

    Science.gov (United States)

    Sivrikoz, Emre; Timirci-Kahraman, Özlem; Ergen, Arzu; Zeybek, Ümit; Aksoy, Murat; Yanar, Fatih; İsbir, Turgay; Kurtoğlu, Mehmet

    2015-01-01

    The purpose of this study was to investigate the gene expression levels of elastin and fibulin-5 according to differences between carotid plaque regions and to correlate it with clinical features of plaque destabilization. The study included 44 endarterectomy specimens available from operated symptomatic carotid artery stenoses. The specimens were separated according to anatomic location: internal carotid artery (ICA), external carotid artery (ECA) and common carotid artery (CCA), and then stored in liquid nitrogen. The amounts of cDNA for elastin and fibulin-5 were determined by Quantitative real-time PCR (Q-RT-PCR). Target gene copy numbers were normalized using hypoxanthine-guanine phosphoribosyltransferase (HPRT1) gene. The delta-delta CT method was applied for relative quantification. Q-RT-PCR data showed that relative fibulin-5 gene expression was increased in ICA plaque regions when compared to CCA regions but not reaching significance (p=0.061). At the same time, no differences were observed in elastin mRNA level between different anatomic plaque regions (p>0.05). Moreover, elastin and fibulin-5 mRNA expression and clinical parameters were compared in ICA plaques versus CCA and ECA regions, respectively. Up-regulation of elastin and fibulin-5 mRNA levels in ICA were strongly correlated with family history of cardiovascular disease when compared to CCA (p<0.05). Up-regulation of fibulin-5 in ICA was significantly associated with diabetes, and elevated triglycerides and very low density lipoprotein (VLDL) when compared to ECA (p<0.05). The clinical significance is the differences between the proximal and distal regions of the lesion, associated with the ICA, CCA and ECA respectively, with increased fibulin-5 in the ICA region. Copyright © 2015 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  5. Simulations of the broad line region of NGC 5548 with CLOUDY code: Temperature determination

    Directory of Open Access Journals (Sweden)

    Ilić D.

    2007-01-01

    Full Text Available In this paper an analysis of the physical properties of the Broad Line Region (BLR of the active galaxy NGC 5548 is presented. Using the photoionization code CLOUDY and the measurements of Peterson et al. (2002, the physical conditions of the BLR are simulated and the BLR temperature is obtained. This temperature was compared to the temperature estimated with the Boltzmann-Plot (BP method (Popović et al. 2007. It was shown that the measured variability in the BLR temperature could be due to the change in the hydrogen density.

  6. Gene function prediction based on Gene Ontology Hierarchy Preserving Hashing.

    Science.gov (United States)

    Zhao, Yingwen; Fu, Guangyuan; Wang, Jun; Guo, Maozu; Yu, Guoxian

    2018-02-23

    Gene Ontology (GO) uses structured vocabularies (or terms) to describe the molecular functions, biological roles, and cellular locations of gene products in a hierarchical ontology. GO annotations associate genes with GO terms and indicate the given gene products carrying out the biological functions described by the relevant terms. However, predicting correct GO annotations for genes from a massive set of GO terms as defined by GO is a difficult challenge. To combat with this challenge, we introduce a Gene Ontology Hierarchy Preserving Hashing (HPHash) based semantic method for gene function prediction. HPHash firstly measures the taxonomic similarity between GO terms. It then uses a hierarchy preserving hashing technique to keep the hierarchical order between GO terms, and to optimize a series of hashing functions to encode massive GO terms via compact binary codes. After that, HPHash utilizes these hashing functions to project the gene-term association matrix into a low-dimensional one and performs semantic similarity based gene function prediction in the low-dimensional space. Experimental results on three model species (Homo sapiens, Mus musculus and Rattus norvegicus) for interspecies gene function prediction show that HPHash performs better than other related approaches and it is robust to the number of hash functions. In addition, we also take HPHash as a plugin for BLAST based gene function prediction. From the experimental results, HPHash again significantly improves the prediction performance. The codes of HPHash are available at: http://mlda.swu.edu.cn/codes.php?name=HPHash. Copyright © 2018 Elsevier Inc. All rights reserved.

  7. Absence of mutation at the 5'-upstream promoter region of the TPM4 gene from cardiac mutant axolotl (Ambystoma mexicanum).

    Science.gov (United States)

    Denz, Christopher R; Zhang, Chi; Jia, Pingping; Du, Jianfeng; Huang, Xupei; Dube, Syamalima; Thomas, Anish; Poiesz, Bernard J; Dube, Dipak K

    2011-09-01

    Tropomyosins are a family of actin-binding proteins that show cell-specific diversity by a combination of multiple genes and alternative RNA splicing. Of the 4 different tropomyosin genes, TPM4 plays a pivotal role in myofibrillogenesis as well as cardiac contractility in amphibians. In this study, we amplified and sequenced the upstream regulatory region of the TPM4 gene from both normal and mutant axolotl hearts. To identify the cis-elements that are essential for the expression of the TPM4, we created various deletion mutants of the TPM4 promoter DNA, inserted the deleted segments into PGL3 vector, and performed promoter-reporter assay using luciferase as the reporter gene. Comparison of sequences of the promoter region of the TPM4 gene from normal and mutant axolotl revealed no mutations in the promoter sequence of the mutant TPM4 gene. CArG box elements that are generally involved in controlling the expression of several other muscle-specific gene promoters were not found in the upstream regulatory region of the TPM4 gene. In deletion experiments, loss of activity of the reporter gene was noted upon deletion which was then restored upon further deletion suggesting the presence of both positive and negative cis-elements in the upstream regulatory region of the TPM4 gene. We believe that this is the first axolotl promoter that has ever been cloned and studied with clear evidence that it functions in mammalian cell lines. Although striated muscle-specific cis-acting elements are absent from the promoter region of TPM4 gene, our results suggest the presence of positive and negative cis-elements in the promoter region, which in conjunction with positive and negative trans-elements may be involved in regulating the expression of TPM4 gene in a tissue-specific manner.

  8. Functional characterisation of an Arabidopsis gene strongly induced by ionising radiation: the gene coding the poly(ADP-ribose)polymerase-1 (AthPARP-1)

    International Nuclear Information System (INIS)

    Doucet-Chabeaud, G.

    2000-01-01

    Arabidopsis thaliana, the model-system in plant genetics, has been used to study the responses to DNA damage, experimentally introduced by γ-irradiation. We have characterised a radiation-induced gene coding a 111 kDa protein, AthPARP-1, homologous to the human poly(ADP-ribose)polymerase-1 (hPARP-1). As hPARP-1 is composed by three functional domain with characteristic motifs, AthPARP-1 binds to DNA bearing single-strand breaks and shows DNA damage-dependent poly(ADP-ribosyl)ation. The preferential expression of AthPARP-1 in mitotically active tissues is in agreement with a potential role in the maintenance of genome integrity during DNA replication, as proposed for its human counterpart. Transcriptional gene activation by ionising radiation of AthPARP-1 and AthPARP-2 genes is to date plant specific activation. Our expression analyses after exposure to various stress indicate that 1) AthPARP-1 and AthPARP-2 play an important role in the response to DNA lesions, particularly they are activated by genotoxic agents implicating the BER DNA repair pathway 2) AthPARP-2 gene seems to play an additional role in the signal transduction induced by oxidative stress 3) the observed expression profile of AthPARP-1 is in favour of the regulation of AthPARP-1 gene expression at the level of transcription and translation. This mode of regulation of AthPARP-1 protein biosynthesis, clearly distinct from that observed in animals, needs the implication of a so far unidentified transcription factor that is activated by the presence of DNA lesions. The major outcome of this work resides in the isolation and characterisation of such new transcription factor, which will provide new insight on the regulation of plant gene expression by genotoxic stress. (author) [fr

  9. The Genomic Code: Genome Evolution and Potential Applications

    KAUST Repository

    Bernardi, Giorgio

    2016-01-25

    The genome of metazoans is organized according to a genomic code which comprises three laws: 1) Compositional correlations hold between contiguous coding and non-coding sequences, as well as among the three codon positions of protein-coding genes; these correlations are the consequence of the fact that the genomes under consideration consist of fairly homogeneous, long (≥200Kb) sequences, the isochores; 2) Although isochores are defined on the basis of purely compositional properties, GC levels of isochores are correlated with all tested structural and functional properties of the genome; 3) GC levels of isochores are correlated with chromosome architecture from interphase to metaphase; in the case of interphase the correlation concerns isochores and the three-dimensional “topological associated domains” (TADs); in the case of mitotic chromosomes, the correlation concerns isochores and chromosomal bands. Finally, the genomic code is the fourth and last pillar of molecular biology, the first three pillars being 1) the double helix structure of DNA; 2) the regulation of gene expression in prokaryotes; and 3) the genetic code.

  10. CVD-associated non-coding RNA, ANRIL, modulates expression of atherogenic pathways in VSMC

    International Nuclear Information System (INIS)

    Congrains, Ada; Kamide, Kei; Katsuya, Tomohiro; Yasuda, Osamu; Oguro, Ryousuke; Yamamoto, Koichi; Ohishi, Mitsuru; Rakugi, Hiromi

    2012-01-01

    Highlights: ► ANRIL maps in the strongest susceptibility locus for cardiovascular disease. ► Silencing of ANRIL leads to altered expression of tissue remodeling-related genes. ► The effects of ANRIL on gene expression are splicing variant specific. ► ANRIL affects progression of cardiovascular disease by regulating proliferation and apoptosis pathways. -- Abstract: ANRIL is a newly discovered non-coding RNA lying on the strongest genetic susceptibility locus for cardiovascular disease (CVD) in the chromosome 9p21 region. Genome-wide association studies have been linking polymorphisms in this locus with CVD and several other major diseases such as diabetes and cancer. The role of this non-coding RNA in atherosclerosis progression is still poorly understood. In this study, we investigated the implication of ANRIL in the modulation of gene sets directly involved in atherosclerosis. We designed and tested siRNA sequences to selectively target two exons (exon 1 and exon 19) of the transcript and successfully knocked down expression of ANRIL in human aortic vascular smooth muscle cells (HuAoVSMC). We used a pathway-focused RT-PCR array to profile gene expression changes caused by ANRIL knock down. Notably, the genes affected by each of the siRNAs were different, suggesting that different splicing variants of ANRIL might have distinct roles in cell physiology. Our results suggest that ANRIL splicing variants play a role in coordinating tissue remodeling, by modulating the expression of genes involved in cell proliferation, apoptosis, extra-cellular matrix remodeling and inflammatory response to finally impact in the risk of cardiovascular disease and other pathologies.

  11. Vertebrate gene predictions and the problem of large genes

    DEFF Research Database (Denmark)

    Wang, Jun; Li, ShengTing; Zhang, Yong

    2003-01-01

    To find unknown protein-coding genes, annotation pipelines use a combination of ab initio gene prediction and similarity to experimentally confirmed genes or proteins. Here, we show that although the ab initio predictions have an intrinsically high false-positive rate, they also have a consistent...

  12. Identification of genes for small non-coding RNAs that belong to the regulon of the two-component regulatory system CiaRH in Streptococcus

    Directory of Open Access Journals (Sweden)

    Hakenbeck Regine

    2010-11-01

    Full Text Available Abstract Background Post-transcriptional regulation by small RNAs (sRNAs in bacteria is now recognized as a wide-spread regulatory mechanism modulating a variety of physiological responses including virulence. In Streptococcus pneumoniae, an important human pathogen, the first sRNAs to be described were found in the regulon of the CiaRH two-component regulatory system. Five of these sRNAs were detected and designated csRNAs for cia-dependent small RNAs. CiaRH pleiotropically affects β-lactam resistance, autolysis, virulence, and competence development by yet to be defined molecular mechanisms. Since CiaRH is highly conserved among streptococci, it is of interest to determine if csRNAs are also included in the CiaRH regulon in this group of organisms consisting of commensal as well as pathogenic species. Knowledge on the participation of csRNAs in CiaRH-dependent regulatory events will be the key to define the physiological role of this important control system. Results Genes for csRNAs were predicted in streptococcal genomes and data base entries other than S. pneumoniae by searching for CiaR-activated promoters located in intergenic regions that are followed by a transcriptional terminator. 61 different candidate genes were obtained specifying csRNAs ranging in size from 51 to 202 nt. Comparing these genes among each other revealed 40 different csRNA types. All streptococcal genomes harbored csRNA genes, their numbers varying between two and six. To validate these predictions, S. mitis, S. oralis, and S. sanguinis were subjected to csRNA-specific northern blot analysis. In addition, a csRNA gene from S. thermophilus plasmid pST0 introduced into S. pneumoniae was also tested. Each of the csRNAs was detected on these blots and showed the anticipated sizes. Thus, the method applied here is able to predict csRNAs with high precision. Conclusions The results of this study strongly suggest that genes for small non-coding RNAs, csRNAs, are part of

  13. Novel overlapping coding sequences in Chlamydia trachomatis

    DEFF Research Database (Denmark)

    Jensen, Klaus Thorleif; Petersen, Lise; Falk, Søren

    2006-01-01

    that are in agreement with the primary annotation. Forty two genes from the primary annotation are not predicted by EasyGene. The majority of these genes are listed as hypothetical in the primary annotation. The 15 novel predicted genes all overlap with genes on the complementary strand. We find homologues of several...... of the novel genes in C. trachomatis Serovar A and Chlamydia muridarum. Several of the genes have typical gene-like and protein-like features. Furthermore, we confirm transcriptional activity from 10 of the putative genes. The combined evidence suggests that at least seven of the 15 are protein coding genes...

  14. Regional differences in gene expression and promoter usage in aged human brains

    KAUST Repository

    Pardo, Luba M.; Rizzu, Patrizia; Francescatto, Margherita; Vitezic, Morana; Leday, Gwenaë l G.R.; Sanchez, Javier Simon; Khamis, Abdullah M.; Takahashi, Hazuki; van de Berg, Wilma D.J.; Medvedeva, Yulia A.; van de Wiel, Mark A.; Daub, Carsten O.; Carninci, Piero; Heutink, Peter

    2013-01-01

    To characterize the promoterome of caudate and putamen regions (striatum), frontal and temporal cortices, and hippocampi from aged human brains, we used high-throughput cap analysis of gene expression to profile the transcription start sites

  15. Evolution of trappin genes in mammals

    Directory of Open Access Journals (Sweden)

    Furutani Yutaka

    2010-01-01

    Full Text Available Abstract Background Trappin is a multifunctional host-defense peptide that has antiproteolytic, antiinflammatory, and antimicrobial activities. The numbers and compositions of trappin paralogs vary among mammalian species: human and sheep have a single trappin-2 gene; mouse and rat have no trappin gene; pig and cow have multiple trappin genes; and guinea pig has a trappin gene and two other derivativegenes. Independent duplications of trappin genes in pig and cow were observed recently after the species were separated. To determine whether these trappin gene duplications are restricted only to certain mammalian lineages, we analyzed recently-developed genome databases for the presence of duplicate trappin genes. Results The database analyses revealed that: 1 duplicated trappin multigenes were found recently in the nine-banded armadillo; 2 duplicated two trappin genes had been found in the Afrotherian species (elephant, tenrec, and hyrax since ancient days; 3 a single trappin-2 gene was found in various eutherians species; and 4 no typical trappin gene has been found in chicken, zebra finch, and opossum. Bayesian analysis estimated the date of the duplication of trappin genes in the Afrotheria, guinea pig, armadillo, cow, and pig to be 244, 35, 11, 13, and 3 million-years ago, respectively. The coding regions of trappin multigenes of almadillo, bovine, and pig evolved much faster than the noncoding exons, introns, and the flanking regions, showing that these genes have undergone accelerated evolution, and positive Darwinian selection was observed in pig-specific trappin paralogs. Conclusion These results suggest that trappin is an eutherian-specific molecule and eutherian genomes have the potential to form trappin multigenes.

  16. Imaging reporter gene for monitoring gene therapy

    International Nuclear Information System (INIS)

    Beco, V. de; Baillet, G.; Tamgac, F.; Tofighi, M.; Weinmann, P.; Vergote, J.; Moretti, J.L.; Tamgac, G.

    2002-01-01

    Scintigraphic images can be obtained to document gene function at cellular level. This approach is presented here and the use of a reporter gene to monitor gene therapy is described. Two main ways are presented: either the use of a reporter gene coding for an enzyme the action of which will be monitored by radiolabeled pro-drug, or a cellular receptor gene, the action of which is documented by a radio labeled cognate receptor ligand. (author)

  17. dPORE-miRNA: Polymorphic regulation of microRNA genes

    KAUST Repository

    Schmeier, Sebastian; Schaefer, Ulf; MacPherson, Cameron R.; Bajic, Vladimir B.

    2011-01-01

    Background: MicroRNAs (miRNAs) are short non-coding RNA molecules that act as post-transcriptional regulators and affect the regulation of protein-coding genes. Mostly transcribed by PolII, miRNA genes are regulated at the transcriptional level similarly to protein-coding genes. In this study we focus on human miRNAs. These miRNAs are involved in a variety of pathways and can affect many diseases. Our interest is on possible deregulation of the transcription initiation of the miRNA encoding genes, which is facilitated by variations in the genomic sequence of transcriptional control regions (promoters). Methodology: Our aim is to provide an online resource to facilitate the investigation of the potential effects of single nucleotide polymorphisms (SNPs) on miRNA gene regulation. We analyzed SNPs overlapped with predicted transcription factor binding sites (TFBSs) in promoters of miRNA genes. We also accounted for the creation of novel TFBSs due to polymorphisms not present in the reference genome. The resulting changes in the original TFBSs and potential creation of new TFBSs were incorporated into the Dragon Database of Polymorphic Regulation of miRNA genes (dPORE-miRNA). Conclusions: The dPORE-miRNA database enables researchers to explore potential effects of SNPs on the regulation of miRNAs. dPORE-miRNA can be interrogated with regards to: a/miRNAs (their targets, or involvement in diseases, or biological pathways), b/SNPs, or c/transcription factors. dPORE-miRNA can be accessed at http://cbrc.kaust.edu.sa/dpore and http://apps.sanbi.ac.za/dpore/. Its use is free for academic and non-profit users. © 2011 Schmeier et al.

  18. dPORE-miRNA: Polymorphic regulation of microRNA genes

    KAUST Repository

    Schmeier, Sebastian

    2011-02-04

    Background: MicroRNAs (miRNAs) are short non-coding RNA molecules that act as post-transcriptional regulators and affect the regulation of protein-coding genes. Mostly transcribed by PolII, miRNA genes are regulated at the transcriptional level similarly to protein-coding genes. In this study we focus on human miRNAs. These miRNAs are involved in a variety of pathways and can affect many diseases. Our interest is on possible deregulation of the transcription initiation of the miRNA encoding genes, which is facilitated by variations in the genomic sequence of transcriptional control regions (promoters). Methodology: Our aim is to provide an online resource to facilitate the investigation of the potential effects of single nucleotide polymorphisms (SNPs) on miRNA gene regulation. We analyzed SNPs overlapped with predicted transcription factor binding sites (TFBSs) in promoters of miRNA genes. We also accounted for the creation of novel TFBSs due to polymorphisms not present in the reference genome. The resulting changes in the original TFBSs and potential creation of new TFBSs were incorporated into the Dragon Database of Polymorphic Regulation of miRNA genes (dPORE-miRNA). Conclusions: The dPORE-miRNA database enables researchers to explore potential effects of SNPs on the regulation of miRNAs. dPORE-miRNA can be interrogated with regards to: a/miRNAs (their targets, or involvement in diseases, or biological pathways), b/SNPs, or c/transcription factors. dPORE-miRNA can be accessed at http://cbrc.kaust.edu.sa/dpore and http://apps.sanbi.ac.za/dpore/. Its use is free for academic and non-profit users. © 2011 Schmeier et al.

  19. Screening for Genes Coding for Putative Antitumor Compounds, Antimicrobial and Enzymatic Activities from Haloalkalitolerant and Haloalkaliphilic Bacteria Strains of Algerian Sahara Soils

    Directory of Open Access Journals (Sweden)

    Okba Selama

    2014-01-01

    Full Text Available Extreme environments may often contain unusual bacterial groups whose physiology is distinct from those of normal environments. To satisfy the need for new bioactive pharmaceuticals compounds and enzymes, we report here the isolation of novel bacteria from an extreme environment. Thirteen selected haloalkalitolerant and haloalkaliphilic bacteria were isolated from Algerian Sahara Desert soils. These isolates were screened for the presence of genes coding for putative antitumor compounds using PCR based methods. Enzymatic, antibacterial, and antifungal activities were determined by using cultural dependant methods. Several of these isolates are typical of desert and alkaline saline soils, but, in addition, we report for the first time the presence of a potential new member of the genus Nocardia with particular activity against the yeast Saccharomyces cerevisiae. In addition to their haloalkali character, the presence of genes coding for putative antitumor compounds, combined with the antimicrobial activity against a broad range of indicator strains and their enzymatic potential, makes them suitable for biotechnology applications.

  20. Association analysis of PRNP gene region with chronic wasting disease in Rocky Mountain elk

    Directory of Open Access Journals (Sweden)

    Spraker Terry R

    2010-11-01

    Full Text Available Abstract Background Chronic wasting disease (CWD is a transmissible spongiform encephalopathy (TSE of cervids including white-tailed (Odocoileus virginianus and mule deer (Odocoileus hemionus, Rocky Mountain elk (Cervus elaphus nelsoni, and moose (Alces alces. A leucine variant at position 132 (132L in prion protein of Rocky Mountain elk confers a long incubation time with CWD, but not complete resistance. However, variants in regulatory regions outside the open reading frame of PRNP have been associated with varying degrees of susceptibility to prion disease in other species, and some variants have been observed in similar regions of Rocky Mountain elk PRNP. Thus, additional genetic variants might provide increased protection, either alone or in combination with 132L. Findings This study provided genomic sequence of all exons for PRNP of Rocky Mountain elk. Many functional sites in and around the PRNP gene region were sequenced, and this report approximately doubled (to 75 the number of known variants in this region. A haplotype-tagging approach was used to reduce the number of genetic variants required to survey this variation in the PRNP gene region of 559 Rocky Mountain elk. Eight haplotypes were observed with frequencies over 1.0%, and one haplotype was present at 71.2% frequency, reflecting limited genetic diversity in the PRNP gene region. Conclusions The presence of 132L cut odds of CWD by more than half (Odds Ratio = 0.43; P = 0.0031, which was similar to a previous report. However after accounting for 132L, no association with CWD was found for any additional variants in the PRNP region (P > 0.05.

  1. The interplay of long non-coding RNAs and MYC in cancer

    Directory of Open Access Journals (Sweden)

    Michael J. Hamilton

    2015-12-01

    Full Text Available Long non-coding RNAs (lncRNAs are a class of RNA molecules that are changing how researchers view eukaryotic gene regulation. Once considered to be non-functional products of low-level aberrant transcription from non-coding regions of the genome, lncRNAs are now viewed as important epigenetic regulators and several lncRNAs have now been demonstrated to be critical players in the development and/or maintenance of cancer. Similarly, the emerging variety of interactions between lncRNAs and MYC, a well-known oncogenic transcription factor linked to most types of cancer, have caught the attention of many biomedical researchers. Investigations exploring the dynamic interactions between lncRNAs and MYC, referred to as the lncRNA-MYC network, have proven to be especially complex. Genome-wide studies have shown that MYC transcriptionally regulates many lncRNA genes. Conversely, recent reports identified lncRNAs that regulate MYC expression both at the transcriptional and post-transcriptional levels. These findings are of particular interest because they suggest roles of lncRNAs as regulators of MYC oncogenic functions and the possibility that targeting lncRNAs could represent a novel avenue to cancer treatment. Here, we briefly review the current understanding of how lncRNAs regulate chromatin structure and gene transcription, and then focus on the new developments in the emerging field exploring the lncRNA-MYC network in cancer.

  2. Construction of a yeast artifical chromosome contig spanning the spinal muscular atrophy disease gene region

    Energy Technology Data Exchange (ETDEWEB)

    Kleyn, P.W.; Wang, C.H.; Vitale, E.; Pan, J.; Ross, B.M.; Grunn, A.; Palmer, D.A.; Warburton, D.; Brzustowicz, L.M.; Gilliam, T.G. (New York State Psychiatric Institute, NY (United States)); Lien, L.L.; Kunkel, L.M. (Howard Hughes Medical Institute, Boston, MA (United States))

    1993-07-15

    The childhood spinal muscular atrophies (SMAs) are the most common, serious neuromuscular disorders of childhood second to Duchenne muscular dystrophy. A single locus for these disorders has been mapped by recombination events to a region of 0.7 centimorgan (range, 0.1-2.1 centimorgans) between loci D5S435 and MAP1B on chromosome 5q11.2-13.3. By using PCR amplification to screen yeast artificial chromosome (YAC) DNA pools and the PCR-vectorette method to amplify YAC ends, a YAC contig was constructed across the disease gene region. Nine walk steps identified 32 YACs, including a minimum of seven overlapping YAC clones (average size, 460 kb) that span the SMA region. The contig is characterized by a collection of 30 YAC-end sequence tag sites together with seven genetic markers. The entire YAC contig spans a minimum of 3.2 Mb; the SMA locus is confined to roughly half of this region. Microsatellite markers generated along the YAC contig segregate with the SMA locus in all families where the flanking markers (D5S435 and MAP1B) recombine. Construction of a YAC contig across the disease gene region is an essential step in isolation of the SMA-encoding gene. 26 refs., 3 figs., 1 tab.

  3. A nine-nucleotide deletion and splice variation in the coding region of the interferon induced ISG12 gene

    DEFF Research Database (Denmark)

    Smidt, Kamille; Hansen, Lise Lotte; Søgaard, T Max M

    2003-01-01

    distributed between ISG12 and ISG12-S in breast carcinoma cells, in cancer cell lines and in cervical cytobrush material with neoplastic lesions. In addition, we have found a nine-nucleotide deletion situated in exon 4 of the ISG12 gene. This deletion leads to a three-amino-acid deletion (AMA) in the putative...... ISG12 gene products, ISG12Δ and ISG12-SΔ. We have determined the prevalence of the deletion ISG12Δ in normal and neoplastic cells. Homozygosity ISG12(0/0) and ISG12(Δ/Δ), and heterozygosity ISG12(0/Δ) were found, although the ISG12(Δ/Δ) genotype was rare. In heterozygous cells from cytobrush material...

  4. Coupling a Basin Modeling and a Seismic Code using MOAB

    KAUST Repository

    Yan, Mi; Jordan, Kirk; Kaushik, Dinesh; Perrone, Michael; Sachdeva, Vipin; Tautges, Timothy J.; Magerlein, John

    2012-01-01

    We report on a demonstration of loose multiphysics coupling between a basin modeling code and a seismic code running on a large parallel machine. Multiphysics coupling, which is one critical capability for a high performance computing (HPC) framework, was implemented using the MOAB open-source mesh and field database. MOAB provides for code coupling by storing mesh data and input and output field data for the coupled analysis codes and interpolating the field values between different meshes used by the coupled codes. We found it straightforward to use MOAB to couple the PBSM basin modeling code and the FWI3D seismic code on an IBM Blue Gene/P system. We describe how the coupling was implemented and present benchmarking results for up to 8 racks of Blue Gene/P with 8192 nodes and MPI processes. The coupling code is fast compared to the analysis codes and it scales well up to at least 8192 nodes, indicating that a mesh and field database is an efficient way to implement loose multiphysics coupling for large parallel machines.

  5. Coupling a Basin Modeling and a Seismic Code using MOAB

    KAUST Repository

    Yan, Mi

    2012-06-02

    We report on a demonstration of loose multiphysics coupling between a basin modeling code and a seismic code running on a large parallel machine. Multiphysics coupling, which is one critical capability for a high performance computing (HPC) framework, was implemented using the MOAB open-source mesh and field database. MOAB provides for code coupling by storing mesh data and input and output field data for the coupled analysis codes and interpolating the field values between different meshes used by the coupled codes. We found it straightforward to use MOAB to couple the PBSM basin modeling code and the FWI3D seismic code on an IBM Blue Gene/P system. We describe how the coupling was implemented and present benchmarking results for up to 8 racks of Blue Gene/P with 8192 nodes and MPI processes. The coupling code is fast compared to the analysis codes and it scales well up to at least 8192 nodes, indicating that a mesh and field database is an efficient way to implement loose multiphysics coupling for large parallel machines.

  6. Transcription Factors Bind Thousands of Active and InactiveRegions in the Drosophila Blastoderm

    Energy Technology Data Exchange (ETDEWEB)

    Li, Xiao-Yong; MacArthur, Stewart; Bourgon, Richard; Nix, David; Pollard, Daniel A.; Iyer, Venky N.; Hechmer, Aaron; Simirenko, Lisa; Stapleton, Mark; Luengo Hendriks, Cris L.; Chu, Hou Cheng; Ogawa, Nobuo; Inwood, William; Sementchenko, Victor; Beaton, Amy; Weiszmann, Richard; Celniker, Susan E.; Knowles, David W.; Gingeras, Tom; Speed, Terence P.; Eisen, Michael B.; Biggin, Mark D.

    2008-01-10

    Identifying the genomic regions bound by sequence-specific regulatory factors is central both to deciphering the complex DNA cis-regulatory code that controls transcription in metazoans and to determining the range of genes that shape animal morphogenesis. Here, we use whole-genome tiling arrays to map sequences bound in Drosophila melanogaster embryos by the six maternal and gap transcription factors that initiate anterior-posterior patterning. We find that these sequence-specific DNA binding proteins bind with quantitatively different specificities to highly overlapping sets of several thousand genomic regions in blastoderm embryos. Specific high- and moderate-affinity in vitro recognition sequences for each factor are enriched in bound regions. This enrichment, however, is not sufficient to explain the pattern of binding in vivo and varies in a context-dependent manner, demonstrating that higher-order rules must govern targeting of transcription factors. The more highly bound regions include all of the over forty well-characterized enhancers known to respond to these factors as well as several hundred putative new cis-regulatory modules clustered near developmental regulators and other genes with patterned expression at this stage of embryogenesis. The new targets include most of the microRNAs (miRNAs) transcribed in the blastoderm, as well as all major zygotically transcribed dorsal-ventral patterning genes, whose expression we show to be quantitatively modulated by anterior-posterior factors. In addition to these highly bound regions, there are several thousand regions that are reproducibly bound at lower levels. However, these poorly bound regions are, collectively, far more distant from genes transcribed in the blastoderm than highly bound regions; are preferentially found in protein-coding sequences; and are less conserved than highly bound regions. Together these observations suggest that many of these poorly-bound regions are not involved in early

  7. Pseudotyped Lentiviral Vectors for Retrograde Gene Delivery into Target Brain Regions

    Directory of Open Access Journals (Sweden)

    Kenta Kobayashi

    2017-08-01

    Full Text Available Gene transfer through retrograde axonal transport of viral vectors offers a substantial advantage for analyzing roles of specific neuronal pathways or cell types forming complex neural networks. This genetic approach may also be useful in gene therapy trials by enabling delivery of transgenes into a target brain region distant from the injection site of the vectors. Pseudotyping of a lentiviral vector based on human immunodeficiency virus type 1 (HIV-1 with various fusion envelope glycoproteins composed of different combinations of rabies virus glycoprotein (RV-G and vesicular stomatitis virus glycoprotein (VSV-G enhances the efficiency of retrograde gene transfer in both rodent and nonhuman primate brains. The most recently developed lentiviral vector is a pseudotype with fusion glycoprotein type E (FuG-E, which demonstrates highly efficient retrograde gene transfer in the brain. The FuG-E–pseudotyped vector permits powerful experimental strategies for more precisely investigating the mechanisms underlying various brain functions. It also contributes to the development of new gene therapy approaches for neurodegenerative disorders, such as Parkinson’s disease, by delivering genes required for survival and protection into specific neuronal populations. In this review article, we report the properties of the FuG-E–pseudotyped vector, and we describe the application of the vector to neural circuit analysis and the potential use of the FuG-E vector in gene therapy for Parkinson’s disease.

  8. ELFN1-AS1: A Novel Primate Gene with Possible MicroRNA Function Expressed Predominantly in Human Tumors

    Directory of Open Access Journals (Sweden)

    Dmitrii E. Polev

    2014-01-01

    Full Text Available Human gene LOC100505644 uncharacterized LOC100505644 [Homo sapiens] (Entrez Gene ID 100505644 is abundantly expressed in tumors but weakly expressed in few normal tissues. Till now the function of this gene remains unknown. Here we identified the chromosomal borders of the transcribed region and the major splice form of the LOC100505644-specific transcript. We characterised the major regulatory motifs of the gene and its splice sites. Analysis of the secondary structure of the major transcript variant revealed a hairpin-like structure characteristic for precursor microRNAs. Comparative genomic analysis of the locus showed that it originated in primates de novo. Taken together, our data indicate that human gene LOC100505644 encodes some non-protein coding RNA, likely a microRNA. It was assigned a gene symbol ELFN1-AS1 (ELFN1 antisense RNA 1 (non-protein coding. This gene combines features of evolutionary novelty and predominant expression in tumors.

  9. HNF1 alpha gene coding regions mutations screening, in a Caucasian population clinically characterized as MODY from Argentina.

    Science.gov (United States)

    Lopez, Ariel Pablo; Foscaldi, Sabrina Andrea; Perez, Maria Silvia; Rodriguez, Martín; Traversa, Mercedes; Puchulu, Félix Miguel; Bergada, Ignacio; Frechtel, Gustavo Daniel

    2011-02-01

    There are at least six subtypes of Maturity Onset Diabetes of the Young (MODY) with distinctive genetic causes. MODY 3 is caused by mutations in HNF1A gene, an insulin transcription factor, so mutations in this gene are associated with impaired insulin secretion. MODY 3 prevalence differs according to the population analyzed, but it is one of the most frequent subtypes. Therefore, our aims in this work were to find mutations present in the HNF1A gene and provide information on their prevalence. Mutations screening was done in a group of 80 unrelated patients (average age 17.1 years) selected by clinical characterization of MODY, by SSCP electrophoresis followed by sequenciation. We found eight mutations, of which six were novel and four sequence variants, which were all novel. Therefore the prevalence of MODY 3 in this group was 10%. Compared clinical data between the non-MODY 3 patients and the MODY 3 diagnosed patients did not show any significant difference. Eight patients were diagnosed as MODY 3 and new data about the prevalence of that subtype is provided. Our results contribute to reveal novel mutations, providing new data about the prevalence of that subtype. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  10. The Arabidopsis TOR Kinase Specifically Regulates the Expression of Nuclear Genes Coding for Plastidic Ribosomal Proteins and the Phosphorylation of the Cytosolic Ribosomal Protein S6.

    Science.gov (United States)

    Dobrenel, Thomas; Mancera-Martínez, Eder; Forzani, Céline; Azzopardi, Marianne; Davanture, Marlène; Moreau, Manon; Schepetilnikov, Mikhail; Chicher, Johana; Langella, Olivier; Zivy, Michel; Robaglia, Christophe; Ryabova, Lyubov A; Hanson, Johannes; Meyer, Christian

    2016-01-01

    Protein translation is an energy consuming process that has to be fine-tuned at both the cell and organism levels to match the availability of resources. The target of rapamycin kinase (TOR) is a key regulator of a large range of biological processes in response to environmental cues. In this study, we have investigated the effects of TOR inactivation on the expression and regulation of Arabidopsis ribosomal proteins at different levels of analysis, namely from transcriptomic to phosphoproteomic. TOR inactivation resulted in a coordinated down-regulation of the transcription and translation of nuclear-encoded mRNAs coding for plastidic ribosomal proteins, which could explain the chlorotic phenotype of the TOR silenced plants. We have identified in the 5' untranslated regions (UTRs) of this set of genes a conserved sequence related to the 5' terminal oligopyrimidine motif, which is known to confer translational regulation by the TOR kinase in other eukaryotes. Furthermore, the phosphoproteomic analysis of the ribosomal fraction following TOR inactivation revealed a lower phosphorylation of the conserved Ser240 residue in the C-terminal region of the 40S ribosomal protein S6 (RPS6). These results were confirmed by Western blot analysis using an antibody that specifically recognizes phosphorylated Ser240 in RPS6. Finally, this antibody was used to follow TOR activity in plants. Our results thus uncover a multi-level regulation of plant ribosomal genes and proteins by the TOR kinase.

  11. Recombinant lactoferrin (Lf) of Vechur cow, the critical breed of Bos indicus and the Lf gene variants.

    Science.gov (United States)

    Anisha, Shashidharan; Bhasker, Salini; Mohankumar, Chinnamma

    2012-03-01

    Vechur cow, categorized as a critically maintained breed by the FAO, is a unique breed of Bos indicus due to its extremely small size, less fodder intake, adaptability, easy domestication and traditional medicinal property of the milk. Lactoferrin (Lf) is an iron-binding glycoprotein that is found predominantly in the milk of mammals. The full coding region of Lf gene of Vechur cow was cloned, sequenced and expressed in a prokaryotic system. Antibacterial activity of the recombinant Lf showed suppression of bacterial growth. To the best of our knowledge this is the first time that the full coding region of Lf gene of B. indicus Vechur breed is sequenced, successfully expressed in a prokaryotic system and characterized. Comparative analysis of Lf gene sequence of five Vechur cows with B. taurus revealed 15 SNPs in the exon region associated with 11 amino acid substitutions. The amino acid arginine was noticed as a pronounced substitution and the tertiary structure analysis of the BLfV protein confirmed the positions of arginine in the β sheet region, random coil and helix region 1. Based on the recent reports on the nutritional therapies of arginine supplementation for wound healing and for cardiovascular diseases, the higher level of arginine in the lactoferrin protein of Vechur cow milk provides enormous scope for further therapeutic studies. Copyright © 2011 Elsevier B.V. All rights reserved.

  12. Anthropogenic antibiotic resistance genes mobilization to the polar regions.

    Science.gov (United States)

    Hernández, Jorge; González-Acuña, Daniel

    2016-01-01

    Anthropogenic influences in the southern polar region have been rare, but lately microorganisms associated with humans have reached Antarctica, possibly from military bases, fishing boats, scientific expeditions, and/or ship-borne tourism. Studies of seawater in areas of human intervention and proximal to fresh penguin feces revealed the presence of Escherichia coli strains least resistant to antibiotics in penguins, whereas E. coli from seawater elsewhere showed resistance to one or more of the following antibiotics: ampicillin, tetracycline, streptomycin, and trim-sulfa. In seawater samples, bacteria were found carrying extended-spectrum β-lactamase (ESBL)-type CTX-M genes in which multilocus sequencing typing (MLST) showed different sequence types (STs), previously reported in humans. In the Arctic, on the contrary, people have been present for a long time, and the presence of antibiotic resistance genes (ARGs) appears to be much more wide-spread than was previously reported. Studies of E coli from Arctic birds (Bering Strait) revealed reduced susceptibility to antibiotics, but one globally spreading clone of E. coli genotype O25b-ST131, carrying genes of ESBL-type CTX-M, was identified. In the few years between sample collections in the same area, differences in resistance pattern were observed, with E. coli from birds showing resistance to a maximum of five different antibiotics. Presence of resistance-type ESBLs (TEM, SHV, and CTX-M) in E. coli and Klebsiella pneumoniae was also confirmed by specified PCR methods. MLST revealed that those bacteria carried STs that connect them to previously described strains in humans. In conclusion, bacteria previously related to humans could be found in relatively pristine environments, and presently human-associated, antibiotic-resistant bacteria have reached a high global level of distribution that they are now found even in the polar regions.

  13. A study of the frequency of methylation of gene promoter regions in ...

    Indian Academy of Sciences (India)

    2013-04-02

    Apr 2, 2013 ... colorectal cancer in the Taiwanese population. CHANG-CHIEH WU1 ... hypermethylation of promoter-region CpG islands is an important ... mismatch repair gene MLH1 plays an important role in dele- ..... Asia Pac. J. Clin.

  14. Isolation and expression of the genes coding for the membrane bound transglycosylase B (MltB and the transferrin binding protein B (TbpB of the salmon pathogen Piscirickettsia salmonis

    Directory of Open Access Journals (Sweden)

    VIVIAN WILHELM

    2004-01-01

    Full Text Available We have isolated and sequenced the genes encoding the membrane bound transglycosylase B (MltB and the transferring binding protein B (TbpB of the salmon pathogen Piscirickettsia salmonis. The results of the sequence revealed two open reading frames that encode proteins with calculated molecular weights of 38,830 and 85,140. The deduced aminoacid sequences of both proteins show a significant homology to the respective protein from phylogenetically related microorganisms. Partial sequences coding the amino and carboxyl regions of MltB and a sequence of 761 base pairs encoding the amino region of TbpB have been expressed in E. coli. The strong humoral response elicited by these proteins in mouse confirmed the immunogenic properties of the recombinant proteins. A similar response was elicited by both proteins when injected intraperitoneally in Atlantic salmon. The present data indicates that these proteins are good candidates to be used in formulations to study the protective immunity of salmon to infection by P. salmonis.

  15. Parallel Evolution of Genes and Languages in the Caucasus Region

    Science.gov (United States)

    Balanovsky, Oleg; Dibirova, Khadizhat; Dybo, Anna; Mudrak, Oleg; Frolova, Svetlana; Pocheshkhova, Elvira; Haber, Marc; Platt, Daniel; Schurr, Theodore; Haak, Wolfgang; Kuznetsova, Marina; Radzhabov, Magomed; Balaganskaya, Olga; Romanov, Alexey; Zakharova, Tatiana; Soria Hernanz, David F.; Zalloua, Pierre; Koshel, Sergey; Ruhlen, Merritt; Renfrew, Colin; Wells, R. Spencer; Tyler-Smith, Chris; Balanovska, Elena

    2012-01-01

    We analyzed 40 SNP and 19 STR Y-chromosomal markers in a large sample of 1,525 indigenous individuals from 14 populations in the Caucasus and 254 additional individuals representing potential source populations. We also employed a lexicostatistical approach to reconstruct the history of the languages of the North Caucasian family spoken by the Caucasus populations. We found a different major haplogroup to be prevalent in each of four sets of populations that occupy distinct geographic regions and belong to different linguistic branches. The haplogroup frequencies correlated with geography and, even more strongly, with language. Within haplogroups, a number of haplotype clusters were shown to be specific to individual populations and languages. The data suggested a direct origin of Caucasus male lineages from the Near East, followed by high levels of isolation, differentiation and genetic drift in situ. Comparison of genetic and linguistic reconstructions covering the last few millennia showed striking correspondences between the topology and dates of the respective gene and language trees, and with documented historical events. Overall, in the Caucasus region, unmatched levels of gene-language co-evolution occurred within geographically isolated populations, probably due to its mountainous terrain. PMID:21571925

  16. A Region-Based GeneSIS Segmentation Algorithm for the Classification of Remotely Sensed Images

    Directory of Open Access Journals (Sweden)

    Stelios K. Mylonas

    2015-03-01

    Full Text Available This paper proposes an object-based segmentation/classification scheme for remotely sensed images, based on a novel variant of the recently proposed Genetic Sequential Image Segmentation (GeneSIS algorithm. GeneSIS segments the image in an iterative manner, whereby at each iteration a single object is extracted via a genetic-based object extraction algorithm. Contrary to the previous pixel-based GeneSIS where the candidate objects to be extracted were evaluated through the fuzzy content of their included pixels, in the newly developed region-based GeneSIS algorithm, a watershed-driven fine segmentation map is initially obtained from the original image, which serves as the basis for the forthcoming GeneSIS segmentation. Furthermore, in order to enhance the spatial search capabilities, we introduce a more descriptive encoding scheme in the object extraction algorithm, where the structural search modules are represented by polygonal shapes. Our objectives in the new framework are posed as follows: enhance the flexibility of the algorithm in extracting more flexible object shapes, assure high level classification accuracies, and reduce the execution time of the segmentation, while at the same time preserving all the inherent attributes of the GeneSIS approach. Finally, exploiting the inherent attribute of GeneSIS to produce multiple segmentations, we also propose two segmentation fusion schemes that operate on the ensemble of segmentations generated by GeneSIS. Our approaches are tested on an urban and two agricultural images. The results show that region-based GeneSIS has considerably lower computational demands compared to the pixel-based one. Furthermore, the suggested methods achieve higher classification accuracies and good segmentation maps compared to a series of existing algorithms.

  17. Mutational analysis of the multicopy hao gene coding for hydroxylamine oxidoreductase in Nitrosomonas sp. strain ENI-11.

    Science.gov (United States)

    Yamagata, A; Hirota, R; Kato, J; Kuroda, A; Ikeda, T; Takiguchi, N; Ohtake, H

    2000-08-01

    The ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11 contains three copies of the hao gene (hao1, hao2, and hao3) coding for hydroxylamine oxidoreductase (HAO). Three single mutants (hao1::kan, hao2::kan, or hao3::kan) had 68 to 75% of the wild-type growth rate and 58 to 89% of the wild-type HAO activity when grown under the same conditions. A double mutant (hao1::kan and hao3::amp) also had 68% of the wild-type growth and 37% of the wild-type HAO activity.

  18. cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity

    International Nuclear Information System (INIS)

    Taira, M.; Yoshida, T.; Miyagawa, K.; Sakamoto, H.; Terada, M.; Sugimura, T.

    1987-01-01

    The hst gene was originally identified as a transforming gene in DNAs from human stomach cancers and from a noncancerous portion of stomach mucosa by DNA-mediated transfection assay using NIH3T3 cells. cDNA clones of hst were isolated from the cDNA library constructed from poly(A) + RNA of a secondary transformant induced by the DNA from a stomach cancer. The sequence analysis of the hst cDNA revealed the presence of two open reading frames. When this cDNA was inserted into an expression vector containing the simian virus 40 promoter, it efficiently induced the transformation of NIH3T3 cells upon transfection. It was found that one of the reading frames, which coded for 206 amino acids, was responsible for the transforming activity

  19. A genetic polymorphism in the coding region of the gastric intrinsic factor gene (GIF) is associated with congenital intrinsic factor deficiency.

    Science.gov (United States)

    Gordon, Marilyn M; Brada, Nancy; Remacha, Angel; Badell, Isabel; del Río, Elisabeth; Baiget, Montserrat; Santer, René; Quadros, Edward V; Rothenberg, Sheldon P; Alpers, David H

    2004-01-01

    Congenital intrinsic factor (IF) deficiency is a disorder characterized by megaloblastic anemia due to the absence of gastric IF (GIF, GenBank NM_005142) and GIF antibodies, with probable autosomal recessive inheritance. Most of the reported patients are isolated cases without genetic studies of the parents or siblings. Complete exonic sequences were determined from the PCR products generated from genomic DNA of five affected individuals. All probands had the identical variant (g.68A>G) in the second position of the fifth codon in the coding sequence of the gene that introduces a restriction enzyme site for Msp I and predicts a change in the mature protein from glutamine(5) (CAG) to arginine(5) (CGG). Three subjects were homozygous for this base exchange and two subjects were heterozygous, one of which was apparently a compound heterozygote at positions 1 and 2 of the fifth codon ([g.67C>G] + [g.68A>G]). The other patient, heterozygous for position 2, had one heterozygous unaffected parent. Most parents were heterozygous for this base exchange, confirming the pattern of autosomal recessive inheritance for congenital IF deficiency. cDNA encoding GIF was mutated at base pair g.68 (A>G) and expressed in COS-7 cells. The apparent size, secretion rate, and sensitivity to pepsin hydrolysis of the expressed IF were similar to native IF. The allelic frequency of g.68A>G was 0.067 and 0.038 in two control populations. This sequence aberration is not the cause of the phenotype, but is associated with the genotype of congenital IF deficiency and could serve as a marker for inheritance of this disorder. Copyright 2003 Wiley-Liss, Inc.

  20. Expression of the Long Intergenic Non-Protein Coding RNA 665 (LINC00665) Gene and the Cell Cycle in Hepatocellular Carcinoma Using The Cancer Genome Atlas, the Gene Expression Omnibus, and Quantitative Real-Time Polymerase Chain Reaction.

    Science.gov (United States)

    Wen, Dong-Yue; Lin, Peng; Pang, Yu-Yan; Chen, Gang; He, Yun; Dang, Yi-Wu; Yang, Hong

    2018-05-05

    BACKGROUND Long non-coding RNAs (lncRNAs) have a role in physiological and pathological processes, including cancer. The aim of this study was to investigate the expression of the long intergenic non-protein coding RNA 665 (LINC00665) gene and the cell cycle in hepatocellular carcinoma (HCC) using database analysis including The Cancer Genome Atlas (TCGA), the Gene Expression Omnibus (GEO), and quantitative real-time polymerase chain reaction (qPCR). MATERIAL AND METHODS Expression levels of LINC00665 were compared between human tissue samples of HCC and adjacent normal liver, clinicopathological correlations were made using TCGA and the GEO, and qPCR was performed to validate the findings. Other public databases were searched for other genes associated with LINC00665 expression, including The Atlas of Noncoding RNAs in Cancer (TANRIC), the Multi Experiment Matrix (MEM), Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and protein-protein interaction (PPI) networks. RESULTS Overexpression of LINC00665 in patients with HCC was significantly associated with gender, tumor grade, stage, and tumor cell type. Overexpression of LINC00665 in patients with HCC was significantly associated with overall survival (OS) (HR=1.47795%; CI: 1.046-2.086). Bioinformatics analysis identified 469 related genes and further analysis supported a hypothesis that LINC00665 regulates pathways in the cell cycle to facilitate the development and progression of HCC through ten identified core genes: CDK1, BUB1B, BUB1, PLK1, CCNB2, CCNB1, CDC20, ESPL1, MAD2L1, and CCNA2. CONCLUSIONS Overexpression of the lncRNA, LINC00665 may be involved in the regulation of cell cycle pathways in HCC through ten identified hub genes.

  1. A Novel Phytophthora sojae Resistance Rps12 Gene Mapped to a Genomic Region That Contains Several Rps Genes.

    Science.gov (United States)

    Sahoo, Dipak K; Abeysekara, Nilwala S; Cianzio, Silvia R; Robertson, Alison E; Bhattacharyya, Madan K

    2017-01-01

    Phytophthora sojae Kaufmann and Gerdemann, which causes Phytophthora root rot, is a widespread pathogen that limits soybean production worldwide. Development of Phytophthora resistant cultivars carrying Phytophthora resistance Rps genes is a cost-effective approach in controlling this disease. For this mapping study of a novel Rps gene, 290 recombinant inbred lines (RILs) (F7 families) were developed by crossing the P. sojae resistant cultivar PI399036 with the P. sojae susceptible AR2 line, and were phenotyped for responses to a mixture of three P. sojae isolates that overcome most of the known Rps genes. Of these 290 RILs, 130 were homozygous resistant, 12 heterzygous and segregating for Phytophthora resistance, and 148 were recessive homozygous and susceptible. From this population, 59 RILs homozygous for Phytophthora sojae resistance and 61 susceptible to a mixture of P. sojae isolates R17 and Val12-11 or P7074 that overcome resistance encoded by known Rps genes mapped to Chromosome 18 were selected for mapping novel Rps gene. A single gene accounted for the 1:1 segregation of resistance and susceptibility among the RILs. The gene encoding the Phytophthora resistance mapped to a 5.8 cM interval between the SSR markers BARCSOYSSR_18_1840 and Sat_064 located in the lower arm of Chromosome 18. The gene is mapped 2.2 cM proximal to the NBSRps4/6-like sequence that was reported to co-segregate with the Phytophthora resistance genes Rps4 and Rps6. The gene is mapped to a highly recombinogenic, gene-rich genomic region carrying several nucleotide binding site-leucine rich repeat (NBS-LRR)-like genes. We named this novel gene as Rps12, which is expected to be an invaluable resource in breeding soybeans for Phytophthora resistance.

  2. A Novel Phytophthora sojae Resistance Rps12 Gene Mapped to a Genomic Region That Contains Several Rps Genes.

    Directory of Open Access Journals (Sweden)

    Dipak K Sahoo

    Full Text Available Phytophthora sojae Kaufmann and Gerdemann, which causes Phytophthora root rot, is a widespread pathogen that limits soybean production worldwide. Development of Phytophthora resistant cultivars carrying Phytophthora resistance Rps genes is a cost-effective approach in controlling this disease. For this mapping study of a novel Rps gene, 290 recombinant inbred lines (RILs (F7 families were developed by crossing the P. sojae resistant cultivar PI399036 with the P. sojae susceptible AR2 line, and were phenotyped for responses to a mixture of three P. sojae isolates that overcome most of the known Rps genes. Of these 290 RILs, 130 were homozygous resistant, 12 heterzygous and segregating for Phytophthora resistance, and 148 were recessive homozygous and susceptible. From this population, 59 RILs homozygous for Phytophthora sojae resistance and 61 susceptible to a mixture of P. sojae isolates R17 and Val12-11 or P7074 that overcome resistance encoded by known Rps genes mapped to Chromosome 18 were selected for mapping novel Rps gene. A single gene accounted for the 1:1 segregation of resistance and susceptibility among the RILs. The gene encoding the Phytophthora resistance mapped to a 5.8 cM interval between the SSR markers BARCSOYSSR_18_1840 and Sat_064 located in the lower arm of Chromosome 18. The gene is mapped 2.2 cM proximal to the NBSRps4/6-like sequence that was reported to co-segregate with the Phytophthora resistance genes Rps4 and Rps6. The gene is mapped to a highly recombinogenic, gene-rich genomic region carrying several nucleotide binding site-leucine rich repeat (NBS-LRR-like genes. We named this novel gene as Rps12, which is expected to be an invaluable resource in breeding soybeans for Phytophthora resistance.

  3. A murC gene in Porphyromonas gingivalis 381.

    Science.gov (United States)

    Ansai, T; Yamashita, Y; Awano, S; Shibata, Y; Wachi, M; Nagai, K; Takehara, T

    1995-09-01

    The gene encoding a 51 kDa polypeptide of Porphyromonas gingivalis 381 was isolated by immunoblotting using an antiserum raised against P. gingivalis alkaline phosphatase. DNA sequence analysis of a 2.5 kb DNA fragment containing a gene encoding the 51 kDa protein revealed one complete and two incomplete ORFs. Database searches using the FASTA program revealed significant homology between the P. gingivalis 51 kDa protein and the MurC protein of Escherichia coli, which functions in peptidoglycan synthesis. The cloned 51 kDa protein encoded a functional product that complemented an E. coli murC mutant. Moreover, the ORF just upstream of murC coded for a protein that was 31% homologous with the E. coli MurG protein. The ORF just downstream of murC coded for a protein that was 17% homologous with the Streptococcus pneumoniae penicillin-binding protein 2B (PBP2B), which functions in peptidoglycan synthesis and is responsible for antibiotic resistance. These results suggest that P. gingivalis contains a homologue of the E. coli peptidoglycan synthesis gene murC and indicate the possibility of a cluster of genes responsible for cell division and cell growth, as in the E. coli mra region.

  4. upstream region of the myostatin gene in four chicken breeds and its

    African Journals Online (AJOL)

    user

    2012-05-17

    May 17, 2012 ... processing site and a carboxy-terminal region containing nine cysteines ... cultivated meat breed (minitype) and the Youxi chicken is a local breed raised for ..... Allele R was the additive gene on growth traits. Bian chickens ...

  5. Interleaved Product LDPC Codes

    OpenAIRE

    Baldi, Marco; Cancellieri, Giovanni; Chiaraluce, Franco

    2011-01-01

    Product LDPC codes take advantage of LDPC decoding algorithms and the high minimum distance of product codes. We propose to add suitable interleavers to improve the waterfall performance of LDPC decoding. Interleaving also reduces the number of low weight codewords, that gives a further advantage in the error floor region.

  6. Gene-Based Analysis of Regionally Enriched Cortical Genes in GWAS Data Sets of Cognitive Traits and Psychiatric Disorders

    DEFF Research Database (Denmark)

    Ersland, Kari M; Christoforou, Andrea; Stansberg, Christine

    2012-01-01

    the regionally enriched cortical genes to mine a genome-wide association study (GWAS) of the Norwegian Cognitive NeuroGenetics (NCNG) sample of healthy adults for association to nine psychometric tests measures. In addition, we explored GWAS data sets for the serious psychiatric disorders schizophrenia (SCZ) (n...

  7. Regulatory Architecture of Gene Expression Variation in the Threespine Stickleback Gasterosteus aculeatus

    Directory of Open Access Journals (Sweden)

    Victoria L. Pritchard

    2017-01-01

    Full Text Available Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus, an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats.

  8. Identification of distal regulatory regions in the human alpha IIb gene locus necessary for consistent, high-level megakaryocyte expression.

    Science.gov (United States)

    Thornton, Michael A; Zhang, Chunyan; Kowalska, Maria A; Poncz, Mortimer

    2002-11-15

    The alphaIIb/beta3-integrin receptor is present at high levels only in megakaryocytes and platelets. Its presence on platelets is critical for hemostasis. The tissue-specific nature of this receptor's expression is secondary to the restricted expression of alphaIIb, and studies of the alphaIIb proximal promoter have served as a model of a megakaryocyte-specific promoter. We have examined the alphaIIb gene locus for distal regulatory elements. Sequence comparison between the human (h) and murine (m) alphaIIb loci revealed high levels of conservation at intergenic regions both 5' and 3' to the alphaIIb gene. Additionally, deoxyribonuclease (DNase) I sensitivity mapping defined tissue-specific hypersensitive (HS) sites that coincide, in part, with these conserved regions. Transgenic mice containing various lengths of the h(alpha)IIb gene locus, which included or excluded the various conserved/HS regions, demonstrated that the proximal promoter was sufficient for tissue specificity, but that a region 2.5 to 7.1 kb upstream of the h(alpha)IIb gene was necessary for consistent expression. Another region 2.2 to 7.4 kb downstream of the gene enhanced expression 1000-fold and led to levels of h(alpha)IIb mRNA that were about 30% of the native m(alpha)IIb mRNA level. These constructs also resulted in detectable h(alpha)IIb/m(beta)3 on the platelet surface. This work not only confirms the importance of the proximal promoter of the alphaIIb gene for tissue specificity, but also characterizes the distal organization of the alphaIIb gene locus and provides an initial localization of 2 important regulatory regions needed for the expression of the alphaIIb gene at high levels during megakaryopoiesis.

  9. Recurrent vomiting and ethylmalonic aciduria associated with rare mutations in the short-chain acyl-CoA dehydrogenase (SCAD) gene

    DEFF Research Database (Denmark)

    Seidel, J.; Streck, S.; Bellstedt, K.

    2003-01-01

    blood spots. Neither of the frequent SCAD gene variants 625G>A and 511C>T was present, but direct sequencing of the promoter and coding regions of the SCAD gene revealed that the patient had mutations on both alleles: 417G>C (Trpl15Cys) and 1095G>T (Gln341His). Neither mutation has been described before...

  10. Analysis of viral protein-2 encoding gene of avian encephalomyelitis virus from field specimens in Central Java region, Indonesia

    Directory of Open Access Journals (Sweden)

    Aris Haryanto

    2016-01-01

    Full Text Available Aim: Avian encephalomyelitis (AE is a viral disease which can infect various types of poultry, especially chicken. In Indonesia, the incidence of AE infection in chicken has been reported since 2009, the AE incidence tends to increase from year to year. The objective of this study was to analyze viral protein 2 (VP-2 encoding gene of AE virus (AEV from various species of birds in field specimen by reverse transcription polymerase chain reaction (RT-PCR amplification using specific nucleotides primer for confirmation of AE diagnosis. Materials and Methods: A total of 13 AEV samples are isolated from various species of poultry which are serologically diagnosed infected by AEV from some areas in central Java, Indonesia. Research stage consists of virus samples collection from field specimens, extraction of AEV RNA, amplification of VP-2 protein encoding gene by RT-PCR, separation of RT-PCR product by agarose gel electrophoresis, DNA sequencing and data analysis. Results: Amplification products of the VP-2 encoding gene of AEV by RT-PCR methods of various types of poultry from field specimens showed a positive results on sample code 499/4/12 which generated DNA fragment in the size of 619 bp. Sensitivity test of RT-PCR amplification showed that the minimum concentration of RNA template is 127.75 ng/μl. The multiple alignments of DNA sequencing product indicated that positive sample with code 499/4/12 has 92% nucleotide homology compared with AEV with accession number AV1775/07 and 85% nucleotide homology with accession number ZCHP2/0912695 from Genbank database. Analysis of VP-2 gene sequence showed that it found 46 nucleotides difference between isolate 499/4/12 compared with accession number AV1775/07 and 93 nucleotides different with accession number ZCHP2/0912695. Conclusions: Analyses of the VP-2 encoding gene of AEV with RT-PCR method from 13 samples from field specimen generated the DNA fragment in the size of 619 bp from one sample with

  11. Natural type 3/type 2 intertypic vaccine-related poliovirus recombinants with the first crossover sites within the VP1 capsid coding region.

    Science.gov (United States)

    Zhang, Yong; Zhu, Shuangli; Yan, Dongmei; Liu, Guiyan; Bai, Ruyin; Wang, Dongyan; Chen, Li; Zhu, Hui; An, Hongqiu; Kew, Olen; Xu, Wenbo

    2010-12-21

    Ten uncommon natural type 3/type 2 intertypic poliovirus recombinants were isolated from stool specimens from nine acute flaccid paralysis case patients and one healthy vaccinee in China from 2001 to 2008. Complete genomic sequences revealed their vaccine-related genomic features and showed that their first crossover sites were randomly distributed in the 3' end of the VP1 coding region. The length of donor Sabin 2 sequences ranged from 55 to 136 nucleotides, which is the longest donor sequence reported in the literature for this type of poliovirus recombination. The recombination resulted in the introduction of Sabin 2 neutralizing antigenic site 3a (NAg3a) into a Sabin 3 genomic background in the VP1 coding region, which may have been altered by some of the type 3-specific antigenic properties, but had not acquired any type 2-specific characterizations. NAg3a of the Sabin 3 strain seems atypical; other wild-type poliovirus isolates that have circulated in recent years have sequences of NAg3a more like the Sabin 2 strain. 10 natural type 3/type 2 intertypic VP1 capsid-recombinant polioviruses, in which the first crossover sites were found to be in the VP1 coding region, were isolated and characterized. In spite of the complete replacement of NAg3a by type 2-specific amino acids, the serotypes of the recombinants were not altered, and they were totally neutralized by polyclonal type 3 antisera but not at all by type 2 antisera. It is possible that recent type 3 wild poliovirus isolates may be a recombinant having NAg3a sequences derived from another strain during between 1967 and 1980, and the type 3/type 2 recombination events in the 3' end of the VP1 coding region may result in a higher fitness.

  12. Natural type 3/type 2 intertypic vaccine-related poliovirus recombinants with the first crossover sites within the VP1 capsid coding region.

    Directory of Open Access Journals (Sweden)

    Yong Zhang

    Full Text Available BACKGROUND: Ten uncommon natural type 3/type 2 intertypic poliovirus recombinants were isolated from stool specimens from nine acute flaccid paralysis case patients and one healthy vaccinee in China from 2001 to 2008. PRINCIPAL FINDINGS: Complete genomic sequences revealed their vaccine-related genomic features and showed that their first crossover sites were randomly distributed in the 3' end of the VP1 coding region. The length of donor Sabin 2 sequences ranged from 55 to 136 nucleotides, which is the longest donor sequence reported in the literature for this type of poliovirus recombination. The recombination resulted in the introduction of Sabin 2 neutralizing antigenic site 3a (NAg3a into a Sabin 3 genomic background in the VP1 coding region, which may have been altered by some of the type 3-specific antigenic properties, but had not acquired any type 2-specific characterizations. NAg3a of the Sabin 3 strain seems atypical; other wild-type poliovirus isolates that have circulated in recent years have sequences of NAg3a more like the Sabin 2 strain. CONCLUSIONS: 10 natural type 3/type 2 intertypic VP1 capsid-recombinant polioviruses, in which the first crossover sites were found to be in the VP1 coding region, were isolated and characterized. In spite of the complete replacement of NAg3a by type 2-specific amino acids, the serotypes of the recombinants were not altered, and they were totally neutralized by polyclonal type 3 antisera but not at all by type 2 antisera. It is possible that recent type 3 wild poliovirus isolates may be a recombinant having NAg3a sequences derived from another strain during between 1967 and 1980, and the type 3/type 2 recombination events in the 3' end of the VP1 coding region may result in a higher fitness.

  13. Expression of an Aspergillus niger Phytase Gene (phyA) in Saccharomyces cerevisiae

    OpenAIRE

    Han, Yanming; Wilson, David B.; Lei, Xin gen

    1999-01-01

    Phytase improves the bioavailability of phytate phosphorus in plant foods to humans and animals and reduces phosphorus pollution of animal waste. Our objectives were to express an Aspergillus niger phytase gene (phyA) in Saccharomyces cerevisiae and to determine the effects of glycosylation on the phytase’s activity and thermostability. A 1.4-kb DNA fragment containing the coding region of the phyA gene was inserted into the expression vector pYES2 and was expressed in S. cerevisiae as an act...

  14. Gene Expression Data from the Moon Jelly, Aurelia, Provide Insights into the Evolution of the Combinatorial Code Controlling Animal Sense Organ Development.

    Directory of Open Access Journals (Sweden)

    Nagayasu Nakanishi

    Full Text Available In Bilateria, Pax6, Six, Eya and Dach families of transcription factors underlie the development and evolution of morphologically and phyletically distinct eyes, including the compound eyes in Drosophila and the camera-type eyes in vertebrates, indicating that bilaterian eyes evolved under the strong influence of ancestral developmental gene regulation. However the conservation in eye developmental genetics deeper in the Eumetazoa, and the origin of the conserved gene regulatory apparatus controlling eye development remain unclear due to limited comparative developmental data from Cnidaria. Here we show in the eye-bearing scyphozoan cnidarian Aurelia that the ectodermal photosensory domain of the developing medusa sensory structure known as the rhopalium expresses sine oculis (so/six1/2 and eyes absent/eya, but not optix/six3/6 or pax (A&B. In addition, the so and eya co-expression domain encompasses the region of active cell proliferation, neurogenesis, and mechanoreceptor development in rhopalia. Consistent with the role of so and eya in rhopalial development, developmental transcriptome data across Aurelia life cycle stages show upregulation of so and eya, but not optix or pax (A&B, during medusa formation. Moreover, pax6 and dach are absent in the Aurelia genome, and thus are not required for eye development in Aurelia. Our data are consistent with so and eya, but not optix, pax or dach, having conserved functions in sensory structure specification across Eumetazoa. The lability of developmental components including Pax genes relative to so-eya is consistent with a model of sense organ development and evolution that involved the lineage specific modification of a combinatorial code that specifies animal sense organs.

  15. Haplotypes and Sequence Variation in the Ovine Adiponectin Gene (ADIPOQ

    Directory of Open Access Journals (Sweden)

    Qing-Ming An

    2015-11-01

    Full Text Available The adiponectin gene (ADIPOQ plays an important role in energy homeostasis. In this study five separate regions (regions 1 to 5 of ovine ADIPOQ were analysed using PCR-SSCP. Four different PCR-SSCP patterns (A1-D1, A2-D2 were detected in region-1 and region-2, respectively, with seven and six SNPs being revealed. In region-3, three different patterns (A3-C3 and three SNPs were observed. Two patterns (A4-B4, A5-B5 and two and one SNPs were observed in region-4 and region-5, respectively. In total, nineteen SNPs were detected, with five of them in the coding region and two (c.46T/C and c.515G/A putatively resulting in amino acid changes (p.Tyr16His and p.Lys172Arg. In region-1, -2 and -3 of 316 sheep from eight New Zealand breeds, variants A1, A2 and A3 were the most common, although variant frequencies differed in the eight breeds. Across region-1 and region-3, nine haplotypes were identified and haplotypes A1-A3, A1-C3, B1-A3 and B1-C3 were most common. These results indicate that the ADIPOQ gene is polymorphic and suggest that further analysis is required to see if the variation in the gene is associated with animal production traits.

  16. Utilization of genetic tests: analysis of gene-specific billing in Medicare claims data.

    Science.gov (United States)

    Lynch, Julie A; Berse, Brygida; Dotson, W David; Khoury, Muin J; Coomer, Nicole; Kautter, John

    2017-08-01

    We examined the utilization of precision medicine tests among Medicare beneficiaries through analysis of gene-specific tier 1 and 2 billing codes developed by the American Medical Association in 2012. We conducted a retrospective cross-sectional study. The primary source of data was 2013 Medicare 100% fee-for-service claims. We identified claims billed for each laboratory test, the number of patients tested, expenditures, and the diagnostic codes indicated for testing. We analyzed variations in testing by patient demographics and region of the country. Pharmacogenetic tests were billed most frequently, accounting for 48% of the expenditures for new codes. The most common indications for testing were breast cancer, long-term use of medications, and disorders of lipid metabolism. There was underutilization of guideline-recommended tumor mutation tests (e.g., epidermal growth factor receptor) and substantial overutilization of a test discouraged by guidelines (methylenetetrahydrofolate reductase). Methodology-based tier 2 codes represented 15% of all claims billed with the new codes. The highest rate of testing per beneficiary was in Mississippi and the lowest rate was in Alaska. Gene-specific billing codes significantly improved our ability to conduct population-level research of precision medicine. Analysis of these data in conjunction with clinical records should be conducted to validate findings.Genet Med advance online publication 26 January 2017.

  17. Genetic variation of the Borrelia burgdorferi gene vlsE involves cassette-specific, segmental gene conversion.

    Science.gov (United States)

    Zhang, J R; Norris, S J

    1998-08-01

    The Lyme disease spirochete Borrelia burgdorferi possesses 15 silent vls cassettes and a vls expression site (vlsE) encoding a surface-exposed lipoprotein. Segments of the silent vls cassettes have been shown to recombine with the vlsE cassette region in the mammalian host, resulting in combinatorial antigenic variation. Despite promiscuous recombination within the vlsE cassette region, the 5' and 3' coding sequences of vlsE that flank the cassette region are not subject to sequence variation during these recombination events. The segments of the silent vls cassettes recombine in the vlsE cassette region through a unidirectional process such that the sequence and organization of the silent vls loci are not affected. As a result of recombination, the previously expressed segments are replaced by incoming segments and apparently degraded. These results provide evidence for a gene conversion mechanism in VlsE antigenic variation.

  18. Structure of the gene for human β2-adrenergic receptor: expression and promoter characterization

    International Nuclear Information System (INIS)

    Emorine, L.J.; Marullo, S.; Delavier-Klutchko, C.; Kaveri, S.V.; Durieu-Trautmann, O.; Strosberg, A.D.

    1987-01-01

    The genomic gene coding for the human β 2 -adrenergic receptor (β 2 AR) from A431 epidermoid cells has been isolated. Transfection of the gene into eukaryotic cells restores a fully active receptor/GTP-binding protein/adenylate cyclase complex with β 2 AR properties. Southern blot analyses with β 2 AR-specific probes show that a single β 2 AR gene is common to various human tissues and that its flanking sequences are highly conserved among humans and between man and rabbit, mouse, and hamster. Functional significance of these regions is supported by the presence of a promoter region (including mRNA cap sites, two TATA boxes, a CAAT box, and three G + C-rich regions that resemble binding sites for transcription factor Sp1) 200-300 base pairs 5' to the translation initiation codon. In the 3' flanking region, sequences homologous to glucocorticoid-response elements might be responsible for the increased expression of the β 2 AR gene observed after treatment of the transfected cells with hydrocortisone. In addition, 5' to the promoter region, an open reading frame encodes a 251-residue polypeptide that displays striking homologies with protein kinases and other nucleotide-binding proteins

  19. Hypothyroidism coordinately and transiently affects myelin protein gene expression in most rat brain regions during postnatal development.

    Science.gov (United States)

    Ibarrola, N; Rodríguez-Peña, A

    1997-03-28

    To assess the role of thyroid hormone on myelin gene expression, we have studied the effect of hypothyroidism on the mRNA steady state levels for the major myelin protein genes: myelin basic protein (MBP), proteolipid protein (PLP), myelin-associated glycoprotein (MAG) and 2':3'-cyclic nucleotide 3'-phosphodiesterase (CNP) in different rat brain regions, during the first postnatal month. We found that hypothyroidism reduces the levels of every myelin protein transcript, with striking differences between the different brain regions. Thus, in the more caudal regions, the effect of hypothyroidism was extremely modest, being only evident at the earlier stages of myelination. In contrast, in the striatum and the cerebral cortex the important decrease in the myelin protein transcripts is maintained beyond the first postnatal month. Therefore, thyroid hormone modulates in a synchronous fashion the expression of the myelin genes and the length of its effect depends on the brain region. On the other hand, hyperthyroidism leads to an increase of the major myelin protein transcripts above control values. Finally, lack of thyroid hormone does not change the expression of the oligodendrocyte progenitor-specific gene, the platelet derived growth factor receptor alpha.

  20. The complete mitogenome of the whale shark parasitic copepod Pandarus rhincodonicus norman, Newbound & Knott (Crustacea; Siphonostomatoida; Pandaridae)--a new gene order for the copepoda.

    Science.gov (United States)

    Austin, Christopher M; Tan, Mun Hua; Lee, Yin Peng; Croft, Laurence J; Meekan, Mark G; Pierce, Simon J; Gan, Han Ming

    2016-01-01

    The complete mitochondrial genome of the parasitic copepod Pandarus rhincodonicus was obtained from a partial genome scan using the HiSeq sequencing system. The Pandarus rhincodonicus mitogenome has 14,480 base pairs (62% A+T content) made up of 12 protein-coding genes, 2 ribosomal subunit genes, 22 transfer RNAs, and a putative 384 bp non-coding AT-rich region. This Pandarus mitogenome sequence is the first for the family Pandaridae, the second for the order Siphonostomatoida and the sixth for the Copepoda.

  1. Limitations of mitochondrial gene barcoding in Octocorallia.

    Science.gov (United States)

    McFadden, Catherine S; Benayahu, Yehuda; Pante, Eric; Thoma, Jana N; Nevarez, P Andrew; France, Scott C

    2011-01-01

    The widespread assumption that COI and other mitochondrial genes will be ineffective DNA barcodes for anthozoan cnidarians has not been well tested for most anthozoans other than scleractinian corals. Here we examine the limitations of mitochondrial gene barcoding in the sub-class Octocorallia, a large, diverse, and ecologically important group of anthozoans. Pairwise genetic distance values (uncorrected p) were compared for three candidate barcoding regions: the Folmer region of COI; a fragment of the octocoral-specific mitochondrial protein-coding gene, msh1; and an extended barcode of msh1 plus COI with a short, adjacent intergenic region (igr1). Intraspecific variation was barcodes, and there was no discernible barcoding gap between intra- and interspecific p values. In a case study to assess regional octocoral biodiversity, COI and msh1 barcodes each identified 70% of morphospecies. In a second case study, a nucleotide character-based analysis correctly identified 70% of species in the temperate genus Alcyonium. Although interspecific genetic distances were 2× greater for msh1 than COI, each marker identified similar numbers of species in the two case studies, and the extended COI + igr1 + msh1 barcode more effectively discriminated sister taxa in Alcyonium. Although far from perfect for species identification, a COI + igr1 + msh1 barcode nonetheless represents a valuable addition to the depauperate set of characters available for octocoral taxonomy. © 2010 Blackwell Publishing Ltd.

  2. Estradiol-Induced Transcriptional Regulation of Long Non-Coding RNA, HOTAIR.

    Science.gov (United States)

    Bhan, Arunoday; Mandal, Subhrangsu S

    2016-01-01

    HOTAIR (HOX antisense intergenic RNA) is a 2.2 kb long non-coding RNA (lncRNA), transcribed from the antisense strand of homeobox C (HOXC) gene locus in chromosome 12. HOTAIR acts as a scaffolding lncRNA. It interacts and guides various chromatin-modifying complexes such as PRC2 (polycomb-repressive complex 2) and LSD1 (lysine-specific demethylase 1) to the target gene promoters leading to their gene silencing. Various studies have demonstrated that HOTAIR overexpression is associated with breast cancer. Recent studies from our laboratory demonstrate that HOTAIR is required for viability of breast cancer cells and is transcriptionally regulated by estradiol (E2) in vitro and in vivo. This chapter describes protocols for analysis of the HOTAIR promoter, cloning, transfection and dual luciferase assays, knockdown of protein synthesis by antisense oligonucleotides, and chromatin immunoprecipitation (ChIP) assay. These protocols are useful for studying the estrogen-mediated transcriptional regulation of lncRNA HOTAIR, as well as other protein coding genes and non-coding RNAs.

  3. CVD-associated non-coding RNA, ANRIL, modulates expression of atherogenic pathways in VSMC

    Energy Technology Data Exchange (ETDEWEB)

    Congrains, Ada; Kamide, Kei [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan); Katsuya, Tomohiro [Clinical Gene Therapy, Osaka University Graduate School of Medicine (Japan); Yasuda, Osamu [Department of Cardiovascular Clinical and Translational Research, Kumamoto University Hospital (Japan); Oguro, Ryousuke; Yamamoto, Koichi [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan); Ohishi, Mitsuru, E-mail: ohishi@geriat.med.osaka-u.ac.jp [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan); Rakugi, Hiromi [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan)

    2012-03-23

    Highlights: Black-Right-Pointing-Pointer ANRIL maps in the strongest susceptibility locus for cardiovascular disease. Black-Right-Pointing-Pointer Silencing of ANRIL leads to altered expression of tissue remodeling-related genes. Black-Right-Pointing-Pointer The effects of ANRIL on gene expression are splicing variant specific. Black-Right-Pointing-Pointer ANRIL affects progression of cardiovascular disease by regulating proliferation and apoptosis pathways. -- Abstract: ANRIL is a newly discovered non-coding RNA lying on the strongest genetic susceptibility locus for cardiovascular disease (CVD) in the chromosome 9p21 region. Genome-wide association studies have been linking polymorphisms in this locus with CVD and several other major diseases such as diabetes and cancer. The role of this non-coding RNA in atherosclerosis progression is still poorly understood. In this study, we investigated the implication of ANRIL in the modulation of gene sets directly involved in atherosclerosis. We designed and tested siRNA sequences to selectively target two exons (exon 1 and exon 19) of the transcript and successfully knocked down expression of ANRIL in human aortic vascular smooth muscle cells (HuAoVSMC). We used a pathway-focused RT-PCR array to profile gene expression changes caused by ANRIL knock down. Notably, the genes affected by each of the siRNAs were different, suggesting that different splicing variants of ANRIL might have distinct roles in cell physiology. Our results suggest that ANRIL splicing variants play a role in coordinating tissue remodeling, by modulating the expression of genes involved in cell proliferation, apoptosis, extra-cellular matrix remodeling and inflammatory response to finally impact in the risk of cardiovascular disease and other pathologies.

  4. Genetic organization of the unc-22 IV gene and the adjacent region in Caenorhabditis elegans.

    Science.gov (United States)

    Rogalski, T M; Baillie, D L

    1985-01-01

    The genetic organization of the region immediately adjacent to the unc-22 IV gene in Caenorhabditis elegans has been studied. We have identified twenty essential genes in this interval of approximately 1.5-map units on Linkage Group IV. The mutations that define these genes were positioned by recombination mapping and complementation with several deficiencies. With few exceptions, the positions obtained by these two methods agreed. Eight of the twenty essential genes identified are represented by more than one allele. Three possible internal deletions of the unc-22 gene have been located by intra-genic mapping. In addition, the right end point of a deficiency or an inversion affecting the adjacent genes let-56 and unc-22 has been positioned inside the unc-22 gene.

  5. Translational regulation of gene expression by an anaerobically induced small non-coding RNA in Escherichia coli

    DEFF Research Database (Denmark)

    Boysen, Anders; Møller-Jensen, Jakob; Kallipolitis, Birgitte H.

    2010-01-01

    Small non-coding RNAs (sRNA) have emerged as important elements of gene regulatory circuits. In enterobacteria such as Escherichia coli and Salmonella many of these sRNAs interact with the Hfq protein, an RNA chaperone similar to mammalian Sm-like proteins and act in the post...... that adaptation to anaerobic growth involves the action of a small regulatory RNA....... of at least one sRNA regulator. Here, we extend this view by the identification and characterization of a highly conserved, anaerobically induced small sRNA in E. coli, whose expression is strictly dependent on the anaerobic transcriptional fumarate and nitrate reductase regulator (FNR). The sRNA, named Fnr...

  6. Functional Analysis of Promoter Region from Eel Cytochrome P450 1A1 Gene in Transgenic Medaka.

    Science.gov (United States)

    Ogino; Itakura; Kato; Aoki; Sato

    1999-07-01

    : Transcription of the CYP1A1 genes in mammals and fish is stimulated by polyaromatic hydrocarbons. DNA sequencing analysis revealed that CYP1A1 gene in eel (Anguilla japonica) contains two kinds of putative cis-acting regulatory elements, XRE (xenobiotic-responsive element) and ERE (estrogen-responsive element). XRE is known as the enhancer that is responsible for the inducibility of the genes of CYP1A1 and some other drug-metabolizing enzymes. In the eel CYP1A1 gene, XRE motifs are distributed as follows: five times in the region from -2136 to -1125 bp, XRE(-6) to (-2); once in the proximal basal promoter region, XRE(-1); and once in the first intron, XRE(+1). The region between XRE(-2) and XRE(-1) contains three ERE motifs. To investigate the function of the cis-acting regulatory elements in the eel CYP1A1 gene, recombinant plasmids prepared with its 5' upstream sequence and the structural gene for luciferase were microinjected into fertilized eggs of medaka at the one-cell stage. Hatched fry were treated with 3-methylcholanthrene, and the transcription efficiency was assayed using competitive polymerase chain reaction analysis. Deletion of the region containing the five XREs, XRE(-6) to XRE(-2), and the point mutation of XRE(-1) reduced the inducible expressions by 75% and 56%, respectively, showing apparent dependency of the drug induction on the XREs. Constitutive expression, however, was not significantly affected by deletion or disruption of the XREs. When the region between XRE(-2) and XRE(-1) containing no XREs but three ERE motifs was internally deleted, the inducible expression and the constitutive expression were reduced by 88% and 75%, respectively. Replacement of this region with a partial fragment of eel CYP1A1 complementary DNA, with slight alteration of the distance between the five XREs and XRE(-1), reduced the inducible expression and the constitutive expression by 91% and 60%, respectively. These results strongly suggest that not only XRE but

  7. Lightweight Object Tracking in Compressed Video Streams Demonstrated in Region-of-Interest Coding

    Directory of Open Access Journals (Sweden)

    Lerouge Sam

    2007-01-01

    Full Text Available Video scalability is a recent video coding technology that allows content providers to offer multiple quality versions from a single encoded video file in order to target different kinds of end-user devices and networks. One form of scalability utilizes the region-of-interest concept, that is, the possibility to mark objects or zones within the video as more important than the surrounding area. The scalable video coder ensures that these regions-of-interest are received by an end-user device before the surrounding area and preferably in higher quality. In this paper, novel algorithms are presented making it possible to automatically track the marked objects in the regions of interest. Our methods detect the overall motion of a designated object by retrieving the motion vectors calculated during the motion estimation step of the video encoder. Using this knowledge, the region-of-interest is translated, thus following the objects within. Furthermore, the proposed algorithms allow adequate resizing of the region-of-interest. By using the available information from the video encoder, object tracking can be done in the compressed domain and is suitable for real-time and streaming applications. A time-complexity analysis is given for the algorithms proving the low complexity thereof and the usability for real-time applications. The proposed object tracking methods are generic and can be applied to any codec that calculates the motion vector field. In this paper, the algorithms are implemented within MPEG-4 fine-granularity scalability codec. Different tests on different video sequences are performed to evaluate the accuracy of the methods. Our novel algorithms achieve a precision up to 96.4 .

  8. Lightweight Object Tracking in Compressed Video Streams Demonstrated in Region-of-Interest Coding

    Directory of Open Access Journals (Sweden)

    Rik Van de Walle

    2007-01-01

    Full Text Available Video scalability is a recent video coding technology that allows content providers to offer multiple quality versions from a single encoded video file in order to target different kinds of end-user devices and networks. One form of scalability utilizes the region-of-interest concept, that is, the possibility to mark objects or zones within the video as more important than the surrounding area. The scalable video coder ensures that these regions-of-interest are received by an end-user device before the surrounding area and preferably in higher quality. In this paper, novel algorithms are presented making it possible to automatically track the marked objects in the regions of interest. Our methods detect the overall motion of a designated object by retrieving the motion vectors calculated during the motion estimation step of the video encoder. Using this knowledge, the region-of-interest is translated, thus following the objects within. Furthermore, the proposed algorithms allow adequate resizing of the region-of-interest. By using the available information from the video encoder, object tracking can be done in the compressed domain and is suitable for real-time and streaming applications. A time-complexity analysis is given for the algorithms proving the low complexity thereof and the usability for real-time applications. The proposed object tracking methods are generic and can be applied to any codec that calculates the motion vector field. In this paper, the algorithms are implemented within MPEG-4 fine-granularity scalability codec. Different tests on different video sequences are performed to evaluate the accuracy of the methods. Our novel algorithms achieve a precision up to 96.4%.

  9. Characterization of class II alpha genes and DLA-D region allelic associations in the dog.

    Science.gov (United States)

    Sarmiento, U M; Storb, R F

    1988-10-01

    Human major histocompatibility complex (HLA) cDNA probes were used to analyze the restriction fragment length polymorphism (RFLP) of the alpha genes of the DLA-D region in dogs. Genomic DNA from peripheral blood leucocytes of 23 unrelated DLA-D homozygous dogs representing nine DLA-D types (defined by mixed leucocyte reaction) was digested with restriction enzymes (BamHI, EcoRI, Hind III, Pvu II, Taq I, Rsa I, Msp I, Pst I and Bgl II), separated by agarose gel electrophoresis and transferred onto Biotrace membrane. The Southern blots were successively hybridized with radiolabelled HLA cDNA probes corresponding to DQ, DP, DZ and DR alpha genes. Clear evidence was obtained for the canine homologues of DQ and DR alpha genes with simple bi- or tri-allelic polymorphism respectively. Evidence for a single, nonpolymorphic DP alpha gene was also obtained. However, the presence of a DZ alpha gene could not be clearly demonstrated in canine genomic DNA. This report extends our previous RFLP analysis documenting polymorphism of DLA class II beta genes in the same panel of homozygous typing cell dogs, and provides the basis for DLA-D genotyping at a population level. This study also characterizes the RFLP-defined preferential allelic associations across the DLA-D region in nine different homozygous typing cell specificities.

  10. Zebrafish homologs of genes within 16p11.2, a genomic region associated with brain disorders, are active during brain development, and include two deletion dosage sensor genes

    Directory of Open Access Journals (Sweden)

    Alicia Blaker-Lee

    2012-11-01

    Deletion or duplication of one copy of the human 16p11.2 interval is tightly associated with impaired brain function, including autism spectrum disorders (ASDs, intellectual disability disorder (IDD and other phenotypes, indicating the importance of gene dosage in this copy number variant region (CNV. The core of this CNV includes 25 genes; however, the number of genes that contribute to these phenotypes is not known. Furthermore, genes whose functional levels change with deletion or duplication (termed ‘dosage sensors’, which can associate the CNV with pathologies, have not been identified in this region. Using the zebrafish as a tool, a set of 16p11.2 homologs was identified, primarily on chromosomes 3 and 12. Use of 11 phenotypic assays, spanning the first 5 days of development, demonstrated that this set of genes is highly active, such that 21 out of the 22 homologs tested showed loss-of-function phenotypes. Most genes in this region were required for nervous system development – impacting brain morphology, eye development, axonal density or organization, and motor response. In general, human genes were able to substitute for the fish homolog, demonstrating orthology and suggesting conserved molecular pathways. In a screen for 16p11.2 genes whose function is sensitive to hemizygosity, the aldolase a (aldoaa and kinesin family member 22 (kif22 genes were identified as giving clear phenotypes when RNA levels were reduced by ∼50%, suggesting that these genes are deletion dosage sensors. This study leads to two major findings. The first is that the 16p11.2 region comprises a highly active set of genes, which could present a large genetic target and might explain why multiple brain function, and other, phenotypes are associated with this interval. The second major finding is that there are (at least two genes with deletion dosage sensor properties among the 16p11.2 set, and these could link this CNV to brain disorders such as ASD and IDD.

  11. Sequence and transcription analysis of the human cytomegalovirus DNA polymerase gene

    International Nuclear Information System (INIS)

    Kouzarides, T.; Bankier, A.T.; Satchwell, S.C.; Weston, K.; Tomlinson, P.; Barrell, B.G.

    1987-01-01

    DNA sequence analysis has revealed that the gene coding for the human cytomegalovirus (HCMV) DNA polymerase is present within the long unique region of the virus genome. Identification is based on extensive amino acid homology between the predicted HCMV open reading frame HFLF2 and the DNA polymerase of herpes simplex virus type 1. The authors present here a 5280 base-pair DNA sequence containing the HCMV pol gene, along with the analysis of transcripts encoded within this region. Since HCMV pol also shows homology to the predicted Epstein-Barr virus pol, they were able to analyze the extent of homology between the DNA polymerases of three distantly related herpes viruses, HCMV, Epstein-Barr virus, and herpes simplex virus. The comparison shows that these DNA polymerases exhibit considerable amino acid homology and highlights a number of highly conserved regions; two such regions show homology to sequences within the adenovirus type 2 DNA polymerase. The HCMV pol gene is flanked by open reading frames with homology to those of other herpes viruses; upstream, there is a reading frame homologous to the glycoprotein B gene of herpes simplex virus type I and Epstein-Barr virus, and downstream there is a reading frame homologous to BFLF2 of Epstein-Barr virus

  12. Prediction and analysis of three gene families related to leaf rust (Puccinia triticina) resistance in wheat (Triticum aestivum L.).

    Science.gov (United States)

    Peng, Fred Y; Yang, Rong-Cai

    2017-06-20

    The resistance to leaf rust (Lr) caused by Puccinia triticina in wheat (Triticum aestivum L.) has been well studied over the past decades with over 70 Lr genes being mapped on different chromosomes and numerous QTLs (quantitative trait loci) being detected or mapped using DNA markers. Such resistance is often divided into race-specific and race-nonspecific resistance. The race-nonspecific resistance can be further divided into resistance to most or all races of the same pathogen and resistance to multiple pathogens. At the molecular level, these three types of resistance may cover across the whole spectrum of pathogen specificities that are controlled by genes encoding different protein families in wheat. The objective of this study is to predict and analyze genes in three such families: NBS-LRR (nucleotide-binding sites and leucine-rich repeats or NLR), START (Steroidogenic Acute Regulatory protein [STaR] related lipid-transfer) and ABC (ATP-Binding Cassette) transporter. The focus of the analysis is on the patterns of relationships between these protein-coding genes within the gene families and QTLs detected for leaf rust resistance. We predicted 526 ABC, 1117 NLR and 144 START genes in the hexaploid wheat genome through a domain analysis of wheat proteome. Of the 1809 SNPs from leaf rust resistance QTLs in seedling and adult stages of wheat, 126 SNPs were found within coding regions of these genes or their neighborhood (5 Kb upstream from transcription start site [TSS] or downstream from transcription termination site [TTS] of the genes). Forty-three of these SNPs for adult resistance and 18 SNPs for seedling resistance reside within coding or neighboring regions of the ABC genes whereas 14 SNPs for adult resistance and 29 SNPs for seedling resistance reside within coding or neighboring regions of the NLR gene. Moreover, we found 17 nonsynonymous SNPs for adult resistance and five SNPs for seedling resistance in the ABC genes, and five nonsynonymous SNPs for

  13. Nucleotide sequence of a human tRNA gene heterocluster

    International Nuclear Information System (INIS)

    Chang, Y.N.; Pirtle, I.L.; Pirtle, R.M.

    1986-01-01

    Leucine tRNA from bovine liver was used as a hybridization probe to screen a human gene library harbored in Charon-4A of bacteriophage lambda. The human DNA inserts from plaque-pure clones were characterized by restriction endonuclease mapping and Southern hybridization techniques, using both [3'- 32 P]-labeled bovine liver leucine tRNA and total tRNA as hybridization probes. An 8-kb Hind III fragment of one of these γ-clones was subcloned into the Hind III site of pBR322. Subsequent fine restriction mapping and DNA sequence analysis of this plasmid DNA indicated the presence of four tRNA genes within the 8-kb DNA fragment. A leucine tRNA gene with an anticodon of AAG and a proline tRNA gene with an anticodon of AGG are in a 1.6-kb subfragment. A threonine tRNA gene with an anticodon of UGU and an as yet unidentified tRNA gene are located in a 1.1-kb subfragment. These two different subfragments are separated by 2.8 kb. The coding regions of the three sequenced genes contain characteristic internal split promoter sequences and do not have intervening sequences. The 3'-flanking region of these three genes have typical RNA polymerase III termination sites of at least four consecutive T residues

  14. Cloning and molecular evolution of the aldehyde dehydrogenase 2 gene (Aldh2) in bats (Chiroptera).

    Science.gov (United States)

    Chen, Yao; Shen, Bin; Zhang, Junpeng; Jones, Gareth; He, Guimei

    2013-02-01

    Old World fruit bats (Pteropodidae) and New World fruit bats (Phyllostomidae) ingest significant quantities of ethanol while foraging. Mitochondrial aldehyde dehydrogenase (ALDH2, encoded by the Aldh2 gene) plays an important role in ethanol metabolism. To test whether the Aldh2 gene has undergone adaptive evolution in frugivorous and nectarivorous bats in relation to ethanol elimination, we sequenced part of the coding region of the gene (1,143 bp, ~73 % coverage) in 14 bat species, including three Old World fruit bats and two New World fruit bats. Our results showed that the Aldh2 coding sequences are highly conserved across all bat species we examined, and no evidence of positive selection was detected in the ancestral branches leading to Old World fruit bats and New World fruit bats. Further research is needed to determine whether other genes involved in ethanol metabolism have been the targets of positive selection in frugivorous and nectarivorous bats.

  15. Regional mapping of the phenylalanine hydroxylase gene and the phenylketonuria locus in the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Lidsky, A.S.; Law, M.L.; Morse, H.G.; Kao, F.T.; Rabin, M.; Ruddle, F.H.; Woo, S.L.C.

    1985-09-01

    Phenylketonuria (PKU) is an autosomal recessive disorder of amino acid metabolism caused by a deficiency of the hepatic enzyme phenylalanine hydroxylase. To define the regional map position of the disease locus and the PAH gene on human chromosome 12, DNA was isolated from human-hamster somatic cell hybrids with various deletions of human chromosome 12 and was analyzed by Southern blot analysis using the human cDNA PAH clone as a hybridization probe. From these results, together with detailed biochemical and cytogenetic characterization of the hybrid cells, the region on chromosome 12 containing the human PAH gene has been defined as 12q14.3..-->..qter. The PAH map position on chromosome 12 was further localized by in situ hybridization of /sup 125/I-labeled human PAH cDNA to chromosomes prepared from a human lymphoblastoid cell line. Results of these experiments demonstrated that the region on chromosome 12 containing the PAH gene and the PKU locus in man is 12q22..-->..12q24.1. These results not only provide a regionalized map position for a major human disease locus but also can serve as a reference point for linkage analysis with other DNA markers on human chromosome 12.

  16. Regional mapping of the phenylalanine hydroxylase gene and the phenylketonuria locus in the human genome

    International Nuclear Information System (INIS)

    Lidsky, A.S.; Law, M.L.; Morse, H.G.; Kao, F.T.; Rabin, M.; Ruddle, F.H.; Woo, S.L.C.

    1985-01-01

    Phenylketonuria (PKU) is an autosomal recessive disorder of amino acid metabolism caused by a deficiency of the hepatic enzyme phenylalanine hydroxylase. To define the regional map position of the disease locus and the PAH gene on human chromosome 12, DNA was isolated from human-hamster somatic cell hybrids with various deletions of human chromosome 12 and was analyzed by Southern blot analysis using the human cDNA PAH clone as a hybridization probe. From these results, together with detailed biochemical and cytogenetic characterization of the hybrid cells, the region on chromosome 12 containing the human PAH gene has been defined as 12q14.3→qter. The PAH map position on chromosome 12 was further localized by in situ hybridization of 125 I-labeled human PAH cDNA to chromosomes prepared from a human lymphoblastoid cell line. Results of these experiments demonstrated that the region on chromosome 12 containing the PAH gene and the PKU locus in man is 12q22→12q24.1. These results not only provide a regionalized map position for a major human disease locus but also can serve as a reference point for linkage analysis with other DNA markers on human chromosome 12

  17. Gene divergence of homeologous regions associated with a major seed protein content QTL in soybean

    Directory of Open Access Journals (Sweden)

    Puji eLestari

    2013-06-01

    Full Text Available Understanding several modes of duplication contributing on the present genome structure is getting an attention because it could be related to numerous agronomically important traits. Since soybean serves as a rich protein source for animal feeds and human consumption, breeding efforts in soybean have been directed toward enhancing seed protein content. The publicly available soybean sequences and its genomically featured elements facilitate comprehending of quantitative trait loci (QTL for seed protein content in concordance with homeologous regions in soybean genome. Although parts of chromosome (Chr 20 and Chr 10 showed synteny, QTLs for seed protein content present only on Chr 20. Using comparative analysis of gene contents in recently duplicated genomic regions harboring QTL for protein/oil content on Chrs 20 and 10, a total of 27 genes are present in duplicated regions of both chromosomes. Notably, 4 tandem duplicates of the putative homeobox protein 22 (HB22 are present only on Chr 20 and this Medicago truncatula homolog expressed in endosperm at seed filling stage. These tandem duplicates could contribute on the protein/oil QTL of Chr 20. Our study suggests that non-shared gene contents within the duplicated genomic regions might lead to absence/presence of QTL related to protein/oil content.

  18. Short-lived long non-coding RNAs as surrogate indicators for chemical exposure and LINC00152 and MALAT1 modulate their neighboring genes.

    Directory of Open Access Journals (Sweden)

    Hidenori Tani

    Full Text Available Whole transcriptome analyses have revealed a large number of novel long non-coding RNAs (lncRNAs. Although accumulating evidence demonstrates that lncRNAs play important roles in regulating gene expression, the detailed mechanisms of action of most lncRNAs remain unclear. We previously reported that a novel class of lncRNAs with a short half-life (t1/2 < 4 h in HeLa cells, termed short-lived non-coding transcripts (SLiTs, are closely associated with physiological and pathological functions. In this study, we focused on 26 SLiTs and nuclear-enriched abundant lncRNA, MALAT1(t1/2 of 7.6 h in HeLa cells in neural stem cells (NSCs derived from human induced pluripotent stem cells, and identified four SLiTs (TUG1, GAS5, FAM222-AS1, and SNHG15 that were affected by the following typical chemical stresses (oxidative stress, heavy metal stress and protein synthesis stress. We also found the expression levels of LINC00152 (t1/2 of 2.1 h in NSCs, MALAT1 (t1/2 of 1.8 h in NSCs, and their neighboring genes were elevated proportionally to the chemical doses. Moreover, we confirmed that the overexpression of LINC00152 or MALAT1 upregulated the expressions of their neighboring genes even in the absence of chemical stress. These results reveal that LINC00152 and MALAT1 modulate their neighboring genes, and thus provide a deeper understanding of the functions of lncRNAs.

  19. Mutation in the B chain coding region is associated with impaired proinsulin conversion in a family with hyperproinsulinemia

    International Nuclear Information System (INIS)

    Chan, S.J.; Seino, S.; Gruppuso, P.A.; Schwartz, R.; Steiner, D.F.

    1987-01-01

    A family has recently been described in which hyperproinsulinemia is inherited in an autosomal dominant pattern, suggesting a structural abnormality in the proinsulin molecule as the basis for this disorder. However, unlike two previous kindreds with a similar syndrome, the serum proinsulin-like material in this family did not appear to be an intermediate conversion product but instead behaved like normal human proinsulin by several criteria. To further characterize this disorder the authors isolated and sequenced the insulin gene of the propositus. Leukocyte DNA was cloned in λgt-WES and recombinants containing the two insulin alleles, λMD41 and λMD51, were isolated by plaque hybridization. DNA sequencing of λMD51 showed that it contained the normal coding sequence for human preproinsulin. Sequence analysis of λMD41, however, revealed a single nucleotide substitution in the codon for residue 10 of proinsulin (CAC → GAC) that predicts the exchange of aspartic acid for histidine in the insulin B chain region. This mutation was also found in an insulin allele cloned from a second affected family member (propositus's father). These results strongly implicate this mutation as the cause of the hyperproinsulinemia in this family. Inhibition of the conversion of proinsulin to insulin may be related to altered folding and/or self-association properties of the [Asp 10 ]proinsulin

  20. Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing.

    Science.gov (United States)

    Zhang, Rui; Deng, Patricia; Jacobson, Dionna; Li, Jin Billy

    2017-02-01

    Adenosine-to-inosine RNA editing diversifies the transcriptome and promotes functional diversity, particularly in the brain. A plethora of editing sites has been recently identified; however, how they are selected and regulated and which are functionally important are largely unknown. Here we show the cis-regulation and stepwise selection of RNA editing during Drosophila evolution and pinpoint a large number of functional editing sites. We found that the establishment of editing and variation in editing levels across Drosophila species are largely explained and predicted by cis-regulatory elements. Furthermore, editing events that arose early in the species tree tend to be more highly edited in clusters and enriched in slowly-evolved neuronal genes, thus suggesting that the main role of RNA editing is for fine-tuning neurological functions. While nonsynonymous editing events have been long recognized as playing a functional role, in addition to nonsynonymous editing sites, a large fraction of 3'UTR editing sites is evolutionarily constrained, highly edited, and thus likely functional. We find that these 3'UTR editing events can alter mRNA stability and affect miRNA binding and thus highlight the functional roles of noncoding RNA editing. Our work, through evolutionary analyses of RNA editing in Drosophila, uncovers novel insights of RNA editing regulation as well as its functions in both coding and non-coding regions.

  1. Function and Application Areas in Medicine of Non-Coding RNA

    Directory of Open Access Journals (Sweden)

    Figen Guzelgul

    2009-06-01

    Full Text Available RNA is the genetic material converting the genetic code that it gets from DNA into protein. While less than 2 % of RNA is converted into protein , more than 98 % of it can not be converted into protein and named as non-coding RNAs. 70 % of noncoding RNAs consists of introns , however, the rest part of them consists of exons. Non-coding RNAs are examined in two classes according to their size and functions. Whereas they are classified as long non-coding and small non-coding RNAs according to their size , they are grouped as housekeeping non-coding RNAs and regulating non-coding RNAs according to their function. For long years ,these non-coding RNAs have been considered as non-functional. However, today, it has been proved that these non-coding RNAs play role in regulating genes and in structural, functional and catalitic roles of RNAs converted into protein. Due to its taking a role in gene silencing mechanism, particularly in medical world , non-coding RNAs have led to significant developments. RNAi technolgy , which is used in designing drugs to be used in treatment of various diseases , is a ray of hope for medical world. [Archives Medical Review Journal 2009; 18(3.000: 141-155

  2. Methylation of miRNA genes and oncogenesis.

    Science.gov (United States)

    Loginov, V I; Rykov, S V; Fridman, M V; Braga, E A

    2015-02-01

    Interaction between microRNA (miRNA) and messenger RNA of target genes at the posttranscriptional level provides fine-tuned dynamic regulation of cell signaling pathways. Each miRNA can be involved in regulating hundreds of protein-coding genes, and, conversely, a number of different miRNAs usually target a structural gene. Epigenetic gene inactivation associated with methylation of promoter CpG-islands is common to both protein-coding genes and miRNA genes. Here, data on functions of miRNAs in development of tumor-cell phenotype are reviewed. Genomic organization of promoter CpG-islands of the miRNA genes located in inter- and intragenic areas is discussed. The literature and our own results on frequency of CpG-island methylation in miRNA genes from tumors are summarized, and data regarding a link between such modification and changed activity of miRNA genes and, consequently, protein-coding target genes are presented. Moreover, the impact of miRNA gene methylation on key oncogenetic processes as well as affected signaling pathways is discussed.

  3. Deciphering the genetic regulatory code using an inverse error control coding framework.

    Energy Technology Data Exchange (ETDEWEB)

    Rintoul, Mark Daniel; May, Elebeoba Eni; Brown, William Michael; Johnston, Anna Marie; Watson, Jean-Paul

    2005-03-01

    We have found that developing a computational framework for reconstructing error control codes for engineered data and ultimately for deciphering genetic regulatory coding sequences is a challenging and uncharted area that will require advances in computational technology for exact solutions. Although exact solutions are desired, computational approaches that yield plausible solutions would be considered sufficient as a proof of concept to the feasibility of reverse engineering error control codes and the possibility of developing a quantitative model for understanding and engineering genetic regulation. Such evidence would help move the idea of reconstructing error control codes for engineered and biological systems from the high risk high payoff realm into the highly probable high payoff domain. Additionally this work will impact biological sensor development and the ability to model and ultimately develop defense mechanisms against bioagents that can be engineered to cause catastrophic damage. Understanding how biological organisms are able to communicate their genetic message efficiently in the presence of noise can improve our current communication protocols, a continuing research interest. Towards this end, project goals include: (1) Develop parameter estimation methods for n for block codes and for n, k, and m for convolutional codes. Use methods to determine error control (EC) code parameters for gene regulatory sequence. (2) Develop an evolutionary computing computational framework for near-optimal solutions to the algebraic code reconstruction problem. Method will be tested on engineered and biological sequences.

  4. Transcriptome-Derived Tetranucleotide Microsatellites and Their Associated Genes from the Giant Panda (Ailuropoda melanoleuca).

    Science.gov (United States)

    Song, Xuhao; Shen, Fujun; Huang, Jie; Huang, Yan; Du, Lianming; Wang, Chengdong; Fan, Zhenxin; Hou, Rong; Yue, Bisong; Zhang, Xiuyue

    2016-09-01

    Recently, an increasing number of microsatellites or simple sequence repeats (SSRs) have been found and characterized from transcriptomes. Such SSRs can be employed as putative functional markers to easily tag corresponding genes, which play an important role in biomedical studies and genetic analysis. However, the transcriptome-derived SSRs for giant panda (Ailuropoda melanoleuca) are not yet available. In this work, we identified and characterized 20 tetranucleotide microsatellite loci from a transcript database generated from the blood of giant panda. Furthermore, we assigned their predicted transcriptome locations: 16 loci were assigned to untranslated regions (UTRs) and 4 loci were assigned to coding regions (CDSs). Gene identities of 14 transcripts contained corresponding microsatellites were determined, which provide useful information to study the potential contribution of SSRs to gene regulation in giant panda. The polymorphic information content (PIC) values ranged from 0.293 to 0.789 with an average of 0.603 for the 16 UTRs-derived SSRs. Interestingly, 4 CDS-derived microsatellites developed in our study were also polymorphic, and the instability of these 4 CDS-derived SSRs was further validated by re-genotyping and sequencing. The genes containing these 4 CDS-derived SSRs were embedded with various types of repeat motifs. The interaction of all the length-changing SSRs might provide a way against coding region frameshift caused by microsatellite instability. We hope these newly gene-associated biomarkers will pave the way for genetic and biomedical studies for giant panda in the future. In sum, this set of transcriptome-derived markers complements the genetic resources available for giant panda. © The American Genetic Association. 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  5. Investigation of mutations in the SRY, SOX9, and DAX1 genes in sex reversal patients from the Sichuan region of China.

    Science.gov (United States)

    Chen, L; Ding, X P; Wei, X; Li, L X

    2014-03-12

    We investigated the molecular genetic mechanism of sex reversal by exploring the relationship between mutations in the sex-determining genes SRY, SOX9, and DAX1 with genetic sex reversal disease. Mutations in the three key genes were detected by polymerase chain reaction (PCR) and sequencing after karyotype analysis. The mutations detected were then aligned with a random sample of 100 normal sequences and the NCBI sequence database in order to confirm any new mutations. Furthermore, the copy number of SOX9 was measured by fluorescence quantitative PCR. Seven of the 10 male sex reversal patients (46, XX) contained an excess copy of the SRY gene, while one of the eight female sex reversal patients (46, XY) was lacking the SRY gene. Additionally, a new mutation (T-A, Asp24Lys) was detected in one female sex reversal patient (46, XY). No other mutation was detected in the analysis of SOX9 and DAX1, with the exception of an insertion mutation (c.35377791insG) found in the testicular-specific enhancer (TESCO) sequences in an SRY-positive female sex reversal patient (46, XY). Eight of the 18 sex reversal cases (44.4%) showed obvious connections with SRY gene translocations, mutations, or deletions, which was significantly higher than that reported previously (33.3%), indicating a need to further expand the range of sample collection. Overall, these results indicated that the main mechanism of sex reversal are not associated with mutations in the coding regions of SOX9 and DAX1 or copy number variations of SOX9, which is consistent with results of previous studies.

  6. Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.

    Science.gov (United States)

    Li, Yifeng; Shi, Wenqiang; Wasserman, Wyeth W

    2018-05-31

    In the human genome, 98% of DNA sequences are non-protein-coding regions that were previously disregarded as junk DNA. In fact, non-coding regions host a variety of cis-regulatory regions which precisely control the expression of genes. Thus, Identifying active cis-regulatory regions in the human genome is critical for understanding gene regulation and assessing the impact of genetic variation on phenotype. The developments of high-throughput sequencing and machine learning technologies make it possible to predict cis-regulatory regions genome wide. Based on rich data resources such as the Encyclopedia of DNA Elements (ENCODE) and the Functional Annotation of the Mammalian Genome (FANTOM) projects, we introduce DECRES based on supervised deep learning approaches for the identification of enhancer and promoter regions in the human genome. Due to their ability to discover patterns in large and complex data, the introduction of deep learning methods enables a significant advance in our knowledge of the genomic locations of cis-regulatory regions. Using models for well-characterized cell lines, we identify key experimental features that contribute to the predictive performance. Applying DECRES, we delineate locations of 300,000 candidate enhancers genome wide (6.8% of the genome, of which 40,000 are supported by bidirectional transcription data), and 26,000 candidate promoters (0.6% of the genome). The predicted annotations of cis-regulatory regions will provide broad utility for genome interpretation from functional genomics to clinical applications. The DECRES model demonstrates potentials of deep learning technologies when combined with high-throughput sequencing data, and inspires the development of other advanced neural network models for further improvement of genome annotations.

  7. Decoding the non-coding genome: elucidating genetic risk outside the coding genome.

    Science.gov (United States)

    Barr, C L; Misener, V L

    2016-01-01

    Current evidence emerging from genome-wide association studies indicates that the genetic underpinnings of complex traits are likely attributable to genetic variation that changes gene expression, rather than (or in combination with) variation that changes protein-coding sequences. This is particularly compelling with respect to psychiatric disorders, as genetic changes in regulatory regions may result in differential transcriptional responses to developmental cues and environmental/psychosocial stressors. Until recently, however, the link between transcriptional regulation and psychiatric genetic risk has been understudied. Multiple obstacles have contributed to the paucity of research in this area, including challenges in identifying the positions of remote (distal from the promoter) regulatory elements (e.g. enhancers) and their target genes and the underrepresentation of neural cell types and brain tissues in epigenome projects - the availability of high-quality brain tissues for epigenetic and transcriptome profiling, particularly for the adolescent and developing brain, has been limited. Further challenges have arisen in the prediction and testing of the functional impact of DNA variation with respect to multiple aspects of transcriptional control, including regulatory-element interaction (e.g. between enhancers and promoters), transcription factor binding and DNA methylation. Further, the brain has uncommon DNA-methylation marks with unique genomic distributions not found in other tissues - current evidence suggests the involvement of non-CG methylation and 5-hydroxymethylation in neurodevelopmental processes but much remains unknown. We review here knowledge gaps as well as both technological and resource obstacles that will need to be overcome in order to elucidate the involvement of brain-relevant gene-regulatory variants in genetic risk for psychiatric disorders. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.

  8. An RNA-Seq strategy to detect the complete coding and non-coding transcriptome including full-length imprinted macro ncRNAs.

    Directory of Open Access Journals (Sweden)

    Ru Huang

    Full Text Available Imprinted macro non-protein-coding (nc RNAs are cis-repressor transcripts that silence multiple genes in at least three imprinted gene clusters in the mouse genome. Similar macro or long ncRNAs are abundant in the mammalian genome. Here we present the full coding and non-coding transcriptome of two mouse tissues: differentiated ES cells and fetal head using an optimized RNA-Seq strategy. The data produced is highly reproducible in different sequencing locations and is able to detect the full length of imprinted macro ncRNAs such as Airn and Kcnq1ot1, whose length ranges between 80-118 kb. Transcripts show a more uniform read coverage when RNA is fragmented with RNA hydrolysis compared with cDNA fragmentation by shearing. Irrespective of the fragmentation method, all coding and non-coding transcripts longer than 8 kb show a gradual loss of sequencing tags towards the 3' end. Comparisons to published RNA-Seq datasets show that the strategy presented here is more efficient in detecting known functional imprinted macro ncRNAs and also indicate that standardization of RNA preparation protocols would increase the comparability of the transcriptome between different RNA-Seq datasets.

  9. Influenza NA and PB1 Gene Segments Interact during the Formation of Viral Progeny: Localization of the Binding Region within the PB1 Gene

    Directory of Open Access Journals (Sweden)

    Brad Gilbertson

    2016-08-01

    Full Text Available The influenza A virus genome comprises eight negative-sense viral RNAs (vRNAs that form individual ribonucleoprotein (RNP complexes. In order to incorporate a complete set of each of these vRNAs, the virus uses a selective packaging mechanism that facilitates co-packaging of specific gene segments but whose molecular basis is still not fully understood. Recently, we used a competitive transfection model where plasmids encoding the A/Puerto Rico/8/34 (PR8 and A/Udorn/307/72 (Udorn PB1 gene segments were competed to show that the Udorn PB1 gene segment is preferentially co-packaged into progeny virions with the Udorn NA gene segment. Here we created chimeric PB1 genes combining both Udorn and PR8 PB1 sequences to further define the location within the Udorn PB1 gene that drives co-segregation of these genes and show that nucleotides 1776–2070 of the PB1 gene are crucial for preferential selection. In vitro assays examining specific interactions between Udorn NA vRNA and purified vRNAs transcribed from chimeric PB1 genes also supported the importance of this region in the PB1-NA interaction. Hence, this work identifies an association between viral genes that are co-selected during packaging. It also reveals a region potentially important in the RNP-RNP interactions within the supramolecular complex that is predicted to form prior to budding to allow one of each segment to be packaged in the viral progeny. Our study lays the foundation to understand the co-selection of specific genes, which may be critical to the emergence of new viruses with pandemic potential.

  10. PSP: rapid identification of orthologous coding genes under positive selection across multiple closely related prokaryotic genomes.

    Science.gov (United States)

    Su, Fei; Ou, Hong-Yu; Tao, Fei; Tang, Hongzhi; Xu, Ping

    2013-12-27

    With genomic sequences of many closely related bacterial strains made available by deep sequencing, it is now possible to investigate trends in prokaryotic microevolution. Positive selection is a sub-process of microevolution, in which a particular mutation is favored, causing the allele frequency to continuously shift in one direction. Wide scanning of prokaryotic genomes has shown that positive selection at the molecular level is much more frequent than expected. Genes with significant positive selection may play key roles in bacterial adaption to different environmental pressures. However, selection pressure analyses are computationally intensive and awkward to configure. Here we describe an open access web server, which is designated as PSP (Positive Selection analysis for Prokaryotic genomes) for performing evolutionary analysis on orthologous coding genes, specially designed for rapid comparison of dozens of closely related prokaryotic genomes. Remarkably, PSP facilitates functional exploration at the multiple levels by assignments and enrichments of KO, GO or COG terms. To illustrate this user-friendly tool, we analyzed Escherichia coli and Bacillus cereus genomes and found that several genes, which play key roles in human infection and antibiotic resistance, show significant evidence of positive selection. PSP is freely available to all users without any login requirement at: http://db-mml.sjtu.edu.cn/PSP/. PSP ultimately allows researchers to do genome-scale analysis for evolutionary selection across multiple prokaryotic genomes rapidly and easily, and identify the genes undergoing positive selection, which may play key roles in the interactions of host-pathogen and/or environmental adaptation.

  11. Nuclear scaffold attachment sites within ENCODE regions associate with actively transcribed genes.

    Directory of Open Access Journals (Sweden)

    Mignon A Keaton

    2011-03-01

    Full Text Available The human genome must be packaged and organized in a functional manner for the regulation of DNA replication and transcription. The nuclear scaffold/matrix, consisting of structural and functional nuclear proteins, remains after extraction of nuclei and anchors loops of DNA. In the search for cis-elements functioning as chromatin domain boundaries, we identified 453 nuclear scaffold attachment sites purified by lithium-3,5-iodosalicylate extraction of HeLa nuclei across 30 Mb of the human genome studied by the ENCODE pilot project. The scaffold attachment sites mapped predominately near expressed genes and localized near transcription start sites and the ends of genes but not to boundary elements. In addition, these regions were enriched for RNA polymerase II and transcription factor binding sites and were located in early replicating regions of the genome. We believe these sites correspond to genome-interactions mediated by transcription factors and transcriptional machinery immobilized on a nuclear substructure.

  12. Structural and functional characterization of the exonuclease I (sbcB) gene and gene product from Escherichia coli and a Markov chain analysis of DNA sequences

    International Nuclear Information System (INIS)

    Phillips, G.J.

    1987-01-01

    The nucleotide sequence for the structural gene for exonuclease I (sbcB) from Escherichia coli was determined. Two putative promotes for this gene were identified and were predicted to have weak transcription initiation activity. In addition, the sbcB coding region contains many non-optimal codons. These observations are consistent with the suggestions that sbcB is a poorly expressed gene. Several mutant exonuclease I genes were cloned onto pBR322 plasmids. These genes represented both sbcB and xonA mutation. One of the xonA mutation (xonA6) was associated with a 1.2-kb insertion of an IS-30 related mobile genetic element in the 3'-region of the gene. Two of the mutations (xonA2 and xonA6) encode unstable polypeptides. Determination of exonucleolytic activity on single-stranded DNA from cell extracts containing each of the cloned mutant genes revealed no correlation between residual exonucleolytic activity and the pheno-types of sbcB and xonA mutants. A proposal that the exonuclease I protein contains an additional activity besides its ability to degrade single-stranded DNA is presented. Characterization of E. coli strains which overproduce exonuclease I showed increased sensitivity to UV irradiation

  13. Pathway Detection from Protein Interaction Networks and Gene Expression Data Using Color-Coding Methods and A* Search Algorithms

    Directory of Open Access Journals (Sweden)

    Cheng-Yu Yeh

    2012-01-01

    Full Text Available With the large availability of protein interaction networks and microarray data supported, to identify the linear paths that have biological significance in search of a potential pathway is a challenge issue. We proposed a color-coding method based on the characteristics of biological network topology and applied heuristic search to speed up color-coding method. In the experiments, we tested our methods by applying to two datasets: yeast and human prostate cancer networks and gene expression data set. The comparisons of our method with other existing methods on known yeast MAPK pathways in terms of precision and recall show that we can find maximum number of the proteins and perform comparably well. On the other hand, our method is more efficient than previous ones and detects the paths of length 10 within 40 seconds using CPU Intel 1.73GHz and 1GB main memory running under windows operating system.

  14. Adaptive Evolution Coupled with Retrotransposon Exaptation Allowed for the Generation of a Human-Protein-Specific Coding Gene That Promotes Cancer Cell Proliferation and Metastasis in Both Haematological Malignancies and Solid Tumours: The Extraordinary Case of MYEOV Gene

    Directory of Open Access Journals (Sweden)

    Spyros I. Papamichos

    2015-01-01

    Full Text Available The incidence of cancer in human is high as compared to chimpanzee. However previous analysis has documented that numerous human cancer-related genes are highly conserved in chimpanzee. Till date whether human genome includes species-specific cancer-related genes that could potentially contribute to a higher cancer susceptibility remains obscure. This study focuses on MYEOV, an oncogene encoding for two protein isoforms, reported as causally involved in promoting cancer cell proliferation and metastasis in both haematological malignancies and solid tumours. First we document, via stringent in silico analysis, that MYEOV arose de novo in Catarrhini. We show that MYEOV short-isoform start codon was evolutionarily acquired after Catarrhini/Platyrrhini divergence. Throughout the course of Catarrhini evolution MYEOV acquired a gradually elongated translatable open reading frame (ORF, a gradually shortened translation-regulatory upstream ORF, and alternatively spliced mRNA variants. A point mutation introduced in human allowed for the acquisition of MYEOV long-isoform start codon. Second, we demonstrate the precious impact of exonized transposable elements on the creation of MYEOV gene structure. Third, we highlight that the initial part of MYEOV long-isoform coding DNA sequence was under positive selection pressure during Catarrhini evolution. MYEOV represents a Primate Orphan Gene that acquired, via ORF expansion, a human-protein-specific coding potential.

  15. EXPANDA-75: one-dimensional diffusion code for multi-region plate lattice heterogeneous system

    International Nuclear Information System (INIS)

    Kikuchi, Yasuyuki; Katsuragi, Satoru; Suzuki, Tomoo; Ogitsu, Makoto.

    1975-08-01

    An advanced treatment has been developed for analyzing a multi-region plate lattice heterogeneous system using the coarse group constants set provided for a homogeneous system. The essential points of this treatment are modification of effective admixture cross sections and improvement of effective elastic removal cross sections. By this treatment the heterogeneity effects for flux distributions and effective cross sections in the unit cell can be reproduced accurately in comparison with the ultra fine group treatment which consumes huge amounts of computing time. Based on the present treatment and using the JAERI-Fast set, a one-dimensional diffusion code, EXPANDA-75, was developed for extensive use for analyses of fast critical experiments. The user's guide is also presented in this report. (auth.)

  16. Nucleotide sequence of the human N-myc gene

    International Nuclear Information System (INIS)

    Stanton, L.W.; Schwab, M.; Bishop, J.M.

    1986-01-01

    Human neuroblastomas frequently display amplification and augmented expression of a gene known as N-myc because of its similarity to the protooncogene c-myc. It has therefore been proposed that N-myc is itself a protooncogene, and subsequent tests have shown that N-myc and c-myc have similar biological activities in cell culture. The authors have now detailed the kinship between N-myc and c-myc by determining the nucleotide sequence of human N-myc and deducing the amino acid sequence of the protein encoded by the gene. The topography of N-myc is strikingly similar to that of c-myc: both genes contain three exons of similar lengths; the coding elements of both genes are located in the second and third exons; and both genes have unusually long 5' untranslated regions in their mRNAs, with features that raise the possibility that expression of the genes may be subject to similar controls of translation. The resemblance between the proteins encoded by N-myc and c-myc sustains previous suspicions that the genes encode related functions

  17. Technical advances in trigger-induced RNA interference gene silencing in the parasite Entamoeba histolytica.

    Science.gov (United States)

    Khalil, Mohamed I; Foda, Bardees M; Suresh, Susmitha; Singh, Upinder

    2016-03-01

    Entamoeba histolytica has a robust endogenous RNA interference (RNAi) pathway. There are abundant 27 nucleotide (nt) anti-sense small RNAs (AS sRNAs) that target genes for silencing and the genome encodes many genes involved in the RNAi pathway such as Argonaute proteins. Importantly, an E. histolytica gene with numerous AS sRNAs can function as a "trigger" to induce silencing of a gene that is fused to the trigger. Thus, the amebic RNAi pathway regulates gene expression relevant to amebic biology and has additionally been harnessed as a tool for genetic manipulation. In this study we have further improved the trigger-induced gene silencing method. We demonstrate that rather than using the full-length gene, a short portion of the coding region fused to a trigger is sufficient to induce silencing; the first 537 bp of the E. histolytica rhomboid gene (EhROM1) fused in-frame to the trigger was sufficient to silence EhROM1. We also demonstrated that the trigger method could silence two amebic genes concomitantly; fusion of the coding regions of EhROM1 and transcription factor, EhMyb, in-frame to a trigger gene resulted in both genes being silenced. Alternatively, two genes can be silenced sequentially: EhROM1-silenced parasites with no drug selection plasmid were transfected with trigger-EhMyb, resulting in parasites with both EhROM1 and EhMyb silenced. With all approaches tested, the trigger-mediated silencing was substantive and silencing was maintained despite loss of the G418 selectable marker. All gene silencing was associated with generation of AS sRNAs to the silenced gene. We tested the reversibility of the trigger system using inhibitors of histone modifications but found that the silencing was highly stable. This work represents a technical advance in the trigger gene silencing method in E. histolytica. Approaches that readily silence multiple genes add significantly to the genetic toolkit available to the ameba research community. Copyright © 2016

  18. Origination of an X-linked testes chimeric gene by illegitimate recombination in Drosophila.

    Directory of Open Access Journals (Sweden)

    J Roman Arguello

    2006-05-01

    Full Text Available The formation of chimeric gene structures provides important routes by which novel proteins and functions are introduced into genomes. Signatures of these events have been identified in organisms from wide phylogenic distributions. However, the ability to characterize the early phases of these evolutionary processes has been difficult due to the ancient age of the genes or to the limitations of strictly computational approaches. While examples involving retrotransposition exist, our understanding of chimeric genes originating via illegitimate recombination is limited to speculations based on ancient genes or transfection experiments. Here we report a case of a young chimeric gene that has originated by illegitimate recombination in Drosophila. This gene was created within the last 2-3 million years, prior to the speciation of Drosophila simulans, Drosophila sechellia, and Drosophila mauritiana. The duplication, which involved the Bällchen gene on Chromosome 3R, was partial, removing substantial 3' coding sequence. Subsequent to the duplication onto the X chromosome, intergenic sequence was recruited into the protein-coding region creating a chimeric peptide with approximately 33 new amino acid residues. In addition, a novel intron-containing 5' UTR and novel 3' UTR evolved. We further found that this new X-linked gene has evolved testes-specific expression. Following speciation of the D. simulans complex, this novel gene evolved lineage-specifically with evidence for positive selection acting along the D. simulans branch.

  19. Non-coding RNAs and epigenome: de novo DNA methylation, allelic exclusion and X-inactivation

    Directory of Open Access Journals (Sweden)

    V. A. Halytskiy

    2013-12-01

    Full Text Available Non-coding RNAs are widespread class of cell RNAs. They participate in many important processes in cells – signaling, posttranscriptional silencing, protein biosynthesis, splicing, maintenance of genome stability, telomere lengthening, X-inactivation. Nevertheless, activity of these RNAs is not restricted to posttranscriptional sphere, but cover also processes that change or maintain the epigenetic information. Non-coding RNAs can directly bind to the DNA targets and cause their repression through recruitment of DNA methyltransferases as well as chromatin modifying enzymes. Such events constitute molecular mechanism of the RNA-dependent DNA methylation. It is possible, that the RNA-DNA interaction is universal mechanism triggering DNA methylation de novo. Allelic exclusion can be also based on described mechanism. This phenomenon takes place, when non-coding RNA, which precursor is transcribed from one allele, triggers DNA methylation in all other alleles present in the cell. Note, that miRNA-mediated transcriptional silencing resembles allelic exclusion, because both miRNA gene and genes, which can be targeted by this miRNA, contain elements with the same sequences. It can be assumed that RNA-dependent DNA methylation and allelic exclusion originated with the purpose of counteracting the activity of mobile genetic elements. Probably, thinning and deregulation of the cellular non-coding RNA pattern allows reactivation of silent mobile genetic elements resulting in genome instability that leads to ageing and carcinogenesis. In the course of X-inactivation, DNA methylation and subsequent hete­rochromatinization of X chromosome can be triggered by direct hybridization of 5′-end of large non-coding RNA Xist with DNA targets in remote regions of the X chromosome.

  20. Burnup code for fuel assembly by Monte Carlo code. MKENO-BURN

    International Nuclear Information System (INIS)

    Naito, Yoshitaka; Suyama, Kenya; Masukawa, Fumihiro; Matsumoto, Kiyoshi; Kurosawa, Masayoshi; Kaneko, Toshiyuki.

    1996-12-01

    The evaluation of neutron spectrum is so important for burnup calculation of the heterogeneous geometry like recent BWR fuel assembly. MKENO-BURN is a multi dimensional burnup code that based on the three dimensional monte carlo neutron transport code 'MULTI-KENO' and the routine for the burnup calculation of the one dimensional burnup code 'UNITBURN'. MKENO-BURN analyzes the burnup problem of arbitrary regions after evaluating the neutron spectrum and making one group cross section in three dimensional geometry with MULTI-KENO. It enables us to do three dimensional burnup calculation. This report consists of general description of MKENO-BURN and the input data. (author)