WorldWideScience

Sample records for repeat protein gene

  1. DNA triplet repeats mediate heterochromatin-protein-1-sensitive variegated gene silencing.

    Science.gov (United States)

    Saveliev, Alexander; Everett, Christopher; Sharpe, Tammy; Webster, Zoë; Festenstein, Richard

    2003-04-24

    Gene repression is crucial to the maintenance of differentiated cell types in multicellular organisms, whereas aberrant silencing can lead to disease. The organization of DNA into chromatin and heterochromatin is implicated in gene silencing. In chromatin, DNA wraps around histones, creating nucleosomes. Further condensation of chromatin, associated with large blocks of repetitive DNA sequences, is known as heterochromatin. Position effect variegation (PEV) occurs when a gene is located abnormally close to heterochromatin, silencing the affected gene in a proportion of cells. Here we show that the relatively short triplet-repeat expansions found in myotonic dystrophy and Friedreich's ataxia confer variegation of expression on a linked transgene in mice. Silencing was correlated with a decrease in promoter accessibility and was enhanced by the classical PEV modifier heterochromatin protein 1 (HP1). Notably, triplet-repeat-associated variegation was not restricted to classical heterochromatic regions but occurred irrespective of chromosomal location. Because the phenomenon described here shares important features with PEV, the mechanisms underlying heterochromatin-mediated silencing might have a role in gene regulation at many sites throughout the mammalian genome and modulate the extent of gene silencing and hence severity in several triplet-repeat diseases.

  2. Expansion of protein domain repeats.

    Directory of Open Access Journals (Sweden)

    Asa K Björklund

    2006-08-01

    Full Text Available Many proteins, especially in eukaryotes, contain tandem repeats of several domains from the same family. These repeats have a variety of binding properties and are involved in protein-protein interactions as well as binding to other ligands such as DNA and RNA. The rapid expansion of protein domain repeats is assumed to have evolved through internal tandem duplications. However, the exact mechanisms behind these tandem duplications are not well-understood. Here, we have studied the evolution, function, protein structure, gene structure, and phylogenetic distribution of domain repeats. For this purpose we have assigned Pfam-A domain families to 24 proteomes with more sensitive domain assignments in the repeat regions. These assignments confirmed previous findings that eukaryotes, and in particular vertebrates, contain a much higher fraction of proteins with repeats compared with prokaryotes. The internal sequence similarity in each protein revealed that the domain repeats are often expanded through duplications of several domains at a time, while the duplication of one domain is less common. Many of the repeats appear to have been duplicated in the middle of the repeat region. This is in strong contrast to the evolution of other proteins that mainly works through additions of single domains at either terminus. Further, we found that some domain families show distinct duplication patterns, e.g., nebulin domains have mainly been expanded with a unit of seven domains at a time, while duplications of other domain families involve varying numbers of domains. Finally, no common mechanism for the expansion of all repeats could be detected. We found that the duplication patterns show no dependence on the size of the domains. Further, repeat expansion in some families can possibly be explained by shuffling of exons. However, exon shuffling could not have created all repeats.

  3. Distribution and Evolution of Yersinia Leucine-Rich Repeat Proteins

    Science.gov (United States)

    Hu, Yueming; Huang, He; Hui, Xinjie; Cheng, Xi; White, Aaron P.

    2016-01-01

    Leucine-rich repeat (LRR) proteins are widely distributed in bacteria, playing important roles in various protein-protein interaction processes. In Yersinia, the well-characterized type III secreted effector YopM also belongs to the LRR protein family and is encoded by virulence plasmids. However, little has been known about other LRR members encoded by Yersinia genomes or their evolution. In this study, the Yersinia LRR proteins were comprehensively screened, categorized, and compared. The LRR proteins encoded by chromosomes (LRR1 proteins) appeared to be more similar to each other and different from those encoded by plasmids (LRR2 proteins) with regard to repeat-unit length, amino acid composition profile, and gene expression regulation circuits. LRR1 proteins were also different from LRR2 proteins in that the LRR1 proteins contained an E3 ligase domain (NEL domain) in the C-terminal region or an NEL domain-encoding nucleotide relic in flanking genomic sequences. The LRR1 protein-encoding genes (LRR1 genes) varied dramatically and were categorized into 4 subgroups (a to d), with the LRR1a to -c genes evolving from the same ancestor and LRR1d genes evolving from another ancestor. The consensus and ancestor repeat-unit sequences were inferred for different LRR1 protein subgroups by use of a maximum parsimony modeling strategy. Structural modeling disclosed very similar repeat-unit structures between LRR1 and LRR2 proteins despite the different unit lengths and amino acid compositions. Structural constraints may serve as the driving force to explain the observed mutations in the LRR regions. This study suggests that there may be functional variation and lays the foundation for future experiments investigating the functions of the chromosomally encoded LRR proteins of Yersinia. PMID:27217422

  4. The diversity and evolution of Wolbachia ankyrin repeat domain genes.

    Directory of Open Access Journals (Sweden)

    Stefanos Siozios

    Full Text Available Ankyrin repeat domain-encoding genes are common in the eukaryotic and viral domains of life, but they are rare in bacteria, the exception being a few obligate or facultative intracellular Proteobacteria species. Despite having a reduced genome, the arthropod strains of the alphaproteobacterium Wolbachia contain an unusually high number of ankyrin repeat domain-encoding genes ranging from 23 in wMel to 60 in wPip strain. This group of genes has attracted considerable attention for their astonishing large number as well as for the fact that ankyrin proteins are known to participate in protein-protein interactions, suggesting that they play a critical role in the molecular mechanism that determines host-Wolbachia symbiotic interactions. We present a comparative evolutionary analysis of the wMel-related ankyrin repeat domain-encoding genes present in different Drosophila-Wolbachia associations. Our results show that the ankyrin repeat domain-encoding genes change in size by expansion and contraction mediated by short directly repeated sequences. We provide examples of intra-genic recombination events and show that these genes are likely to be horizontally transferred between strains with the aid of bacteriophages. These results confirm previous findings that the Wolbachia genomes are evolutionary mosaics and illustrate the potential that these bacteria have to generate diversity in proteins potentially involved in the symbiotic interactions.

  5. Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants

    OpenAIRE

    Liu, Ping-Li; Du, Liang; Huang, Yuan; Gao, Shu-Min; Yu, Meng

    2017-01-01

    Background Leucine-rich repeat receptor-like protein kinases (LRR-RLKs) are the largest group of receptor-like kinases in plants and play crucial roles in development and stress responses. The evolutionary relationships among LRR-RLK genes have been investigated in flowering plants; however, no comprehensive studies have been performed for these genes in more ancestral groups. The subfamily classification of LRR-RLK genes in plants, the evolutionary history and driving force for the evolution...

  6. Superfamily of ankyrin repeat proteins in tomato.

    Science.gov (United States)

    Yuan, Xiaowei; Zhang, Shizhong; Qing, Xiaohe; Sun, Meihong; Liu, Shiyang; Su, Hongyan; Shu, Huairui; Li, Xinzheng

    2013-07-10

    The ankyrin repeat (ANK) protein family plays a crucial role in plant growth and development and in response to biotic and abiotic stresses. However, no detailed information concerning this family is available for tomato (Solanum lycopersicum) due to the limited information on whole genome sequences. In this study, we identified a total of 130 ANK genes in tomato genome (SlANK), and these genes were distributed across all 12 chromosomes at various densities. And chromosomal localizations of SlANK genes indicated 25 SlANK genes were involved in tandem duplications. Based on their domain composition, all of the SlANK proteins were grouped into 13 subgroups. A combined phylogenetic tree was constructed with the aligned SlANK protein sequences. This tree revealed that the SlANK proteins comprise five major groups. An analysis of the expression profiles of SlANK genes in tomato in different tissues and in response to stresses showed that the SlANK proteins play roles in plant growth, development and stress responses. To our knowledge, this is the first report of a genome-wide analysis of the tomato ANK gene family. This study provides valuable information regarding the classification and putative functions of SlANK genes in tomato. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.

  7. Repeat-associated plasticity in the Helicobacter pylori RD gene family.

    Science.gov (United States)

    Shak, Joshua R; Dick, Jonathan J; Meinersmann, Richard J; Perez-Perez, Guillermo I; Blaser, Martin J

    2009-11-01

    The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3' region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5' region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host.

  8. Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

    Energy Technology Data Exchange (ETDEWEB)

    Novelli, G.; Sineo, L.; Pontieri, E. [Catholic Univ. of Rome (Italy)]|[Univ. of Milan (Italy)]|[Univ. Florence (Italy)] [and others

    1994-09-01

    Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PK gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.

  9. Repeat-Associated Plasticity in the Helicobacter pylori RD Gene Family▿ †

    Science.gov (United States)

    Shak, Joshua R.; Dick, Jonathan J.; Meinersmann, Richard J.; Perez-Perez, Guillermo I.; Blaser, Martin J.

    2009-01-01

    The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3′ region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5′ region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host. PMID:19749042

  10. Bovine proteins containing poly-glutamine repeats are often polymorphic and enriched for components of transcriptional regulatory complexes

    LENUS (Irish Health Repository)

    Whan, Vicki

    2010-11-23

    Abstract Background About forty human diseases are caused by repeat instability mutations. A distinct subset of these diseases is the result of extreme expansions of polymorphic trinucleotide repeats; typically CAG repeats encoding poly-glutamine (poly-Q) tracts in proteins. Polymorphic repeat length variation is also apparent in human poly-Q encoding genes from normal individuals. As these coding sequence repeats are subject to selection in mammals, it has been suggested that normal variations in some of these typically highly conserved genes are implicated in morphological differences between species and phenotypic variations within species. At present, poly-Q encoding genes in non-human mammalian species are poorly documented, as are their functions and propensities for polymorphic variation. Results The current investigation identified 178 bovine poly-Q encoding genes (Q ≥ 5) and within this group, 26 genes with orthologs in both human and mouse that did not contain poly-Q repeats. The bovine poly-Q encoding genes typically had ubiquitous expression patterns although there was bias towards expression in epithelia, brain and testes. They were also characterised by unusually large sizes. Analysis of gene ontology terms revealed that the encoded proteins were strongly enriched for functions associated with transcriptional regulation and many contributed to physical interaction networks in the nucleus where they presumably act cooperatively in transcriptional regulatory complexes. In addition, the coding sequence CAG repeats in some bovine genes impacted mRNA splicing thereby generating unusual transcriptional diversity, which in at least one instance was tissue-specific. The poly-Q encoding genes were prioritised using multiple criteria for their likelihood of being polymorphic and then the highest ranking group was experimentally tested for polymorphic variation within a cattle diversity panel. Extensive and meiotically stable variation was identified

  11. Topological characteristics of helical repeat proteins

    NARCIS (Netherlands)

    Groves, M R; Barford, D

    The recent elucidation of protein structures based upon repeating amino acid motifs, including the armadillo motif, the HEAT motif and tetratricopeptide repeats, reveals that they belong to the class of helical repeat proteins. These proteins share the common property of being assembled from tandem

  12. Identification of genes containing expanded purine repeats in the human genome and their apparent protective role against cancer.

    Science.gov (United States)

    Singh, Himanshu Narayan; Rajeswari, Moganty R

    2016-01-01

    Purine repeat sequences present in a gene are unique as they have high propensity to form unusual DNA-triple helix structures. Friedreich's ataxia is the only human disease that is well known to be associated with DNA-triplexes formed by purine repeats. The purpose of this study was to recognize the expanded purine repeats (EPRs) in human genome and find their correlation with cancer pathogenesis. We developed "PuRepeatFinder.pl" algorithm to identify non-overlapping EPRs without pyrimidine interruptions in the human genome and customized for searching repeat lengths, n ≥ 200. A total of 1158 EPRs were identified in the genome which followed Wakeby distribution. Two hundred and ninety-six EPRs were found in geneic regions of 282 genes (EPR-genes). Gene clustering of EPR-genes was done based on their cellular function and a large number of EPR-genes were found to be enzymes/enzyme modulators. Meta-analysis of 282 EPR-genes identified only 63 EPR-genes in association with cancer, mostly in breast, lung, and blood cancers. Protein-protein interaction network analysis of all 282 EPR-genes identified proteins including those in cadherins and VEGF. The two observations, that EPRs can induce mutations under malignant conditions and that identification of some EPR-gene products in vital cell signaling-mediated pathways, together suggest the crucial role of EPRs in carcinogenesis. The new link between EPR-genes and their functionally interacting proteins throws a new dimension in the present understanding of cancer pathogenesis and can help in planning therapeutic strategies. Validation of present results using techniques like NGS is required to establish the role of the EPR genes in cancer pathology.

  13. The Pentapeptide Repeat Proteins

    OpenAIRE

    Vetting, Matthew W.; Hegde, Subray S.; Fajardo, J. Eduardo; Fiser, Andras; Roderick, Steven L.; Takiff, Howard E.; Blanchard, John S.

    2006-01-01

    The Pentapeptide Repeat Protein (PRP) family has over 500 members in the prokaryotic and eukaryotic kingdoms. These proteins are composed of, or contain domains composed of, tandemly repeated amino acid sequences with a consensus sequence of [S,T,A,V][D,N][L,F]-[S,T,R][G]. The biochemical function of the vast majority of PRP family members is unknown. The three-dimensional structure of the first member of the PRP family was determined for the fluoroquinolone resistance protein (MfpA) from Myc...

  14. WD-repeat instability and diversification of the Podospora anserina hnwd non-self recognition gene family.

    Science.gov (United States)

    Chevanne, Damien; Saupe, Sven J; Clavé, Corinne; Paoletti, Mathieu

    2010-05-06

    Genes involved in non-self recognition and host defence are typically capable of rapid diversification and exploit specialized genetic mechanism to that end. Fungi display a non-self recognition phenomenon termed heterokaryon incompatibility that operates when cells of unlike genotype fuse and leads to the cell death of the fusion cell. In the fungus Podospora anserina, three genes controlling this allorecognition process het-d, het-e and het-r are paralogs belonging to the same hnwd gene family. HNWD proteins are STAND proteins (signal transduction NTPase with multiple domains) that display a WD-repeat domain controlling recognition specificity. Based on genomic sequence analysis of different P. anserina isolates, it was established that repeat regions of all members of the gene family are extremely polymorphic and undergoing concerted evolution arguing for frequent recombination within and between family members. Herein, we directly analyzed the genetic instability and diversification of this allorecognition gene family. We have constituted a collection of 143 spontaneous mutants of the het-R (HNWD2) and het-E (hnwd5) genes with altered recognition specificities. The vast majority of the mutants present rearrangements in the repeat arrays with deletions, duplications and other modifications as well as creation of novel repeat unit variants. We investigate the extreme genetic instability of these genes and provide a direct illustration of the diversification strategy of this eukaryotic allorecognition gene family.

  15. Identification of Pentatricopeptide Repeat Proteins in the Model Organism Dictyostelium discoideum

    Directory of Open Access Journals (Sweden)

    Sam Manna

    2013-01-01

    Full Text Available Pentatricopeptide repeat (PPR proteins are RNA binding proteins with functions in organelle RNA metabolism. They are found in all eukaryotes but have been most extensively studied in plants. We report on the identification of 12 PPR-encoding genes in the genome of the protist Dictyostelium discoideum, with potential homologs in other members of the same lineage and some predicted novel functions for the encoded gene products in protists. For one of the gene products, we show that it localizes to the mitochondria, and we also demonstrate that antisense inhibition of its expression leads to slower growth, a phenotype associated with mitochondrial dysfunction.

  16. Rhoptry-associated protein (rap-1) genes in the sheep pathogen Babesia sp. Xinjiang: Multiple transcribed copies differing by 3' end repeated sequences.

    Science.gov (United States)

    Niu, Qingli; Marchand, Jordan; Yang, Congshan; Bonsergent, Claire; Guan, Guiquan; Yin, Hong; Malandrin, Laurence

    2015-07-30

    Sheep babesiosis occurs mainly in tropical and subtropical areas. The sheep parasite Babesia sp. Xinjiang is widespread in China, and our goal is to characterize rap-1 (rhoptry-associated protein 1) gene diversity and expression as a first step of a long term goal aiming at developing a recombinant subunit vaccine. Seven different rap-1a genes were amplified in Babesia sp. Xinjiang, using degenerate primers designed from conserved motifs. Rap-1b and rap-1c gene types could not be identified. In all seven rap-1a genes, the 5' regions exhibited identical sequences over 936 nt, and the 3' regions differed at 28 positions over 147 nt, defining two types of genes designated α and β. The remaining 3' part varied from 72 to 360 nt in length, depending on the gene. This region consists of a succession of two to ten 36 nt repeats, which explains the size differences. Even if the nucleotide sequences varied, 6 repeats encoded the same stretch of amino acids. Transcription of at least four α and two β genes was demonstrated by standard RT-PCR. Copyright © 2015 Elsevier B.V. All rights reserved.

  17. Selfish DNA in protein-coding genes of Rickettsia.

    Science.gov (United States)

    Ogata, H; Audic, S; Barbe, V; Artiguenave, F; Fournier, P E; Raoult, D; Claverie, J M

    2000-10-13

    Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.

  18. The number of genes encoding repeat domain-containing proteins positively correlates with genome size in amoebal giant viruses

    Science.gov (United States)

    Shukla, Avi; Chatterjee, Anirvan

    2018-01-01

    Abstract Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near–linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption. PMID:29308275

  19. Imperfect DNA mirror repeats in the gag gene of HIV-1 (HXB2 identify key functional domains and coincide with protein structural elements in each of the mature proteins

    Directory of Open Access Journals (Sweden)

    Lang Dorothy M

    2007-10-01

    Full Text Available Abstract Background A DNA mirror repeat is a sequence segment delimited on the basis of its containing a center of symmetry on a single strand, e.g. 5'-GCATGGTACG-3'. It is most frequently described in association with a functionally significant site in a genomic sequence, and its occurrence is regarded as noteworthy, if not unusual. However, imperfect mirror repeats (IMRs having ≥ 50% symmetry are common in the protein coding DNA of monomeric proteins and their distribution has been found to coincide with protein structural elements – helices, β sheets and turns. In this study, the distribution of IMRs is evaluated in a polyprotein – to determine whether IMRs may be related to the position or order of protein cleavage or other hierarchal aspects of protein function. The gag gene of HIV-1 [GenBank:K03455] was selected for the study because its protein motifs and structural components are well documented. Results There is a highly specific relationship between IMRs and structural and functional aspects of the Gag polyprotein. The five longest IMRs in the polyprotein translate a key functional segment in each of the five cleavage products. Throughout the protein, IMRs coincide with functionally significant segments of the protein. A detailed annotation of the protein, which combines structural, functional and IMR data illustrates these associations. There is a significant statistical correlation between the ends of IMRs and the ends of PSEs in each of the mature proteins. Weakly symmetric IMRs (≥ 33% are related to cleavage positions and processes. Conclusion The frequency and distribution of IMRs in HIV-1 Gag indicates that DNA symmetry is a fundamental property of protein coding DNA and that different levels of symmetry are associated with different functional aspects of the gene and its protein. The interaction between IMRs and protein structure and function is precise and interwoven over the entire length of the polyprotein. The

  20. Genome-wide identification, sequence characterization, and protein-protein interaction properties of DDB1 (damaged DNA binding protein-1)-binding WD40-repeat family members in Solanum lycopersicum.

    Science.gov (United States)

    Zhu, Yunye; Huang, Shengxiong; Miao, Min; Tang, Xiaofeng; Yue, Junyang; Wang, Wenjie; Liu, Yongsheng

    2015-06-01

    One hundred DDB1 (damaged DNA binding protein-1)-binding WD40-repeat domain (DWD) family genes were identified in the S. lycopersicum genome. The DWD genes encode proteins presumably functioning as the substrate recognition subunits of the cullin4-ring ubiquitin E3 ligase complex. These findings provide candidate genes and a research platform for further gene functionality and molecular breeding study. A subclass of DDB1 (damaged DNA binding protein-1)-binding WD40-repeat domain (DWD) family proteins has been demonstrated to function as the substrate recognition subunits of the cullin4-ring ubiquitin E3 ligase complex. However, little information is available about the cognate subfamily genes in tomato (S. lycopersicum). In this study, based on the recently released tomato genome sequences, 100 tomato genes encoding DWD proteins that potentially interact with DDB1 were identified and characterized, including analyses of the detailed annotations, chromosome locations and compositions of conserved amino acid domains. In addition, a phylogenetic tree, which comprises of three main groups, of the subfamily genes was constructed. The physical interaction between tomato DDB1 and 14 representative DWD proteins was determined by yeast two-hybrid and co-immunoprecipitation assays. The subcellular localization of these 14 representative DWD proteins was determined. Six of them were localized in both nucleus and cytoplasm, seven proteins exclusively in cytoplasm, and one protein either in nucleus and cytoplasm, or exclusively in cytoplasm. Comparative genomic analysis demonstrated that the expansion of these subfamily members in tomato predominantly resulted from two whole-genome triplication events in the evolution history.

  1. Immediate-early gene response to repeated immobilization: Fos protein and arc mRNA levels appear to be less sensitive than c-fos mRNA to adaptation.

    Science.gov (United States)

    Ons, Sheila; Rotllant, David; Marín-Blasco, Ignacio J; Armario, Antonio

    2010-06-01

    Stress exposure resulted in brain induction of immediate-early genes (IEGs), considered as markers of neuronal activation. Upon repeated exposure to the same stressor, reduction of IEG response (adaptation) has been often observed, but there are important discrepancies in literature that may be in part related to the particular IEG and methodology used. We studied the differential pattern of adaptation of the IEGs c-fos and arc (activity-regulated cytoskeleton-associated protein) after repeated exposure to a severe stressor: immobilization on wooden boards (IMO). Rats repeatedly exposed to IMO showed reduced c-fos mRNA levels in response to acute IMO in most brain areas studied: the medial prefrontal cortex (mPFC), lateral septum (LS), medial amygdala (MeA), paraventricular nucleus of the hypothalamus (PVN) and locus coeruleus. In contrast, the number of neurons showing Fos-like immunoreactivity was only reduced in the MeA and the various subregions of the PVN. IMO-induced increases in arc gene expression were restricted to telencephalic regions and reduced by repeated IMO only in the mPFC. Double-labelling in the LS of IMO-exposed rats revealed that arc was expressed in only one-third of Fos+ neurons, suggesting two populations of Fos+ neurons. These data suggest that c-fos mRNA levels are more affected by repeated IMO than corresponding protein, and that arc gene expression does not reflect adaptation in most brain regions, which may be related to its constitutive expression. Therefore, the choice of a particular IEG and the method of measurement are important for proper interpretation of the impact of chronic repeated stress on brain activation.

  2. Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Yuan Tong

    2010-01-01

    Full Text Available Abstract Background Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In Arabidopsis thaliana, there are at least 223 Leucine-rich repeat receptor-like kinases (LRR-RLKs, representing one of the largest protein families. Although functional roles for a handful of LRR-RLKs have been revealed, the functions of the majority of members in this protein family have not been elucidated. Results As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for in planta functional analysis. All clones from this study have been submitted to the Arabidopsis Biological Resource Center (ABRC at Ohio State University for full accessibility by the Arabidopsis research community. Conclusions Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.

  3. Repeat-mediated epigenetic dysregulation of the FMR1 gene in the fragile X-related disorders.

    Science.gov (United States)

    Usdin, Karen; Kumari, Daman

    2015-01-01

    The fragile X-related disorders are members of the Repeat Expansion Diseases, a group of genetic conditions resulting from an expansion in the size of a tandem repeat tract at a specific genetic locus. The repeat responsible for disease pathology in the fragile X-related disorders is CGG/CCG and the repeat tract is located in the 5' UTR of the FMR1 gene, whose protein product FMRP, is important for the proper translation of dendritic mRNAs in response to synaptic activation. There are two different pathological FMR1 allele classes that are distinguished only by the number of repeats. Premutation alleles have 55-200 repeats and confer risk of fragile X-associated tremor/ataxia syndrome and fragile X-associated primary ovarian insufficiency. Full mutation alleles on the other hand have >200 repeats and result in fragile X syndrome, a disorder that affects learning and behavior. Different symptoms are seen in carriers of premutation and full mutation alleles because the repeat number has paradoxical effects on gene expression: Epigenetic changes increase transcription from premutation alleles and decrease transcription from full mutation alleles. This review will cover what is currently known about the mechanisms responsible for these changes in FMR1 expression and how they may relate to other Repeat Expansion Diseases that also show repeat-mediated changes in gene expression.

  4. A dual origin of the Xist gene from a protein-coding gene and a set of transposable elements.

    Directory of Open Access Journals (Sweden)

    Eugeny A Elisaphenko

    2008-06-01

    Full Text Available X-chromosome inactivation, which occurs in female eutherian mammals is controlled by a complex X-linked locus termed the X-inactivation center (XIC. Previously it was proposed that genes of the XIC evolved, at least in part, as a result of pseudogenization of protein-coding genes. In this study we show that the key XIC gene Xist, which displays fragmentary homology to a protein-coding gene Lnx3, emerged de novo in early eutherians by integration of mobile elements which gave rise to simple tandem repeats. The Xist gene promoter region and four out of ten exons found in eutherians retain homology to exons of the Lnx3 gene. The remaining six Xist exons including those with simple tandem repeats detectable in their structure have similarity to different transposable elements. Integration of mobile elements into Xist accompanies the overall evolution of the gene and presumably continues in contemporary eutherian species. Additionally we showed that the combination of remnants of protein-coding sequences and mobile elements is not unique to the Xist gene and is found in other XIC genes producing non-coding nuclear RNA.

  5. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures

    Directory of Open Access Journals (Sweden)

    Hayes Michael L

    2012-05-01

    Full Text Available Abstract Background Pentatricopeptide repeat (PPR proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82 and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. Results All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3’ UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. Conclusion PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative

  6. Molecular evolution of pentatricopeptide repeat genes reveals truncation in species lacking an editing target and structural domains under distinct selective pressures.

    Science.gov (United States)

    Hayes, Michael L; Giang, Karolyn; Mulligan, R Michael

    2012-05-14

    Pentatricopeptide repeat (PPR) proteins are required for numerous RNA processing events in plant organelles including C-to-U editing, splicing, stabilization, and cleavage. Fifteen PPR proteins are known to be required for RNA editing at 21 sites in Arabidopsis chloroplasts, and belong to the PLS class of PPR proteins. In this study, we investigate the co-evolution of four PPR genes (CRR4, CRR21, CLB19, and OTP82) and their six editing targets in Brassicaceae species. PPR genes are composed of approximately 10 to 20 tandem repeats and each repeat has two α-helical regions, helix A and helix B, that are separated by short coil regions. Each repeat and structural feature was examined to determine the selective pressures on these regions. All of the PPR genes examined are under strong negative selection. Multiple independent losses of editing site targets are observed for both CRR21 and OTP82. In several species lacking the known editing target for CRR21, PPR genes are truncated near the 17th PPR repeat. The coding sequences of the truncated CRR21 genes are maintained under strong negative selection; however, the 3' UTR sequences beyond the truncation site have substantially diverged. Phylogenetic analyses of four PPR genes show that sequences corresponding to helix A are high compared to helix B sequences. Differential evolutionary selection of helix A versus helix B is observed in both plant and mammalian PPR genes. PPR genes and their cognate editing sites are mutually constrained in evolution. Editing sites are frequently lost by replacement of an edited C with a genomic T. After the loss of an editing site, the PPR genes are observed with three outcomes: first, few changes are detected in some cases; second, the PPR gene is present as a pseudogene; and third, the PPR gene is present but truncated in the C-terminal region. The retention of truncated forms of CRR21 that are maintained under strong negative selection even in the absence of an editing site target

  7. Arabidopsis leucine-rich repeat extensin (LRX) proteins modify cell wall composition and influence plant growth.

    Science.gov (United States)

    Draeger, Christian; Ndinyanka Fabrice, Tohnyui; Gineau, Emilie; Mouille, Grégory; Kuhn, Benjamin M; Moller, Isabel; Abdou, Marie-Therese; Frey, Beat; Pauly, Markus; Bacic, Antony; Ringli, Christoph

    2015-06-24

    Leucine-rich repeat extensins (LRXs) are extracellular proteins consisting of an N-terminal leucine-rich repeat (LRR) domain and a C-terminal extensin domain containing the typical features of this class of structural hydroxyproline-rich glycoproteins (HRGPs). The LRR domain is likely to bind an interaction partner, whereas the extensin domain has an anchoring function to insolubilize the protein in the cell wall. Based on the analysis of the root hair-expressed LRX1 and LRX2 of Arabidopsis thaliana, LRX proteins are important for cell wall development. The importance of LRX proteins in non-root hair cells and on the structural changes induced by mutations in LRX genes remains elusive. The LRX gene family of Arabidopsis consists of eleven members, of which LRX3, LRX4, and LRX5 are expressed in aerial organs, such as leaves and stem. The importance of these LRX genes for plant development and particularly cell wall formation was investigated. Synergistic effects of mutations with gradually more severe growth retardation phenotypes in double and triple mutants suggest a similar function of the three genes. Analysis of cell wall composition revealed a number of changes to cell wall polysaccharides in the mutants. LRX3, LRX4, and LRX5, and most likely LRX proteins in general, are important for cell wall development. Due to the complexity of changes in cell wall structures in the lrx mutants, the exact function of LRX proteins remains to be determined. The increasingly strong growth-defect phenotypes in double and triple mutants suggests that the LRX proteins have similar functions and that they are important for proper plant development.

  8. A conserved gene family encodes transmembrane proteins with fibronectin, immunoglobulin and leucine-rich repeat domains (FIGLER

    Directory of Open Access Journals (Sweden)

    Haga Christopher L

    2007-09-01

    Full Text Available Abstract Background In mouse the cytokine interleukin-7 (IL-7 is required for generation of B lymphocytes, but human IL-7 does not appear to have this function. A bioinformatics approach was therefore used to identify IL-7 receptor related genes in the hope of identifying the elusive human cytokine. Results Our database search identified a family of nine gene candidates, which we have provisionally named fibronectin immunoglobulin leucine-rich repeat (FIGLER. The FIGLER 1–9 genes are predicted to encode type I transmembrane glycoproteins with 6–12 leucine-rich repeats (LRR, a C2 type Ig domain, a fibronectin type III domain, a hydrophobic transmembrane domain, and a cytoplasmic domain containing one to four tyrosine residues. Members of this multichromosomal gene family possess 20–47% overall amino acid identity and are differentially expressed in cell lines and primary hematopoietic lineage cells. Genes for FIGLER homologs were identified in macaque, orangutan, chimpanzee, mouse, rat, dog, chicken, toad, and puffer fish databases. The non-human FIGLER homologs share 38–99% overall amino acid identity with their human counterpart. Conclusion The extracellular domain structure and absence of recognizable cytoplasmic signaling motifs in members of the highly conserved FIGLER gene family suggest a trophic or cell adhesion function for these molecules.

  9. Not so bad after all: retroviruses and long terminal repeat retrotransposons as a source of new genes in vertebrates.

    Science.gov (United States)

    Naville, M; Warren, I A; Haftek-Terreau, Z; Chalopin, D; Brunet, F; Levin, P; Galiana, D; Volff, J-N

    2016-04-01

    Viruses and transposable elements, once considered as purely junk and selfish sequences, have repeatedly been used as a source of novel protein-coding genes during the evolution of most eukaryotic lineages, a phenomenon called 'molecular domestication'. This is exemplified perfectly in mammals and other vertebrates, where many genes derived from long terminal repeat (LTR) retroelements (retroviruses and LTR retrotransposons) have been identified through comparative genomics and functional analyses. In particular, genes derived from gag structural protein and envelope (env) genes, as well as from the integrase-coding and protease-coding sequences, have been identified in humans and other vertebrates. Retroelement-derived genes are involved in many important biological processes including placenta formation, cognitive functions in the brain and immunity against retroelements, as well as in cell proliferation, apoptosis and cancer. These observations support an important role of retroelement-derived genes in the evolution and diversification of the vertebrate lineage. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  10. StaRProtein, A Web Server for Prediction of the Stability of Repeat Proteins

    Science.gov (United States)

    Xu, Yongtao; Zhou, Xu; Huang, Meilan

    2015-01-01

    Repeat proteins have become increasingly important due to their capability to bind to almost any proteins and the potential as alternative therapy to monoclonal antibodies. In the past decade repeat proteins have been designed to mediate specific protein-protein interactions. The tetratricopeptide and ankyrin repeat proteins are two classes of helical repeat proteins that form different binding pockets to accommodate various partners. It is important to understand the factors that define folding and stability of repeat proteins in order to prioritize the most stable designed repeat proteins to further explore their potential binding affinities. Here we developed distance-dependant statistical potentials using two classes of alpha-helical repeat proteins, tetratricopeptide and ankyrin repeat proteins respectively, and evaluated their efficiency in predicting the stability of repeat proteins. We demonstrated that the repeat-specific statistical potentials based on these two classes of repeat proteins showed paramount accuracy compared with non-specific statistical potentials in: 1) discriminate correct vs. incorrect models 2) rank the stability of designed repeat proteins. In particular, the statistical scores correlate closely with the equilibrium unfolding free energies of repeat proteins and therefore would serve as a novel tool in quickly prioritizing the designed repeat proteins with high stability. StaRProtein web server was developed for predicting the stability of repeat proteins. PMID:25807112

  11. WD40-repeat proteins in plant cell wall formation: current evidence and research prospects

    Directory of Open Access Journals (Sweden)

    Gea eGuerriero

    2015-12-01

    Full Text Available The metabolic complexity of living organisms relies on supramolecular protein structures which ensure vital processes, such as signal transduction, transcription, translation and cell wall synthesis. In eukaryotes WD40-repeat (WDR proteins often function as molecular hubs mediating supramolecular interactions. WDR proteins may display a variety of interacting partners and participate in the assembly of complexes involved in distinct cellular functions. In plants, the formation of lignocellulosic biomass involves extensive synthesis of cell wall polysaccharides, a process that requires the assembly of large transmembrane enzyme complexes, intensive vesicle trafficking, interactions with the cytoskeleton, and coordinated gene expression. Because of their function as supramolecular hubs, WDR proteins could participate in each or any of these steps, although to date only few WDR proteins have been linked to the cell wall by experimental evidence. Nevertheless, several potential cell wall-related WDR proteins were recently identified using in silico aproaches, such as analyses of co-expression, interactome and conserved gene neighbourhood. Notably, some WDR genes are frequently genomic neighbours of genes coding for GT2-family polysaccharide synthases in eukaryotes, and this WDR-GT2 collinear microsynteny is detected in diverse taxa. In angiosperms, two WDR genes are collinear to cellulose synthase genes, CESAs, whereas in ascomycetous fungi several WDR genes are adjacent to chitin synthase genes, chs. In this Perspective we summarize and discuss experimental and in silico studies on the possible involvement of WDR proteins in plant cell wall formation. The prospects of biotechnological engineering for enhanced biomass production are discussed.

  12. The Asian Rice Gall Midge (Orseolia oryzae Mitogenome Has Evolved Novel Gene Boundaries and Tandem Repeats That Distinguish Its Biotypes.

    Directory of Open Access Journals (Sweden)

    Isha Atray

    Full Text Available The complete mitochondrial genome of the Asian rice gall midge, Orseolia oryzae (Diptera; Cecidomyiidae was sequenced, annotated and analysed in the present study. The circular genome is 15,286 bp with 13 protein-coding genes, 22 tRNAs and 2 ribosomal RNA genes, and a 578 bp non-coding control region. All protein coding genes used conventional start codons and terminated with a complete stop codon. The genome presented many unusual features: (1 rearrangement in the order of tRNAs as well as protein coding genes; (2 truncation and unusual secondary structures of tRNAs; (3 presence of two different repeat elements in separate non-coding regions; (4 presence of one pseudo-tRNA gene; (5 inversion of the rRNA genes; (6 higher percentage of non-coding regions when compared with other insect mitogenomes. Rearrangements of the tRNAs and protein coding genes are explained on the basis of tandem duplication and random loss model and why intramitochondrial recombination is a better model for explaining rearrangements in the O. oryzae mitochondrial genome is discussed. Furthermore, we evaluated the number of iterations of the tandem repeat elements found in the mitogenome. This led to the identification of genetic markers capable of differentiating rice gall midge biotypes and the two Orseolia species investigated.

  13. Gene conversion homogenizes the CMT1A paralogous repeats

    Directory of Open Access Journals (Sweden)

    Hurles Matthew E

    2001-12-01

    Full Text Available Abstract Background Non-allelic homologous recombination between paralogous repeats is increasingly being recognized as a major mechanism causing both pathogenic microdeletions and duplications, and structural polymorphism in the human genome. It has recently been shown empirically that gene conversion can homogenize such repeats, resulting in longer stretches of absolute identity that may increase the rate of non-allelic homologous recombination. Results Here, a statistical test to detect gene conversion between pairs of non-coding sequences is presented. It is shown that the 24 kb Charcot-Marie-Tooth type 1A paralogous repeats (CMT1A-REPs exhibit the imprint of gene conversion processes whilst control orthologous sequences do not. In addition, Monte Carlo simulations of the evolutionary divergence of the CMT1A-REPs, incorporating two alternative models for gene conversion, generate repeats that are statistically indistinguishable from the observed repeats. Bounds are placed on the rate of these conversion processes, with central values of 1.3 × 10-4 and 5.1 × 10-5 per generation for the alternative models. Conclusions This evidence presented here suggests that gene conversion may have played an important role in the evolution of the CMT1A-REP paralogous repeats. The rates of these processes are such that it is probable that homogenized CMT1A-REPs are polymorphic within modern populations. Gene conversion processes are similarly likely to play an important role in the evolution of other segmental duplications and may influence the rate of non-allelic homologous recombination between them.

  14. Characterization of tetratricopeptide repeat-containing proteins critical for cilia formation and function.

    Directory of Open Access Journals (Sweden)

    Yanan Xu

    Full Text Available Cilia formation and function require a special set of trafficking machinery termed intraflagellar transport (IFT, consisting mainly of protein complexes IFT-A, IFT-B, BBSome, and microtubule-dependent molecular motors. Tetratricopeptide repeat-containing (TTC proteins are widely involved in protein complex formation. Nine of them are known to serve as components of the IFT or BBSome complexes. How many TTC proteins are cilia-related and how they function, however, remain unclear. Here we show that twenty TTC genes were upregulated by at least 2-fold during the differentiation of cultured mouse tracheal epithelial cells (MTECs into multiciliated cells. Our systematic screen in zebrafish identified four novel TTC genes, ttc4, -9c, -36, and -39c, that are critical for cilia formation and motility. Accordingly, their zebrafish morphants displayed typical ciliopathy-related phenotypes, including curved body, abnormal otolith, hydrocephalus, and defective left-right patterning. The morphants of ttc4 and ttc25, a known cilia-related gene, additionally showed pronephric cyst formation. Immunoprecipitation indicated associations of TTC4, -9c, -25, -36, and -39c with components or entire complexes of IFT-A, IFT-B, or BBSome, implying their participations in IFT or IFT-related activities. Our results provide a global view for the relationship between TTC proteins and cilia.

  15. Genome-wide characterization, evolution, and expression analysis of the leucine-rich repeat receptor-like protein kinase (LRR-RLK) gene family in Rosaceae genomes.

    Science.gov (United States)

    Sun, Jiangmei; Li, Leiting; Wang, Peng; Zhang, Shaoling; Wu, Juyou

    2017-10-10

    Leucine-rich repeat receptor-like protein kinase (LRR-RLK) is the largest gene family of receptor-like protein kinases (RLKs) and actively participates in regulating the growth, development, signal transduction, immunity, and stress responses of plants. However, the patterns of LRR-RLK gene family evolution in the five main Rosaceae species for which genome sequences are available have not yet been reported. In this study, we performed a comprehensive analysis of LRR-RLK genes for five Rosaceae species: Fragaria vesca (strawberry), Malus domestica (apple), Pyrus bretschneideri (Chinese white pear), Prunus mume (mei), and Prunus persica (peach), which contained 201, 244, 427, 267, and 258 LRR-RLK genes, respectively. All LRR-RLK genes were further grouped into 23 subfamilies based on the hidden Markov models approach. RLK-Pelle_LRR-XII-1, RLK-Pelle_LRR-XI-1, and RLK-Pelle_LRR-III were the three largest subfamilies. Synteny analysis indicated that there were 236 tandem duplicated genes in the five Rosaceae species, among which subfamilies XII-1 (82 genes) and XI-1 (80 genes) comprised 68.6%. Our results indicate that tandem duplication made a large contribution to the expansion of the subfamilies. The gene expression, tissue-specific expression, and subcellular localization data revealed that LRR-RLK genes were differentially expressed in various organs and tissues, and the largest subfamily XI-1 was highly expressed in all five Rosaceae species, suggesting that LRR-RLKs play important roles in each stage of plant growth and development. Taken together, our results provide an overview of the LRR-RLK family in Rosaceae genomes and the basis for further functional studies.

  16. Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion.

    Science.gov (United States)

    Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An

    2017-09-11

    The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.

  17. Domain movement within a gene: a novel evolutionary mechanism for protein diversification.

    Directory of Open Access Journals (Sweden)

    Yoshikazu Furuta

    Full Text Available A protein function is carried out by a specific domain localized at a specific position. In the present study, we report that, within a gene, a specific amino acid sequence can move between a certain position and another position. This was discovered when the sequences of restriction-modification systems within the bacterial species Helicobacter pylori were compared. In the specificity subunit of Type I restriction-modification systems, DNA sequence recognition is mediated by target recognition domain 1 (TRD1 and TRD2. To our surprise, several sequences are shared by TRD1 and TRD2 of genes (alleles at the same locus (chromosomal location; these domains appear to have moved between the two positions. The gene/protein organization can be represented as x-(TRD1-y-x-(TRD2-y, where x and y represent repeat sequences. Movement probably occurs by recombination at these flanking DNA repeats. In accordance with this hypothesis, recombination at these repeats also appears to decrease two TRDs into one TRD or increase these two TRDs to three TRDs (TRD1-TRD2-TRD2 and to allow TRD movement between genes even at different loci. Similar movement of domains between TRD1 and TRD2 was observed for the specificity subunit of a Type IIG restriction enzyme. Similar movement of domain between TRD1 and TRD2 was observed for Type I restriction-modification enzyme specificity genes in two more eubacterial species, Streptococcus pyogenes and Mycoplasma agalactiae. Lateral domain movements within a protein, which we have designated DOMO (domain movement, represent novel routes for the diversification of proteins.

  18. TMPyP4 porphyrin distorts RNA G-quadruplex structures of the disease-associated r(GGGGCC)n repeat of the C9orf72 gene and blocks interaction of RNA-binding proteins.

    Science.gov (United States)

    Zamiri, Bita; Reddy, Kaalak; Macgregor, Robert B; Pearson, Christopher E

    2014-02-21

    Certain DNA and RNA sequences can form G-quadruplexes, which can affect genetic instability, promoter activity, RNA splicing, RNA stability, and neurite mRNA localization. Amyotrophic lateral sclerosis and frontotemporal dementia can be caused by expansion of a (GGGGCC)n repeat in the C9orf72 gene. Mutant r(GGGGCC)n- and r(GGCCCC)n-containing transcripts aggregate in nuclear foci, possibly sequestering repeat-binding proteins such as ASF/SF2 and hnRNPA1, suggesting a toxic RNA pathogenesis, as occurs in myotonic dystrophy. Furthermore, the C9orf72 repeat RNA was recently demonstrated to undergo the noncanonical repeat-associated non-AUG translation (RAN translation) into pathologic dipeptide repeats in patient brains, a process that is thought to depend upon RNA structure. We previously demonstrated that the r(GGGGCC)n RNA forms repeat tract length-dependent G-quadruplex structures that bind the ASF/SF2 protein. Here we show that the cationic porphyrin (5,10,15,20-tetra(N-methyl-4-pyridyl) porphyrin (TMPyP4)), which can bind some G-quadruplex-forming sequences, can bind and distort the G-quadruplex formed by r(GGGGCC)8, and this ablates the interaction of either hnRNPA1 or ASF/SF2 with the repeat. These findings provide proof of concept that nucleic acid binding small molecules, such as TMPyP4, can distort the secondary structure of the C9orf72 repeat, which may beneficially disrupt protein interactions, which may ablate either protein sequestration and/or RAN translation into potentially toxic dipeptides. Disruption of secondary structure formation of the C9orf72 RNA repeats may be a viable therapeutic avenue, as well as a means to test the role of RNA structure upon RAN translation.

  19. Characterization and evolutionary analysis of tributyltin-binding protein and pufferfish saxitoxin and tetrodotoxin-binding protein genes in toxic and nontoxic pufferfishes.

    Science.gov (United States)

    Hashiguchi, Y; Lee, J M; Shiraishi, M; Komatsu, S; Miki, S; Shimasaki, Y; Mochioka, N; Kusakabe, T; Oshima, Y

    2015-05-01

    Understanding the evolutionary mechanisms of toxin accumulation in pufferfishes has been long-standing problem in toxicology and evolutionary biology. Pufferfish saxitoxin and tetrodotoxin-binding protein (PSTBP) is involved in the transport and accumulation of tetrodotoxin and is one of the most intriguing proteins related to the toxicity of pufferfishes. PSTBPs are fusion proteins consisting of two tandem repeated tributyltin-binding protein type 2 (TBT-bp2) domains. In this study, we examined the evolutionary dynamics of TBT-bp2 and PSTBP genes to understand the evolution of toxin accumulation in pufferfishes. Database searches and/or PCR-based cDNA cloning in nine pufferfish species (6 toxic and 3 nontoxic) revealed that all species possessed one or more TBT-bp2 genes, but PSTBP genes were found only in 5 toxic species belonging to genus Takifugu. These toxic Takifugu species possessed two or three copies of PSTBP genes. Phylogenetic analysis of TBT-bp2 and PSTBP genes suggested that PSTBPs evolved in the common ancestor of Takifugu species by repeated duplications and fusions of TBT-bp2 genes. In addition, a detailed comparison of Takifugu TBT-bp2 and PSTBP gene sequences detected a signature of positive selection under the pressure of gene conversion. The complicated evolutionary dynamics of TBT-bp2 and PSTBP genes may reflect the diversity of toxicity in pufferfishes. © 2015 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2015 European Society For Evolutionary Biology.

  20. Transmissible familial Creutzfeldt-Jakob disease associated with five, seven, and eight extra octapeptide coding repeats in the PRNP gene

    Energy Technology Data Exchange (ETDEWEB)

    Goldfarb, L.G.; Brown, P.; McCombie, W.R.; Gibbs, C.J. Jr.; Gajdusek, D.C. (National Inst. of Health, Bethesda, MD (United States)); Goldgaber, D. (State Univ. of New York, Stony Brook (United States)); Swergold, G.D. (National Inst. of Health, Bethesda, MD (United States)); Wills, P.R. (Univ. of Auckland (New Zealand)); Cervenakova, L. (Inst. of Preventive and Clinical Medicine, Bratislava (Czechoslovakia)); Baron, H. (Searle Pharmaceuticals, Paris (France))

    1991-12-01

    The PRNP gene, encoding the amyloid precursor protein that is centrally involved in Creutzfeldt-Jakob disease (CJD), has an unstable region of five variant tandem octapeptide coding repeats between codons 51 and 91. The authors screened a total of 535 individuals for the presence of extra repeats in this region, including patients with sporadic and familial forms of spongiform encephalopathy, members of their families, other neurological and non-neurological patients, and normal controls. They identified three CJD families (in each of which the proband's disease was neuropathologically confirmed and experimentally transmitted to primates) that were heterozygous for alleles with 10, 12, or 13 repeats, some of which had wobble nucleotide substitutions. They also found one individual with 9 repeats and no nucleotide substitutions who had no evidence of neurological disease. These observations, together with data on published British patients with 11 and 14 repeats, strongly suggest that the occurrence of 10 or more octapeptide repeats in the encoded amyloid precursor protein predisposes to CJD.

  1. Genus-specific protein binding to the large clusters of DNA repeats (short regularly spaced repeats) present in Sulfolobus genomes

    DEFF Research Database (Denmark)

    Peng, Xu; Brügger, Kim; Shen, Biao

    2003-01-01

    terminally modified and corresponds to SSO454, an open reading frame of previously unassigned function. It binds specifically to DNA fragments carrying double and single repeat sequences, binding on one side of the repeat structure, and producing an opening of the opposite side of the DNA structure. It also...... recognizes both main families of repeat sequences in S. solfataricus. The recombinant protein, expressed in Escherichia coli, showed the same binding properties to the SRSR repeat as the native one. The SSO454 protein exhibits a tripartite internal repeat structure which yields a good sequence match...... with a helix-turn-helix DNA-binding motif. Although this putative motif is shared by other archaeal proteins, orthologs of SSO454 were only detected in species within the Sulfolobus genus and in the closely related Acidianus genus. We infer that the genus-specific protein induces an opening of the structure...

  2. Repeat Sequence Proteins as Matrices for Nanocomposites

    Energy Technology Data Exchange (ETDEWEB)

    Drummy, L.; Koerner, H; Phillips, D; McAuliffe, J; Kumar, M; Farmer, B; Vaia, R; Naik, R

    2009-01-01

    Recombinant protein-inorganic nanocomposites comprised of exfoliated Na+ montmorillonite (MMT) in a recombinant protein matrix based on silk-like and elastin-like amino acid motifs (silk elastin-like protein (SELP)) were formed via a solution blending process. Charged residues along the protein backbone are shown to dominate long-range interactions, whereas the SELP repeat sequence leads to local protein/MMT compatibility. Up to a 50% increase in room temperature modulus and a comparable decrease in high temperature coefficient of thermal expansion occur for cast films containing 2-10 wt.% MMT.

  3. Identification and function of leucine-rich repeat flightless-I-interacting protein 2 (LRRFIP2 in Litopenaeus vannamei.

    Directory of Open Access Journals (Sweden)

    Shuang Zhang

    Full Text Available Leucine-rich repeat flightless-I-interacting protein 2 (LRRFIP2 is a myeloid differentiation factor 88-interacting protein with a positive regulatory function in toll-like receptor signaling. In this study, seven LRRFIP2 protein variants (LvLRRFIP2A-G were identified in Litopenaeus vannamei. All the seven LvLRRFIP2 protein variants encode proteins with a DUF2051 domain. LvLRRFIP2s were upregulated in hemocytes after challenged with lipopolysaccharide, poly I:C, CpG-ODN2006, Vibrio parahaemolyticus, Staphylococcus aureus, and white spot syndrome virus (WSSV. Dual-luciferase reporter assays in Drosophila Schneider 2 cells revealed that LvLRRFIP2 activates the promoters of Drosophila and shrimp AMP genes. The knockdown of LvLRRFIP2 by RNA interference resulted in higher cumulative mortality of L. vannamei upon V. parahaemolyticus but not S. aureus and WSSV infections. The expression of L. vannamei AMP genes were reduced by dsLvLRRFIP2 interference. These results indicate that LvLRRFIP2 has an important function in antibacterials via the regulation of AMP gene expression.

  4. Dynamic changes of tumor gene expression during repeated pressurized intraperitoneal aerosol chemotherapy (PIPAC) in women with peritoneal cancer

    International Nuclear Information System (INIS)

    Rezniczek, Günther A.; Jüngst, Friederike; Jütte, Hendrik; Tannapfel, Andrea; Hilal, Ziad; Hefler, Lukas A.; Reymond, Marc-André; Tempfer, Clemens B.

    2016-01-01

    Intraperitoneal chemotherapy is used to treat peritoneal cancer. The pattern of gene expression changes of peritoneal cancer during intraperitoneal chemotherapy has not been studied before. Pressurized intraperitoneal aerosol chemotherapy is a new form of intraperitoneal chemotherapy using repeated applications and allowing repeated tumor sampling during chemotherapy. Here, we present the analysis of gene expression changes during pressurized intraperitoneal aerosol chemotherapy with doxorubicin and cisplatin using a 22-gene panel. Total RNA was extracted from 152 PC samples obtained from 63 patients in up to six cycles of intraperitoneal chemotherapy. Quantitative real-time PCR was used to determine the gene expression levels. For select genes, immunohistochemistry was used to verify gene expression changes observed on the transcript level on the protein level. Observed (changes in) expression levels were correlated with clinical outcomes. Gene expression profiles differed significantly between peritoneal cancer and non- peritoneal cancer samples and between ascites-producing and non ascites-producing peritoneal cancers. Changes of gene expression patterns during repeated intraperitoneal chemotherapy cycles were prognostic of overall survival, suggesting a molecular tumor response of peritoneal cancer. Specifically, downregulation of the whole gene panel during intraperitoneal chemotherapy was associated with better treatment response and survival. In summary, molecular changes of peritoneal cancer during pressurized intraperitoneal aerosol chemotherapy can be documented and may be used to refine individual treatment and prognostic estimations. The online version of this article (doi:10.1186/s12885-016-2668-4) contains supplementary material, which is available to authorized users

  5. Repeat-containing protein effectors of plant-associated organisms

    Directory of Open Access Journals (Sweden)

    Carl H. Mesarich

    2015-10-01

    Full Text Available Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.

  6. ST proteins, a new family of plant tandem repeat proteins with a DUF2775 domain mainly found in Fabaceae and Asteraceae.

    Science.gov (United States)

    Albornos, Lucía; Martín, Ignacio; Iglesias, Rebeca; Jiménez, Teresa; Labrador, Emilia; Dopico, Berta

    2012-11-07

    Many proteins with tandem repeats in their sequence have been described and classified according to the length of the repeats: I) Repeats of short oligopeptides (from 2 to 20 amino acids), including structural cell wall proteins and arabinogalactan proteins. II) Repeats that range in length from 20 to 40 residues, including proteins with a well-established three-dimensional structure often involved in mediating protein-protein interactions. (III) Longer repeats in the order of 100 amino acids that constitute structurally and functionally independent units. Here we analyse ShooT specific (ST) proteins, a family of proteins with tandem repeats of unknown function that were first found in Leguminosae, and their possible similarities to other proteins with tandem repeats. ST protein sequences were only found in dicotyledonous plants, limited to several plant families, mainly the Fabaceae and the Asteraceae. ST mRNAs accumulate mainly in the roots and under biotic interactions. Most ST proteins have one or several Domain(s) of Unknown Function 2775 (DUF2775). All deduced ST proteins have a signal peptide, indicating that these proteins enter the secretory pathway, and the mature proteins have tandem repeat oligopeptides that share a hexapeptide (E/D)FEPRP followed by 4 partially conserved amino acids, which could determine a putative N-glycosylation signal, and a fully conserved tyrosine. In a phylogenetic tree, the sequences clade according to taxonomic group. A possible involvement in symbiosis and abiotic stress as well as in plant cell elongation is suggested, although different STs could play different roles in plant development. We describe a new family of proteins called ST whose presence is limited to the plant kingdom, specifically to a few families of dicotyledonous plants. They present 20 to 40 amino acid tandem repeat sequences with different characteristics (signal peptide, DUF2775 domain, conservative repeat regions) from the described group of 20 to 40

  7. Plasmodium cysteine repeat modular proteins 1-4: complex proteins with roles throughout the malaria parasite life cycle.

    Science.gov (United States)

    Thompson, Joanne; Fernandez-Reyes, Delmiro; Sharling, Lisa; Moore, Sally G; Eling, Wijnand M; Kyes, Sue A; Newbold, Christopher I; Kafatos, Fotis C; Janse, Chris J; Waters, Andrew P

    2007-06-01

    The Cysteine Repeat Modular Proteins (PCRMP1-4) of Plasmodium, are encoded by a small gene family that is conserved in malaria and other Apicomplexan parasites. They are very large, predicted surface proteins with multipass transmembrane domains containing motifs that are conserved within families of cysteine-rich, predicted surface proteins in a range of unicellular eukaryotes, and a unique combination of protein-binding motifs, including a >100 kDa cysteine-rich modular region, an epidermal growth factor-like domain and a Kringle domain. PCRMP1 and 2 are expressed in life cycle stages in both the mosquito and vertebrate. They colocalize with PfEMP1 (P. falciparum Erythrocyte Membrane Antigen-1) during its export from P. falciparum blood-stage parasites and are exposed on the surface of haemolymph- and salivary gland-sporozoites in the mosquito, consistent with a role in host tissue targeting and invasion. Gene disruption of pcrmp1 and 2 in the rodent malaria model, P. berghei, demonstrated that both are essential for transmission of the parasite from the mosquito to the mouse and has established their discrete and important roles in sporozoite targeting to the mosquito salivary gland. The unprecedented expression pattern and structural features of the PCRMPs thus suggest a variety of roles mediating host-parasite interactions throughout the parasite life cycle.

  8. Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

    Science.gov (United States)

    Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

    1999-04-16

    A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.

  9. Modulation of CRISPR locus transcription by the repeat-binding protein Cbp1 in Sulfolobus

    DEFF Research Database (Denmark)

    Deng, Ling; Kenchappa, Chandra Shekar; Peng, Xu

    2012-01-01

    CRISPR loci are essential components of the adaptive immune system of archaea and bacteria. They consist of long arrays of repeats separated by DNA spacers encoding guide RNAs (crRNA), which target foreign genetic elements. Cbp1 (CRISPR DNA repeat binding protein) binds specifically to the multiple...... direct repeats of CRISPR loci of members of the acidothermophilic, crenarchaeal order Sulfolobales. cbp1 gene deletion from Sulfolobus islandicus REY15A produced a strong reduction in pre-crRNA yields from CRISPR loci but did not inhibit the foreign DNA targeting capacity of the CRISPR/Cas system....... Conversely, overexpression of Cbp1 in S. islandicus generated an increase in pre-crRNA yields while the level of reverse strand transcripts from CRISPR loci remained unchanged. It is proposed that Cbp1 modulates production of longer pre-crRNA transcripts from CRISPR loci. A possible mechanism...

  10. The LuWD40-1 gene encoding WD repeat protein regulates growth and pollen viability in flax (Linum Usitatissimum L.).

    Science.gov (United States)

    Kumar, Santosh; Jordan, Mark C; Datla, Raju; Cloutier, Sylvie

    2013-01-01

    As a crop, flax holds significant commercial value for its omega-3 rich oilseeds and stem fibres. Canada is the largest producer of linseed but there exists scope for significant yield improvements. Implementation of mechanisms such as male sterility can permit the development of hybrids to assist in achieving this goal. Temperature sensitive male sterility has been reported in flax but the leakiness of this system in field conditions limits the production of quality hybrid seeds. Here, we characterized a 2,588 bp transcript differentially expressed in male sterile lines of flax. The twelve intron gene predicted to encode a 368 amino acid protein has five WD40 repeats which, in silico, form a propeller structure with putative nucleic acid and histone binding capabilities. The LuWD40-1 protein localized to the nucleus and its expression increased during the transition and continued through the vegetative stages (seed, etiolated seedling, stem) while the transcript levels declined during reproductive development (ovary, anthers) and embryonic morphogenesis of male fertile plants. Knockout lines for LuWD40-1 in flax failed to develop shoots while overexpression lines showed delayed growth phenotype and were male sterile. The non-viable flowers failed to open and the pollen grains from these flowers were empty. Three independent transgenic lines overexpressing the LuWD40-1 gene had ∼80% non-viable pollen, reduced branching, delayed flowering and maturity compared to male fertile genotypes. The present study provides new insights into a male sterility mechanism present in flax.

  11. The LuWD40-1 gene encoding WD repeat protein regulates growth and pollen viability in flax (Linum Usitatissimum L..

    Directory of Open Access Journals (Sweden)

    Santosh Kumar

    Full Text Available As a crop, flax holds significant commercial value for its omega-3 rich oilseeds and stem fibres. Canada is the largest producer of linseed but there exists scope for significant yield improvements. Implementation of mechanisms such as male sterility can permit the development of hybrids to assist in achieving this goal. Temperature sensitive male sterility has been reported in flax but the leakiness of this system in field conditions limits the production of quality hybrid seeds. Here, we characterized a 2,588 bp transcript differentially expressed in male sterile lines of flax. The twelve intron gene predicted to encode a 368 amino acid protein has five WD40 repeats which, in silico, form a propeller structure with putative nucleic acid and histone binding capabilities. The LuWD40-1 protein localized to the nucleus and its expression increased during the transition and continued through the vegetative stages (seed, etiolated seedling, stem while the transcript levels declined during reproductive development (ovary, anthers and embryonic morphogenesis of male fertile plants. Knockout lines for LuWD40-1 in flax failed to develop shoots while overexpression lines showed delayed growth phenotype and were male sterile. The non-viable flowers failed to open and the pollen grains from these flowers were empty. Three independent transgenic lines overexpressing the LuWD40-1 gene had ∼80% non-viable pollen, reduced branching, delayed flowering and maturity compared to male fertile genotypes. The present study provides new insights into a male sterility mechanism present in flax.

  12. A TALE-inspired computational screen for proteins that contain approximate tandem repeats.

    Science.gov (United States)

    Perycz, Malgorzata; Krwawicz, Joanna; Bochtler, Matthias

    2017-01-01

    TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen.

  13. Poly-dipeptides encoded by the C9ORF72 repeats block global protein translation.

    Science.gov (United States)

    Kanekura, Kohsuke; Yagi, Takuya; Cammack, Alexander J; Mahadevan, Jana; Kuroda, Masahiko; Harms, Matthew B; Miller, Timothy M; Urano, Fumihiko

    2016-05-01

    The expansion of the GGGGCC hexanucleotide repeat in the non-coding region of the Chromosome 9 open-reading frame 72 (C9orf72) gene is the most common genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS). This genetic alteration leads to the accumulation of five types of poly-dipeptides translated from the GGGGCC hexanucleotide repeat. Among these, poly-proline-arginine (poly-PR) and poly-glycine-arginine (poly-GR) peptides are known to be neurotoxic. However, the mechanisms of neurotoxicity associated with these poly-dipeptides are not clear. A proteomics approach identified a number of interacting proteins with poly-PR peptide, including mRNA-binding proteins, ribosomal proteins, translation initiation factors and translation elongation factors. Immunostaining of brain sections from patients with C9orf72 ALS showed that poly-GR was colocalized with a mRNA-binding protein, hnRNPA1. In vitro translation assays showed that poly-PR and poly-GR peptides made insoluble complexes with mRNA, restrained the access of translation factors to mRNA, and blocked protein translation. Our results demonstrate that impaired protein translation mediated by poly-PR and poly-GR peptides plays a role in neurotoxicity and reveal that the pathways altered by the poly-dipeptides-mRNA complexes are potential therapeutic targets for treatment of C9orf72 FTD/ALS. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  14. Mining disease genes using integrated protein-protein interaction and gene-gene co-regulation information.

    Science.gov (United States)

    Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming

    2015-01-01

    In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.

  15. Huntingtin gene repeat size variations affect risk of lifetime depression

    DEFF Research Database (Denmark)

    Gardiner, Sarah L.; van Belzen, Martine J.; Boogaard, Merel W.

    2017-01-01

    Huntington disease (HD) is a severe neuropsychiatric disorder caused by a cytosine-adenine-guanine (CAG) repeat expansion in the HTT gene. Although HD is frequently complicated by depression, it is still unknown to what extent common HTT CAG repeat size variations in the normal range could affect...

  16. Translation of dipeptide repeat proteins from the C9ORF72 expanded repeat is associated with cellular stress.

    Science.gov (United States)

    Sonobe, Yoshifumi; Ghadge, Ghanashyam; Masaki, Katsuhisa; Sendoel, Ataman; Fuchs, Elaine; Roos, Raymond P

    2018-08-01

    Expansion of a hexanucleotide repeat (HRE), GGGGCC, in the C9ORF72 gene is recognized as the most common cause of familial amyotrophic lateral sclerosis (FALS), frontotemporal dementia (FTD) and ALS-FTD, as well as 5-10% of sporadic ALS. Despite the location of the HRE in the non-coding region (with respect to the main C9ORF72 gene product), dipeptide repeat proteins (DPRs) that are thought to be toxic are translated from the HRE in all three reading frames from both the sense and antisense transcript. Here, we identified a CUG that has a good Kozak consensus sequence as the translation initiation codon. Mutation of this CTG significantly suppressed polyglycine-alanine (GA) translation. GA was translated when the G 4 C 2 construct was placed as the second cistron in a bicistronic construct. CRISPR/Cas9-induced knockout of a non-canonical translation initiation factor, eIF2A, impaired GA translation. Transfection of G 4 C 2 constructs induced an integrated stress response (ISR), while triggering the ISR led to a continuation of translation of GA with a decline in conventional cap-dependent translation. These in vitro observations were confirmed in chick embryo neural cells. The findings suggest that DPRs translated from an HRE in C9ORF72 aggregate and lead to an ISR that then leads to continuing DPR production and aggregation, thereby creating a continuing pathogenic cycle. Copyright © 2018 Elsevier Inc. All rights reserved.

  17. Constructs for the expression of repeating triple-helical protein domains

    International Nuclear Information System (INIS)

    Peng, Yong Y; Werkmeister, Jerome A; Vaughan, Paul R; Ramshaw, John A M

    2009-01-01

    The development of novel scaffolds will be an important aspect in future success of tissue engineering. Scaffolds will preferably contain information that directs the cellular content of constructs so that the new tissue that is formed is closely aligned in structure, composition and function to the target natural tissue. One way of approaching this will be the development of novel protein-based constructs that contain one or more repeats of functional elements derived from various proteins. In the present case, we describe a strategy to make synthetic, recombinant triple-helical constructs that contain repeat segments of biologically relevant domains. Copies of a DNA fragment prepared by PCR from human type III collagen have been inserted in a co-linear contiguous fashion into the yeast expression vector YEpFlag-1, using sequential addition between selected restriction sites. Constructs containing 1, 2 and 3 repeats were designed to maintain the (Gly-X-Y) repeat, which is essential for the formation of an extended triple helix. All constructs gave expressed protein, with the best being the 3-repeat construct which was readily secreted. This material had the expected composition and N-terminal sequence. Incubation of the product at low temperature led to triple-helix formation, shown by reaction with a conformation dependent monoclonal antibody.

  18. Constructs for the expression of repeating triple-helical protein domains

    Energy Technology Data Exchange (ETDEWEB)

    Peng, Yong Y; Werkmeister, Jerome A; Vaughan, Paul R; Ramshaw, John A M, E-mail: jerome.werkmeister@csiro.a [CSIRO Molecular and Health Technologies, Bag 10, Clayton South, VIC 3169 (Australia)

    2009-02-15

    The development of novel scaffolds will be an important aspect in future success of tissue engineering. Scaffolds will preferably contain information that directs the cellular content of constructs so that the new tissue that is formed is closely aligned in structure, composition and function to the target natural tissue. One way of approaching this will be the development of novel protein-based constructs that contain one or more repeats of functional elements derived from various proteins. In the present case, we describe a strategy to make synthetic, recombinant triple-helical constructs that contain repeat segments of biologically relevant domains. Copies of a DNA fragment prepared by PCR from human type III collagen have been inserted in a co-linear contiguous fashion into the yeast expression vector YEpFlag-1, using sequential addition between selected restriction sites. Constructs containing 1, 2 and 3 repeats were designed to maintain the (Gly-X-Y) repeat, which is essential for the formation of an extended triple helix. All constructs gave expressed protein, with the best being the 3-repeat construct which was readily secreted. This material had the expected composition and N-terminal sequence. Incubation of the product at low temperature led to triple-helix formation, shown by reaction with a conformation dependent monoclonal antibody.

  19. Quantitative analysis and prediction of curvature in leucine-rich repeat proteins.

    Science.gov (United States)

    Hindle, K Lauren; Bella, Jordi; Lovell, Simon C

    2009-11-01

    Leucine-rich repeat (LRR) proteins form a large and diverse family. They have a wide range of functions most of which involve the formation of protein-protein interactions. All known LRR structures form curved solenoids, although there is large variation in their curvature. It is this curvature that determines the shape and dimensions of the inner space available for ligand binding. Unfortunately, large-scale parameters such as the overall curvature of a protein domain are extremely difficult to predict. Here, we present a quantitative analysis of determinants of curvature of this family. Individual repeats typically range in length between 20 and 30 residues and have a variety of secondary structures on their convex side. The observed curvature of the LRR domains correlates poorly with the lengths of their individual repeats. We have, therefore, developed a scoring function based on the secondary structure of the convex side of the protein that allows prediction of the overall curvature with a high degree of accuracy. We also demonstrate the effectiveness of this method in selecting a suitable template for comparative modeling. We have developed an automated, quantitative protocol that can be used to predict accurately the curvature of leucine-rich repeat proteins of unknown structure from sequence alone. This protocol is available as an online resource at http://www.bioinf.manchester.ac.uk/curlrr/.

  20. Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

    Science.gov (United States)

    Amirhaeri, S; Wohlrab, F; Wells, R D

    1995-02-17

    The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.

  1. Generating markers based on biotic stress of protein system in and tandem repeats sequence for Aquilaria sp

    International Nuclear Information System (INIS)

    Azhar Mohamad; Muhammad Hanif Azhari N; Siti Norhayati Ismail

    2014-01-01

    Aquilaria sp. belongs to the Thymelaeaceae family and is well distributed in Asia region. The species has multipurpose use from root to shoot and is an economically important crop, which generates wide interest in understanding genetic diversity of the species. Knowledge on DNA-based markers has become a prerequisite for more effective application of molecular marker techniques in breeding and mapping programs. In this work, both targeted genes and tandem repeat sequences were used for DNA fingerprinting in Aquilaria sp. A total of 100 ISSR (inter simple sequence repeat) primers and 50 combination pairs of specific primers derived from conserved region of a specific protein known as system in were optimized. 38 ISSR primers were found affirmative for polymorphism evaluation study and were generated from both specific and degenerate ISSR primers. And one utmost combination of system in primers showed significant results in distinguishing the Aquilaria sp. In conclusion, polymorphism derived from ISSR profiling and targeted stress genes of protein system in proved as a powerful approach for identification and molecular classification of Aquilaria sp. which will be useful for diversification in identifying any mutant lines derived from nature. (author)

  2. Repeated pulses of serotonin required for long-term facilitation activate mitogen-activated protein kinase in sensory neurons of Aplysia

    Science.gov (United States)

    Michael, Dan; Martin, Kelsey C.; Seger, Rony; Ning, Ming-Ming; Baston, Rene; Kandel, Eric R.

    1998-01-01

    Long-term facilitation of the connections between the sensory and motor neurons of the gill-withdrawal reflex in Aplysia requires five repeated pulses of serotonin (5-HT). The repeated pulses of 5-HT initiate a cascade of gene activation that leads ultimately to the growth of new synaptic connections. Several genes in this process have been identified, including the transcriptional regulators apCREB-1, apCREB-2, apC/EBP, and the cell adhesion molecule apCAM, which is thought to be involved in the formation of new synaptic connections. Here we report that the transcriptional regulators apCREB-2 and apC/EBP, as well as a peptide derived from the cytoplasmic domain of apCAM, are phosphorylated in vitro by Aplysia mitogen-activated protein kinase (apMAPK). We have cloned the cDNA encoding apMAPK and show that apMAPK activity is increased in sensory neurons treated with repeated pulses of 5-HT and by the cAMP pathway. These results suggest that apMAPK may participate with cAMP-dependent protein kinase during long-term facilitation in sensory cells by modifying some of the key elements involved in the consolidation of short- to long-lasting changes in synaptic strength. PMID:9465108

  3. TAIL1: an isthmin-like gene, containing type 1 thrombospondin-repeat and AMOP domain, mapped to ARVD1 critical region.

    Science.gov (United States)

    Rossi, Valeria; Beffagna, Giorgia; Rampazzo, Alessandra; Bauce, Barbara; Danieli, Gian Antonio

    2004-06-23

    Isthmins represent a novel family of vertebrate secreted proteins containing one copy of the thrombospondin type 1 repeat (TSR), which in mammals is shared by several proteins with diverse biological functions, including cell adhesion, angiogenesis, and patterning of developing nervous system. We have determined the genomic organization of human TAIL1 (thrombospondin and AMOP containing isthmin-like 1), a novel isthmin-like gene encoding a protein that contains a TSR and a C-terminal AMOP domain (adhesion-associated domain in MUC4 and other proteins), characteristic of extracellular proteins involved in adhesion processes. TAIL1 gene encompasses more than 24.4 kb. Analysis of the DNA sequence surrounding the putative transcriptional start region revealed a TATA-less promoter located in a CpG island. Several consensus binding sites for the transcription factors Sp1 and MZF-1 were identified in this promoter region. In humans, TAIL1 gene is located on chromosome 14q24.3 within ARVD1 (arrhythmogenic right ventricular dysplasia/cardiomyopathy, type 1) critical region; preliminary evidence suggests that it is expressed in several tissues, showing multiple alternative splicing.

  4. Characterization and expression of the maize β-carbonic anhydrase gene repeat regions.

    Science.gov (United States)

    Tems, Ursula; Burnell, James N

    2010-12-01

    In maize, carbonic anhydrase (CA; EC 4.2.1.1) catalyzes the first reaction of the C(4) photosynthetic pathway; it catalyzes the hydration of CO(2) to bicarbonate and provides an inorganic carbon source for the primary carboxylation reaction catalyzed by phosphoenolpyruvate (PEP) carboxylase. The β-CA isozymes from maize, as well as other agronomically important NADP-malic enzyme (NADP-ME) type C(4) crops, have remained relatively uncharacterized but differ significantly from the β-CAs of other C(4) monocot species primarily due to transcript length and the presence of repeat sequences. This research confirmed earlier findings of repeat sequences in maize CA transcripts, and demonstrated that the gene encoding these transcripts is also composed of repeat sequences. One of the maize CA genes was sequenced and found to encode two domains, with distinct groups of exons corresponding to the repeat regions of the transcript. We have also shown that expression of a single repeat region of the CA transcript produced active enzyme that associated as a dimer and was composed primarily of α-helices, consistent with that observed for other plant CAs. As the presence of repeat regions in the CA gene is unique to NADP-ME type C(4) monocot species, the implications of these findings in the context of the evolution of the location and function of this C(4) pathway enzyme are strongly suggestive of CA gene duplication resulting in an evolutionary advantage and a higher photosynthetic efficiency. Copyright © 2010 Elsevier Masson SAS. All rights reserved.

  5. Mononucleotide repeats are asymmetrically distributed in fungal genes

    NARCIS (Netherlands)

    Passel, van M.W.J.; Graaff, de L.H.

    2008-01-01

    ABSTRACT: BACKGROUND: Systematic analyses of sequence features have resulted in a better characterisation of the organisation of the genome. A previous study in prokaryotes on the distribution of sequence repeats, which are notoriously variable and can disrupt the reading frame in genes, showed that

  6. Repeated exposures to roadside particulate matter extracts suppresses pulmonary defense mechanisms, resulting in lipid and protein oxidative damage

    International Nuclear Information System (INIS)

    Pardo, Michal; Porat, Ziv; Rudich, Assaf; Schauer, James J.; Rudich, Yinon

    2016-01-01

    Exposure to particulate matter (PM) pollution in cities and urban canyons can be harmful to the exposed population. However, the underlying mechanisms that lead to health effects are not yet elucidated. It is postulated that exposure to repeated, small, environmentally relevant concentrations can affect lung homeostasis. This study examines the impact of repeated exposures to urban PM on mouse lungs with focus on inflammatory and oxidative stress parameters. Aqueous extracts from collected urban PM were administered to mice by 5 repeated intra-tracheal instillations (IT). Multiple exposures, led to an increase in cytokine levels in both bronchoalveolar lavage fluid and in the blood serum, indicating a systemic reaction. Lung mRNA levels of antioxidant/phase II detoxifying enzymes decreased by exposure to the PM extract, but not when metals were removed by chelation. Finally, disruption of lung tissue oxidant-inflammatory/defense balance was evidenced by increased levels of lipid and protein oxidation. Unlike response to a single IT exposure to the same dose and source of extract, multiple exposures result in lung oxidative damage and a systemic inflammatory reaction. These could be attributed to compromised capacity to activate the protective Nrf2 tissue defense system. It is suggested that water-soluble metals present in urban PM, potentially from break and tire wear, may constitute major drivers of the pulmonary and systemic responses to multiple exposure to urban PM. - Highlights: • Repeated exposure to urban PM cause systemic inflammation and oxidative damage to lung tissue lipids and proteins. • Repeated exposure to these PM extracts decreased transcription of Nrf2 protective genes. • Single as opposed to repeated exposure, induced confined lung response accompanied by activated defense mechanisms. • Metals, potentially from break and tire wear, drive the pulmonary response with exposure to urban PM. - Repeated exposures to urban PM water extracts

  7. Characterization of a novel gene encoding ankyrin repeat domain from Cotesia vestalis polydnavirus (CvBV)

    International Nuclear Information System (INIS)

    Shi Min; Chen Yafeng; Huang Fang; Liu Pengcheng; Zhou Xueping; Chen Xuexin

    2008-01-01

    Cotesia vestalis (Haliday) is an endoparasitoid of Plutella xylostella (L.) larvae and injects a polydnavirus (CvBV) into its host during oviposition. In this report we describe the characterization of a gene (CvBV805) and its products. CvBV805 is located on the segment S8 of CvBV genome; it has a size of 909 bp and encodes a predicted protein of 125 amino acids. This protein contains an ankyrin repeat domain with a high degree of similarity with IκB-like genes. Gene transcripts were detected in extracts of the host as early as 2 h post-parasitization (p.p.) and continued to be detected through 24 h. Tissue-specific expression patterns showed that CvBV805 might be involved in early host immunosuppression. CvBV805 was detected in parasitized hosts at 12 h p.p. and in rBac-eGFP-CvBV805-infected Tn-5B1-4 cells at 72 h.p.i. by using western blots analysis. The size of the protein expressed in the host hemocytes and infected Tn-5B1-4 cells was 17 kDa and 56 kDa (including eGFP), respectively, which nearly corresponded with the predicted molecular weight (14.31 kDa) of CvBV805, suggesting that the protein did not undergo extensive post-translational modification. The protein was confirmed to be present within the nuclear region in hemocytes of the parasitized P. xylostella larvae at 48 h p.p. using confocal laser scanning microscopy

  8. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats.

    Science.gov (United States)

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-11-16

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Identification of Interferon-Stimulated Gene Proteins That Inhibit Human Parainfluenza Virus Type 3.

    Science.gov (United States)

    Rabbani, M A G; Ribaudo, Michael; Guo, Ju-Tao; Barik, Sailen

    2016-12-15

    A major arm of cellular innate immunity is type I interferon (IFN), represented by IFN-α and IFN-β. Type I IFN transcriptionally induces a large number of cellular genes, collectively known as IFN-stimulated gene (ISG) proteins, which act as antivirals. The IFIT (interferon-induced proteins with tetratricopeptide repeats) family proteins constitute a major subclass of ISG proteins and are characterized by multiple tetratricopeptide repeats (TPRs). In this study, we have interrogated IFIT proteins for the ability to inhibit the growth of human parainfluenza virus type 3 (PIV3), a nonsegmented negative-strand RNA virus of the Paramyxoviridae family and a major cause of respiratory disease in children. We found that IFIT1 significantly inhibited PIV3, whereas IFIT2, IFIT3, and IFIT5 were less effective or not at all. In further screening a set of ISG proteins we discovered that several other such proteins also inhibited PIV3, including IFITM1, IDO (indoleamine 2,3-dioxygenase), PKR (protein kinase, RNA activated), and viperin (virus inhibitory protein, endoplasmic reticulum associated, interferon inducible)/Cig5. The antiviral effect of IDO, the enzyme that catalyzes the first step of tryptophan degradation, could be counteracted by tryptophan. These results advance our knowledge of diverse ISG proteins functioning as antivirals and may provide novel approaches against PIV3. The innate immunity of the host, typified by interferon (IFN), is a major antiviral defense. IFN inhibits virus growth by inducing a large number of IFN-stimulated gene (ISG) proteins, several of which have been shown to have specific antiviral functions. Parainfluenza virus type 3 (PIV3) is major pathogen of children, and no reliable vaccine or specific antiviral against it currently exists. In this article, we report several ISG proteins that strongly inhibit PIV3 growth, the use of which may allow a better antiviral regimen targeting PIV3. Copyright © 2016, American Society for Microbiology

  10. Meta-analysis of the effect of overexpression of C-repeat/dehydration-responsive element binding family genes on temperature stress tolerance and related responses

    Science.gov (United States)

    C-repeat/dehydration-responsive element binding proteins are transcription factors that play a critical role in plant response to temperature stress. Over-expression of CBF/DREB genes has been demonstrated to enhance temperature stress tolerance. A series of physiological and biochemical modificat...

  11. RRW: repeated random walks on genome-scale protein networks for local cluster discovery

    Directory of Open Access Journals (Sweden)

    Can Tolga

    2009-09-01

    Full Text Available Abstract Background We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. Results We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL, and find a significant improvement in the RRW clusters' precision and accuracy values. Conclusion RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multi-functional proteins by allowing overlapping clusters.

  12. A human-specific de novo protein-coding gene associated with human brain functions.

    Directory of Open Access Journals (Sweden)

    Chuan-Yun Li

    2010-03-01

    Full Text Available To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203. Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.

  13. The 1.7 Å resolution structure of At2g44920, a pentapeptide-repeat protein in the thylakoid lumen of Arabidopsis thaliana

    International Nuclear Information System (INIS)

    Ni, Shuisong; McGookey, Michael E.; Tinch, Stuart L.; Jones, Alisha N.; Jayaraman, Seetharaman; Tong, Liang; Kennedy, Michael A.

    2011-01-01

    The crystal structure of At2g44920, a pentapeptide repeat protein (PRP) from Arabidopsis thaliana, has been determined at 1.7 Å resolution. The structure represents the first PRP protein whose subcellular localization has been experimentally confirmed to be the thylakoid lumen of a plant species. At2g44920 belongs to a diverse family (Pfam PF00805) of pentapeptide-repeat proteins (PRPs) that are present in all known organisms except yeast. PRPs contain at least eight tandem-repeating sequences of five amino acids with an approximate consensus sequence (STAV)(D/N)(L/F)(S/T/R)(X). Recent crystal structures show that PRPs adopt a highly regular four-sided right-handed β-helical structure consisting mainly of type II and type IV β-turns, sometimes referred to as a repeated five-residue (or Rfr) fold. Among sequenced genomes, PRP genes are most abundant in cyanobacteria, leading to speculation that PRPs play an important role in the unique lifestyle of photosynthetic cyanobacteria. Despite the recent structural characterization of several cyanobacterial PRPs, most of their functions remain unknown. Plants, whose chloroplasts are of cyanobacterial origin, have only four PRP genes in their genomes. At2g44920 is one of three PRPs located in the thylakoid lumen. Here, the crystal structure of a double methionine mutant of residues 81–224 of At2g44920, the naturally processed fragment of one of its full-length isoforms, is reported at 1.7 Å resolution. The structure of At2g44920 consists of the characteristic Rfr fold with five uninterrupted coils made up of 25 pentapeptide repeats and α-helical elements capping both termini. A disulfide bridge links the two α-helices with a conserved loop between the helical elements at its C-terminus. This structure represents the first structure of a PRP protein whose subcellular location has been experimentally confirmed to be the thylakoid lumen in a plant species

  14. Meta-Analysis of DNA Tumor-Viral Integration Site Selection Indicates a Role for Repeats, Gene Expression and Epigenetics

    Directory of Open Access Journals (Sweden)

    Janet M. Doolittle-Hall

    2015-11-01

    Full Text Available Oncoviruses cause tremendous global cancer burden. For several DNA tumor viruses, human genome integration is consistently associated with cancer development. However, genomic features associated with tumor viral integration are poorly understood. We sought to define genomic determinants for 1897 loci prone to hosting human papillomavirus (HPV, hepatitis B virus (HBV or Merkel cell polyomavirus (MCPyV. These were compared to HIV, whose enzyme-mediated integration is well understood. A comprehensive catalog of integration sites was constructed from the literature and experimentally-determined HPV integration sites. Features were scored in eight categories (genes, expression, open chromatin, histone modifications, methylation, protein binding, chromatin segmentation and repeats and compared to random loci. Random forest models determined loci classification and feature selection. HPV and HBV integrants were not fragile site associated. MCPyV preferred integration near sensory perception genes. Unique signatures of integration-associated predictive genomic features were detected. Importantly, repeats, actively-transcribed regions and histone modifications were common tumor viral integration signatures.

  15. Effects of acute versus repeated cocaine exposure on the expression of endocannabinoid signaling-related proteins in the mouse cerebellum

    Directory of Open Access Journals (Sweden)

    Ana ePalomino

    2014-03-01

    Full Text Available Growing awareness of cerebellar involvement in addiction is based on the cerebellum’s intermediary position between motor and reward, potentially acting as an interface between motivational and cognitive functions. Here, we examined the impact of acute and repeated cocaine exposure on the two main signaling systems in the mouse cerebellum: the endocannabinoid (eCB and glutamate systems. To this end, we investigated whether eCB signaling-related gene and protein expression (CB1 receptors and enzymes that produce (DAGLα/β and NAPE-PLD and degrade (MAGL and FAAH eCB were altered. In addition, we analyzed the gene expression of relevant components of the glutamate signaling system (glutamate synthesizing enzymes LGA and KGA, mGluR3/5 metabotropic receptors, and NR1/2A/2B/2C-NMDA and GluR1/2/3/4-AMPA ionotropic receptor subunits and the gene expression of tyrosine hydroxylase (TH, the rate-limiting enzyme in catecholamine biosynthesis, because noradrenergic terminals innervate the cerebellar cortex. Results indicated that acute cocaine exposure decreased DAGLα expression, suggesting a down-regulation of 2-AG production, as well as gene expression of TH, KGA, mGluR3 and all ionotropic receptor subunits analyzed in the cerebellum. The acquisition of conditioned locomotion and sensitization after repeated cocaine exposure were associated with an increased NAPE-PLD/FAAH ratio, suggesting enhanced anandamide production, and a decreased DAGLβ/MAGL ratio, suggesting decreased 2-AG generation. Repeated cocaine also increased LGA gene expression but had no effect on glutamate receptors. These findings indicate that acute cocaine modulates the expression of the eCB and glutamate systems. Repeated cocaine results in normalization of glutamate receptor expression, although sustained changes in eCB is observed. We suggest that cocaine-induced alterations to cerebellar eCB should be considered when analyzing the adaptations imposed by psychostimulants that

  16. Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using Gene Composer

    Directory of Open Access Journals (Sweden)

    Walchli John

    2009-04-01

    Full Text Available Abstract Background With the goal of improving yield and success rates of heterologous protein production for structural studies we have developed the database and algorithm software package Gene Composer. This freely available electronic tool facilitates the information-rich design of protein constructs and their engineered synthetic gene sequences, as detailed in the accompanying manuscript. Results In this report, we compare heterologous protein expression levels from native sequences to that of codon engineered synthetic gene constructs designed by Gene Composer. A test set of proteins including a human kinase (P38α, viral polymerase (HCV NS5B, and bacterial structural protein (FtsZ were expressed in both E. coli and a cell-free wheat germ translation system. We also compare the protein expression levels in E. coli for a set of 11 different proteins with greatly varied G:C content and codon bias. Conclusion The results consistently demonstrate that protein yields from codon engineered Gene Composer designs are as good as or better than those achieved from the synonymous native genes. Moreover, structure guided N- and C-terminal deletion constructs designed with the aid of Gene Composer can lead to greater success in gene to structure work as exemplified by the X-ray crystallographic structure determination of FtsZ from Bacillus subtilis. These results validate the Gene Composer algorithms, and suggest that using a combination of synthetic gene and protein construct engineering tools can improve the economics of gene to structure research.

  17. Study of the repeatability of histone genes in the ploidy series of wheat and Aegilops

    International Nuclear Information System (INIS)

    Vakhitov, V.A.; Kulikov, A.M.

    1986-01-01

    The hDNA content and number of histone genes in the genomes of different wheat and Aegilops species have been determined by molecular hybridization of DNA with 125 I-histone DNA of Drosophila (L-repeat) on nitrocellulose filters. It has been demonstrated that the proportion of hDNA in the total DNA of diploid and polyploid wheat species is (1.3-7.7) x 10 -3 % (57-850 genes), and in the ploidy series of Aegilops species (2.0-8.0) x 10 -3 % (89-780 genes). The repeatability of the histone genes generally increases at each ploidy level in the species with higher DNA content. At the same time, it has been demonstrated that the DNA content is not the only factor determining repeatability of the histone genes, as some diploid and allopolyploid species have similar number of these genes. It has been concluded that genetic mechanisms are involved in the regulation of the number of histone genes

  18. Genome-wide analyses and functional classification of proline repeat-rich proteins: potential role of eIF5A in eukaryotic evolution.

    Directory of Open Access Journals (Sweden)

    Ajeet Mandal

    Full Text Available The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro and/or PPG (Pro-Pro-Gly-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids, only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.

  19. Structure of thrombospondin type 3 repeats in bacterial outer membrane protein A reveals its intra-repeat disulfide bond-dependent calcium-binding capability

    Energy Technology Data Exchange (ETDEWEB)

    Dai, Shuyan; Sun, Cancan; Tan, Kemin; Ye, Sheng; Zhang, Rongguang

    2017-09-01

    Eukaryotic thrombospondin type 3 repeat (TT3R) is an efficient calcium ion (Ca2+) binding motif only found in mammalian thrombospondin family. TT3R has also been found in prokaryotic cellulase Cel5G, which was thought to forfeit the Ca2+-binding capability due to the formation of intra-repeat disulfide bonds, instead of the inter-repeat ones possessed by eukaryotic TT3Rs. In this study, we have identified an enormous number of prokaryotic TT3R-containing proteins belonging to several different protein families, including outer membrane protein A (OmpA), an important structural protein connecting the outer membrane and the periplasmic peptidoglycan layer in gram-negative bacteria. Here, we report the crystal structure of the periplasmic region of OmpA from Capnocytophaga gingivalis, which contains a linker region comprising five consecutive TT3Rs. The structure of OmpA-TT3R exhibits a well-ordered architecture organized around two tightly-coordinated Ca2+ and confirms the presence of abnormal intra-repeat disulfide bonds. Further mutagenesis studies showed that the Ca2+-binding capability of OmpA-TT3R is indeed dependent on the proper formation of intra-repeat disulfide bonds, which help to fix a conserved glycine residue at its proper position for Ca2+ coordination. Additionally, despite lacking inter repeat disulfide bonds, the interfaces between adjacent OmpA-TT3Rs are enhanced by both hydrophobic and conserved aromatic-proline interactions.

  20. Alanine repeats influence protein localization in splicing speckles and paraspeckles.

    Science.gov (United States)

    Chang, Shuo-Hsiu; Chang, Wei-Lun; Lu, Chia-Chen; Tarn, Woan-Yuh

    2014-12-16

    Mammalian splicing regulatory protein RNA-binding motif protein 4 (RBM4) has an alanine repeat-containing C-terminal domain (CAD) that confers both nuclear- and splicing speckle-targeting activities. Alanine-repeat expansion has pathological potential. Here we show that the alanine-repeat tracts influence the subnuclear targeting properties of the RBM4 CAD in cultured human cells. Notably, truncation of the alanine tracts redistributed a portion of RBM4 to paraspeckles. The alanine-deficient CAD was sufficient for paraspeckle targeting. On the other hand, alanine-repeat expansion reduced the mobility of RBM4 and impaired its splicing activity. We further took advantage of the putative coactivator activator (CoAA)-RBM4 conjoined splicing factor, CoAZ, to investigate the function of the CAD in subnuclear targeting. Transiently expressed CoAZ formed discrete nuclear foci that emerged and subsequently separated-fully or partially-from paraspeckles. Alanine-repeat expansion appeared to prevent CoAZ separation from paraspeckles, resulting in their complete colocalization. CoAZ foci were dynamic but, unlike paraspeckles, were resistant to RNase treatment. Our results indicate that the alanine-rich CAD, in conjunction with its conjoined RNA-binding domain(s), differentially influences the subnuclear localization and biogenesis of RBM4 and CoAZ. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

    Science.gov (United States)

    Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

    2017-04-01

    There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.

  2. Solution properties of the archaeal CRISPR DNA repeat-binding homeodomain protein Cbp2

    DEFF Research Database (Denmark)

    Kenchappa, Chandra; Heiðarsson, Pétur Orri; Kragelund, Birthe

    2013-01-01

    Clustered regularly interspaced short palindromic repeats (CRISPR) form the basis of diverse adaptive immune systems directed primarily against invading genetic elements of archaea and bacteria. Cbp1 of the crenarchaeal thermoacidophilic order Sulfolobales, carrying three imperfect repeats, binds...... specifically to CRISPR DNA repeats and has been implicated in facilitating production of long transcripts from CRISPR loci. Here, a second related class of CRISPR DNA repeat-binding protein, denoted Cbp2, is characterized that contains two imperfect repeats and is found amongst members of the crenarchaeal...... in facilitating high affinity DNA binding of Cbp2 by tethering the two domains. Structural studies on mutant proteins provide support for Cys(7) and Cys(28) enhancing high thermal stability of Cbp2(Hb) through disulphide bridge formation. Consistent with their proposed CRISPR transcriptional regulatory role, Cbp2...

  3. Inferring repeat-protein energetics from evolutionary information.

    Directory of Open Access Journals (Sweden)

    Rocío Espada

    2017-06-01

    Full Text Available Natural protein sequences contain a record of their history. A common constraint in a given protein family is the ability to fold to specific structures, and it has been shown possible to infer the main native ensemble by analyzing covariations in extant sequences. Still, many natural proteins that fold into the same structural topology show different stabilization energies, and these are often related to their physiological behavior. We propose a description for the energetic variation given by sequence modifications in repeat proteins, systems for which the overall problem is simplified by their inherent symmetry. We explicitly account for single amino acid and pair-wise interactions and treat higher order correlations with a single term. We show that the resulting evolutionary field can be interpreted with structural detail. We trace the variations in the energetic scores of natural proteins and relate them to their experimental characterization. The resulting energetic evolutionary field allows the prediction of the folding free energy change for several mutants, and can be used to generate synthetic sequences that are statistically indistinguishable from the natural counterparts.

  4. The mate recognition protein gene mediates reproductive isolation and speciation in the Brachionus plicatilis cryptic species complex.

    Science.gov (United States)

    Gribble, Kristin E; Mark Welch, David B

    2012-08-01

    Chemically mediated prezygotic barriers to reproduction likely play an important role in speciation. In facultatively sexual monogonont rotifers from the Brachionus plicatilis cryptic species complex, mate recognition of females by males is mediated by the Mate Recognition Protein (MRP), a globular glycoprotein on the surface of females, encoded by the mmr-b gene family. In this study, we sequenced mmr-b copies from 27 isolates representing 11 phylotypes of the B. plicatilis species complex, examined the mode of evolution and selection of mmr-b, and determined the relationship between mmr-b genetic distance and mate recognition among isolates. Isolates of the B. plicatilis species complex have 1-4 copies of mmr-b, each composed of 2-9 nearly identical tandem repeats. The repeats within a gene copy are generally more similar than are gene copies among phylotypes, suggesting concerted evolution. Compared to housekeeping genes from the same isolates, mmr-b has accumulated only half as many synonymous differences but twice as many non-synonymous differences. Most of the amino acid differences between repeats appear to occur on the outer face of the protein, and these often result in changes in predicted patterns of phosphorylation. However, we found no evidence of positive selection driving these differences. Isolates with the most divergent copies were unable to mate with other isolates and rarely self-crossed. Overall the degree of mate recognition was significantly correlated with the genetic distance of mmr-b. Discrimination of compatible mates in the B. plicatilis species complex is determined by proteins encoded by closely related copies of a single gene, mmr-b. While concerted evolution of the tandem repeats in mmr-b may function to maintain identity, it can also lead to the rapid spread of a mutation through all copies in the genome and thus to reproductive isolation. The mmr-b gene is evolving rapidly, and novel alleles may be maintained and increase in

  5. The mate recognition protein gene mediates reproductive isolation and speciation in the Brachionus plicatilis cryptic species complex

    Directory of Open Access Journals (Sweden)

    Gribble Kristin E

    2012-08-01

    Full Text Available Abstract Background Chemically mediated prezygotic barriers to reproduction likely play an important role in speciation. In facultatively sexual monogonont rotifers from the Brachionus plicatilis cryptic species complex, mate recognition of females by males is mediated by the Mate Recognition Protein (MRP, a globular glycoprotein on the surface of females, encoded by the mmr-b gene family. In this study, we sequenced mmr-b copies from 27 isolates representing 11 phylotypes of the B. plicatilis species complex, examined the mode of evolution and selection of mmr-b, and determined the relationship between mmr-b genetic distance and mate recognition among isolates. Results Isolates of the B. plicatilis species complex have 1–4 copies of mmr-b, each composed of 2–9 nearly identical tandem repeats. The repeats within a gene copy are generally more similar than are gene copies among phylotypes, suggesting concerted evolution. Compared to housekeeping genes from the same isolates, mmr-b has accumulated only half as many synonymous differences but twice as many non-synonymous differences. Most of the amino acid differences between repeats appear to occur on the outer face of the protein, and these often result in changes in predicted patterns of phosphorylation. However, we found no evidence of positive selection driving these differences. Isolates with the most divergent copies were unable to mate with other isolates and rarely self-crossed. Overall the degree of mate recognition was significantly correlated with the genetic distance of mmr-b. Conclusions Discrimination of compatible mates in the B. plicatilis species complex is determined by proteins encoded by closely related copies of a single gene, mmr-b. While concerted evolution of the tandem repeats in mmr-b may function to maintain identity, it can also lead to the rapid spread of a mutation through all copies in the genome and thus to reproductive isolation. The mmr-b gene is evolving

  6. Short tandem repeats in CdLS-causing genes: distribution and ...

    Indian Academy of Sciences (India)

    and SMC3, as all STRs for these genes fall in noncoding region only. ... This indicates that more repeated STRs are at the risk of replication ... patients versus controls. ... ing from a balance between slippage events and point mutations. Proc.

  7. Differential gene expression profiling of endometrium during the mid-luteal phase of the estrous cycle between a repeat breeder (RB) and non-RB cows.

    Science.gov (United States)

    Hayashi, Ken-Go; Hosoe, Misa; Kizaki, Keiichiro; Fujii, Shiori; Kanahara, Hiroko; Takahashi, Toru; Sakumoto, Ryosuke

    2017-03-23

    Repeat breeding directly affects reproductive efficiency in cattle due to an increase in services per conception and calving interval. This study aimed to investigate whether changes in endometrial gene expression profile are involved in repeat breeding in cows. Differential gene expression profiles of the endometrium were investigated during the mid-luteal phase of the estrous cycle between repeat breeder (RB) and non-RB cows using microarray analysis. The caruncular (CAR) and intercaruncular (ICAR) endometrium of both ipsilateral and contralateral uterine horns to the corpus luteum were collected from RB (inseminated at least three times but not pregnant) and non-RB cows on Day 15 of the estrous cycle (4 cows/group). Global gene expression profiles of these endometrial samples were analyzed with a 15 K custom-made oligo-microarray for cattle. Immunohistochemistry was performed to investigate the cellular localization of proteins of three identified transcripts in the endometrium. Microarray analysis revealed that 405 and 397 genes were differentially expressed in the CAR and ICAR of the ipsilateral uterine horn of RB, respectively when compared with non-RB cows. In the contralateral uterine horn, 443 and 257 differentially expressed genes were identified in the CAR and ICAR of RB, respectively when compared with non-RB cows. Gene ontology analysis revealed that genes involved in development and morphogenesis were mainly up-regulated in the CAR of RB cows. In the ICAR of both the ipsilateral and contralateral uterine horns, genes related to the metabolic process were predominantly enriched in the RB cows when compared with non-RB cows. In the analysis of the whole uterus (combining the data above four endometrial compartments), RB cows showed up-regulation of 37 genes including PRSS2, GSTA3 and PIPOX and down-regulation of 39 genes including CHGA, KRT35 and THBS4 when compared with non-RB cows. Immunohistochemistry revealed that CHGA, GSTA3 and PRSS2 proteins

  8. Constraints and consequences of the emergence of amino acid repeats in eukaryotic proteins.

    Science.gov (United States)

    Chavali, Sreenivas; Chavali, Pavithra L; Chalancon, Guilhem; de Groot, Natalia Sanchez; Gemayel, Rita; Latysheva, Natasha S; Ing-Simmons, Elizabeth; Verstrepen, Kevin J; Balaji, Santhanam; Babu, M Madan

    2017-09-01

    Proteins with amino acid homorepeats have the potential to be detrimental to cells and are often associated with human diseases. Why, then, are homorepeats prevalent in eukaryotic proteomes? In yeast, homorepeats are enriched in proteins that are essential and pleiotropic and that buffer environmental insults. The presence of homorepeats increases the functional versatility of proteins by mediating protein interactions and facilitating spatial organization in a repeat-dependent manner. During evolution, homorepeats are preferentially retained in proteins with stringent proteostasis, which might minimize repeat-associated detrimental effects such as unregulated phase separation and protein aggregation. Their presence facilitates rapid protein divergence through accumulation of amino acid substitutions, which often affect linear motifs and post-translational-modification sites. These substitutions may result in rewiring protein interaction and signaling networks. Thus, homorepeats are distinct modules that are often retained in stringently regulated proteins. Their presence facilitates rapid exploration of the genotype-phenotype landscape of a population, thereby contributing to adaptation and fitness.

  9. Lack of variation of ATTCT pentanucleotide repeats at ATXN10 gene ...

    Indian Academy of Sciences (India)

    ized incoordination of gait, speech and limb movements. (Tang et ... ataxia (SCA); ATXN10 gene; nucleotide repeat; mutation detection; capillary electrophoresis; human genetics. ... Southern blot analysis using EcoRI was performed to distin-.

  10. Promoter Engineering Reveals the Importance of Heptameric Direct Repeats for DNA Binding by Streptomyces Antibiotic Regulatory Protein-Large ATP-Binding Regulator of the LuxR Family (SARP-LAL) Regulators in Streptomyces natalensis.

    Science.gov (United States)

    Barreales, Eva G; Vicente, Cláudia M; de Pedro, Antonio; Santos-Aberturas, Javier; Aparicio, Jesús F

    2018-05-15

    The biosynthesis of small-size polyene macrolides is ultimately controlled by a couple of transcriptional regulators that act in a hierarchical way. A Streptomyces antibiotic regulatory protein-large ATP-binding regulator of the LuxR family (SARP-LAL) regulator binds the promoter of a PAS-LuxR regulator-encoding gene and activates its transcription, and in turn, the gene product of the latter activates transcription from various promoters of the polyene gene cluster directly. The primary operator of PimR, the archetype of SARP-LAL regulators, contains three heptameric direct repeats separated by four-nucleotide spacers, but the regulator can also bind a secondary operator with only two direct repeats separated by a 3-nucleotide spacer, both located in the promoter region of its unique target gene, pimM A similar arrangement of operators has been identified for PimR counterparts encoded by gene clusters for different antifungal secondary metabolites, including not only polyene macrolides but peptidyl nucleosides, phoslactomycins, or cycloheximide. Here, we used promoter engineering and quantitative transcriptional analyses to determine the contributions of the different heptameric repeats to transcriptional activation and final polyene production. Optimized promoters have thus been developed. Deletion studies and electrophoretic mobility assays were used for the definition of DNA-binding boxes formed by 22-nucleotide sequences comprising two conserved heptameric direct repeats separated by four-nucleotide less conserved spacers. The cooperative binding of PimR SARP appears to be the mechanism involved in the binding of regulator monomers to operators, and at least two protein monomers are required for efficient binding. IMPORTANCE Here, we have shown that a modulation of the production of the antifungal pimaricin in Streptomyces natalensis can be accomplished via promoter engineering of the PAS-LuxR transcriptional activator pimM The expression of this gene is

  11. Comparison of serum leptin, glucose, total cholesterol and total protein levels in fertile and repeat breeder cows

    Directory of Open Access Journals (Sweden)

    Saime Guzel

    2014-12-01

    Full Text Available In the present study we measured serum glucose, leptin, total cholesterol and total protein concentrations in repeat breeder cows and compared them with fertile cows. For this aim, 20 repeat breeder cows and 20 fertile cows were used as material. Repeat breeder cows were found to have lower levels of leptin and glucose as compared with fertile ones. No significant differences in total cholesterol and total protein levels were observed between the two groups. No significant correlation of leptin with glucose, total cholesterol and total protein was observed in fertile and repeat breeder cows. Low concentrations of glucose and leptin can have some effects on reproductive problems as repeat breeder and help to understand potential mechanisms impairing fertility in repeat breeder cows.

  12. A diverse host thrombospondin-type-1 repeat protein repertoire promotes symbiont colonization during establishment of cnidarian-dinoflagellate symbiosis.

    Science.gov (United States)

    Neubauer, Emilie-Fleur; Poole, Angela Z; Neubauer, Philipp; Detournay, Olivier; Tan, Kenneth; Davy, Simon K; Weis, Virginia M

    2017-05-08

    The mutualistic endosymbiosis between cnidarians and dinoflagellates is mediated by complex inter-partner signaling events, where the host cnidarian innate immune system plays a crucial role in recognition and regulation of symbionts. To date, little is known about the diversity of thrombospondin-type-1 repeat (TSR) domain proteins in basal metazoans or their potential role in regulation of cnidarian-dinoflagellate mutualisms. We reveal a large and diverse repertoire of TSR proteins in seven anthozoan species, and show that in the model sea anemone Aiptasia pallida the TSR domain promotes colonization of the host by the symbiotic dinoflagellate Symbiodinium minutum . Blocking TSR domains led to decreased colonization success, while adding exogenous TSRs resulted in a 'super colonization'. Furthermore, gene expression of TSR proteins was highest at early time-points during symbiosis establishment. Our work characterizes the diversity of cnidarian TSR proteins and provides evidence that these proteins play an important role in the establishment of cnidarian-dinoflagellate symbiosis.

  13. A large complement of the predicted Arabidopsis ARM repeat proteins are members of the U-box E3 ubiquitin ligase family.

    Science.gov (United States)

    Mudgil, Yashwanti; Shiu, Shin-Han; Stone, Sophia L; Salt, Jennifer N; Goring, Daphne R

    2004-01-01

    The Arabidopsis genome was searched to identify predicted proteins containing armadillo (ARM) repeats, a motif known to mediate protein-protein interactions in a number of different animal proteins. Using domain database predictions and models generated in this study, 108 Arabidopsis proteins were identified that contained a minimum of two ARM repeats with the majority of proteins containing four to eight ARM repeats. Clustering analysis showed that the 108 predicted Arabidopsis ARM repeat proteins could be divided into multiple groups with wide differences in their domain compositions and organizations. Interestingly, 41 of the 108 Arabidopsis ARM repeat proteins contained a U-box, a motif present in a family of E3 ligases, and these proteins represented the largest class of Arabidopsis ARM repeat proteins. In 14 of these U-box/ARM repeat proteins, there was also a novel conserved domain identified in the N-terminal region. Based on the phylogenetic tree, representative U-box/ARM repeat proteins were selected for further study. RNA-blot analyses revealed that these U-box/ARM proteins are expressed in a variety of tissues in Arabidopsis. In addition, the selected U-box/ARM proteins were found to be functional E3 ubiquitin ligases. Thus, these U-box/ARM proteins represent a new family of E3 ligases in Arabidopsis.

  14. MNS16A tandem repeat minisatellite of human telomerase gene: functional studies in colorectal, lung and prostate cancer.

    Science.gov (United States)

    Hofer, Philipp; Zöchmeister, Cornelia; Behm, Christian; Brezina, Stefanie; Baierl, Andreas; Doriguzzi, Angelina; Vanas, Vanita; Holzmann, Klaus; Sutterlüty-Fall, Hedwig; Gsur, Andrea

    2017-04-25

    MNS16A, a functional polymorphic tandem repeat minisatellite, is located in the promoter region of an antisense transcript of the human telomerase reverse transcriptase gene. MNS16A promoter activity depends on the variable number of tandem repeats (VNTR) presenting varying numbers of transcription factor binding sites for GATA binding protein 1. Although MNS16A has been investigated in multiple cancer epidemiology studies with incongruent findings, functional data of only two VNTRs (VNTR-243 and VNTR-302) were available thus far, linking the shorter VNTR to higher promoter activity.For the first time, we investigated promoter activity of all six VNTRs of MNS16A in cell lines of colorectal, lung and prostate cancer using Luciferase reporter assay. In all investigated cell lines shorter VNTRs showed higher promoter activity. While this anticipated indirect linear relationship was affirmed for colorectal cancer SW480 (P = 0.006), a piecewise linear regression model provided significantly better model fit in lung cancer A-427 (P = 6.9 × 10-9) and prostate cancer LNCaP (P = 0.039). In silico search for transcription factor binding sites in MNS16A core repeat element suggested a higher degree of complexity involving X-box binding protein 1, general transcription factor II-I, and glucocorticoid receptor alpha in addition to GATA binding protein 1.Further functional studies in additional cancers are requested to extend our knowledge of MNS16A functionality uncovering potential cancer type-specific differences. Risk alleles may vary in different malignancies and their determination in vitro could be relevant for interpretation of genotype data.

  15. ACCA phosphopeptide recognition by the BRCT repeats of BRCA1.

    Science.gov (United States)

    Ray, Hind; Moreau, Karen; Dizin, Eva; Callebaut, Isabelle; Venezia, Nicole Dalla

    2006-06-16

    The tumour suppressor gene BRCA1 encodes a 220 kDa protein that participates in multiple cellular processes. The BRCA1 protein contains a tandem of two BRCT repeats at its carboxy-terminal region. The majority of disease-associated BRCA1 mutations affect this region and provide to the BRCT repeats a central role in the BRCA1 tumour suppressor function. The BRCT repeats have been shown to mediate phospho-dependant protein-protein interactions. They recognize phosphorylated peptides using a recognition groove that spans both BRCT repeats. We previously identified an interaction between the tandem of BRCA1 BRCT repeats and ACCA, which was disrupted by germ line BRCA1 mutations that affect the BRCT repeats. We recently showed that BRCA1 modulates ACCA activity through its phospho-dependent binding to ACCA. To delineate the region of ACCA that is crucial for the regulation of its activity by BRCA1, we searched for potential phosphorylation sites in the ACCA sequence that might be recognized by the BRCA1 BRCT repeats. Using sequence analysis and structure modelling, we proposed the Ser1263 residue as the most favourable candidate among six residues, for recognition by the BRCA1 BRCT repeats. Using experimental approaches, such as GST pull-down assay with Bosc cells, we clearly showed that phosphorylation of only Ser1263 was essential for the interaction of ACCA with the BRCT repeats. We finally demonstrated by immunoprecipitation of ACCA in cells, that the whole BRCA1 protein interacts with ACCA when phosphorylated on Ser1263.

  16. Analysis of tandem repeat units of the promoter of capsanthin/capsorubin synthase (Ccs) gene in pepper fruit.

    Science.gov (United States)

    Tian, Shi-Lin; Li, Zheng; Li, Li; Shah, S N M; Gong, Zhen-Hui

    2017-07-01

    Capsanthin/capsorubin synthase ( Ccs ) gene is a key gene that regulates the synthesis of capsanthin and the development of red coloration in pepper fruits. There are three tandem repeat units in the promoter region of Ccs , but the potential effects of the number of repetitive units on the transcriptional regulation of Ccs has been unclear. In the present study, expression vectors carrying different numbers of repeat units of the Ccs promoter were constructed, and the transient expression of the β-glucuronidase ( GUS ) gene was used to detect differences in expression levels associated with the promoter fragments. These repeat fragments and the plant expression vector PBI121 containing the 35s CaMV promoter were ligated to form recombinant vectors that were transfected into Agrobacterium tumefaciens GV3101. A fluorescence spectrophotometer was used to analyze the expression associated with the various repeat units. It was concluded that the constructs containing at least one repeat were associated with GUS expression, though they did not differ from one another. This repeating unit likely plays a role in transcription and regulation of Ccs expression.

  17. Tetranucleotide repeat polymorphism at the human prostatic acid phosphatase (ACPP) gene

    Energy Technology Data Exchange (ETDEWEB)

    Polymeropoulos, M H; Xiao, Hong; Rath, D S; Merril, C R [National Inst. of Mental Health Neuroscience Center, Washington, DC (United States)

    1991-09-11

    The polymorphic (AAAT){sub n} repeat begins at base pair 2342 of the human prostatic acid phosphatase gene on chromosome 3q21-qter. The polymorphism can be typed using the polymerase chain reaction (PCR) as described previously. The predicted length of the amplified sequence was 275 bp. Co-dominant segregation was observed in two informative families. The human prostatic acid phosphatase gene has been assigned to chromosome 3q21-qter.

  18. Deletion of Repeats in the Alpha C Protein Enhances the Pathogenicity of Group B Streptococci in Immune Mice

    OpenAIRE

    Gravekamp, C.; Rosner, Bernard; Madoff, L. C.

    1998-01-01

    The alpha C protein is a protective surface-associated antigen of group B streptococci (GBS). The prototype alpha C protein of GBS (strain A909) contains nine identical tandem repeats, each comprising 82 amino acids, flanked by N- and C-terminal domains. Clinical isolates of GBS show variable numbers of repeats with a normal distribution and a median of 9 to 10 repeats. Here, we show that escape mutants of GBS expressing one-repeat alpha C protein were 100-fold more pathogenic than GBS expres...

  19. Gene Expression Responses to FUS, EWS, and TAF15 Reduction and Stress Granule Sequestration Analyses Identifies FET-Protein Non-Redundant Functions

    DEFF Research Database (Denmark)

    Blechingberg, Jenny; Luo, Yonglun; Bolund, Lars

    2012-01-01

    The FET family of proteins is composed of FUS/TLS, EWS/EWSR1, and TAF15 and possesses RNA- and DNA-binding capacities. The FET-proteins are involved in transcriptional regulation and RNA processing, and FET-gene deregulation is associated with development of cancer and protein granule formations...... in amyotrophic lateral sclerosis, frontotemporal lobar degeneration, and trinucleotide repeat expansion diseases. We here describe a comparative characterization of FET-protein localization and gene regulatory functions. We show that FUS and TAF15 locate to cellular stress granules to a larger extend than EWS....... FET-proteins have no major importance for stress granule formation and cellular stress responses, indicating that FET-protein stress granule association most likely is a downstream response to cellular stress. Gene expression analyses showed that the cellular response towards FUS and TAF15 reduction...

  20. CAG repeat length variation in the polymerase gamma (POLG) gene: effect on semen quality

    NARCIS (Netherlands)

    Westerveld, G. H.; Kaaij-Visser, L.; Tanck, M.; van der Veen, F.; Repping, S.

    2008-01-01

    Several case-control studies have investigated the effect of CAG repeat length variation in the POLG gene on male fertility and semen quality. Some described an association between the homozygous not10 CAG-repeat genotype and male subfertility and/or reduced semen quality, whereas others did not.

  1. A novel tetratricopeptide repeat (TPR containing PP5 serine/threonine protein phosphatase in the malaria parasite, Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Adams Brian

    2001-11-01

    Full Text Available Abstract Background The malarial parasite, Plasmodium falciparum (Pf, is responsible for nearly 2 million deaths worldwide. However, the mechanisms of cellular signaling in the parasite remain largely unknown. Recent discovery of a few protein kinases and phosphatases point to a thriving reversible phosphorylation system in the parasite, although their function and regulation need to be determined. Results We provide biochemical and sequence evidence for a protein serine/threonine phosphatase type PP5 in Plasmodium falciparum, and named it PfPP5. The 594-amino acid polypeptide was encoded by a 1785 nucleotide long intronless gene in the parasite. The recombinant protein, expressed in bacteria, was indistinguishable from native PfPP5. Sequencing comparison indicated that the extra-long N-terminus of PfPP5 outside the catalytic core contained four tetratricopeptide repeats (TPRs, compared to three such repeats in other PP5 phosphatases. The PfPP5 N-terminus was required for stimulation of the phosphatase activity by polyunsaturated fatty acids. Co-immunoprecipitation demonstrated an interaction between native PfPP5 and Pf heat shock protein 90 (hsp90. PfPP5 was expressed in all the asexual erythrocytic stages of the parasite, and was moderately sensitive to okadaic acid. Conclusions This is the first example of a TPR-domain protein in the Apicomplexa family of parasites. Since TPR domains play important roles in protein-protein interaction, especially relevant to the regulation of PP5 phosphatases, PfPP5 is destined to have a definitive role in parasitic growth and signaling pathways. This is exemplified by the interaction between PfPP5 and the cognate chaperone hsp90.

  2. Large Polyglutamine Repeats Cause Muscle Degeneration in SCA17 Mice

    Directory of Open Access Journals (Sweden)

    Shanshan Huang

    2015-10-01

    Full Text Available In polyglutamine (polyQ diseases, large polyQ repeats cause juvenile cases with different symptoms than those of adult-onset patients, who carry smaller expanded polyQ repeats. The mechanisms behind the differential pathology mediated by different polyQ repeat lengths remain unknown. By studying knockin mouse models of spinal cerebellar ataxia-17 (SCA17, we found that a large polyQ (105 glutamines in the TATA-box-binding protein (TBP preferentially causes muscle degeneration and reduces the expression of muscle-specific genes. Direct expression of TBP with different polyQ repeats in mouse muscle revealed that muscle degeneration is mediated only by the large polyQ repeats. Different polyQ repeats differentially alter TBP’s interaction with neuronal and muscle-specific transcription factors. As a result, the large polyQ repeat decreases the association of MyoD with TBP and DNA promoters. Our findings suggest that specific alterations in protein interactions by large polyQ repeats may account for the unique pathology in juvenile polyQ diseases.

  3. Large Polyglutamine Repeats Cause Muscle Degeneration in SCA17 Mice

    Science.gov (United States)

    Huang, Shanshan; Yang, Su; Guo, Jifeng; Yan, Sen; Gaertig, Marta A.; Li, Shihua; Li, Xiao-Jiang

    2015-01-01

    SUMMARY In polyglutamine (polyQ) diseases, large polyQ repeats cause juvenile cases with different symptoms than adult-onset patients, who carry smaller expanded polyQ repeats. The mechanisms behind the differential pathology mediated by different polyQ repeat lengths remain unknown. By studying knock-in mouse models of spinal cerebellar ataxia-17 (SCA17), we found that a large polyQ (105 glutamines) in the TATA box-binding protein (TBP) preferentially causes muscle degeneration and reduces the expression of muscle-specific genes. Direct expression of TBP with different polyQ repeats in mouse muscle revealed that muscle degeneration is mediated only by the large polyQ repeats. Different polyQ repeats differentially alter TBP’s interaction with neuronal and muscle-specific transcription factors. As a result, the large polyQ repeat decreases the association of MyoD with TBP and DNA promoters. Our findings suggest that specific alterations in protein interactions by large polyQ repeats may account for the unique pathology in juvenile polyQ diseases. PMID:26387956

  4. Programmable DNA-binding proteins from Burkholderia provide a fresh perspective on the TALE-like repeat domain.

    Science.gov (United States)

    de Lange, Orlando; Wolf, Christina; Dietze, Jörn; Elsaesser, Janett; Morbitzer, Robert; Lahaye, Thomas

    2014-06-01

    The tandem repeats of transcription activator like effectors (TALEs) mediate sequence-specific DNA binding using a simple code. Naturally, TALEs are injected by Xanthomonas bacteria into plant cells to manipulate the host transcriptome. In the laboratory TALE DNA binding domains are reprogrammed and used to target a fused functional domain to a genomic locus of choice. Research into the natural diversity of TALE-like proteins may provide resources for the further improvement of current TALE technology. Here we describe TALE-like proteins from the endosymbiotic bacterium Burkholderia rhizoxinica, termed Bat proteins. Bat repeat domains mediate sequence-specific DNA binding with the same code as TALEs, despite less than 40% sequence identity. We show that Bat proteins can be adapted for use as transcription factors and nucleases and that sequence preferences can be reprogrammed. Unlike TALEs, the core repeats of each Bat protein are highly polymorphic. This feature allowed us to explore alternative strategies for the design of custom Bat repeat arrays, providing novel insights into the functional relevance of non-RVD residues. The Bat proteins offer fertile grounds for research into the creation of improved programmable DNA-binding proteins and comparative insights into TALE-like evolution. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Acute and repeated ECS treatment increases CRF, POMC and PENK gene expression in selected regions of the rat hypothalamus.

    Science.gov (United States)

    Garcia-Garcia, L; Llewellyn-Jones, V; Fernandez Fernandez, I; Fuentes, J A; Manzanares, J

    1998-01-05

    The purpose of this study was to investigate the effects of acute and repeated electroconvulsive shock (ECS) on corticotropin releasing factor (CRF), proopiomelanocortin (POMC) and proenkephalin (PENK) gene expression in selected regions of the brain and pituitary of the rat. Acute ECS increased CRF gene expression in the paraventricular nucleus (PVN) by 20%, an effect that was further enhanced to 38% when rats received repeated ECS treatment. Acute and repeated ECS increased POMC gene expression in the arcuate nucleus (ARC) by 49-59% but failed to alter these mRNA levels in the anterior lobe (AL) of the pituitary gland. PENK gene expression was increased by 35% in the nucleus accumbens (NA) and by 180% the ventromedial nucleus (VMN) after acute or repeated ECS treatment but no significant changes were found in the PVN or striatum (ST). Taken together, these results indicate a differential CRF and opioid gene expression regulation after acute or repeated ECS treatment that may be relevant to their therapeutic or side effects in depression.

  6. Huntingtin gene repeat size variations affect risk of lifetime depression.

    Science.gov (United States)

    Gardiner, Sarah L; van Belzen, Martine J; Boogaard, Merel W; van Roon-Mom, Willeke M C; Rozing, Maarten P; van Hemert, Albert M; Smit, Johannes H; Beekman, Aartjan T F; van Grootheest, Gerard; Schoevers, Robert A; Oude Voshaar, Richard C; Roos, Raymund A C; Comijs, Hannie C; Penninx, Brenda W J H; van der Mast, Roos C; Aziz, N Ahmad

    2017-12-11

    Huntington disease (HD) is a severe neuropsychiatric disorder caused by a cytosine-adenine-guanine (CAG) repeat expansion in the HTT gene. Although HD is frequently complicated by depression, it is still unknown to what extent common HTT CAG repeat size variations in the normal range could affect depression risk in the general population. Using binary logistic regression, we assessed the association between HTT CAG repeat size and depression risk in two well-characterized Dutch cohorts─the Netherlands Study of Depression and Anxiety and the Netherlands Study of Depression in Older Persons─including 2165 depressed and 1058 non-depressed persons. In both cohorts, separately as well as combined, there was a significant non-linear association between the risk of lifetime depression and HTT CAG repeat size in which both relatively short and relatively large alleles were associated with an increased risk of depression (β = -0.292 and β = 0.006 for the linear and the quadratic term, respectively; both P < 0.01 after adjustment for the effects of sex, age, and education level). The odds of lifetime depression were lowest in persons with a HTT CAG repeat size of 21 (odds ratio: 0.71, 95% confidence interval: 0.52 to 0.98) compared to the average odds in the total cohort. In conclusion, lifetime depression risk was higher with both relatively short and relatively large HTT CAG repeat sizes in the normal range. Our study provides important proof-of-principle that repeat polymorphisms can act as hitherto unappreciated but complex genetic modifiers of depression.

  7. Ternary WD40 repeat-containing protein complexes: evolution, composition and roles in plant immunity

    Directory of Open Access Journals (Sweden)

    Jimi C. Miller

    2016-01-01

    Full Text Available Plants, like mammals, rely on their innate immune system to perceive and discriminate among the majority of their microbial pathogens. Unlike mammals, plants respond to this molecular dialogue by unleashing a complex chemical arsenal of defense metabolites to resist or evade pathogen infection. In basal or non-host resistance, plants utilize signal transduction pathways to detect non-self, damaged-self and altered-self-associated molecular patterns and translate these danger signals into largely inducible chemical defenses. The WD40 repeat (WDR-containing proteins Gβ and TTG1 are constituents of two independent ternary protein complexes functioning at opposite ends of a plant immune signaling pathway. Gβ and TTG1 are also encoded by single-copy genes that are ubiquitous in higher plants, implying the limited diversity and functional conservation of their respective complexes. In this review, we summarize what is currently known about the evolutionary history of these WDR-containing ternary complexes, their repertoire and combinatorial interactions, and their downstream effectors and pathways in plant defense.

  8. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases

    Directory of Open Access Journals (Sweden)

    Ma'ayan Avi

    2007-10-01

    Full Text Available Abstract Background In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP, generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Results Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Conclusion Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  9. Genome-wide identification and tissue-specific expression analysis of nucleotide binding site-leucine rich repeat gene family in Cicer arietinum (kabuli chickpea).

    Science.gov (United States)

    Sharma, Ranu; Rawat, Vimal; Suresh, C G

    2017-12-01

    The nucleotide binding site-leucine rich repeat (NBS-LRR) proteins play an important role in the defense mechanisms against pathogens. Using bioinformatics approach, we identified and annotated 104 NBS-LRR genes in chickpea. Phylogenetic analysis points to their diversification into two families namely TIR-NBS-LRR and non-TIR-NBS-LRR. Gene architecture revealed intron gain/loss events in this resistance gene family during their independent evolution into two families. Comparative genomics analysis elucidated its evolutionary relationship with other fabaceae species. Around 50% NBS-LRRs reside in macro-syntenic blocks underlining positional conservation along with sequence conservation of NBS-LRR genes in chickpea. Transcriptome sequencing data provided evidence for their transcription and tissue-specific expression. Four cis -regulatory elements namely WBOX, DRE, CBF, and GCC boxes, that commonly occur in resistance genes, were present in the promoter regions of these genes. Further, the findings will provide a strong background to use candidate disease resistance NBS-encoding genes and identify their specific roles in chickpea.

  10. Tetratricopeptide repeat domain 9A is an interacting protein for tropomyosin Tm5NM-1

    International Nuclear Information System (INIS)

    Cao, Shenglan; Ho, Gay Hui; Lin, Valerie CL

    2008-01-01

    Tetratricopeptide repeat domain 9A (TTC9A) protein is a recently identified protein which contains three tetratricopeptide repeats (TPRs) on its C-terminus. In our previous studies, we have shown that TTC9A was a hormonally-regulated gene in breast cancer cells. In this study, we found that TTC9A was over-expressed in breast cancer tissues compared with the adjacent controls (P < 0.00001), suggesting it might be involved in the breast cancer development process. The aim of the current study was to further elucidate the function of TTC9A. Breast samples from 25 patients including the malignant breast tissues and the adjacent normal tissues were processed for Southern blot analysis. Yeast-two-hybrid assay, GST pull-down assay and co-immunoprecipitation were used to identify and verify the interaction between TTC9A and other proteins. Tropomyosin Tm5NM-1 was identified as one of the TTC9A partner proteins. The interaction between TTC9A and Tm5NM-1 was further confirmed by GST pull-down assay and co-immunoprecipitation in mammalian cells. TTC9A domains required for the interaction were also characterized in this study. The results suggested that the first TPR domain and the linker fragment between the first two TPR domains of TTC9A were important for the interaction with Tm5NM-1 and the second and the third TPR might play an inhibitory role. Since the primary function of tropomyosin is to stabilize actin filament, its interaction with TTC9A may play a role in cell shape and motility. In our previous results, we have found that progesterone-induced TTC9A expression was associated with increased cell motility and cell spreading. We speculate that TTC9A acts as a chaperone protein to facilitate the function of tropomyosins in stabilizing microfilament and it may play a role in cancer cell invasion and metastasis

  11. A Large Complement of the Predicted Arabidopsis ARM Repeat Proteins Are Members of the U-Box E3 Ubiquitin Ligase Family1[w

    Science.gov (United States)

    Mudgil, Yashwanti; Shiu, Shin-Han; Stone, Sophia L.; Salt, Jennifer N.; Goring, Daphne R.

    2004-01-01

    The Arabidopsis genome was searched to identify predicted proteins containing armadillo (ARM) repeats, a motif known to mediate protein-protein interactions in a number of different animal proteins. Using domain database predictions and models generated in this study, 108 Arabidopsis proteins were identified that contained a minimum of two ARM repeats with the majority of proteins containing four to eight ARM repeats. Clustering analysis showed that the 108 predicted Arabidopsis ARM repeat proteins could be divided into multiple groups with wide differences in their domain compositions and organizations. Interestingly, 41 of the 108 Arabidopsis ARM repeat proteins contained a U-box, a motif present in a family of E3 ligases, and these proteins represented the largest class of Arabidopsis ARM repeat proteins. In 14 of these U-box/ARM repeat proteins, there was also a novel conserved domain identified in the N-terminal region. Based on the phylogenetic tree, representative U-box/ARM repeat proteins were selected for further study. RNA-blot analyses revealed that these U-box/ARM proteins are expressed in a variety of tissues in Arabidopsis. In addition, the selected U-box/ARM proteins were found to be functional E3 ubiquitin ligases. Thus, these U-box/ARM proteins represent a new family of E3 ligases in Arabidopsis. PMID:14657406

  12. Structural organization of glycophorin A and B genes: Glycophorin B gene evolved by homologous recombination at Alu repeat sequences

    International Nuclear Information System (INIS)

    Kudo, Shinichi; Fukuda, Minoru

    1989-01-01

    Glycophorins A (GPA) and B (GPB) are two major sialoglycoproteins of the human erythrocyte membrane. Here the authors present a comparison of the genomic structures of GPA and GPB developed by analyzing DNA clones isolated from a K562 genomic library. Nucleotide sequences of exon-intron junctions and 5' and 3' flanking sequences revealed that the GPA and GPB genes consist of 7 and 5 exons, respectively, and both genes have >95% identical sequence from the 5' flanking region to the region ∼ 1 kilobase downstream from the exon encoding the transmembrane regions. In this homologous part of the genes, GPB lacks one exon due to a point mutation at the 5' splicing site of the third intron, which inactivates the 5' cleavage event of splicing and leads to ligation of the second to the fourth exon. Following these very homologous sequences, the genomic sequences for GPA and GPB diverge significantly and no homology can be detected in their 3' end sequences. The analysis of the Alu sequences and their flanking direct repeat sequences suggest that an ancestral genomic structure has been maintained in the GPA gene, whereas the GPB gene has arisen from the acquisition of 3' sequences different from those of the GPA gene by homologous recombination at the Alu repeats during or after gene duplication

  13. Defended to the Nines: 25 Years of Resistance Gene Cloning Identifies Nine Mechanisms for R Protein Function[OPEN

    Science.gov (United States)

    2018-01-01

    Plants have many, highly variable resistance (R) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize (Zea mays) Hm1, was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. PMID:29382771

  14. Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

    Science.gov (United States)

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-04-21

    To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease

  15. Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

    Directory of Open Access Journals (Sweden)

    Mixon Mark

    2009-04-01

    Full Text Available Abstract Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene

  16. Defended to the Nines: 25 Years of Resistance Gene Cloning Identifies Nine Mechanisms for R Protein Function.

    Science.gov (United States)

    Kourelis, Jiorgos; van der Hoorn, Renier A L

    2018-02-01

    Plants have many, highly variable resistance ( R ) gene loci, which provide resistance to a variety of pathogens. The first R gene to be cloned, maize ( Zea mays ) Hm1 , was published over 25 years ago, and since then, many different R genes have been identified and isolated. The encoded proteins have provided clues to the diverse molecular mechanisms underlying immunity. Here, we present a meta-analysis of 314 cloned R genes. The majority of R genes encode cell surface or intracellular receptors, and we distinguish nine molecular mechanisms by which R proteins can elevate or trigger disease resistance: direct (1) or indirect (2) perception of pathogen-derived molecules on the cell surface by receptor-like proteins and receptor-like kinases; direct (3) or indirect (4) intracellular detection of pathogen-derived molecules by nucleotide binding, leucine-rich repeat receptors, or detection through integrated domains (5); perception of transcription activator-like effectors through activation of executor genes (6); and active (7), passive (8), or host reprogramming-mediated (9) loss of susceptibility. Although the molecular mechanisms underlying the functions of R genes are only understood for a small proportion of known R genes, a clearer understanding of mechanisms is emerging and will be crucial for rational engineering and deployment of novel R genes. © 2018 American Society of Plant Biologists. All rights reserved.

  17. Roles of genes and Alu repeats in nonlinear correlations of HUMHBB DNA sequence

    International Nuclear Information System (INIS)

    Xiao Yi; Huang Yanzhao

    2004-01-01

    DNA sequences of different species and different portion of the DNA of the same species may have completely different correlation properties, but the origin of these correlations is still not very clear and is currently being investigated, especially in different particular cases. We report here a study of the DNA sequence of human beta globin region (HUMHBB) which has strong linear and nonlinear correlations. We studied the roles of two of the typical elements of DNA sequence, genes and Alu repeats, in the nonlinear correlations of HUMHBB. We find that there exist strong nonlinear correlations between the exons or introns in different genes and between the Alu repeats. They may be one of the major sources of the nonlinear correlations in HUMBHB

  18. Structure-Function Analysis of Cf-9, a Receptor-Like Protein with Extracytoplasmic Leucine-Rich Repeats

    NARCIS (Netherlands)

    Hoorn, van der R.A.L.; Wulff, B.B.H.; Rivas, S.; Durrant, M.C.; Ploeg, van der A.; Wit, de P.J.G.M.; Jones, J.D.G.

    2005-01-01

    The tomato (Lycopersicon pimpinellifolium) resistance protein Cf-9 belongs to a large class of plant proteins with extracytoplasmic Leu-rich repeats (eLRRs). eLRR proteins play key roles in plant defense and development, mainly as receptor-like proteins or receptor-like kinases, conferring

  19. In silico reversal of repeat-induced point mutation (RIP identifies the origins of repeat families and uncovers obscured duplicated genes

    Directory of Open Access Journals (Sweden)

    Hane James K

    2010-11-01

    Full Text Available Abstract Background Repeat-induced point mutation (RIP is a fungal genome defence mechanism guarding against transposon invasion. RIP mutates the sequence of repeated DNA and over time renders the affected regions unrecognisable by similarity search tools such as BLAST. Results DeRIP is a new software tool developed to predict the original sequence of a RIP-mutated region prior to the occurrence of RIP. In this study, we apply deRIP to the genome of the wheat pathogen Stagonospora nodorum SN15 and predict the origin of several previously uncharacterised classes of repetitive DNA. Conclusions Five new classes of transposon repeats and four classes of endogenous gene repeats were identified after deRIP. The deRIP process is a new tool for fungal genomics that facilitates the identification and understanding of the role and origin of fungal repetitive DNA. DeRIP is open-source and is available as part of the RIPCAL suite at http://www.sourceforge.net/projects/ripcal.

  20. Identification of TTAGGG-binding proteins in Neurospora crassa, a fungus with vertebrate-like telomere repeats.

    Science.gov (United States)

    Casas-Vila, Núria; Scheibe, Marion; Freiwald, Anja; Kappei, Dennis; Butter, Falk

    2015-11-17

    To date, telomere research in fungi has mainly focused on Saccharomyces cerevisiae and Schizosaccharomyces pombe, despite the fact that both yeasts have degenerated telomeric repeats in contrast to the canonical TTAGGG motif found in vertebrates and also several other fungi. Using label-free quantitative proteomics, we here investigate the telosome of Neurospora crassa, a fungus with canonical telomeric repeats. We show that at least six of the candidates detected in our screen are direct TTAGGG-repeat binding proteins. While three of the direct interactors (NCU03416 [ncTbf1], NCU01991 [ncTbf2] and NCU02182 [ncTay1]) feature the known myb/homeobox DNA interaction domain also found in the vertebrate telomeric factors, we additionally show that a zinc-finger protein (NCU07846) and two proteins without any annotated DNA-binding domain (NCU02644 and NCU05718) are also direct double-strand TTAGGG binders. We further find two single-strand binders (NCU02404 [ncGbp2] and NCU07735 [ncTcg1]). By quantitative label-free interactomics we identify TTAGGG-binding proteins in Neurospora crassa, suggesting candidates for telomeric factors that are supported by phylogenomic comparison with yeast species. Intriguingly, homologs in yeast species with degenerated telomeric repeats are also TTAGGG-binding proteins, e.g. in S. cerevisiae Tbf1 recognizes the TTAGGG motif found in its subtelomeres. However, there is also a subset of proteins that is not conserved. While a rudimentary core TTAGGG-recognition machinery may be conserved across yeast species, our data suggests Neurospora as an emerging model organism with unique features.

  1. Study of simple sequence repeat (SSR) polymorphism for biotic ...

    African Journals Online (AJOL)

    home

    2013-10-02

    Oct 2, 2013 ... G. Siva Kumar1, K. Aruna Kumari1*, Ch. V. Durga Rani1, R. M. Sundaram2, S. Vanisree3, Md. ..... review by Jena and Mackill (2008) provided the list of .... repeat protein and is a member of a resistance gene cluster on rice.

  2. Gene expression profiles of the small intestinal mucosa of dogs repeatedly infected with the cestode Echinococcus multilocularis

    Directory of Open Access Journals (Sweden)

    Hirokazu Kouguchi

    2018-04-01

    Full Text Available The data set presented in this article is related to a previous research article entitled “ The timing of worm exclusion in dogs repeatedly infected with the cestode Echinococcus multilocularis” (Kouguchi et al., 2016 [1]. This article describes the genes >2-fold up- or down-regulated in the first- and repeated-infection groups compared to the healthy controls group. The gene expression profiles were generated using the Agilent-021193 Canine (V2 Gene Expression Microarray (GPL15379. The raw and normalized microarray data have been deposited with the Gene Expression Omnibus (GEO database under accession number GSE105098. Keywords: E. multilocularis, Microarray, Dog, Echinococcosis, Vaccine

  3. Combining random gene fission and rational gene fusion to discover near-infrared fluorescent protein fragments that report on protein-protein interactions.

    Science.gov (United States)

    Pandey, Naresh; Nobles, Christopher L; Zechiedrich, Lynn; Maresso, Anthony W; Silberg, Jonathan J

    2015-05-15

    Gene fission can convert monomeric proteins into two-piece catalysts, reporters, and transcription factors for systems and synthetic biology. However, some proteins can be challenging to fragment without disrupting function, such as near-infrared fluorescent protein (IFP). We describe a directed evolution strategy that can overcome this challenge by randomly fragmenting proteins and concomitantly fusing the protein fragments to pairs of proteins or peptides that associate. We used this method to create libraries that express fragmented IFP as fusions to a pair of associating peptides (IAAL-E3 and IAAL-K3) and proteins (CheA and CheY) and screened for fragmented IFP with detectable near-infrared fluorescence. Thirteen novel fragmented IFPs were identified, all of which arose from backbone fission proximal to the interdomain linker. Either the IAAL-E3 and IAAL-K3 peptides or CheA and CheY proteins could assist with IFP fragment complementation, although the IAAL-E3 and IAAL-K3 peptides consistently yielded higher fluorescence. These results demonstrate how random gene fission can be coupled to rational gene fusion to create libraries enriched in fragmented proteins with AND gate logic that is dependent upon a protein-protein interaction, and they suggest that these near-infrared fluorescent protein fragments will be suitable as reporters for pairs of promoters and protein-protein interactions within whole animals.

  4. Crystal structure of clustered regularly interspaced short palindromic repeats (CRISPR)-associated Csn2 protein revealed Ca2+-dependent double-stranded DNA binding activity.

    Science.gov (United States)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong

    2011-09-02

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 Å tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is ∼26 Å wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an α/β domain and an α-helical domain; significant hinge motion was observed between these two domains. Ca(2+) was located at strategic positions in the oligomerization interface. We further showed that removal of Ca(2+) ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca(2+) ions.

  5. The energy landscapes of repeat-containing proteins: topology, cooperativity, and the folding funnels of one-dimensional architectures.

    Directory of Open Access Journals (Sweden)

    Diego U Ferreiro

    2008-05-01

    Full Text Available Repeat-proteins are made up of near repetitions of 20- to 40-amino acid stretches. These polypeptides usually fold up into non-globular, elongated architectures that are stabilized by the interactions within each repeat and those between adjacent repeats, but that lack contacts between residues distant in sequence. The inherent symmetries both in primary sequence and three-dimensional structure are reflected in a folding landscape that may be analyzed as a quasi-one-dimensional problem. We present a general description of repeat-protein energy landscapes based on a formal Ising-like treatment of the elementary interaction energetics in and between foldons, whose collective ensemble are treated as spin variables. The overall folding properties of a complete "domain" (the stability and cooperativity of the repeating array can be derived from this microscopic description. The one-dimensional nature of the model implies there are simple relations for the experimental observables: folding free-energy (DeltaG(water and the cooperativity of denaturation (m-value, which do not ordinarily apply for globular proteins. We show how the parameters for the "coarse-grained" description in terms of foldon spin variables can be extracted from more detailed folding simulations on perfectly funneled landscapes. To illustrate the ideas, we present a case-study of a family of tetratricopeptide (TPR repeat proteins and quantitatively relate the results to the experimentally observed folding transitions. Based on the dramatic effect that single point mutations exert on the experimentally observed folding behavior, we speculate that natural repeat proteins are "poised" at particular ratios of inter- and intra-element interaction energetics that allow them to readily undergo structural transitions in physiologically relevant conditions, which may be intrinsically related to their biological functions.

  6. Automatically identifying gene/protein terms in MEDLINE abstracts.

    Science.gov (United States)

    Yu, Hong; Hatzivassiloglou, Vasileios; Rzhetsky, Andrey; Wilbur, W John

    2002-01-01

    Natural language processing (NLP) techniques are used to extract information automatically from computer-readable literature. In biology, the identification of terms corresponding to biological substances (e.g., genes and proteins) is a necessary step that precedes the application of other NLP systems that extract biological information (e.g., protein-protein interactions, gene regulation events, and biochemical pathways). We have developed GPmarkup (for "gene/protein-full name mark up"), a software system that automatically identifies gene/protein terms (i.e., symbols or full names) in MEDLINE abstracts. As a part of marking up process, we also generated automatically a knowledge source of paired gene/protein symbols and full names (e.g., LARD for lymphocyte associated receptor of death) from MEDLINE. We found that many of the pairs in our knowledge source do not appear in the current GenBank database. Therefore our methods may also be used for automatic lexicon generation. GPmarkup has 73% recall and 93% precision in identifying and marking up gene/protein terms in MEDLINE abstracts. A random sample of gene/protein symbols and full names and a sample set of marked up abstracts can be viewed at http://www.cpmc.columbia.edu/homepages/yuh9001/GPmarkup/. Contact. hy52@columbia.edu. Voice: 212-939-7028; fax: 212-666-0140.

  7. Polymorphisms in leucine-rich repeat genes are associated with autism spectrum disorder susceptibility in populations of European ancestry

    Directory of Open Access Journals (Sweden)

    Sousa Inês

    2010-03-01

    Full Text Available Abstract Background Autism spectrum disorders (ASDs are a group of highly heritable neurodevelopmental disorders which are characteristically comprised of impairments in social interaction, communication and restricted interests/behaviours. Several cell adhesion transmembrane leucine-rich repeat (LRR proteins are highly expressed in the nervous system and are thought to be key regulators of its development. Here we present an association study analysing the roles of four promising candidate genes - LRRTM1 (2p, LRRTM3 (10q, LRRN1 (3p and LRRN3 (7q - in order to identify common genetic risk factors underlying ASDs. Methods In order to gain a better understanding of how the genetic variation within these four gene regions may influence susceptibility to ASDs, a family-based association study was undertaken in 661 families of European ancestry selected from four different ASD cohorts. In addition, a case-control study was undertaken across the four LRR genes, using logistic regression in probands with ASD of each population against 295 ECACC controls. Results Significant results were found for LRRN3 and LRRTM3 (P LRRTM3. Conclusions Overall, our findings implicate the neuronal leucine-rich genes LRRN3 and LRRTM3 in ASD susceptibility.

  8. The leucine-rich repeat structure.

    Science.gov (United States)

    Bella, J; Hindle, K L; McEwan, P A; Lovell, S C

    2008-08-01

    The leucine-rich repeat is a widespread structural motif of 20-30 amino acids with a characteristic repetitive sequence pattern rich in leucines. Leucine-rich repeat domains are built from tandems of two or more repeats and form curved solenoid structures that are particularly suitable for protein-protein interactions. Thousands of protein sequences containing leucine-rich repeats have been identified by automatic annotation methods. Three-dimensional structures of leucine-rich repeat domains determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. As the essential structural principles become well established, the leucine-rich repeat architecture is emerging as an attractive framework for structural prediction and protein engineering. This review presents an update of the current understanding of leucine-rich repeat structure at the primary, secondary, tertiary and quaternary levels and discusses specific examples from recently determined three-dimensional structures.

  9. Rapid bursts of androgen-binding protein (Abp) gene duplication occurred independently in diverse mammals.

    Science.gov (United States)

    Laukaitis, Christina M; Heger, Andreas; Blakley, Tyler D; Munclinger, Pavel; Ponting, Chris P; Karn, Robert C

    2008-02-12

    The draft mouse (Mus musculus) genome sequence revealed an unexpected proliferation of gene duplicates encoding a family of secretoglobin proteins including the androgen-binding protein (ABP) alpha, beta and gamma subunits. Further investigation of 14 alpha-like (Abpa) and 13 beta- or gamma-like (Abpbg) undisrupted gene sequences revealed a rich diversity of developmental stage-, sex- and tissue-specific expression. Despite these studies, our understanding of the evolution of this gene family remains incomplete. Questions arise from imperfections in the initial mouse genome assembly and a dearth of information about the gene family structure in other rodents and mammals. Here, we interrogate the latest 'finished' mouse (Mus musculus) genome sequence assembly to show that the Abp gene repertoire is, in fact, twice as large as reported previously, with 30 Abpa and 34 Abpbg genes and pseudogenes. All of these have arisen since the last common ancestor with rat (Rattus norvegicus). We then demonstrate, by sequencing homologs from species within the Mus genus, that this burst of gene duplication occurred very recently, within the past seven million years. Finally, we survey Abp orthologs in genomes from across the mammalian clade and show that bursts of Abp gene duplications are not specific to the murid rodents; they also occurred recently in the lagomorph (rabbit, Oryctolagus cuniculus) and ruminant (cattle, Bos taurus) lineages, although not in other mammalian taxa. We conclude that Abp genes have undergone repeated bursts of gene duplication and adaptive sequence diversification driven by these genes' participation in chemosensation and/or sexual identification.

  10. Pentapeptide-repeat proteins that act as topoisomerase poison resistance factors have a common dimer interface

    International Nuclear Information System (INIS)

    Vetting, Matthew W.; Hegde, Subray S.; Zhang, Yong; Blanchard, John S.

    2011-01-01

    The pentapeptide repeat protein AlbG, provides self-resistance to the nonribosomally encoded hybrid polyketide-peptide termed albicidin. Analysis of the AlbG three-dimensional structure and the sequences of other pentapeptide repeat proteins that confer resistance to topiosomerase poisons suggests they have a similar dimer interface which may be critical to their interaction with topoisomerases. The protein AlbG is a self-resistance factor against albicidin, a nonribosomally encoded hybrid polyketide-peptide with antibiotic and phytotoxic properties produced by Xanthomonas albilineans. Primary-sequence analysis indicates that AlbG is a member of the pentapeptide-repeat family of proteins (PRP). The structure of AlbG from X. albilineans was determined at 2.0 Å resolution by SAD phasing using data collected from a single trimethyllead acetate derivative on a home source. AlbG folds into a right-handed quadrilateral β-helix composed of approximately eight semi-regular coils. The regularity of the β-helix is blemished by a large loop/deviation in the β-helix between coils 4 and 5. The C-terminus of the β-helix is capped by a dimerization module, yielding a dimer with a 110 Å semi-collinear β-helical axis. This method of dimer formation appears to be common to all PRP proteins that confer resistance to topoisomerase poisons and contrasts with most PRP proteins, which are typically monomeric

  11. Dinucleotide repeat polymorphism in Fms-like tyrosine kinase-1 (Flt-1 gene is not associated with preeclampsia

    Directory of Open Access Journals (Sweden)

    Park So-Yeon

    2008-07-01

    Full Text Available Abstract Background Preeclampsia is a major cause of maternal and perinatal mortality and morbidity. The etiology of preeclampsia remains unclear. Recently, it was shown that misregulation of fms-like tyrosine kinase-1 (Flt-1 in the peripheral blood mononuclear cells of pregnant women results in over-expression of the soluble splice variant of Flt-1, sFlt-1, producing an additional (extra-placental source of sFlt-1 that can contribute to the etiology of preeclampsia. The aim of this study was to investigate the relationship between preeclampsia and a dinucleotide (threonine-glycine; TGn repeat polymorphism in the 3' non-coding region of the Flt-1 gene. Methods The number of the d(TGn repeats was analyzed in 170 patients with preeclampsia and in 202 normotensive pregnancies. The region containing the dinucleotide repeat polymorphism of the Flt-1 gene was amplified by polymerase chain reaction (PCR from the DNA samples and was analyzed by direct PCR sequencing. Results We found 10 alleles of the dinucleotide repeat polymorphism and designated these as allele*12 (A1 through allele*23 (A12 according to the number of the TG repeats, from 12 to 23. The frequency of the 14-repeat allele (A3 was most abundant (63.82% in preeclampsia and 69.06% in controls, followed by the 21-repeat allele (A10; 28.53% in preeclampsia and 23.76% in controls. There was no significant difference in the allele frequency between patients with preeclampsia and normal controls. The most common genotype in preeclamptic and normotensive pregnancies was heterozygous (TG14/(TG21 (41.76% and homozygous (TG14/(TG14 (45.05%, respectively. However, the genotype frequencies were not significantly different between preeclamptic patients and controls. Conclusion This is the first study to characterize the dinucleotide repeat polymorphism of the Flt-1 gene in patients with preeclampsia. We found no differences in the allele or genotype frequencies between patients with preeclampsia and

  12. Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation

    International Nuclear Information System (INIS)

    O'Hara, P.J.; Grant, F.J.; Haldeman, B.A.; Gray, C.L.; Insley, M.Y.; Hagen, F.S.; Murray, M.J.

    1987-01-01

    Activated factor VII (factor VIIa) is a vitamin K-dependent plasma serine protease that participates in a cascade of reactions leading to the coagulation of blood. Two overlapping genomic clones containing sequences encoding human factor VII were isolated and characterized. The complete sequence of the gene was determined and found to span about 12.8 kilobases. The mRNA for factor VII as demonstrated by cDNA cloning is polyadenylylated at multiple sites but contains only one AAUAAA poly(A) signal sequence. The mRNA can undergo alternative splicing, forming one transcript containing eight segments as exons and another with an additional exon that encodes a larger prepro leader sequence. The latter transcript has no known counterpart in the other vitamin K-dependent proteins. The positions of the introns with respect to the amino acid sequence encoded by the eight essential exons of factor VII are the same as those present in factor IX, factor X, protein C, and the first three exons of prothrombin. These exons code for domains generally conserved among members of this gene family. The comparable introns in these genes, however, are dissimilar with respect to size and sequence, with the exception of intron C in factor VII and protein C. The gene for factor VII also contains five regions made up of tandem repeats of oligonucleotide monomer elements. More than a quarter of the intron sequences and more than a third of the 3' untranslated portion of the mRNA transcript consist of these minisatellite tandem repeats

  13. TRDistiller: a rapid filter for enrichment of sequence datasets with proteins containing tandem repeats.

    Science.gov (United States)

    Richard, François D; Kajava, Andrey V

    2014-06-01

    The dramatic growth of sequencing data evokes an urgent need to improve bioinformatics tools for large-scale proteome analysis. Over the last two decades, the foremost efforts of computer scientists were devoted to proteins with aperiodic sequences having globular 3D structures. However, a large portion of proteins contain periodic sequences representing arrays of repeats that are directly adjacent to each other (so called tandem repeats or TRs). These proteins frequently fold into elongated fibrous structures carrying different fundamental functions. Algorithms specific to the analysis of these regions are urgently required since the conventional approaches developed for globular domains have had limited success when applied to the TR regions. The protein TRs are frequently not perfect, containing a number of mutations, and some of them cannot be easily identified. To detect such "hidden" repeats several algorithms have been developed. However, the most sensitive among them are time-consuming and, therefore, inappropriate for large scale proteome analysis. To speed up the TR detection we developed a rapid filter that is based on the comparison of composition and order of short strings in the adjacent sequence motifs. Tests show that our filter discards up to 22.5% of proteins which are known to be without TRs while keeping almost all (99.2%) TR-containing sequences. Thus, we are able to decrease the size of the initial sequence dataset enriching it with TR-containing proteins which allows a faster subsequent TR detection by other methods. The program is available upon request. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Exceptionally long 5' UTR short tandem repeats specifically linked to primates.

    Science.gov (United States)

    Namdar-Aligoodarzi, P; Mohammadparast, S; Zaker-Kandjani, B; Talebi Kakroodi, S; Jafari Vesiehsari, M; Ohadi, M

    2015-09-10

    We have previously reported genome-scale short tandem repeats (STRs) in the core promoter interval (i.e. -120 to +1 to the transcription start site) of protein-coding genes that have evolved identically in primates vs. non-primates. Those STRs may function as evolutionary switch codes for primate speciation. In the current study, we used the Ensembl database to analyze the 5' untranslated region (5' UTR) between +1 and +60 of the transcription start site of the entire human protein-coding genes annotated in the GeneCards database, in order to identify "exceptionally long" STRs (≥5-repeats), which may be of selective/adaptive advantage. The importance of this critical interval is its function as core promoter, and its effect on transcription and translation. In order to minimize ascertainment bias, we analyzed the evolutionary status of the human 5' UTR STRs of ≥5-repeats in several species encompassing six major orders and superorders across mammals, including primates, rodents, Scandentia, Laurasiatheria, Afrotheria, and Xenarthra. We introduce primate-specific STRs, and STRs which have expanded from mouse to primates. Identical co-occurrence of the identified STRs of rare average frequency between 0.006 and 0.0001 in primates supports a role for those motifs in processes that diverged primates from other mammals, such as neuronal differentiation (e.g. APOD and FGF4), and craniofacial development (e.g. FILIP1L). A number of the identified STRs of ≥5-repeats may be human-specific (e.g. ZMYM3 and DAZAP1). Future work is warranted to examine the importance of the listed genes in primate/human evolution, development, and disease. Copyright © 2015 Elsevier B.V. All rights reserved.

  15. Discovering disease-associated genes in weighted protein-protein interaction networks

    Science.gov (United States)

    Cui, Ying; Cai, Meng; Stanley, H. Eugene

    2018-04-01

    Although there have been many network-based attempts to discover disease-associated genes, most of them have not taken edge weight - which quantifies their relative strength - into consideration. We use connection weights in a protein-protein interaction (PPI) network to locate disease-related genes. We analyze the topological properties of both weighted and unweighted PPI networks and design an improved random forest classifier to distinguish disease genes from non-disease genes. We use a cross-validation test to confirm that weighted networks are better able to discover disease-associated genes than unweighted networks, which indicates that including link weight in the analysis of network properties provides a better model of complex genotype-phenotype associations.

  16. Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

    Science.gov (United States)

    Raynard, Steven J; Baker, Mark D

    2004-01-01

    In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.

  17. Repeat polymorphisms in the Homo sapiens heme oxygenase-1 gene in diabetic and idiopathic gastroparesis.

    Directory of Open Access Journals (Sweden)

    Simon J Gibbons

    Full Text Available Idiopathic and diabetic gastroparesis in Homo sapiens cause significant morbidity. Etiology or risk factors have not been clearly identified. Failure to sustain elevated heme oxygenase-1 (HO1 expression is associated with delayed gastric emptying in diabetic mice and polymorphisms in the HO1 gene (HMOX1, NCBI Gene ID:3162 are associated with worse outcomes in other diseases.Our hypothesis was that longer polyGT alleles are more common in the HMOX1 genes of individuals with gastroparesis than in controls without upper gastrointestinal motility disorders.Repeat length was determined in genomic DNA. Controls with diabetes (84 type 1, 84 type 2 and without diabetes (n = 170 were compared to diabetic gastroparetics (99 type 1, 72 type 2 and idiopathic gastroparetics (n = 234. Correlations of repeat lengths with clinical symptom sub-scores on the gastroparesis cardinal symptom index (GCSI were done. Statistical analyses of short (32 repeat alleles and differences in allele length were used to test for associations with gastroparesis.The distribution of allele lengths was different between groups (P = 0.016. Allele lengths were longest in type 2 diabetics with gastroparesis (29.18±0.35, mean ± SEM and longer in gastroparetics compared to non-diabetic controls (28.50±0.14 vs 27.64±0.20 GT repeats/allele, P = 0.0008. Type 2 diabetic controls had longer alleles than non-diabetic controls. In all gastroparetic groups, allele lengths were longer in African Americans compared to other racial groups, differences in the proportion of African Americans in the groups accounted for the differences between gastroparetics and controls. Diabetic gastroparetics with 1 or 2 long alleles had worse GCSI nausea sub-scores (3.30±0.23 as compared to those with 0 long alleles (2.66±0.12, P = 0.022.Longer poly-GT repeats in the HMOX1 gene are more common in African Americans with gastroparesis. Nausea symptoms are worse in subjects with longer alleles.

  18. Repeat polymorphisms in the Homo sapiens heme oxygenase-1 gene in diabetic and idiopathic gastroparesis.

    Science.gov (United States)

    Gibbons, Simon J; Grover, Madhusudan; Choi, Kyoung Moo; Wadhwa, Akhilesh; Zubair, Adeel; Wilson, Laura A; Wu, Yanhong; Abell, Thomas L; Hasler, William L; Koch, Kenneth L; McCallum, Richard W; Nguyen, Linda A B; Parkman, Henry P; Sarosiek, Irene; Snape, William J; Tonascia, James; Hamilton, Frank A; Pasricha, Pankaj J; Farrugia, Gianrico

    2017-01-01

    Idiopathic and diabetic gastroparesis in Homo sapiens cause significant morbidity. Etiology or risk factors have not been clearly identified. Failure to sustain elevated heme oxygenase-1 (HO1) expression is associated with delayed gastric emptying in diabetic mice and polymorphisms in the HO1 gene (HMOX1, NCBI Gene ID:3162) are associated with worse outcomes in other diseases. Our hypothesis was that longer polyGT alleles are more common in the HMOX1 genes of individuals with gastroparesis than in controls without upper gastrointestinal motility disorders. Repeat length was determined in genomic DNA. Controls with diabetes (84 type 1, 84 type 2) and without diabetes (n = 170) were compared to diabetic gastroparetics (99 type 1, 72 type 2) and idiopathic gastroparetics (n = 234). Correlations of repeat lengths with clinical symptom sub-scores on the gastroparesis cardinal symptom index (GCSI) were done. Statistical analyses of short (32) repeat alleles and differences in allele length were used to test for associations with gastroparesis. The distribution of allele lengths was different between groups (P = 0.016). Allele lengths were longest in type 2 diabetics with gastroparesis (29.18±0.35, mean ± SEM) and longer in gastroparetics compared to non-diabetic controls (28.50±0.14 vs 27.64±0.20 GT repeats/allele, P = 0.0008). Type 2 diabetic controls had longer alleles than non-diabetic controls. In all gastroparetic groups, allele lengths were longer in African Americans compared to other racial groups, differences in the proportion of African Americans in the groups accounted for the differences between gastroparetics and controls. Diabetic gastroparetics with 1 or 2 long alleles had worse GCSI nausea sub-scores (3.30±0.23) as compared to those with 0 long alleles (2.66±0.12), P = 0.022. Longer poly-GT repeats in the HMOX1 gene are more common in African Americans with gastroparesis. Nausea symptoms are worse in subjects with longer alleles.

  19. Repeat polymorphisms in the Homo sapiens heme oxygenase-1 gene in diabetic and idiopathic gastroparesis

    Science.gov (United States)

    Gibbons, Simon J.; Grover, Madhusudan; Choi, Kyoung Moo; Wadhwa, Akhilesh; Zubair, Adeel; Wilson, Laura A.; Wu, Yanhong; Abell, Thomas L.; Hasler, William L.; Koch, Kenneth L.; McCallum, Richard W.; Nguyen, Linda A. B.; Parkman, Henry P.; Sarosiek, Irene; Snape, William J.; Tonascia, James; Hamilton, Frank A.; Pasricha, Pankaj J.

    2017-01-01

    Background Idiopathic and diabetic gastroparesis in Homo sapiens cause significant morbidity. Etiology or risk factors have not been clearly identified. Failure to sustain elevated heme oxygenase-1 (HO1) expression is associated with delayed gastric emptying in diabetic mice and polymorphisms in the HO1 gene (HMOX1, NCBI Gene ID:3162) are associated with worse outcomes in other diseases. Aim Our hypothesis was that longer polyGT alleles are more common in the HMOX1 genes of individuals with gastroparesis than in controls without upper gastrointestinal motility disorders. Methods Repeat length was determined in genomic DNA. Controls with diabetes (84 type 1, 84 type 2) and without diabetes (n = 170) were compared to diabetic gastroparetics (99 type 1, 72 type 2) and idiopathic gastroparetics (n = 234). Correlations of repeat lengths with clinical symptom sub-scores on the gastroparesis cardinal symptom index (GCSI) were done. Statistical analyses of short (32) repeat alleles and differences in allele length were used to test for associations with gastroparesis. Results The distribution of allele lengths was different between groups (P = 0.016). Allele lengths were longest in type 2 diabetics with gastroparesis (29.18±0.35, mean ± SEM) and longer in gastroparetics compared to non-diabetic controls (28.50±0.14 vs 27.64±0.20 GT repeats/allele, P = 0.0008). Type 2 diabetic controls had longer alleles than non-diabetic controls. In all gastroparetic groups, allele lengths were longer in African Americans compared to other racial groups, differences in the proportion of African Americans in the groups accounted for the differences between gastroparetics and controls. Diabetic gastroparetics with 1 or 2 long alleles had worse GCSI nausea sub-scores (3.30±0.23) as compared to those with 0 long alleles (2.66±0.12), P = 0.022. Conclusions Longer poly-GT repeats in the HMOX1 gene are more common in African Americans with gastroparesis. Nausea symptoms are worse in

  20. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    Directory of Open Access Journals (Sweden)

    Glass John I

    2010-07-01

    Full Text Available Abstract Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT. Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the

  1. Presence of fibronectin-binding protein gene prtF2 in invasive group A streptococci in tropical Australia is associated with increased internalisation efficiency.

    Science.gov (United States)

    Gorton, Davina; Norton, Robert; Layton, Ramon; Smith, Helen; Ketheesan, Natkunam

    2005-03-01

    The fibronectin-binding proteins (FnBPs) PrtF1 and PrtF2 are considered to be major group A streptococcal virulence factors, mediating adherence to and internalisation of host cells. The present study investigated an association between the presence of prtF1 and prtF2 genes and internalisation efficiency in group A streptococci (GAS) isolated from patients with invasive disease. Of the 80 isolates tested, 58 (73%) had prtF1 and 71 (89%) possessed prtF2. Three isolates (4%) had neither gene, seven (9%) had prtF1 only, 19 (24%) had prtF2 only and 51 isolates (64%) had both prtF1 and prtF2. prtF2-positive isolates internalised up to three times more efficiently than isolates that had prtF1 alone (Pinternalisation efficiency and presence of the prtF1 gene. Analysis of the fibronectin-binding repeat domain (FBRD) of prtF2 revealed that this gene can contain 2, 3, 4 or 5 repeat regions and that five repeat regions conferred very high internalisation efficiency in invasive GAS isolates.

  2. Invited review: Frontotemporal dementia caused by microtubule-associated protein tau gene (MAPT) mutations: a chameleon for neuropathology and neuroimaging.

    Science.gov (United States)

    Ghetti, B; Oblak, A L; Boeve, B F; Johnson, K A; Dickerson, B C; Goedert, M

    2015-02-01

    Hereditary frontotemporal dementia associated with mutations in the microtubule-associated protein tau gene (MAPT) is a protean disorder. Three neuropathologic subtypes can be recognized, based on the presence of inclusions made of tau isoforms with three and four repeats, predominantly three repeats and mostly four repeats. This is relevant for establishing a correlation between structural magnetic resonance imaging and positron emission tomography using tracers specific for aggregated tau. Longitudinal studies will be essential to determine the evolution of anatomical alterations from the asymptomatic stage to the various phases of disease following the onset of symptoms. © 2014 The Authors. Neuropathology and Applied Neurobiology published by John Wiley & Sons Ltd on behalf of British Neuropathological Society.

  3. Molecular cloning, genomic organization, developmental regulation, and a knock-out mutant of a novel leu-rich repeats-containing G protein-coupled receptor (DLGR-2) from Drosophila melanogaster

    DEFF Research Database (Denmark)

    Eriksen, Kathrine Krageskov; Hauser, Frank; Schiøtt, Morten

    2000-01-01

    After screening the Berkeley Drosophila Genome Project database with sequences from a recently characterized Leu-rich repeats-containing G protein-coupled receptor (LGR) fromDrosophila (DLGR-1), we identified a second gene for a different LGR (DLGR-2) and cloned its cDNA. DLGR-2 is 1360 amino aci...... knock-out mutants, where the DLGR-2 gene is interrupted by a P element insertion, die around the time of hatching. This finding, together with the expression data, strongly suggests that DLGR-2 is exclusively involved in development....

  4. Crystal Structure of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated Csn2 Protein Revealed Ca[superscript 2+]-dependent Double-stranded DNA Binding Activity

    Energy Technology Data Exchange (ETDEWEB)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong (Cornell); (NWU)

    2012-05-22

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 {angstrom} tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is {approx}26 {angstrom} wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an {alpha}/{beta} domain and an {alpha}-helical domain; significant hinge motion was observed between these two domains. Ca{sup 2+} was located at strategic positions in the oligomerization interface. We further showed that removal of Ca{sup 2+} ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca{sup 2+} ions.

  5. t2prhd: a tool to study the patterns of repeat evolution

    Directory of Open Access Journals (Sweden)

    Pénzes Zsolt

    2008-01-01

    Full Text Available Abstract Background The models developed to characterize the evolution of multigene families (such as the birth-and-death and the concerted models have also been applied on the level of sequence repeats inside a gene/protein. Phylogenetic reconstruction is the method of choice to study the evolution of gene families and also sequence repeats in the light of these models. The characterization of the gene family evolution in view of the evolutionary models is done by the evaluation of the clustering of the sequences with the originating loci in mind. As the locus represents positional information, it is straightforward that in the case of the repeats the exact position in the sequence should be used, as the simple numbering according to repeat order can be misleading. Results We have developed a novel rapid visual approach to study repeat evolution, that takes into account the exact repeat position in a sequence. The "pairwise repeat homology diagram" visualizes sequence repeats detected by a profile HMM in a pair of sequences and highlights their homology relations inferred by a phylogenetic tree. The method is implemented in a Perl script (t2prhd available for downloading at http://t2prhd.sourceforge.net and is also accessible as an online tool at http://t2prhd.brc.hu. The power of the method is demonstrated on the EGF-like and fibronectin-III-like (Fn-III domain repeats of three selected mammalian Tenascin sequences. Conclusion Although pairwise repeat homology diagrams do not carry all the information provided by the phylogenetic tree, they allow a rapid and intuitive assessment of repeat evolution. We believe, that t2prhd is a helpful tool with which to study the pattern of repeat evolution. This method can be particularly useful in cases of large datasets (such as large gene families, as the command line interface makes it possible to automate the generation of pairwise repeat homology diagrams with the aid of scripts.

  6. Protein Annotation from Protein Interaction Networks and Gene Ontology

    OpenAIRE

    Nguyen, Cao D.; Gardiner, Katheleen J.; Cios, Krzysztof J.

    2011-01-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precis...

  7. Analysis of gene and protein name synonyms in Entrez Gene and UniProtKB resources

    KAUST Repository

    Arkasosy, Basil

    2013-01-01

    be ambiguous, referring in some cases to more than one gene or one protein, or in others, to both genes and proteins at the same time. Public biological databases give a very useful insight about genes and proteins information, including their names

  8. [HMGA proteins and their genes as a potential neoplastic biomarkers].

    Science.gov (United States)

    Balcerczak, Ewa; Balcerczak, Mariusz; Mirowski, Marek

    2005-01-01

    HMGA proteins and their genes are described in this article. HMGA proteins reveal ability to bind DNA in AT-rich regions, which are characteristic for gene promoter sequences. This interaction lead to gene silencing or their overexpression. In normal tissue HMGA proteins level is low or even undetectable. During embriogenesis their level is increasing. High HMGA proteins level is characteristic for tumor phenotype of spontaneous and experimental malignant neoplasms. High HMGA proteins expression correlate with bad prognostic factors and with metastases formation. HMGA genes expression can be used as a marker of tumor progression. Present studies connected with tumor gene therapy based on HMGA proteins sythesis inhibition by the use of viral vectors containing gene encoding these proteins in antisence orientation, as well as a new potential anticancer drugs acting as crosslinkers between DNA and HMGA proteins suggest their usefulness as a targets in cancer therapy.

  9. Analysis of the AHR gene proximal promoter GGGGC-repeat polymorphism in lung, breast, and colon cancer

    International Nuclear Information System (INIS)

    Spink, Barbara C.; Bloom, Michael S.; Wu, Susan; Sell, Stewart; Schneider, Erasmus; Ding, Xinxin; Spink, David C.

    2015-01-01

    The aryl hydrocarbon receptor (AhR) regulates expression of numerous genes, including those of the CYP1 gene family. With the goal of determining factors that control AHR gene expression, our studies are focused on the role of the short tandem repeat polymorphism, (GGGGC) n , located in the proximal promoter of the human AHR gene. When luciferase constructs containing varying GGGGC repeats were transfected into cancer cell lines derived from the lung, colon, and breast, the number of GGGGC repeats affected AHR promoter activity. The number of GGGGC repeats was determined in DNA from 327 humans and from 38 samples representing 5 species of non-human primates. In chimpanzees and 3 species of macaques, only (GGGGC) 2 alleles were observed; however, in western gorilla, (GGGGC) n alleles with n = 2, 4, 5, 6, 7, and 8 were identified. In all human populations examined, the frequency of (GGGGC) n was n = 4 > 5 ≫ 2, 6. When frequencies of the (GGGGC) n alleles in DNA from patients with lung, colon, or breast cancer were evaluated, the occurrence of (GGGGC) 2 was found to be 8-fold more frequent among lung cancer patients in comparison with its incidence in the general population, as represented by New York State neonates. Analysis of matched tumor and non-tumor DNA samples from the same individuals provided no evidence of microsatellite instability. These studies indicate that the (GGGGC) n short tandem repeats are inherited, and that the (GGGGC) 2 allele in the AHR proximal promoter region should be further investigated with regard to its potential association with lung cancer susceptibility. - Highlights: • The AHR proximal promoter contains a polymorphism, (GGGGC) n , where n = 4 > 5 ≫ 2, 6 • Matched tumor and non-tumor DNA did not show (GGGGC) n microsatellite instability • AHR promoter activity of a construct with (GGGGC) 2 was lower than that of (GGGGC) 4 • The frequency of (GGGGC) 2 in lung cancer patients was 8-fold higher than in neonates • The

  10. Topological and organizational properties of the products of house-keeping and tissue-specific genes in protein-protein interaction networks.

    Science.gov (United States)

    Lin, Wen-Hsien; Liu, Wei-Chung; Hwang, Ming-Jing

    2009-03-11

    Human cells of various tissue types differ greatly in morphology despite having the same set of genetic information. Some genes are expressed in all cell types to perform house-keeping functions, while some are selectively expressed to perform tissue-specific functions. In this study, we wished to elucidate how proteins encoded by human house-keeping genes and tissue-specific genes are organized in human protein-protein interaction networks. We constructed protein-protein interaction networks for different tissue types using two gene expression datasets and one protein-protein interaction database. We then calculated three network indices of topological importance, the degree, closeness, and betweenness centralities, to measure the network position of proteins encoded by house-keeping and tissue-specific genes, and quantified their local connectivity structure. Compared to a random selection of proteins, house-keeping gene-encoded proteins tended to have a greater number of directly interacting neighbors and occupy network positions in several shortest paths of interaction between protein pairs, whereas tissue-specific gene-encoded proteins did not. In addition, house-keeping gene-encoded proteins tended to connect with other house-keeping gene-encoded proteins in all tissue types, whereas tissue-specific gene-encoded proteins also tended to connect with other tissue-specific gene-encoded proteins, but only in approximately half of the tissue types examined. Our analysis showed that house-keeping gene-encoded proteins tend to occupy important network positions, while those encoded by tissue-specific genes do not. The biological implications of our findings were discussed and we proposed a hypothesis regarding how cells organize their protein tools in protein-protein interaction networks. Our results led us to speculate that house-keeping gene-encoded proteins might form a core in human protein-protein interaction networks, while clusters of tissue-specific gene

  11. PRELP (proline/arginine-rich end leucine-rich repeat protein) promotes osteoblastic differentiation of preosteoblastic MC3T3-E1 cells by regulating the β-catenin pathway

    Energy Technology Data Exchange (ETDEWEB)

    Li, Haiying; Cui, Yazhou; Luan, Jing [School of Medicine and Life Sciences, University of Jinan-Shandong Academy of Medical Science, Ji' nan, Shandong (China); Key Laboratory for Rare Disease Research of Shandong Province, Key Laboratory for Biotech Drugs of the Ministry of Health, Shandong Medical Biotechnological Center, Shandong Academy of Medical Sciences, Ji' nan, Shandong (China); Zhang, Xiumei [School of Medicine and Life Sciences, University of Jinan-Shandong Academy of Medical Science, Ji' nan, Shandong (China); Li, Chengzhi; Zhou, Xiaoyan; Shi, Liang [School of Medicine and Life Sciences, University of Jinan-Shandong Academy of Medical Science, Ji' nan, Shandong (China); Key Laboratory for Rare Disease Research of Shandong Province, Key Laboratory for Biotech Drugs of the Ministry of Health, Shandong Medical Biotechnological Center, Shandong Academy of Medical Sciences, Ji' nan, Shandong (China); Wang, Huaxin [Shandong University of Traditional Chinese Medicine, Ji' an, Shandong (China); Han, Jinxiang, E-mail: jxhan9888@aliyun.com [School of Medicine and Life Sciences, University of Jinan-Shandong Academy of Medical Science, Ji' nan, Shandong (China); Key Laboratory for Rare Disease Research of Shandong Province, Key Laboratory for Biotech Drugs of the Ministry of Health, Shandong Medical Biotechnological Center, Shandong Academy of Medical Sciences, Ji' nan, Shandong (China)

    2016-02-12

    Proline/arginine-rich end leucine-rich repeat protein (PRELP) is a collagen-binding proteoglycan highly expressed in the developing bones. Recent studies indicated that PRELP could inhibit osteoclastogenesis as a NF-κB inhibitor. However, its role during osteoblast differentiation is still unclear. In this study, we confirmed that the expression of PRELP increased with the osteogenesis induction of preosteoblastic MC3T3-E1 cells. Down-regulation of PRELP expression by shRNA reduced ALP activity, mineralization and expression of osteogenic marker gene Runx2. Our microarray analysis data suggested that β-catenin may act as a hub gene in the PRELP-mediated gene network. We validated furtherly that PRELP knockdown could inhibit the level of connexin43, a key regulator of osteoblast differentiation by affecting β-catenin protein expression, and its nuclear translocation in MC3T3-E1 preosteoblasts. Therefore, this study established a new role of PRELP in modulating β-catenin/connexin43 pathway and osteoblast differentiation.

  12. Overexpression of MIP2, a novel WD-repeat protein, promotes proliferation of H9c2 cells

    International Nuclear Information System (INIS)

    Wei, Xing; Song, Lan; Jiang, Lei; Wang, Guiliang; Luo, Xinjing; Zhang, Bin; Xiao, Xianzhong

    2010-01-01

    WD40 repeat proteins have a wide range of diverse biological functions including signal transduction, cell cycle regulation, RNA splicing, and transcription. Myocardial ischemic preconditioning up-regulated protein 2 (MIP2) is a novel member of the WD40 repeat proteins superfamily that contains five WD40 repeats. Little is known about its biological role, and the purpose of this study was to determine the role of MIP2 in regulating cellular proliferation. Transfection and constitutive expression of MIP2 in the rat cardiomyoblast cell line H9c2 results in enhanced growth of those cells as measured by cell number and is proportional to the amount of MIP2 expressed. Overexpression of MIP2 results in a shorter cell cycle, as measured by flow cytometry. Collectively, these data suggest that MIP2 may participate in the progression of cell proliferation in H9c2 cells.

  13. Sexy gene conversions: locating gene conversions on the X-chromosome.

    Science.gov (United States)

    Lawson, Mark J; Zhang, Liqing

    2009-08-01

    Gene conversion can have a profound impact on both the short- and long-term evolution of genes and genomes. Here, we examined the gene families that are located on the X-chromosomes of human (Homo sapiens), chimpanzee (Pan troglodytes), mouse (Mus musculus) and rat (Rattus norvegicus) for evidence of gene conversion. We identified seven gene families (WD repeat protein family, Ferritin Heavy Chain family, RAS-related Protein RAB-40 family, Diphosphoinositol polyphosphate phosphohydrolase family, Transcription Elongation Factor A family, LDOC1-related family, Zinc Finger Protein ZIC, and GLI family) that show evidence of gene conversion. Through phylogenetic analyses and synteny evidence, we show that gene conversion has played an important role in the evolution of these gene families and that gene conversion has occurred independently in both primates and rodents. Comparing the results with those of two gene conversion prediction programs (GENECONV and Partimatrix), we found that both GENECONV and Partimatrix have very high false negative rates (i.e. failed to predict gene conversions), which leads to many undetected gene conversions. The combination of phylogenetic analyses with physical synteny evidence exhibits high resolution in the detection of gene conversions.

  14. Identification and characterization of a tandem repeat in exon III of the dopamine receptor D4 (DRD4) gene in cetaceans

    DEFF Research Database (Denmark)

    Mogensen, Line; Kinze, Carl Christian; Werge, Thomas

    2006-01-01

    A large number of mammalian species harbor a tandem repeat in exon III of the gene encoding dopamine receptor D4 (DRD4), a receptor associated with cognitive functions. In this study, a DRD4 gene exon III tandem repeat from the order Cetacea was identified and characterized. Included in our study...

  15. Characterization of upstream sequences of the LIM2 gene that bind developmentally regulated and lens-specific proteins

    Institute of Scientific and Technical Information of China (English)

    HSU Heng; Robert L. CHURCH

    2004-01-01

    During lens development, lens epithelial cells differentiate into fiber cells. To date, four major lens fiber cell intrinsic membrane proteins (MIP) ranging in size from 70 kD to 19 kD have been characterized. The second most abundant lens fiber cell intrinsic membrane protein is MP19. This protein probably is involved with lens cell communication and relates with cataractogenesis. The aim of this research is to characterize upstream sequences of the MP19 (also called LIM2) gene that bind developmentally regulated and lens-specific proteins. We have used the gel mobility assays and corresponding competition experiments to identify and characterize cis elements within approximately 500 bases of LIM2 upstream sequences. Our studies locate the positions of some cis elements, including a "CA" repeat, a methylation Hha I island, an FnuD II site, an Ap1 and an Ap2 consensus sequences, and identify some specific cis elements which relate to lens-specific transcription of LIM2. Our experiments also preliminarily identify trans factors which bind to specific cis elements of the LIM2 promoter and/or regulate transcription of LIM2. We conclude that developmental regulation and coordination of the MP 19 gene in ocular lens fiber cells is controlled by the presence of specific cis elements that bind regulatory trans factors that affect LIM2 gene expression. DNA methylation is one mechanism of controlling LIM2 gene expression during lens development.

  16. Hexanucleotide Repeats in ALS/FTD Form Length-Dependent RNA Foci, Sequester RNA Binding Proteins, and Are Neurotoxic

    Directory of Open Access Journals (Sweden)

    Youn-Bok Lee

    2013-12-01

    Full Text Available The GGGGCC (G4C2 intronic repeat expansion within C9ORF72 is the most common genetic cause of amyotrophic lateral sclerosis (ALS and frontotemporal dementia (FTD. Intranuclear neuronal RNA foci have been observed in ALS and FTD tissues, suggesting that G4C2 RNA may be toxic. Here, we demonstrate that the expression of 38× and 72× G4C2 repeats form intranuclear RNA foci that initiate apoptotic cell death in neuronal cell lines and zebrafish embryos. The foci colocalize with a subset of RNA binding proteins, including SF2, SC35, and hnRNP-H in transfected cells. Only hnRNP-H binds directly to G4C2 repeats following RNA immunoprecipitation, and only hnRNP-H colocalizes with 70% of G4C2 RNA foci detected in C9ORF72 mutant ALS and FTD brain tissues. We show that expanded G4C2 repeats are potently neurotoxic and bind hnRNP-H and other RNA binding proteins. We propose that RNA toxicity and protein sequestration may disrupt RNA processing and contribute to neurodegeneration.

  17. History, rare, and multiple events of mechanical unfolding of repeat proteins

    Science.gov (United States)

    Sumbul, Fidan; Marchesi, Arin; Rico, Felix

    2018-03-01

    Mechanical unfolding of proteins consisting of repeat domains is an excellent tool to obtain large statistics. Force spectroscopy experiments using atomic force microscopy on proteins presenting multiple domains have revealed that unfolding forces depend on the number of folded domains (history) and have reported intermediate states and rare events. However, the common use of unspecific attachment approaches to pull the protein of interest holds important limitations to study unfolding history and may lead to discarding rare and multiple probing events due to the presence of unspecific adhesion and uncertainty on the pulling site. Site-specific methods that have recently emerged minimize this uncertainty and would be excellent tools to probe unfolding history and rare events. However, detailed characterization of these approaches is required to identify their advantages and limitations. Here, we characterize a site-specific binding approach based on the ultrastable complex dockerin/cohesin III revealing its advantages and limitations to assess the unfolding history and to investigate rare and multiple events during the unfolding of repeated domains. We show that this approach is more robust, reproducible, and provides larger statistics than conventional unspecific methods. We show that the method is optimal to reveal the history of unfolding from the very first domain and to detect rare events, while being more limited to assess intermediate states. Finally, we quantify the forces required to unfold two molecules pulled in parallel, difficult when using unspecific approaches. The proposed method represents a step forward toward more reproducible measurements to probe protein unfolding history and opens the door to systematic probing of rare and multiple molecule unfolding mechanisms.

  18. Stealing the spotlight: CUL4-DDB1 ubiquitin ligase docks WD40-repeat proteins to destroy

    Directory of Open Access Journals (Sweden)

    Zhang Hui

    2007-02-01

    Full Text Available Abstract Recent investigation of Cullin 4 (CUL4 has ushered this class of multiprotein ubiquitin E3 ligases to center stage as critical regulators of diverse processes including cell cycle regulation, developmental patterning, DNA replication, DNA damage and repair, and epigenetic control of gene expression. CUL4 associates with DNA Damage Binding protein 1 (DDB1 to assemble an ubiquitin E3 ligase that targets protein substrates for ubiquitin-dependent proteolysis. CUL4 ligase activity is also regulated by the covalent attachment of the ubiquitin-like protein NEDD8 to CUL4, or neddylation, and the COP9 signalosome complex (CSN that removes this important modification. Recently, multiple WD40-repeat proteins (WDR were found to interact with DDB1 and serve as the substrate-recognition subunits of the CUL4-DDB1 ubiquitin ligase. As more than 150–300 WDR proteins exist in the human genome, these findings impact a wide array of biological processes through CUL4 ligase-mediated proteolysis. Here, we review the recent progress in understanding the mechanism of CUL4 ubiquitin E3 ligase and discuss the architecture of CUL4-assembled E3 ubiquitin ligase complexes by comparison to CUL1-based E3s (SCF. Then, we will review several examples to highlight the critical roles of CUL4 ubiquitin ligase in genome stability, cell cycle regulation, and histone lysine methylation. Together, these studies provide insights into the mechanism of this novel ubiquitin ligase in the regulation of important biological processes.

  19. The Impact of Repeated Freeze-Thaw Cycles on the Quality of Biomolecules in Four Different Tissues.

    Science.gov (United States)

    Ji, Xiaoli; Wang, Min; Li, Lingling; Chen, Fang; Zhang, Yanyang; Li, Qian; Zhou, Junmei

    2017-10-01

    High-quality biosamples are valuable resources for biomedical research. However, some tissues are stored without being sectioned into small aliquots and have to undergo repeated freeze-thaw cycles throughout prolonged experimentation. Little is known regarding the effects of repeated freeze-thaw cycles on the quality of biomolecules in tissues. The aim of this study was to evaluate the impact of repeated freeze-thaw (at room temperature or on ice) cycles on biomolecules and gene expression in four different types of tissues. Each fresh tissue was sectioned into seven aliquots and snap-frozen before undergoing repeated freeze-thaw cycles at room temperature or on ice. Biomolecules were extracted and analyzed. Both relative and absolute quantification were used to detect the changes in gene expression. The results indicated that the impact of repeated freeze-thaw cycles on RNA integrity varied by tissue type. Gene expression, including the housekeeping gene, was affected in RNA-degraded samples according to absolute quantification rather than relative quantification. Furthermore, our results suggest that thawing on ice could protect RNA integrity compared with thawing at room temperature. No obvious degradation of protein or DNA was observed with repeated freeze-thaw cycles either at room temperature or on ice. This research provides ample evidence for the necessity of sectioning fresh tissues into small aliquots before snap-freezing, thus avoiding degradation of RNA and alteration of gene expression resulting from repeated freeze-thaw cycles. For frozen tissue samples that were already in storage and had to be used repeatedly during their lifecycle, thawing on ice or sectioned at ultralow temperature is recommended.

  20. Mechanism of Repeat-Associated MicroRNAs in Fragile X Syndrome

    Directory of Open Access Journals (Sweden)

    Karen Kelley

    2012-01-01

    Full Text Available The majority of the human genome is comprised of non-coding DNA, which frequently contains redundant microsatellite-like trinucleotide repeats. Many of these trinucleotide repeats are involved in triplet repeat expansion diseases (TREDs such as fragile X syndrome (FXS. After transcription, the trinucleotide repeats can fold into RNA hairpins and are further processed by Dicer endoribonuclases to form microRNA (miRNA-like molecules that are capable of triggering targeted gene-silencing effects in the TREDs. However, the function of these repeat-associated miRNAs (ramRNAs is unclear. To solve this question, we identified the first native ramRNA in FXS and successfully developed a transgenic zebrafish model for studying its function. Our studies showed that ramRNA-induced DNA methylation of the FMR1 5′-UTR CGG trinucleotide repeat expansion is responsible for both pathological and neurocognitive characteristics linked to the transcriptional FMR1 gene inactivation and the deficiency of its protein product FMRP. FMRP deficiency often causes synapse deformity in the neurons essential for cognition and memory activities, while FMR1 inactivation augments metabotropic glutamate receptor (mGluR-activated long-term depression (LTD, leading to abnormal neuronal responses in FXS. Using this novel animal model, we may further dissect the etiological mechanisms of TREDs, with the hope of providing insights into new means for therapeutic intervention.

  1. MicroRNAs in CAG trinucleotide repeat expansion disorders: an integrated review of the literature.

    Science.gov (United States)

    Dumitrescu, Laura; Popescu, Bogdan O

    2015-01-01

    MicroRNAs are small RNAs involved in gene silencing. They play important roles in transcriptional regulation and are selectively and abundantly expressed in the central nervous system. A considerable amount of the human genome is comprised of tandem repeating nucleotide streams. Several diseases are caused by above-threshold expansion of certain trinucleotide repeats occurring in a protein-coding or non-coding region. Though monogenic, CAG trinucleotide repeat expansion disorders have a complex pathogenesis, various combinations of multiple coexisting pathways resulting in one common final consequence: selective neurodegeneration. Mutant protein and mutant transcript gain of toxic function are considered to be the core pathogenic mechanisms. The profile of microRNAs in CAG trinucleotide repeat disorders is scarcely described, however microRNA dysregulation has been identified in these diseases and microRNA-related intereference with gene expression is considered to be involved in their pathogenesis. Better understanding of microRNAs functions and means of manipulation promises to offer further insights into the pathogenic pathways of CAG repeat expansion disorders, to point out new potential targets for drug intervention and to provide some of the much needed etiopathogenic therapeutic agents. A number of disease-modifying microRNA silencing strategies are under development, but several implementation impediments still have to be resolved. CAG targeting seems feasible and efficient in animal models and is an appealing approach for clinical practice. Preliminary human trials are just beginning.

  2. Inference of gene-phenotype associations via protein-protein interaction and orthology.

    Directory of Open Access Journals (Sweden)

    Panwen Wang

    Full Text Available One of the fundamental goals of genetics is to understand gene functions and their associated phenotypes. To achieve this goal, in this study we developed a computational algorithm that uses orthology and protein-protein interaction information to infer gene-phenotype associations for multiple species. Furthermore, we developed a web server that provides genome-wide phenotype inference for six species: fly, human, mouse, worm, yeast, and zebrafish. We evaluated our inference method by comparing the inferred results with known gene-phenotype associations. The high Area Under the Curve values suggest a significant performance of our method. By applying our method to two human representative diseases, Type 2 Diabetes and Breast Cancer, we demonstrated that our method is able to identify related Gene Ontology terms and Kyoto Encyclopedia of Genes and Genomes pathways. The web server can be used to infer functions and putative phenotypes of a gene along with the candidate genes of a phenotype, and thus aids in disease candidate gene discovery. Our web server is available at http://jjwanglab.org/PhenoPPIOrth.

  3. Using the 2A Protein Coexpression System: Multicistronic 2A Vectors Expressing Gene(s) of Interest and Reporter Proteins.

    Science.gov (United States)

    Luke, Garry A; Ryan, Martin D

    2018-01-01

    To date, a huge range of different proteins-many with cotranslational and posttranslational subcellular localization signals-have been coexpressed together with various reporter proteins in vitro and in vivo using 2A peptides. The pros and cons of 2A co-expression technology are considered below, followed by a simple example of a "how to" protocol to concatenate multiple genes of interest, together with a reporter gene, into a single gene linked via 2As for easy identification or selection of transduced cells.

  4. A novel germline mutation in the aryl hydrocarbon receptor-interacting protein (AIP) gene in an Italian family with gigantism.

    Science.gov (United States)

    Urbani, C; Russo, D; Raggi, F; Lombardi, M; Sardella, C; Scattina, I; Lupi, I; Manetti, L; Tomisti, L; Marcocci, C; Martino, E; Bogazzi, F

    2014-10-01

    Acromegaly usually occurs as a sporadic disease, but it may be a part of familial pituitary tumor syndromes in rare cases. Germline mutations in the aryl hydrocarbon receptor-interacting protein (AIP) gene have been associated with a predisposition to familial isolated pituitary adenoma. The aim of the present study was to evaluate the AIP gene in a patient with gigantism and in her relatives. Direct sequencing of AIP gene was performed in fourteen members of the family, spanning among three generations. The index case was an 18-year-old woman with gigantism due to an invasive GH-secreting pituitary adenoma and a concomitant tall-cell variant of papillary thyroid carcinoma. A novel germline mutation in the AIP gene (c.685C>T, p.Q229X) was identified in the proband and in two members of her family, who did not present clinical features of acromegaly or other pituitary disorders. Eleven subjects had no mutation in the AIP gene. Two members of the family with clinical features of acromegaly refused either the genetic or the biochemical evaluation. The Q229X mutation was predicted to generate a truncated AIP protein, lacking the last two tetratricopeptide repeat domains and the final C-terminal α-7 helix. We identified a new AIP germline mutation predicted to produce a truncated AIP protein, lacking its biological properties due to the disruption of the C-terminus binding sites for both the chaperones and the client proteins of AIP.

  5. Analysis of gene and protein name synonyms in Entrez Gene and UniProtKB resources

    KAUST Repository

    Arkasosy, Basil

    2013-05-11

    Ambiguity in texts is a well-known problem: words can carry several meanings, and hence, can be read and interpreted differently. This is also true in the biological literature; names of biological concepts, such as genes and proteins, might be ambiguous, referring in some cases to more than one gene or one protein, or in others, to both genes and proteins at the same time. Public biological databases give a very useful insight about genes and proteins information, including their names. In this study, we made a thorough analysis of the nomenclatures of genes and proteins in two data sources and for six different species. We developed an automated process that parses, extracts, processes and stores information available in two major biological databases: Entrez Gene and UniProtKB. We analysed gene and protein synonyms, their types, frequencies, and the ambiguities within a species, in between data sources and cross-species. We found that at least 40% of the cross-species ambiguities are caused by names that are already ambiguous within the species. Our study shows that from the six species we analysed (Homo Sapiens, Mus Musculus, Arabidopsis Thaliana, Oryza Sativa, Bacillus Subtilis and Pseudomonas Fluorescens), rice (Oriza Sativa) has the best naming model in Entrez Gene database, with low ambiguities between data sources and cross-species.

  6. Association analysis of a highly polymorphic CAG Repeat in the human potassium channel gene KCNN3 and migraine susceptibility

    Directory of Open Access Journals (Sweden)

    Ovcaric Mick

    2005-09-01

    Full Text Available Abstract Background Migraine is a polygenic multifactorial disease, possessing environmental and genetic causative factors with multiple involved genes. Mutations in various ion channel genes are responsible for a number of neurological disorders. KCNN3 is a neuronal small conductance calcium-activated potassium channel gene that contains two polyglutamine tracts, encoded by polymorphic CAG repeats in the gene. This gene plays a critical role in determining the firing pattern of neurons and acts to regulate intracellular calcium channels. Methods The present association study tested whether length variations in the second (more 3' polymorphic CAG repeat in exon 1 of the KCNN3 gene, are involved in susceptibility to migraine with and without aura (MA and MO. In total 423 DNA samples from unrelated individuals, of which 202 consisted of migraine patients and 221 non-migraine controls, were genotyped and analysed using a fluorescence labelled primer set on an ABI310 Genetic Analyzer. Allele frequencies were calculated from observed genotype counts for the KCNN3 polymorphism. Analysis was performed using standard contingency table analysis, incorporating the chi-squared test of independence and CLUMP analysis. Results Overall, there was no convincing evidence that KCNN3 CAG lengths differ between Caucasian migraineurs and controls, with no significant difference in the allelic length distribution of CAG repeats between the population groups (P = 0.090. Also the MA and MO subtypes did not differ significantly between control allelic distributions (P > 0.05. The prevalence of the long CAG repeat (>19 repeats did not reach statistical significance in migraineurs (P = 0.15, nor was there a significant difference between the MA and MO subgroups observed compared to controls (P = 0.46 and P = 0.09, respectively, or between MA vs MO (P = 0.40. Conclusion This association study provides no evidence that length variations of the second polyglutamine array in

  7. Ubiquitin--conserved protein or selfish gene?

    Science.gov (United States)

    Catic, André; Ploegh, Hidde L

    2005-11-01

    The posttranslational modifier ubiquitin is encoded by a multigene family containing three primary members, which yield the precursor protein polyubiquitin and two ubiquitin moieties, Ub(L40) and Ub(S27), that are fused to the ribosomal proteins L40 and S27, respectively. The gene encoding polyubiquitin is highly conserved and, until now, those encoding Ub(L40) and Ub(S27) have been generally considered to be equally invariant. The evolution of the ribosomal ubiquitin moieties is, however, proving to be more dynamic. It seems that the genes encoding Ub(L40) and Ub(S27) are actively maintained by homologous recombination with the invariant polyubiquitin locus. Failure to recombine leads to deterioration of the sequence of the ribosomal ubiquitin moieties in several phyla, although this deterioration is evidently constrained by the structural requirements of the ubiquitin fold. Only a few amino acids in ubiquitin are vital for its function, and we propose that conservation of all three ubiquitin genes is driven not only by functional properties of the ubiquitin protein, but also by the propensity of the polyubiquitin locus to act as a 'selfish gene'.

  8. PURA, the gene encoding Pur-alpha, member of an ancient nucleic acid-binding protein family with mammalian neurological functions.

    Science.gov (United States)

    Daniel, Dianne C; Johnson, Edward M

    2018-02-15

    The PURA gene encodes Pur-alpha, a 322 amino acid protein with repeated nucleic acid binding domains that are highly conserved from bacteria through humans. PUR genes with a single copy of this domain have been detected so far in spirochetes and bacteroides. Lower eukaryotes possess one copy of the PUR gene, whereas chordates possess 1 to 4 PUR family members. Human PUR genes encode Pur-alpha (Pura), Pur-beta (Purb) and two forms of Pur-gamma (Purg). Pur-alpha is a protein that binds specific DNA and RNA sequence elements. Human PURA, located at chromosome band 5q31, is under complex control of three promoters. The entire protein coding sequence of PURA is contiguous within a single exon. Several studies have found that overexpression or microinjection of Pura inhibits anchorage-independent growth of oncogenically transformed cells and blocks proliferation at either G1-S or G2-M checkpoints. Effects on the cell cycle may be mediated by interaction of Pura with cellular proteins including Cyclin/Cdk complexes and the Rb tumor suppressor protein. PURA knockout mice die shortly after birth with effects on brain and hematopoietic development. In humans environmentally induced heterozygous deletions of PURA have been implicated in forms of myelodysplastic syndrome and progression to acute myelogenous leukemia. Pura plays a role in AIDS through association with the HIV-1 protein, Tat. In the brain Tat and Pura association in glial cells activates transcription and replication of JC polyomavirus, the agent causing the demyelination disease, progressive multifocal leukoencephalopathy. Tat and Pura also act to stimulate replication of the HIV-1 RNA genome. In neurons Pura accompanies mRNA transcripts to sites of translation in dendrites. Microdeletions in the PURA locus have been implicated in several neurological disorders. De novo PURA mutations have been related to a spectrum of phenotypes indicating a potential PURA syndrome. The nucleic acid, G-rich Pura binding

  9. Protein annotation from protein interaction networks and Gene Ontology.

    Science.gov (United States)

    Nguyen, Cao D; Gardiner, Katheleen J; Cios, Krzysztof J

    2011-10-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precision and 60% recall versus 45% and 26% for Majority and 24% and 61% for χ²-statistics, respectively. Copyright © 2011 Elsevier Inc. All rights reserved.

  10. Amino acid sequence analysis of the annexin super-gene family of proteins.

    Science.gov (United States)

    Barton, G J; Newman, R H; Freemont, P S; Crumpton, M J

    1991-06-15

    The annexins are a widespread family of calcium-dependent membrane-binding proteins. No common function has been identified for the family and, until recently, no crystallographic data existed for an annexin. In this paper we draw together 22 available annexin sequences consisting of 88 similar repeat units, and apply the techniques of multiple sequence alignment, pattern matching, secondary structure prediction and conservation analysis to the characterisation of the molecules. The analysis clearly shows that the repeats cluster into four distinct families and that greatest variation occurs within the repeat 3 units. Multiple alignment of the 88 repeats shows amino acids with conserved physicochemical properties at 22 positions, with only Gly at position 23 being absolutely conserved in all repeats. Secondary structure prediction techniques identify five conserved helices in each repeat unit and patterns of conserved hydrophobic amino acids are consistent with one face of a helix packing against the protein core in predicted helices a, c, d, e. Helix b is generally hydrophobic in all repeats, but contains a striking pattern of repeat-specific residue conservation at position 31, with Arg in repeats 4 and Glu in repeats 2, but unconserved amino acids in repeats 1 and 3. This suggests repeats 2 and 4 may interact via a buried saltbridge. The loop between predicted helices a and b of repeat 3 shows features distinct from the equivalent loop in repeats 1, 2 and 4, suggesting an important structural and/or functional role for this region. No compelling evidence emerges from this study for uteroglobin and the annexins sharing similar tertiary structures, or for uteroglobin representing a derivative of a primordial one-repeat structure that underwent duplication to give the present day annexins. The analyses performed in this paper are re-evaluated in the Appendix, in the light of the recently published X-ray structure for human annexin V. The structure confirms most of

  11. Analysis of the AHR gene proximal promoter GGGGC-repeat polymorphism in lung, breast, and colon cancer

    Energy Technology Data Exchange (ETDEWEB)

    Spink, Barbara C. [Wadsworth Center, New York State Department of Health, Albany, NY 12201 (United States); Bloom, Michael S. [Department of Environmental Health Sciences, School of Public Health, University at Albany, State University of New York, Albany, NY 12201 (United States); Wu, Susan [Wadsworth Center, New York State Department of Health, Albany, NY 12201 (United States); Sell, Stewart; Schneider, Erasmus [Wadsworth Center, New York State Department of Health, Albany, NY 12201 (United States); Department of Biomedical Sciences, School of Public Health, University at Albany, State University of New York, Albany, NY 12201 (United States); Ding, Xinxin [Wadsworth Center, New York State Department of Health, Albany, NY 12201 (United States); Department of Environmental Health Sciences, School of Public Health, University at Albany, State University of New York, Albany, NY 12201 (United States); Department of Biomedical Sciences, School of Public Health, University at Albany, State University of New York, Albany, NY 12201 (United States); Spink, David C., E-mail: spink@wadsworth.org [Wadsworth Center, New York State Department of Health, Albany, NY 12201 (United States); Department of Environmental Health Sciences, School of Public Health, University at Albany, State University of New York, Albany, NY 12201 (United States)

    2015-01-01

    The aryl hydrocarbon receptor (AhR) regulates expression of numerous genes, including those of the CYP1 gene family. With the goal of determining factors that control AHR gene expression, our studies are focused on the role of the short tandem repeat polymorphism, (GGGGC){sub n}, located in the proximal promoter of the human AHR gene. When luciferase constructs containing varying GGGGC repeats were transfected into cancer cell lines derived from the lung, colon, and breast, the number of GGGGC repeats affected AHR promoter activity. The number of GGGGC repeats was determined in DNA from 327 humans and from 38 samples representing 5 species of non-human primates. In chimpanzees and 3 species of macaques, only (GGGGC){sub 2} alleles were observed; however, in western gorilla, (GGGGC){sub n} alleles with n = 2, 4, 5, 6, 7, and 8 were identified. In all human populations examined, the frequency of (GGGGC){sub n} was n = 4 > 5 ≫ 2, 6. When frequencies of the (GGGGC){sub n} alleles in DNA from patients with lung, colon, or breast cancer were evaluated, the occurrence of (GGGGC){sub 2} was found to be 8-fold more frequent among lung cancer patients in comparison with its incidence in the general population, as represented by New York State neonates. Analysis of matched tumor and non-tumor DNA samples from the same individuals provided no evidence of microsatellite instability. These studies indicate that the (GGGGC){sub n} short tandem repeats are inherited, and that the (GGGGC){sub 2} allele in the AHR proximal promoter region should be further investigated with regard to its potential association with lung cancer susceptibility. - Highlights: • The AHR proximal promoter contains a polymorphism, (GGGGC){sub n}, where n = 4 > 5 ≫ 2, 6 • Matched tumor and non-tumor DNA did not show (GGGGC){sub n} microsatellite instability • AHR promoter activity of a construct with (GGGGC){sub 2} was lower than that of (GGGGC){sub 4} • The frequency of (GGGGC){sub 2} in lung

  12. Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.

    Science.gov (United States)

    Wang, Ying; Ding, Jun; Daniell, Henry; Hu, Haiyan; Li, Xiaoman

    2012-09-01

    Chloroplasts play critical roles in land plant cells. Despite their importance and the availability of at least 200 sequenced chloroplast genomes, the number of known DNA regulatory sequences in chloroplast genomes are limited. In this paper, we designed computational methods to systematically study putative DNA regulatory sequences in intergenic regions near chloroplast genes in seven plant species and in promoter sequences of nuclear genes in Arabidopsis and rice. We found that -35/-10 elements alone cannot explain the transcriptional regulation of chloroplast genes. We also concluded that there are unlikely motifs shared by intergenic sequences of most of chloroplast genes, indicating that these genes are regulated differently. Finally and surprisingly, we found five conserved motifs, each of which occurs in no more than six chloroplast intergenic sequences, are significantly shared by promoters of nuclear-genes encoding chloroplast proteins. By integrating information from gene function annotation, protein subcellular localization analyses, protein-protein interaction data, and gene expression data, we further showed support of the functionality of these conserved motifs. Our study implies the existence of unknown nuclear-encoded transcription factors that regulate both chloroplast genes and nuclear genes encoding chloroplast protein, which sheds light on the understanding of the transcriptional regulation of chloroplast genes.

  13. Bph32, a novel gene encoding an unknown SCR domain-containing protein, confers resistance against the brown planthopper in rice

    Science.gov (United States)

    Ren, Juansheng; Gao, Fangyuan; Wu, Xianting; Lu, Xianjun; Zeng, Lihua; Lv, Jianqun; Su, Xiangwen; Luo, Hong; Ren, Guangjun

    2016-01-01

    An urgent need exists to identify more brown planthopper (Nilaparvata lugens Stål, BPH) resistance genes, which will allow the development of rice varieties with resistance to BPH to counteract the increased incidence of this pest species. Here, using bioinformatics and DNA sequencing approaches, we identified a novel BPH resistance gene, LOC_Os06g03240 (MSU LOCUS ID), from the rice variety Ptb33 in the interval between the markers RM19291 and RM8072 on the short arm of chromosome 6, where a gene for resistance to BPH was mapped by Jirapong Jairin et al. and renamed as “Bph32”. This gene encodes a unique short consensus repeat (SCR) domain protein. Sequence comparison revealed that the Bph32 gene shares 100% sequence identity with its allele in Oryza latifolia. The transgenic introgression of Bph32 into a susceptible rice variety significantly improved resistance to BPH. Expression analysis revealed that Bph32 was highly expressed in the leaf sheaths, where BPH primarily settles and feeds, at 2 and 24 h after BPH infestation, suggesting that Bph32 may inhibit feeding in BPH. Western blotting revealed the presence of Pph (Ptb33) and Tph (TN1) proteins using a Penta-His antibody, and both proteins were insoluble. This study provides information regarding a valuable gene for rice defence against insect pests. PMID:27876888

  14. Bph32, a novel gene encoding an unknown SCR domain-containing protein, confers resistance against the brown planthopper in rice.

    Science.gov (United States)

    Ren, Juansheng; Gao, Fangyuan; Wu, Xianting; Lu, Xianjun; Zeng, Lihua; Lv, Jianqun; Su, Xiangwen; Luo, Hong; Ren, Guangjun

    2016-11-23

    An urgent need exists to identify more brown planthopper (Nilaparvata lugens Stål, BPH) resistance genes, which will allow the development of rice varieties with resistance to BPH to counteract the increased incidence of this pest species. Here, using bioinformatics and DNA sequencing approaches, we identified a novel BPH resistance gene, LOC_Os06g03240 (MSU LOCUS ID), from the rice variety Ptb33 in the interval between the markers RM19291 and RM8072 on the short arm of chromosome 6, where a gene for resistance to BPH was mapped by Jirapong Jairin et al. and renamed as "Bph32". This gene encodes a unique short consensus repeat (SCR) domain protein. Sequence comparison revealed that the Bph32 gene shares 100% sequence identity with its allele in Oryza latifolia. The transgenic introgression of Bph32 into a susceptible rice variety significantly improved resistance to BPH. Expression analysis revealed that Bph32 was highly expressed in the leaf sheaths, where BPH primarily settles and feeds, at 2 and 24 h after BPH infestation, suggesting that Bph32 may inhibit feeding in BPH. Western blotting revealed the presence of Pph (Ptb33) and Tph (TN1) proteins using a Penta-His antibody, and both proteins were insoluble. This study provides information regarding a valuable gene for rice defence against insect pests.

  15. Low-Normal FMR1 CGG Repeat Length: Phenotypic Associations

    Directory of Open Access Journals (Sweden)

    Marsha eMailick

    2014-09-01

    Full Text Available This population-based study investigates genotype-phenotype correlations of low-normal CGG repeats in the fragile X mental retardation 1 (FMR1 gene. FMR1 plays an important role in brain development and function, and encodes FMRP (fragile X mental retardation protein, an RNA-binding protein that regulates protein synthesis impacting activity-dependent synaptic development and plasticity. Most past research has focused on CGG premutation expansions (41 to 200 CGG repeats and on fragile X syndrome (200+ CGG repeats, with considerably less attention on the other end of the spectrum of CGG repeats. Using existing data, older adults with 23 or fewer CGG repeats (2 SDs below the mean were compared with age-peers who have normal numbers of CGGs (24-40 with respect to cognition, mental health, cancer, and having children with disabilities. Men (n = 341 with an allele in the low-normal range and women (n = 46 with two low-normal alleles had significantly more difficulty with their memory and ability to solve day to day problems. Women with both FMR1 alleles in the low-normal category had significantly elevated odds of feeling that they need to drink more to get the same effect as in the past. These women also had two and one-half times the odds of having had breast cancer and four times the odds of uterine cancer. Men and women with low-normal CGGs had higher odds of having a child with a disability, either a developmental disability or a mental health condition. These findings are in line with the hypothesis that there is a need for tight neuronal homeostatic control mechanisms for optimal cognitive and behavioral functioning, and more generally that low numbers as well as high numbers of CGG repeats may be problematic for health.

  16. Modular protein switches derived from antibody mimetic proteins.

    Science.gov (United States)

    Nicholes, N; Date, A; Beaujean, P; Hauk, P; Kanwar, M; Ostermeier, M

    2016-02-01

    Protein switches have potential applications as biosensors and selective protein therapeutics. Protein switches built by fusion of proteins with the prerequisite input and output functions are currently developed using an ad hoc process. A modular switch platform in which existing switches could be readily adapted to respond to any ligand would be advantageous. We investigated the feasibility of a modular protein switch platform based on fusions of the enzyme TEM-1 β-lactamase (BLA) with two different antibody mimetic proteins: designed ankyrin repeat proteins (DARPins) and monobodies. We created libraries of random insertions of the gene encoding BLA into genes encoding a DARPin or a monobody designed to bind maltose-binding protein (MBP). From these libraries, we used a genetic selection system for β-lactamase activity to identify genes that conferred MBP-dependent ampicillin resistance to Escherichia coli. Some of these selected genes encoded switch proteins whose enzymatic activity increased up to 14-fold in the presence of MBP. We next introduced mutations into the antibody mimetic domain of these switches that were known to cause binding to different ligands. To different degrees, introduction of the mutations resulted in switches with the desired specificity, illustrating the potential modularity of these platforms. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. The gene expression and immunohistochemical time-course of diphenylcyclopropenone-induced contact allergy in healthy humans following repeated epicutaneous challenges

    DEFF Research Database (Denmark)

    Mose, Kristian F; Burton, Mark; Thomassen, Mads

    2017-01-01

    DPCP challenges to find the predominant gene expression pattern, (ii) the time-course of cell infiltration following repeated DPCP challenges and (iii) the transcriptome of a repeated CA exposure model. We obtained punch biopsies from control and DPCP-exposed skin from ten DPCP sensitized individuals...

  18. Tumoral Environment Triggers Transcript Anomalies in Established Tumors: Induction of Altered Gene Expression and of Aberrant, Truncated and B2 Repeat-Containing Gene Transcripts

    Directory of Open Access Journals (Sweden)

    Pieter Rottiers

    1999-12-01

    Full Text Available In addition to eugenetic changes, cancerous cells exhibit extensive modifications in the expression levels of a variety of genes. The phenotypic switch observed after inoculation of T lymphoma cells into syngenic mice illustrates the active participation of tumoral environment in the induction of an aberrant gene expression pattern. To further substantiate this contribution, we performed polymerase chain reaction (PCR-based subtraction suppression hybridization (SSH to identify genes that are differentially expressed in tumor-derived EL4/13.3 cells compared to the same cells isolated from cultures. Besides a number of unknown genes, the subtracted library contained several known genes that have been reported to be expressed at increased levels in tumors and/or to contribute to carcinogenesis. Apart from clones representing translated transcripts, the subtracted library also contained a high number of clones representing B2 repeat elements, viz. short interspersed repetitive elements that are transcribed by RNA polymerase III. Northern blotting confirmed the induction of B2 transcripts in tumor tissue and also revealed induction of chimeric, B2 repeat-containing mRNA. The appearance of chimeric transcripts was accompanied by aberrant, shorter-than-full-length transcripts, specifically from upregulated genes. Accordingly, in addition to altered gene expression, tumoral environmental triggers constitute a potent mechanism to create an epigenetic diversity in cancers by inducing extensive transcript anomalies.

  19. Bioinformatics Analysis of NBS-LRR Encoding Resistance Genes in Setaria italica.

    Science.gov (United States)

    Zhao, Yan; Weng, Qiaoyun; Song, Jinhui; Ma, Hailian; Yuan, Jincheng; Dong, Zhiping; Liu, Yinghui

    2016-06-01

    In plants, resistance (R) genes are involved in pathogen recognition and subsequent activation of innate immune responses. The nucleotide-binding site-leucine-rich repeat (NBS-LRR) genes family forms the largest R-gene family among plant genomes and play an important role in plant disease resistance. In this paper, comprehensive analysis of NBS-encoding genes is performed in the whole Setaria italica genome. A total of 96 NBS-LRR genes are identified, and comprehensive overview of the NBS-LRR genes is undertaken, including phylogenetic analysis, chromosome locations, conserved motifs of proteins, and gene expression. Based on the domain, these genes are divided into two groups and distributed in all Setaria italica chromosomes. Most NBS-LRR genes are located at the distal tip of the long arms of the chromosomes. Setaria italica NBS-LRR proteins share at least one nucleotide-biding domain and one leucine-rich repeat domain. Our results also show the duplication of NBS-LRR genes in Setaria italica is related to their gene structure.

  20. The gene expression and immunohistochemical time-course of diphenylcyclopropenone induced contact allergy in healthy humans following repeated epicutaneous challenges

    DEFF Research Database (Denmark)

    Mose, K F; Burton, M; Thomassen, M

    2017-01-01

    challenges with DPCP to find the predominant gene expression pattern, ii) the time-course of cell infiltration following repeated DPCP challenges, and iii) the transcriptome of a repeated CA exposure model. We obtained punch biopsies from control and DPCP exposed skin from ten DPCP sensitized individuals...

  1. A computational model of the LGI1 protein suggests a common binding site for ADAM proteins.

    Directory of Open Access Journals (Sweden)

    Emanuela Leonardi

    Full Text Available Mutations of human leucine-rich glioma inactivated (LGI1 gene encoding the epitempin protein cause autosomal dominant temporal lateral epilepsy (ADTLE, a rare familial partial epileptic syndrome. The LGI1 gene seems to have a role on the transmission of neuronal messages but the exact molecular mechanism remains unclear. In contrast to other genes involved in epileptic disorders, epitempin shows no homology with known ion channel genes but contains two domains, composed of repeated structural units, known to mediate protein-protein interactions.A three dimensional in silico model of the two epitempin domains was built to predict the structure-function relationship and propose a functional model integrating previous experimental findings. Conserved and electrostatic charged regions of the model surface suggest a possible arrangement between the two domains and identifies a possible ADAM protein binding site in the β-propeller domain and another protein binding site in the leucine-rich repeat domain. The functional model indicates that epitempin could mediate the interaction between proteins localized to different synaptic sides in a static way, by forming a dimer, or in a dynamic way, by binding proteins at different times.The model was also used to predict effects of known disease-causing missense mutations. Most of the variants are predicted to alter protein folding while several other map to functional surface regions. In agreement with experimental evidence, this suggests that non-secreted LGI1 mutants could be retained within the cell by quality control mechanisms or by altering interactions required for the secretion process.

  2. Prevalence of Huntington's disease gene CAG trinucleotide repeat alleles in patients with bipolar disorder.

    Science.gov (United States)

    Ramos, Eliana Marisa; Gillis, Tammy; Mysore, Jayalakshmi S; Lee, Jong-Min; Alonso, Isabel; Gusella, James F; Smoller, Jordan W; Sklar, Pamela; MacDonald, Marcy E; Perlis, Roy H

    2015-06-01

    Huntington's disease is a neurodegenerative disorder characterized by motor, cognitive, and psychiatric symptoms that are caused by huntingtin gene (HTT) CAG trinucleotide repeat alleles of 36 or more units. A greater than expected prevalence of incompletely penetrant HTT CAG repeat alleles observed among individuals diagnosed with major depressive disorder raises the possibility that another mood disorder, bipolar disorder, could likewise be associated with Huntington's disease. We assessed the distribution of HTT CAG repeat alleles in a cohort of individuals with bipolar disorder. HTT CAG allele sizes from 2,229 Caucasian individuals diagnosed with DSM-IV bipolar disorder were compared to allele sizes in 1,828 control individuals from multiple cohorts. We found that HTT CAG repeat alleles > 35 units were observed in only one of 4,458 chromosomes from individuals with bipolar disorder, compared to three of 3,656 chromosomes from control subjects. These findings do not support an association between bipolar disorder and Huntington's disease. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  3. Operon Gene Order Is Optimized for Ordered Protein Complex Assembly

    Science.gov (United States)

    Wells, Jonathan N.; Bergendahl, L. Therese; Marsh, Joseph A.

    2016-01-01

    Summary The assembly of heteromeric protein complexes is an inherently stochastic process in which multiple genes are expressed separately into proteins, which must then somehow find each other within the cell. Here, we considered one of the ways by which prokaryotic organisms have attempted to maximize the efficiency of protein complex assembly: the organization of subunit-encoding genes into operons. Using structure-based assembly predictions, we show that operon gene order has been optimized to match the order in which protein subunits assemble. Exceptions to this are almost entirely highly expressed proteins for which assembly is less stochastic and for which precisely ordered translation offers less benefit. Overall, these results show that ordered protein complex assembly pathways are of significant biological importance and represent a major evolutionary constraint on operon gene organization. PMID:26804901

  4. De novo origin of human protein-coding genes.

    Directory of Open Access Journals (Sweden)

    Dong-Dong Wu

    2011-11-01

    Full Text Available The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA-seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes.

  5. De Novo Origin of Human Protein-Coding Genes

    Science.gov (United States)

    Wu, Dong-Dong; Irwin, David M.; Zhang, Ya-Ping

    2011-01-01

    The de novo origin of a new protein-coding gene from non-coding DNA is considered to be a very rare occurrence in genomes. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee. The functionality of these genes is supported by both transcriptional and proteomic evidence. RNA–seq data indicate that these genes have their highest expression levels in the cerebral cortex and testes, which might suggest that these genes contribute to phenotypic traits that are unique to humans, such as improved cognitive ability. Our results are inconsistent with the traditional view that the de novo origin of new genes is very rare, thus there should be greater appreciation of the importance of the de novo origination of genes. PMID:22102831

  6. A mutation in the Arabidopsis HYL1 gene encoding a dsRNA binding protein affects responses to abscisic acid, auxin, and cytokinin

    Science.gov (United States)

    Lu, C.; Fedoroff, N.

    2000-01-01

    Both physiological and genetic evidence indicate interconnections among plant responses to different hormones. We describe a pleiotropic recessive Arabidopsis transposon insertion mutation, designated hyponastic leaves (hyl1), that alters the plant's responses to several hormones. The mutant is characterized by shorter stature, delayed flowering, leaf hyponasty, reduced fertility, decreased rate of root growth, and an altered root gravitropic response. It also exhibits less sensitivity to auxin and cytokinin and hypersensitivity to abscisic acid (ABA). The auxin transport inhibitor 2,3,5-triiodobenzoic acid normalizes the mutant phenotype somewhat, whereas another auxin transport inhibitor, N-(1-naph-thyl)phthalamic acid, exacerbates the phenotype. The gene, designated HYL1, encodes a 419-amino acid protein that contains two double-stranded RNA (dsRNA) binding motifs, a nuclear localization motif, and a C-terminal repeat structure suggestive of a protein-protein interaction domain. We present evidence that the HYL1 gene is ABA-regulated and encodes a nuclear dsRNA binding protein. We hypothesize that the HYL1 protein is a regulatory protein functioning at the transcriptional or post-transcriptional level.

  7. Replication Stalling and Heteroduplex Formation within CAG/CTG Trinucleotide Repeats by Mismatch Repair

    KAUST Repository

    Viterbo, David; Michoud, Gregoire; Mosbach, Valentine; Dujon, Bernard; Richard, Guy-Franck

    2016-01-01

    Trinucleotide repeat expansions are responsible for at least two dozen neurological disorders. Mechanisms leading to these large expansions of repeated DNA are still poorly understood. It was proposed that transient stalling of the replication fork by the repeat tract might trigger slippage of the newly-synthesized strand over its template, leading to expansions or contractions of the triplet repeat. However, such mechanism was never formally proven. Here we show that replication fork pausing and CAG/CTG trinucleotide repeat instability are not linked, stable and unstable repeats exhibiting the same propensity to stall replication forks when integrated in a yeast natural chromosome. We found that replication fork stalling was dependent on the integrity of the mismatch-repair system, especially the Msh2p-Msh6p complex, suggesting that direct interaction of MMR proteins with secondary structures formed by trinucleotide repeats in vivo, triggers replication fork pauses. We also show by chromatin immunoprecipitation that Msh2p is enriched at trinucleotide repeat tracts, in both stable and unstable orientations, this enrichment being dependent on MSH3 and MSH6. Finally, we show that overexpressing MSH2 favors the formation of heteroduplex regions, leading to an increase in contractions and expansions of CAG/CTG repeat tracts during replication, these heteroduplexes being dependent on both MSH3 and MSH6. These heteroduplex regions were not detected when a mutant msh2-E768A gene in which the ATPase domain was mutated was overexpressed. Our results unravel two new roles for mismatch-repair proteins: stabilization of heteroduplex regions and transient blocking of replication forks passing through such repeats. Both roles may involve direct interactions between MMR proteins and secondary structures formed by trinucleotide repeat tracts, although indirect interactions may not be formally excluded.

  8. Replication Stalling and Heteroduplex Formation within CAG/CTG Trinucleotide Repeats by Mismatch Repair

    KAUST Repository

    Viterbo, David

    2016-03-16

    Trinucleotide repeat expansions are responsible for at least two dozen neurological disorders. Mechanisms leading to these large expansions of repeated DNA are still poorly understood. It was proposed that transient stalling of the replication fork by the repeat tract might trigger slippage of the newly-synthesized strand over its template, leading to expansions or contractions of the triplet repeat. However, such mechanism was never formally proven. Here we show that replication fork pausing and CAG/CTG trinucleotide repeat instability are not linked, stable and unstable repeats exhibiting the same propensity to stall replication forks when integrated in a yeast natural chromosome. We found that replication fork stalling was dependent on the integrity of the mismatch-repair system, especially the Msh2p-Msh6p complex, suggesting that direct interaction of MMR proteins with secondary structures formed by trinucleotide repeats in vivo, triggers replication fork pauses. We also show by chromatin immunoprecipitation that Msh2p is enriched at trinucleotide repeat tracts, in both stable and unstable orientations, this enrichment being dependent on MSH3 and MSH6. Finally, we show that overexpressing MSH2 favors the formation of heteroduplex regions, leading to an increase in contractions and expansions of CAG/CTG repeat tracts during replication, these heteroduplexes being dependent on both MSH3 and MSH6. These heteroduplex regions were not detected when a mutant msh2-E768A gene in which the ATPase domain was mutated was overexpressed. Our results unravel two new roles for mismatch-repair proteins: stabilization of heteroduplex regions and transient blocking of replication forks passing through such repeats. Both roles may involve direct interactions between MMR proteins and secondary structures formed by trinucleotide repeat tracts, although indirect interactions may not be formally excluded.

  9. The Basic/Helix-Loop-Helix Protein Family in Gossypium: Reference Genes and Their Evolution during Tetraploidization.

    Directory of Open Access Journals (Sweden)

    Qian Yan

    Full Text Available Basic/helix-loop-helix (bHLH proteins comprise one of the largest transcription factor families and play important roles in diverse cellular and molecular processes. Comprehensive analyses of the composition and evolution of the bHLH family in cotton are essential to elucidate their functions and the molecular basis of cotton development. By searching bHLH homologous genes in sequenced diploid cotton genomes (Gossypium raimondii and G. arboreum, a set of cotton bHLH reference genes containing 289 paralogs were identified and named as GobHLH001-289. Based on their phylogenetic relationships, these cotton bHLH proteins were clustered into 27 subfamilies. Compared to those in Arabidopsis and cacao, cotton bHLH proteins generally increased in number, but unevenly in different subfamilies. To further uncover evolutionary changes of bHLH genes during tetraploidization of cotton, all genes of S5a and S5b subfamilies in upland cotton and its diploid progenitors were cloned and compared, and their transcript profiles were determined in upland cotton. A total of 10 genes of S5a and S5b subfamilies (doubled from A- and D-genome progenitors maintained in tetraploid cottons. The major sequence changes in upland cotton included a 15-bp in-frame deletion in GhbHLH130D and a long terminal repeat retrotransposon inserted in GhbHLH062A, which eliminated GhbHLH062A expression in various tissues. The S5a and S5b bHLH genes of A and D genomes (except GobHLH062 showed similar transcription patterns in various tissues including roots, stems, leaves, petals, ovules, and fibers, while the A- and D-genome genes of GobHLH110 and GobHLH130 displayed clearly different transcript profiles during fiber development. In total, this study represented a genome-wide analysis of cotton bHLH family, and revealed significant changes in sequence and expression of these genes in tetraploid cottons, which paved the way for further functional analyses of bHLH genes in the cotton genus.

  10. Interaction between a plasma membrane-localized ankyrin-repeat protein ITN1 and a nuclear protein RTV1

    Energy Technology Data Exchange (ETDEWEB)

    Sakamoto, Hikaru [Department of Bioproduction, Faculty of Bioindustry, Tokyo University of Agriculture, 196 Yasaka, Abashiri-shi, Hokkaido 093-2422 (Japan); Sakata, Keiko; Kusumi, Kensuke [Department of Biology, Faculty of Sciences, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka 812-8581 (Japan); Kojima, Mikiko; Sakakibara, Hitoshi [RIKEN Plant Science Center, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045 (Japan); Iba, Koh, E-mail: koibascb@kyushu-u.org [Department of Biology, Faculty of Sciences, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, Fukuoka 812-8581 (Japan)

    2012-06-29

    Highlights: Black-Right-Pointing-Pointer ITN1, a plasma membrane ankyrin protein, interacts with a nuclear DNA-binding protein RTV1. Black-Right-Pointing-Pointer The nuclear transport of RTV1 is partially inhibited by interaction with ITN1. Black-Right-Pointing-Pointer RTV1 can promote the nuclear localization of ITN1. Black-Right-Pointing-Pointer Both overexpression of RTV1 and the lack of ITN1 increase salicylic acids sensitivity in plants. -- Abstract: The increased tolerance to NaCl 1 (ITN1) protein is a plasma membrane (PM)-localized protein involved in responses to NaCl stress in Arabidopsis. The predicted structure of ITN1 is composed of multiple transmembrane regions and an ankyrin-repeat domain that is known to mediate protein-protein interactions. To elucidate the molecular functions of ITN1, we searched for interacting partners using a yeast two-hybrid assay, and a nuclear-localized DNA-binding protein, RTV1, was identified as a candidate. Bimolecular fluorescence complementation analysis revealed that RTV1 interacted with ITN1 at the PM and nuclei in vivo. RTV1 tagged with red fluorescent protein localized to nuclei and ITN1 tagged with green fluorescent protein localized to PM; however, both proteins localized to both nuclei and the PM when co-expressed. These findings suggest that RTV1 and ITN1 regulate the subcellular localization of each other.

  11. A Drosophila gene encoding a protein resembling the human β-amyloid protein precursor

    International Nuclear Information System (INIS)

    Rosen, D.R.; Martin-Morris, L.; Luo, L.; White, K.

    1989-01-01

    The authors have isolated genomic and cDNA clones for a Drosophila gene resembling the human β-amyloid precursor protein (APP). This gene produces a nervous system-enriched 6.5-kilobase transcript. Sequencing of cDNAs derived from the 6.5-kilobase transcript predicts an 886-amino acid polypeptide. This polypeptide contains a putative transmembrane domain and exhibits strong sequence similarity to cytoplasmic and extracellular regions of the human β-amyloid precursor protein. There is a high probability that this Drosophila gene corresponds to the essential Drosophila locus vnd, a gene required for embryonic nervous system development

  12. The protein network surrounding the human telomere repeat binding factors TRF1, TRF2, and POT1.

    Directory of Open Access Journals (Sweden)

    Richard J Giannone

    2010-08-01

    Full Text Available Telomere integrity (including telomere length and capping is critical in overall genomic stability. Telomere repeat binding factors and their associated proteins play vital roles in telomere length regulation and end protection. In this study, we explore the protein network surrounding telomere repeat binding factors, TRF1, TRF2, and POT1 using dual-tag affinity purification in combination with multidimensional protein identification technology liquid chromatography--tandem mass spectrometry (MudPIT LC-MS/MS. After control subtraction and data filtering, we found that TRF2 and POT1 co-purified all six members of the telomere protein complex, while TRF1 identified five of six components at frequencies that lend evidence towards the currently accepted telomere architecture. Many of the known TRF1 or TRF2 interacting proteins were also identified. Moreover, putative associating partners identified for each of the three core components fell into functional categories such as DNA damage repair, ubiquitination, chromosome cohesion, chromatin modification/remodeling, DNA replication, cell cycle and transcription regulation, nucleotide metabolism, RNA processing, and nuclear transport. These putative protein-protein associations may participate in different biological processes at telomeres or, intriguingly, outside telomeres.

  13. Identification of a novel Leucine-rich repeat protein and candidate PP1 regulatory subunit expressed in developing spermatids

    Directory of Open Access Journals (Sweden)

    Sperry Ann O

    2008-01-01

    Full Text Available Abstract Background Spermatogenesis is comprised of a series of highly regulated developmental changes that transform the precursor germ cell into a highly specialized spermatozoon. The last phase of spermatogenesis, termed spermiogenesis, involves dramatic morphological change including formation of the acrosome, elongation and condensation of the nucleus, formation of the flagella, and disposal of unnecessary cytoplasm. A prominent cytoskeletal component of the developing spermatid is the manchette, a unique microtubular structure that surrounds the nucleus of the developing spermatid and is thought to assist in both the reshaping of the nucleus and redistribution of spermatid cytoplasm. Although the molecular motor KIFC1 has been shown to associate with the manchette, its precise role in function of the manchette and the identity of its testis specific protein partners are unknown. The purpose of this study was to identify proteins in the testis that interact with KIFC1 using a yeast 2 hybrid screen of a testis cDNA library. Results Thirty percent of the interacting clones identified in our screen contain an identical cDNA encoding a 40 kD protein. This interacting protein has 4 leucine-rich repeats in its amino terminal half and is expressed primarily in the testis; therefore we have named this protein testis leucine-rich repeat protein or TLRR. TLRR was also found to associate tightly with the KIFC1 targeting domain using affinity chromatography. In addition to the leucine-rich repeats, TLRR contains a consensus-binding site for protein phosphatase-1 (PP1. Immunocytochemistry using a TLRR specific antibody demonstrates that this protein is found near the manchette of developing spermatids. Conclusion We have identified a previously uncharacterized leucine-rich repeat protein that is expressed abundantly in the testis and associates with the manchette of developing spermatids, possibly through its interaction with the KIFC1 molecular motor

  14. Highly Stable Trypsin-Aggregate Coatings on Polymer Nanofibers for Repeated Protein Digestion

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Byoung Chan; Lopez-Ferrer, Daniel; Lee, Sang-mok; Ahn, Hye-kyung; Nair, Sujith; Kim, Seong H.; Kim, Beom S.; Petritis, Konstantinos; Camp, David G.; Grate, Jay W.; Smith, Richard D.; Koo, Yoon-mo; Gu, Man Bock; Kim, Jungbae

    2009-04-01

    A stable and robust trypsin-based biocatalytic system was developed and demonstrated for proteomic applications. The system utilizes polymer nanofibers coated with trypsin aggregates for immobilized protease digestions. After covalently attaching an initial layer of trypsin to the polymer nanofibers, highly concentrated trypsin molecules are crosslinked to the layered trypsin by way of a glutaraldehyde treatment. This new process produced a 300-fold increase in trypsin activity compared with a conventional method for covalent trypsin immobilization and proved to be robust in that it still maintained a high level of activity after a year of repeated recycling. This highly stable form of immobilized trypsin was also resistant to autolysis, enabling repeated digestions of bovine serum albumin over 40 days and successful peptide identification by LC-MS/MS. Finally, the immobilized trypsin was resistant to proteolysis when exposed to other enzymes (i.e. chymotrypsin), which makes it suitable for use in “real-world” proteomic applications. Overall, the biocatalytic nanofibers with enzyme aggregate coatings proved to be an effective approach for repeated and automated protein digestion in proteomic analyses.

  15. Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of protein-protein interactions and gene expression profiling

    Directory of Open Access Journals (Sweden)

    Mo Min

    2008-05-01

    Full Text Available Abstract Background The polyadenylation of mRNA is one of the critical processing steps during expression of almost all eukaryotic genes. It is tightly integrated with transcription, particularly its termination, as well as other RNA processing events, i.e. capping and splicing. The poly(A tail protects the mRNA from unregulated degradation, and it is required for nuclear export and translation initiation. In recent years, it has been demonstrated that the polyadenylation process is also involved in the regulation of gene expression. The polyadenylation process requires two components, the cis-elements on the mRNA and a group of protein factors that recognize the cis-elements and produce the poly(A tail. Here we report a comprehensive pairwise protein-protein interaction mapping and gene expression profiling of the mRNA polyadenylation protein machinery in Arabidopsis. Results By protein sequence homology search using human and yeast polyadenylation factors, we identified 28 proteins that may be components of Arabidopsis polyadenylation machinery. To elucidate the protein network and their functions, we first tested their protein-protein interaction profiles. Out of 320 pair-wise protein-protein interaction assays done using the yeast two-hybrid system, 56 (~17% showed positive interactions. 15 of these interactions were further tested, and all were confirmed by co-immunoprecipitation and/or in vitro co-purification. These interactions organize into three distinct hubs involving the Arabidopsis polyadenylation factors. These hubs are centered around AtCPSF100, AtCLPS, and AtFIPS. The first two are similar to complexes seen in mammals, while the third one stands out as unique to plants. When comparing the gene expression profiles extracted from publicly available microarray datasets, some of the polyadenylation related genes showed tissue-specific expression, suggestive of potential different polyadenylation complex configurations. Conclusion An

  16. Memory responses of jasmonic acid-associated Arabidopsis genes to a repeated dehydration stress.

    Science.gov (United States)

    Liu, Ning; Staswick, Paul E; Avramova, Zoya

    2016-11-01

    Dehydration stress activates numerous genes co-regulated by diverse signaling pathways. Upon repeated exposures, however, a subset of these genes does not respond maintaining instead transcription at their initial pre-stressed levels ('revised-response' genes). Most of these genes are involved in jasmonic acid (JA) biosynthesis, JA-signaling and JA-mediated stress responses. How these JA-associated genes are regulated to provide different responses to similar dehydration stresses is an enigma. Here, we investigate molecular mechanisms that contribute to this transcriptional behavior. The memory-mechanism is stress-specific: one exposure to dehydration stress or to abscisic acid (ABA) is required to prevent transcription in the second. Both ABA-mediated and JA-mediated pathways are critical for the activation of these genes, but the two signaling pathways interact differently during a single or multiple encounters with dehydration stress. Synthesis of JA during the first (S1) but not the second dehydration stress (S2) accounts for the altered transcriptional responses. We propose a model for these memory responses, wherein lack of MYC2 and of JA synthesis in S2 is responsible for the lack of expression of downstream genes. The similar length of the memory displayed by different memory-type genes suggests biological relevance for transcriptional memory as a gene-regulating mechanism during recurring bouts of drought. © 2016 John Wiley & Sons Ltd.

  17. Characterization of chicken riboflavin carrier protein gene structure ...

    Indian Academy of Sciences (India)

    The chicken riboflavin carrier protein (RCP) is an estrogen induced egg yolk and white protein. Eggs from hens which have a splice mutation in RCP gene fail to hatch, indicating an absolute requirement of RCP for the transport of riboflavin to the oocyte. In order to understand the mechanism of regulation of this gene by ...

  18. Shorter CAG repeat in the AR gene is associated with atypical hyperplasia and breast carcinoma

    DEFF Research Database (Denmark)

    De Abreu, Francine Blumental; Pirolo, Leandro Júnior; Canevari, Renata de Azevedo

    2007-01-01

    BACKGROUND: Previous reports into the role of [CAG]n repeat lengths in the androgen receptor (AR) gene indicate that these may play an important part in the development and progression of breast cancer, however, knowledge regarding benign breast lesions is limited. PATIENTS AND METHODS: PCR-based...

  19. Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

    Directory of Open Access Journals (Sweden)

    Trimpalis Philip

    2011-07-01

    Full Text Available Abstract Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs.

  20. Cloning of gene-encoded stem bromelain on system coming from Pichia pastoris as therapeutic protein candidate

    Science.gov (United States)

    Yusuf, Y.; Hidayati, W.

    2018-01-01

    The process of identifying bacterial recombination using PCR, and restriction, and then sequencing process was done after identifying the bacteria. This research aimed to get a yeast cell of Pichia pastoris which has an encoder gene of stem bromelain enzyme. The production of recombinant stem bromelain enzymes using yeast cells of P. pastoris can produce pure bromelain rod enzymes and have the same conformation with the enzyme’s conformation in pineapple plants. This recombinant stem bromelain enzyme can be used as a therapeutic protein in inflammatory, cancer and degenerative diseases. This study was an early stage of a step series to obtain bromelain rod protein derived from pineapple made with genetic engineering techniques. This research was started by isolating the RNA of pineapple stem which was continued with constructing cDNA using reserve transcriptase-PCR technique (RT-PCR), doing the amplification of bromelain enzyme encoder gene with PCR technique using a specific premiere couple which was designed. The process was continued by cloning into bacterium cells of Escherichia coli. A vector which brought the encoder gene of stem bromelain enzyme was inserted into the yeast cell of P. pastoris and was continued by identifying the yeast cell of P. pastoris which brought the encoder gene of stem bromelain enzyme. The research has not found enzyme gene of stem bromelain in yeast cell of P. pastoris yet. The next step is repeating the process by buying new reagent; RNase inhibitor, and buying liquid nitrogen.

  1. The Immature Fiber Mutant Phenotype of Cotton (Gossypium hirsutum Is Linked to a 22-bp Frame-Shift Deletion in a Mitochondria Targeted Pentatricopeptide Repeat Gene

    Directory of Open Access Journals (Sweden)

    Gregory N. Thyssen

    2016-06-01

    Full Text Available Cotton seed trichomes are the most important source of natural fibers globally. The major fiber thickness properties influence the price of the raw material, and the quality of the finished product. The recessive immature fiber (im gene reduces the degree of fiber cell wall thickening by a process that was previously shown to involve mitochondrial function in allotetraploid Gossypium hirsutum. Here, we present the fine genetic mapping of the im locus, gene expression analysis of annotated proteins near the locus, and association analysis of the linked markers. Mapping-by-sequencing identified a 22-bp deletion in a pentatricopeptide repeat (PPR gene that is completely linked to the immature fiber phenotype in 2837 F2 plants, and is absent from all 163 cultivated varieties tested, although other closely linked marker polymorphisms are prevalent in the diversity panel. This frame-shift mutation results in a transcript with two long open reading frames: one containing the N-terminal transit peptide that targets mitochondria, the other containing only the RNA-binding PPR domains, suggesting that a functional PPR protein cannot be targeted to mitochondria in the im mutant. Taken together, these results suggest that PPR gene Gh_A03G0489 is involved in the cotton fiber wall thickening process, and is a promising candidate gene at the im locus. Our findings expand our understanding of the molecular mechanisms that modulate cotton fiber fineness and maturity, and may facilitate the development of cotton varieties with superior fiber attributes.

  2. Human Immunodeficiency Virus-Type 1 LTR DNA contains an intrinsic gene producing antisense RNA and protein products

    Directory of Open Access Journals (Sweden)

    Hsiao Chiu-Bin

    2006-11-01

    Full Text Available Abstract Background While viruses have long been shown to capitalize on their limited genomic size by utilizing both strands of DNA or complementary DNA/RNA intermediates to code for viral proteins, it has been assumed that human retroviruses have all their major proteins translated only from the plus or sense strand of RNA, despite their requirement for a dsDNA proviral intermediate. Several studies, however, have suggested the presence of antisense transcription for both HIV-1 and HTLV-1. More recently an antisense transcript responsible for the HTLV-1 bZIP factor (HBZ protein has been described. In this study we investigated the possibility of an antisense gene contained within the human immunodeficiency virus type 1 (HIV-1 long terminal repeat (LTR. Results Inspection of published sequences revealed a potential transcription initiator element (INR situated downstream of, and in reverse orientation to, the usual HIV-1 promoter and transcription start site. This antisense initiator (HIVaINR suggested the possibility of an antisense gene responsible for RNA and protein production. We show that antisense transcripts are generated, in vitro and in vivo, originating from the TAR DNA of the HIV-1 LTR. To test the possibility that protein(s could be translated from this novel HIV-1 antisense RNA, recombinant HIV antisense gene-FLAG vectors were designed. Recombinant protein(s were produced and isolated utilizing carboxy-terminal FLAG epitope (DYKDDDDK sequences. In addition, affinity-purified antisera to an internal peptide derived from the HIV antisense protein (HAP sequences identified HAPs from HIV+ human peripheral blood lymphocytes. Conclusion HIV-1 contains an antisense gene in the U3-R regions of the LTR responsible for both an antisense RNA transcript and proteins. This antisense transcript has tremendous potential for intrinsic RNA regulation because of its overlap with the beginning of all HIV-1 sense RNA transcripts by 25 nucleotides. The

  3. Downregulation of ATM Gene and Protein Expression in Canine Mammary Tumors.

    Science.gov (United States)

    Raposo-Ferreira, T M M; Bueno, R C; Terra, E M; Avante, M L; Tinucci-Costa, M; Carvalho, M; Cassali, G D; Linde, S D; Rogatto, S R; Laufer-Amorim, R

    2016-11-01

    The ataxia telangiectasia mutated (ATM) gene encodes a protein associated with DNA damage repair and maintenance of genomic integrity. In women, ATM transcript and protein downregulation have been reported in sporadic breast carcinomas, and the absence of ATM protein expression has been associated with poor prognosis. The aim of this study was to evaluate ATM gene and protein expression in canine mammary tumors and their association with clinical outcome. ATM gene and protein expression was evaluated by reverse transcription-quantitative polymerase chain reaction and immunohistochemistry, respectively, in normal mammary gland samples (n = 10), benign mammary tumors (n = 11), nonmetastatic mammary carcinomas (n = 19), and metastatic mammary carcinomas (n = 11). Lower ATM transcript levels were detected in benign mammary tumors and carcinomas compared with normal mammary glands (P = .011). Similarly, lower ATM protein expression was observed in benign tumors (P = .0003), nonmetastatic mammary carcinomas (P ATM gene or protein levels were detected among benign tumors and nonmetastatic and metastatic mammary carcinomas (P > .05). The levels of ATM gene or protein expression were not significantly associated with clinical and pathological features or with survival. Similar to human breast cancer, the data in this study suggest that ATM gene and protein downregulation is involved in canine mammary gland tumorigenesis. © The Author(s) 2016.

  4. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes

    OpenAIRE

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-01-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix–loop–helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks e...

  5. Alternative Conformations of the Tau Repeat Domain in Complex with an Engineered Binding Protein*

    Science.gov (United States)

    Grüning, Clara S. R.; Mirecka, Ewa A.; Klein, Antonia N.; Mandelkow, Eckhard; Willbold, Dieter; Marino, Stephen F.; Stoldt, Matthias; Hoyer, Wolfgang

    2014-01-01

    The aggregation of Tau into paired helical filaments is involved in the pathogenesis of several neurodegenerative diseases, including Alzheimer disease. The aggregation reaction is characterized by conformational conversion of the repeat domain, which partially adopts a cross-β-structure in the resulting amyloid-like fibrils. Here, we report the selection and characterization of an engineered binding protein, β-wrapin TP4, targeting the Tau repeat domain. TP4 was obtained by phage display using the four-repeat Tau construct K18ΔK280 as a target. TP4 binds K18ΔK280 as well as the longest isoform of human Tau, hTau40, with nanomolar affinity. NMR spectroscopy identified two alternative TP4-binding sites in the four-repeat domain, with each including two hexapeptide motifs with high β-sheet propensity. Both binding sites contain the aggregation-determining PHF6 hexapeptide within repeat 3. In addition, one binding site includes the PHF6* hexapeptide within repeat 2, whereas the other includes the corresponding hexapeptide Tau(337–342) within repeat 4, denoted PHF6**. Comparison of TP4-binding with Tau aggregation reveals that the same regions of Tau are involved in both processes. TP4 inhibits Tau aggregation at substoichiometric concentration, demonstrating that it interferes with aggregation nucleation. This study provides residue-level insight into the interaction of Tau with an aggregation inhibitor and highlights the structural flexibility of Tau. PMID:24966331

  6. Unique CCT repeats mediate transcription of the TWIST1 gene in mesenchymal cell lines

    International Nuclear Information System (INIS)

    Ohkuma, Mizue; Funato, Noriko; Higashihori, Norihisa; Murakami, Masanori; Ohyama, Kimie; Nakamura, Masataka

    2007-01-01

    TWIST1, a basic helix-loop-helix transcription factor, plays critical roles in embryo development, cancer metastasis and mesenchymal progenitor differentiation. Little is known about transcriptional regulation of TWIST1 expression. Here we identified DNA sequences responsible for TWIST1 expression in mesenchymal lineage cell lines. Reporter assays with TWIST1 promoter mutants defined the -102 to -74 sequences that are essential for TWIST1 expression in human and mouse mesenchymal cell lines. Tandem repeats of CCT, but not putative CREB and NF-κB sites in the sequences substantially supported activity of the TWIST1 promoter. Electrophoretic mobility shift assay demonstrated that the DNA sequences with the CCT repeats formed complexes with nuclear factors, containing, at least, Sp1 and Sp3. These results suggest critical implication of the CCT repeats in association with Sp1 and Sp3 factors in sustaining expression of the TWIST1 gene in mesenchymal cells

  7. Rapid and Specific Detection of Acidovorax avenae subsp. citrulli Using SYBR Green-Based Real-Time PCR Amplification of the YD-Repeat Protein Gene.

    Science.gov (United States)

    Cho, Min Seok; Park, Duck Hwan; Ahn, Tae-Young; Park, Dong Suk

    2015-09-01

    The aim of this study was to develop a SYBR Green-based real-time PCR assay for the rapid, specific, and sensitive detection of Acidovorax avenae subsp. citrulli, which causes bacterial fruit blotch (BFB), a serious disease of cucurbit plants. The molecular and serological methods currently available for the detection of this pathogen are insufficiently sensitive and specific. Thus, a novel SYBR Green-based real-time PCR assay targeting the YD-repeat protein gene of A. avenae subsp. citrulli was developed. The specificity of the primer set was evaluated using DNA purified from 6 isolates of A. avenae subsp. citrulli, 7 other Acidovorax species, and 22 of non-targeted strains, including pathogens and non-pathogens. The AC158F/R primer set amplified a single band of the expected size from genomic DNA obtained from the A. avenae subsp. citrulli strains but not from the genomic DNA of other Acidovorax species, including that of other bacterial genera. Using this assay, it was possible to detect at least one genomeequivalents of the cloned amplified target DNA using 5 × 10(0) fg/μl of purified genomic DNA per reaction or using a calibrated cell suspension, with 6.5 colony-forming units per reaction being employed. In addition, this assay is a highly sensitive and reliable method for identifying and quantifying the target pathogen in infected samples that does not require DNA extraction. Therefore, we suggest that this approach is suitable for the rapid and efficient diagnosis of A. avenae subsp. citrulli contaminations of seed lots and plants.

  8. Leucine-rich repeat-containing G-protein-coupled receptors as markers of adult stem cells

    NARCIS (Netherlands)

    Barker, N.; Clevers, H.

    2010-01-01

    Molecular markers are used to characterize and track adult stem cells. Colon cancer research has led to the identification of 2 related receptors, leucine-rich repeat-containing, G-protein-coupled receptors (Lgr)5 and Lgr6, that are expressed by small populations of cells in a variety of adult

  9. [open quotes]Cryptic[close quotes] repeating triplets of purines and pyrimidines (cRRY(i)) are frequent and polymorphic: Analysis of coding cRRY(i) in the proopiomelanocortin (POMC) and TATA-binding protein (TBP) genes

    Energy Technology Data Exchange (ETDEWEB)

    Gostout, B.; Qiang Liu; Sommer, S.S. (Mayo Clinic/Foundation, Rochester, MN (United States))

    1993-06-01

    Triplets of the form of purine, purine, pyrimidine (RRY(i)) are enhanced in frequency in the genomes of primates, rodents, and bacteria. Some RRY(i) are [open quotes]cryptic[close quotes] repeats (cRRY(i)) in which no one tandem run of a trinucleotide predominates. A search of human GenBank sequence revealed that the sequences of cRRY(i) are highly nonrandom. Three randomly chosen human cRRY(i) were sequenced in search of polymorphic alleles. Multiple polymorphic alleles were found in cRRY(i) in the coding regions of the genes for proopiomelanocortin (POMC) and TATA-binding protein (TBP). The highly polymorphic TBP cRRY(i) was characterized in detail. Direct sequencing of 157 unrelated human alleles demonstrated the presence of 20 different alleles which resulted in 29--40 consecutive glutamines in the amino-terminal region of TBP. These alleles are differently distributed among the races. PCR was used to screen 1,846 additional alleles in order to characterize more fully the range of variation in the population. Three additional alleles were discovered, but there was no example of a substantial sequence amplification as is seen in the repeat sequences associated with X-linked spinal and bulbar muscular atrophy, myotonic dystrophy, or the fragile-X syndrome. The structure of the TBP cRRY(i) is conserved in the five monkey species examined. In the chimpanzee, examination of four individuals revealed that the cRRY(i) was highly polymorphic, but the pattern of polymorphism differed from that in humans. The TBP cRRY(i) displays both similarities with and differences from the previously described RRY(i) in the coding sequence of the androgen receptor. The data suggest how simple tandem repeats could evolve from cryptic repeats. 18 refs., 3 figs., 6 tabs.

  10. Amelogenesis Imperfecta; Genes, Proteins, and Pathways

    Directory of Open Access Journals (Sweden)

    Claire E. L. Smith

    2017-06-01

    Full Text Available Amelogenesis imperfecta (AI is the name given to a heterogeneous group of conditions characterized by inherited developmental enamel defects. AI enamel is abnormally thin, soft, fragile, pitted and/or badly discolored, with poor function and aesthetics, causing patients problems such as early tooth loss, severe embarrassment, eating difficulties, and pain. It was first described separately from diseases of dentine nearly 80 years ago, but the underlying genetic and mechanistic basis of the condition is only now coming to light. Mutations in the gene AMELX, encoding an extracellular matrix protein secreted by ameloblasts during enamel formation, were first identified as a cause of AI in 1991. Since then, mutations in at least eighteen genes have been shown to cause AI presenting in isolation of other health problems, with many more implicated in syndromic AI. Some of the encoded proteins have well documented roles in amelogenesis, acting as enamel matrix proteins or the proteases that degrade them, cell adhesion molecules or regulators of calcium homeostasis. However, for others, function is less clear and further research is needed to understand the pathways and processes essential for the development of healthy enamel. Here, we review the genes and mutations underlying AI presenting in isolation of other health problems, the proteins they encode and knowledge of their roles in amelogenesis, combining evidence from human phenotypes, inheritance patterns, mouse models, and in vitro studies. An LOVD resource (http://dna2.leeds.ac.uk/LOVD/ containing all published gene mutations for AI presenting in isolation of other health problems is described. We use this resource to identify trends in the genes and mutations reported to cause AI in the 270 families for which molecular diagnoses have been reported by 23rd May 2017. Finally we discuss the potential value of the translation of AI genetics to clinical care with improved patient pathways and

  11. Amelogenesis Imperfecta; Genes, Proteins, and Pathways.

    Science.gov (United States)

    Smith, Claire E L; Poulter, James A; Antanaviciute, Agne; Kirkham, Jennifer; Brookes, Steven J; Inglehearn, Chris F; Mighell, Alan J

    2017-01-01

    Amelogenesis imperfecta (AI) is the name given to a heterogeneous group of conditions characterized by inherited developmental enamel defects. AI enamel is abnormally thin, soft, fragile, pitted and/or badly discolored, with poor function and aesthetics, causing patients problems such as early tooth loss, severe embarrassment, eating difficulties, and pain. It was first described separately from diseases of dentine nearly 80 years ago, but the underlying genetic and mechanistic basis of the condition is only now coming to light. Mutations in the gene AMELX , encoding an extracellular matrix protein secreted by ameloblasts during enamel formation, were first identified as a cause of AI in 1991. Since then, mutations in at least eighteen genes have been shown to cause AI presenting in isolation of other health problems, with many more implicated in syndromic AI. Some of the encoded proteins have well documented roles in amelogenesis, acting as enamel matrix proteins or the proteases that degrade them, cell adhesion molecules or regulators of calcium homeostasis. However, for others, function is less clear and further research is needed to understand the pathways and processes essential for the development of healthy enamel. Here, we review the genes and mutations underlying AI presenting in isolation of other health problems, the proteins they encode and knowledge of their roles in amelogenesis, combining evidence from human phenotypes, inheritance patterns, mouse models, and in vitro studies. An LOVD resource (http://dna2.leeds.ac.uk/LOVD/) containing all published gene mutations for AI presenting in isolation of other health problems is described. We use this resource to identify trends in the genes and mutations reported to cause AI in the 270 families for which molecular diagnoses have been reported by 23rd May 2017. Finally we discuss the potential value of the translation of AI genetics to clinical care with improved patient pathways and speculate on the

  12. Single Strand Annealing Plays a Major Role in RecA-Independent Recombination between Repeated Sequences in the Radioresistant Deinococcus radiodurans Bacterium.

    Directory of Open Access Journals (Sweden)

    Solenne Ithurbide

    2015-10-01

    Full Text Available The bacterium Deinococcus radiodurans is one of the most radioresistant organisms known. It is able to reconstruct a functional genome from hundreds of radiation-induced chromosomal fragments. Our work aims to highlight the genes involved in recombination between 438 bp direct repeats separated by intervening sequences of various lengths ranging from 1,479 bp to 10,500 bp to restore a functional tetA gene in the presence or absence of radiation-induced DNA double strand breaks. The frequency of spontaneous deletion events between the chromosomal direct repeats were the same in recA+ and in ΔrecA, ΔrecF, and ΔrecO bacteria, whereas recombination between chromosomal and plasmid DNA was shown to be strictly dependent on the RecA and RecF proteins. The presence of mutations in one of the repeated sequence reduced, in a MutS-dependent manner, the frequency of the deletion events. The distance between the repeats did not influence the frequencies of deletion events in recA+ as well in ΔrecA bacteria. The absence of the UvrD protein stimulated the recombination between the direct repeats whereas the absence of the DdrB protein, previously shown to be involved in DNA double strand break repair through a single strand annealing (SSA pathway, strongly reduces the frequency of RecA- (and RecO- independent deletions events. The absence of the DdrB protein also increased the lethal sectoring of cells devoid of RecA or RecO protein. γ-irradiation of recA+ cells increased about 10-fold the frequencies of the deletion events, but at a lesser extend in cells devoid of the DdrB protein. Altogether, our results suggest a major role of single strand annealing in DNA repeat deletion events in bacteria devoid of the RecA protein, and also in recA+ bacteria exposed to ionizing radiation.

  13. Gender, obesity and repeated elevation of C-reactive protein: data from the CARDIA cohort.

    Directory of Open Access Journals (Sweden)

    Shinya Ishii

    Full Text Available C-reactive Protein (CRP measurements above 10 mg/L have been conventionally treated as acute inflammation and excluded from epidemiologic studies of chronic inflammation. However, recent evidence suggest that such CRP elevations can be seen even with chronic inflammation. The authors assessed 3,300 participants in The Coronary Artery Risk Development in Young Adults study, who had two or more CRP measurements between 1992/3 and 2005/6 to a investigate characteristics associated with repeated CRP elevation above 10 mg/L; b identify subgroups at high risk of repeated elevation; and c investigate the effect of different CRP thresholds on the probability of an elevation being one-time rather than repeated. 225 participants (6.8% had one-time and 103 (3.1% had repeated CRP elevation above 10 mg/L. Repeated elevation was associated with obesity, female gender, low income, and sex hormone use. The probability of an elevation above 10 mg/L being one-time rather than repeated was lowest (51% in women with body mass index above 31 kg/m(2, compared to 82% in others. These findings suggest that CRP elevations above 10 mg/L in obese women are likely to be from chronic rather than acute inflammation, and that CRP thresholds above 10 mg/L may be warranted to distinguish acute from chronic inflammation in obese women.

  14. Ranking candidate disease genes from gene expression and protein interaction: a Katz-centrality based approach.

    Directory of Open Access Journals (Sweden)

    Jing Zhao

    Full Text Available Many diseases have complex genetic causes, where a set of alleles can affect the propensity of getting the disease. The identification of such disease genes is important to understand the mechanistic and evolutionary aspects of pathogenesis, improve diagnosis and treatment of the disease, and aid in drug discovery. Current genetic studies typically identify chromosomal regions associated specific diseases. But picking out an unknown disease gene from hundreds of candidates located on the same genomic interval is still challenging. In this study, we propose an approach to prioritize candidate genes by integrating data of gene expression level, protein-protein interaction strength and known disease genes. Our method is based only on two, simple, biologically motivated assumptions--that a gene is a good disease-gene candidate if it is differentially expressed in cases and controls, or that it is close to other disease-gene candidates in its protein interaction network. We tested our method on 40 diseases in 58 gene expression datasets of the NCBI Gene Expression Omnibus database. On these datasets our method is able to predict unknown disease genes as well as identifying pleiotropic genes involved in the physiological cellular processes of many diseases. Our study not only provides an effective algorithm for prioritizing candidate disease genes but is also a way to discover phenotypic interdependency, cooccurrence and shared pathophysiology between different disorders.

  15. The human protein disulfide isomerase gene family

    Directory of Open Access Journals (Sweden)

    Galligan James J

    2012-07-01

    Full Text Available Abstract Enzyme-mediated disulfide bond formation is a highly conserved process affecting over one-third of all eukaryotic proteins. The enzymes primarily responsible for facilitating thiol-disulfide exchange are members of an expanding family of proteins known as protein disulfide isomerases (PDIs. These proteins are part of a larger superfamily of proteins known as the thioredoxin protein family (TRX. As members of the PDI family of proteins, all proteins contain a TRX-like structural domain and are predominantly expressed in the endoplasmic reticulum. Subcellular localization and the presence of a TRX domain, however, comprise the short list of distinguishing features required for gene family classification. To date, the PDI gene family contains 21 members, varying in domain composition, molecular weight, tissue expression, and cellular processing. Given their vital role in protein-folding, loss of PDI activity has been associated with the pathogenesis of numerous disease states, most commonly related to the unfolded protein response (UPR. Over the past decade, UPR has become a very attractive therapeutic target for multiple pathologies including Alzheimer disease, Parkinson disease, alcoholic and non-alcoholic liver disease, and type-2 diabetes. Understanding the mechanisms of protein-folding, specifically thiol-disulfide exchange, may lead to development of a novel class of therapeutics that would help alleviate a wide range of diseases by targeting the UPR.

  16. The repeatability of interleukin-6, tumor necrosis factor-alpha, and C-reactive protein in COPD patients over one year

    DEFF Research Database (Denmark)

    Kolsum, Umme; Roy, Kay; Starkey, Cerys

    2009-01-01

    BACKGROUND: Many of the systemic manifestations of chronic obstructive pulmonary disease (COPD) are mediated through increased systemic levels of inflammatory proteins. We assessed the long term repeatability of interleukin-6 (IL-6), tumor necrosis factor-alpha (TNF-alpha), and C-reactive protein......(i)) and the Bland-Altman method. Pearson correlations were used to determine the relationships between the systemic markers at both visits. RESULTS: There was moderate repeatability with a very high degree of statistical significance (p...... (CRP) over one year and examined the relationships between these systemic markers in COPD. METHODS: Fifty-eight stable COPD patients completed a baseline and one-year visit. Serum IL-6, plasma CRP, and plasma TNF-alpha were measured. Repeatability was expressed by intraclass correlation coefficient (R...

  17. The repeatability of interleukin-6, tumor necrosis factor-alpha, and C-reactive protein in COPD patients over one year

    DEFF Research Database (Denmark)

    Kolsum, Umme; Roy, Kay; Starkey, Cerys

    2009-01-01

    BACKGROUND: Many of the systemic manifestations of chronic obstructive pulmonary disease (COPD) are mediated through increased systemic levels of inflammatory proteins. We assessed the long term repeatability of Interleukin-6 (IL-6), tumor necrosis factor-alpha (TNF-alpha), and C-reactive protein......(i)) and the Bland-Altman method. Pearson correlations were used to determine the relationships between the systemic markers at both visits. RESULTS: There was moderate repeatability with a very high degree of statistical significance (p...... (CRP) over one year and examined the relationships between these systemic markers in COPD. METHODS: Fifty-eight stable COPD patients completed a baseline and one-year visit. Serum IL-6, plasma CRP, and plasma TNF-alpha were measured. Repeatability was expressed by intraclass correlation coefficient (R...

  18. Prediction of the Ebola Virus Infection Related Human Genes Using Protein-Protein Interaction Network.

    Science.gov (United States)

    Cao, HuanHuan; Zhang, YuHang; Zhao, Jia; Zhu, Liucun; Wang, Yi; Li, JiaRui; Feng, Yuan-Ming; Zhang, Ning

    2017-01-01

    Ebola hemorrhagic fever (EHF) is caused by Ebola virus (EBOV). It is reported that human could be infected by EBOV with a high fatality rate. However, association factors between EBOV and host still tend to be ambiguous. According to the "guilt by association" (GBA) principle, proteins interacting with each other are very likely to function similarly or the same. Based on this assumption, we tried to obtain EBOV infection-related human genes in a protein-protein interaction network using Dijkstra algorithm. We hope it could contribute to the discovery of novel effective treatments. Finally, 15 genes were selected as potential EBOV infection-related human genes. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  19. Receptor-like proteins involved in plant disease resistance

    NARCIS (Netherlands)

    Kruijt, M.; Kock, de M.J.D.; Wit, de P.J.G.M.

    2005-01-01

    Race-specific resistance in plants against microbial pathogens is governed by several distinct classes of resistance (R) genes. This review focuses on the class that consists of the plasma membrane-bound leucine-rich repeat proteins known as receptor-like proteins (RLPs). The first isolated

  20. Porcine lung surfactant protein B gene (SFTPB)

    DEFF Research Database (Denmark)

    Cirera Salicio, Susanna; Fredholm, Merete

    2008-01-01

    The porcine surfactant protein B (SFTPB) is a single copy gene on chromosome 3. Three different cDNAs for the SFTPB have been isolated and sequenced. Nucleotide sequence comparison revealed six nonsynonymous single nucleotide polymorphisms (SNPs), four synonymous SNPs and an in-frame deletion of 69...... bp in the region coding for the active protein. Northern analysis showed lung-specific expression of three different isoforms of the SFTPB transcript. The expression level for the SFTPB gene is low in 50 days-old fetus and it increases during lung development. Quantitative real-time polymerase chain...

  1. Identification of a point mutation in growth factor repeat C of the low density lipoprotein-receptor gene in a patient with homozygous familial hypercholesterolemia

    International Nuclear Information System (INIS)

    Soutar, A.K.; Knight, B.L.; Patel, D.D.

    1989-01-01

    The coding region of the low density lipoprotein (LDL)-receptor gene from a patient (MM) with homozygous familial hypercholesterolemia (FH) has been sequenced from six overlapping 500-base-pair amplified fragments of the cDNA from cultured skin fibroblasts. Two separate single nucleotide base changes from the normal sequence were detected. The first involved substitution of guanine for adenine in the third position of the codon for amino acid residue Cys-27 and did not affect the protein sequence. The second mutation was substitution of thymine for cytosine in the DNA for the codon for amino acid residue 664, changing the codon from CCG (proline) to CTG (leucine) and introducing a new site for the restriction enzyme PstI. MM is a true homozygote with two identical genes, and the mutation cosegregated with clinically diagnosed FH in his family in which first cousin marriages occurred frequently. LDL receptors in MM's skin fibroblasts bind less LDL than normal and with reduced affinity. Thus this naturally occurring single point mutation affects both intracellular transport of the protein and ligand binding and occurs in growth factor-like repeat C, a region that has not previously been found to influence LDL binding

  2. Genetic diversity of the merozoite surface protein-3 gene in Plasmodium falciparum populations in Thailand.

    Science.gov (United States)

    Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai

    2016-10-21

    An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.

  3. MNF, an ankyrin repeat protein of myxoma virus, is part of a native cellular SCF complex during viral infection

    Directory of Open Access Journals (Sweden)

    Gelfi Jacqueline

    2010-03-01

    Full Text Available Abstract Myxoma virus (MYXV, a member of the Poxviridae family, is the agent responsible for myxomatosis, a fatal disease in the European rabbit (Oryctolagus cuniculus. Like all poxviruses, MYXV is known for encoding multiple proteins that regulate cellular signaling pathways. Among them, four proteins share the same ANK/PRANC structure: M148R, M149R, MNF (Myxoma Nuclear factor and M-T5, all of them described as virulence factors. This family of poxvirus proteins, recently identified, has drawn considerable attention for its potential role in modulating the host ubiquitin-proteasome system during viral infection. To date, many members of this novel protein family have been shown to interact with SCF components, in vitro. Here, we focus on MNF gene, which has been shown to express a nuclear protein presenting nine ANK repeats, one of which has been identified as a nuclear localization signal. In transfection, MNF has been shown to colocalise with the transcription factor NF-κB in the nucleus of TNFα-stimulated cells. Functionally, MNF is a critical virulence factor since its deletion generates an almost apathogenic virus. In this study, to pursue the investigation of proteins interacting with MNF and of its mechanism of action, we engineered a recombinant MYXV expressing a GFP-linked MNF under the control of MNF native promoter. Infection of rabbits with MYXV-GFPMNF recombinant virus provided the evidence that the GFP fusion does not disturb the main function of MNF. Hence, cells were infected with MYXV-GFPMNF and immunoprecipitation of the GFPMNF fusion protein was performed to identify MNF's partners. For the first time, endogenous components of SCF (Cullin-1 and Skp1 were co-precipitated with an ANK myxoma virus protein, expressed in an infectious context, and without over-expression of any protein.

  4. MNF, an ankyrin repeat protein of myxoma virus, is part of a native cellular SCF complex during viral infection

    Science.gov (United States)

    2010-01-01

    Myxoma virus (MYXV), a member of the Poxviridae family, is the agent responsible for myxomatosis, a fatal disease in the European rabbit (Oryctolagus cuniculus). Like all poxviruses, MYXV is known for encoding multiple proteins that regulate cellular signaling pathways. Among them, four proteins share the same ANK/PRANC structure: M148R, M149R, MNF (Myxoma Nuclear factor) and M-T5, all of them described as virulence factors. This family of poxvirus proteins, recently identified, has drawn considerable attention for its potential role in modulating the host ubiquitin-proteasome system during viral infection. To date, many members of this novel protein family have been shown to interact with SCF components, in vitro. Here, we focus on MNF gene, which has been shown to express a nuclear protein presenting nine ANK repeats, one of which has been identified as a nuclear localization signal. In transfection, MNF has been shown to colocalise with the transcription factor NF-κB in the nucleus of TNFα-stimulated cells. Functionally, MNF is a critical virulence factor since its deletion generates an almost apathogenic virus. In this study, to pursue the investigation of proteins interacting with MNF and of its mechanism of action, we engineered a recombinant MYXV expressing a GFP-linked MNF under the control of MNF native promoter. Infection of rabbits with MYXV-GFPMNF recombinant virus provided the evidence that the GFP fusion does not disturb the main function of MNF. Hence, cells were infected with MYXV-GFPMNF and immunoprecipitation of the GFPMNF fusion protein was performed to identify MNF's partners. For the first time, endogenous components of SCF (Cullin-1 and Skp1) were co-precipitated with an ANK myxoma virus protein, expressed in an infectious context, and without over-expression of any protein. PMID:20211013

  5. Unraveling the Role of RNA Mediated Toxicity of C9orf72 Repeats in C9-FTD/ALS

    Directory of Open Access Journals (Sweden)

    Vijay Kumar

    2017-12-01

    Full Text Available The most frequent genetic cause of amyotrophic lateral sclerosis (ALS and frontotemporal dementia (FTD is intronic hexanucleotide (G4C2 repeat expansions (HRE in the C9orf72 gene. The non-exclusive pathogenic mechanisms by which C9orf72 repeat expansions contribute to these neurological disorders include loss of C9orf72 function and gain-of-function determined by toxic RNA molecules and dipeptides repeats protein toxicity. The expanded repeats are transcribed bidirectionally and forms RNA foci in the central nervous system, and sequester key RNA-binding proteins (RBPs leading to impairment in RNA processing events. Many studies report widespread transcriptome changes in ALS carrying a C9orf72 repeat expansion. Here we review the contribution of RNA foci interaction with RBPs as well as transcriptome changes involved in the pathogenesis of C9orf72- associated FTD/ALS. These informations are essential to elucidate the pathology and therapeutic intervention of ALS and/or FTD.

  6. Identifying Novel Candidate Genes Related to Apoptosis from a Protein-Protein Interaction Network

    Directory of Open Access Journals (Sweden)

    Baoman Wang

    2015-01-01

    Full Text Available Apoptosis is the process of programmed cell death (PCD that occurs in multicellular organisms. This process of normal cell death is required to maintain the balance of homeostasis. In addition, some diseases, such as obesity, cancer, and neurodegenerative diseases, can be cured through apoptosis, which produces few side effects. An effective comprehension of the mechanisms underlying apoptosis will be helpful to prevent and treat some diseases. The identification of genes related to apoptosis is essential to uncover its underlying mechanisms. In this study, a computational method was proposed to identify novel candidate genes related to apoptosis. First, protein-protein interaction information was used to construct a weighted graph. Second, a shortest path algorithm was applied to the graph to search for new candidate genes. Finally, the obtained genes were filtered by a permutation test. As a result, 26 genes were obtained, and we discuss their likelihood of being novel apoptosis-related genes by collecting evidence from published literature.

  7. The repeatability of interleukin-6, tumor necrosis factor-α, and C-reactive protein in COPD patients over one year

    Directory of Open Access Journals (Sweden)

    Umme Kolsum

    2009-04-01

    Full Text Available Umme Kolsum, Kay Roy, Cerys Starkey, Zoë Borrill, Nick Truman, Jørgen Vestbo, Dave SinghNorth West Lung Research Centre, University of Manchester, South Manchester University Hospitals Trust, Wythenshawe, Manchester, UKBackground: Many of the systemic manifestations of chronic obstructive pulmonary disease (COPD are mediated through increased systemic levels of inflammatory proteins. We assessed the long term repeatability of Interleukin-6 (IL-6, tumor necrosis factor-α (TNF-α, and C-reactive protein (CRP over one year and examined the relationships between these systemic markers in COPD.Methods: Fifty-eight stable COPD patients completed a baseline and one-year visit. Serum IL-6, plasma CRP, and plasma TNF-α were measured. Repeatability was expressed by intraclass correlation coefficient (Ri and the Bland–Altman method. Pearson correlations were used to determine the relationships between the systemic markers at both visits.Results: There was moderate repeatability with a very high degree of statistical significance (p ≤ 0.001 between the two visits for all the systemic biomarkers (IL-6, CRP, and TNF-α. CRP was significantly associated with IL-6 at both visits (r = 0.55, p = 0.0001, r = 0.51, p = 0.0002, respectively. There were no other significant associations between the systemic markers at either of the visits.Conclusions: Systemic inflammatory biomarkers IL-6, CRP, and TNF-α were moderately repeatable over a twelve month period in COPD patients. We have also shown that a robust and repeatable association between IL-6 and CRP exists.Keywords: interleukin-6, tumor necrosis factor-α, C-reactive protein, repeatability, COPD   

  8. Variable number of tandem repeats of 9 Plasmodium vivax genes among Southeast Asian isolates.

    Science.gov (United States)

    Wang, Bo; Nyunt, Myat Htut; Yun, Seung-Gyu; Lu, Feng; Cheng, Yang; Han, Jin-Hee; Ha, Kwon-Soo; Park, Won Sun; Hong, Seok-Ho; Lim, Chae-Seung; Cao, Jun; Sattabongkot, Jetsumon; Kyaw, Myat Phone; Cui, Liwang; Han, Eun-Taek

    2017-06-01

    The variable number of tandem repeats (VNTRs) provides valuable information about both the functional and evolutionary aspects of genetic diversity. Comparative analysis of 3 Plasmodium falciparum genomes has shown that more than 9% of its open reading frames (ORFs) harbor VNTRs. Although microsatellites and VNTR genes of P. vivax were reported, the VNTR polymorphism of genes has not been examined widely. In this study, 230 P. vivax genes were analyzed for VNTRs by SERV, and 33 kinds of TR deletions or insertions from 29 P. vivax genes (12.6%) were found. Of these, 9 VNTR fragments from 8 P. vivax genes were used for PCR amplification and sequence analysis to examine the genetic diversity among 134 isolates from four Southeast Asian countries (China, Republic of Korea, Thailand, and Myanmar) with different malaria endemicity. We confirmed the existence of extensive polymorphism of VNTR fragments in field isolates. This detection provides several suitable markers for analysis of the molecular epidemiology of P. vivax field isolates. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese.

    Science.gov (United States)

    Ebstein, Richard P; Monakhov, Mikhail V; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-08-22

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal-conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. © 2015 The Author(s).

  10. Association between the dopamine D4 receptor gene exon III variable number of tandem repeats and political attitudes in female Han Chinese

    Science.gov (United States)

    Ebstein, Richard P.; Monakhov, Mikhail V.; Lu, Yunfeng; Jiang, Yushi; Lai, Poh San; Chew, Soo Hong

    2015-01-01

    Twin and family studies suggest that political attitudes are partially determined by an individual's genotype. The dopamine D4 receptor gene (DRD4) exon III repeat region that has been extensively studied in connection with human behaviour, is a plausible candidate to contribute to individual differences in political attitudes. A first United States study provisionally identified this gene with political attitude along a liberal–conservative axis albeit contingent upon number of friends. In a large sample of 1771 Han Chinese university students in Singapore, we observed a significant main effect of association between the DRD4 exon III variable number of tandem repeats and political attitude. Subjects with two copies of the 4-repeat allele (4R/4R) were significantly more conservative. Our results provided evidence for a role of the DRD4 gene variants in contributing to individual differences in political attitude particularly in females and more generally suggested that associations between individual genes, and neurochemical pathways, contributing to traits relevant to the social sciences can be provisionally identified. PMID:26246555

  11. Using the clustered circular layout as an informative method for visualizing protein-protein interaction networks.

    Science.gov (United States)

    Fung, David C Y; Wilkins, Marc R; Hart, David; Hong, Seok-Hee

    2010-07-01

    The force-directed layout is commonly used in computer-generated visualizations of protein-protein interaction networks. While it is good for providing a visual outline of the protein complexes and their interactions, it has two limitations when used as a visual analysis method. The first is poor reproducibility. Repeated running of the algorithm does not necessarily generate the same layout, therefore, demanding cognitive readaptation on the investigator's part. The second limitation is that it does not explicitly display complementary biological information, e.g. Gene Ontology, other than the protein names or gene symbols. Here, we present an alternative layout called the clustered circular layout. Using the human DNA replication protein-protein interaction network as a case study, we compared the two network layouts for their merits and limitations in supporting visual analysis.

  12. Characterization of α-isopropylmalate synthases containing different copy numbers of tandem repeats in Mycobacterium tuberculosis

    Directory of Open Access Journals (Sweden)

    Palittapongarnpim Prasit

    2009-06-01

    Full Text Available Abstract Background Alpha-isopropylmalate synthase (α-IPMS is the key enzyme that catalyzes the first committed step in the leucine biosynthetic pathway. The gene encoding α-IPMS in Mycobacterium tuberculosis, leuA, is polymorphic due to the insertion of 57-bp repeat units referred to as Variable Number of Tandem Repeats (VNTR. The role of the VNTR found within the M. tuberculosis genome is unclear. To investigate the role of the VNTR in leuA, we compared two α-IPMS proteins with different numbers of amino acid repeats, one with two copies and the other with 14 copies. We have cloned leuA with 14 copies of the repeat units into the pET15b expression vector with a His6-tag at the N-terminus, as was previously done for the leuA gene with two copies of the repeat units. Results The recombinant His6-α-IPMS proteins with two and 14 copies (α-IPMS-2CR and α-IPMS-14CR, respectively of the repeat units were purified by immobilized metal ion affinity chromatography and gel filtration. Both enzymes were found to be dimers by gel filtration. Both enzymes work well at pH values of 7–8.5 and temperatures of 37–42°C. However, α-IPMS-14CR tolerates pH values and temperatures outside of this range better than α-IPMS-2CR does. α-IPMS-14CR has higher affinity than α-IPMS-2CR for the two substrates, α-ketoisovalerate and acetyl CoA. Furthermore, α-IPMS-2CR was feedback inhibited by the end product l-leucine, whereas α-IPMS-14CR was not. Conclusion The differences in the kinetic properties and the l-leucine feedback inhibition between the two M. tuberculosis α-IPMS proteins containing low and high numbers of VNTR indicate that a large VNTR insertion affects protein structure and function. Demonstration of l-leucine binding to α-IPMS-14CR would confirm whether or not α-IPMS-14CR responds to end-product feedback inhibition.

  13. Repeated short climatic change affects the epidermal differentiation program and leads to matrix remodeling in a human organotypic skin model.

    Science.gov (United States)

    Boutrand, Laetitia-Barbollat; Thépot, Amélie; Muther, Charlotte; Boher, Aurélie; Robic, Julie; Guéré, Christelle; Vié, Katell; Damour, Odile; Lamartine, Jérôme

    2017-01-01

    Human skin is subject to frequent changes in ambient temperature and humidity and needs to cope with these environmental modifications. To decipher the molecular response of human skin to repeated climatic change, a versatile model of skin equivalent subject to "hot-wet" (40°C, 80% relative humidity [RH]) or "cold-dry" (10°C, 40% RH) climatic stress repeated daily was used. To obtain an exhaustive view of the molecular mechanisms elicited by climatic change, large-scale gene expression DNA microarray analysis was performed and modulated function was determined by bioinformatic annotation. This analysis revealed several functions, including epidermal differentiation and extracellular matrix, impacted by repeated variations in climatic conditions. Some of these molecular changes were confirmed by histological examination and protein expression. Both treatments (hot-wet and cold-dry) reduced the expression of genes encoding collagens, laminin, and proteoglycans, suggesting a profound remodeling of the extracellular matrix. Strong induction of the entire family of late cornified envelope genes after cold-dry exposure, confirmed at protein level, was also observed. These changes correlated with an increase in epidermal differentiation markers such as corneodesmosin and a thickening of the stratum corneum, indicating possible implementation of defense mechanisms against dehydration. This study for the first time reveals the complex pattern of molecular response allowing adaption of human skin to repeated change in its climatic environment.

  14. Convergent evolution of gene networks by single-gene duplications in higher eukaryotes.

    Science.gov (United States)

    Amoutzias, Gregory D; Robertson, David L; Oliver, Stephen G; Bornberg-Bauer, Erich

    2004-03-01

    By combining phylogenetic, proteomic and structural information, we have elucidated the evolutionary driving forces for the gene-regulatory interaction networks of basic helix-loop-helix transcription factors. We infer that recurrent events of single-gene duplication and domain rearrangement repeatedly gave rise to distinct networks with almost identical hub-based topologies, and multiple activators and repressors. We thus provide the first empirical evidence for scale-free protein networks emerging through single-gene duplications, the dominant importance of molecular modularity in the bottom-up construction of complex biological entities, and the convergent evolution of networks.

  15. A single gene directs synthesis of a precursor protein with beta- and alpha-amylase activities in Bacillus polymyxa.

    OpenAIRE

    Uozumi, N; Sakurai, K; Sasaki, T; Takekawa, S; Yamagata, H; Tsukagoshi, N; Udaka, S

    1989-01-01

    The Bacillus polymyxa amylase gene comprises 3,588 nucleotides. The mature amylase comprises 1,161 amino acids with a molecular weight of 127,314. The gene appeared to be divided into two portions by the direct-repeat sequence located at almost the middle of the gene. The 5' region upstream of the direct-repeat sequence was shown to be responsible for the synthesis of beta-amylase. The 3' region downstream of the direct-repeat sequence contained four sequences homologous with those in other a...

  16. Characterization of a Beta vulgaris PGIP defense gene promoter in transgenic plants

    Science.gov (United States)

    Polygalacturonase-inhibiting protein (BvPGIP) genes were cloned from a sugar beet breeding line F1016 with increased tolerance to the sugar beet root maggot. Polygalacturonase-inhibiting proteins are cell wall leucine-rich repeat (LRR) proteins with crucial roles in development, pathogen defense an...

  17. The relationship among gene expression, the evolution of gene dosage, and the rate of protein evolution.

    Directory of Open Access Journals (Sweden)

    Jean-François Gout

    2010-05-01

    Full Text Available The understanding of selective constraints affecting genes is a major issue in biology. It is well established that gene expression level is a major determinant of the rate of protein evolution, but the reasons for this relationship remain highly debated. Here we demonstrate that gene expression is also a major determinant of the evolution of gene dosage: the rate of gene losses after whole genome duplications in the Paramecium lineage is negatively correlated to the level of gene expression, and this relationship is not a byproduct of other factors known to affect the fate of gene duplicates. This indicates that changes in gene dosage are generally more deleterious for highly expressed genes. This rule also holds for other taxa: in yeast, we find a clear relationship between gene expression level and the fitness impact of reduction in gene dosage. To explain these observations, we propose a model based on the fact that the optimal expression level of a gene corresponds to a trade-off between the benefit and cost of its expression. This COSTEX model predicts that selective pressure against mutations changing gene expression level or affecting the encoded protein should on average be stronger in highly expressed genes and hence that both the frequency of gene loss and the rate of protein evolution should correlate negatively with gene expression. Thus, the COSTEX model provides a simple and common explanation for the general relationship observed between the level of gene expression and the different facets of gene evolution.

  18. Drosophila TDP-43 RNA-Binding Protein Facilitates Association of Sister Chromatid Cohesion Proteins with Genes, Enhancers and Polycomb Response Elements.

    Directory of Open Access Journals (Sweden)

    Amanda Swain

    2016-09-01

    Full Text Available The cohesin protein complex mediates sister chromatid cohesion and participates in transcriptional control of genes that regulate growth and development. Substantial reduction of cohesin activity alters transcription of many genes without disrupting chromosome segregation. Drosophila Nipped-B protein loads cohesin onto chromosomes, and together Nipped-B and cohesin occupy essentially all active transcriptional enhancers and a large fraction of active genes. It is unknown why some active genes bind high levels of cohesin and some do not. Here we show that the TBPH and Lark RNA-binding proteins influence association of Nipped-B and cohesin with genes and gene regulatory sequences. In vitro, TBPH and Lark proteins specifically bind RNAs produced by genes occupied by Nipped-B and cohesin. By genomic chromatin immunoprecipitation these RNA-binding proteins also bind to chromosomes at cohesin-binding genes, enhancers, and Polycomb response elements (PREs. RNAi depletion reveals that TBPH facilitates association of Nipped-B and cohesin with genes and regulatory sequences. Lark reduces binding of Nipped-B and cohesin at many promoters and aids their association with several large enhancers. Conversely, Nipped-B facilitates TBPH and Lark association with genes and regulatory sequences, and interacts with TBPH and Lark in affinity chromatography and immunoprecipitation experiments. Blocking transcription does not ablate binding of Nipped-B and the RNA-binding proteins to chromosomes, indicating transcription is not required to maintain binding once established. These findings demonstrate that RNA-binding proteins help govern association of sister chromatid cohesion proteins with genes and enhancers.

  19. Molecular Characterization of Respiratory Syncytial Virus in Children with Repeated Infections with Subgroup B in the Philippines.

    Science.gov (United States)

    Okamoto, Michiko; Dapat, Clyde P; Sandagon, Ann Marie D; Batangan-Nacion, Leilanie P; Lirio, Irene C; Tamaki, Raita; Saito, Mayuko; Saito-Obata, Mariko; Lupisan, Socorro P; Oshitani, Hitoshi

    2018-05-02

    Human respiratory syncytial virus (RSV) is the leading cause of severe acute respiratory infection in infants and young children, which is characterized by repeated infections. However, the role of amino acid substitutions in repeated infections remains unclear. Hence, this study aimed to elucidate the genetic characteristics of RSV in children with repeated infections using molecular analyses of F and G genes. We conducted a cohort study for children younger than 5 years in the Philippines. We collected nasopharyngeal swabs from children with acute respiratory symptoms and compared F and G sequences between prior and subsequent RSV infections. We examined 1,802 children from May 2014 to January 2016 and collected 3,471 samples. Repeated infections were observed in 25 children, including 4 with homologous RSV-B reinfections. Viruses from the 4 pairs of homologous reinfections had amino acid substitutions in the G protein mostly at O-glycosylation sites, whereas changes in the F protein were identified at antigenic sites V (L173S) and θ (Q209K), considered essential epitopes for the prefusion conformation of the F protein. Amino acid substitutions in G and F proteins of RSV-B might have led to antigenic changes, potentially contributing to homologous reinfections observed in this study.

  20. Cloning of human genes encoding novel G protein-coupled receptors

    Energy Technology Data Exchange (ETDEWEB)

    Marchese, A.; Docherty, J.M.; Heiber, M. [Univ. of Toronto, (Canada)] [and others

    1994-10-01

    We report the isolation and characterization of several novel human genes encoding G protein-coupled receptors. Each of the receptors contained the familiar seven transmembrane topography and most closely resembled peptide binding receptors. Gene GPR1 encoded a receptor protein that is intronless in the coding region and that shared identity (43% in the transmembrane regions) with the opioid receptors. Northern blot analysis revealed that GPR1 transcripts were expressed in the human hippocampus, and the gene was localized to chromosome 15q21.6. Gene GPR2 encoded a protein that most closely resembled an interleukin-8 receptor (51% in the transmembrane regions), and this gene, not expressed in the six brain regions examined, was localized to chromosome 17q2.1-q21.3. A third gene, GPR3, showed identity (56% in the transmembrane regions) with a previously characterized cDNA clone from rat and was localized to chromosome 1p35-p36.1. 31 refs., 5 figs., 1 tab.

  1. The proliferation marker pKi-67 becomes masked to MIB-1 staining after expression of its tandem repeats.

    Science.gov (United States)

    Schmidt, Mirko H H; Broll, Rainer; Bruch, Hans-Peter; Duchrow, Michael

    2002-11-01

    The Ki-67 antigen, pKi-67, is one of the most commonly used markers of proliferating cells. The protein can only be detected in dividing cells (G(1)-, S-, G(2)-, and M-phase) but not in quiescent cells (G(0)). The standard antibody to detect pKi-67 is MIB-1, which detects the so-called 'Ki-67 motif' FKELF in 9 of the protein's 16 tandem repeats. To investigate the function of these repeats we expressed three of them in an inducible gene expression system in HeLa cells. Surprisingly, addition of a nuclear localization sequence led to a complete absence of signal in the nuclei of MIB-1-stained cells. At the same time antibodies directed against different epitopes of pKi-67 did not fail to detect the protein. We conclude that the overexpression of the 'Ki-67 motif', which is present in the repeats, can lead to inability of MIB-1 to detect its antigen as demonstrated in adenocarcinoma tissue samples. Thereafter, in order to prevent the underestimation of Ki-67 proliferation indices in MIB-1-labeled preparations, additional antibodies (for example, MIB-21) should be used. Additionally, we could show in a mammalian two-hybrid assay that recombinant pKi-67 repeats are capable of self-associating with endogenous pKi-67. Speculating that the tandem repeats are intimately involved in its protein-protein interactions, this offers new insights in how access to these repeats is regulated by pKi-67 itself.

  2. A complex of Cas proteins 5, 6, and 7 is required for the biogenesis and stability of clustered regularly interspaced short palindromic repeats (crispr)-derived rnas (crrnas) in Haloferax volcanii.

    Science.gov (United States)

    Brendel, Jutta; Stoll, Britta; Lange, Sita J; Sharma, Kundan; Lenz, Christof; Stachler, Aris-Edda; Maier, Lisa-Katharina; Richter, Hagen; Nickel, Lisa; Schmitz, Ruth A; Randau, Lennart; Allers, Thorsten; Urlaub, Henning; Backofen, Rolf; Marchfelder, Anita

    2014-03-07

    The clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR-Cas) system is a prokaryotic defense mechanism against foreign genetic elements. A plethora of CRISPR-Cas versions exist, with more than 40 different Cas protein families and several different molecular approaches to fight the invading DNA. One of the key players in the system is the CRISPR-derived RNA (crRNA), which directs the invader-degrading Cas protein complex to the invader. The CRISPR-Cas types I and III use the Cas6 protein to generate mature crRNAs. Here, we show that the Cas6 protein is necessary for crRNA production but that additional Cas proteins that form a CRISPR-associated complex for antiviral defense (Cascade)-like complex are needed for crRNA stability in the CRISPR-Cas type I-B system in Haloferax volcanii in vivo. Deletion of the cas6 gene results in the loss of mature crRNAs and interference. However, cells that have the complete cas gene cluster (cas1-8b) removed and are transformed with the cas6 gene are not able to produce and stably maintain mature crRNAs. crRNA production and stability is rescued only if cas5, -6, and -7 are present. Mutational analysis of the cas6 gene reveals three amino acids (His-41, Gly-256, and Gly-258) that are essential for pre-crRNA cleavage, whereas the mutation of two amino acids (Ser-115 and Ser-224) leads to an increase of crRNA amounts. This is the first systematic in vivo analysis of Cas6 protein variants. In addition, we show that the H. volcanii I-B system contains a Cascade-like complex with a Cas7, Cas5, and Cas6 core that protects the crRNA.

  3. Think big--giant genes in bacteria.

    Science.gov (United States)

    Reva, Oleg; Tümmler, Burkhard

    2008-03-01

    Long genes should be rare in archaea and eubacteria because of the demanding costs of time and resources for protein production. The search in 580 sequenced prokaryotic genomes, however, revealed 0.2% of all genes to be longer than 5 kb (absolute number: 3732 genes). Eighty giant bacterial genes of more than 20 kb in length were identified in 47 taxa that belong to the phyla Thermotogae (1), Chlorobi (3), Planctomycetes (1), Cyanobacteria (2), Firmicutes (7), Actinobacteria (9), Proteobacteria (23) or Euryarchaeota (1) (number of taxa in brackets). Giant genes are strain-specific, differ in their tetranucleotide usage from the bulk genome and occur preferentially in non-pathogenic environmental bacteria. The two longest bacterial genes known to date were detected in the green sulfur bacterium Chlorobium chlorochromatii CaD3 encoding proteins of 36 806 and 20 647 amino acids, being surpassed in length only by the human titin coding sequence. More than 90% of bacterial giant genes either encode a surface protein or a polyketide/non-ribosomal peptide synthetase. Most surface proteins are acidic, threonine-rich, lack cystein and harbour multiple amino acid repeats. Giant proteins increase bacterial fitness by the production of either weapons towards or shields against animate competitors or hostile environments.

  4. Regulation of human protein S gene (PROS1) transcription

    NARCIS (Netherlands)

    Wolf, Cornelia de

    2006-01-01

    This thesis describes the investigation of the transcriptional regulation of the gene for anticoagulant plasma Protein S, PROS1. Protein S is a cofactor for Protein C in the Protein C anticoagulant pathway. The coagulation cascade is negatively regulated by this pathway through inactivation of

  5. Changes in nucleosome repeat lengths precede replication in the early replicating metallothionein II gene region of cells synchronized in early S phase

    International Nuclear Information System (INIS)

    D'Anna, J.A.; Tobey, R.A.

    1989-01-01

    Previous investigations showed that inhibition of DNA synthesis by hydroxyurea, aphidicolin, or 5-fluorodeoxyuridine produced large changes in the composition and nucleosome repeat lengths of bulk chromatin. There the authors report results of investigations to determine whether the changes in nucleosome repeat lengths might be localized in the initiated replicons, as postulated. In most experiments, Chinese hamster (line CHO) cells were synchronized in G1, or they were synchronized in early S phase by allowing G1 cells to enter S phase in medium containing 1 mM hydroxyurea or 5 μg mL -1 aphidicolin, a procedure believed to produce an accumulation of initiated replicons that arise from normally early replicating DNA. Measurements of nucleosome repeat lengths of bulk chromatin, the early replicating unexpressed metallothionein II (MTII) gene region, and a later replicating repeated sequence indicate that the changes in repeat lengths occur preferentially in the early replicating MTII gene region as G1 cells enter and become synchronized in early S phase. During that time, the MTII gene region is not replicated nor is there any evidence for induction of MTII messenger RNA. Thus, the results are consistent with the hypothesis that changes in chromatin structure occur preferentially in the early replicating (presumably initiated) replicons at initiation or that changes in chromatin structure can precede replication during inhibition of DNA synthesis. The shortened repeat lengths that precede MTII replication are, potentially, reversible, because they become elongated when the synchronized early S-phase cells are released to resume cell cycle progression

  6. The relationship between CA repeat polymorphism of the IGF-1 gene and the structure of motor skills in young athletes.

    Science.gov (United States)

    Karpowicz, Krzysztof; Krych, Katarzyna; Karpowicz, Małgorzata; Nowak, Witold; Gronek, Piotr

    2018-01-01

    The map of candidate genes that can potentially affect physical fitness becomes larger every year, and they are associated with such aspects as respiratory and cardiovascular stability; body build and composition - especially muscle mass and strength; carbohydrate and lipid metabolism; response to training; and exercise intolerance.The aim of this study was to analyze the relationship between the CA repeat polymorphism of the P1 promoter of the IGF1 gene and the structure of motor skills in the two groups of Polish young athletes in 2007-2009. In this study, 350 young sportsmen representing different sports disciplines were examined (age = 15.5 ± 0.5 years), by genotyping the IGF1 gene and determining the structure of motor skills using the International Physical Fitness Test (IPFT) battery. The multiple stepwise regression was used to determine the impact of the investigated motor skills on the indicator of the overall physical fitness, measured by the total score of the International Physical Fitness Test (IPFT). The analysis showed some regularity related to the character of the IGF1 gene polymorphism. It can be concluded that the two groups of young boys athletes practicing various sports disciplines (kinds of physical exercise) displayed similar associations between CA repeat polymorphism of the P1 promoter of the IGF1 gene and the level of motor effects. Our results suggest that this polymorphism may be a genetic marker of the physical performance phenotype. We demonstrated that CA repeat polymorphism of the P1 promoter of the IGF1 gene was associated with strength predispositions in the homozygous and non-carriers groups. In the group who were heterozygous it was speed-strength aptitudes.

  7. Creation and structure determination of an artificial protein with three complete sequence repeats

    Energy Technology Data Exchange (ETDEWEB)

    Adachi, Motoyasu, E-mail: adachi.motoyasu@jaea.go.jp; Shimizu, Rumi; Kuroki, Ryota [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Blaber, Michael [Japan Atomic Energy Agency, Shirakatashirane 2-4, Nakagun Tokaimura, Ibaraki 319-1195 (Japan); Florida State University, Tallahassee, FL 32306-4300 (United States)

    2013-11-01

    An artificial protein with three complete sequence repeats was created and the structure was determined by X-ray crystallography. The structure showed threefold symmetry even though there is an amino- and carboxy-terminal. The artificial protein with threefold symmetry may be useful as a scaffold to capture small materials with C3 symmetry. Symfoil-4P is a de novo protein exhibiting the threefold symmetrical β-trefoil fold designed based on the human acidic fibroblast growth factor. First three asparagine–glycine sequences of Symfoil-4P are replaced with glutamine–glycine (Symfoil-QG) or serine–glycine (Symfoil-SG) sequences protecting from deamidation, and His-Symfoil-II was prepared by introducing a protease digestion site into Symfoil-QG so that Symfoil-II has three complete repeats after removal of the N-terminal histidine tag. The Symfoil-QG and SG and His-Symfoil-II proteins were expressed in Eschericha coli as soluble protein, and purified by nickel affinity chromatography. Symfoil-II was further purified by anion-exchange chromatography after removing the HisTag by proteolysis. Both Symfoil-QG and Symfoil-II were crystallized in 0.1 M Tris-HCl buffer (pH 7.0) containing 1.8 M ammonium sulfate as precipitant at 293 K; several crystal forms were observed for Symfoil-QG and II. The maximum diffraction of Symfoil-QG and II crystals were 1.5 and 1.1 Å resolution, respectively. The Symfoil-II without histidine tag diffracted better than Symfoil-QG with N-terminal histidine tag. Although the crystal packing of Symfoil-II is slightly different from Symfoil-QG and other crystals of Symfoil derivatives having the N-terminal histidine tag, the refined crystal structure of Symfoil-II showed pseudo-threefold symmetry as expected from other Symfoils. Since the removal of the unstructured N-terminal histidine tag did not affect the threefold structure of Symfoil, the improvement of diffraction quality of Symfoil-II may be caused by molecular characteristics of

  8. Ideal crop plant architecture is mediated by tassels replace upper ears1, a BTB/POZ ankyrin repeat gene directly targeted by TEOSINTE BRANCHED1.

    Science.gov (United States)

    Dong, Zhaobin; Li, Wei; Unger-Wallace, Erica; Yang, Jinliang; Vollbrecht, Erik; Chuck, George

    2017-10-10

    Axillary branch suppression is a favorable trait bred into many domesticated crop plants including maize compared with its highly branched wild ancestor teosinte. Branch suppression in maize was achieved through selection of a gain of function allele of the teosinte branched1 (tb1) transcription factor that acts as a repressor of axillary bud growth. Previous work indicated that other loci may function epistatically with tb1 and may be responsible for some of its phenotypic effects. Here, we show that tb1 mediates axillary branch suppression through direct activation of the tassels replace upper ears1 ( tru1 ) gene that encodes an ankyrin repeat domain protein containing a BTB/POZ motif necessary for protein-protein interactions. The expression of TRU1 and TB1 overlap in axillary buds, and TB1 binds to two locations in the tru1 gene as shown by chromatin immunoprecipitation and gel shifts. In addition, nucleotide diversity surveys indicate that tru1 , like tb1 , was a target of selection. In modern maize, TRU1 is highly expressed in the leaf trace vasculature of axillary internodes, while in teosinte, this expression is highly reduced or absent. This increase in TRU1 expression levels in modern maize is supported by comparisons of relative protein levels with teosinte as well as by quantitative measurements of mRNA levels. Hence, a major innovation in creating ideal maize plant architecture originated from ectopic overexpression of tru1 in axillary branches, a critical step in mediating the effects of domestication by tb1.

  9. Identification and expression analysis of the interferon-induced protein with tetratricopeptide repeats 5 (IFIT5 gene in duck (Anas platyrhynchos domesticus.

    Directory of Open Access Journals (Sweden)

    Bin Wang

    Full Text Available The interferon-induced proteins with tetratricopeptide repeats (IFITs protein family mediates antiviral effects by inhibiting translation initiation, cell proliferation, and migration in the interferon (IFN dependent innate immune system. Several members of this family, including IFIT1, IFIT2, IFIT3 and IFIT5, have been heavily studied in mammals. Avian species contain only one family member, IFIT5, and little is known about the role of this protein in birds. In this study, duck IFIT5 (duIFIT5 full-length mRNA was cloned by reverse transcription polymerase chain reaction (RT-PCR and rapid amplification of the cDNA ends (RACE. Based on the sequence obtained, we performed a series of bioinformatics analyses, and found that duIFIT5 was most similar to homologs in other avian species. Also, duIFIT5 contained eight conserved TPR motifs and two conserved multi-domains (TPR_11 and TPR_12. Finally, we used duck hepatitis virus type 1 (DHV-1 and polyriboinosinicpolyribocytidylic acid (poly (I:C as a pathogen or a pathogen-associated molecular pattern induction to infect three-day-old domestic ducklings. The liver and spleen were collected to detect the change in duIFIT5 transcript level upon infection by quantitative real-time PCR (qRT-PCR. DuIFIT5 expression rapidly increased after DHV-1 infection and maintained a high level, while the transcripts of duIFIT5 peaked at 8h after poly (I:C infection and then returned to normal. Taken together, these results provide a greater understanding of avian IFIT5.

  10. Identification and Expression Analysis of the Interferon-Induced Protein with Tetratricopeptide Repeats 5 (IFIT5) Gene in Duck (Anas platyrhynchos domesticus)

    Science.gov (United States)

    Mu, Chunyu; Su, Yanhui; Liu, Ran; Huang, Zhengyang; Li, Yang; Yu, Qingming; Chang, Guobin; Xu, Qi; Chen, Guohong

    2015-01-01

    The interferon-induced proteins with tetratricopeptide repeats (IFITs) protein family mediates antiviral effects by inhibiting translation initiation, cell proliferation, and migration in the interferon (IFN) dependent innate immune system. Several members of this family, including IFIT1, IFIT2, IFIT3 and IFIT5, have been heavily studied in mammals. Avian species contain only one family member, IFIT5, and little is known about the role of this protein in birds. In this study, duck IFIT5 (duIFIT5) full-length mRNA was cloned by reverse transcription polymerase chain reaction (RT-PCR) and rapid amplification of the cDNA ends (RACE). Based on the sequence obtained, we performed a series of bioinformatics analyses, and found that duIFIT5 was most similar to homologs in other avian species. Also, duIFIT5 contained eight conserved TPR motifs and two conserved multi-domains (TPR_11 and TPR_12). Finally, we used duck hepatitis virus type 1 (DHV-1) and polyriboinosinicpolyribocytidylic acid (poly (I:C)) as a pathogen or a pathogen-associated molecular pattern induction to infect three-day-old domestic ducklings. The liver and spleen were collected to detect the change in duIFIT5 transcript level upon infection by quantitative real-time PCR (qRT-PCR). DuIFIT5 expression rapidly increased after DHV-1 infection and maintained a high level, while the transcripts of duIFIT5 peaked at 8h after poly (I:C) infection and then returned to normal. Taken together, these results provide a greater understanding of avian IFIT5. PMID:25816333

  11. Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

    Directory of Open Access Journals (Sweden)

    Graner Andreas

    2008-10-01

    Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular

  12. Cardiac ankyrin repeat protein (CARP) expression in human and murine atherosclerotic lesions - Activin induces carp in smooth muscle cells

    NARCIS (Netherlands)

    de Waard, Vivian; van Achterberg, Tanja A. E.; Beauchamp, Nicholas J.; Pannekoek, Hans; de Vries, Carlie J. M.

    2003-01-01

    Objective-Cardiac ankyrin repeat protein (CARP) is a transcription factor-related protein that has been studied most extensively in the heart. In the present study, we investigated the expression and the potential function of CARP in human and murine atherosclerosis. Methods and Results-CARP

  13. Acyl-CoA-binding protein/diazepam-binding inhibitor gene and pseudogenes

    DEFF Research Database (Denmark)

    Mandrup, S; Hummel, R; Ravn, S

    1992-01-01

    Acyl-CoA-binding protein (ACBP) is a 10 kDa protein isolated from bovine liver by virtue of its ability to bind and induce the synthesis of medium-chain acyl-CoA esters. Surprisingly, it turned out to be identical to a protein named diazepam-binding Inhibitor (DBI) claimed to be an endogenous mod...... have molecularly cloned and characterized the ACBP/DBI gene family in rat. The rat ACBP/DBI gene family comprises one expressed gene and four processed pseudogenes of which one was shown to exist in two allelic forms. The expressed gene is organized into four exons and three introns...

  14. Differential interaction and aggregation of 3-repeat and 4-repeat tau isoforms with 14-3-3ζ protein

    International Nuclear Information System (INIS)

    Sadik, Golam; Tanaka, Toshihisa; Kato, Kiyoko; Yanagi, Kentaro; Kudo, Takashi; Takeda, Masatoshi

    2009-01-01

    Tau isoforms, 3-repeat (3R) and 4-repeat tau (4R), are differentially involved in neuronal development and in several tauopathies. 14-3-3 protein binds to tau and 14-3-3/tau association has been found both in the development and in tauopathies. To understand the role of 14-3-3 in the differential regulation of tau isoforms, we have performed studies on the interaction and aggregation of 3R-tau and 4R-tau, either phosphorylated or unphosphorylated, with 14-3-3ζ. We show by surface plasmon resonance studies that the interaction between unphosphorylated 3R-tau and 14-3-3ζ is ∼3-folds higher than that between unphosphorylated 4R-tau and 14-3-3ζ. Phosphorylation of tau by protein kinase A (PKA) increases the affinity of both 3R- and 4R-tau for 14-3-3ζ to a similar level. An in vitro aggregation assay employing both transmission electron microscopy and fluorescence spectroscopy revealed the aggregation of unphosphorylated 4R-tau to be significantly higher than that of unphosphorylated 3R-tau following the induction of 14-3-3ζ. The filaments formed from 3R- and 4R-tau were almost similar in morphology. In contrast, the aggregation of both 3R- and 4R-tau was reduced to a similar low level after phosphorylation with PKA. Taken together, these results suggest that 14-3-3ζ exhibits a similar role for tau isoforms after PKA-phosphorylation, but a differential role for unphosphorylated tau. The significant aggregation of 4R-tau by 14-3-3ζ suggests that 14-3-3 may act as an inducer in the generation of 4R-tau-predominant neurofibrillary tangles in tauopathies.

  15. Gene ontology based transfer learning for protein subcellular localization

    Directory of Open Access Journals (Sweden)

    Zhou Shuigeng

    2011-02-01

    Full Text Available Abstract Background Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as GO, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the GO terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology. Results In this paper, we propose a Gene Ontology Based Transfer Learning Model (GO-TLM for large-scale protein subcellular localization. The model transfers the signature-based homologous GO terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false GO terms that are resulted from evolutionary divergence. We derive three GO kernels from the three aspects of gene ontology to measure the GO similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for

  16. Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise

    Science.gov (United States)

    Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San

    2018-01-01

    Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…

  17. A novel seven-octapeptide repeat insertion in the prion protein gene (PRNP) in a Dutch pedigree with Gerstmann-Sträussler-Scheinker disease phenotype: comparison with similar cases from the literature

    NARCIS (Netherlands)

    Jansen, Casper; Voet, Willem; Head, Mark W.; Parchi, Piero; Yull, Helen; Verrips, Aad; Wesseling, Pieter; Meulstee, Jan; Baas, Frank; van Gool, Willem A.; Ironside, James W.; Rozemuller, Annemieke J. M.

    2011-01-01

    Human prion diseases can be sporadic, inherited or acquired by infection and show considerable phenotypic heterogeneity. We describe the clinical, histopathological and pathological prion protein (PrP(Sc)) characteristics of a Dutch family with a novel 7-octapeptide repeat insertion (7-OPRI) in

  18. Disruption of Higher Order DNA Structures in Friedreich's Ataxia (GAA)(n) Repeats by PNA or LNA Targeting

    DEFF Research Database (Denmark)

    Bergquist, Helen; Rocha, Cristina S. J.; Alvarez-Asencio, Ruben

    2016-01-01

    Expansion of (GAA)n repeats in the first intron of the Frataxin gene is associated with reduced mRNA and protein levels and the development of Friedreich’s ataxia. (GAA)n expansions form non-canonical structures, including intramolecular triplex (H-DNA), and R-loops and are associated with epigen...

  19. SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2017-04-01

    Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .

  20. A Complex of Cas Proteins 5, 6, and 7 Is Required for the Biogenesis and Stability of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-derived RNAs (crRNAs) in Haloferax volcanii*

    Science.gov (United States)

    Brendel, Jutta; Stoll, Britta; Lange, Sita J.; Sharma, Kundan; Lenz, Christof; Stachler, Aris-Edda; Maier, Lisa-Katharina; Richter, Hagen; Nickel, Lisa; Schmitz, Ruth A.; Randau, Lennart; Allers, Thorsten; Urlaub, Henning; Backofen, Rolf; Marchfelder, Anita

    2014-01-01

    The clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR-Cas) system is a prokaryotic defense mechanism against foreign genetic elements. A plethora of CRISPR-Cas versions exist, with more than 40 different Cas protein families and several different molecular approaches to fight the invading DNA. One of the key players in the system is the CRISPR-derived RNA (crRNA), which directs the invader-degrading Cas protein complex to the invader. The CRISPR-Cas types I and III use the Cas6 protein to generate mature crRNAs. Here, we show that the Cas6 protein is necessary for crRNA production but that additional Cas proteins that form a CRISPR-associated complex for antiviral defense (Cascade)-like complex are needed for crRNA stability in the CRISPR-Cas type I-B system in Haloferax volcanii in vivo. Deletion of the cas6 gene results in the loss of mature crRNAs and interference. However, cells that have the complete cas gene cluster (cas1–8b) removed and are transformed with the cas6 gene are not able to produce and stably maintain mature crRNAs. crRNA production and stability is rescued only if cas5, -6, and -7 are present. Mutational analysis of the cas6 gene reveals three amino acids (His-41, Gly-256, and Gly-258) that are essential for pre-crRNA cleavage, whereas the mutation of two amino acids (Ser-115 and Ser-224) leads to an increase of crRNA amounts. This is the first systematic in vivo analysis of Cas6 protein variants. In addition, we show that the H. volcanii I-B system contains a Cascade-like complex with a Cas7, Cas5, and Cas6 core that protects the crRNA. PMID:24459147

  1. Haplotypes in the gene encoding protein kinase c-beta (PRKCB1) on chromosome 16 are associated with autism.

    Science.gov (United States)

    Philippi, A; Roschmann, E; Tores, F; Lindenbaum, P; Benajou, A; Germain-Leclerc, L; Marcaillou, C; Fontaine, K; Vanpeene, M; Roy, S; Maillard, S; Decaulne, V; Saraiva, J P; Brooks, P; Rousseau, F; Hager, J

    2005-10-01

    Autism is a developmental disorder characterized by impairments in social interaction and communication associated with repetitive patterns of interest or behavior. Autism is highly influenced by genetic factors. Genome-wide linkage and candidate gene association approaches have been used to try and identify autism genes. A few loci have repeatedly been reported linked to autism. Several groups reported evidence for linkage to a region on chromosome 16p. We have applied a direct physical identity-by-descent (IBD) mapping approach to perform a high-density (0.85 megabases) genome-wide linkage scan in 116 families from the AGRE collection. Our results confirm linkage to a region on chromosome 16p with autism. High-resolution single-nucleotide polymorphism (SNP) genotyping and analysis of this region show that haplotypes in the protein kinase c-beta gene are strongly associated with autism. An independent replication of the association in a second set of 167 trio families with autism confirmed our initial findings. Overall, our data provide evidence that the PRKCB1 gene on chromosome 16p may be involved in the etiology of autism.

  2. Identification of distinct specificity determinants in resistance protein Cf-4 allows construction of a Cf-9 mutant that confers recognition of avirulence protein AVR4

    NARCIS (Netherlands)

    Hoorn, Van der R.A.L.; Roth, R.; Wit, De P.J.G.M.

    2001-01-01

    The tomato resistance genes Cf-4 and Cf-9 confer specific, hypersensitive response-associated recognition of Cladosporium carrying the avirulence genes Avr4 and Avr9, respectively. Cf-4 and Cf-9 encode type I transmembrane proteins with extracellular leucine-rich repeats (LRRs). Compared with Cf-9,

  3. JP-3 gene polymorphism in a healthy population of Serbia and ...

    Indian Academy of Sciences (India)

    Unknown

    sions are the cause of Huntington disease like-2 (HDL-2) phenotype. CTG repeats in ... Analyses of CTG repeat polymorphism of JP-3 gene in various healthy populations could help in estimating ... of a recently described membrane protein family exclu- .... low frequency of larger alleles in German population based on very ...

  4. Gene, protein and network of male sterility in rice

    Directory of Open Access Journals (Sweden)

    Wang eKun

    2013-04-01

    Full Text Available Rice is one of the most important model crop plants whose heterosis has been well exploited in commercial hybrid seed production via a variety of types of male sterile lines. Hybrid rice cultivation area is steadily expanding around the world, especially in Southern Asia. Characterization of genes and proteins related to male sterility aims to understand how and why the male sterility occurs, and which proteins are the key players for microspores abortion. Recently, a series of genes and proteins related to cytoplasmic male sterility, photoperiod sensitive male sterility, self-incompatibility and other types of microspores deterioration have been characterized through genetics or proteomics. Especially the latter, offers us a powerful and high throughput approach to discern the novel proteins involving in male-sterile pathways which may help us to breed artificial male-sterile system. This represents an alternative tool to meet the critical challenge of further development of hybrid rice. In this paper, we reviewed the recent developments in our understanding of male sterility in rice hybrid production across gene, protein and integrated network levels, and also, present a perspective on the engineering of male sterile lines for hybrid rice production.

  5. Repeat-mediated genetic and epigenetic changes at the FMR1 locus in the Fragile X-related disorders

    Science.gov (United States)

    Usdin, Karen; Hayward, Bruce E.; Kumari, Daman; Lokanga, Rachel A.; Sciascia, Nicholas; Zhao, Xiao-Nan

    2014-01-01

    The Fragile X-related disorders are a group of genetic conditions that include the neurodegenerative disorder, Fragile X-associated tremor/ataxia syndrome (FXTAS), the fertility disorder, Fragile X-associated primary ovarian insufficiency (FXPOI) and the intellectual disability, Fragile X syndrome (FXS). The pathology in all these diseases is related to the number of CGG/CCG-repeats in the 5′ UTR of the Fragile X mental retardation 1 (FMR1) gene. The repeats are prone to continuous expansion and the increase in repeat number has paradoxical effects on gene expression increasing transcription on mid-sized alleles and decreasing it on longer ones. In some cases the repeats can simultaneously both increase FMR1 mRNA production and decrease the levels of the FMR1 gene product, Fragile X mental retardation 1 protein (FMRP). Since FXTAS and FXPOI result from the deleterious consequences of the expression of elevated levels of FMR1 mRNA and FXS is caused by an FMRP deficiency, the clinical picture is turning out to be more complex than once appreciated. Added complications result from the fact that increasing repeat numbers make the alleles somatically unstable. Thus many individuals have a complex mixture of different sized alleles in different cells. Furthermore, it has become apparent that the eponymous fragile site, once thought to be no more than a useful diagnostic criterion, may have clinical consequences for females who inherit chromosomes that express this site. This review will cover what is currently known about the mechanisms responsible for repeat instability, for the repeat-mediated epigenetic changes that affect expression of the FMR1 gene, and for chromosome fragility. It will also touch on what current and future options are for ameliorating some of these effects. PMID:25101111

  6. Epstein-Barr virus immediate-early gene product trans-activates gene expression from the human immunodeficiency virus long terminal repeat

    International Nuclear Information System (INIS)

    Kenney, S.; Kamine, J.; Markovitz, D.; Fenrick, R.; Pagano, J.

    1988-01-01

    Acquired immunodeficiency syndrome patients are frequently coinfected with Epstein-Barr virus (EBV). In this report, the authors demonstrate that an EBV immediate-early gene product, BamHI MLF1, stimulates expression of the bacterial chloramphenicol acetyltransferase (CAT) gene linked to the human immunodeficiency virus (HIV) promoter. The HIV promoter sequences necessary for trans-activation by EBV do not include the tat-responsive sequences. In addition, in contrast to the other herpesvirus trans-activators previously studied, the EBV BamHI MLF1 gene product appears to function in part by a posttranscriptional mechanism, since it increases pHIV-CAT protein activity more than it increases HIV-CAT mRNA. This ability of an EBV gene product to activate HIV gene expression may have biologic consequences in persons coinfected with both viruses

  7. Survey of clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) systems in multiple sequenced strains of Klebsiella pneumoniae.

    Science.gov (United States)

    Ostria-Hernández, Martha Lorena; Sánchez-Vallejo, Carlos Javier; Ibarra, J Antonio; Castro-Escarpulli, Graciela

    2015-08-04

    In recent years the emergence of multidrug resistant Klebsiella pneumoniae strains has been an increasingly common event. This opportunistic species is one of the five main bacterial pathogens that cause hospital infections worldwide and multidrug resistance has been associated with the presence of high molecular weight plasmids. Plasmids are generally acquired through horizontal transfer and therefore is possible that systems that prevent the entry of foreign genetic material are inactive or absent. One of these systems is CRISPR/Cas. However, little is known regarding the clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) system in K. pneumoniae. The adaptive immune system CRISPR/Cas has been shown to limit the entry of foreign genetic elements into bacterial organisms and in some bacteria it has been shown to be involved in regulation of virulence genes. Thus in this work we used bioinformatics tools to determine the presence or absence of CRISPR/Cas systems in available K. pneumoniae genomes. The complete CRISPR/Cas system was identified in two out of the eight complete K. pneumoniae genomes sequences and in four out of the 44 available draft genomes sequences. The cas genes in these strains comprises eight cas genes similar to those found in Escherichia coli, suggesting they belong to the type I-E group, although their arrangement is slightly different. As for the CRISPR sequences, the average lengths of the direct repeats and spacers were 29 and 33 bp, respectively. BLAST searches demonstrated that 38 of the 116 spacer sequences (33%) are significantly similar to either plasmid, phage or genome sequences, while the remaining 78 sequences (67%) showed no significant similarity to other sequences. The region where the CRISPR/Cas systems were located is the same in all the Klebsiella genomes containing it, it has a syntenic architecture, and is located among genes encoding for proteins likely involved in

  8. The candidate phylum Poribacteria by single-cell genomics: new insights into phylogeny, cell-compartmentation, eukaryote-like repeat proteins, and other genomic features.

    Directory of Open Access Journals (Sweden)

    Janine Kamke

    Full Text Available The candidate phylum Poribacteria is one of the most dominant and widespread members of the microbial communities residing within marine sponges. Cell compartmentalization had been postulated along with their discovery about a decade ago and their phylogenetic association to the Planctomycetes, Verrucomicrobia, Chlamydiae superphylum was proposed soon thereafter. In the present study we revised these features based on genomic data obtained from six poribacterial single cells. We propose that Poribacteria form a distinct monophyletic phylum contiguous to the PVC superphylum together with other candidate phyla. Our genomic analyses supported the possibility of cell compartmentalization in form of bacterial microcompartments. Further analyses of eukaryote-like protein domains stressed the importance of such proteins with features including tetratricopeptide repeats, leucin rich repeats as well as low density lipoproteins receptor repeats, the latter of which are reported here for the first time from a sponge symbiont. Finally, examining the most abundant protein domain family on poribacterial genomes revealed diverse phyH family proteins, some of which may be related to dissolved organic posphorus uptake.

  9. Mouse Models of C9orf72 Hexanucleotide Repeat Expansion in Amyotrophic Lateral Sclerosis/ Frontotemporal Dementia

    Directory of Open Access Journals (Sweden)

    Ranjan Batra

    2017-07-01

    Full Text Available The presence of hexanucleotide repeat expansion (HRE in the first intron of the human C9orf72 gene is the most common genetic cause underlying both familial amyotrophic lateral sclerosis (ALS and frontotemporal dementia (FTD. Studies aimed at elucidating the pathogenic mechanisms associated of C9orf72 FTD and ALS (C9FTD/ALS have focused on the hypothesis of RNA and protein toxic gain-of-function models, including formation of nuclear RNA foci containing GGGGCC (G4C2 HRE, inclusions containing dipeptide repeat proteins through a non-canonical repeat associated non-ATG (RAN translation mechanism, and on loss-of-function of the C9orf72 protein. Immense effort to elucidate these mechanisms has been put forth and toxic gain-of-function models have especially gained attention. Various mouse models that recapitulate distinct disease-related pathological, functional, and behavioral phenotypes have been generated and characterized. Although these models express the C9orf72 HRE mutation, there are numerous differences among them, including the transgenesis approach to introduce G4C2-repeat DNA, genomic coverage of C9orf72 features in the transgene, G4C2-repeat length after genomic stabilization, spatiotemporal expression profiles of RNA foci and RAN protein aggregates, neuropathological features, and neurodegeneration-related clinical symptoms. This review aims to (1 provide an overview of the key characteristics; (2 provide insights into potential pathological factors contributing to neurotoxicity and clinical phenotypes through systematic comparison of these models.

  10. Unusually effective microRNA targeting within repeat-rich coding regions of mammalian mRNAs

    Science.gov (United States)

    Schnall-Levin, Michael; Rissland, Olivia S.; Johnston, Wendy K.; Perrimon, Norbert; Bartel, David P.; Berger, Bonnie

    2011-01-01

    MicroRNAs (miRNAs) regulate numerous biological processes by base-pairing with target messenger RNAs (mRNAs), primarily through sites in 3′ untranslated regions (UTRs), to direct the repression of these targets. Although miRNAs have sometimes been observed to target genes through sites in open reading frames (ORFs), large-scale studies have shown such targeting to be generally less effective than 3′ UTR targeting. Here, we show that several miRNAs each target significant groups of genes through multiple sites within their coding regions. This ORF targeting, which mediates both predictable and effective repression, arises from highly repeated sequences containing miRNA target sites. We show that such sequence repeats largely arise through evolutionary duplications and occur particularly frequently within families of paralogous C2H2 zinc-finger genes, suggesting the potential for their coordinated regulation. Examples of ORFs targeted by miR-181 include both the well-known tumor suppressor RB1 and RBAK, encoding a C2H2 zinc-finger protein and transcriptional binding partner of RB1. Our results indicate a function for repeat-rich coding sequences in mediating post-transcriptional regulation and reveal circumstances in which miRNA-mediated repression through ORF sites can be reliably predicted. PMID:21685129

  11. Revisiting the TALE repeat.

    Science.gov (United States)

    Deng, Dong; Yan, Chuangye; Wu, Jianping; Pan, Xiaojing; Yan, Nieng

    2014-04-01

    Transcription activator-like (TAL) effectors specifically bind to double stranded (ds) DNA through a central domain of tandem repeats. Each TAL effector (TALE) repeat comprises 33-35 amino acids and recognizes one specific DNA base through a highly variable residue at a fixed position in the repeat. Structural studies have revealed the molecular basis of DNA recognition by TALE repeats. Examination of the overall structure reveals that the basic building block of TALE protein, namely a helical hairpin, is one-helix shifted from the previously defined TALE motif. Here we wish to suggest a structure-based re-demarcation of the TALE repeat which starts with the residues that bind to the DNA backbone phosphate and concludes with the base-recognition hyper-variable residue. This new numbering system is consistent with the α-solenoid superfamily to which TALE belongs, and reflects the structural integrity of TAL effectors. In addition, it confers integral number of TALE repeats that matches the number of bound DNA bases. We then present fifteen crystal structures of engineered dHax3 variants in complex with target DNA molecules, which elucidate the structural basis for the recognition of bases adenine (A) and guanine (G) by reported or uncharacterized TALE codes. Finally, we analyzed the sequence-structure correlation of the amino acid residues within a TALE repeat. The structural analyses reported here may advance the mechanistic understanding of TALE proteins and facilitate the design of TALEN with improved affinity and specificity.

  12. High-Pressure NMR and SAXS Reveals How Capping Modulates Folding Cooperativity of the pp32 Leucine-rich Repeat Protein.

    Science.gov (United States)

    Zhang, Yi; Berghaus, Melanie; Klein, Sean; Jenkins, Kelly; Zhang, Siwen; McCallum, Scott A; Morgan, Joel E; Winter, Roland; Barrick, Doug; Royer, Catherine A

    2018-04-27

    Many repeat proteins contain capping motifs, which serve to shield the hydrophobic core from solvent and maintain structural integrity. While the role of capping motifs in enhancing the stability and structural integrity of repeat proteins is well documented, their contribution to folding cooperativity is not. Here we examined the role of capping motifs in defining the folding cooperativity of the leucine-rich repeat protein, pp32, by monitoring the pressure- and urea-induced unfolding of an N-terminal capping motif (N-cap) deletion mutant, pp32-∆N-cap, and a C-terminal capping motif destabilization mutant pp32-Y131F/D146L, using residue-specific NMR and small-angle X-ray scattering. Destabilization of the C-terminal capping motif resulted in higher cooperativity for the unfolding transition compared to wild-type pp32, as these mutations render the stability of the C-terminus similar to that of the rest of the protein. In contrast, deletion of the N-cap led to strong deviation from two-state unfolding. In both urea- and pressure-induced unfolding, residues in repeats 1-3 of pp32-ΔN-cap lost their native structure first, while the C-terminal half was more stable. The residue-specific free energy changes in all regions of pp32-ΔN-cap were larger in urea compared to high pressure, indicating a less cooperative destabilization by pressure. Moreover, in contrast to complete structural disruption of pp32-ΔN-cap at high urea concentration, its pressure unfolded state remained compact. The contrasting effects of the capping motifs on folding cooperativity arise from the differential local stabilities of pp32, whereas the contrasting effects of pressure and urea on the pp32-ΔN-cap variant arise from their distinct mechanisms of action. Copyright © 2018 Elsevier Ltd. All rights reserved.

  13. Comparative analysis of the prion protein gene sequences in African lion.

    Science.gov (United States)

    Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming

    2006-10-01

    The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.

  14. Study on Fusion Protein and Its gene in Baculovirus Specificity

    International Nuclear Information System (INIS)

    Nemr, W.A.H.

    2012-01-01

    Baculoviruses are subdivided into two groups depending on the type of budded virus envelop fusion protein; group I utilized gp64 which include the most of nucleopolyhedroviruses (NPVs), group II utilized F protein which include the remnants of NPVs and all Granuloviruses (GVs). Recent studies reported the viral F protein coding gene as a host cellular sourced gene and may evolutionary acquired from the host genome referring to phylogeny analysis of fusion proteins. Thus, it was deduced that F protein coding gene is species- specific nucleotide sequence related to the type of the specific host and if virus could infect an unexpected host, the resulted virus may encode a vary F gene. In this regard, the present study utilized the mentioned properties of F gene in an attempt to produce a model of specific and more economic wider range granulovirus bio- pesticide able to infect both Spodoptera littoralis and Phthorimaea operculella larvae. Multiple sequence alignment and phylogeny analysis were performed on six members of group II baculovirus, novel universal PCR primers were manually designed from the conserved regions in the alignment graph, targeted to amplify species- specific sequence entire F gene open reading frame (ORF) which is useful in molecular identification of baculovirus in unknown samples. So, the PCR product of SpliGV used to prepare a specific probe for the F gene of this type of virus. Results reflected that it is possible to infect S. littoralis larvae by PhopGV if injected into larval haemocoel, the resulted virus of this infection showed by using DNA hybridization technique to be encode to F gene homologous with the F gene of Spli GV, which is revealed that the resulted virus acquired this F gene sequence from the host genome after infection. Consequently, these results may infer that if genetic aberrations occur in the host genome, this may affect in baculoviral infectivity. So, this study aimed to investigate the effect of gamma radiation at

  15. Accumulation of sulfonamide resistance genes in arable soils due to repeated application of manure containing sulfadiazine.

    Science.gov (United States)

    Heuer, Holger; Solehati, Qodiah; Zimmerling, Ute; Kleineidam, Kristina; Schloter, Michael; Müller, Tanja; Focks, Andreas; Thiele-Bruhn, Sören; Smalla, Kornelia

    2011-04-01

    Two soils were amended three times with pig manure. The abundance of sulfonamide resistance genes was determined by quantitative PCR 2 months after each application. In both soils treated with sulfadiazine-containing manure, the numbers of copies of sul1 and sul2 significantly increased compared to numbers after treatments with antibiotic-free manure or a control and accumulated with repeated applications.

  16. Rapid evolution of the sequences and gene repertoires of secreted proteins in bacteria.

    Directory of Open Access Journals (Sweden)

    Teresa Nogueira

    Full Text Available Proteins secreted to the extracellular environment or to the periphery of the cell envelope, the secretome, play essential roles in foraging, antagonistic and mutualistic interactions. We hypothesize that arms races, genetic conflicts and varying selective pressures should lead to the rapid change of sequences and gene repertoires of the secretome. The analysis of 42 bacterial pan-genomes shows that secreted, and especially extracellular proteins, are predominantly encoded in the accessory genome, i.e. among genes not ubiquitous within the clade. Genes encoding outer membrane proteins might engage more frequently in intra-chromosomal gene conversion because they are more often in multi-genic families. The gene sequences encoding the secretome evolve faster than the rest of the genome and in particular at non-synonymous positions. Cell wall proteins in Firmicutes evolve particularly fast when compared with outer membrane proteins of Proteobacteria. Virulence factors are over-represented in the secretome, notably in outer membrane proteins, but cell localization explains more of the variance in substitution rates and gene repertoires than sequence homology to known virulence factors. Accordingly, the repertoires and sequences of the genes encoding the secretome change fast in the clades of obligatory and facultative pathogens and also in the clades of mutualists and free-living bacteria. Our study shows that cell localization shapes genome evolution. In agreement with our hypothesis, the repertoires and the sequences of genes encoding secreted proteins evolve fast. The particularly rapid change of extracellular proteins suggests that these public goods are key players in bacterial adaptation.

  17. Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

    Science.gov (United States)

    Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

    2010-08-01

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.

  18. Discovery of rare protein-coding genes in model methylotroph Methylobacterium extorquens AM1.

    Science.gov (United States)

    Kumar, Dhirendra; Mondal, Anupam Kumar; Yadav, Amit Kumar; Dash, Debasis

    2014-12-01

    Proteogenomics involves the use of MS to refine annotation of protein-coding genes and discover genes in a genome. We carried out comprehensive proteogenomic analysis of Methylobacterium extorquens AM1 (ME-AM1) from publicly available proteomics data with a motive to improve annotation for methylotrophs; organisms capable of surviving in reduced carbon compounds such as methanol. Besides identifying 2482(50%) proteins, 29 new genes were discovered and 66 annotated gene models were revised in ME-AM1 genome. One such novel gene is identified with 75 peptides, lacks homolog in other methylobacteria but has glycosyl transferase and lipopolysaccharide biosynthesis protein domains, indicating its potential role in outer membrane synthesis. Many novel genes are present only in ME-AM1 among methylobacteria. Distant homologs of these genes in unrelated taxonomic classes and low GC-content of few genes suggest lateral gene transfer as a potential mode of their origin. Annotations of methylotrophy related genes were also improved by the discovery of a short gene in methylotrophy gene island and redefining a gene important for pyrroquinoline quinone synthesis, essential for methylotrophy. The combined use of proteogenomics and rigorous bioinformatics analysis greatly enhanced the annotation of protein-coding genes in model methylotroph ME-AM1 genome. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. General theory for integrated analysis of growth, gene, and protein expression in biofilms.

    Science.gov (United States)

    Zhang, Tianyu; Pabst, Breana; Klapper, Isaac; Stewart, Philip S

    2013-01-01

    A theory for analysis and prediction of spatial and temporal patterns of gene and protein expression within microbial biofilms is derived. The theory integrates phenomena of solute reaction and diffusion, microbial growth, mRNA or protein synthesis, biomass advection, and gene transcript or protein turnover. Case studies illustrate the capacity of the theory to simulate heterogeneous spatial patterns and predict microbial activities in biofilms that are qualitatively different from those of planktonic cells. Specific scenarios analyzed include an inducible GFP or fluorescent protein reporter, a denitrification gene repressed by oxygen, an acid stress response gene, and a quorum sensing circuit. It is shown that the patterns of activity revealed by inducible stable fluorescent proteins or reporter unstable proteins overestimate the region of activity. This is due to advective spreading and finite protein turnover rates. In the cases of a gene induced by either limitation for a metabolic substrate or accumulation of a metabolic product, maximal expression is predicted in an internal stratum of the biofilm. A quorum sensing system that includes an oxygen-responsive negative regulator exhibits behavior that is distinct from any stage of a batch planktonic culture. Though here the analyses have been limited to simultaneous interactions of up to two substrates and two genes, the framework applies to arbitrarily large networks of genes and metabolites. Extension of reaction-diffusion modeling in biofilms to the analysis of individual genes and gene networks is an important advance that dovetails with the growing toolkit of molecular and genetic experimental techniques.

  20. Genome-Wide Identification and Analysis of Genes Encoding PHD-Finger Protein in Tomato

    International Nuclear Information System (INIS)

    Hayat, S.; Cheng, Z.; Chen, X.

    2016-01-01

    The PHD-finger proteins are conserved in eukaryotic organisms and are involved in a variety of important functions in different biological processes in plants. However, the function of PHD fingers are poorly known in tomato (Solanum lycopersicum L.). In current study, we identified 45 putative genes coding Phd finger protein in tomato distributed on 11 chromosomes except for chromosome 8. Some of the genes encode other conserved key domains besides Phd-finger. Phylogenetic analysis of these 45 proteins resulted in seven clusters. Most Phd finger proteins were predicted to PML body location. These PHD-finger genes displayed differential expression either in various organs, at different development stages and under stresses in tomato. Our study provides the first systematic analysis of PHD-finger genes and proteins in tomato. This preliminary study provides a very useful reference information for Phd-finger proteins in tomato. They will be helpful for cloning and functional study of tomato PHD-finger genes. (author)

  1. Duplication and relocation of the functional DPY19L2 gene within low copy repeats

    Directory of Open Access Journals (Sweden)

    Cheung Joseph

    2006-03-01

    Full Text Available Abstract Background Low copy repeats (LCRs are thought to play an important role in recent gene evolution, especially when they facilitate gene duplications. Duplicate genes are fundamental to adaptive evolution, providing substrates for the development of new or shared gene functions. Moreover, silencing of duplicate genes can have an indirect effect on adaptive evolution by causing genomic relocation of functional genes. These changes are theorized to have been a major factor in speciation. Results Here we present a novel example showing functional gene relocation within a LCR. We characterize the genomic structure and gene content of eight related LCRs on human Chromosomes 7 and 12. Two members of a novel transmembrane gene family, DPY19L, were identified in these regions, along with six transcribed pseudogenes. One of these genes, DPY19L2, is found on Chromosome 12 and is not syntenic with its mouse orthologue. Instead, the human locus syntenic to mouse Dpy19l2 contains a pseudogene, DPY19L2P1. This indicates that the ancestral copy of this gene has been silenced, while the descendant copy has remained active. Thus, the functional copy of this gene has been relocated to a new genomic locus. We then describe the expansion and evolution of the DPY19L gene family from a single gene found in invertebrate animals. Ancient duplications have led to multiple homologues in different lineages, with three in fish, frogs and birds and four in mammals. Conclusion Our results show that the DPY19L family has expanded throughout the vertebrate lineage and has undergone recent primate-specific evolution within LCRs.

  2. Origins of gene, genetic code, protein and life

    Indian Academy of Sciences (India)

    Unknown

    have concluded that newly-born genes are products of nonstop frames (NSF) ... research to determine tertiary structures of proteins such ... the present earth, is favourable for new genes to arise, if ..... NGG) in the universal genetic code table, cannot satisfy ..... which has been proposed to explain the development of life on.

  3. Ethylene-induced senescence-related gene expression requires protein synthesis

    International Nuclear Information System (INIS)

    Lawton, K.A.; Raghothama, K.G.; Woodson, W.R.

    1990-01-01

    We have investigated the effects of inhibiting protein synthesis on the ethylene-induced expression of 3 carnation senescence-related genes, pSR5, pSR8, and pSR12. Treatment of preclimacteric carnation petal discs with 1μg/ml of cycloheximide, a cytoplasmic protein synthesis inhibitor, for 3h inhibited protein synthesis by >80% as quantitated by the incorporation of [35S]methionine into protein. Pre-treatment of petal discs with cycloheximide prevented ethylene-induced SR transcript accumulation. Cycloheximide treatment of petal discs held in air did not result in increased levels of SR mRNA. These results indicate that ethylene does not interact with pre-formed factors but rather that the activation of SR gene expression by ethylene is mediated by labile protein factor(s) synthesized on cytoplasmic ribosomes. Experiments are currently underway to determine if cycloheximide exerts its effect at the transcriptional or post-transcriptional level

  4. False positive reduction in protein-protein interaction predictions using gene ontology annotations

    Directory of Open Access Journals (Sweden)

    Lin Yen-Han

    2007-07-01

    Full Text Available Abstract Background Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. Results Gene Ontology (GO annotations were used to reduce false positive protein-protein interactions (PPI pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top-ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% (by average of four datasets in yeast and worm, respectively. Based on eight top-ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Conclusion Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially

  5. Functional redundancy and/or ongoing pseudogenization among F-box protein genes expressed in Arabidopsis male gametophyte.

    Science.gov (United States)

    Ikram, Sobia; Durandet, Monique; Vesa, Simona; Pereira, Serge; Guerche, Philippe; Bonhomme, Sandrine

    2014-06-01

    F-box protein genes family is one of the largest gene families in plants, with almost 700 predicted genes in the model plant Arabidopsis. F-box proteins are key components of the ubiquitin proteasome system that allows targeted protein degradation. Transcriptome analyses indicate that half of these F-box protein genes are found expressed in microspore and/or pollen, i.e., during male gametogenesis. To assess the role of F-box protein genes during this crucial developmental step, we selected 34 F-box protein genes recorded as highly and specifically expressed in pollen and isolated corresponding insertion mutants. We checked the expression level of each selected gene by RT-PCR and confirmed pollen expression for 25 genes, but specific expression for only 10 of the 34 F-box protein genes. In addition, we tested the expression level of selected F-box protein genes in 24 mutant lines and showed that 11 of them were null mutants. Transmission analysis of the mutations to the progeny showed that none of the single mutations was gametophytic lethal. These unaffected transmission efficiencies suggested leaky mutations or functional redundancy among F-box protein genes. Cytological observation of the gametophytes in the mutants confirmed these results. Combinations of mutations in F-box protein genes from the same subfamily did not lead to transmission defect either, further highlighting functional redundancy and/or a high proportion of pseudogenes among these F-box protein genes.

  6. Androgen receptor gene CAG repeat polymorphism independently influences recovery of male sexual function after testosterone replacement therapy in postsurgical hypogonadotropic hypogonadism.

    Science.gov (United States)

    Tirabassi, Giacomo; Delli Muti, Nicola; Corona, Giovanni; Maggi, Mario; Balercia, Giancarlo

    2014-05-01

    Few and contradictory studies have evaluated the possible influence of androgen receptor (AR) gene CAG repeat polymorphism on male sexual function. In this study we evaluated the role of AR gene CAG repeat polymorphism in the recovery of sexual function after testosterone replacement therapy (TRT) in men affected by postsurgical hypogonadotropic hypogonadism, a condition which is often associated with hypopituitarism and in which the sexual benefits of TRT must be distinguished from those of pituitary-function replacement therapies. Fifteen men affected by postsurgical hypogonadotropic hypogonadism were retrospectively assessed before and after TRT. Main outcome measures included sexual parameters as assessed by the International Index of Erectile Function questionnaire, levels of pituitary dependent hormones (total testosterone, free T3, free T4, cortisol, insulin-like growth factor-1 [IGF-1], prolactin), and results of genetic analysis (AR gene CAG repeat number). Plasma concentrations of free T3, free T4, cortisol, and prolactin did not vary significantly between the two phases, while testosterone and IGF-1 increased significantly after TRT. A significant improvement in all sexual parameters studied was found. The number of CAG triplets was negatively and significantly correlated with changes in all the sexual parameters, while opposite correlations were found between changes in sexual parameters and changes in testosterone levels; no correlation of change in IGF1 with change in sexual parameters was reported. On multiple linear regression analysis, after correction for changes in testosterone, nearly all the associations between the number of CAG triplets and changes in sexual parameters were confirmed. Shorter length AR gene CAG repeat number is associated with the recovery of sexual function after TRT in postsurgical male hypogonadotropic hypogonadism, independently of the effects of concomitant pituitary-replacement therapies. © 2014 International Society

  7. Molecular characterization of the porcine surfactant, pulmonary-associated protein C gene

    DEFF Research Database (Denmark)

    Cirera, S.; Nygård, A.B.; Jensen, H.E.

    2006-01-01

    The surfactant, pulmonary-associated protein C (SFTPC) is a peptide secreted by the alveolar type II pneumocytes of the lung. We have characterized the porcine SFTPC gene at genomic, transcriptional, and protein levels. The porcine SFTPC is a single-copy gene on pig chromosome 14. Two transcripts...

  8. Gene, protein, and network of male sterility in rice

    OpenAIRE

    Wang, Kun; Peng, Xiaojue; Ji, Yanxiao; Yang, Pingfang; Zhu, Yingguo; Li, Shaoqing

    2013-01-01

    Rice is one of the most important model crop plants whose heterosis has been well-exploited in commercial hybrid seed production via a variety of types of male-sterile lines. Hybrid rice cultivation area is steadily expanding around the world, especially in Southern Asia. Characterization of genes and proteins related to male sterility aims to understand how and why the male sterility occurs, and which proteins are the key players for microspores abortion. Recently, a series of genes and prot...

  9. Shorter CAG repeat length in the AR gene is associated with poor outcome in head and neck cancer

    DEFF Research Database (Denmark)

    Rosa, Fabíola Encinas; dos Santos, Rodrigo Mattos; Poli-Frederico, Regina Célia

    2007-01-01

    microsatellite instability (MSI) and loss of heterozygosity (LOH) in these tumors. DESIGN: Matched samples of blood and head and neck tumors were evaluated using two methodologies, silver-stained gels to perform the analyses of MSI and LOH, and automated analysis to confirm these results and for genotyping...... of the AR [CAG](n) repeat length. Sixty-nine individuals without cancer were used as a control group for both procedures. The Log-rank test was used to compare overall survival and disease-free survival curves. The Cox proportional hazards regression models were performed to determine the [CAG](n) repeats......: These results suggest that short [CAG](n) repeat length (poor prognosis in a subset of male patients with head and neck cancer and that AR gene microsatellite instability is uncommon in these tumors....

  10. Simulating evolution of protein complexes through gene duplication and co-option.

    Science.gov (United States)

    Haarsma, Loren; Nelesen, Serita; VanAndel, Ethan; Lamine, James; VandeHaar, Peter

    2016-06-21

    We present a model of the evolution of protein complexes with novel functions through gene duplication, mutation, and co-option. Under a wide variety of input parameters, digital organisms evolve complexes of 2-5 bound proteins which have novel functions but whose component proteins are not independently functional. Evolution of complexes with novel functions happens more quickly as gene duplication rates increase, point mutation rates increase, protein complex functional probability increases, protein complex functional strength increases, and protein family size decreases. Evolution of complexity is inhibited when the metabolic costs of making proteins exceeds the fitness gain of having functional proteins, or when point mutation rates get so large the functional proteins undergo deleterious mutations faster than new functional complexes can evolve. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Cloning and characterization of an insecticidal crystal protein gene ...

    Indian Academy of Sciences (India)

    Unknown

    The sequence of the cloned crystal protein gene showed almost complete homology with a mosquitocidal toxin gene from Bacillus .... diet or by topical application on food substrates as .... has very high similarity (99.74%) at DNA level with.

  12. Sense-encoded poly-GR dipeptide repeat proteins correlate to neurodegeneration and uniquely co-localize with TDP-43 in dendrites of repeat expanded C9orf72 amyotrophic lateral sclerosis

    Science.gov (United States)

    Saberi, Shahram; Stauffer, Jennifer E.; Jiang, Jie; Garcia, Sandra Diaz; Taylor, Amy E; Schulte, Derek; Ohkubo, Takuya; Schloffman, Cheyenne L.; Maldonado, Marcus; Baughn, Michael; Rodriguez, Maria J; Pizzo, Don; Cleveland, Don; Ravits, John

    2018-01-01

    Hexanucleotide repeat expansions in C9orf72 are the most common genetic cause of amyotrophic lateral sclerosis (C9 ALS). The main hypothesized pathogenic mechanisms are C9orf72 haploinsufficiency and/or toxicity from one or more of bi-directionally transcribed repeat RNAs and their dipeptide repeat proteins (DPRs) poly-GP, poly-GA, poly-GR, poly-PR and poly-PA. Recently, nuclear import and/or export defects especially caused by arginine-containing poly-GR or poly-PR have been proposed as significant contributors to pathogenesis based on disease models. We quantitatively studied and compared DPRs, nuclear pore proteins and C9orf72 protein in clinically-related and clinically-unrelated regions of the central nervous system, and compared them to phosphorylated TDP-43 (pTDP-43), the hallmark protein of ALS. Of the five DPRs, only poly-GR was significantly abundant in clinically-related areas compared to unrelated areas (p<0.001), and formed dendritic-like aggregates in the motor cortex that co-localized with pTDP-43 (p<0.0001). While most poly-GR dendritic inclusions were pTDP-43-positive, only 4% of pTDP-43 dendritic inclusions were poly-GR-positive. Staining for arginine-containing poly-GR and poly-PR in nuclei of neurons produced signals that were not specific to C9 ALS. We could not detect significant differences of nuclear markers RanGap, Lamin B1, and Importin β1 in C9 ALS, although we observed subtle nuclear changes in ALS, both C9 and non-C9, compared to control. The C9orf72 protein itself was diffusely expressed in cytoplasm of large neurons and glia, and nearly 50% reduced, in both clinically-related frontal cortex and unrelated occipital cortex, but not in cerebellum. In summary, sense-encoded poly-GR DPR was unique, and localized to neurites and pTDP43 in motor regions of C9 ALS CNS. This is consistent with new emerging ideas about TDP-43 functions in dendrites. PMID:29196813

  13. Identification of novel type 1 diabetes candidate genes by integrating genome-wide association data, protein-protein interactions, and human pancreatic islet gene expression

    DEFF Research Database (Denmark)

    Bergholdt, Regine; Brorsson, Caroline; Palleja, Albert

    2012-01-01

    Genome-wide association studies (GWAS) have heralded a new era in susceptibility locus discovery in complex diseases. For type 1 diabetes, >40 susceptibility loci have been discovered. However, GWAS do not inevitably lead to identification of the gene or genes in a given locus associated with dis......-cells. Our results provide novel insight to the mechanisms behind type 1 diabetes pathogenesis and, thus, may provide the basis for the design of novel treatment strategies.......Genome-wide association studies (GWAS) have heralded a new era in susceptibility locus discovery in complex diseases. For type 1 diabetes, >40 susceptibility loci have been discovered. However, GWAS do not inevitably lead to identification of the gene or genes in a given locus associated...... with disease, and they do not typically inform the broader context in which the disease genes operate. Here, we integrated type 1 diabetes GWAS data with protein-protein interactions to construct biological networks of relevance for disease. A total of 17 networks were identified. To prioritize...

  14. One, Two, Three: Polycomb Proteins Hit All Dimensions of Gene Regulation

    Directory of Open Access Journals (Sweden)

    Stefania del Prete

    2015-07-01

    Full Text Available Polycomb group (PcG proteins contribute to the formation and maintenance of a specific repressive chromatin state that prevents the expression of genes in a particular space and time. Polycomb repressive complexes (PRCs consist of several PcG proteins with specific regulatory or catalytic properties. PRCs are recruited to thousands of target genes, and various recruitment factors, including DNA-binding proteins and non-coding RNAs, are involved in the targeting. PcG proteins contribute to a multitude of biological processes by altering chromatin features at different scales. PcG proteins mediate both biochemical modifications of histone tails and biophysical modifications (e.g., chromatin fiber compaction and three-dimensional (3D chromatin conformation. Here, we review the role of PcG proteins in nuclear architecture, describing their impact on the structure of the chromatin fiber, on chromatin interactions, and on the spatial organization of the genome in nuclei. Although little is known about the role of plant PcG proteins in nuclear organization, much is known in the animal field, and we highlight similarities and differences in the roles of PcG proteins in 3D gene regulation in plants and animals.

  15. The Expression of Genes Encoding Secreted Proteins in Medicago truncatula A17 Inoculated Roots

    Directory of Open Access Journals (Sweden)

    LUCIA KUSUMAWATI

    2013-09-01

    Full Text Available Subtilisin-like serine protease (MtSBT, serine carboxypeptidase (MtSCP, MtN5, non-specific lipid transfer protein (MtnsLTP, early nodulin2-like protein (MtENOD2-like, FAD-binding domain containing protein (MtFAD-BP1, and rhicadhesin receptor protein (MtRHRE1 were among 34 proteins found in the supernatant of M. truncatula 2HA and sickle cell suspension cultures. This study investigated the expression of genes encoding those proteins in roots and developing nodules. Two methods were used: quantitative real time RT-PCR and gene expression analysis (with promoter:GUS fusion in roots. Those proteins are predicted as secreted proteins which is indirectly supported by the findings that promoter:GUS fusions of six of the seven genes encoding secreted proteins were strongly expressed in the vascular bundle of transgenic hairy roots. All six genes have expressed in 14-day old nodule. The expression levels of the selected seven genes were quantified in Sinorhizobium-inoculated and control plants using quantitative real time RT-PCR. In conclusion, among seven genes encoding secreted proteins analyzed, the expression level of only one gene, MtN5, was up-regulated significantly in inoculated root segments compared to controls. The expression of MtSBT1, MtSCP1, MtnsLTP, MtFAD-BP1, MtRHRE1 and MtN5 were higher in root tip than in other tissues examined.

  16. Expression of circadian clock genes and proteins in urothelial cancer is related to cancer-associated genes

    International Nuclear Information System (INIS)

    Litlekalsoy, Jorunn; Rostad, Kari; Kalland, Karl-Henning; Hostmark, Jens G.; Laerum, Ole Didrik

    2016-01-01

    The purpose of this study was to evaluate invasive and metastatic potential of urothelial cancer by investigating differential expression of various clock genes/proteins participating in the 24 h circadian rhythms and to compare these gene expressions with transcription of other cancer-associated genes. Twenty seven paired samples of tumour and benign tissue collected from patients who underwent cystectomy were analysed and compared to 15 samples of normal bladder tissue taken from patients who underwent cystoscopy for benign prostate hyperplasia (unrelated donors). Immunohistochemical analyses were made for clock and clock-related proteins. In addition, the gene-expression levels of 22 genes (clock genes, casein kinases, oncogenes, tumour suppressor genes and cytokeratins) were analysed by real-time quantitative PCR (qPCR). Considerable up- or down-regulation and altered cellular distribution of different clock proteins, a reduction of casein kinase1A1 (CSNK1A1) and increase of casein kinase alpha 1 E (CSNK1E) were found. The pattern was significantly correlated with simultaneous up-regulation of stimulatory tumour markers, and a down-regulation of several suppressor genes. The pattern was mainly seen in aneuploid high-grade cancers. Considerable alterations were also found in the neighbouring bladder mucosa. The close correlation between altered expression of various clock genes and common tumour markers in urothelial cancer indicates that disturbed function in the cellular clock work may be an important additional mechanism contributing to cancer progression and malignant behaviour. The online version of this article (doi:10.1186/s12885-016-2580-y) contains supplementary material, which is available to authorized users

  17. Relationship between drug resistance and the clustered, regularly interspaced, short, palindromic repeat-associated protein genes cas1 and cas2 in Shigella from giant panda dung.

    Science.gov (United States)

    Ren, Lu; Deng, Lin-Hua; Zhang, Ri-Peng; Wang, Cheng-Dong; Li, De-Sheng; Xi, Li-Xin; Chen, Zhen-Rong; Yang, Rui; Huang, Jie; Zeng, Yang-Ru; Wu, Hong-Lin; Cao, San-Jie; Wu, Rui; Huang, Yong; Yan, Qi-Gui

    2017-02-01

    To detect drug resistance in Shigella obtained from the dung of the giant panda, explore the factors leading to drug resistance in Shigella, understand the characteristics of clustered, regularly interspaced, short, palindromic repeats (CRISPR), and assess the relationship between CRISPR and drug resistance. We collected fresh feces from 27 healthy giant pandas in the Giant Panda Conservation base (Wolong, China). We identified the strains of Shigella in the samples by using nucleotide sequence analysis. Further, the Kirby-Bauer paper method was used to determine drug sensitivity of the Shigella strains. CRISPR-associated protein genes cas1 and cas2 in Shigella were detected by polymerase chain reaction (PCR), and the PCR products were sequenced and compared. We isolated and identified 17 strains of Shigella from 27 samples, including 14 strains of Shigella flexneri, 2 strains of Shigella sonnei, and 1 strain of Shigella dysenteriae. Further, drug resistance to cefazolin, imipenem, and amoxicillin-clavulanic acid was identified as a serious problem, as multidrug-resistant strains were detected. Further, cas1 and cas2 showed different degrees of point mutations. The CRISPR system widely exists in Shigella and shares homology with that in Escherichia coli. The cas1 and cas 2 mutations contribute to the different levels of resistance. Point mutations at sites 3176455, 3176590, and 3176465 in cas1 (a); sites 3176989, 3176992, and 3176995 in cas1 (b); sites 3176156 and 3176236 in cas2 may affect the resistance of bacteria, cause emergence of multidrug resistance, and increase the types of drug resistance.

  18. Genes and proteins of Escherichia coli K-12.

    Science.gov (United States)

    Riley, M

    1998-01-01

    GenProtEC is a database of Escherichia coli genes and their gene products, classified by type of function and physiological role and with citations to the literature for each. Also present are data on sequence similarities among E.coli proteins, representing groups of paralogous genes, with PAM values, percent identity of amino acids, length of alignment and percent aligned. GenProtEC can be accessed at the URL http://www.mbl.edu/html/ecoli.html

  19. Radioresistance related genes screened by protein-protein interaction network analysis in nasopharyngeal carcinoma

    International Nuclear Information System (INIS)

    Zhu Xiaodong; Guo Ya; Qu Song; Li Ling; Huang Shiting; Li Danrong; Zhang Wei

    2012-01-01

    Objective: To discover radioresistance associated molecular biomarkers and its mechanism in nasopharyngeal carcinoma by protein-protein interaction network analysis. Methods: Whole genome expression microarray was applied to screen out differentially expressed genes in two cell lines CNE-2R and CNE-2 with different radiosensitivity. Four differentially expressed genes were randomly selected for further verification by the semi-quantitative RT-PCR analysis with self-designed primers. The common differentially expressed genes from two experiments were analyzed with the SNOW online database in order to find out the central node related to the biomarkers of nasopharyngeal carcinoma radioresistance. The expression of STAT1 in CNE-2R and CNE-2 cells was measured by Western blot. Results: Compared with CNE-2 cells, 374 genes in CNE-2R cells were differentially expressed while 197 genes showed significant differences. Four randomly selected differentially expressed genes were verified by RT-PCR and had same change trend in consistent with the results of chip assay. Analysis with the SNOW database demonstrated that those 197 genes could form a complicated interaction network where STAT1 and JUN might be two key nodes. Indeed, the STAT1-α expression in CNE-2R was higher than that in CNE-2 (t=4.96, P<0.05). Conclusions: The key nodes of STAT1 and JUN may be the molecular biomarkers leading to radioresistance in nasopharyngeal carcinoma, and STAT1-α might have close relationship with radioresistance. (authors)

  20. The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.

    Science.gov (United States)

    Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M

    2013-05-29

    Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in

  1. Cyclin B1 Destruction Box-Mediated Protein Instability: The Enhanced Sensitivity of Fluorescent-Protein-Based Reporter Gene System

    Directory of Open Access Journals (Sweden)

    Chao-Hsun Yang

    2013-01-01

    Full Text Available The periodic expression and destruction of several cyclins are the most important steps for the exact regulation of cell cycle. Cyclins are degraded by the ubiquitin-proteasome system during cell cycle. Besides, a short sequence near the N-terminal of cyclin B called the destruction box (D-box; CDB is also required. Fluorescent-protein-based reporter gene system is insensitive to analysis because of the overly stable fluorescent proteins. Therefore, in this study, we use human CDB fused with both enhanced green fluorescent protein (EGFP at C-terminus and red fluorescent protein (RFP, DsRed at N-terminus in the transfected human melanoma cells to examine the effects of CDB on different fluorescent proteins. Our results indicated that CDB-fused fluorescent protein can be used to examine the slight gene regulations in the reporter gene system and have the potential to be the system for screening of functional compounds in the future.

  2. Challenges in biotechnology at LLNL: from genes to proteins; TOPICAL

    International Nuclear Information System (INIS)

    Albala, J S

    1999-01-01

    This effort has undertaken the task of developing a link between the genomics, DNA repair and structural biology efforts within the Biology and Biotechnology Research Program at LLNL. Through the advent of the I.M.A.G.E. (Integrated Molecular Analysis of Genomes and their Expression) Consortium, a world-wide effort to catalog the largest public collection of genes, accepted and maintained within BBRP, it is now possible to systematically express the protein complement of these to further elucidate novel gene function and structure. The work has ensued in four phases, outlined as follows: (1) Gene and System selection; (2) Protein expression and purification; (3) Structural analysis; and (4) biological integration. Proteins to be expressed have been those of high programmatic interest. This includes, in particular, proteins involved in the maintenance of genome integrity, particularly those involved in the repair of DNA damage, including ERCC1, ERCC4, XRCC2, XRCC3, XRCC9, HEX1, APN1, p53, RAD51B, RAD51C, and RAD51. Full-length cDNA cognates of selected genes were isolated, and cloned into baculovirus-based expression vectors. The baculoviral expression system for protein over-expression is now well-established in the Albala laboratory. Procedures have been successfully optimized for full-length cDNA clining into expression vectors for protein expression from recombinant constructs. This includes the reagents, cell lines, techniques necessary for expression of recombinant baculoviral constructs in Spodoptera frugiperda (Sf9) cells. The laboratory has also generated a high-throughput baculoviral expression paradigm for large scale expression and purification of human recombinant proteins amenable to automation

  3. Transcriptional profiling of protein expression related genes of Pichia pastoris under simulated microgravity.

    Directory of Open Access Journals (Sweden)

    Feng Qi

    Full Text Available The physiological responses and transcription profiling of Pichia pastoris GS115 to simulated microgravity (SMG were substantially changed compared with normal gravity (NG control. We previously reported that the recombinant P. pastoris grew faster under SMG than NG during methanol induction phase and the efficiencies of recombinant enzyme production and secretion were enhanced under SMG, which was considered as the consequence of changed transcriptional levels of some key genes. In this work, transcriptiome profiling of P. pastoris cultured under SMG and NG conditions at exponential and stationary phases were determined using next-generation sequencing (NGS technologies. Four categories of 141 genes function as methanol utilization, protein chaperone, RNA polymerase and protein transportation or secretion classified according to Gene Ontology (GO were chosen to be analyzed on the basis of NGS results. And 80 significantly changed genes were weighted and estimated by Cluster 3.0. It was found that most genes of methanol metabolism (85% of 20 genes and protein transportation or secretion (82.2% of 45 genes were significantly up-regulated under SMG. Furthermore the quantity and fold change of up-regulated genes in exponential phase of each category were higher than those of stationary phase. The results indicate that the up-regulated genes of methanol metabolism and protein transportation or secretion mainly contribute to enhanced production and secretion of the recombinant protein under SMG.

  4. Genetic variations in insulin-like growth factor binding protein acid labile subunit gene associated with growth traits in beef cattle (Bos taurus) in China.

    Science.gov (United States)

    Liu, Yu; Duan, Xiaoyan; Liu, Xiaolin; Guo, Jiazhong; Wang, Hongliang; Li, Zhixiong; Yang, Jing

    2014-05-01

    The insulin-like growth factor binding protein acid labile subunit (IGFALS) gene encodes a serum protein that binds to IGFs and regulates growth, development, and other physiological processes. We have found that sequencing of the IGFALS gene in Chinese Qinchuan beef cattle (n=300) revealed four SNP loci in exon two of the gene (g1219: T>C, g1893: T>C, g2612: G>A, and g2696: A>G). The SNP g2696: A>G resulted in a change from asparagine to aspartic acid (p. N574D) in the leucine-rich repeat region in the carboxyl-terminal domain of IGFALS. Four SNPs were in low linkage disequilibrium, and 12 different haplotypes were identified in the population. Association analysis suggested that SNP g1219: T>C had a significant association with hip width (PG displayed a significant association with stature (Pgrowth traits of bovine, and may serve as a genetic marker for selection of beef cattle for growth traits, including stature. Copyright © 2014 Elsevier B.V. All rights reserved.

  5. Microsatellite instability at a tetranucleotide repeat in type I endometrial carcinoma

    Directory of Open Access Journals (Sweden)

    Choi Ho

    2008-12-01

    Full Text Available Abstract Background Microsatellite instability (MSI at tri- or tetranucleotide repeat markers (elevated microsatellite alterations at selected tetranucleotide repeat, EMAST has been recently described. But, the underlying genetic mechanism of EMAST is unclear. This study was to investigate the prevalence of EMAST, in type I endometrial carcinoma, and to determine the correlation between the MSI status and mismatch repair genes (MMR or p53. Methods We examined the 3 mono-, 3 di-, and 6 tetranucleotide repeat markers by PCR in 39 cases of type I endometrial carcinoma and performed the immunohistochemistry of hMSH2, hMLH1, and p53 protein. Results More than two MSI at mono- and dinucleotide repeat markers was noted in 8 cases (MSI-H, 20.5%. MSI, at a tetranucleotide repeat, was detected in 15 cases (EMAST, 38.5%. In remaining 16 cases, any MSI was not observed. (MSS, 42.1%, MSI status was not associated with FIGO stage, grade or depth of invasion. The absence of expression of either one of both hMSH2 or hMLH1 was noted in seven (87.5% of eight MSI-H tumors, one (6.3% of 16 MSS tumors, and five (33.3% of 15 EMAST tumors. (p = 0.010 The expression of p53 protein was found in one (12.5% of eight MSI-H tumors, five (31.3% of 16 MSS tumors, and seven of 15 EMAST tumors. (p = 0.247 Conclusion Our results showed that about 38.5% of type I endometrial carcinomas exhibited EMAST, and that EMAST was rarely associated with alteration of hMSH2 or hMLH1.

  6. Expansion of GA Dinucleotide Repeats Increases the Density of CLAMP Binding Sites on the X-Chromosome to Promote Drosophila Dosage Compensation.

    Directory of Open Access Journals (Sweden)

    Guray Kuzu

    2016-07-01

    Full Text Available Dosage compensation is an essential process that equalizes transcript levels of X-linked genes between sexes by forming a domain of coordinated gene expression. Throughout the evolution of Diptera, many different X-chromosomes acquired the ability to be dosage compensated. Once each newly evolved X-chromosome is targeted for dosage compensation in XY males, its active genes are upregulated two-fold to equalize gene expression with XX females. In Drosophila melanogaster, the CLAMP zinc finger protein links the dosage compensation complex to the X-chromosome. However, the mechanism for X-chromosome identification has remained unknown. Here, we combine biochemical, genomic and evolutionary approaches to reveal that expansion of GA-dinucleotide repeats likely accumulated on the X-chromosome over evolutionary time to increase the density of CLAMP binding sites, thereby driving the evolution of dosage compensation. Overall, we present new insight into how subtle changes in genomic architecture, such as expansions of a simple sequence repeat, promote the evolution of coordinated gene expression.

  7. Functional evolution of ADAMTS genes: Evidence from analyses of phylogeny and gene organization

    Directory of Open Access Journals (Sweden)

    Van Meir Erwin G

    2005-02-01

    Full Text Available Abstract Background The ADAMTS (A Disintegrin-like and Metalloprotease with Thrombospondin motifs proteins are a family of metalloproteases with sequence similarity to the ADAM proteases, that contain the thrombospondin type 1 sequence repeat motifs (TSRs common to extracellular matrix proteins. ADAMTS proteins have recently gained attention with the discovery of their role in a variety of diseases, including tissue and blood disorders, cancer, osteoarthritis, Alzheimer's and the genetic syndromes Weill-Marchesani syndrome (ADAMTS10, thrombotic thrombocytopenic purpura (ADAMTS13, and Ehlers-Danlos syndrome type VIIC (ADAMTS2 in humans and belted white-spotting mutation in mice (ADAMTS20. Results Phylogenetic analysis and comparison of the exon/intron organization of vertebrate (Homo, Mus, Fugu, chordate (Ciona and invertebrate (Drosophila and Caenorhabditis ADAMTS homologs has elucidated the evolutionary relationships of this important gene family, which comprises 19 members in humans. Conclusions The evolutionary history of ADAMTS genes in vertebrate genomes has been marked by rampant gene duplication, including a retrotransposition that gave rise to a distinct ADAMTS subfamily (ADAMTS1, -4, -5, -8, -15 that may have distinct aggrecanase and angiogenesis functions.

  8. A pomegranate (Punica granatum L.) WD40-repeat gene is a functional homologue of Arabidopsis TTG1 and is involved in the regulation of anthocyanin biosynthesis during pomegranate fruit development.

    Science.gov (United States)

    Ben-Simhon, Zohar; Judeinstein, Sylvie; Nadler-Hassar, Talia; Trainin, Taly; Bar-Ya'akov, Irit; Borochov-Neori, Hamutal; Holland, Doron

    2011-11-01

    Anthocyanins are the major pigments responsible for the pomegranate (Punica granatum L.) fruit skin color. The high variability in fruit external color in pomegranate cultivars reflects variations in anthocyanin composition. To identify genes involved in the regulation of anthocyanin biosynthesis pathway in the pomegranate fruit skin we have isolated, expressed and characterized the pomegranate homologue of the Arabidopsis thaliana TRANSPARENT TESTA GLABRA1 (TTG1), encoding a WD40-repeat protein. The TTG1 protein is a regulator of anthocyanins and proanthocyanidins (PAs) biosynthesis in Arabidopsis, and acts by the formation of a transcriptional regulatory complex with two other regulatory proteins: bHLH and MYB. Our results reveal that the pomegranate gene, designated PgWD40, recovered the anthocyanin, PAs, trichome and seed coat mucilage phenotype in Arabidopsis ttg1 mutant. PgWD40 expression and anthocyanin composition in the skin were analyzed during pomegranate fruit development, in two accessions that differ in skin color intensity and timing of appearance. The results indicate high positive correlation between the total cyanidin derivatives quantity (red pigments) and the expression level of PgWD40. Furthermore, strong correlation was found between the steady state levels of PgWD40 transcripts and the transcripts of pomegranate homologues of the structural genes PgDFR and PgLDOX. PgWD40, PgDFR and PgLDOX expression also correlated with the expression of pomegranate homologues of the regulatory genes PgAn1 (bHLH) and PgAn2 (MYB). On the basis of our results we propose that PgWD40 is involved in the regulation of anthocyanin biosynthesis during pomegranate fruit development and that expression of PgWD40, PgAn1 and PgAn2 in the pomegranate fruit skin is required to regulate the expression of downstream structural genes involved in the anthocyanin biosynthesis.

  9. Ribosomal protein gene knockdown causes developmental defects in zebrafish.

    Directory of Open Access Journals (Sweden)

    Tamayo Uechi

    Full Text Available The ribosomal proteins (RPs form the majority of cellular proteins and are mandatory for cellular growth. RP genes have been linked, either directly or indirectly, to various diseases in humans. Mutations in RP genes are also associated with tissue-specific phenotypes, suggesting a possible role in organ development during early embryogenesis. However, it is not yet known how mutations in a particular RP gene result in specific cellular changes, or how RP genes might contribute to human diseases. The development of animal models with defects in RP genes will be essential for studying these questions. In this study, we knocked down 21 RP genes in zebrafish by using morpholino antisense oligos to inhibit their translation. Of these 21, knockdown of 19 RPs resulted in the development of morphants with obvious deformities. Although mutations in RP genes, like other housekeeping genes, would be expected to result in nonspecific developmental defects with widespread phenotypes, we found that knockdown of some RP genes resulted in phenotypes specific to each gene, with varying degrees of abnormality in the brain, body trunk, eyes, and ears at about 25 hours post fertilization. We focused further on the organogenesis of the brain. Each knocked-down gene that affected the morphogenesis of the brain produced a different pattern of abnormality. Among the 7 RP genes whose knockdown produced severe brain phenotypes, 3 human orthologs are located within chromosomal regions that have been linked to brain-associated diseases, suggesting a possible involvement of RP genes in brain or neurological diseases. The RP gene knockdown system developed in this study could be a powerful tool for studying the roles of ribosomes in human diseases.

  10. The SpTransformer Gene Family (Formerly Sp185/333) in the Purple Sea Urchin and the Functional Diversity of the Anti-Pathogen rSpTransformer-E1 Protein

    Science.gov (United States)

    Smith, L. Courtney; Lun, Cheng Man

    2017-01-01

    The complex innate immune system of sea urchins is underpinned by several multigene families including the SpTransformer family (SpTrf; formerly Sp185/333) with estimates of ~50 members, although the family size is likely variable among individuals of Strongylocentrotus purpuratus. The genes are small with similar structure, are tightly clustered, and have several types of repeats in the second of two exons and that surround each gene. The density of repeats suggests that the genes are positioned within regions of genomic instability, which may be required to drive sequence diversification. The second exon encodes the mature protein and is composed of blocks of sequence called elements that are present in mosaics of defined element patterns and are the major source of sequence diversity. The SpTrf genes respond swiftly to immune challenge, but only a single gene is expressed per phagocyte. Many of the mRNAs appear to be edited and encode proteins with altered and/or missense sequence that are often truncated, of which some may be functional. The standard SpTrf protein structure is an N-terminal glycine-rich region, a central RGD motif, a histidine-rich region, and a C-terminal region. Function is predicted from a recombinant protein, rSpTransformer-E1 (rSpTrf-E1), which binds to Vibrio and Saccharomyces, but not to Bacillus, and binds tightly to lipopolysaccharide, β-1,3-glucan, and flagellin, but not to peptidoglycan. rSpTrf-E1 is intrinsically disordered but transforms to α helical structure in the presence of binding targets including lipopolysaccharide, which may underpin the characteristics of binding to multiple targets. SpTrf proteins associate with coelomocyte membranes, and rSpTrf-E1 binds specifically to phosphatidic acid (PA). When rSpTrf-E1 is bound to PA in liposome membranes, it induces morphological changes in liposomes that correlate with PA clustering and leakage of luminal contents, and it extracts or removes PA from the bilayer. The

  11. Comparative study of human mitochondrial proteome reveals extensive protein subcellular relocalization after gene duplications

    Directory of Open Access Journals (Sweden)

    Huang Yong

    2009-11-01

    Full Text Available Abstract Background Gene and genome duplication is the principle creative force in evolution. Recently, protein subcellular relocalization, or neolocalization was proposed as one of the mechanisms responsible for the retention of duplicated genes. This hypothesis received support from the analysis of yeast genomes, but has not been tested thoroughly on animal genomes. In order to evaluate the importance of subcellular relocalizations for retention of duplicated genes in animal genomes, we systematically analyzed nuclear encoded mitochondrial proteins in the human genome by reconstructing phylogenies of mitochondrial multigene families. Results The 456 human mitochondrial proteins selected for this study were clustered into 305 gene families including 92 multigene families. Among the multigene families, 59 (64% consisted of both mitochondrial and cytosolic (non-mitochondrial proteins (mt-cy families while the remaining 33 (36% were composed of mitochondrial proteins (mt-mt families. Phylogenetic analyses of mt-cy families revealed three different scenarios of their neolocalization following gene duplication: 1 relocalization from mitochondria to cytosol, 2 from cytosol to mitochondria and 3 multiple subcellular relocalizations. The neolocalizations were most commonly enabled by the gain or loss of N-terminal mitochondrial targeting signals. The majority of detected subcellular relocalization events occurred early in animal evolution, preceding the evolution of tetrapods. Mt-mt protein families showed a somewhat different pattern, where gene duplication occurred more evenly in time. However, for both types of protein families, most duplication events appear to roughly coincide with two rounds of genome duplications early in vertebrate evolution. Finally, we evaluated the effects of inaccurate and incomplete annotation of mitochondrial proteins and found that our conclusion of the importance of subcellular relocalization after gene duplication on

  12. Nucleotide sequence of soybean chloroplast DNA regions which contain the psb A and trn H genes and cover the ends of the large single copy region and one end of the inverted repeats.

    Science.gov (United States)

    Spielmann, A; Stutz, E

    1983-10-25

    The soybean chloroplast psb A gene (photosystem II thylakoid membrane protein of Mr 32 000, lysine-free) and the trn H gene (tRNAHisGUG), which both map in the large single copy region adjacent to one of the inverted repeat structures (IR1), have been sequenced including flanking regions. The psb A gene shows in its structural part 92% sequence homology with the corresponding genes of spinach and N. debneyi and contains also an open reading frame for 353 aminoacids. The aminoacid sequence of a potential primary translation product (calculated Mr, 38 904, no lysine) diverges from that of spinach and N. debneyi in only two positions in the C-terminal part. The trn H gene has the same polarity as the psb A gene and the coding region is located at the very end of the large single copy region. The deduced sequence of the soybean chloroplast tRNAHisGUG is identical with that of Zea mays chloroplasts. Both ends of the large single copy region were sequenced including a small segment of the adjacent IR1 and IR2.

  13. Leucine-rich repeat-containing G protein-coupled receptor 4 facilitates vesicular stomatitis virus infection by binding vesicular stomatitis virus glycoprotein.

    Science.gov (United States)

    Zhang, Na; Huang, Hongjun; Tan, Binghe; Wei, Yinglei; Xiong, Qingqing; Yan, Yan; Hou, Lili; Wu, Nannan; Siwko, Stefan; Cimarelli, Andrea; Xu, Jianrong; Han, Honghui; Qian, Min; Liu, Mingyao; Du, Bing

    2017-10-06

    Vesicular stomatitis virus (VSV) and rabies and Chandipura viruses belong to the Rhabdovirus family. VSV is a common laboratory virus to study viral evolution and host immune responses to viral infection, and recombinant VSV-based vectors have been widely used for viral oncolysis, vaccination, and gene therapy. Although the tropism of VSV is broad, and its envelope glycoprotein G is often used for pseudotyping other viruses, the host cellular components involved in VSV infection remain unclear. Here, we demonstrate that the host protein leucine-rich repeat-containing G protein-coupled receptor 4 (Lgr4) is essential for VSV and VSV-G pseudotyped lentivirus (VSVG-LV) to infect susceptible cells. Accordingly, Lgr4-deficient mice had dramatically decreased VSV levels in the olfactory bulb. Furthermore, Lgr4 knockdown in RAW 264.7 cells also significantly suppressed VSV infection, and Lgr4 overexpression in RAW 264.7 cells enhanced VSV infection. Interestingly, only VSV infection relied on Lgr4, whereas infections with Newcastle disease virus, influenza A virus (A/WSN/33), and herpes simplex virus were unaffected by Lgr4 status. Of note, assays of virus entry, cell ELISA, immunoprecipitation, and surface plasmon resonance indicated that VSV bound susceptible cells via the Lgr4 extracellular domain. Pretreating cells with an Lgr4 antibody, soluble LGR4 extracellular domain, or R-spondin 1 blocked VSV infection by competitively inhibiting VSV binding to Lgr4. Taken together, the identification of Lgr4 as a VSV-specific host factor provides important insights into understanding VSV entry and its pathogenesis and lays the foundation for VSV-based gene therapy and viral oncolytic therapeutics. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  14. A guild of 45 CRISPR-associated (Cas protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes.

    Directory of Open Access Journals (Sweden)

    Daniel H Haft

    2005-11-01

    Full Text Available Clustered regularly interspaced short palindromic repeats (CRISPRs are a family of DNA direct repeats found in many prokaryotic genomes. Repeats of 21-37 bp typically show weak dyad symmetry and are separated by regularly sized, nonrepetitive spacer sequences. Four CRISPR-associated (Cas protein families, designated Cas1 to Cas4, are strictly associated with CRISPR elements and always occur near a repeat cluster. Some spacers originate from mobile genetic elements and are thought to confer "immunity" against the elements that harbor these sequences. In the present study, we have systematically investigated uncharacterized proteins encoded in the vicinity of these CRISPRs and found many additional protein families that are strictly associated with CRISPR loci across multiple prokaryotic species. Multiple sequence alignments and hidden Markov models have been built for 45 Cas protein families. These models identify family members with high sensitivity and selectivity and classify key regulators of development, DevR and DevS, in Myxococcus xanthus as Cas proteins. These identifications show that CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a repeat cluster or filling the region between two repeat clusters. Distinctive subsets of the collection of Cas proteins recur in phylogenetically distant species and correlate with characteristic repeat periodicity. The analyses presented here support initial proposals of mobility of these units, along with the likelihood that loci of different subtypes interact with one another as well as with host cell defensive, replicative, and regulatory systems. It is evident from this analysis that CRISPR/cas loci are larger, more complex, and more heterogeneous than previously appreciated.

  15. Hypolipidemic effect of dietary pea proteins: Impact on genes regulating hepatic lipid metabolism.

    Science.gov (United States)

    Rigamonti, Elena; Parolini, Cinzia; Marchesi, Marta; Diani, Erika; Brambilla, Stefano; Sirtori, Cesare R; Chiesa, Giulia

    2010-05-01

    Controversial data on the lipid-lowering effect of dietary pea proteins have been provided and the mechanisms behind this effect are not completely understood. The aim of the study was to evaluate a possible hypolipidemic activity of a pea protein isolate and to determine whether pea proteins could affect the hepatic lipid metabolism through regulation of genes involved in cholesterol and fatty acid homeostasis. Rats were fed Nath's hypercholesterolemic diets for 28 days, the protein sources being casein or a pea protein isolate from Pisum sativum. After 14 and 28 days of dietary treatment, rats fed pea proteins had markedly lower plasma cholesterol and triglyceride levels than rats fed casein (pPea protein-fed rats displayed higher hepatic mRNA levels of LDL receptor versus those fed casein (ppea protein-fed rats than in rats fed casein (ppea proteins in rats. Moreover, pea proteins appear to affect cellular lipid homeostasis by upregulating genes involved in hepatic cholesterol uptake and by downregulating fatty acid synthesis genes.

  16. Secretion Trap Tagging of Secreted and Membrane-Spanning Proteins Using Arabidopsis Gene Traps

    Science.gov (United States)

    Andrew T. Groover; Joseph R. Fontana; Juana M. Arroyo; Cristina Yordan; W. Richard McCombie; Robert A. Martienssen

    2003-01-01

    Secreted and membrane-spanning proteins play fundamental roles in plant development but pose challenges for genetic identification and characterization. We describe a "secretion trap" screen for gene trap insertions in genes encoding proteins routed through the secretory pathway. The gene trap transposon encodes a ß-glucuronidase reporter enzyme...

  17. Knock-in of Enhanced Green Fluorescent Protein or/and Human Fibroblast Growth Factor 2 Gene into β-Casein Gene Locus in the Porcine Fibroblasts to Produce Therapeutic Protein.

    Science.gov (United States)

    Lee, Sang Mi; Kim, Ji Woo; Jeong, Young-Hee; Kim, Se Eun; Kim, Yeong Ji; Moon, Seung Ju; Lee, Ji-Hye; Kim, Keun-Jung; Kim, Min-Kyu; Kang, Man-Jong

    2014-11-01

    Transgenic animals have become important tools for the production of therapeutic proteins in the domestic animal. Production efficiencies of transgenic animals by conventional methods as microinjection and retrovirus vector methods are low, and the foreign gene expression levels are also low because of their random integration in the host genome. In this study, we investigated the homologous recombination on the porcine β-casein gene locus using a knock-in vector for the β-casein gene locus. We developed the knock-in vector on the porcine β-casein gene locus and isolated knock-in fibroblast for nuclear transfer. The knock-in vector consisted of the neomycin resistance gene (neo) as a positive selectable marker gene, diphtheria toxin-A gene as negative selection marker, and 5' arm and 3' arm from the porcine β-casein gene. The secretion of enhanced green fluorescent protein (EGFP) was more easily detected in the cell culture media than it was by western blot analysis of cell extract of the HC11 mouse mammary epithelial cells transfected with EGFP knock-in vector. These results indicated that a knock-in system using β-casein gene induced high expression of transgene by the gene regulatory sequence of endogenous β-casein gene. These fibroblasts may be used to produce transgenic pigs for the production of therapeutic proteins via the mammary glands.

  18. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    Directory of Open Access Journals (Sweden)

    Surendra K Prajapati

    Full Text Available The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75 from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years and long-term population history (79,235 to 104,008 of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes to that inferred from mitochondrial genome diversity.

  19. Neutral polymorphisms in putative housekeeping genes and tandem repeats unravels the population genetics and evolutionary history of Plasmodium vivax in India.

    Science.gov (United States)

    Prajapati, Surendra K; Joshi, Hema; Carlton, Jane M; Rizvi, M Alam

    2013-01-01

    The evolutionary history and age of Plasmodium vivax has been inferred as both recent and ancient by several studies, mainly using mitochondrial genome diversity. Here we address the age of P. vivax on the Indian subcontinent using selectively neutral housekeeping genes and tandem repeat loci. Analysis of ten housekeeping genes revealed a substantial number of SNPs (n = 75) from 100 P. vivax isolates collected from five geographical regions of India. Neutrality tests showed a majority of the housekeeping genes were selectively neutral, confirming the suitability of housekeeping genes for inferring the evolutionary history of P. vivax. In addition, a genetic differentiation test using housekeeping gene polymorphism data showed a lack of geographical structuring between the five regions of India. The coalescence analysis of the time to the most recent common ancestor estimate yielded an ancient TMRCA (232,228 to 303,030 years) and long-term population history (79,235 to 104,008) of extant P. vivax on the Indian subcontinent. Analysis of 18 tandem repeat loci polymorphisms showed substantial allelic diversity and heterozygosity per locus, and analysis of potential bottlenecks revealed the signature of a stable P. vivax population, further corroborating our ancient age estimates. For the first time we report a comparable evolutionary history of P. vivax inferred by nuclear genetic markers (putative housekeeping genes) to that inferred from mitochondrial genome diversity.

  20. Accumulation of Sulfonamide Resistance Genes in Arable Soils Due to Repeated Application of Manure Containing Sulfadiazine ▿

    OpenAIRE

    Heuer, Holger; Solehati, Qodiah; Zimmerling, Ute; Kleineidam, Kristina; Schloter, Michael; Müller, Tanja; Focks, Andreas; Thiele-Bruhn, Sören; Smalla, Kornelia

    2011-01-01

    Two soils were amended three times with pig manure. The abundance of sulfonamide resistance genes was determined by quantitative PCR 2 months after each application. In both soils treated with sulfadiazine-containing manure, the numbers of copies of sul1 and sul2 significantly increased compared to numbers after treatments with antibiotic-free manure or a control and accumulated with repeated applications.

  1. Differentially expressed genes in iron-induced prion protein conversion

    International Nuclear Information System (INIS)

    Kim, Minsun; Kim, Eun-hee; Choi, Bo-Ran; Woo, Hee-Jong

    2016-01-01

    The conversion of the cellular prion protein (PrP C ) to the protease-resistant isoform is the key event in chronic neurodegenerative diseases, including transmissible spongiform encephalopathies (TSEs). Increased iron in prion-related disease has been observed due to the prion protein-ferritin complex. Additionally, the accumulation and conversion of recombinant PrP (rPrP) is specifically derived from Fe(III) but not Fe(II). Fe(III)-mediated PK-resistant PrP (PrP res ) conversion occurs within a complex cellular environment rather than via direct contact between rPrP and Fe(III). In this study, differentially expressed genes correlated with prion degeneration by Fe(III) were identified using Affymetrix microarrays. Following Fe(III) treatment, 97 genes were differentially expressed, including 85 upregulated genes and 12 downregulated genes (≥1.5-fold change in expression). However, Fe(II) treatment produced moderate alterations in gene expression without inducing dramatic alterations in gene expression profiles. Moreover, functional grouping of identified genes indicated that the differentially regulated genes were highly associated with cell growth, cell maintenance, and intra- and extracellular transport. These findings showed that Fe(III) may influence the expression of genes involved in PrP folding by redox mechanisms. The identification of genes with altered expression patterns in neural cells may provide insights into PrP conversion mechanisms during the development and progression of prion-related diseases. - Highlights: • Differential genes correlated with prion degeneration by Fe(III) were identified. • Genes were identified in cell proliferation and intra- and extracellular transport. • In PrP degeneration, redox related genes were suggested. • Cbr2, Rsad2, Slc40a1, Amph and Mvd were expressed significantly.

  2. Pathogenic Leptospira species express surface-exposed proteins belonging to the bacterial immunoglobulin superfamily

    Science.gov (United States)

    Matsunaga, James; Barocchi, Michele A.; Croda, Julio; Young, Tracy A.; Sanchez, Yolanda; Siqueira, Isadora; Bolin, Carole A.; Reis, Mitermayer G.; Riley, Lee W.; Haake, David A.; Ko, Albert I.

    2005-01-01

    Summary Proteins with bacterial immunoglobulin-like (Big) domains, such as the Yersinia pseudotuberculosis invasin and Escherichia coli intimin, are surface-expressed proteins that mediate host mammalian cell invasion or attachment. Here, we report the identification and characterization of a new family of Big domain proteins, referred to as Lig (leptospiral Ig-like) proteins, in pathogenic Leptospira. Screening of L. interrogans and L. kirschneri expression libraries with sera from leptospirosis patients identified 13 lambda phage clones that encode tandem repeats of the 90 amino acid Big domain. Two lig genes, designated ligA and ligB, and one pseudo-gene, ligC, were identified. The ligA and ligB genes encode amino-terminal lipoprotein signal peptides followed by 10 or 11 Big domain repeats and, in the case of ligB, a unique carboxy-terminal non-repeat domain. The organization of ligC is similar to that of ligB but contains mutations that disrupt the reading frame. The lig sequences are present in pathogenic but not saprophytic Leptospira species. LigA and LigB are expressed by a variety of virulent leptospiral strains. Loss of Lig protein and RNA transcript expression is correlated with the observed loss of virulence during culture attenuation of pathogenic strains. High-pressure freeze substitution followed by immunocytochemical electron microscopy confirmed that the Lig proteins were localized to the bacterial surface. Immunoblot studies with patient sera found that the Lig proteins are a major antigen recognized during the acute host infection. These observations demonstrate that the Lig proteins are a newly identified surface protein of pathogenic Leptospira, which by analogy to other bacterial immunoglobulin superfamily virulence factors, may play a role in host cell attachment and invasion during leptospiral pathogenesis. PMID:12890019

  3. High-temperature protein G is essential for activity of the Escherichia coli clustered regularly interspaced short palindromic repeats (CRISPR)/Cas system.

    Science.gov (United States)

    Yosef, Ido; Goren, Moran G; Kiro, Ruth; Edgar, Rotem; Qimron, Udi

    2011-12-13

    Prokaryotic DNA arrays arranged as clustered regularly interspaced short palindromic repeats (CRISPR), along with their associated proteins, provide prokaryotes with adaptive immunity by RNA-mediated targeting of alien DNA or RNA matching the sequences between the repeats. Here, we present a thorough screening system for the identification of bacterial proteins participating in immunity conferred by the Escherichia coli CRISPR system. We describe the identification of one such protein, high-temperature protein G (HtpG), a homolog of the eukaryotic chaperone heat-shock protein 90. We demonstrate that in the absence of htpG, the E. coli CRISPR system loses its suicidal activity against λ prophage and its ability to provide immunity from lysogenization. Transcomplementation of htpG restores CRISPR activity. We further show that inactivity of the CRISPR system attributable to htpG deficiency can be suppressed by expression of Cas3, a protein that is essential for its activity. Accordingly, we also find that the steady-state level of overexpressed Cas3 is significantly enhanced following HtpG expression. We conclude that HtpG is a newly identified positive modulator of the CRISPR system that is essential for maintaining functional levels of Cas3.

  4. Cocaine-induced behavioral sensitization decreases the expression of endocannabinoid signaling-related proteins in the mouse hippocampus.

    Science.gov (United States)

    Blanco, Eduardo; Galeano, Pablo; Palomino, Ana; Pavón, Francisco J; Rivera, Patricia; Serrano, Antonia; Alen, Francisco; Rubio, Leticia; Vargas, Antonio; Castilla-Ortega, Estela; Decara, Juan; Bilbao, Ainhoa; de Fonseca, Fernando Rodríguez; Suárez, Juan

    2016-03-01

    In the reward mesocorticolimbic circuits, the glutamatergic and endocannabinoid systems are implicated in neurobiological mechanisms underlying cocaine addiction. However, the involvement of both systems in the hippocampus, a critical region to process relational information relevant for encoding drug-associated memories, in cocaine-related behaviors remains unknown. In the present work, we studied whether the hippocampal gene/protein expression of relevant glutamate signaling components, including glutamate-synthesizing enzymes and metabotropic and ionotropic receptors, and the hippocampal gene/protein expression of cannabinoid type 1 (CB1) receptor and endocannabinoid metabolic enzymes were altered following acute and/or repeated cocaine administration resulting in conditioned locomotion and locomotor sensitization. Results showed that acute cocaine administration induced an overall down-regulation of glutamate-related gene expression and, specifically, a low phosphorylation level of GluA1. In contrast, locomotor sensitization to cocaine produced an up-regulation of several glutamate receptor-related genes and, specifically, an increased protein expression of the GluN1 receptor subunit. Regarding the endocannabinoid system, acute and repeated cocaine administration were associated with an increased gene/protein expression of CB1 receptors and a decreased gene/protein expression of the endocannabinoid-synthesis enzymes N-acyl phosphatidylethanolamine D (NAPE-PLD) and diacylglycerol lipase alpha (DAGLα). These changes resulted in an overall decrease in endocannabinoid synthesis/degradation ratios, especially NAPE-PLD/fatty acid amide hydrolase and DAGLα/monoacylglycerol lipase, suggesting a reduced endocannabinoid production associated with a compensatory up-regulation of CB1 receptor. Overall, these findings suggest that repeated cocaine administration resulting in locomotor sensitization induces a down-regulation of the endocannabinoid signaling that could

  5. Canine candidate genes for dilated cardiomyopathy: annotation of and polymorphic markers for 14 genes.

    Science.gov (United States)

    Wiersma, Anje C; Leegwater, Peter Aj; van Oost, Bernard A; Ollier, William E; Dukes-McEwan, Joanna

    2007-10-19

    Dilated cardiomyopathy is a myocardial disease occurring in humans and domestic animals and is characterized by dilatation of the left ventricle, reduced systolic function and increased sphericity of the left ventricle. Dilated cardiomyopathy has been observed in several, mostly large and giant, dog breeds, such as the Dobermann and the Great Dane. A number of genes have been identified, which are associated with dilated cardiomyopathy in the human, mouse and hamster. These genes mainly encode structural proteins of the cardiac myocyte. We present the annotation of, and marker development for, 14 of these genes of the dog genome, i.e. alpha-cardiac actin, caveolin 1, cysteine-rich protein 3, desmin, lamin A/C, LIM-domain binding factor 3, myosin heavy polypeptide 7, phospholamban, sarcoglycan delta, titin cap, alpha-tropomyosin, troponin I, troponin T and vinculin. A total of 33 Single Nucleotide Polymorphisms were identified for these canine genes and 11 polymorphic microsatellite repeats were developed. The presented polymorphisms provide a tool to investigate the role of the corresponding genes in canine Dilated Cardiomyopathy by linkage analysis or association studies.

  6. A new essential protein discovery method based on the integration of protein-protein interaction and gene expression data

    Directory of Open Access Journals (Sweden)

    Li Min

    2012-03-01

    Full Text Available Abstract Background Identification of essential proteins is always a challenging task since it requires experimental approaches that are time-consuming and laborious. With the advances in high throughput technologies, a large number of protein-protein interactions are available, which have produced unprecedented opportunities for detecting proteins' essentialities from the network level. There have been a series of computational approaches proposed for predicting essential proteins based on network topologies. However, the network topology-based centrality measures are very sensitive to the robustness of network. Therefore, a new robust essential protein discovery method would be of great value. Results In this paper, we propose a new centrality measure, named PeC, based on the integration of protein-protein interaction and gene expression data. The performance of PeC is validated based on the protein-protein interaction network of Saccharomyces cerevisiae. The experimental results show that the predicted precision of PeC clearly exceeds that of the other fifteen previously proposed centrality measures: Degree Centrality (DC, Betweenness Centrality (BC, Closeness Centrality (CC, Subgraph Centrality (SC, Eigenvector Centrality (EC, Information Centrality (IC, Bottle Neck (BN, Density of Maximum Neighborhood Component (DMNC, Local Average Connectivity-based method (LAC, Sum of ECC (SoECC, Range-Limited Centrality (RL, L-index (LI, Leader Rank (LR, Normalized α-Centrality (NC, and Moduland-Centrality (MC. Especially, the improvement of PeC over the classic centrality measures (BC, CC, SC, EC, and BN is more than 50% when predicting no more than 500 proteins. Conclusions We demonstrate that the integration of protein-protein interaction network and gene expression data can help improve the precision of predicting essential proteins. The new centrality measure, PeC, is an effective essential protein discovery method.

  7. Sequence diversities of serine-aspartate repeat genes among Staphylococcus aureus isolates from different hosts presumably by horizontal gene transfer.

    Directory of Open Access Journals (Sweden)

    Huping Xue

    Full Text Available BACKGROUND: Horizontal gene transfer (HGT is recognized as one of the major forces for bacterial genome evolution. Many clinically important bacteria may acquire virulence factors and antibiotic resistance through HGT. The comparative genomic analysis has become an important tool for identifying HGT in emerging pathogens. In this study, the Serine-Aspartate Repeat (Sdr family has been compared among different sources of Staphylococcus aureus (S. aureus to discover sequence diversities within their genomes. METHODOLOGY/PRINCIPAL FINDINGS: Four sdr genes were analyzed for 21 different S. aureus strains and 218 mastitis-associated S. aureus isolates from Canada. Comparative genomic analyses revealed that S. aureus strains from bovine mastitis (RF122 and mastitis isolates in this study, ovine mastitis (ED133, pig (ST398, chicken (ED98, and human methicillin-resistant S. aureus (MRSA (TCH130, MRSA252, Mu3, Mu50, N315, 04-02981, JH1 and JH9 were highly associated with one another, presumably due to HGT. In addition, several types of insertion and deletion were found in sdr genes of many isolates. A new insertion sequence was found in mastitis isolates, which was presumably responsible for the HGT of sdrC gene among different strains. Moreover, the sdr genes could be used to type S. aureus. Regional difference of sdr genes distribution was also indicated among the tested S. aureus isolates. Finally, certain associations were found between sdr genes and subclinical or clinical mastitis isolates. CONCLUSIONS: Certain sdr gene sequences were shared in S. aureus strains and isolates from different species presumably due to HGT. Our results also suggest that the distributional assay of virulence factors should detect the full sequences or full functional regions of these factors. The traditional assay using short conserved regions may not be accurate or credible. These findings have important implications with regard to animal husbandry practices that may

  8. The first report of prion-related protein gene (PRNT) polymorphisms in goat.

    Science.gov (United States)

    Kim, Yong-Chan; Jeong, Byung-Hoon

    2017-06-01

    Prion protein is encoded by the prion protein gene (PRNP). Polymorphisms of several members of the prion gene family have shown association with prion diseases in several species. Recent studies on a novel member of the prion gene family in rams have shown that prion-related protein gene (PRNT) has a linkage with codon 26 of prion-like protein (PRND). In a previous study, codon 26 polymorphism of PRND has shown connection with PRNP haplotype which is strongly associated with scrapie vulnerability. In addition, the genotype of a single nucleotide polymorphism (SNP) at codon 26 of PRND is related to fertilisation capacity. These findings necessitate studies on the SNP of PRNT gene which is connected with PRND. In goat, several polymorphism studies have been performed for PRNP, PRND, and shadow of prion protein gene (SPRN). However, polymorphism on PRNT has not been reported. Hence, the objective of this study was to determine the genotype and allelic distribution of SNPs of PRNT in 238 Korean native goats and compare PRNT DNA sequences between Korean native goats and several ruminant species. A total of five SNPs, including PRNT c.-114G > T, PRNT c.-58A > G in the upstream of PRNT gene, PRNT c.71C > T (p.Ala24Val) and PRNT c.102G > A in the open reading frame (ORF) and c.321C > T in the downstream of PRNT gene, were found in this study. All five SNPs of caprine PRNT gene in Korean native goat are in complete linkage disequilibrium (LD) with a D' value of 1.0. Interestingly, comparative sequence analysis of the PRNT gene revealed five mismatches between DNA sequences of Korean native goats and those of goats deposited in the GenBank. Korean native black goats also showed 5 mismatches in PRNT ORF with cattle. To the best of our knowledge, this is the first genetic research of the PRNT gene in goat.

  9. Selection shaped the evolution of mouse androgen-binding protein (ABP) function and promoted the duplication of Abp genes.

    Science.gov (United States)

    Karn, Robert C; Laukaitis, Christina M

    2014-08-01

    In the present article, we summarize two aspects of our work on mouse ABP (androgen-binding protein): (i) the sexual selection function producing incipient reinforcement on the European house mouse hybrid zone, and (ii) the mechanism behind the dramatic expansion of the Abp gene region in the mouse genome. Selection unifies these two components, although the ways in which selection has acted differ. At the functional level, strong positive selection has acted on key sites on the surface of one face of the ABP dimer, possibly to influence binding to a receptor. A different kind of selection has apparently driven the recent and rapid expansion of the gene region, probably by increasing the amount of Abp transcript, in one or both of two ways. We have shown previously that groups of Abp genes behave as LCRs (low-copy repeats), duplicating as relatively large blocks of genes by NAHR (non-allelic homologous recombination). The second type of selection involves the close link between the accumulation of L1 elements and the expansion of the Abp gene family by NAHR. It is probably predicated on an initial selection for increased transcription of existing Abp genes and/or an increase in Abp gene number providing more transcriptional sites. Either or both could increase initial transcript production, a quantitative change similar to increasing the volume of a radio transmission. In closing, we also provide a note on Abp gene nomenclature.

  10. Influence of training intensity on adaptations in acid/base transport proteins, muscle buffer capacity, and repeated-sprint ability in active men.

    Science.gov (United States)

    McGinley, Cian; Bishop, David J

    2016-12-01

    McGinley C, Bishop DJ. Influence of training intensity on adaptations in acid/base transport proteins, muscle buffer capacity, and repeated-sprint ability in active men. J Appl Physiol 121: 1290-1305, 2016. First published October 14, 2016; doi:10.1152/japplphysiol.00630.2016-This study measured the adaptive response to exercise training for each of the acid-base transport protein families, including providing isoform-specific evidence for the monocarboxylate transporter (MCT)1/4 chaperone protein basigin and for the electrogenic sodium-bicarbonate cotransporter (NBCe)1. We investigated whether 4 wk of work-matched, high-intensity interval training (HIIT), performed either just above the lactate threshold (HIITΔ20; n = 8), or close to peak aerobic power (HIITΔ90; n = 8), influenced adaptations in acid-base transport protein abundance, nonbicarbonate muscle buffer capacity (βm in vitro ), and exercise capacity in active men. Training intensity did not discriminate between adaptations for most proteins measured, with abundance of MCT1, sodium/hydrogen exchanger (NHE) 1, NBCe1, carbonic anhydrase (CA) II, and CAXIV increasing after 4 wk, whereas there was little change in CAIII and CAIV abundance. βm in vitro also did not change. However, MCT4 protein content only increased for HIITΔ20 [effect size (ES): 1.06, 90% confidence limits × / ÷ 0.77], whereas basigin protein content only increased for HIITΔ90 (ES: 1.49, × / ÷ 1.42). Repeated-sprint ability (5 × 6-s sprints; 24 s passive rest) improved similarly for both groups. Power at the lactate threshold only improved for HIITΔ20 (ES: 0.49; 90% confidence limits ± 0.38), whereas peak O 2 uptake did not change for either group. Detraining was characterized by the loss of adaptations for all of the proteins measured and for repeated-sprint ability 6 wk after removing the stimulus of HIIT. In conclusion, 4 wk of HIIT induced improvements in each of the acid-base transport protein families, but, remarkably, a 40

  11. Expression of protein-coding genes embedded in ribosomal DNA

    DEFF Research Database (Denmark)

    Johansen, Steinar D; Haugen, Peik; Nielsen, Henrik

    2007-01-01

    Ribosomal DNA (rDNA) is a specialised chromosomal location that is dedicated to high-level transcription of ribosomal RNA genes. Interestingly, rDNAs are frequently interrupted by parasitic elements, some of which carry protein genes. These are non-LTR retrotransposons and group II introns that e...... in the nucleolus....

  12. Dysregulation of gene expression in the striatum of BACHD rats expressing full-length mutant huntingtin and associated abnormalities on molecular and protein levels.

    Science.gov (United States)

    Yu-Taeger, Libo; Bonin, Michael; Stricker-Shaver, Janice; Riess, Olaf; Nguyen, Hoa Huu Phuc

    2017-05-01

    Huntington disease (HD) is an autosomal dominantly inherited neurodegenerative disorder caused by a CAG repeat expansion in the gene coding for the huntingtin protein (HTT). Mutant HTT (mHTT) has been proposed to cause neuronal dysfunction and neuronal loss through multiple mechanisms. Transcriptional changes may be a core pathogenic feature of HD. Utilizing the Affymetrix platform we performed a genome-wide RNA expression analysis in two BACHD transgenic rat lines (TG5 and TG9) at 12 months of age, both of which carry full-length human mHTT but with different expression levels. By defining the threshold of significance at p < 0.01, we found 1608 genes and 871 genes differentially expressed in both TG5 and TG9 rats when compared to the wild type littermates, respectively. We only chose the highly up-/down-regulated genes for further analysis by setting an additional threshold of 1.5 fold change. Comparing gene expression profiles of human HD brains and BACHD rats revealed a high concordance in both functional and IPA (Ingenuity Pathway Analysis) canonical pathways relevant to HD. In addition, we investigated the causes leading to gene expression changes at molecular and protein levels in BACHD rats including the involvement of polyQ-containing transcription factors TATA box-binding protein (TBP), Sp1 and CBP as well as the chromatin structure. We demonstrate that the BACHD rat model recapitulates the gene expression changes of the human disease supporting its role as a preclinical research animal model. We also show for the first time that TFIID complex formation is reduced, while soluble TBP is increased in an HD model. This finding suggests that mHTT is a competitor instead of a recruiter of polyQ-containing transcription factors in the transcription process in HD. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Evolution of the MAGUK protein gene family in premetazoan lineages

    Directory of Open Access Journals (Sweden)

    Ruiz-Trillo Iñaki

    2010-04-01

    Full Text Available Abstract Background Cell-to-cell communication is a key process in multicellular organisms. In multicellular animals, scaffolding proteins belonging to the family of membrane-associated guanylate kinases (MAGUK are involved in the regulation and formation of cell junctions. These MAGUK proteins were believed to be exclusive to Metazoa. However, a MAGUK gene was recently identified in an EST survey of Capsaspora owczarzaki, an unicellular organism that branches off near the metazoan clade. To further investigate the evolutionary history of MAGUK, we have undertook a broader search for this gene family using available genomic sequences of different opisthokont taxa. Results Our survey and phylogenetic analyses show that MAGUK proteins are present not only in Metazoa, but also in the choanoflagellate Monosiga brevicollis and in the protist Capsaspora owczarzaki. However, MAGUKs are absent from fungi, amoebozoans or any other eukaryote. The repertoire of MAGUKs in Placozoa and eumetazoan taxa (Cnidaria + Bilateria is quite similar, except for one class that is missing in Trichoplax, while Porifera have a simpler MAGUK repertoire. However, Vertebrata have undergone several independent duplications and exhibit two exclusive MAGUK classes. Three different MAGUK types are found in both M. brevicollis and C. owczarzaki: DLG, MPP and MAGI. Furthermore, M. brevicollis has suffered a lineage-specific diversification. Conclusions The diversification of the MAGUK protein gene family occurred, most probably, prior to the divergence between Metazoa+choanoflagellates and the Capsaspora+Ministeria clade. A MAGI-like, a DLG-like, and a MPP-like ancestral genes were already present in the unicellular ancestor of Metazoa, and new gene members have been incorporated through metazoan evolution within two major periods, one before the sponge-eumetazoan split and another within the vertebrate lineage. Moreover, choanoflagellates have suffered an independent MAGUK

  14. A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori.

    Science.gov (United States)

    Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

    2016-03-25

    Hoxgenes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hoxgenes can also function in terminally differentiated tissue of the lepidopteranBombyx mori In this species,Antennapedia(Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antpcan regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antpin the posterior silk gland induced ectopic expression of major silk protein genes such assericin-3,fhxh4, and fhxh5 These genes are normally expressed specifically in the middle silk gland as is Antp Therefore, the evidence strongly suggests that Antpactivates these silk protein genes in the middle silk gland. The putativesericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antpdirectly activates their expression. We also found that the pattern of gene expression was well conserved between B. moriand the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori We suggest that Hoxgenes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  15. A sight on protein-based nanoparticles as drug/gene delivery systems.

    Science.gov (United States)

    Salatin, Sara; Jelvehgari, Mitra; Maleki-Dizaj, Solmaz; Adibkia, Khosro

    2015-01-01

    Polymeric nanomaterials have extensively been applied for the preparation of targeted and controlled release drug/gene delivery systems. However, problems involved in the formulation of synthetic polymers such as using of the toxic solvents and surfactants have limited their desirable applications. In this regard, natural biomolecules including proteins and polysaccharide are suitable alternatives due to their safety. According to literature, protein-based nanoparticles possess many advantages for drug and gene delivery such as biocompatibility, biodegradability and ability to functionalize with targeting ligands. This review provides a general sight on the application of biodegradable protein-based nanoparticles in drug/gene delivery based on their origins. Their unique physicochemical properties that help them to be formulated as pharmaceutical carriers are also discussed.

  16. TGC repeat expansion in the TCF4 gene increases the risk of Fuchs' endothelial corneal dystrophy in Australian cases.

    Directory of Open Access Journals (Sweden)

    Abraham Kuot

    Full Text Available Fuchs' endothelial corneal dystrophy (FECD is a progressive, vision impairing disease. Common single nucleotide polymorphisms (SNPs and a trinucleotide repeat polymorphism, thymine-guanine-cytosine (TGC, in the TCF4 gene have been associated with the risk of FECD in some populations. We previously reported association of SNPs in TCF4 with FECD risk in the Australian population. The aim of this study was to determine whether TGC repeat polymorphism in TCF4 is associated with FECD in the Australian population. In 189 unrelated Australian cases with advanced late-onset FECD and 183 matched controls, the TGC repeat polymorphism located in intron 3 of TCF4 was genotyped using a short tandem repeat (STR assay. The repeat length was verified by direct sequencing in selected homozygous carriers. We found significant association between the expanded TGC repeat (≥ 40 repeats in TCF4 and advanced FECD (P = 2.58 × 10-22; OR = 15.66 (95% CI: 7.79-31.49. Genotypic analysis showed that 51% of cases (97 compared to 5% of controls (9 were heterozygous or homozygous for the expanded repeat allele. Furthermore, the repeat expansion showed stronger association than the most significantly associated SNP, rs613872, in TCF4, with the disease in the Australian cohort. This and haplotype analysis of both the polymorphisms suggest that considering both the polymorphisms together rather than either of the two alone would better predict susceptibility to FECD in the Australian population. This is the first study to report association of the TGC trinucleotide repeat expansion in TCF4 with advanced FECD in the Australian population.

  17. The equine herpesvirus-1 IR3 gene that lies antisense to the sole immediate-early (IE) gene is trans-activated by the IE protein, and is poorly expressed to a protein

    International Nuclear Information System (INIS)

    Ahn, Byung Chul; Breitenbach, Jonathan E.; Kim, Seong K.; O'Callaghan, Dennis J.

    2007-01-01

    The unique IR3 gene of equine herpesvirus 1 (EHV-1) is expressed as a late 1.0-kb transcript. Previous studies confirmed the IR3 transcription initiation site and tentatively identified other cis-acting elements specific to IR3 such as a TATA box, a 443 base pair 5'untranslated region (UTR), a 285 base pair open reading frame (ORF), and a poly adenylation (A) signal [Holden, V.R., Harty, R.N., Yalamanchili, R.R., O'Callaghan, D.J., 1992. The IR3 gene of equine herpesvirus type 1: a unique gene regulated by sequences within the intron of the immediate-early gene. DNA Seq. 3, 143-152]. Transient transfection assays revealed that the IR3 promoter is strongly trans-activated by the IE protein (IEP) and that coexpression of the IEP with the early EICP0 and IR4 regulatory proteins results in maximal trans-activation of the IR3 promoter. Gel shift assays revealed that the IEP directly binds to the IR3 promoter region. Western blot analysis showed that the IR3 protein produced in E. coli was detected by antibodies to IR3 synthetic peptides; however, the IR3 protein was not detected in EHV-1 infected cell extracts by these same anti-IR3 antibodies, even though the IR3 transcript was detected by northern blot. These findings suggest that the IR3 may not be expressed to a protein. Expression of an IR3/GFP fusion gene was not observed, but expression of a GFP/IR3 fusion gene was detected by fluorescent microscopy. In further attempts to detect the IR3/GFP fusion protein using anti-GFP antibody, western blot analysis showed that the IR3/GFP fusion protein was not detected in vivo. Interestingly, a truncated form of the GFP/IR3 protein was synthesized from the GFP/IR3 fusion gene. However, GFP/IR3 and IR3/GFP fusion proteins of the predicted sizes were synthesized by in vitro coupled transcription and translation of the fusion genes, suggesting poor expression of the IR3 protein in vivo. The possible role of the IR3 transcript in EHV-1 infection is discussed

  18. Identification of stress responsive genes by studying specific relationships between mRNA and protein abundance.

    Science.gov (United States)

    Morimoto, Shimpei; Yahara, Koji

    2018-03-01

    Protein expression is regulated by the production and degradation of mRNAs and proteins but the specifics of their relationship are controversial. Although technological advances have enabled genome-wide and time-series surveys of mRNA and protein abundance, recent studies have shown paradoxical results, with most statistical analyses being limited to linear correlation, or analysis of variance applied separately to mRNA and protein datasets. Here, using recently analyzed genome-wide time-series data, we have developed a statistical analysis framework for identifying which types of genes or biological gene groups have significant correlation between mRNA and protein abundance after accounting for potential time delays. Our framework stratifies all genes in terms of the extent of time delay, conducts gene clustering in each stratum, and performs a non-parametric statistical test of the correlation between mRNA and protein abundance in a gene cluster. Consequently, we revealed stronger correlations than previously reported between mRNA and protein abundance in two metabolic pathways. Moreover, we identified a pair of stress responsive genes ( ADC17 and KIN1 ) that showed a highly similar time series of mRNA and protein abundance. Furthermore, we confirmed robustness of the analysis framework by applying it to another genome-wide time-series data and identifying a cytoskeleton-related gene cluster (keratin 18, keratin 17, and mitotic spindle positioning) that shows similar correlation. The significant correlation and highly similar changes of mRNA and protein abundance suggests a concerted role of these genes in cellular stress response, which we consider provides an answer to the question of the specific relationships between mRNA and protein in a cell. In addition, our framework for studying the relationship between mRNAs and proteins in a cell will provide a basis for studying specific relationships between mRNA and protein abundance after accounting for potential

  19. Positive selection and propeptide repeats promote rapid interspecific divergence of a gastropod sperm protein.

    Science.gov (United States)

    Hellberg, M E; Moy, G W; Vacquier, V D

    2000-03-01

    Male-specific proteins have increasingly been reported as targets of positive selection and are of special interest because of the role they may play in the evolution of reproductive isolation. We report the rapid interspecific divergence of cDNA encoding a major acrosomal protein of unknown function (TMAP) of sperm from five species of teguline gastropods. A mitochondrial DNA clock (calibrated by congeneric species divided by the Isthmus of Panama) estimates that these five species diverged 2-10 MYA. Inferred amino acid sequences reveal a propeptide that has diverged rapidly between species. The mature protein has diverged faster still due to high nonsynonymous substitution rates (> 25 nonsynonymous substitutions per site per 10(9) years). cDNA encoding the mature protein (89-100 residues) shows evidence of positive selection (Dn/Ds > 1) for 4 of 10 pairwise species comparisons. cDNA and predicted secondary-structure comparisons suggest that TMAP is neither orthologous nor paralogous to abalone lysin, and thus marks a second, phylogenetically independent, protein subject to strong positive selection in free-spawning marine gastropods. In addition, an internal repeat in one species (Tegula aureotincta) produces a duplicated cleavage site which results in two alternatively processed mature proteins differing by nine amino acid residues. Such alternative processing may provide a mechanism for introducing novel amino acid sequence variation at the amino-termini of proteins. Highly divergent TMAP N-termini from two other tegulines (Tegula regina and Norrisia norrisii) may have originated by such a mechanism.

  20. Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs.

    Directory of Open Access Journals (Sweden)

    Chen Xie

    2012-09-01

    Full Text Available Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA-Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis, which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level.

  1. Connecting protein and mRNA burst distributions for stochastic models of gene expression

    International Nuclear Information System (INIS)

    Elgart, Vlad; Jia, Tao; Fenley, Andrew T; Kulkarni, Rahul

    2011-01-01

    The intrinsic stochasticity of gene expression can lead to large variability in protein levels for genetically identical cells. Such variability in protein levels can arise from infrequent synthesis of mRNAs which in turn give rise to bursts of protein expression. Protein expression occurring in bursts has indeed been observed experimentally and recent studies have also found evidence for transcriptional bursting, i.e. production of mRNAs in bursts. Given that there are distinct experimental techniques for quantifying the noise at different stages of gene expression, it is of interest to derive analytical results connecting experimental observations at different levels. In this work, we consider stochastic models of gene expression for which mRNA and protein production occurs in independent bursts. For such models, we derive analytical expressions connecting protein and mRNA burst distributions which show how the functional form of the mRNA burst distribution can be inferred from the protein burst distribution. Additionally, if gene expression is repressed such that observed protein bursts arise only from single mRNAs, we show how observations of protein burst distributions (repressed and unrepressed) can be used to completely determine the mRNA burst distribution. Assuming independent contributions from individual bursts, we derive analytical expressions connecting means and variances for burst and steady-state protein distributions. Finally, we validate our general analytical results by considering a specific reaction scheme involving regulation of protein bursts by small RNAs. For a range of parameters, we derive analytical expressions for regulated protein distributions that are validated using stochastic simulations. The analytical results obtained in this work can thus serve as useful inputs for a broad range of studies focusing on stochasticity in gene expression

  2. Protein-Protein Interaction Network and Gene Ontology

    Science.gov (United States)

    Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah

    Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.

  3. Cloning and characterization of carboxyl terminus of heat shock cognate 70-interacting protein gene from the silkworm, Bombyx mori.

    Science.gov (United States)

    Ohsawa, Takeshi; Fujimoto, Shota; Tsunakawa, Akane; Shibano, Yuka; Kawasaki, Hideki; Iwanaga, Masashi

    2016-11-01

    Carboxyl terminus of heat shock cognate 70-interacting protein (CHIP) is an evolutionarily conserved E3 ubiquitin ligase across different eukaryotic species and is known to play a key role in protein quality control. CHIP has two distinct functional domains, an N-terminal tetratricopeptide repeat (TPR) and a C-terminal U-box domain, which are required for the ubiquitination of numerous labile client proteins that are chaperoned by heat shock proteins (HSPs) and heat shock cognate proteins (HSCs). During our screen for CHIP-like proteins in the Bombyx mori databases, we found a novel silkworm gene, Bombyx mori CHIP. Phylogenetic analysis showed that BmCHIP belongs to Lepidopteran lineages. Quantitative reverse transcription-PCR analysis indicated that BmCHIP was relatively highly expressed in the gonad and fat body. A pull-down experiment and auto-ubiquitination assay showed that BmCHIP interacted with BmHSC70 and had E3 ligase activity. Additionally, immunohistochemical analysis revealed that BmCHIP was partially co-localized with ubiquitin in BmN4 cells. These data support that BmCHIP plays an important role in the ubiquitin proteasome system as an E3 ubiquitin ligase in B. mori. Copyright © 2016 Elsevier Inc. All rights reserved.

  4. Parkinson's Disease: Leucine-Rich Repeat Kinase 2 and Autophagy, Intimate Enemies

    Directory of Open Access Journals (Sweden)

    José M. Bravo-San Pedro

    2012-01-01

    Full Text Available Parkinson's disease is the second common neurodegenerative disorder, after Alzheimer's disease. It is a clinical syndrome characterized by loss of dopamine-generating cells in the substancia nigra, a region of the midbrain. The etiology of Parkinson's disease has long been through to involve both genetic and environmental factors. Mutations in the leucine-rich repeat kinase 2 gene cause late-onset Parkinson's disease with a clinical appearance indistinguishable from Parkinson's disease idiopathic. Autophagy is an intracellular catabolic mechanism whereby a cell recycles or degrades damage proteins and cytoplasmic organelles. This degradative process has been associated with cellular dysfunction in neurodegenerative processes including Parkinson's disease. We discuss the role of leucine-rich repeat kinase 2 in autophagy, and how the deregulations of this degradative mechanism in cells can be implicated in the Parkinson's disease etiology.

  5. Prenatal diagnosis of the fragile X syndrome : loss of mutation owing to a double recombinant or gene conversion event at the FMR1 locus

    NARCIS (Netherlands)

    Losekoot, M; Hoogendoorn, E; Olmer, R; Jansen, CCAM; Oosterwijk, JC; vandenOuweland, AMW; Halley, DJJ; Warren, ST; Willemsen, R; Oostra, BA; Bakker, E

    1997-01-01

    The fragile X syndrome, an X linked mental retardation syndrome, is caused by an expanded CGG repeat in the first exon of the FMR1 gene. In patients with an expanded repeat the FMR1 promoter is methylated and, consequently, the gene is silenced and no FMR1 protein (FMRP) is produced, thus leading to

  6. [Progress of genome engineering technology via clustered regularly interspaced short palindromic repeats--a review].

    Science.gov (United States)

    Li, Hao; Qiu, Shaofu; Song, Hongbin

    2013-10-04

    In survival competition with phage, bacteria and archaea gradually evolved the acquired immune system--Clustered regularly interspaced short palindromic repeats (CRISPR), presenting the trait of transcribing the crRNA and the CRISPR-associated protein (Cas) to silence or cleaving the foreign double-stranded DNA specifically. In recent years, strong interest arises in prokaryotes primitive immune system and many in-depth researches are going on. Recently, researchers successfully repurposed CRISPR as an RNA-guided platform for sequence-specific gene expression, which provides a simple approach for selectively perturbing gene expression on a genome-wide scale. It will undoubtedly bring genome engineering into a more convenient and accurate new era.

  7. Testing of disease-resistance of pokeweed antiviral protein gene ...

    African Journals Online (AJOL)

    Transformation of pokeweed antiviral protein gene (PAP) into plants was shown to improve plant resistance to several viruses or fungi pathogens with no much negative effect on plant growth. The non-virulent defective PAP inhibits only the virus but does not interfere with the host. A non-virulent defective PAP gene ...

  8. Evidence that C9ORF72 Dipeptide Repeat Proteins Associate with U2 snRNP to Cause Mis-splicing in ALS/FTD Patients.

    Science.gov (United States)

    Yin, Shanye; Lopez-Gonzalez, Rodrigo; Kunz, Ryan C; Gangopadhyay, Jaya; Borufka, Carl; Gygi, Steven P; Gao, Fen-Biao; Reed, Robin

    2017-06-13

    Hexanucleotide repeat expansion in the C9ORF72 gene results in production of dipeptide repeat (DPR) proteins that may disrupt pre-mRNA splicing in amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) patients. At present, the mechanisms underlying this mis-splicing are not understood. Here, we show that addition of proline-arginine (PR) and glycine-arginine (GR) toxic DPR peptides to nuclear extracts blocks spliceosome assembly and splicing, but not other types of RNA processing. Proteomic and biochemical analyses identified the U2 small nuclear ribonucleoprotein particle (snRNP) as a major interactor of PR and GR peptides. In addition, U2 snRNP, but not other splicing factors, mislocalizes from the nucleus to the cytoplasm both in C9ORF72 patient induced pluripotent stem cell (iPSC)-derived motor neurons and in HeLa cells treated with the toxic peptides. Bioinformatic studies support a specific role for U2-snRNP-dependent mis-splicing in C9ORF72 patient brains. Together, our data indicate that DPR-mediated dysfunction of U2 snRNP could account for as much as ∼44% of the mis-spliced cassette exons in C9ORF72 patient brains. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  9. Evidence that C9ORF72 Dipeptide Repeat Proteins Associate with U2 snRNP to Cause Mis-splicing in ALS/FTD Patients

    Directory of Open Access Journals (Sweden)

    Shanye Yin

    2017-06-01

    Full Text Available Hexanucleotide repeat expansion in the C9ORF72 gene results in production of dipeptide repeat (DPR proteins that may disrupt pre-mRNA splicing in amyotrophic lateral sclerosis (ALS and frontotemporal dementia (FTD patients. At present, the mechanisms underlying this mis-splicing are not understood. Here, we show that addition of proline-arginine (PR and glycine-arginine (GR toxic DPR peptides to nuclear extracts blocks spliceosome assembly and splicing, but not other types of RNA processing. Proteomic and biochemical analyses identified the U2 small nuclear ribonucleoprotein particle (snRNP as a major interactor of PR and GR peptides. In addition, U2 snRNP, but not other splicing factors, mislocalizes from the nucleus to the cytoplasm both in C9ORF72 patient induced pluripotent stem cell (iPSC-derived motor neurons and in HeLa cells treated with the toxic peptides. Bioinformatic studies support a specific role for U2-snRNP-dependent mis-splicing in C9ORF72 patient brains. Together, our data indicate that DPR-mediated dysfunction of U2 snRNP could account for as much as ∼44% of the mis-spliced cassette exons in C9ORF72 patient brains.

  10. Identification and cloning of two insecticidal protein genes from ...

    African Journals Online (AJOL)

    Bacillus thuringiensis (Bt) is the most widely applied type of microbial pesticide due to its high specificity and environmental safety. The activity of Bt is largely attributed to the insecticidal crystal protein encoded by the cry genes. Different insecticidal crystal proteins of Bt have different bioactivity against distinct agricultural ...

  11. Local chromatin structure of heterochromatin regulates repeated DNA stability, nucleolus structure, and genome integrity

    Energy Technology Data Exchange (ETDEWEB)

    Peng, Jamy C. [Univ. of California, Berkeley, CA (United States)

    2007-01-01

    Heterochromatin constitutes a significant portion of the genome in higher eukaryotes; approximately 30% in Drosophila and human. Heterochromatin contains a high repeat DNA content and a low density of protein-encoding genes. In contrast, euchromatin is composed mostly of unique sequences and contains the majority of single-copy genes. Genetic and cytological studies demonstrated that heterochromatin exhibits regulatory roles in chromosome organization, centromere function and telomere protection. As an epigenetically regulated structure, heterochromatin formation is not defined by any DNA sequence consensus. Heterochromatin is characterized by its association with nucleosomes containing methylated-lysine 9 of histone H3 (H3K9me), heterochromatin protein 1 (HP1) that binds H3K9me, and Su(var)3-9, which methylates H3K9 and binds HP1. Heterochromatin formation and functions are influenced by HP1, Su(var)3-9, and the RNA interference (RNAi) pathway. My thesis project investigates how heterochromatin formation and function impact nuclear architecture, repeated DNA organization, and genome stability in Drosophila melanogaster. H3K9me-based chromatin reduces extrachromosomal DNA formation; most likely by restricting the access of repair machineries to repeated DNAs. Reducing extrachromosomal ribosomal DNA stabilizes rDNA repeats and the nucleolus structure. H3K9me-based chromatin also inhibits DNA damage in heterochromatin. Cells with compromised heterochromatin structure, due to Su(var)3-9 or dcr-2 (a component of the RNAi pathway) mutations, display severe DNA damage in heterochromatin compared to wild type. In these mutant cells, accumulated DNA damage leads to chromosomal defects such as translocations, defective DNA repair response, and activation of the G2-M DNA repair and mitotic checkpoints that ensure cellular and animal viability. My thesis research suggests that DNA replication, repair, and recombination mechanisms in heterochromatin differ from those in

  12. Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias

    Directory of Open Access Journals (Sweden)

    Sailen Barik

    2017-12-01

    Full Text Available A significant number of proteins in all living species contains amino acid repeats (AARs of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(An] and double [(ABn] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1 One class of amino acid repeats (Class I uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2 The second class (Class II disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons; and finally, (3 In all AARs (including Class I, Class II, and the in-betweens, the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.

  13. Amino acid repeats avert mRNA folding through conservative substitutions and synonymous codons, regardless of codon bias.

    Science.gov (United States)

    Barik, Sailen

    2017-12-01

    A significant number of proteins in all living species contains amino acid repeats (AARs) of various lengths and compositions, many of which play important roles in protein structure and function. Here, I have surveyed select homopolymeric single [(A)n] and double [(AB)n] AARs in the human proteome. A close examination of their codon pattern and analysis of RNA structure propensity led to the following set of empirical rules: (1) One class of amino acid repeats (Class I) uses a mixture of synonymous codons, some of which approximate the codon bias ratio in the overall human proteome; (2) The second class (Class II) disregards the codon bias ratio, and appears to have originated by simple repetition of the same codon (or just a few codons); and finally, (3) In all AARs (including Class I, Class II, and the in-betweens), the codons are chosen in a manner that precludes the formation of RNA secondary structure. It appears that the AAR genes have evolved by orchestrating a balance between codon usage and mRNA secondary structure. The insights gained here should provide a better understanding of AAR evolution and may assist in designing synthetic genes.

  14. Upon Infection the Cellular WD Repeat-containing Protein 5 (WDR5) Localizes to Cytoplasmic Inclusion Bodies and Enhances Measles Virus Replication.

    Science.gov (United States)

    Ma, Dzwokai; George, Cyril X; Nomburg, Jason; Pfaller, Christian K; Cattaneo, Roberto; Samuel, Charles E

    2017-12-13

    Replication of negative-strand RNA viruses occurs in association with discrete cytoplasmic foci called inclusion bodies. Whereas inclusion bodies represent a prominent subcellular structure induced by viral infection, our knowledge of the cellular protein components involved in inclusion body formation and function is limited. Using measles virus-infected HeLa cells, we found that the WD repeat-containing protein 5 (WDR5), a subunit of histone H3 lysine 4 methyltransferases, was selectively recruited to virus-induced inclusion bodies. Furthermore, WDR5 was found in complexes containing viral proteins associated with RNA replication. WDR5 was not detected with mitochondria, stress granules, or other known secretory or endocytic compartments of infected cells. WDR5 deficiency decreased both viral protein production and infectious virus yields. Interferon production was modestly increased in WDR5 deficient cells. Thus, our study identifies WDR5 as a novel viral inclusion body-associated cellular protein and suggests a role for WDR5 in promoting viral replication. IMPORTANCE Measles virus is a human pathogen that remains a global concern with more than 100,000 measles-related deaths annually despite the availability of an effective vaccine. As measles continues to cause significant morbidity and mortality, understanding the virus-host interactions at the molecular level that affect virus replication efficiency is important for development and optimization of treatment procedures. Measles virus is an RNA virus that encodes six genes and replicates in the cytoplasm of infected cells in discrete cytoplasmic replication bodies, though little is known of the biochemical nature of these structures. Here we show that the cellular protein WDR5 is enriched in the cytoplasmic viral replication factories and enhances virus growth. WDR5-containing protein complex includes viral proteins responsible for viral RNA replication. Thus, we have identified WDR5 as a host factor that

  15. Tracking the evolution of a cold stress associated gene family in cold tolerant grasses

    DEFF Research Database (Denmark)

    Sandve, Simen R; Rudi, Heidi; Asp, Torben

    2008-01-01

    to the repeat motifs of the IRI-domain in cold tolerant grasses. Finally we show that the LRR-domain of carrot and grass IRI proteins both share homology to an Arabidopsis thaliana LRR-trans membrane protein kinase (LRR-TPK). Conclusion The diverse IRI-like genes identified in this study tell a tale...... of a complex evolutionary history including birth of an ice binding domain, a burst of gene duplication events after cold tolerant grasses radiated from rice, protein domain structure differentiation between paralogs, and sub- and/or neofunctionalisation of IRI-like proteins. From our sequence analysis we...

  16. Comparative transcriptomic analyses of differentially expressed genes in transgenic melatonin biosynthesis ovine HIOMT gene in switchgrass

    Directory of Open Access Journals (Sweden)

    Shan Yuan

    2016-11-01

    Full Text Available Melatonin serves pleiotropic functions in prompting plant growth and resistance to various stresses. The accurate biosynthetic pathway of melatonin remains elusive in plant species, while the N-acetyltransferase and O-methyltransferase were considered to be the last two key enzymes during its biosynthesis. To investigate the biosynthesis and metabolic pathway of melatonin in plants, the RNA-seq profile of overexpression of the ovine HIOMT was analyzed and compared with the previous transcriptome of transgenic oAANAT gene in switchgrass, a model plant for cellulosic ethanol production. A total of 946, 405 and 807 differentially expressed unigenes were observed in AANAT vs. control, HIOMT vs. control, and AANAT vs. HIOMT, respectively. The significantly upregulated (F-box/kelch-repeat protein, zinc finger BED domain-containing protein-3 genes were consistent with enhanced phenotypes of shoot, stem and root growth in transgenic oHIOMT switchgrass. Early flowering in overexpression of oHIOMT switchgrass involved in the regulation of flowering-time genes (APETALA2. Several stress resistant related genes (SPX domain-containing membrane protein, copper transporter 1, late blight resistance protein homolog R1A-6 OS etc. were specifically and significantly upregulated in transgenic oHIOMT only, while metabolism-related genes (phenylalanine-4-hydroxylase, tyrosine decarboxylase 1, protein disulfide-isomerase and galactinol synthase 2 etc. were significantly upregulated in transgenic oAANAT only. These results provide new sights into the biosynthetic and physiological functional networks of melatonin in plants.

  17. Meta-analysis of the effect of overexpression of CBF/DREB family genes on drought stress response

    Science.gov (United States)

    Transcription factors C-repeat/dehydration-responsive element binding proteins (CBF/DREB) play an important role in plant response to abiotic stresses. Over-expression of various CBF/DREB genes in diverse plants have been reported, but inconsistency of gene donor, recipient genus, parameters used i...

  18. Chicken genome analysis reveals novel genes encoding biotin-binding proteins related to avidin family

    Directory of Open Access Journals (Sweden)

    Nordlund Henri R

    2005-03-01

    Full Text Available Abstract Background A chicken egg contains several biotin-binding proteins (BBPs, whose complete DNA and amino acid sequences are not known. In order to identify and characterise these genes and proteins we studied chicken cDNAs and genes available in the NCBI database and chicken genome database using the reported N-terminal amino acid sequences of chicken egg-yolk BBPs as search strings. Results Two separate hits showing significant homology for these N-terminal sequences were discovered. For one of these hits, the chromosomal location in the immediate proximity of the avidin gene family was found. Both of these hits encode proteins having high sequence similarity with avidin suggesting that chicken BBPs are paralogous to avidin family. In particular, almost all residues corresponding to biotin binding in avidin are conserved in these putative BBP proteins. One of the found DNA sequences, however, seems to encode a carboxy-terminal extension not present in avidin. Conclusion We describe here the predicted properties of the putative BBP genes and proteins. Our present observations link BBP genes together with avidin gene family and shed more light on the genetic arrangement and variability of this family. In addition, comparative modelling revealed the potential structural elements important for the functional and structural properties of the putative BBP proteins.

  19. Disease candidate gene identification and prioritization using protein interaction networks

    Directory of Open Access Journals (Sweden)

    Aronow Bruce J

    2009-02-01

    Full Text Available Abstract Background Although most of the current disease candidate gene identification and prioritization methods depend on functional annotations, the coverage of the gene functional annotations is a limiting factor. In the current study, we describe a candidate gene prioritization method that is entirely based on protein-protein interaction network (PPIN analyses. Results For the first time, extended versions of the PageRank and HITS algorithms, and the K-Step Markov method are applied to prioritize disease candidate genes in a training-test schema. Using a list of known disease-related genes from our earlier study as a training set ("seeds", and the rest of the known genes as a test list, we perform large-scale cross validation to rank the candidate genes and also evaluate and compare the performance of our approach. Under appropriate settings – for example, a back probability of 0.3 for PageRank with Priors and HITS with Priors, and step size 6 for K-Step Markov method – the three methods achieved a comparable AUC value, suggesting a similar performance. Conclusion Even though network-based methods are generally not as effective as integrated functional annotation-based methods for disease candidate gene prioritization, in a one-to-one comparison, PPIN-based candidate gene prioritization performs better than all other gene features or annotations. Additionally, we demonstrate that methods used for studying both social and Web networks can be successfully used for disease candidate gene prioritization.

  20. Identification and Expression Profiling of the BTB Domain-Containing Protein Gene Family in the Silkworm, Bombyx mori

    Directory of Open Access Journals (Sweden)

    Daojun Cheng

    2014-01-01

    Full Text Available The BTB domain is a conserved protein-protein interaction motif. In this study, we identified 56 BTB domain-containing protein genes in the silkworm, in addition to 46 in the honey bee, 55 in the red flour beetle, and 53 in the monarch butterfly. Silkworm BTB protein genes were classified into nine subfamilies according to their domain architecture, and most of them could be mapped on the different chromosomes. Phylogenetic analysis suggests that silkworm BTB protein genes may have undergone a duplication event in three subfamilies: BTB-BACK-Kelch, BTB-BACK-PHR, and BTB-FLYWCH. Comparative analysis demonstrated that the orthologs of each of 13 BTB protein genes present a rigorous orthologous relationship in the silkworm and other surveyed insects, indicating conserved functions of these genes during insect evolution. Furthermore, several silkworm BTB protein genes exhibited sex-specific expression in larval tissues or at different stages during metamorphosis. These findings not only contribute to a better understanding of the evolution of insect BTB protein gene families but also provide a basis for further investigation of the functions of BTB protein genes in the silkworm.

  1. Expression and genomic organization of zonadhesin-like genes in three species of fish give insight into the evolutionary history of a mosaic protein

    Directory of Open Access Journals (Sweden)

    Davidson William S

    2005-11-01

    Full Text Available Abstract Background The mosaic sperm protein zonadhesin (ZAN has been characterized in mammals and is implicated in species-specific egg-sperm binding interactions. The genomic structure and testes-specific expression of zonadhesin is known for many mammalian species. All zonadhesin genes characterized to date consist of meprin A5 antigen receptor tyrosine phosphatase mu (MAM domains, mucin tandem repeats, and von Willebrand (VWD adhesion domains. Here we investigate the genomic structure and expression of zonadhesin-like genes in three species of fish. Results The cDNA and corresponding genomic locus of a zonadhesin-like gene (zlg in Atlantic salmon (Salmo salar were sequenced. Zlg is similar in adhesion domain content to mammalian zonadhesin; however, the domain order is altered. Analysis of puffer fish (Takifugu rubripes and zebrafish (Danio rerio sequence data identified zonadhesin (zan genes that share the same domain order, content, and a conserved syntenic relationship with mammalian zonadhesin. A zonadhesin-like gene in D. rerio was also identified. Unlike mammalian zonadhesin, D. rerio zan and S. salar zlg were expressed in the gut and not in the testes. Conclusion We characterized likely orthologs of zonadhesin in both T. rubripes and D. rerio and uncovered zonadhesin-like genes in S. salar and D. rerio. Each of these genes contains MAM, mucin, and VWD domains. While these domains are associated with several proteins that show prominent gut expression, their combination is unique to zonadhesin and zonadhesin-like genes in vertebrates. The expression patterns of fish zonadhesin and zonadhesin-like genes suggest that the reproductive role of zonadhesin evolved later in the mammalian lineage.

  2. Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Directory of Open Access Journals (Sweden)

    Li Weizhong

    2008-04-01

    Full Text Available Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net. Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

  3. Identification of stress responsive genes by studying specific relationships between mRNA and protein abundance

    Directory of Open Access Journals (Sweden)

    Shimpei Morimoto

    2018-03-01

    Full Text Available Protein expression is regulated by the production and degradation of mRNAs and proteins but the specifics of their relationship are controversial. Although technological advances have enabled genome-wide and time-series surveys of mRNA and protein abundance, recent studies have shown paradoxical results, with most statistical analyses being limited to linear correlation, or analysis of variance applied separately to mRNA and protein datasets. Here, using recently analyzed genome-wide time-series data, we have developed a statistical analysis framework for identifying which types of genes or biological gene groups have significant correlation between mRNA and protein abundance after accounting for potential time delays. Our framework stratifies all genes in terms of the extent of time delay, conducts gene clustering in each stratum, and performs a non-parametric statistical test of the correlation between mRNA and protein abundance in a gene cluster. Consequently, we revealed stronger correlations than previously reported between mRNA and protein abundance in two metabolic pathways. Moreover, we identified a pair of stress responsive genes (ADC17 and KIN1 that showed a highly similar time series of mRNA and protein abundance. Furthermore, we confirmed robustness of the analysis framework by applying it to another genome-wide time-series data and identifying a cytoskeleton-related gene cluster (keratin 18, keratin 17, and mitotic spindle positioning that shows similar correlation. The significant correlation and highly similar changes of mRNA and protein abundance suggests a concerted role of these genes in cellular stress response, which we consider provides an answer to the question of the specific relationships between mRNA and protein in a cell. In addition, our framework for studying the relationship between mRNAs and proteins in a cell will provide a basis for studying specific relationships between mRNA and protein abundance after

  4. Prediction of human protein function according to Gene Ontology categories

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Stærfeldt, Hans Henrik

    2003-01-01

    developed a method for prediction of protein function for a subset of classes from the Gene Ontology classification scheme. This subset includes several pharmaceutically interesting categories-transcription factors, receptors, ion channels, stress and immune response proteins, hormones and growth factors...

  5. DAZ Family Proteins, Key Players for Germ Cell Development

    Science.gov (United States)

    Fu, Xia-Fei; Cheng, Shun-Feng; Wang, Lin-Qing; Yin, Shen; De Felici, Massimo; Shen, Wei

    2015-01-01

    DAZ family proteins are found almost exclusively in germ cells in distant animal species. Deletion or mutations of their encoding genes usually severely impair either oogenesis or spermatogenesis or both. The family includes Boule (or Boll), Dazl (or Dazla) and DAZ genes. Boule and Dazl are situated on autosomes while DAZ, exclusive of higher primates, is located on the Y chromosome. Deletion of DAZ gene is the most common causes of infertility in humans. These genes, encoding for RNA binding proteins, contain a highly conserved RNA recognition motif and at least one DAZ repeat encoding for a 24 amino acids sequence able to bind other mRNA binding proteins. Basically, Daz family proteins function as adaptors for target mRNA transport and activators of their translation. In some invertebrate species, BOULE protein play a pivotal role in germline specification and a conserved regulatory role in meiosis. Depending on the species, DAZL is expressed in primordial germ cells (PGCs) and/or pre-meiotic and meiotic germ cells of both sexes. Daz is found in fetal gonocytes, spermatogonia and spermatocytes of adult testes. Here we discuss DAZ family genes in a phylogenic perspective, focusing on the common and distinct features of these genes, and their pivotal roles during gametogenesis evolved during evolution. PMID:26327816

  6. Highly sensitive detection of individual HEAT and ARM repeats with HHpred and COACH.

    Science.gov (United States)

    Kippert, Fred; Gerloff, Dietlind L

    2009-09-24

    HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high

  7. Elfin: An algorithm for the computational design of custom three-dimensional structures from modular repeat protein building blocks.

    Science.gov (United States)

    Yeh, Chun-Ting; Brunette, T J; Baker, David; McIntosh-Smith, Simon; Parmeggiani, Fabio

    2018-02-01

    Computational protein design methods have enabled the design of novel protein structures, but they are often still limited to small proteins and symmetric systems. To expand the size of designable proteins while controlling the overall structure, we developed Elfin, a genetic algorithm for the design of novel proteins with custom shapes using structural building blocks derived from experimentally verified repeat proteins. By combining building blocks with compatible interfaces, it is possible to rapidly build non-symmetric large structures (>1000 amino acids) that match three-dimensional geometric descriptions provided by the user. A run time of about 20min on a laptop computer for a 3000 amino acid structure makes Elfin accessible to users with limited computational resources. Protein structures with controlled geometry will allow the systematic study of the effect of spatial arrangement of enzymes and signaling molecules, and provide new scaffolds for functional nanomaterials. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Use of galerina marginata genes and proteins for peptide production

    Science.gov (United States)

    Hallen-Adams, Heather E.; Scott-Craig, John S.; Walton, Jonathan D.; Luo, Hong

    2018-04-03

    The present invention relates to compositions and methods comprising genes and peptides associated with cyclic peptides and cyclic peptide production in mushrooms. In particular, the present invention relates to using genes and proteins from Galerina species encoding peptides specifically relating to amatoxins in addition to proteins involved with processing cyclic peptide toxins. In a preferred embodiment, the present invention also relates to methods for making small peptides and small cyclic peptides including peptides similar to amanitin. Further, the present inventions relate to providing kits for making small peptides.

  9. Use of Galerina marginata genes and proteins for peptide production

    Energy Technology Data Exchange (ETDEWEB)

    Hallen-Adams, Heather E.; Scott-Craig, John S.; Walton, Jonathan D.; Luo, Hong

    2017-03-21

    The present invention relates to compositions and methods comprising genes and peptides associated with cyclic peptides and cyclic peptide production in mushrooms. In particular, the present invention relates to using genes and proteins from Galerina species encoding peptides specifically relating to amatoxins in addition to proteins involved with processing cyclic peptide toxins. In a preferred embodiment, the present invention also relates to methods for making small peptides and small cyclic peptides including peptides similar to amanitin. Further, the present inventions relate to providing kits for making small peptides.

  10. Armadillo Repeat Containing 8α Binds to HRS and Promotes HRS Interaction with Ubiquitinated Proteins

    Science.gov (United States)

    Tomaru, Koji; Ueda, Atsuhisa; Suzuki, Takeyuki; Kobayashi, Nobuaki; Yang, Jun; Yamamoto, Masaki; Takeno, Mitsuhiro; Kaneko, Takeshi; Ishigatsubo, Yoshiaki

    2010-01-01

    Recently, we reported that a complex with an essential role in the degradation of Fructose-1,6-bisphosphatase in yeast is well conserved in mammalian cells; we named this mammalian complex C-terminal to the Lissencephaly type-1-like homology (CTLH) complex. Although the function of the CTLH complex remains unclear, here we used yeast two-hybrid screening to isolate Hepatocyte growth factor-regulated tyrosine kinase substrate (HRS) as a protein binding to a key component of CTLH complex, Armadillo repeat containing 8 (ARMc8) α. The association was confirmed by a yeast two-hybrid assay and a co-immunoprecipitation assay. The proline-rich domain of HRS was essential for the association. As demonstrated through immunofluorescence microscopy, ARMc8α co-localized with HRS. ARMc8α promoted the interaction of HRS with various ubiquitinated proteins through the ubiquitin-interacting motif. These findings suggest that HRS mediates protein endosomal trafficking partly through its interaction with ARMc8α. PMID:20224683

  11. Large-scale studies of the HphI insulin gene variable-number-of-tandem-repeats polymorphism in relation to Type 2 diabetes mellitus and insulin release

    DEFF Research Database (Denmark)

    Hansen, S K; Gjesing, A P; Rasmussen, S K

    2004-01-01

    The class III allele of the variable-number-of-tandem-repeats polymorphism located 5' of the insulin gene (INS-VNTR) has been associated with Type 2 diabetes and altered birthweight. It has also been suggested, although inconsistently, that the class III allele plays a role in glucose-induced ins......The class III allele of the variable-number-of-tandem-repeats polymorphism located 5' of the insulin gene (INS-VNTR) has been associated with Type 2 diabetes and altered birthweight. It has also been suggested, although inconsistently, that the class III allele plays a role in glucose...

  12. Rotavirus nonstructural protein 1 antagonizes innate immune response by interacting with retinoic acid inducible gene I

    Directory of Open Access Journals (Sweden)

    Qin Lan

    2011-12-01

    Full Text Available Abstract Background The nonstructural protein 1 (NSP1 of rotavirus has been reported to block interferon (IFN signaling by mediating proteasome-dependent degradation of IFN-regulatory factors (IRFs and (or the β-transducin repeat containing protein (β-TrCP. However, in addition to these targets, NSP1 may subvert innate immune responses via other mechanisms. Results The NSP1 of rotavirus OSU strain as well as the IRF3 binding domain truncated NSP1 of rotavirus SA11 strain are unable to degrade IRFs, but can still inhibit host IFN response, indicating that NSP1 may target alternative host factor(s other than IRFs. Overexpression of NSP1 can block IFN-β promoter activation induced by the retinoic acid inducible gene I (RIG-I, but does not inhibit IFN-β activation induced by the mitochondrial antiviral-signaling protein (MAVS, indicating that NSP1 may target RIG-I. Immunoprecipitation experiments show that NSP1 interacts with RIG-I independent of IRF3 binding domain. In addition, NSP1 induces down-regulation of RIG-I in a proteasome-independent way. Conclusions Our findings demonstrate that inhibition of RIG-I mediated type I IFN responses by NSP1 may contribute to the immune evasion of rotavirus.

  13. Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT){sub n} {center_dot} (GA){sub n} microsatellite embedded within the U2 repeat unit

    Energy Technology Data Exchange (ETDEWEB)

    Liao, D.; Weiner, A.M. [Yale Univ., New Haven, CT (United States)

    1995-12-10

    The RNU2 locus encoding human U2 small nuclear RNA (snRNA) is organized as a nearly perfect tandem array containing 5 to 22 copies of a 5.8-kb repeat unit. Just downstream of the U2 snRNA gene in each 5.8-kb repeat unit lies a large (CT){sub n}{center_dot}(GA){sub n} dinucleotide repeat (n {approx} 70). This form of genomic organization, in which one repeat is embedded within another, provides an unusual opportunity to study the balance of forces maintaining the homogeneity of both kinds of repeats. Using a combination of field inversion gel electrophoresis and polymerase chain reaction, we have been able to study the CT microsatellites within individual U2 tandem arrays. We find that the CT microsatellites within an RNU2 allele exhibit significant length polymorphism, despite the remarkable homogeneity of the surrounding U2 repeat units. Length polymorphism is due primarily to loss or gain of CT dinucleotide repeats, but other types of deletions, insertions, and substitutions are also frequent. Polymorphism is greatly reduced in regions where pure (CT){sub n} tracts are interrupted by occasional G residues, suggesting that irregularities stabilize both the length and the sequence of the dinucleotide repeat. We further show that the RNU2 loci of other catarrhine primates (gorilla, chimpanzee, ogangutan, and baboon) contain orthologous CT microsatellites; these also exhibit length polymorphism, but are highly divergent from each other. Thus, although the CT microsatellite is evolving far more rapidly than the rest of the U2 repeat unit, it has persisted through multiple speciation events spanning >35 Myr. The persistence of the CT microsatellite, despite polymorphism and rapid evolution, suggests that it might play a functional role in concerted evolution of the RNU2 loci, perhaps as an initiation site for recombination and/or gene conversion. 70 refs., 5 figs.

  14. Sieve element occlusion (SEO) genes encode structural phloem proteins involved in wound sealing of the phloem.

    Science.gov (United States)

    Ernst, Antonia M; Jekat, Stephan B; Zielonka, Sascia; Müller, Boje; Neumann, Ulla; Rüping, Boris; Twyman, Richard M; Krzyzanek, Vladislav; Prüfer, Dirk; Noll, Gundula A

    2012-07-10

    The sieve element occlusion (SEO) gene family originally was delimited to genes encoding structural components of forisomes, which are specialized crystalloid phloem proteins found solely in the Fabaceae. More recently, SEO genes discovered in various non-Fabaceae plants were proposed to encode the common phloem proteins (P-proteins) that plug sieve plates after wounding. We carried out a comprehensive characterization of two tobacco (Nicotiana tabacum) SEO genes (NtSEO). Reporter genes controlled by the NtSEO promoters were expressed specifically in immature sieve elements, and GFP-SEO fusion proteins formed parietal agglomerates in intact sieve elements as well as sieve plate plugs after wounding. NtSEO proteins with and without fluorescent protein tags formed agglomerates similar in structure to native P-protein bodies when transiently coexpressed in Nicotiana benthamiana, and the analysis of these protein complexes by electron microscopy revealed ultrastructural features resembling those of native P-proteins. NtSEO-RNA interference lines were essentially devoid of P-protein structures and lost photoassimilates more rapidly after injury than control plants, thus confirming the role of P-proteins in sieve tube sealing. We therefore provide direct evidence that SEO genes in tobacco encode P-protein subunits that affect translocation. We also found that peptides recently identified in fascicular phloem P-protein plugs from squash (Cucurbita maxima) represent cucurbit members of the SEO family. Our results therefore suggest a common evolutionary origin for P-proteins found in the sieve elements of all dicotyledonous plants and demonstrate the exceptional status of extrafascicular P-proteins in cucurbits.

  15. The Mycobacterium leprae antigen 85 complex gene family: identification of the genes for the 85A, 85C, and related MPT51 proteins

    NARCIS (Netherlands)

    Rinke de Wit, T. F.; Bekelie, S.; Osland, A.; Wieles, B.; Janson, A. A.; Thole, J. E.

    1993-01-01

    The genes for two novel members (designated 85A and 85C) of the Mycobacterium leprae antigen 85 complex family of proteins and the gene for the closely related M. leprae MPT51 protein were isolated. The complete DNA sequence of the M. leprae 85C gene and partial sequences of the 85A and MPT51 genes

  16. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes.

    Science.gov (United States)

    Kumar, Dhirendra; Mondal, Anupam Kumar; Kutum, Rintu; Dash, Debasis

    2016-01-01

    Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Protein-Protein Interactions Prediction Based on Iterative Clique Extension with Gene Ontology Filtering

    Directory of Open Access Journals (Sweden)

    Lei Yang

    2014-01-01

    Full Text Available Cliques (maximal complete subnets in protein-protein interaction (PPI network are an important resource used to analyze protein complexes and functional modules. Clique-based methods of predicting PPI complement the data defection from biological experiments. However, clique-based predicting methods only depend on the topology of network. The false-positive and false-negative interactions in a network usually interfere with prediction. Therefore, we propose a method combining clique-based method of prediction and gene ontology (GO annotations to overcome the shortcoming and improve the accuracy of predictions. According to different GO correcting rules, we generate two predicted interaction sets which guarantee the quality and quantity of predicted protein interactions. The proposed method is applied to the PPI network from the Database of Interacting Proteins (DIP and most of the predicted interactions are verified by another biological database, BioGRID. The predicted protein interactions are appended to the original protein network, which leads to clique extension and shows the significance of biological meaning.

  18. Canine candidate genes for dilated cardiomyopathy: annotation of and polymorphic markers for 14 genes

    Directory of Open Access Journals (Sweden)

    van Oost Bernard A

    2007-10-01

    Full Text Available Abstract Background Dilated cardiomyopathy is a myocardial disease occurring in humans and domestic animals and is characterized by dilatation of the left ventricle, reduced systolic function and increased sphericity of the left ventricle. Dilated cardiomyopathy has been observed in several, mostly large and giant, dog breeds, such as the Dobermann and the Great Dane. A number of genes have been identified, which are associated with dilated cardiomyopathy in the human, mouse and hamster. These genes mainly encode structural proteins of the cardiac myocyte. Results We present the annotation of, and marker development for, 14 of these genes of the dog genome, i.e. α-cardiac actin, caveolin 1, cysteine-rich protein 3, desmin, lamin A/C, LIM-domain binding factor 3, myosin heavy polypeptide 7, phospholamban, sarcoglycan δ, titin cap, α-tropomyosin, troponin I, troponin T and vinculin. A total of 33 Single Nucleotide Polymorphisms were identified for these canine genes and 11 polymorphic microsatellite repeats were developed. Conclusion The presented polymorphisms provide a tool to investigate the role of the corresponding genes in canine Dilated Cardiomyopathy by linkage analysis or association studies.

  19. Molecular quantification of genes encoding for green-fluorescent proteins

    DEFF Research Database (Denmark)

    Felske, A; Vandieken, V; Pauling, B V

    2003-01-01

    A quantitative PCR approach is presented to analyze the amount of recombinant green fluorescent protein (gfp) genes in environmental DNA samples. The quantification assay is a combination of specific PCR amplification and temperature gradient gel electrophoresis (TGGE). Gene quantification...... PCR strategy is a highly specific and sensitive way to monitor recombinant DNA in environments like the efflux of a biotechnological plant....

  20. Hybrid Sterility in Rice (Oryza sativa L.) Involves the Tetratricopeptide Repeat Domain Containing Protein.

    Science.gov (United States)

    Yu, Yang; Zhao, Zhigang; Shi, Yanrong; Tian, Hua; Liu, Linglong; Bian, Xiaofeng; Xu, Yang; Zheng, Xiaoming; Gan, Lu; Shen, Yumin; Wang, Chaolong; Yu, Xiaowen; Wang, Chunming; Zhang, Xin; Guo, Xiuping; Wang, Jiulin; Ikehashi, Hiroshi; Jiang, Ling; Wan, Jianmin

    2016-07-01

    Intersubspecific hybrid sterility is a common form of reproductive isolation in rice (Oryza sativa L.), which significantly hampers the utilization of heterosis between indica and japonica varieties. Here, we elucidated the mechanism of S7, which specially causes Aus-japonica/indica hybrid female sterility, through cytological and genetic analysis, map-based cloning, and transformation experiments. Abnormal positioning of polar nuclei and smaller embryo sac were observed in F1 compared with male and female parents. Female gametes carrying S7(cp) and S7(i) were aborted in S7(ai)/S7(cp) and S7(ai)/S7(i), respectively, whereas they were normal in both N22 and Dular possessing a neutral allele, S7(n) S7 was fine mapped to a 139-kb region in the centromere region on chromosome 7, where the recombination was remarkably suppressed due to aggregation of retrotransposons. Among 16 putative open reading frames (ORFs) localized in the mapping region, ORF3 encoding a tetratricopeptide repeat domain containing protein was highly expressed in the pistil. Transformation experiments demonstrated that ORF3 is the candidate gene: downregulated expression of ORF3 restored spikelet fertility and eliminated absolutely preferential transmission of S7(ai) in heterozygote S7(ai)/S7(cp); sterility occurred in the transformants Cpslo17-S7(ai) Our results may provide implications for overcoming hybrid embryo sac sterility in intersubspecific hybrid rice and utilization of hybrid heterosis for cultivated rice improvement. Copyright © 2016 by the Genetics Society of America.

  1. Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor

    Directory of Open Access Journals (Sweden)

    Takano Eriko

    2011-09-01

    Full Text Available Abstract Background Streptomyces coelicolor, a model organism of antibiotic producing bacteria, has one of the largest genomes of the bacterial kingdom, including 7825 predicted protein coding genes. A large number of these genes, nearly 34%, are functionally orphan (hypothetical proteins with unknown function. However, in gene expression time course data, many of these functionally orphan genes show interesting expression patterns. Results In this paper, we analyzed all functionally orphan genes of Streptomyces coelicolor and identified a list of "high priority" orphans by combining gene expression analysis and additional phylogenetic information (i.e. the level of evolutionary conservation of each protein. Conclusions The prioritized orphan genes are promising candidates to be examined experimentally in the lab for further characterization of their function.

  2. Gene-specific correlation of RNA and protein levels in human cells and tissues

    DEFF Research Database (Denmark)

    Edfors, Fredrik; Danielsson, Frida; Hallström, Björn M.

    2016-01-01

    An important issue for molecular biology is to establish whether transcript levels of a given gene can be used as proxies for the corresponding protein levels. Here, we have developed a targeted proteomics approach for a set of human non-secreted proteins based on parallel reaction monitoring...... to measure, at steady-state conditions, absolute protein copy numbers across human tissues and cell lines and compared these levels with the corresponding mRNA levels using transcriptomics. The study shows that the transcript and protein levels do not correlate well unless a gene-specific RNA-to-protein (RTP...

  3. Revisiting the missing protein-coding gene catalog of the domestic dog

    Directory of Open Access Journals (Sweden)

    Galibert Francis

    2009-02-01

    Full Text Available Abstract Background Among mammals for which there is a high sequence coverage, the whole genome assembly of the dog is unique in that it predicts a low number of protein-coding genes, ~19,000, compared to the over 20,000 reported for other mammalian species. Of particular interest are the more than 400 of genes annotated in primates and rodent genomes, but missing in dog. Results Using over 14,000 orthologous genes between human, chimpanzee, mouse rat and dog, we built multiple pairwise synteny maps to infer short orthologous intervals that were targeted for characterizing the canine missing genes. Based on gene prediction and a functionality test using the ratio of replacement to silent nucleotide substitution rates (dN/dS, we provide compelling structural and functional evidence for the identification of 232 new protein-coding genes in the canine genome and 69 gene losses, characterized as undetected gene or pseudogenes. Gene loss phyletic pattern analysis using ten species from chicken to human allowed us to characterize 28 canine-specific gene losses that have functional orthologs continuously from chicken or marsupials through human, and 10 genes that arose specifically in the evolutionary lineage leading to rodent and primates. Conclusion This study demonstrates the central role of comparative genomics for refining gene catalogs and exploring the evolutionary history of gene repertoires, particularly as applied for the characterization of species-specific gene gains and losses.

  4. P-Finder: Reconstruction of Signaling Networks from Protein-Protein Interactions and GO Annotations.

    Science.gov (United States)

    Young-Rae Cho; Yanan Xin; Speegle, Greg

    2015-01-01

    Because most complex genetic diseases are caused by defects of cell signaling, illuminating a signaling cascade is essential for understanding their mechanisms. We present three novel computational algorithms to reconstruct signaling networks between a starting protein and an ending protein using genome-wide protein-protein interaction (PPI) networks and gene ontology (GO) annotation data. A signaling network is represented as a directed acyclic graph in a merged form of multiple linear pathways. An advanced semantic similarity metric is applied for weighting PPIs as the preprocessing of all three methods. The first algorithm repeatedly extends the list of nodes based on path frequency towards an ending protein. The second algorithm repeatedly appends edges based on the occurrence of network motifs which indicate the link patterns more frequently appearing in a PPI network than in a random graph. The last algorithm uses the information propagation technique which iteratively updates edge orientations based on the path strength and merges the selected directed edges. Our experimental results demonstrate that the proposed algorithms achieve higher accuracy than previous methods when they are tested on well-studied pathways of S. cerevisiae. Furthermore, we introduce an interactive web application tool, called P-Finder, to visualize reconstructed signaling networks.

  5. Identification and analysis of YELLOW protein family genes in the silkworm, Bombyx mori

    Directory of Open Access Journals (Sweden)

    Yi Yong-Zhu

    2006-08-01

    Full Text Available Abstract Background The major royal jelly proteins/yellow (MRJP/YELLOW family possesses several physiological and chemical functions in the development of Apis mellifera and Drosophila melanogaster. Each protein of the family has a conserved domain named MRJP. However, there is no report of MRJP/YELLOW family proteins in the Lepidoptera. Results Using the YELLOW protein sequence in Drosophila melanogaster to BLAST silkworm EST database, we found a gene family composed of seven members with a conserved MRJP domain each and named it YELLOW protein family of Bombyx mori. We completed the cDNA sequences with RACE method. The protein of each member possesses a MRJP domain and a putative cleavable signal peptide consisting of a hydrophobic sequence. In view of genetic evolution, the whole Bm YELLOW protein family composes a monophyletic group, which is distinctly separate from Drosophila melanogaster and Apis mellifera. We then showed the tissue expression profiles of Bm YELLOW protein family genes by RT-PCR. Conclusion A Bombyx mori YELLOW protein family is found to be composed of at least seven members. The low homogeneity and unique pattern of gene expression by each member among the family ensure us to prophesy that the members of Bm YELLOW protein family would play some important physiological functions in silkworm development.

  6. Variable presence of the inverted repeat and plastome stability in Erodium.

    Science.gov (United States)

    Blazier, John C; Jansen, Robert K; Mower, Jeffrey P; Govindu, Madhu; Zhang, Jin; Weng, Mao-Lun; Ruhlman, Tracey A

    2016-06-01

    Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR. We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus. Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR. The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts. © The Author 2016

  7. Orthogonal Cas9 proteins for RNA-guided gene regulation and editing

    Science.gov (United States)

    Church, George M.; Esvelt, Kevin; Mali, Prashant

    2017-03-07

    Methods of modulating expression of a target nucleic acid in a cell are provided including use of multiple orthogonal Cas9 proteins to simultaneously and independently regulate corresponding genes or simultaneously and independently edit corresponding genes.

  8. Revised Mimivirus major capsid protein sequence reveals intron-containing gene structure and extra domain

    Directory of Open Access Journals (Sweden)

    Suzan-Monti Marie

    2009-05-01

    Full Text Available Abstract Background Acanthamoebae polyphaga Mimivirus (APM is the largest known dsDNA virus. The viral particle has a nearly icosahedral structure with an internal capsid shell surrounded with a dense layer of fibrils. A Capsid protein sequence, D13L, was deduced from the APM L425 coding gene and was shown to be the most abundant protein found within the viral particle. However this protein remained poorly characterised until now. A revised protein sequence deposited in a database suggested an additional N-terminal stretch of 142 amino acids missing from the original deduced sequence. This result led us to investigate the L425 gene structure and the biochemical properties of the complete APM major Capsid protein. Results This study describes the full length 3430 bp Capsid coding gene and characterises the 593 amino acids long corresponding Capsid protein 1. The recombinant full length protein allowed the production of a specific monoclonal antibody able to detect the Capsid protein 1 within the viral particle. This protein appeared to be post-translationnally modified by glycosylation and phosphorylation. We proposed a secondary structure prediction of APM Capsid protein 1 compared to the Capsid protein structure of Paramecium Bursaria Chlorella Virus 1, another member of the Nucleo-Cytoplasmic Large DNA virus family. Conclusion The characterisation of the full length L425 Capsid coding gene of Acanthamoebae polyphaga Mimivirus provides new insights into the structure of the main Capsid protein. The production of a full length recombinant protein will be useful for further structural studies.

  9. HinT proteins and their putative interaction partners in Mollicutes and Chlamydiaceae

    Directory of Open Access Journals (Sweden)

    Hegemann Johannes H

    2005-05-01

    Full Text Available Background HinT proteins are found in prokaryotes and eukaryotes and belong to the superfamily of HIT proteins, which are characterized by an histidine-triad sequence motif. While the eukaryotic variants hydrolyze AMP derivates and modulate transcription, the function of prokaryotic HinT proteins is less clearly defined. In Mycoplasma hominis, HinT is concomitantly expressed with the proteins P60 and P80, two domains of a surface exposed membrane complex, and in addition interacts with the P80 moiety. Results An cluster of hitABL genes, similar to that of M. hominis was found in M. pulmonis, M. mycoides subspecies mycoides SC, M. mobile and Mesoplasma florum. RT-PCR analyses provided evidence that the P80, P60 and HinT homologues of M. pulmonis were polycistronically organized, suggesting a genetic and physical interaction between the proteins encoded by these genes in these species. While the hit loci of M. pneumoniae and M. genitalium encoded, in addition to HinT, a protein with several transmembrane segments, the hit locus of Ureaplasma parvum encoded a pore-forming protein, UU270, a P60 homologue, UU271, HinT, UU272, and a membrane protein of unknown function, UU273. Although a full-length mRNA spanning the four genes was not detected, amplification of all intergenic regions from the center of UU270 to the end of UU273 by RT-PCR may be indicative of a common, but unstable mRNA. In Chlamydiaceae the hit gene is flanked upstream by a gene predicted to encode a metal dependent hydrolase and downstream by a gene putatively encoding a protein with ARM-repeats, which are known to be involved in protein-protein interactions. In RT-PCR analyses of C. pneumoniae, regions comprising only two genes, Cp265/Cp266 and Cp266/Cp267 were able to be amplified. In contrast to this in vivo interaction analysis using the yeast two-hybrid system and in vitro immune co-precipitation revealed an interaction between Cp267, which contains the ARM repeats, Cp265, the

  10. Gene therapy strategy for long-term myocardial protection using adeno-associated virus-mediated delivery of heme oxygenase gene.

    Science.gov (United States)

    Melo, Luis G; Agrawal, Reitu; Zhang, Lunan; Rezvani, Mojgan; Mangi, Abeel A; Ehsan, Afshin; Griese, Daniel P; Dell'Acqua, Giorgio; Mann, Michael J; Oyama, Junichi; Yet, Shaw-Fang; Layne, Matthew D; Perrella, Mark A; Dzau, Victor J

    2002-02-05

    Ischemia and oxidative stress are the leading mechanisms for tissue injury. An ideal strategy for preventive/protective therapy would be to develop an approach that could confer long-term transgene expression and, consequently, tissue protection from repeated ischemia/reperfusion injury with a single administration of a therapeutic gene. In the present study, we used recombinant adeno-associated virus (rAAV) as a vector for direct delivery of the cytoprotective gene heme oxygenase-1 (HO-1) into the rat myocardium, with the purpose of evaluating this strategy as a therapeutic approach for long-term protection from ischemia-induced myocardial injury. Human HO-1 gene (hHO-1) was delivered to normal rat hearts by intramyocardial injection. AAV-mediated transfer of the hHO-1 gene 8 weeks before acute coronary artery ligation and release led to a dramatic reduction (>75%) in left ventricular myocardial infarction. The reduction in infarct size was accompanied by decreases in myocardial lipid peroxidation and in proapoptotic Bax and proinflammatory interleukin-1beta protein abundance, concomitant with an increase in antiapoptotic Bcl-2 protein level. This suggested that the transgene exerts its cardioprotective effects in part by reducing oxidative stress and associated inflammation and apoptotic cell death. This study documents the beneficial therapeutic effect of rAAV-mediated transfer, before myocardial injury, of a cytoprotective gene that confers long-term myocardial protection from ischemia/reperfusion injury. Our data suggest that this novel "pre-event" gene transfer approach may provide sustained tissue protection from future repeated episodes of injury and may be beneficial as preventive therapy for patients with or at risk of developing coronary ischemic events.

  11. Characterisation of silent and active genes for a variable large protein of Borrelia recurrentis

    Directory of Open Access Journals (Sweden)

    Scragg Ian G

    2002-10-01

    Full Text Available Abstract Background We report the characterisation of the variable large protein (vlp gene expressed by clinical isolate A1 of Borrelia recurrentis; the agent of the life-threatening disease louse-borne relapsing fever. Methods The major vlp protein of this isolate was characterised and a DNA probe created. Use of this together with standard molecular methods was used to determine the location of the vlp1B. recurrentis A1 gene in both this and other isolates. Results This isolate was found to carry silent and expressed copies of the vlp1B. recurrentis A1 gene on plasmids of 54 kbp and 24 kbp respectively, whereas a different isolate, A17, had only the silent vlp1B. recurrentis A17 on a 54 kbp plasmid. Silent and expressed vlp1 have identical mature protein coding regions but have different 5' regions, both containing different potential lipoprotein leader sequences. Only one form of vlp1 is transcribed in the A1 isolate of B. recurrentis, yet both 5' upstream sequences of this vlp1 gene possess features of bacterial promoters. Conclusion Taken together these results suggest that antigenic variation in B. recurrentis may result from recombination of variable large and small protein genes at the junction between lipoprotein leader sequence and mature protein coding region. However, this hypothetical model needs to be validated by further identification of expressed and silent variant protein genes in other B. recurrentis isolates.

  12. Human Immunodeficiency Virus Tat-Activated Expression of Poliovirus Protein 2A Inhibits mRNA Translation

    Science.gov (United States)

    Sun, Xiao-Hong; Baltimore, David

    1989-04-01

    To study the effect of poliovirus protein 2A on cellular RNA translation, the tat control system of human immunodeficiency virus (HIV) was used. Protein 2A was expressed from a plasmid construct (pHIV/2A) incorporating the HIV long terminal repeat. Protein synthesis was measured by using chloramphenicol acetyltransferase as a reporter gene driven by the Rous sarcoma virus long terminal repeat. When HIV/2A was contransfected with the reporter, addition of a tat-producing plasmid caused at least a 50-fold drop in chloramphenicol acetyltransferase synthesis. A HeLa cell line carrying HIV/2A was established. In it, tat expression caused more than a 10-fold drop in chloramphenicol acetyltransferase synthesis from the reporter plasmid. Furthermore, 2A induction by tat caused cleavage of the cellular translation factor P220, a part of eukaryotic translation initiation factor 4F. Thus protein 2A can, by itself, carry out the inhibition of cellular protein synthesis characteristic of a poliovirus infection. Also, the HIV tat activation provides a very effective method to control gene expression in mammalian cells.

  13. Association of the polymorphism of the CAG repeat in the mitochondrial DNA polymerase gamma gene (POLG) with testicular germ-cell cancer

    DEFF Research Database (Denmark)

    Blomberg Jensen, M; Leffers, H; Petersen, J H

    2008-01-01

    BACKGROUND: A possible association between the polymorphic CAG repeat in the DNA polymerase gamma (POLG) gene and the risk of testicular germ-cell tumours (TGCT) was investigated in this study. The hypothesis was prompted by an earlier preliminary study proposing an association of the absence...

  14. A strain-variable bacteriocin in Bacillus anthracis and Bacillus cereus with repeated Cys-Xaa-Xaa motifs

    Directory of Open Access Journals (Sweden)

    Haft Daniel H

    2009-04-01

    Full Text Available Abstract Bacteriocins are peptide antibiotics from ribosomally translated precursors, produced by bacteria often through extensive post-translational modification. Minimal sequence conservation, short gene lengths, and low complexity sequence can hinder bacteriocin identification, even during gene calling, so they are often discovered by proximity to accessory genes encoding maturation, immunity, and export functions. This work reports a new subfamily of putative thiazole-containing heterocyclic bacteriocins. It appears universal in all strains of Bacillus anthracis and B. cereus, but has gone unrecognized because it is always encoded far from its maturation protein operon. Patterns of insertions and deletions among twenty-four variants suggest a repeating functional unit of Cys-Xaa-Xaa. Reviewers This article was reviewed by Andrei Osterman and Lakshminarayan Iyer.

  15. RACK1, A Multifaceted Scaffolding Protein: Structure and Function

    LENUS (Irish Health Repository)

    Adams, David R

    2011-10-06

    Abstract The Receptor for Activated C Kinase 1 (RACK1) is a member of the tryptophan-aspartate repeat (WD-repeat) family of proteins and shares significant homology to the β subunit of G-proteins (Gβ). RACK1 adopts a seven-bladed β-propeller structure which facilitates protein binding. RACK1 has a significant role to play in shuttling proteins around the cell, anchoring proteins at particular locations and in stabilising protein activity. It interacts with the ribosomal machinery, with several cell surface receptors and with proteins in the nucleus. As a result, RACK1 is a key mediator of various pathways and contributes to numerous aspects of cellular function. Here, we discuss RACK1 gene and structure and its role in specific signaling pathways, and address how posttranslational modifications facilitate subcellular location and translocation of RACK1. This review condenses several recent studies suggesting a role for RACK1 in physiological processes such as development, cell migration, central nervous system (CN) function and circadian rhythm as well as reviewing the role of RACK1 in disease.

  16. Natural selection in avian protein-coding genes expressed in brain.

    Science.gov (United States)

    Axelsson, Erik; Hultin-Rosenberg, Lina; Brandström, Mikael; Zwahlén, Martin; Clayton, David F; Ellegren, Hans

    2008-06-01

    The evolution of birds from theropod dinosaurs took place approximately 150 million years ago, and was associated with a number of specific adaptations that are still evident among extant birds, including feathers, song and extravagant secondary sexual characteristics. Knowledge about the molecular evolutionary background to such adaptations is lacking. Here, we analyse the evolution of > 5000 protein-coding gene sequences expressed in zebra finch brain by comparison to orthologous sequences in chicken. Mean d(N)/d(S) is 0.085 and genes with their maximal expression in the eye and central nervous system have the lowest mean d(N)/d(S) value, while those expressed in digestive and reproductive tissues exhibit the highest. We find that fast-evolving genes (those which have higher than expected rate of nonsynonymous substitution, indicative of adaptive evolution) are enriched for biological functions such as fertilization, muscle contraction, defence response, response to stress, wounding and endogenous stimulus, and cell death. After alignment to mammalian orthologues, we identify a catalogue of 228 genes that show a significantly higher rate of protein evolution in the two bird lineages than in mammals. These accelerated bird genes, representing candidates for avian-specific adaptations, include genes implicated in vocal learning and other cognitive processes. Moreover, colouration genes evolve faster in birds than in mammals, which may have been driven by sexual selection for extravagant plumage characteristics.

  17. Molecular evolution of the Paramyxoviridae and Rhabdoviridae multiple-protein-encoding P gene.

    Science.gov (United States)

    Jordan, I K; Sutter, B A; McClure, M A

    2000-01-01

    Presented here is an analysis of the molecular evolutionary dynamics of the P gene among 76 representative sequences of the Paramyxoviridae and Rhabdoviridae RNA virus families. In a number of Paramyxoviridae taxa, as well as in vesicular stomatitis viruses of the Rhabdoviridae, the P gene encodes multiple proteins from a single genomic RNA sequence. These products include the phosphoprotein (P), as well as the C and V proteins. The complexity of the P gene makes it an intriguing locus to study from an evolutionary perspective. Amino acid sequence alignments of the proteins encoded at the P and N loci were used in independent phylogenetic reconstructions of the Paramyxoviridae and Rhabdoviridae families. P-gene-coding capacities were mapped onto the Paramyxoviridae phylogeny, and the most parsimonious path of multiple-coding-capacity evolution was determined. Levels of amino acid variation for Paramyxoviridae and Rhabdoviridae P-gene-encoded products were also analyzed. Proteins encoded in overlapping reading frames from the same nucleotides have different levels of amino acid variation. The nucleotide architecture that underlies the amino acid variation was determined in order to evaluate the role of selection in the evolution of the P gene overlapping reading frames. In every case, the evolution of one of the proteins encoded in the overlapping reading frames has been constrained by negative selection while the other has evolved more rapidly. The integrity of the overlapping reading frame that represents a derived state is generally maintained at the expense of the ancestral reading frame encoded by the same nucleotides. The evolution of such multicoding sequences is likely a response by RNA viruses to selective pressure to maximize genomic information content while maintaining small genome size. The ability to evolve such a complex genomic strategy is intimately related to the dynamics of the viral quasispecies, which allow enhanced exploration of the adaptive

  18. Human mismatch repair protein hMutLα is required to repair short slipped-DNAs of trinucleotide repeats.

    Science.gov (United States)

    Panigrahi, Gagan B; Slean, Meghan M; Simard, Jodie P; Pearson, Christopher E

    2012-12-07

    Mismatch repair (MMR) is required for proper maintenance of the genome by protecting against mutations. The mismatch repair system has also been implicated as a driver of certain mutations, including disease-associated trinucleotide repeat instability. We recently revealed a requirement of hMutSβ in the repair of short slip-outs containing a single CTG repeat unit (1). The involvement of other MMR proteins in short trinucleotide repeat slip-out repair is unknown. Here we show that hMutLα is required for the highly efficient in vitro repair of single CTG repeat slip-outs, to the same degree as hMutSβ. HEK293T cell extracts, deficient in hMLH1, are unable to process single-repeat slip-outs, but are functional when complemented with hMutLα. The MMR-deficient hMLH1 mutant, T117M, which has a point mutation proximal to the ATP-binding domain, is defective in slip-out repair, further supporting a requirement for hMLH1 in the processing of short slip-outs and possibly the involvement of hMHL1 ATPase activity. Extracts of hPMS2-deficient HEC-1-A cells, which express hMLH1, hMLH3, and hPMS1, are only functional when complemented with hMutLα, indicating that neither hMutLβ nor hMutLγ is sufficient to repair short slip-outs. The resolution of clustered short slip-outs, which are poorly repaired, was partially dependent upon a functional hMutLα. The joint involvement of hMutSβ and hMutLα suggests that repeat instability may be the result of aberrant outcomes of repair attempts.

  19. EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions

    DEFF Research Database (Denmark)

    Luo, Yonglun; Friis, Jenny Blechingberg; Fernandes, Ana Miguel

    2015-01-01

    at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. Conclusions The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes...... involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.......Background FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins...

  20. Repeatability and Reproducibility in Proteomic Identifications by Liquid Chromatography—Tandem Mass Spectrometry

    Science.gov (United States)

    Tabb, David L.; Vega-Montoto, Lorenzo; Rudnick, Paul A.; Variyath, Asokan Mulayath; Ham, Amy-Joan L.; Bunk, David M.; Kilpatrick, Lisa E.; Billheimer, Dean D.; Blackman, Ronald K.; Cardasis, Helene L.; Carr, Steven A.; Clauser, Karl R.; Jaffe, Jacob D.; Kowalski, Kevin A.; Neubert, Thomas A.; Regnier, Fred E.; Schilling, Birgit; Tegeler, Tony J.; Wang, Mu; Wang, Pei; Whiteaker, Jeffrey R.; Zimmerman, Lisa J.; Fisher, Susan J.; Gibson, Bradford W.; Kinsinger, Christopher R.; Mesri, Mehdi; Rodriguez, Henry; Stein, Steven E.; Tempst, Paul; Paulovich, Amanda G.; Liebler, Daniel C.; Spiegelman, Cliff

    2009-01-01

    The complexity of proteomic instrumentation for LC-MS/MS introduces many possible sources of variability. Data-dependent sampling of peptides constitutes a stochastic element at the heart of discovery proteomics. Although this variation impacts the identification of peptides, proteomic identifications are far from completely random. In this study, we analyzed interlaboratory data sets from the NCI Clinical Proteomic Technology Assessment for Cancer to examine repeatability and reproducibility in peptide and protein identifications. Included data spanned 144 LC-MS/MS experiments on four Thermo LTQ and four Orbitrap instruments. Samples included yeast lysate, the NCI-20 defined dynamic range protein mix, and the Sigma UPS 1 defined equimolar protein mix. Some of our findings reinforced conventional wisdom, such as repeatability and reproducibility being higher for proteins than for peptides. Most lessons from the data, however, were more subtle. Orbitraps proved capable of higher repeatability and reproducibility, but aberrant performance occasionally erased these gains. Even the simplest protein digestions yielded more peptide ions than LC-MS/MS could identify during a single experiment. We observed that peptide lists from pairs of technical replicates overlapped by 35–60%, giving a range for peptide-level repeatability in these experiments. Sample complexity did not appear to affect peptide identification repeatability, even as numbers of identified spectra changed by an order of magnitude. Statistical analysis of protein spectral counts revealed greater stability across technical replicates for Orbitraps, making them superior to LTQ instruments for biomarker candidate discovery. The most repeatable peptides were those corresponding to conventional tryptic cleavage sites, those that produced intense MS signals, and those that resulted from proteins generating many distinct peptides. Reproducibility among different instruments of the same type lagged behind

  1. Dissection of Resistance Genes to Pseudomonas syringae pv. phaseolicola in UI3 Common Bean Cultivar.

    Science.gov (United States)

    González, Ana M; Godoy, Luís; Santalla, Marta

    2017-11-23

    Few quantitative trait loci have been mapped for resistance to Pseudomonas syringae pv. phaseolicola in common bean. Two F₂ populations were developed from the host differential UI3 cultivar. The objective of this study was to further characterize the resistance to races 1, 5, 7 and 9 of Psp included in UI3. Using a QTL mapping approach, 16 and 11 main-effect QTLs for pod and primary leaf resistance were located on LG10, explaining up to 90% and 26% of the phenotypic variation, respectively. The homologous genomic region corresponding to primary leaf resistance QTLs detected tested positive for the presence of resistance-associated gene cluster encoding nucleotide-binding and leucine-rich repeat (NL), Natural Resistance Associated Macrophage (NRAMP) and Pentatricopeptide Repeat family (PPR) proteins. It is worth noting that the main effect QTLs for resistance in pod were located inside a 3.5 Mb genomic region that included the Phvul.010G021200 gene, which encodes a protein that has the highest sequence similarity to the RIN4 gene of Arabidopsis, and can be considered an important candidate gene for the organ-specific QTLs identified here. These results support that resistance to Psp from UI3 might result from the immune response activated by combinations of R proteins, and suggest the guard model as an important mechanism in pod resistance to halo blight. The candidate genes identified here warrant functional studies that will help in characterizing the actual defense gene(s) in UI3 genotype.

  2. Detection, characterization and evolution of internal repeats in Chitinases of known 3-D structure.

    Directory of Open Access Journals (Sweden)

    Manigandan Sivaji

    Full Text Available Chitinase proteins have evolved and diversified almost in all organisms ranging from prokaryotes to eukaryotes. During evolution, internal repeats may appear in amino acid sequences of proteins which alter the structural and functional features. Here we deciphered the internal repeats from Chitinase and characterized the structural similarities between them. Out of 24 diverse Chitinase sequences selected, six sequences (2CJL, 2DSK, 2XVP, 2Z37, 3EBV and 3HBE did not contain any internal repeats of amino acid sequences. Ten sequences contained repeats of length <50, and the remaining 8 sequences contained repeat length between 50 and 100 residues. Two Chitinase sequences, 1ITX and 3SIM, were found to be structurally similar when analyzed using secondary structure of Chitinase from secondary and 3-Dimensional structure database of Protein Data Bank. Internal repeats of 3N17 and 1O6I were also involved in the ligand-binding site of those Chitinase proteins, respectively. Our analyses enhance our understanding towards the identification of structural characteristics of internal repeats in Chitinase proteins.

  3. Selection pressure on human STR loci and its relevance in repeat expansion disease

    KAUST Repository

    Shimada, Makoto K.

    2016-06-11

    Short Tandem Repeats (STRs) comprise repeats of one to several base pairs. Because of the high mutability due to strand slippage during DNA synthesis, rapid evolutionary change in the number of repeating units directly shapes the range of repeat-number variation according to selection pressure. However, the remaining questions include: Why are STRs causing repeat expansion diseases maintained in the human population; and why are these limited to neurodegenerative diseases? By evaluating the genome-wide selection pressure on STRs using the database we constructed, we identified two different patterns of relationship in repeat-number polymorphisms between DNA and amino-acid sequences, although both patterns are evolutionary consequences of avoiding the formation of harmful long STRs. First, a mixture of degenerate codons is represented in poly-proline (poly-P) repeats. Second, long poly-glutamine (poly-Q) repeats are favored at the protein level; however, at the DNA level, STRs encoding long poly-Qs are frequently divided by synonymous SNPs. Furthermore, significant enrichments of apoptosis and neurodevelopment were biological processes found specifically in genes encoding poly-Qs with repeat polymorphism. This suggests the existence of a specific molecular function for polymorphic and/or long poly-Q stretches. Given that the poly-Qs causing expansion diseases were longer than other poly-Qs, even in healthy subjects, our results indicate that the evolutionary benefits of long and/or polymorphic poly-Q stretches outweigh the risks of long CAG repeats predisposing to pathological hyper-expansions. Molecular pathways in neurodevelopment requiring long and polymorphic poly-Q stretches may provide a clue to understanding why poly-Q expansion diseases are limited to neurodegenerative diseases. © 2016, Springer-Verlag Berlin Heidelberg.

  4. A positive feedback-based gene circuit to increase the production of a membrane protein

    Directory of Open Access Journals (Sweden)

    Gennis Robert B

    2010-05-01

    Full Text Available Abstract Background Membrane proteins are an important class of proteins, playing a key role in many biological processes, and are a promising target in pharmaceutical development. However, membrane proteins are often difficult to produce in large quantities for the purpose of crystallographic or biochemical analyses. Results In this paper, we demonstrate that synthetic gene circuits designed specifically to overexpress certain genes can be applied to manipulate the expression kinetics of a model membrane protein, cytochrome bd quinol oxidase in E. coli, resulting in increased expression rates. The synthetic circuit involved is an engineered, autoinducer-independent variant of the lux operon activator LuxR from V. fischeri in an autoregulatory, positive feedback configuration. Conclusions Our proof-of-concept experiments indicate a statistically significant increase in the rate of production of the bd oxidase membrane protein. Synthetic gene networks provide a feasible solution for the problem of membrane protein production.

  5. KvLEA, a New Isolated Late Embryogenesis Abundant Protein Gene from Kosteletzkya virginica Responding to Multiabiotic Stresses

    Directory of Open Access Journals (Sweden)

    Xiaoli Tang

    2016-01-01

    Full Text Available The LEA proteins are a kind of hydrophilic proteins, playing main functions in desiccation tolerance. However, their importance as a kind of stress proteins in abiotic stress is being clarified little by little. In this study we isolated, cloned, and identified the first KvLEA gene in Kosteletzkya virginica. Bioinformatic analysis showed that the protein encoded by this gene had common properties of LEA proteins and the multiple sequences alignment and phylogenetic analysis further showed that this protein had high homology with two Arabidopsis LEA proteins. Gene expression analysis revealed that this gene had a higher expression in root and it was induced obviously by salt stress. Moreover, the transcripts of KvLEA were also induced by other abiotic stresses including drought, high temperature, chilling, and ABA treatment. Among these abiotic stresses, ABA treatment brought about the biggest changes to this gene. Collectively, our research discovered a novel LEA gene and uncovered its involvement in multiabiotic stresses in K. virginica. This research not only enriched studies on LEA gene in plant but also would accelerate more studies on K. virginica in the future.

  6. Codon usage and expression level of human mitochondrial 13 protein coding genes across six continents.

    Science.gov (United States)

    Chakraborty, Supriyo; Uddin, Arif; Mazumder, Tarikul Huda; Choudhury, Monisha Nath; Malakar, Arup Kumar; Paul, Prosenjit; Halder, Binata; Deka, Himangshu; Mazumder, Gulshana Akthar; Barbhuiya, Riazul Ahmed; Barbhuiya, Masuk Ahmed; Devi, Warepam Jesmi

    2017-12-02

    The study of codon usage coupled with phylogenetic analysis is an important tool to understand the genetic and evolutionary relationship of a gene. The 13 protein coding genes of human mitochondria are involved in electron transport chain for the generation of energy currency (ATP). However, no work has yet been reported on the codon usage of the mitochondrial protein coding genes across six continents. To understand the patterns of codon usage in mitochondrial genes across six different continents, we used bioinformatic analyses to analyze the protein coding genes. The codon usage bias was low as revealed from high ENC value. Correlation between codon usage and GC3 suggested that all the codons ending with G/C were positively correlated with GC3 but vice versa for A/T ending codons with the exception of ND4L and ND5 genes. Neutrality plot revealed that for the genes ATP6, COI, COIII, CYB, ND4 and ND4L, natural selection might have played a major role while mutation pressure might have played a dominant role in the codon usage bias of ATP8, COII, ND1, ND2, ND3, ND5 and ND6 genes. Phylogenetic analysis indicated that evolutionary relationships in each of 13 protein coding genes of human mitochondria were different across six continents and further suggested that geographical distance was an important factor for the origin and evolution of 13 protein coding genes of human mitochondria. Copyright © 2017 Elsevier B.V. and Mitochondria Research Society. All rights reserved.

  7. Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

    Science.gov (United States)

    Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

    1993-01-01

    Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043

  8. m-Trifluoromethyl-diphenyl Diselenide Regulates Prefrontal Cortical MOR and KOR Protein Levels and Abolishes the Phenotype Induced by Repeated Forced Swim Stress in Mice.

    Science.gov (United States)

    Rosa, Suzan Gonçalves; Pesarico, Ana Paula; Martini, Franciele; Nogueira, Cristina Wayne

    2018-04-05

    The present study aimed to investigate the m-trifluoromethyl-diphenyl diselenide [(m-CF 3 -PhSe) 2 ] effects on prefrontal cortical MOR and KOR protein levels and phenotype induced by repeated forced swim stress (FSS) in mice. Adult Swiss mice were subjected to repeated FSS sessions, and after that, they performed the spontaneous locomotor/exploratory activity, tail suspension, and splash tests. (m-CF 3 -PhSe) 2 (0.1 to 5 mg/kg) was administered to mice 30 min before the first FSS session and 30 min before the subsequent repeated FSS. (m-CF 3 -PhSe) 2 abolished the phenotype induced by repeated FSS in mice. In addition, a single FSS session increased μ but reduced δ-opioid receptor contents, without changing the κ content. Mice subjected to repeated FSS had an increase in the μ content when compared to those of naïve group or subjected to single FSS. Repeated FSS induced an increase of δ-opioid receptor content compared to those mice subjected to single FSS. However, the δ-opioid receptor contents were lower than those found in the naïve group. The mice subjected to repeated FSS showed an increase in the κ-opioid receptor content when compared to that of the naïve mice. (m-CF 3 -PhSe) 2 regulated the protein contents of μ and κ receptors in mice subjected to repeated FSS. These findings demonstrate that (m-CF 3 -PhSe) 2 was effective to abolish the phenotype induced by FSS, which was accompanied by changes in the contents of cortical μ- and κ-opioid receptors.

  9. Emerging strategies for cell and gene therapy of the muscular dystrophies

    OpenAIRE

    Muir, Lindsey A.; Chamberlain, Jeffrey S.

    2009-01-01

    The muscular dystrophies are a heterogeneous group of over 40 disorders that are characterised by muscle weakness and wasting. The most common are Duchenne muscular dystrophy and Becker muscular dystrophy, which result from mutations within the gene encoding dystrophin; myotonic dystrophy type 1, which results from an expanded trinucleotide repeat in the myotonic dystrophy protein kinase gene; and facioscapulohumeral dystrophy, which is associated with contractions in the subtelomeric region ...

  10. Functional modules by relating protein interaction networks and gene expression.

    Science.gov (United States)

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  11. TALEN-Induced Double-Strand Break Repair of CTG Trinucleotide Repeats

    Directory of Open Access Journals (Sweden)

    Valentine Mosbach

    2018-02-01

    Full Text Available Trinucleotide repeat expansions involving CTG/CAG triplets are responsible for several neurodegenerative disorders, including myotonic dystrophy and Huntington’s disease. Because expansions trigger the disease, contracting repeat length could be a possible approach to gene therapy for these disorders. Here, we show that a TALEN-induced double-strand break was very efficient at contracting expanded CTG repeats in yeast. We show that RAD51, POL32, and DNL4 are dispensable for double-strand break repair within CTG repeats, the only required genes being RAD50, SAE2, and RAD52. Resection was totally abolished in the absence of RAD50 on both sides of the break, whereas it was reduced in a sae2Δ mutant on the side of the break containing the longest repeat tract, suggesting that secondary structures at double-strand break ends must be removed by the Mre11-Rad50 complex and Sae2. Following the TALEN double-strand break, single-strand annealing occurred between both sides of the repeat tract, leading to repeat contraction.

  12. Exact protein distributions for stochastic models of gene expression using partitioning of Poisson processes.

    Science.gov (United States)

    Pendar, Hodjat; Platini, Thierry; Kulkarni, Rahul V

    2013-04-01

    Stochasticity in gene expression gives rise to fluctuations in protein levels across a population of genetically identical cells. Such fluctuations can lead to phenotypic variation in clonal populations; hence, there is considerable interest in quantifying noise in gene expression using stochastic models. However, obtaining exact analytical results for protein distributions has been an intractable task for all but the simplest models. Here, we invoke the partitioning property of Poisson processes to develop a mapping that significantly simplifies the analysis of stochastic models of gene expression. The mapping leads to exact protein distributions using results for mRNA distributions in models with promoter-based regulation. Using this approach, we derive exact analytical results for steady-state and time-dependent distributions for the basic two-stage model of gene expression. Furthermore, we show how the mapping leads to exact protein distributions for extensions of the basic model that include the effects of posttranscriptional and posttranslational regulation. The approach developed in this work is widely applicable and can contribute to a quantitative understanding of stochasticity in gene expression and its regulation.

  13. Exact protein distributions for stochastic models of gene expression using partitioning of Poisson processes

    Science.gov (United States)

    Pendar, Hodjat; Platini, Thierry; Kulkarni, Rahul V.

    2013-04-01

    Stochasticity in gene expression gives rise to fluctuations in protein levels across a population of genetically identical cells. Such fluctuations can lead to phenotypic variation in clonal populations; hence, there is considerable interest in quantifying noise in gene expression using stochastic models. However, obtaining exact analytical results for protein distributions has been an intractable task for all but the simplest models. Here, we invoke the partitioning property of Poisson processes to develop a mapping that significantly simplifies the analysis of stochastic models of gene expression. The mapping leads to exact protein distributions using results for mRNA distributions in models with promoter-based regulation. Using this approach, we derive exact analytical results for steady-state and time-dependent distributions for the basic two-stage model of gene expression. Furthermore, we show how the mapping leads to exact protein distributions for extensions of the basic model that include the effects of posttranscriptional and posttranslational regulation. The approach developed in this work is widely applicable and can contribute to a quantitative understanding of stochasticity in gene expression and its regulation.

  14. Schizosaccharomyces pombe MutSα and MutLα Maintain Stability of Tetra-Nucleotide Repeats and Msh3 of Hepta-Nucleotide Repeats

    Directory of Open Access Journals (Sweden)

    Desirée Villahermosa

    2017-05-01

    Full Text Available Defective mismatch repair (MMR in humans is associated with colon cancer and instability of microsatellites, that is, DNA sequences with one or several nucleotides repeated. Key factors of eukaryotic MMR are the heterodimers MutSα (Msh2-Msh6, which recognizes base-base mismatches and unpaired nucleotides in DNA, and MutLα (Mlh1-Pms1, which facilitates downstream steps. In addition, MutSβ (Msh2-Msh3 recognizes DNA loops of various sizes, although our previous data and the data presented here suggest that Msh3 of Schizosaccharomyces pombe does not play a role in MMR. To test microsatellite stability in S. pombe and hence DNA loop repair, we have inserted tetra-, penta-, and hepta-nucleotide repeats in the ade6 gene and determined their Ade+ reversion rates and spectra in wild type and various mutants. Our data indicate that loops with four unpaired nucleotides in the nascent and the template strand are the upper limit of MutSα- and MutLα-mediated MMR in S. pombe. Stability of hepta-nucleotide repeats requires Msh3 and Exo1 in MMR-independent processes as well as the DNA repair proteins Rad50, Rad51, and Rad2FEN1. Most strikingly, mutation rates in the double mutants msh3 exo1 and msh3 rad51 were decreased when compared to respective single mutants, indicating that Msh3 prevents error prone processes carried out by Exo1 and Rad51. We conclude that Msh3 has no obvious function in MMR in S. pombe, but contributes to DNA repeat stability in MMR-independent processes.

  15. PRMT1 mediated methylation of TAF15 is required for its positive gene regulatory function

    Energy Technology Data Exchange (ETDEWEB)

    Jobert, Laure; Argentini, Manuela [Institut de Genetique et de Biologie Moleculaire et Cellulaire (IGBMC), CNRS UMR 7104, INSERM U 596, Universite Louis Pasteur de Strasbourg, BP 10142 - 67404 Illkirch Cedex, CU de Strasbourg (France); Tora, Laszlo, E-mail: laszlo@igbmc.u-strasbg.fr [Institut de Genetique et de Biologie Moleculaire et Cellulaire (IGBMC), CNRS UMR 7104, INSERM U 596, Universite Louis Pasteur de Strasbourg, BP 10142 - 67404 Illkirch Cedex, CU de Strasbourg (France)

    2009-04-15

    TAF15 (formerly TAF{sub II}68) is a nuclear RNA-binding protein that is associated with a distinct population of TFIID and RNA polymerase II complexes. TAF15 harbours an N-terminal activation domain, an RNA recognition motif (RRM) and many Arg-Gly-Gly (RGG) repeats at its C-terminal end. The N-terminus of TAF15 serves as an essential transforming domain in the fusion oncoprotein created by chromosomal translocation in certain human chondrosarcomas. Post-transcriptional modifications (PTMs) of proteins are known to regulate their activity, however, nothing is known on how PTMs affect TAF15 function. Here we demonstrate that endogenous human TAF15 is methylated in vivo at its numerous RGG repeats. Furthermore, we identify protein arginine N-methyltransferase 1 (PRMT1) as a TAF15 interactor and the major PRMT responsible for its methylation. In addition, the RGG repeat-containing C-terminus of TAF15 is responsible for the shuttling between the nucleus and the cytoplasm and the methylation of RGG repeats affects the subcellular localization of TAF15. The methylation of TAF15 by PRMT1 is required for the ability of TAF15 to positively regulate the expression of the studied endogenous TAF15-target genes. Our findings demonstrate that arginine methylation of TAF15 by PRMT1 is a crucial event determining its proper localization and gene regulatory function.

  16. PRMT1 mediated methylation of TAF15 is required for its positive gene regulatory function

    International Nuclear Information System (INIS)

    Jobert, Laure; Argentini, Manuela; Tora, Laszlo

    2009-01-01

    TAF15 (formerly TAF II 68) is a nuclear RNA-binding protein that is associated with a distinct population of TFIID and RNA polymerase II complexes. TAF15 harbours an N-terminal activation domain, an RNA recognition motif (RRM) and many Arg-Gly-Gly (RGG) repeats at its C-terminal end. The N-terminus of TAF15 serves as an essential transforming domain in the fusion oncoprotein created by chromosomal translocation in certain human chondrosarcomas. Post-transcriptional modifications (PTMs) of proteins are known to regulate their activity, however, nothing is known on how PTMs affect TAF15 function. Here we demonstrate that endogenous human TAF15 is methylated in vivo at its numerous RGG repeats. Furthermore, we identify protein arginine N-methyltransferase 1 (PRMT1) as a TAF15 interactor and the major PRMT responsible for its methylation. In addition, the RGG repeat-containing C-terminus of TAF15 is responsible for the shuttling between the nucleus and the cytoplasm and the methylation of RGG repeats affects the subcellular localization of TAF15. The methylation of TAF15 by PRMT1 is required for the ability of TAF15 to positively regulate the expression of the studied endogenous TAF15-target genes. Our findings demonstrate that arginine methylation of TAF15 by PRMT1 is a crucial event determining its proper localization and gene regulatory function.

  17. GT-repeat polymorphism in the heme oxygenase-1 gene promoter is associated with cardiovascular mortality risk in an arsenic-exposed population in northeastern Taiwan

    International Nuclear Information System (INIS)

    Wu, Meei-Maan; Chiou, Hung-Yi; Chen, Chi-Ling; Wang, Yuan-Hung; Hsieh, Yi-Chen; Lien, Li-Ming; Lee, Te-Chang; Chen, Chien-Jen

    2010-01-01

    Inorganic arsenic has been associated with increased risk of atherosclerotic vascular disease and mortality in humans. A functional GT-repeat polymorphism in the heme oxygenase-1 (HO-1) gene promoter is inversely correlated with the development of coronary artery disease and restenosis after clinical angioplasty. The relationship of HO-1 genotype with arsenic-associated cardiovascular disease has not been studied. In this study, we evaluated the relationship between the HO-1 GT-repeat polymorphism and cardiovascular mortality in an arsenic-exposed population. A total of 504 study participants were followed up for a median of 10.7 years for occurrence of cardiovascular deaths (coronary heart disease, cerebrovascular disease, and peripheral arterial disease). Cardiovascular risk factors and DNA samples for determination of HO-1 GT repeats were obtained at recruitment. GT repeats variants were grouped into the S (< 27 repeats) or L allele (≥ 27 repeats). Relative mortality risk was estimated using Cox regression analysis, adjusted for competing risk of cancer and other causes. For the L/L, L/S, and S/S genotype groups, the crude mortalities for cardiovascular disease were 8.42, 3.10, and 2.85 cases/1000 person-years, respectively. After adjusting for conventional cardiovascular risk factors and competing risk of cancer and other causes, carriers with class S allele (L/S or S/S genotypes) had a significantly reduced risk of cardiovascular mortality compared to non-carriers (L/L genotype) [OR, 0.38; 95% CI, 0.16-0.90]. In contrast, no significant association was observed between HO-1 genotype and cancer mortality or mortality from other causes. Shorter (GT)n repeats in the HO-1 gene promoter may confer protective effects against cardiovascular mortality related to arsenic exposure.

  18. Antibodies to a recombinant glutamate-rich Plasmodium falciparum protein

    DEFF Research Database (Denmark)

    Hogh, B; Petersen, E; Dziegiel, Morten Hanefeld

    1992-01-01

    A Plasmodium falciparum antigen gene coding for a 220-kD glutamate-rich protein (GLURP) has been cloned, and the 783 C-terminal amino acids of this protein (GLURP489-1271) have been expressed as a beta-galactosidase fusion protein in Escherichia coli. The encoded 783 amino acid residues contain two...... areas of repeated amino acid sequences. Antibodies against recombinant GLURP489-1271, as well as against a synthetic peptide corresponding to GLURP899-916, and against a synthetic peptide representing the major glutamate rich repeat sequence from the P. falciparum ring erythrocyte surface antigen (Pf155...... between the anti-GLURP489-1271 and anti-(EENV)6 antibody responses. The data provide indirect evidence for a protective role of antibodies reacting with recombinant GLURP489-1271 as well as with the synthetic peptide (EENV)6 from the Pf155/RESA....

  19. Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.

    Science.gov (United States)

    Zhang, Shu-Bo; Tang, Qiang-Rong

    2016-07-21

    Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Sequence analyses reveal that a TPR-DP module, surrounded by recombinable flanking introns, could be at the origin of eukaryotic Hop and Hip TPR-DP domains and prokaryotic GerD proteins.

    Science.gov (United States)

    Hernández Torres, Jorge; Papandreou, Nikolaos; Chomilier, Jacques

    2009-05-01

    The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR-DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR-DP domains.

  1. Expression of genes encoding multi-transmembrane proteins in specific primate taste cell populations.

    Directory of Open Access Journals (Sweden)

    Bryan D Moyer

    Full Text Available BACKGROUND: Using fungiform (FG and circumvallate (CV taste buds isolated by laser capture microdissection and analyzed using gene arrays, we previously constructed a comprehensive database of gene expression in primates, which revealed over 2,300 taste bud-associated genes. Bioinformatics analyses identified hundreds of genes predicted to encode multi-transmembrane domain proteins with no previous association with taste function. A first step in elucidating the roles these gene products play in gustation is to identify the specific taste cell types in which they are expressed. METHODOLOGY/PRINCIPAL FINDINGS: Using double label in situ hybridization analyses, we identified seven new genes expressed in specific taste cell types, including sweet, bitter, and umami cells (TRPM5-positive, sour cells (PKD2L1-positive, as well as other taste cell populations. Transmembrane protein 44 (TMEM44, a protein with seven predicted transmembrane domains with no homology to GPCRs, is expressed in a TRPM5-negative and PKD2L1-negative population that is enriched in the bottom portion of taste buds and may represent developmentally immature taste cells. Calcium homeostasis modulator 1 (CALHM1, a component of a novel calcium channel, along with family members CALHM2 and CALHM3; multiple C2 domains; transmembrane 1 (MCTP1, a calcium-binding transmembrane protein; and anoctamin 7 (ANO7, a member of the recently identified calcium-gated chloride channel family, are all expressed in TRPM5 cells. These proteins may modulate and effect calcium signalling stemming from sweet, bitter, and umami receptor activation. Synaptic vesicle glycoprotein 2B (SV2B, a regulator of synaptic vesicle exocytosis, is expressed in PKD2L1 cells, suggesting that this taste cell population transmits tastant information to gustatory afferent nerve fibers via exocytic neurotransmitter release. CONCLUSIONS/SIGNIFICANCE: Identification of genes encoding multi-transmembrane domain proteins

  2. Impact of Alu repeats on the evolution of human p53 binding sites

    Directory of Open Access Journals (Sweden)

    Sirotin Michael V

    2011-01-01

    Full Text Available Abstract Background The p53 tumor suppressor protein is involved in a complicated regulatory network, mediating expression of ~1000 human genes. Recent studies have shown that many p53 in vivo binding sites (BSs reside in transposable repeats. The relationship between these BSs and functional p53 response elements (REs remains unknown, however. We sought to understand whether the p53 REs also reside in transposable elements and particularly in the most-abundant Alu repeats. Results We have analyzed ~160 functional p53 REs identified so far and found that 24 of them occur in repeats. More than half of these repeat-associated REs reside in Alu elements. In addition, using a position weight matrix approach, we found ~400,000 potential p53 BSs in Alu elements genome-wide. Importantly, these putative BSs are located in the same regions of Alu repeats as the functional p53 REs - namely, in the vicinity of Boxes A/A' and B of the internal RNA polymerase III promoter. Earlier nucleosome-mapping experiments showed that the Boxes A/A' and B have a different chromatin environment, which is critical for the binding of p53 to DNA. Here, we compare the Alu-residing p53 sites with the corresponding Alu consensus sequences and conclude that the p53 sites likely evolved through two different mechanisms - the sites overlapping with the Boxes A/A' were generated by CG → TG mutations; the other sites apparently pre-existed in the progenitors of several Alu subfamilies, such as AluJo and AluSq. The binding affinity of p53 to the Alu-residing sites generally correlates with the age of Alu subfamilies, so that the strongest sites are embedded in the 'relatively young' Alu repeats. Conclusions The primate-specific Alu repeats play an important role in shaping the p53 regulatory network in the context of chromatin. One of the selective factors responsible for the frequent occurrence of Alu repeats in introns may be related to the p53-mediated regulation of Alu

  3. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    Science.gov (United States)

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  4. β-Adrenergic receptor antagonism prevents anxiety-like behavior and microglial reactivity induced by repeated social defeat.

    Science.gov (United States)

    Wohleb, Eric S; Hanke, Mark L; Corona, Angela W; Powell, Nicole D; Stiner, La'Tonia M; Bailey, Michael T; Nelson, Randy J; Godbout, Jonathan P; Sheridan, John F

    2011-04-27

    Psychosocial stress is associated with altered immune function and development of psychological disorders including anxiety and depression. Here we show that repeated social defeat in mice increased c-Fos staining in brain regions associated with fear and threat appraisal and promoted anxiety-like behavior in a β-adrenergic receptor-dependent manner. Repeated social defeat also significantly increased the number of CD11b(+)/CD45(high)/Ly6C(high) macrophages that trafficked to the brain. In addition, several inflammatory markers were increased on the surface of microglia (CD14, CD86, and TLR4) and macrophages (CD14 and CD86) after social defeat. Repeated social defeat also increased the presence of deramified microglia in the medial amygdala, prefrontal cortex, and hippocampus. Moreover, mRNA analysis of microglia indicated that repeated social defeat increased levels of interleukin (IL)-1β and reduced levels of glucocorticoid responsive genes [glucocorticoid-induced leucine zipper (GILZ) and FK506 binding protein-51 (FKBP51)]. The stress-dependent changes in microglia and macrophages were prevented by propranolol, a β-adrenergic receptor antagonist. Microglia isolated from socially defeated mice and cultured ex vivo produced markedly higher levels of IL-6, tumor necrosis factor-α, and monocyte chemoattractant protein-1 after stimulation with lipopolysaccharide compared with microglia from control mice. Last, repeated social defeat increased c-Fos activation in IL-1 receptor type-1-deficient mice, but did not promote anxiety-like behavior or microglia activation in the absence of functional IL-1 receptor type-1. These findings indicate that repeated social defeat-induced anxiety-like behavior and enhanced reactivity of microglia was dependent on activation of β-adrenergic and IL-1 receptors.

  5. EWS and FUS bind a subset of transcribed genes encoding proteins enriched in RNA regulatory functions.

    Science.gov (United States)

    Luo, Yonglun; Blechingberg, Jenny; Fernandes, Ana Miguel; Li, Shengting; Fryland, Tue; Børglum, Anders D; Bolund, Lars; Nielsen, Anders Lade

    2015-11-14

    FUS (TLS) and EWS (EWSR1) belong to the FET-protein family of RNA and DNA binding proteins. FUS and EWS are structurally and functionally related and participate in transcriptional regulation and RNA processing. FUS and EWS are identified in translocation generated cancer fusion proteins and involved in the human neurological diseases amyotrophic lateral sclerosis and fronto-temporal lobar degeneration. To determine the gene regulatory functions of FUS and EWS at the level of chromatin, we have performed chromatin immunoprecipitation followed by next generation sequencing (ChIP-seq). Our results show that FUS and EWS bind to a subset of actively transcribed genes, that binding often is downstream the poly(A)-signal, and that binding overlaps with RNA polymerase II. Functional examinations of selected target genes identified that FUS and EWS can regulate gene expression at different levels. Gene Ontology analyses showed that FUS and EWS target genes preferentially encode proteins involved in regulatory processes at the RNA level. The presented results yield new insights into gene interactions of EWS and FUS and have identified a set of FUS and EWS target genes involved in pathways at the RNA regulatory level with potential to mediate normal and disease-associated functions of the FUS and EWS proteins.

  6. Alterations in gene expression in mutant amyloid precursor protein transgenic mice lacking Niemann-Pick type C1 protein.

    Directory of Open Access Journals (Sweden)

    Mahua Maulik

    Full Text Available Niemann-Pick type C (NPC disease, a rare autosomal recessive disorder caused mostly by mutation in NPC1 gene, is pathologically characterized by the accumulation of free cholesterol in brain and other tissues. This is accompanied by gliosis and loss of neurons in selected brain regions, including the cerebellum. Recent studies have shown that NPC disease exhibits intriguing parallels with Alzheimer's disease, including the presence of neurofibrillary tangles and increased levels of amyloid precursor protein (APP-derived β-amyloid (Aβ peptides in vulnerable brain neurons. To evaluate the role of Aβ in NPC disease, we determined the gene expression profile in selected brain regions of our recently developed bigenic ANPC mice, generated by crossing APP transgenic (Tg mice with heterozygous Npc1-deficient mice. The ANPC mice exhibited exacerbated neuronal and glial pathology compared to other genotypes [i.e., APP-Tg, double heterozygous (Dhet, Npc1-null and wild-type mice]. Analysis of expression profiles of 86 selected genes using real-time RT-PCR arrays showed a wide-spectrum of alterations in the four genotypes compared to wild-type controls. The changes observed in APP-Tg and Dhet mice are limited to only few genes involved mostly in the regulation of cholesterol metabolism, whereas Npc1-null and ANPC mice showed alterations in the expression profiles of a number of genes regulating cholesterol homeostasis, APP metabolism, vesicular trafficking and cell death mechanism in both hippocampus and cerebellum compared to wild-type mice. Intriguingly, ANPC and Npc1-null mice, with some exceptions, exhibited similar changes, although more genes were differentially expressed in the affected cerebellum than the relatively spared hippocampus. The altered gene profiles were found to match with the corresponding protein levels. These results suggest that lack of Npc1 protein can alter the expression profile of selected transcripts as well as proteins, and

  7. The soybean-Phytophthora resistance locus Rps1-k encompasses coiled coil-nucleotide binding-leucine rich repeat-like genes and repetitive sequences

    Directory of Open Access Journals (Sweden)

    Bhattacharyya Madan K

    2008-03-01

    Full Text Available Abstract Background A series of Rps (resistance to Pytophthora sojae genes have been protecting soybean from the root and stem rot disease caused by the Oomycete pathogen, Phytophthora sojae. Five Rps genes were mapped to the Rps1 locus located near the 28 cM map position on molecular linkage group N of the composite genetic soybean map. Among these five genes, Rps1-k was introgressed from the cultivar, Kingwa. Rps1-k has been providing stable and broad-spectrum Phytophthora resistance in the major soybean-producing regions of the United States. Rps1-k has been mapped and isolated. More than one functional Rps1-k gene was identified from the Rps1-k locus. The clustering feature at the Rps1-k locus might have facilitated the expansion of Rps1-k gene numbers and the generation of new recognition specificities. The Rps1-k region was sequenced to understand the possible evolutionary steps that shaped the generation of Phytophthora resistance genes in soybean. Results Here the analyses of sequences of three overlapping BAC clones containing the 184,111 bp Rps1-k region are reported. A shotgun sequencing strategy was applied in sequencing the BAC contig. Sequence analysis predicted a few full-length genes including two Rps1-k genes, Rps1-k-1 and Rps1-k-2. Previously reported Rps1-k-3 from this genomic region 1 was evolved through intramolecular recombination between Rps1-k-1 and Rps1-k-2 in Escherichia coli. The majority of the predicted genes are truncated and therefore most likely they are nonfunctional. A member of a highly abundant retroelement, SIRE1, was identified from the Rps1-k region. The Rps1-k region is primarily composed of repetitive sequences. Sixteen simple repeat and 63 tandem repeat sequences were identified from the locus. Conclusion These data indicate that the Rps1 locus is located in a gene-poor region. The abundance of repetitive sequences in the Rps1-k region suggested that the location of this locus is in or near a

  8. The presence of two S-layer-protein-encoding genes is conserved among species related to Lactobacillus acidophilus

    NARCIS (Netherlands)

    Boot, H.J.; Kolen, C.P.A.M.; Pot, B.; Kersters, K.; Pouwels, P.H.

    1996-01-01

    Previously we have shown that the type strain of Lactobacillus acidophilus possesses two S-protein-encoding genes, one of which is silent, on a chromosomal segment of 6 kb. The S-protein-encoding gene in the expression site can be exchanged for the silent S-protein-encoding gene by inversion of this

  9. Evolution of the duplicated intracellular lipid-binding protein genes of teleost fishes.

    Science.gov (United States)

    Venkatachalam, Ananda B; Parmar, Manoj B; Wright, Jonathan M

    2017-08-01

    Increasing organismal complexity during the evolution of life has been attributed to the duplication of genes and entire genomes. More recently, theoretical models have been proposed that postulate the fate of duplicated genes, among them the duplication-degeneration-complementation (DDC) model. In the DDC model, the common fate of a duplicated gene is lost from the genome owing to nonfunctionalization. Duplicated genes are retained in the genome either by subfunctionalization, where the functions of the ancestral gene are sub-divided between the sister duplicate genes, or by neofunctionalization, where one of the duplicate genes acquires a new function. Both processes occur either by loss or gain of regulatory elements in the promoters of duplicated genes. Here, we review the genomic organization, evolution, and transcriptional regulation of the multigene family of intracellular lipid-binding protein (iLBP) genes from teleost fishes. Teleost fishes possess many copies of iLBP genes owing to a whole genome duplication (WGD) early in the teleost fish radiation. Moreover, the retention of duplicated iLBP genes is substantially higher than the retention of all other genes duplicated in the teleost genome. The fatty acid-binding protein genes, a subfamily of the iLBP multigene family in zebrafish, are differentially regulated by peroxisome proliferator-activated receptor (PPAR) isoforms, which may account for the retention of iLBP genes in the zebrafish genome by the process of subfunctionalization of cis-acting regulatory elements in iLBP gene promoters.

  10. [Variation of CAG repeats in coding region of ATXN2 gene in different ethnic groups].

    Science.gov (United States)

    Chen, Xiao-Chen; Sun, Hao; Mi, Dong-Qing; Huang, Xiao-Qin; Lin, Ke-Qin; Yi, Wen; Yu, Liang; Shi, Lei; Shi, Li; Yang, Zhao-Qing; Chu, Jia-You

    2011-04-01

    Toinvestigate CAG repeats variation of ATXN2 gene coding region in six ethnic groups that live in comparatively different environments, to evaluate whether these variations are under positive selection, and to find factors driving selection effects, 291 unrelated healthy individuals were collected from six ethnic groups and their STR geneotyping was performed. The frequencies of alleles and genotypes were counted and thereby Slatkin's linearized Fst values were calculated. The UPGMA tree against this gene was constructed. The MDS analysis among these groups was carried out as well. The results from the linearized Fst values indicated that there were significant evolutionary differences of the STR in ATXN2 gene between Hui and Yi groups, but not among the other 4 groups. Further analysis was performed by combining our data with published data obtained from other groups. These results indicated that there were significant differences between Japanese and other groups including Hui, Hani, Yunnan Mongolian, and Inner Mongolian. Both Hui and Mongolian from Inner Mongolia were significantly different from Han. In conclusion, the six ethnic groups had their own distribution characterizations of allelic frequencies of ATXN2 STR, and the potential cause of frequency changes in rare alleles could be the consequence of positive selection.

  11. Ribosomal protein S14 transcripts are edited in Oenothera mitochondria.

    Science.gov (United States)

    Schuster, W; Unseld, M; Wissinger, B; Brennicke, A

    1990-01-01

    The gene encoding ribosomal protein S14 (rps14) in Oenothera mitochondria is located upstream of the cytochrome b gene (cob). Sequence analysis of independently derived cDNA clones covering the entire rps14 coding region shows two nucleotides edited from the genomic DNA to the mRNA derived sequences by C to U modifications. A third editing event occurs four nucleotides upstream of the AUG initiation codon and improves a potential ribosome binding site. A CGG codon specifying arginine in a position conserved in evolution between chloroplasts and E. coli as a UGG tryptophan codon is not edited in any of the cDNAs analysed. An inverted repeat 3' of an unidentified open reading frame is located upstream of the rps14 gene. The inverted repeat sequence is highly conserved at analogous regions in other Oenothera mitochondrial loci. Images PMID:2326162

  12. Polymorphisms in the prion protein gene and in the doppel gene increase susceptibility for Creutzfeldt-Jakob disease

    NARCIS (Netherlands)

    E.A. Croes (Esther); B.Z. Alizadeh (Behrooz); A.M. Bertoli Avella (Aida); T.A.M. Rademaker (Tessa); J. Vergeer-Drop (Jeannette); B. Dermaut (Bart); J.J. Houwing-Duistermaat (Jeanine); D.P.W.M. Wientjens (Dorothee); A. Hofman (Albert); C. van Broeckhoven (Christine); C.M. van Duijn (Cornelia)

    2004-01-01

    textabstractThe prion protein gene (PRNP) plays a central role in the origin of Creutzfeldt-Jakob disease (CJD), but there is growing interest in other polymorphisms that may be involved in CJD. Polymorphisms upstream of PRNP that may modulate the prion protein production as well as polymorphisms in

  13. Structural evolution of the 4/1 genes and proteins in non-vascular and lower vascular plants.

    Science.gov (United States)

    Morozov, Sergey Y; Milyutina, Irina A; Bobrova, Vera K; Ryazantsev, Dmitry Y; Erokhina, Tatiana N; Zavriev, Sergey K; Agranovsky, Alexey A; Solovyev, Andrey G; Troitsky, Alexey V

    2015-12-01

    The 4/1 protein of unknown function is encoded by a single-copy gene in most higher plants. The 4/1 protein of Nicotiana tabacum (Nt-4/1 protein) has been shown to be alpha-helical and predominantly expressed in conductive tissues. Here, we report the analysis of 4/1 genes and the encoded proteins of lower land plants. Sequences of a number of 4/1 genes from liverworts, lycophytes, ferns and gymnosperms were determined and analyzed together with sequences available in databases. Most of the vascular plants were found to encode Magnoliophyta-like 4/1 proteins exhibiting previously described gene structure and protein properties. Identification of the 4/1-like proteins in hornworts, liverworts and charophyte algae (sister lineage to all land plants) but not in mosses suggests that 4/1 proteins are likely important for plant development but not required for a primary metabolic function of plant cell. Copyright © 2015 Elsevier B.V. and Société Française de Biochimie et Biologie Moléculaire (SFBBM). All rights reserved.

  14. Topology of transmembrane channel-like gene 1 protein.

    Science.gov (United States)

    Labay, Valentina; Weichert, Rachel M; Makishima, Tomoko; Griffith, Andrew J

    2010-10-05

    Mutations of transmembrane channel-like gene 1 (TMC1) cause hearing loss in humans and mice. TMC1 is the founding member of a family of genes encoding proteins of unknown function that are predicted to contain multiple transmembrane domains. The goal of our study was to define the topology of mouse TMC1 expressed heterologously in tissue culture cells. TMC1 was retained in the endoplasmic reticulum (ER) membrane of five tissue culture cell lines that we tested. We used anti-TMC1 and anti-HA antibodies to probe the topologic orientation of three native epitopes and seven HA epitope tags along full-length TMC1 after selective or complete permeabilization of transfected cells with digitonin or Triton X-100, respectively. TMC1 was present within the ER as an integral membrane protein containing six transmembrane domains and cytosolic N- and C-termini. There is a large cytoplasmic loop, between the fourth and fifth transmembrane domains, with two highly conserved hydrophobic regions that might associate with or penetrate, but do not span, the plasma membrane. Our study is the first to demonstrate that TMC1 is a transmembrane protein. The topologic organization revealed by this study shares some features with that of the shaker-TRP superfamily of ion channels.

  15. DNA-dependent protein kinase inhibits AID-induced antibody gene conversion.

    Directory of Open Access Journals (Sweden)

    Adam J L Cook

    2007-04-01

    Full Text Available Affinity maturation and class switching of antibodies requires activation-induced cytidine deaminase (AID-dependent hypermutation of Ig V(DJ rearrangements and Ig S regions, respectively, in activated B cells. AID deaminates deoxycytidine bases in Ig genes, converting them into deoxyuridines. In V(DJ regions, subsequent excision of the deaminated bases by uracil-DNA glycosylase, or by mismatch repair, leads to further point mutation or gene conversion, depending on the species. In Ig S regions, nicking at the abasic sites produced by AID and uracil-DNA glycosylases results in staggered double-strand breaks, whose repair by nonhomologous end joining mediates Ig class switching. We have tested whether nonhomologous end joining also plays a role in V(DJ hypermutation using chicken DT40 cells deficient for Ku70 or the DNA-dependent protein kinase catalytic subunit (DNA-PKcs. Inactivation of the Ku70 or DNA-PKcs genes in DT40 cells elevated the rate of AID-induced gene conversion as much as 5-fold. Furthermore, DNA-PKcs-deficiency appeared to reduce point mutation. The data provide strong evidence that double-strand DNA ends capable of recruiting the DNA-dependent protein kinase complex are important intermediates in Ig V gene conversion.

  16. Gene Unprediction with Spurio: A tool to identify spurious protein sequences.

    Science.gov (United States)

    Höps, Wolfram; Jeffryes, Matt; Bateman, Alex

    2018-01-01

    We now have access to the sequences of tens of millions of proteins. These protein sequences are essential for modern molecular biology and computational biology. The vast majority of protein sequences are derived from gene prediction tools and have no experimental supporting evidence for their translation.  Despite the increasing accuracy of gene prediction tools there likely exists a large number of spurious protein predictions in the sequence databases.  We have developed the Spurio tool to help identify spurious protein predictions in prokaryotes.  Spurio searches the query protein sequence against a prokaryotic nucleotide database using tblastn and identifies homologous sequences. The tblastn matches are used to score the query sequence's likelihood of being a spurious protein prediction using a Gaussian process model. The most informative feature is the appearance of stop codons within the presumed translation of homologous DNA sequences. Benchmarking shows that the Spurio tool is able to distinguish spurious from true proteins. However, transposon proteins are prone to be predicted as spurious because of the frequency of degraded homologs found in the DNA sequence databases. Our initial experiments suggest that less than 1% of the proteins in the UniProtKB sequence database are likely to be spurious and that Spurio is able to identify over 60 times more spurious proteins than the AntiFam resource. The Spurio software and source code is available under an MIT license at the following URL: https://bitbucket.org/bateman-group/spurio.

  17. Orientia tsutsugamushi ankyrin repeat-containing protein family members are Type 1 secretion system substrates that traffic to the host cell endoplasmic reticulum.

    Science.gov (United States)

    VieBrock, Lauren; Evans, Sean M; Beyer, Andrea R; Larson, Charles L; Beare, Paul A; Ge, Hong; Singh, Smita; Rodino, Kyle G; Heinzen, Robert A; Richards, Allen L; Carlyon, Jason A

    2014-01-01

    Scrub typhus is an understudied, potentially fatal infection that threatens one billion persons in the Asia-Pacific region. How the causative obligate intracellular bacterium, Orientia tsutsugamushi, facilitates its intracellular survival and pathogenesis is poorly understood. Many intracellular bacterial pathogens utilize the Type 1 (T1SS) or Type 4 secretion system (T4SS) to translocate ankyrin repeat-containing proteins (Anks) that traffic to distinct subcellular locations and modulate host cell processes. The O. tsutsugamushi genome encodes one of the largest known bacterial Ank repertoires plus T1SS and T4SS components. Whether these potential virulence factors are expressed during infection, how the Anks are potentially secreted, and to where they localize in the host cell are not known. We determined that O. tsutsugamushi transcriptionally expresses 20 unique ank genes as well as genes for both T1SS and T4SS during infection of mammalian host cells. Examination of the Anks' C-termini revealed that the majority of them resemble T1SS substrates. Escherichia coli expressing a functional T1SS was able to secrete chimeric hemolysin proteins bearing the C-termini of 19 of 20 O. tsutsugamushi Anks in an HlyBD-dependent manner. Thus, O. tsutsugamushi Anks C-termini are T1SS-compatible. Conversely, Coxiella burnetii could not secrete heterologously expressed Anks in a T4SS-dependent manner. Analysis of the subcellular distribution patterns of 20 ectopically expressed Anks revealed that, while 6 remained cytosolic or trafficked to the nucleus, 14 localized to, and in some cases, altered the morphology of the endoplasmic reticulum. This study identifies O. tsutsugamushi Anks as T1SS substrates and indicates that many display a tropism for the host cell secretory pathway.

  18. Transcription factor IID in the Archaea: sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes

    Science.gov (United States)

    Marsh, T. L.; Reich, C. I.; Whitelock, R. B.; Olsen, G. J.; Woese, C. R. (Principal Investigator)

    1994-01-01

    The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archael and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archael protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archael protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

  19. Poxviral Ankyrin Proteins

    Directory of Open Access Journals (Sweden)

    Michael H. Herbert

    2015-02-01

    Full Text Available Multiple repeats of the ankyrin motif (ANK are ubiquitous throughout the kingdoms of life but are absent from most viruses. The main exception to this is the poxvirus family, and specifically the chordopoxviruses, with ANK repeat proteins present in all but three species from separate genera. The poxviral ANK repeat proteins belong to distinct orthologue groups spread over different species, and align well with the phylogeny of their genera. This distribution throughout the chordopoxviruses indicates these proteins were present in an ancestral vertebrate poxvirus, and have since undergone numerous duplication events. Most poxviral ANK repeat proteins contain an unusual topology of multiple ANK motifs starting at the N-terminus with a C-terminal poxviral homologue of the cellular F-box enabling interaction with the cellular SCF ubiquitin ligase complex. The subtle variations between ANK repeat proteins of individual poxviruses suggest an array of different substrates may be bound by these protein-protein interaction domains and, via the F-box, potentially directed to cellular ubiquitination pathways and possible degradation. Known interaction partners of several of these proteins indicate that the NF-κB coordinated anti-viral response is a key target, whilst some poxviral ANK repeat domains also have an F-box independent affect on viral host-range.

  20. GREAM: A Web Server to Short-List Potentially Important Genomic Repeat Elements Based on Over-/Under-Representation in Specific Chromosomal Locations, Such as the Gene Neighborhoods, within or across 17 Mammalian Species.

    Directory of Open Access Journals (Sweden)

    Darshan Shimoga Chandrashekar

    Full Text Available Genome-wide repeat sequences, such as LINEs, SINEs and LTRs share a considerable part of the mammalian nuclear genomes. These repeat elements seem to be important for multiple functions including the regulation of transcription initiation, alternative splicing and DNA methylation. But it is not possible to study all repeats and, hence, it would help to short-list before exploring their potential functional significance via experimental studies and/or detailed in silico analyses.We developed the 'Genomic Repeat Element Analyzer for Mammals' (GREAM for analysis, screening and selection of potentially important mammalian genomic repeats. This web-server offers many novel utilities. For example, this is the only tool that can reveal a categorized list of specific types of transposons, retro-transposons and other genome-wide repetitive elements that are statistically over-/under-represented in regions around a set of genes, such as those expressed differentially in a disease condition. The output displays the position and frequency of identified elements within the specified regions. In addition, GREAM offers two other types of analyses of genomic repeat sequences: a enrichment within chromosomal region(s of interest, and b comparative distribution across the neighborhood of orthologous genes. GREAM successfully short-listed a repeat element (MER20 known to contain functional motifs. In other case studies, we could use GREAM to short-list repetitive elements in the azoospermia factor a (AZFa region of the human Y chromosome and those around the genes associated with rat liver injury. GREAM could also identify five over-represented repeats around some of the human and mouse transcription factor coding genes that had conserved expression patterns across the two species.GREAM has been developed to provide an impetus to research on the role of repetitive sequences in mammalian genomes by offering easy selection of more interesting repeats in various

  1. Molecular characterization of a phloem-specific gene encoding the filament protein, phloem protein 1 (PP1), from Cucurbita maxima.

    Science.gov (United States)

    Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A

    1997-07-01

    Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.

  2. Genome-scale portrait and evolutionary significance of human-specific core promoter tri- and tetranucleotide short tandem repeats.

    Science.gov (United States)

    Nazaripanah, N; Adelirad, F; Delbari, A; Sahaf, R; Abbasi-Asl, T; Ohadi, M

    2018-04-05

    While there is an ongoing trend to identify single nucleotide substitutions (SNSs) that are linked to inter/intra-species differences and disease phenotypes, short tandem repeats (STRs)/microsatellites may be of equal (if not more) importance in the above processes. Genes that contain STRs in their promoters have higher expression divergence compared to genes with fixed or no STRs in the gene promoters. In line with the above, recent reports indicate a role of repetitive sequences in the rise of young transcription start sites (TSSs) in human evolution. Following a comparative genomics study of all human protein-coding genes annotated in the GeneCards database, here we provide a genome-scale portrait of human-specific short- and medium-size (≥ 3-repeats) tri- and tetranucleotide STRs and STR motifs in the critical core promoter region between - 120 and + 1 to the TSS and evidence of skewing of this compartment in reference to the STRs that are not human-specific (Levene's test p human-specific transcripts was detected in the tri and tetra human-specific compartments (mid-p genome-scale skewing of STRs at a specific region of the human genome and a link between a number of these STRs and TSS selection/transcript specificity. The STRs and genes listed here may have a role in the evolution and development of characteristics and phenotypes that are unique to the human species.

  3. Gene Prioritization by Integrated Analysis of Protein Structural and Network Topological Properties for the Protein-Protein Interaction Network of Neurological Disorders

    Directory of Open Access Journals (Sweden)

    Yashna Paul

    2016-01-01

    Full Text Available Neurological disorders are known to show similar phenotypic manifestations like anxiety, depression, and cognitive impairment. There is a need to identify shared genetic markers and molecular pathways in these diseases, which lead to such comorbid conditions. Our study aims to prioritize novel genetic markers that might increase the susceptibility of patients affected with one neurological disorder to other diseases with similar manifestations. Identification of pathways involving common candidate markers will help in the development of improved diagnosis and treatments strategies for patients affected with neurological disorders. This systems biology study for the first time integratively uses 3D-structural protein interface descriptors and network topological properties that characterize proteins in a neurological protein interaction network, to aid the identification of genes that are previously not known to be shared between these diseases. Results of protein prioritization by machine learning have identified known as well as new genetic markers which might have direct or indirect involvement in several neurological disorders. Important gene hubs have also been identified that provide an evidence for shared molecular pathways in the neurological disease network.

  4. Mycobacterium tuberculosis HspX/EsxS Fusion Protein: Gene Cloning, Protein Expression, and Purification in Escherichia coli.

    Science.gov (United States)

    Khademi, Farzad; Yousefi-Avarvand, Arshid; Derakhshan, Mohammad; Meshkat, Zahra; Tafaghodi, Mohsen; Ghazvini, Kiarash; Aryan, Ehsan; Sankian, Mojtaba

    2017-10-01

    The purpose of this study was to clone, express, and purify a novel multidomain fusion protein of Micobacterium tuberculosis (Mtb) in a prokaryotic system. An hspX/esxS gene construct was synthesized and ligated into a pGH plasmid, E. coli TOP10 cells were transformed, and the vector was purified. The vector containing the construct and pET-21b (+) plasmid were digested with the same enzymes and the construct was ligated into pET-21b (+). The accuracy of cloning was confirmed by colony PCR and sequencing. E. coli BL21 cells were transformed with the pET-21b (+)/hspX/esxS expression vector and protein expression was evaluated. Finally, the expressed fusion protein was purified on a Ni-IDA column and verified by SDS-PAGE and western blotting. The hspX/esxS gene construct was inserted into pET-21b (+) and recombinant protein expression was induced with IPTG in E. coli BL21 cells. Various concentrations of IPTG were tested to determine the optimum concentration for expression induction. The recombinant protein was expressed in insoluble inclusion bodies. Three molar guanidine HCl was used to solubilize the insoluble protein. An HspX/EsxS Mtb fusion protein was expressed in E. coli and the recombinant protein was purified. After immunological analysis, the HspX/EsxS fusion protein might be an anti-tuberculosis vaccine candidate in future clinical trial studies.

  5. Recombinant Brucella abortus gene expressing immunogenic protein

    Energy Technology Data Exchange (ETDEWEB)

    Mayfield, J.E.; Tabatabai, L.B.

    1991-06-11

    This patent describes a synthetic recombinant DNA molecule containing a DNA sequence. It comprises a gene of Brucella abortus encoding an immunogenic protein having a molecular weight of approximately 31,000 daltons as determined by sodium dodecyl sulfate polyacrylamide gel electrophoresis under denaturing conditions, the protein having an isoelectric point around 4.9, and containing a twenty-five amino acid sequence from its amino terminal end consisting of Gln-Ala-Pro-Thr-Phe-Phe-Arg-Ile-Gly-Thr-Gly-Gly-Thr-Ala-Gly-Thr-Tyr-Tyr-Pro-Ile-Gly-Gly-Leu-Ile-Ala, wherein Gln, Ala, Pro, Thr, Phe, Arg, Ile, Gly, Tyr, and Leu, respectively, represent glutamine, alanine, proline, threonine, phenylalanine, arginine, isolecuine, glycine, tyrosine, and leucine.

  6. Predicting Essential Genes and Proteins Based on Machine Learning and Network Topological Features: A Comprehensive Review

    Science.gov (United States)

    Zhang, Xue; Acencio, Marcio Luis; Lemke, Ney

    2016-01-01

    Essential proteins/genes are indispensable to the survival or reproduction of an organism, and the deletion of such essential proteins will result in lethality or infertility. The identification of essential genes is very important not only for understanding the minimal requirements for survival of an organism, but also for finding human disease genes and new drug targets. Experimental methods for identifying essential genes are costly, time-consuming, and laborious. With the accumulation of sequenced genomes data and high-throughput experimental data, many computational methods for identifying essential proteins are proposed, which are useful complements to experimental methods. In this review, we show the state-of-the-art methods for identifying essential genes and proteins based on machine learning and network topological features, point out the progress and limitations of current methods, and discuss the challenges and directions for further research. PMID:27014079

  7. Nucleotide sequence of the coat protein gene of Lettuce big-vein virus.

    Science.gov (United States)

    Sasaya, T; Ishikawa, K; Koganezawa, H

    2001-06-01

    A sequence of 1425 nt was established that included the complete coat protein (CP) gene of Lettuce big-vein virus (LBVV). The LBVV CP gene encodes a 397 amino acid protein with a predicted M(r) of 44486. Antisera raised against synthetic peptides corresponding to N-terminal or C-terminal parts of the LBVV CP reacted in Western blot analysis with a protein with an M(r) of about 48000. RNA extracted from purified particles of LBVV by using proteinase K, SDS and phenol migrated in gels as two single-stranded RNA species of approximately 7.3 kb (ss-1) and 6.6 kb (ss-2). After denaturation by heat and annealing at room temperature, the RNA migrated as four species, ss-1, ss-2 and two additional double-stranded RNAs (ds-1 and ds-2). The Northern blot hybridization analysis using riboprobes from a full-length clone of the LBVV CP gene indicated that ss-2 has a negative-sense nature and contains the LBVV CP gene. Moreover, ds-2 is a double-stranded form of ss-2. Database searches showed that the LBVV CP most resembled the nucleocapsid proteins of rhabdoviruses. These results indicate that it would be appropriate to classify LBVV as a negative-sense single-stranded RNA virus rather than as a double-stranded RNA virus.

  8. Demethylation regulation of BDNF gene expression in dorsal root ganglion neurons is implicated in opioid-induced pain hypersensitivity in rats.

    Science.gov (United States)

    Chao, Yu-Chieh; Xie, Fang; Li, Xueyang; Guo, Ruijuan; Yang, Ning; Zhang, Chen; Shi, Rong; Guan, Yun; Yue, Yun; Wang, Yun

    2016-07-01

    Repeated administration of morphine may result in opioid-induced hypersensitivity (OIH), which involves altered expression of numerous genes, including brain-derived neurotrophic factor (BDNF) in dorsal root ganglion (DRG) neurons. Yet, it remains unclear how BDNF expression is increased in DRG neurons after repeated morphine treatment. DNA methylation is an important mechanism of epigenetic control of gene expression. In the current study, we hypothesized that the demethylation regulation of certain BDNF gene promoters in DRG neurons may contribute to the development of OIH. Real-time RT-PCR was used to assess changes in the mRNA transcription levels of major BDNF exons including exon I, II, IV, VI, as well as total BDNF mRNA in DRGs from rats after repeated morphine administration. The levels of exon IV and total BDNF mRNA were significantly upregulated by repeated morphine administration, as compared to that in saline control group. Further, ELISA array and immunocytochemistry study revealed a robust upregulation of BDNF protein expression in DRG neurons after repeated morphine exposure. Correspondingly, the methylation levels of BDNF exon IV promoter showed a significant downregulation by morphine treatment. Importantly, intrathecal administration of a BDNF antibody, but not control IgG, significantly inhibited mechanical hypersensitivity that developed in rats after repeated morphine treatment. Conversely, intrathecal administration of an inhibitor of DNA methylation, 5-aza-2'-deoxycytidine (5-aza-dC) markedly upregulated the BDNF protein expression in DRG neurons and enhanced the mechanical allodynia after repeated morphine exposure. Together, our findings suggest that demethylation regulation of BDNF gene promoter may be implicated in the development of OIH through epigenetic control of BDNF expression in DRG neurons. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Microarray Data Analysis of Space Grown Arabidopsis Leaves for Genes Important in Vascular Patterning. Analysis of Space Grown Arabidopsis with Microarray Data from GeneLab: Identification of Genes Important in Vascular Patterning

    Science.gov (United States)

    Weitzel, A. J.; Wyatt, S. E.; Parsons-Wingerter, P.

    2016-01-01

    Venation patterning in leaves is a major determinant of photosynthesis efficiency because of its dependency on vascular transport of photo-assimilates, water, and minerals. Arabidopsis thaliana grown in microgravity show delayed growth and leaf maturation. Gene expression data from the roots, hypocotyl, and leaves of A. thaliana grown during spaceflight vs. ground control analyzed by Affymetrix microarray are available through NASA's GeneLab (GLDS-7). We analyzed the data for differential expression of genes in leaves resulting from the effects of spaceflight on vascular patterning. Two genes were found by preliminary analysis to be up-regulated during spaceflight that may be related to vascular formation. The genes are responsible for coding an ARGOS (Auxin-Regulated Gene Involved in Organ Size)-like protein (potentially affecting cell elongation in the leaves), and an F-box/kelch-repeat protein (possibly contributing to protoxylem specification). Further analysis that will focus on raw data quality assessment and a moderated t-test may further confirm up-regulation of the two genes and/or identify other gene candidates. Plants defective in these genes will then be assessed for phenotype by the mapping and quantification of leaf vascular patterning by NASA's VESsel GENeration (VESGEN) software to model specific vascular differences of plants grown in spaceflight.

  10. The complete chloroplast genome sequence of Mahonia bealei (Berberidaceae) reveals a significant expansion of the inverted repeat and phylogenetic relationship with other angiosperms.

    Science.gov (United States)

    Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin

    2013-10-10

    Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae

  11. Protein aggregates and novel presenilin gene variants in idiopathic dilated cardiomyopathy.

    Science.gov (United States)

    Gianni, Davide; Li, Airong; Tesco, Giuseppina; McKay, Kenneth M; Moore, John; Raygor, Kunal; Rota, Marcello; Gwathmey, Judith K; Dec, G William; Aretz, Thomas; Leri, Annarosa; Semigran, Marc J; Anversa, Piero; Macgillivray, Thomas E; Tanzi, Rudolph E; del Monte, Federica

    2010-03-16

    Heart failure is a debilitating condition resulting in severe disability and death. In a subset of cases, clustered as idiopathic dilated cardiomyopathy (iDCM), the origin of heart failure is unknown. In the brain of patients with dementia, proteinaceous aggregates and abnormal oligomeric assemblies of beta-amyloid impair cell function and lead to cell death. We have similarly characterized fibrillar and oligomeric assemblies in the hearts of iDCM patients, pointing to abnormal protein aggregation as a determinant of iDCM. We also showed that oligomers alter myocyte Ca(2+) homeostasis. Additionally, we have identified 2 new sequence variants in the presenilin-1 (PSEN1) gene promoter leading to reduced gene and protein expression. We also show that presenilin-1 coimmunoprecipitates with SERCA2a. On the basis of these findings, we propose that 2 mechanisms may link protein aggregation and cardiac function: oligomer-induced changes on Ca(2+) handling and a direct effect of PSEN1 sequence variants on excitation-contraction coupling protein function.

  12. Usher syndrome: animal models, retinal function of Usher proteins, and prospects for gene therapy

    Science.gov (United States)

    Williams, David S.

    2009-01-01

    Usher syndrome is a deafness-blindness disorder. The blindness occurs from a progressive retinal degeneration that begins after deafness and after the retina has developed. Three clinical subtypes of Usher syndrome have been identified, with mutations in any one of six different genes giving rise to type 1, in any one of three different genes to type 2, and in one identified gene causing Usher type 3. Mutant mice for most of the genes have been studied; while they have clear inner ear defects, retinal phenotypes are relatively mild and have been difficult to characterize. The retinal functions of the Usher proteins are still largely unknown. Protein binding studies have suggested many interactions among the proteins, and a model of interaction among all the proteins in the photoreceptor synapse has been proposed. However this model is not supported by localization data from some laboratories, or the indication of any synaptic phenotype in mutant mice. An earlier suggestion, based on patient pathologies, of Usher protein function in the photoreceptor cilium continues to gain support from immunolocalization and mutant mouse studies, which are consistent with Usher protein interaction in the photoreceptor ciliary/periciliary region. So far, the most characterized Usher protein is myosin VIIa. It is present in the apical RPE and photoreceptor ciliary/periciliary region, where it is required for organelle transport and clearance of opsin from the connecting cilium, respectively. Usher syndrome is amenable to gene replacement therapy, but also has some specific challenges. Progress in this treatment approach has been achieved by correction of mutant phenotypes in Myo7a-null mouse retinas, following lentiviral delivery of MYO7A. PMID:17936325

  13. Update of the human secretoglobin (SCGB gene superfamily and an example of 'evolutionary bloom' of androgen-binding protein genes within the mouse Scgb gene superfamily

    Directory of Open Access Journals (Sweden)

    Jackson Brian C

    2011-10-01

    Full Text Available Abstract The secretoglobins (SCGBs comprise a family of small, secreted proteins found in animals exclusively of mammalian lineage. There are 11 human SCGB genes and five pseudogenes. Interestingly, mice have 68 Scgb genes, four of which are highly orthologous to human SCGB genes; the remainder represent an 'evolutionary bloom' and make up a large gene family represented by only six counterparts in humans. SCGBs are found in high concentrations in many mammalian secretions, including fluids of the lung, lacrimal gland, salivary gland, prostate and uterus. Whereas the biological activities of most individual SCGBs have not been fully characterised, what already has been discovered suggests that this family has an important role in the modulation of inflammation, tissue repair and tumorigenesis. In mice, the large Scgb1b and Scgb2b gene families encode the androgen-binding proteins, which have been shown to play a role in mate selection. Although much has been learned about SCGBs in recent years, clearly more research remains to be done to allow a better understanding of the roles of these proteins in human health and disease. Such information is predicted to reveal valuable novel drug targets for the treatment of inflammation, as well as designing biomarkers that might identify tissue damage or cancer.

  14. Elevated expression of protein biosynthesis genes in liver and muscle of hibernating black bears (Ursus americanus).

    Science.gov (United States)

    Fedorov, Vadim B; Goropashnaya, Anna V; Tøien, Øivind; Stewart, Nathan C; Gracey, Andrew Y; Chang, Celia; Qin, Shizhen; Pertea, Geo; Quackenbush, John; Showe, Louise C; Showe, Michael K; Boyer, Bert B; Barnes, Brian M

    2009-04-10

    We conducted a large-scale gene expression screen using the 3,200 cDNA probe microarray developed specifically for Ursus americanus to detect expression differences in liver and skeletal muscle that occur during winter hibernation compared with animals sampled during summer. The expression of 12 genes, including RNA binding protein motif 3 (Rbm3), that are mostly involved in protein biosynthesis, was induced during hibernation in both liver and muscle. The Gene Ontology and Gene Set Enrichment analysis consistently showed a highly significant enrichment of the protein biosynthesis category by overexpressed genes in both liver and skeletal muscle during hibernation. Coordinated induction in transcriptional level of genes involved in protein biosynthesis is a distinctive feature of the transcriptome in hibernating black bears. This finding implies induction of translation and suggests an adaptive mechanism that contributes to a unique ability to reduce muscle atrophy over prolonged periods of immobility during hibernation. Comparing expression profiles in bears to small mammalian hibernators shows a general trend during hibernation of transcriptional changes that include induction of genes involved in lipid metabolism and carbohydrate synthesis as well as depression of genes involved in the urea cycle and detoxification function in liver.

  15. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  16. Enhanced Expression of WD Repeat-Containing Protein 35 via Nuclear Factor-Kappa B Activation in Bupivacaine-Treated Neuro2a Cells

    Science.gov (United States)

    Huang, Lei; Kondo, Fumio; Harato, Misako; Feng, Guo-Gang; Ishikawa, Naoshisa; Fujiwara, Yoshihiro; Okada, Shoshiro

    2014-01-01

    The family of WD repeat proteins comprises a large number of proteins and is involved in a wide variety of cellular processes such as signal transduction, cell growth, proliferation, and apoptosis. Bupivacaine is a sodium channel blocker administered for local infiltration, nerve block, epidural, and intrathecal anesthesia. Recently, we reported that bupivacaine induces reactive oxygen species (ROS) generation and p38 mitogen-activated protein kinase (MAPK) activation, resulting in an increase in the expression of WD repeat-containing protein 35 (WDR35) in mouse neuroblastoma Neuro2a cells. It has been shown that ROS activate MAPK through phosphorylation, followed by activation of nuclear factor-kappa B (NF-κB) and activator protein 1 (AP-1). The present study was undertaken to test whether NF-κB and c-Jun/AP-1 are involved in bupivacaine-induced WDR35 expression in Neuro2a cells. Bupivacaine activated both NF-κB and c-Jun in Neuro2a cells. APDC, an NF-κB inhibitor, attenuated the increase in NF-κB activity and WDR35 protein expression in bupivacaine-treated Neuro2a cells. GW9662, a selective peroxisome proliferator-activated receptor-γ antagonist, enhanced the increase in NF-κB activity and WDR35 protein expression in bupivacaine-treated Neuro2a cells. In contrast, c-Jun siRNA did not inhibit the bupivacaine-induced increase in WDR35 mRNA expression. These results indicate that bupivacaine induces the activation of transcription factors NF-κB and c-Jun/AP-1 in Neuro2a cells, while activation of NF-κB is involved in bupivacaine-induced increases in WDR35 expression. PMID:24466034

  17. Dissection of Resistance Genes to Pseudomonas syringae pv. phaseolicola in UI3 Common Bean Cultivar

    Directory of Open Access Journals (Sweden)

    Ana M. González

    2017-11-01

    Full Text Available Few quantitative trait loci have been mapped for resistance to Pseudomonas syringae pv. phaseolicola in common bean. Two F2 populations were developed from the host differential UI3 cultivar. The objective of this study was to further characterize the resistance to races 1, 5, 7 and 9 of Psp included in UI3. Using a QTL mapping approach, 16 and 11 main-effect QTLs for pod and primary leaf resistance were located on LG10, explaining up to 90% and 26% of the phenotypic variation, respectively. The homologous genomic region corresponding to primary leaf resistance QTLs detected tested positive for the presence of resistance-associated gene cluster encoding nucleotide-binding and leucine-rich repeat (NL, Natural Resistance Associated Macrophage (NRAMP and Pentatricopeptide Repeat family (PPR proteins. It is worth noting that the main effect QTLs for resistance in pod were located inside a 3.5 Mb genomic region that included the Phvul.010G021200 gene, which encodes a protein that has the highest sequence similarity to the RIN4 gene of Arabidopsis, and can be considered an important candidate gene for the organ-specific QTLs identified here. These results support that resistance to Psp from UI3 might result from the immune response activated by combinations of R proteins, and suggest the guard model as an important mechanism in pod resistance to halo blight. The candidate genes identified here warrant functional studies that will help in characterizing the actual defense gene(s in UI3 genotype.

  18. Gene Repression in Haloarchaea Using the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas I-B System.

    Science.gov (United States)

    Stachler, Aris-Edda; Marchfelder, Anita

    2016-07-15

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system is used by bacteria and archaea to fend off foreign genetic elements. Since its discovery it has been developed into numerous applications like genome editing and regulation of transcription in eukaryotes and bacteria. For archaea currently no tools for transcriptional repression exist. Because molecular biology analyses in archaea become more and more widespread such a tool is vital for investigating the biological function of essential genes in archaea. Here we use the model archaeon Haloferax volcanii to demonstrate that its endogenous CRISPR-Cas system I-B can be harnessed to repress gene expression in archaea. Deletion of cas3 and cas6b genes results in efficient repression of transcription. crRNAs targeting the promoter region reduced transcript levels down to 8%. crRNAs targeting the reading frame have only slight impact on transcription. crRNAs that target the coding strand repress expression only down to 88%, whereas crRNAs targeting the template strand repress expression down to 8%. Repression of an essential gene results in reduction of transcription levels down to 22%. Targeting efficiencies can be enhanced by expressing a catalytically inactive Cas3 mutant. Genes can be targeted on plasmids or on the chromosome, they can be monocistronic or part of a polycistronic operon. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  19. Gene Repression in Haloarchaea Using the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)-Cas I-B System*

    Science.gov (United States)

    Stachler, Aris-Edda; Marchfelder, Anita

    2016-01-01

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system is used by bacteria and archaea to fend off foreign genetic elements. Since its discovery it has been developed into numerous applications like genome editing and regulation of transcription in eukaryotes and bacteria. For archaea currently no tools for transcriptional repression exist. Because molecular biology analyses in archaea become more and more widespread such a tool is vital for investigating the biological function of essential genes in archaea. Here we use the model archaeon Haloferax volcanii to demonstrate that its endogenous CRISPR-Cas system I-B can be harnessed to repress gene expression in archaea. Deletion of cas3 and cas6b genes results in efficient repression of transcription. crRNAs targeting the promoter region reduced transcript levels down to 8%. crRNAs targeting the reading frame have only slight impact on transcription. crRNAs that target the coding strand repress expression only down to 88%, whereas crRNAs targeting the template strand repress expression down to 8%. Repression of an essential gene results in reduction of transcription levels down to 22%. Targeting efficiencies can be enhanced by expressing a catalytically inactive Cas3 mutant. Genes can be targeted on plasmids or on the chromosome, they can be monocistronic or part of a polycistronic operon. PMID:27226589

  20. Novel APC gene mutations associated with protein alteration in diffuse type gastric cancer.

    Science.gov (United States)

    Ghatak, Souvik; Chakraborty, Payel; Sarkar, Sandeep Roy; Chowdhury, Biswajit; Bhaumik, Arup; Kumar, Nachimuthu Senthil

    2017-06-02

    The role of adenomatous polyposis coli (APC) gene in mitosis might be critical for regulation of genomic stability and chromosome segregation. APC gene mutations have been associated to have a role in colon cancer and since gastric and colon tumors share some common genetic lesions, it is relevant to investigate the role of APC tumor suppressor gene in gastric cancer. We investigated for somatic mutations in the Exons 14 and 15 of APC gene from 40 diffuse type gastric cancersamples. Rabbit polyclonal anti-APC antibody was used, which detects the wild-type APC protein and was recommended for detection of the respective protein in human tissues. Cell cycle analysis was done from tumor and adjacent normal tissue. APC immunoreactivity showed positive expression of the protein in stages I, II, III and negative expression in Stages III and IV. Two novel deleterious variations (g.127576C > A, g.127583C > T) in exon 14 sequence were found to generate stop codon (Y622* and Q625*)in the tumor samples. Due to the generation of stop codon, the APC protein might be truncated and all the regulatory features could be lost which has led to the down-regulation of protein expression. Our results indicate that aneuploidy might occurdue to the codon 622 and 625 APC-driven gastric tumorigenesis, in agreement with our cell cycle analysis. The APC gene function in mitosis and chromosomal stability might be lost and G1 might be arrested with high quantity of DNA in the S phase. Six missense somatic mutations in tumor samples were detected in exon 15 A-B, twoof which showed pathological and disease causing effects based on SIFT, Polyphen2 and SNPs & GO score and were not previously reported in the literature or the public mutation databases. The two novel pathological somatic mutations (g.127576C > A, g.127583C > T) in exon 14 might be altering the protein expression leading to development of gastric cancer in the study population. Our study showed that mutations in the APC

  1. Abscisic Acid (ABA) Regulation of Arabidopsis SR Protein Gene Expression

    Science.gov (United States)

    Cruz, Tiago M. D.; Carvalho, Raquel F.; Richardson, Dale N.; Duque, Paula

    2014-01-01

    Serine/arginine-rich (SR) proteins are major modulators of alternative splicing, a key generator of proteomic diversity and flexible means of regulating gene expression likely to be crucial in plant environmental responses. Indeed, mounting evidence implicates splicing factors in signal transduction of the abscisic acid (ABA) phytohormone, which plays pivotal roles in the response to various abiotic stresses. Using real-time RT-qPCR, we analyzed total steady-state transcript levels of the 18 SR and two SR-like genes from Arabidopsis thaliana in seedlings treated with ABA and in genetic backgrounds with altered expression of the ABA-biosynthesis ABA2 and the ABA-signaling ABI1 and ABI4 genes. We also searched for ABA-responsive cis elements in the upstream regions of the 20 genes. We found that members of the plant-specific SC35-Like (SCL) Arabidopsis SR protein subfamily are distinctively responsive to exogenous ABA, while the expression of seven SR and SR-related genes is affected by alterations in key components of the ABA pathway. Finally, despite pervasiveness of established ABA-responsive promoter elements in Arabidopsis SR and SR-like genes, their expression is likely governed by additional, yet unidentified cis-acting elements. Overall, this study pinpoints SR34, SR34b, SCL30a, SCL28, SCL33, RS40, SR45 and SR45a as promising candidates for involvement in ABA-mediated stress responses. PMID:25268622

  2. Cloning an artificial gene encoding angiostatic anginex: From designed peptide to functional recombinant protein

    International Nuclear Information System (INIS)

    Brandwijk, Ricardo J.M.G.E.; Nesmelova, Irina; Dings, Ruud P.M.; Mayo, Kevin H.; Thijssen, Victor L.J.L.; Griffioen, Arjan W.

    2005-01-01

    Anginex, a designed peptide 33-mer, is a potent angiogenesis inhibitor and anti-tumor agent in vivo. Anginex functions by inhibiting endothelial cell (EC) proliferation and migration leading to detachment and apoptosis of activated EC's. To better understand tumor endothelium targeting properties of anginex and enable its use in gene therapy, we constructed an artificial gene encoding the biologically exogenous peptide and produced the protein recombinantly in Pichia pastoris. Mass spectrometry shows recombinant anginex to be a dimer and circular dichroism shows the recombinant protein folds with β-strand structure like the synthetic peptide. Moreover, like parent anginex, the recombinant protein is active at inhibiting EC growth and migration, as well as inhibiting angiogenesis in vivo in the chorioallantoic membrane of the chick embryo. This study demonstrated that it is possible to produce a functionally active protein version of a rationally designed peptide, using an artificial gene and the recombinant protein approach

  3. Multiple independent insertions of 5S rRNA genes in the spliced-leader gene family of trypanosome species.

    Science.gov (United States)

    Beauparlant, Marc A; Drouin, Guy

    2014-02-01

    Analyses of the 5S rRNA genes found in the spliced-leader (SL) gene repeat units of numerous trypanosome species suggest that such linkages were not inherited from a common ancestor, but were the result of independent 5S rRNA gene insertions. In trypanosomes, 5S rRNA genes are found either in the tandemly repeated units coding for SL genes or in independent tandemly repeated units. Given that trypanosome species where 5S rRNA genes are within the tandemly repeated units coding for SL genes are phylogenetically related, one might hypothesize that this arrangement is the result of an ancestral insertion of 5S rRNA genes into the tandemly repeated SL gene family of trypanosomes. Here, we use the types of 5S rRNA genes found associated with SL genes, the flanking regions of the inserted 5S rRNA genes and the position of these insertions to show that most of the 5S rRNA genes found within SL gene repeat units of trypanosome species were not acquired from a common ancestor but are the results of independent insertions. These multiple 5S rRNA genes insertion events in trypanosomes are likely the result of frequent founder events in different hosts and/or geographical locations in species having short generation times.

  4. Repeated Gene Transfection Impairs the Engraftment of Transplanted Porcine Neonatal Pancreatic Cells

    Directory of Open Access Journals (Sweden)

    Min Koo Seo

    2011-02-01

    Full Text Available BackgroundPreviously, we reported that neonatal porcine pancreatic cells transfected with hepatocyte growth factor (HGF gene in an Epstein-Barr virus (EBV-based plasmid (pEBVHGF showed improved proliferation and differentiation compared to those of the control. In this study, we examined if pancreatic cells transfected repeatedly with pEBVHGF can be successfully grafted to control blood glucose in a diabetes mouse model.MethodsNeonatal porcine pancreatic cells were cultured as a monolayer and were transfected with pEBVHGF every other day for a total of three transfections. The transfected pancreatic cells were re-aggregated and transplanted into kidney capsules of diabetic nude mice or normal nude mice. Blood glucose level and body weight were measured every other day after transplantation. The engraftment of the transplanted cells and differentiation into beta cells were assessed using immunohistochemistry.ResultsRe-aggregation of the pancreatic cells before transplantation improved engraftment of the cells and facilitated neovascularization of the graft. Right before transplantation, pancreatic cells that were transfected with pEBVHGF and then re-aggregated showed ductal cell marker expression. However, ductal cells disappeared and the cells underwent fibrosis in a diabetes mouse model two to five weeks after transplantation; these mice also did not show controlled blood glucose levels. Furthermore, pancreatic cells transplanted into nude mice with normal blood glucose showed poor graft survival regardless of the type of transfected plasmid (pCEP4, pHGF, or pEBVHGF.ConclusionFor clinical application of transfected neonatal porcine pancreatic cells, further studies are required to develop methods of overcoming the damage for the cells caused by repeated transfection and to re-aggregate them into islet-like structures.

  5. Transcriptional activation of the mouse obese (ob) gene by CCAAT/enhancer binding protein alpha

    DEFF Research Database (Denmark)

    Hwang, C S; Mandrup, S; MacDougald, O A

    1996-01-01

    Like other adipocyte genes that are transcriptionally activated by CCAAT/enhancer binding protein alpha (C/EBP alpha) during preadipocyte differentiation, expression of the mouse obese (ob) gene is immediately preceded by the expression of C/EBP alpha. While the 5' flanking region of the mouse ob...... gene contains several consensus C/EBP binding sites, only one of these sites appears to be functional. DNase I cleavage inhibition patterns (footprinting) of the ob gene promoter revealed that recombinant C/EBP alpha, as well as a nuclear factor present in fully differentiated 3T3-L1 adipocytes...... to a consensus C/EBP binding site at nucleotides -55 to -47 generated a specific protein-oligonucleotide complex that was supershifted by antibody against C/EBP alpha. Probes corresponding to two upstream consensus C/EBP binding sites failed to generate protein-oligonucleotide complexes. Cotransfection of a C...

  6. MBNL142 and MBNL143 gene isoforms, overexpressed in DM1-patient muscle, encode for nuclear proteins interacting with Src family kinases.

    Science.gov (United States)

    Botta, A; Malena, A; Tibaldi, E; Rocchi, L; Loro, E; Pena, E; Cenci, L; Ambrosi, E; Bellocchi, M C; Pagano, M A; Novelli, G; Rossi, G; Monaco, H L; Gianazza, E; Pantic, B; Romeo, V; Marin, O; Brunati, A M; Vergani, L

    2013-08-15

    Myotonic dystrophy type-1 (DM1) is the most prevalent form of muscular dystrophy in adults. This disorder is an RNA-dominant disease, caused by expansion of a CTG repeat in the DMPK gene that leads to a misregulation in the alternative splicing of pre-mRNAs. The longer muscleblind-like-1 (MBNL1) transcripts containing exon 5 and the respective protein isoforms (MBNL142-43) were found to be overexpressed in DM1 muscle and localized exclusively in the nuclei. In vitro assays showed that MBNL142-43 bind the Src-homology 3 domain of Src family kinases (SFKs) via their proline-rich motifs, enhancing the SFK activity. Notably, this association was also confirmed in DM1 muscle and myotubes. The recovery, mediated by an siRNA target to Ex5-MBNL142-43, succeeded in reducing the nuclear localization of both Lyn and MBNL142-43 proteins and in decreasing the level of tyrosine phosphorylated proteins. Our results suggest an additional molecular mechanism in the DM1 pathogenesis, based on an altered phosphotyrosine signalling pathway.

  7. Effect of secretory pathway gene overexpression on secretion of a fluorescent reporter protein in Aspergillus nidulans

    DEFF Research Database (Denmark)

    Schalén, Martin; Anyaogu, Diana Chinyere; Hoof, Jakob Blæsbjerg

    2016-01-01

    roles in the process have been identified through transcriptomics. The assignment of function to these genes has been enabled in combination with gene deletion studies. In this work, 14 genes known to play a role in protein secretion in filamentous fungi were overexpressed in Aspergillus nidulans....... The background strain was a fluorescent reporter secreting mRFP. The overall effect of the overexpressions could thus be easily monitored through fluorescence measurements, while the effects on physiology were determined in batch cultivations and surface growth studies. Results: Fourteen protein secretion...... pathway related genes were overexpressed with a tet-ON promoter in the RFP-secreting reporter strain and macromorphology, physiology and protein secretion were monitored when the secretory genes were induced. Overexpression of several of the chosen genes was shown to cause anomalies on growth, micro...

  8. PTEN gene and phosphorylation of Akt protein expression in the LPS-induced lung fibroblast

    Directory of Open Access Journals (Sweden)

    Mao-lin HUANG

    2014-09-01

    Full Text Available Objective: To investigate PTEN gene expression and the Akt phosphorylation of protein expression in the LPS-induced lung fibroblast, to initially reveal the relation between PTEN gene and the Akt phosphorylated proteins to LPS-induced lung fibroblast proliferation mechanism. Methods: BrdU experiments was performed to evaluate the LPS-induced lung fibroblast proliferation,  RT-PCR and Western Blot analysis were used to analyze the PTEN gene expression and Western blot was performed to analyze Akt phosphorylated protein expression. Results: PTEN mRNA level of the experimental group were significantly lower than the control group (P<0.05 with LPS simulation for 24h and 72h , and there were no significant difference between the experimental group and control group the experimental group and control group (P>0.05 . PTEN protein expression levels of the experimental group were significantly lower than the control group (P<0.05 , at 72h, and PTEN mRNA levels had no significant differences between these of the experimental and control group at 6h,12h and 24h(p>0.05. Phosphorylation Akt protein level (relative to total Akt protein was significantly higer than the control group (P<0.05 at 24h and 72h, and phosphorylation Akt protein levels had no significant differences between these of the experimental and control group at 6h and 12h (P>0.05 .Conclusion: PTEN gene and phosphorylation Akt protein involve in LPS-induced lung fibroblast proliferation signal transduction pathway.

  9. Identification of Genes Encoding the Folate- and Thiamine-Binding Membrane Proteins in Firmicutes

    NARCIS (Netherlands)

    Eudes, Aymerick; Erkens, Guus B.; Slotboom, Dirk J.; Rodionov, Dmitry A.; Naponelli, Valeria; Hanson, Andrew D.

    Genes encoding high-affinity folate- and thiamine-binding proteins (FolT, ThiT) were identified in the Lactobacillus casei genome, expressed in Lactococcus lactis, and functionally characterized. Similar genes occur in many Firmicutes, sometimes next to folate or thiamine salvage genes. Most thiT

  10. Coxiella burnetii Nine Mile II proteins modulate gene expression of monocytic host cells during infection

    Directory of Open Access Journals (Sweden)

    Shaw Edward I

    2010-09-01

    Full Text Available Abstract Background Coxiella burnetii is an intracellular bacterial pathogen that causes acute and chronic disease in humans. Bacterial replication occurs within enlarged parasitophorous vacuoles (PV of eukaryotic cells, the biogenesis and maintenance of which is dependent on C. burnetii protein synthesis. These observations suggest that C. burnetii actively subverts host cell processes, however little is known about the cellular biology mechanisms manipulated by the pathogen during infection. Here, we examined host cell gene expression changes specifically induced by C. burnetii proteins during infection. Results We have identified 36 host cell genes that are specifically regulated when de novo C. burnetii protein synthesis occurs during infection using comparative microarray analysis. Two parallel sets of infected and uninfected THP-1 cells were grown for 48 h followed by the addition of chloramphenicol (CAM to 10 μg/ml in one set. Total RNA was harvested at 72 hpi from all conditions, and microarrays performed using Phalanx Human OneArray™ slides. A total of 784 (mock treated and 901 (CAM treated THP-1 genes were up or down regulated ≥2 fold in the C. burnetii infected vs. uninfected cell sets, respectively. Comparisons between the complementary data sets (using >0 fold, eliminated the common gene expression changes. A stringent comparison (≥2 fold between the separate microarrays revealed 36 host cell genes modulated by C. burnetii protein synthesis. Ontological analysis of these genes identified the innate immune response, cell death and proliferation, vesicle trafficking and development, lipid homeostasis, and cytoskeletal organization as predominant cellular functions modulated by C. burnetii protein synthesis. Conclusions Collectively, these data indicate that C. burnetii proteins actively regulate the expression of specific host cell genes and pathways. This is in addition to host cell genes that respond to the presence of the

  11. Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

    Directory of Open Access Journals (Sweden)

    Tadashi Imanishi

    2004-06-01

    Full Text Available The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/. It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs, identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA

  12. Transcriptome analysis of the Cryptocaryon irritans tomont stage identifies potential genes for the detection and control of cryptocaryonosis

    Directory of Open Access Journals (Sweden)

    Wan Kiew-Lian

    2010-01-01

    Full Text Available Abstract Background Cryptocaryon irritans is a parasitic ciliate that causes cryptocaryonosis (white spot disease in marine fish. Diagnosis of cryptocaryonosis often depends on the appearance of white spots on the surface of the fish, which are usually visible only during later stages of the disease. Identifying suitable biomarkers of this parasite would aid the development of diagnostic tools and control strategies for C. irritans. The C. irritans genome is virtually unexplored; therefore, we generated and analyzed expressed sequence tags (ESTs of the parasite to identify genes that encode for surface proteins, excretory/secretory proteins and repeat-containing proteins. Results ESTs were generated from a cDNA library of C. irritans tomonts isolated from infected Asian sea bass, Lates calcarifer. Clustering of the 5356 ESTs produced 2659 unique transcripts (UTs containing 1989 singletons and 670 consensi. BLAST analysis showed that 74% of the UTs had significant similarity (E-value -5 to sequences that are currently available in the GenBank database, with more than 15% of the significant hits showing unknown function. Forty percent of the UTs had significant similarity to ciliates from the genera Tetrahymena and Paramecium. Comparative gene family analysis with related taxa showed that many protein families are conserved among the protozoans. Based on gene ontology annotation, functional groups were successfully assigned to 790 UTs. Genes encoding excretory/secretory proteins and membrane and membrane-associated proteins were identified because these proteins often function as antigens and are good antibody targets. A total of 481 UTs were classified as encoding membrane proteins, 54 were classified as encoding for membrane-bound proteins, and 155 were found to contain excretory/secretory protein-coding sequences. Amino acid repeat-containing proteins and GPI-anchored proteins were also identified as potential candidates for the development of

  13. Analysis of ribosomal protein gene structures: implications for intron evolution.

    Directory of Open Access Journals (Sweden)

    2006-03-01

    Full Text Available Many spliceosomal introns exist in the eukaryotic nuclear genome. Despite much research, the evolution of spliceosomal introns remains poorly understood. In this paper, we tried to gain insights into intron evolution from a novel perspective by comparing the gene structures of cytoplasmic ribosomal proteins (CRPs and mitochondrial ribosomal proteins (MRPs, which are held to be of archaeal and bacterial origin, respectively. We analyzed 25 homologous pairs of CRP and MRP genes that together had a total of 527 intron positions. We found that all 12 of the intron positions shared by CRP and MRP genes resulted from parallel intron gains and none could be considered to be "conserved," i.e., descendants of the same ancestor. This was supported further by the high frequency of proto-splice sites at these shared positions; proto-splice sites are proposed to be sites for intron insertion. Although we could not definitively disprove that spliceosomal introns were already present in the last universal common ancestor, our results lend more support to the idea that introns were gained late. At least, our results show that MRP genes were intronless at the time of endosymbiosis. The parallel intron gains between CRP and MRP genes accounted for 2.3% of total intron positions, which should provide a reliable estimate for future inferences of intron evolution.

  14. Characterization of the Pichia pastoris protein-O-mannosyltransferase gene family.

    Directory of Open Access Journals (Sweden)

    Juergen H Nett

    Full Text Available The methylotrophic yeast, Pichiapastoris, is an important organism used for the production of therapeutic proteins. However, the presence of fungal-like glycans, either N-linked or O-linked, can elicit an immune response or enable the expressed protein to bind to mannose receptors, thus reducing their efficacy. Previously we have reported the elimination of β-linked glycans in this organism. In the current report we have focused on reducing the O-linked mannose content of proteins produced in P. pastoris, thereby reducing the potential to bind to mannose receptors. The initial step in the synthesis of O-linked glycans in P. pastoris is the transfer of mannose from dolichol-phosphomannose to a target protein in the yeast secretory pathway by members of the protein-O-mannosyltransferase (PMT family. In this report we identify and characterize the members of the P. pastoris PMT family. Like Candida albicans, P. pastoris has five PMT genes. Based on sequence homology, these PMTs can be grouped into three sub-families, with both PMT1 and PMT2 sub-families possessing two members each (PMT1 and PMT5, and PMT2 and PMT6, respectively. The remaining sub-family, PMT4, has only one member (PMT4. Through gene knockouts we show that PMT1 and PMT2 each play a significant role in O-glycosylation. Both, by gene knockouts and the use of Pmt inhibitors we were able to significantly reduce not only the degree of O-mannosylation, but also the chain-length of these glycans. Taken together, this reduction of O-glycosylation represents an important step forward in developing the P. pastoris platform as a suitable system for the production of therapeutic glycoproteins.

  15. CCAAT/enhancer-binding proteins regulate expression of the human steroidogenic acute regulatory protein (StAR) gene.

    Science.gov (United States)

    Christenson, L K; Johnson, P F; McAllister, J M; Strauss, J F

    1999-09-10

    Two putative CCAAT/enhancer-binding protein (C/EBP) response elements were identified in the proximal promoter of the human steroidogenic acute regulatory protein (StAR) gene, which encodes a key protein-regulating steroid hormone synthesis. Expression of C/EBPalpha and -beta increased StAR promoter activity in COS-1 and HepG2 cells. Cotransfection of C/EBPalpha or -beta and steroidogenic factor 1, a transcription factor required for cAMP regulation of StAR expression, into COS-1 augmented 8-bromoadenosine 3':5'-cyclic monophosphate (8-Br-cAMP)-stimulated promoter activity. When the putative C/EBP response elements were mutated, individually or together, a pronounced decline in basal StAR promoter activity in human granulosa-lutein cells resulted, but the fold stimulation of promoter activity by 8-Br-cAMP was unaffected. Recombinant C/EBPalpha and -beta bound to the two identified sequences but not the mutated elements. Human granulosa-lutein cell nuclear extracts also bound these elements but not the mutated sequences. An antibody to C/EBPbeta, but not C/EBPalpha, supershifted the nuclear protein complex associated with the more distal element. The complex formed by nuclear extracts with the proximal element was not supershifted by either antibody. Western blot analysis revealed the presence of C/EBPalpha and C/EBPbeta in human granulosa-lutein cell nuclear extracts. C/EBPbeta levels were up-regulated 3-fold by 8-Br-cAMP treatment. Our studies demonstrate a role for C/EBPbeta as well as yet to be identified proteins, which can bind to C/EBP response elements, in the regulation of StAR gene expression and suggest a mechanism by which C/EBPbeta participates in the cAMP regulation of StAR gene transcription.

  16. Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 with improved proof-reading enhances homology-directed repair.

    Science.gov (United States)

    Kato-Inui, Tomoko; Takahashi, Gou; Hsu, Szuyin; Miyaoka, Yuichiro

    2018-05-18

    Genome editing using clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated protein 9 (Cas9) predominantly induces non-homologous end joining (NHEJ), which generates random insertions or deletions, whereas homology-directed repair (HDR), which generates precise recombination products, is useful for wider applications. However, the factors that determine the ratio of HDR to NHEJ products after CRISPR/Cas9 editing remain unclear, and methods by which the proportion of HDR products can be increased have not yet been fully established. We systematically analyzed the HDR and NHEJ products after genome editing using various modified guide RNAs (gRNAs) and Cas9 variants with an enhanced conformational checkpoint to improve the fidelity at endogenous gene loci in HEK293T cells and HeLa cells. We found that these modified gRNAs and Cas9 variants were able to enhance HDR in both single-nucleotide substitutions and a multi-kb DNA fragment insertion. Our results suggest that the original CRISPR/Cas9 system from the bacterial immune system is not necessarily the best option for the induction of HDR in genome editing and indicate that the modulation of the kinetics of conformational checkpoints of Cas9 can optimize the HDR/NHEJ ratio.

  17. PREFACE: Physics approaches to protein interactions and gene regulation Physics approaches to protein interactions and gene regulation

    Science.gov (United States)

    Nussinov, Ruth; Panchenko, Anna R.; Przytycka, Teresa

    2011-06-01

    networks have been identified, including scale free distribution of the vertex degree, network motifs, and modularity, to name a few. These studies of network organization require the network to be as complete as possible, which given the limitations of experimental techniques is not currently the case. Therefore, experimental procedures for detecting biomolecular interactions should be complemented by computational approaches. The paper by Lees et al provides a review of computational methods, integrating multiple independent sources of data to infer physical and functional protein-protein interaction networks. One of the important aspects of protein interactions that should be accounted for in the prediction of protein interaction networks is that many proteins are composed of distinct domains. Protein domains may mediate protein interactions while proteins and their interaction networks may gain complexity through gene duplication and expansion of existing domain architectures via domain rearrangements. The latter mechanisms have been explored in detail in the paper by Cohen-Gihon et al. Protein-protein interactions are not the only component of the cell's interactome. Regulation of cell activity can be achieved at the level of transcription and involve a transcription factor—DNA binding which typically requires recognition of a specific DNA sequence motif. Chip-Chip and the more recent Chip-Seq technologies allow in vivo identification of DNA binding sites and, together with novel in vitro approaches, provide data necessary for deciphering the corresponding binding motifs. Such information, complemented by structures of protein-DNA complexes and knowledge of the differences in binding sites among homologs, opens the door to constructing predictive binding models. The paper by Persikov and Singh provides an example of such a model in the Cys2His2 zinc finger family. Recent studies have indicated that the presence of such binding motifs is, however, neither necessary

  18. Conserved loci of leaf and stem rust fungi of wheat share synteny interrupted by lineage-specific influx of repeat elements

    Directory of Open Access Journals (Sweden)

    Fellers John P

    2013-01-01

    Full Text Available Abstract Background Wheat leaf rust (Puccinia triticina Eriks; Pt and stem rust fungi (P. graminis f.sp. tritici; Pgt are significant economic pathogens having similar host ranges and life cycles, but different alternate hosts. The Pt genome, currently estimated at 135 Mb, is significantly larger than Pgt, at 88 Mb, but the reason for the expansion is unknown. Three genomic loci of Pt conserved proteins were characterized to gain insight into gene content, genome complexity and expansion. Results A bacterial artificial chromosome (BAC library was made from P. triticina race 1, BBBD and probed with Pt homologs of genes encoding two predicted Pgt secreted effectors and a DNA marker mapping to a region of avirulence. Three BACs, 103 Kb, 112 Kb, and 166 Kb, were sequenced, assembled, and open reading frames were identified. Orthologous genes were identified in Pgt and local conservation of gene order (microsynteny was observed. Pairwise protein identities ranged from 26 to 99%. One Pt BAC, containing a RAD18 ortholog, shares syntenic regions with two Pgt scaffolds, which could represent both haplotypes of Pgt. Gene sequence is diverged between the species as well as within the two haplotypes. In all three BAC clones, gene order is locally conserved, however, gene shuffling has occurred relative to Pgt. These regions are further diverged by differing insertion loci of LTR-retrotransposon, Gypsy, Copia, Mutator, and Harbinger mobile elements. Uncharacterized Pt open reading frames were also found; these proteins are high in lysine and similar to multiple proteins in Pgt. Conclusions The three Pt loci are conserved in gene order, with a range of gene sequence divergence. Conservation of predicted haustoria expressed secreted protein genes between Pt and Pgt is extended to the more distant poplar rust, Melampsora larici-populina. The loci also reveal that genome expansion in Pt is in part due to higher occurrence of repeat-elements in this species.

  19. Cloning and Molecular Characterization of the Schistosoma mansoni Genes RbAp48 and Histone H4

    Directory of Open Access Journals (Sweden)

    Patrícia P Souza

    2002-10-01

    Full Text Available The human nuclear protein RbAp48 is a member of the tryptophan/aspartate (WD repeat family, which binds to the retinoblastoma (Rb protein. It also corresponds to the smallest subunit of the chromatin assembly factor and is able to bind to the helix 1 of histone H4, taking it to the DNA in replication. A cDNA homologous to the human gene RbAp48 was isolated from a Schistosoma mansoni adult worm library and named SmRbAp48. The full length sequence of SmRbAp48 cDNA is 1036 bp long, encoding a protein of 308 amino acids. The transcript of SmRbAp48 was detected in egg, cercariae and schistosomulum stages. The protein shows 84% similarity with the human RbAp48, possessing four WD repeats on its C-terminus. A hypothetical tridimensional structure for the SmRbAp48 C-terminal domain was constructed by computational molecular modeling using the b-subunit of the G protein as a model. To further verify a possible interaction between SmRbAp48 and S. mansoni histone H4, the histone H4 gene was amplified from adult worm genomic DNA using degenerated primers. The gene fragment of SmH4 is 294 bp long, encoding a protein of 98 amino acids which is 100% identical to histone H4 from Drosophila melanogaster.

  20. Unveiling network-based functional features through integration of gene expression into protein networks.

    Science.gov (United States)

    Jalili, Mahdi; Gebhardt, Tom; Wolkenhauer, Olaf; Salehzadeh-Yazdi, Ali

    2018-06-01

    Decoding health and disease phenotypes is one of the fundamental objectives in biomedicine. Whereas high-throughput omics approaches are available, it is evident that any single omics approach might not be adequate to capture the complexity of phenotypes. Therefore, integrated multi-omics approaches have been used to unravel genotype-phenotype relationships such as global regulatory mechanisms and complex metabolic networks in different eukaryotic organisms. Some of the progress and challenges associated with integrated omics studies have been reviewed previously in comprehensive studies. In this work, we highlight and review the progress, challenges and advantages associated with emerging approaches, integrating gene expression and protein-protein interaction networks to unravel network-based functional features. This includes identifying disease related genes, gene prioritization, clustering protein interactions, developing the modules, extract active subnetworks and static protein complexes or dynamic/temporal protein complexes. We also discuss how these approaches contribute to our understanding of the biology of complex traits and diseases. This article is part of a Special Issue entitled: Cardiac adaptations to obesity, diabetes and insulin resistance, edited by Professors Jan F.C. Glatz, Jason R.B. Dyck and Christine Des Rosiers. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. A Novel Prokaryotic Green Fluorescent Protein Expression System for Testing Gene Editing Tools Activity Like Zinc Finger Nuclease.

    Science.gov (United States)

    Sabzehei, Faezeh; Kouhpayeh, Shirin; Dastjerdeh, Mansoureh Shahbazi; Khanahmad, Hossein; Salehi, Rasoul; Naderi, Shamsi; Taghizadeh, Razieh; Rabiei, Parisa; Hejazi, Zahra; Shariati, Laleh

    2017-01-01

    Gene editing technology has created a revolution in the field of genome editing. The three of the most famous tools in gene editing technology are zinc finger nucleases (ZFNs), transcription activator-like effector nucleases, clustered regularly interspaced short palindromic repeats (CRISPR), and CRISPR-associated systems. As their predictable nature, it is necessary to assess their efficiency. There are some methods for this purpose, but most of them are time labor and complicated. Here, we introduce a new prokaryotic reporter system, which makes it possible to evaluate the efficiency of gene editing tools faster, cheaper, and simpler than previous methods. At first, the target sites of a custom ZFN, which is designed against a segment of ampicillin resistance gene, were cloned on both sides of green fluorescent protein (GFP) gene to construct pPRO-GFP. Then pPRO-GFP was transformed into Escherichia coli TOP10F' that contains pZFN (contains expression cassette of a ZFN against ampicillin resistant gene), or p15A-KanaR as a negative control. The transformed bacteria were cultured on three separate media that contained ampicillin, kanamycin, and ampicillin + kanamycin; then the resulted colonies were assessed by flow cytometry. The results of flow cytometry showed a significant difference between the case (bacteria contain pZFN) and control (bacteria contain p15A, KanaR) in MFI (Mean Fluorescence Intensity) ( P < 0.0001). According to ZFN efficiency, it can bind and cut the target sites, the bilateral cutting can affect the intensity of GFP fluorescence. Our flow cytometry results showed that this ZFN could reduce the intensity of GFP color and colony count of bacteria in media containing amp + kana versus control sample.

  2. Molecular identification and characterization of clustered regularly interspaced short palindromic repeat (CRISPR) gene cluster in Taylorella equigenitalis.

    Science.gov (United States)

    Hara, Yasushi; Hayashi, Kyohei; Nakajima, Takuya; Kagawa, Shizuko; Tazumi, Akihiro; Moore, John E; Matsuda, Motoo

    2013-09-01

    Clustered regularly interspaced short palindromic repeats (CRISPRs), of approximately 10,000 base pairs (bp) in length, were shown to occur in the Japanese Taylorella equigenitalis strain, EQ59. The locus was composed of the putative CRISPRs-associated with 5 (cas5), RAMP csd1, csd2, recB, cas1, a leader region, 13 CRISPR consensus sequence repeats (each 32 bp; 5'-TCAGCCACGTTCGCGTGGCTGTGTGTTTAAAG-3'). These were in turn separated by 12 non repetitive unique spacer regions of similar length. In addition, a leader region, a transposase/IS protein, a leader region, and cas3 were also seen. All seven putative open reading frames carry their ribosome binding sites. Promoter consensus sequences at the -35 and -10 regions and putative intrinsic ρ-independent transcription terminator regions also occurred. A possible long overlap of 170 bp in length occurred between the recB and cas1 loci. Positive reverse transcription PCR signals of cas5, RAMP csd1, csd2-recB/cas1, and cas3 were generated. A putative secondary structure of the CRISPR consensus repeats was constructed. Following this, CRISPR results of the T. equigenitalis EQ59 isolate were subsequently compared with those from the Taylorella asinigenitalis MCE3 isolate.

  3. Changes in HSP gene and protein expression in natural scrapie with brain damage

    Science.gov (United States)

    2011-01-01

    Heat shock proteins (Hsp) perform cytoprotective functions such as apoptosis regulation and inflammatory response control. These proteins can also be secreted to the extracellular medium, acting as inflammatory mediators, and their chaperone activity permits correct folding of proteins and avoids the aggregation of anomalous isoforms. Several studies have proposed the implication of Hsp in prion diseases. We analysed the gene expression and protein distribution of different members of the Hsp27, Hsp70, and Hsp90 families in the central nervous system of sheep naturally infected with scrapie. Different expression profiles were observed in the areas analysed. Whereas changes in transcript levels were not observed in the cerebellum or medulla oblongata, a significant decrease in HSP27 and HSP90 was detected in the prefrontal cortex. In contrast, HSP73 was over-expressed in diencephalons of scrapie animals. Western blotting did not reveal significant differences in Hsp90 and Hsp70 protein expression between scrapie and control animals. Expression rates identified by real-time RT-PCR and western blotting were compared with the extent of classical scrapie lesions using stepwise regression. Changes in Hsp gene and protein expression were associated with prion protein deposition, gliosis and spongiosis rather than with apoptosis. Finally, immunohistochemistry revealed intense Hsp70 and Hsp90 immunolabelling in Purkinje cells of scrapie sheep. In contrast, controls displayed little or no staining in these cells. The observed differences in gene expression and protein distribution suggest that the heat shock proteins analysed play a role in the natural form of the disease. PMID:21314976

  4. Changes in HSP gene and protein expression in natural scrapie with brain damage

    Directory of Open Access Journals (Sweden)

    Serrano Carmen

    2011-01-01

    Full Text Available Abstract Heat shock proteins (Hsp perform cytoprotective functions such as apoptosis regulation and inflammatory response control. These proteins can also be secreted to the extracellular medium, acting as inflammatory mediators, and their chaperone activity permits correct folding of proteins and avoids the aggregation of anomalous isoforms. Several studies have proposed the implication of Hsp in prion diseases. We analysed the gene expression and protein distribution of different members of the Hsp27, Hsp70, and Hsp90 families in the central nervous system of sheep naturally infected with scrapie. Different expression profiles were observed in the areas analysed. Whereas changes in transcript levels were not observed in the cerebellum or medulla oblongata, a significant decrease in HSP27 and HSP90 was detected in the prefrontal cortex. In contrast, HSP73 was over-expressed in diencephalons of scrapie animals. Western blotting did not reveal significant differences in Hsp90 and Hsp70 protein expression between scrapie and control animals. Expression rates identified by real-time RT-PCR and western blotting were compared with the extent of classical scrapie lesions using stepwise regression. Changes in Hsp gene and protein expression were associated with prion protein deposition, gliosis and spongiosis rather than with apoptosis. Finally, immunohistochemistry revealed intense Hsp70 and Hsp90 immunolabelling in Purkinje cells of scrapie sheep. In contrast, controls displayed little or no staining in these cells. The observed differences in gene expression and protein distribution suggest that the heat shock proteins analysed play a role in the natural form of the disease.

  5. Characterizing genes with distinct methylation patterns in the context of protein-protein interaction network: application to human brain tissues.

    Science.gov (United States)

    Li, Yongsheng; Xu, Juan; Chen, Hong; Zhao, Zheng; Li, Shengli; Bai, Jing; Wu, Aiwei; Jiang, Chunjie; Wang, Yuan; Su, Bin; Li, Xia

    2013-01-01

    DNA methylation is an essential epigenetic mechanism involved in transcriptional control. However, how genes with different methylation patterns are assembled in the protein-protein interaction network (PPIN) remains a mystery. In the present study, we systematically dissected the characterization of genes with different methylation patterns in the PPIN. A negative association was detected between the methylation levels in the brain tissues and topological centralities. By focusing on two classes of genes with considerably different methylation levels in the brain tissues, namely the low methylated genes (LMGs) and high methylated genes (HMGs), we found that their organizing principles in the PPIN are distinct. The LMGs tend to be the center of the PPIN, and attacking them causes a more deleterious effect on the network integrity. Furthermore, the LMGs express their functions in a modular pattern and substantial differences in functions are observed between the two types of genes. The LMGs are enriched in the basic biological functions, such as binding activity and regulation of transcription. More importantly, cancer genes, especially recessive cancer genes, essential genes, and aging-related genes were all found more often in the LMGs. Additionally, our analysis presented that the intra-classes communications are enhanced, but inter-classes communications are repressed. Finally, a functional complementation was revealed between methylation and miRNA regulation in the human genome. We have elucidated the assembling principles of genes with different methylation levels in the context of the PPIN, providing key insights into the complex epigenetic regulation mechanisms.

  6. From proteins to genes: immunoanalysis in the diagnosis of muscular dystrophies

    Directory of Open Access Journals (Sweden)

    Barresi Rita

    2011-06-01

    Full Text Available Abstract Muscular dystrophies are a large heterogeneous group of inherited diseases that cause progressive muscle weakness and permanent muscle damage. Very few muscular dystrophies show sufficient specific clinical features to allow a definite diagnosis. Because of the currently limited capacity to screen for numerous genes simultaneously, muscle biopsy is a time and cost-effective test for many of these disorders. Protein analysis interpreted in correlation with the clinical phenotype is a useful way of directing genetic testing in many types of muscular dystrophies. Immunohistochemistry and western blot are complementary techniques used to gather quantitative and qualitative information on the expression of proteins involved in this group of diseases. Immunoanalysis has a major diagnostic application mostly in recessive conditions where the absence of labelling for a particular protein is likely to indicate a defect in that gene. However, abnormalities in protein expression can vary from absence to very subtle reduction. It is good practice to test muscle biopsies with antibodies for several proteins simultaneously and to interpret the results in context. Indeed, there is a degree of direct or functional association between many of these proteins that is reflected by the presence of specific secondary abnormalities that are of value, especially when the diagnosis is not straightforward.

  7. Characterization of the ovine ribosomal protein SA gene and its pseudogenes

    Directory of Open Access Journals (Sweden)

    Van Zeveren Alex

    2010-03-01

    Full Text Available Abstract Background The ribosomal protein SA (RPSA, previously named 37-kDa laminin receptor precursor/67-kDa laminin receptor (LRP/LR is a multifunctional protein that plays a role in a number of pathological processes, such as cancer and prion diseases. In all investigated species, RPSA is a member of a multicopy gene family consisting of one full length functional gene and several pseudogenes. Therefore, for studies on RPSA related pathways/pathologies, it is important to characterize the whole family and to address the possible function of the other RPSA family members. The present work aims at deciphering the RPSA family in sheep. Results In addition to the full length functional ovine RPSA gene, 11 other members of this multicopy gene family, all processed pseudogenes, were identified. Comparison between the RPSA transcript and these pseudogenes shows a large variety in sequence identities ranging from 99% to 74%. Only one of the 11 pseudogenes, i.e. RPSAP7, shares the same open reading frame (ORF of 295 amino acids with the RPSA gene, differing in only one amino acid. All members of the RPSA family were annotated by comparative mapping and fluorescence in situ hybridization (FISH localization. Transcription was investigated in the cerebrum, cerebellum, spleen, muscle, lymph node, duodenum and blood, and transcripts were detected for 6 of the 11 pseudogenes in some of these tissues. Conclusions In the present work we have characterized the ovine RPSA family. Our results have revealed the existence of 11 ovine RPSA pseudogenes and provide new data on their structure and sequence. Such information will facilitate molecular studies of the functional RPSA gene taking into account the existence of these pseudogenes in the design of experiments. It remains to be investigated if the transcribed members are functional as regulatory non-coding RNA or as functional proteins.

  8. Variants in the ASB10 Gene Are Associated with Primary Open Angle Glaucoma

    NARCIS (Netherlands)

    Micheal, S.; Ayub, H.; Islam, F.; Siddiqui, S.N.; Khan, W.A.; Akhtar, F.; Qamar, R.; Khan, M.I.; Hollander, A.I. den

    2015-01-01

    BACKGROUND: Recently nonsynonymous coding variants in the ankyrin repeats and suppressor of cytokine signaling box-containing protein 10 (ASB10) gene were found to be associated with primary open angle glaucoma (POAG) in cohorts from Oregon and Germany, but this finding was not confirmed in an

  9. The coat protein complex II, COPII, protein Sec13 directly interacts with presenilin-1

    International Nuclear Information System (INIS)

    Nielsen, Anders Lade

    2009-01-01

    Mutations in the human gene encoding presenilin-1, PS1, account for most cases of early-onset familial Alzheimer's disease. PS1 has nine transmembrane domains and a large loop orientated towards the cytoplasm. PS1 locates to cellular compartments as endoplasmic reticulum (ER), Golgi apparatus, vesicular structures, and plasma membrane, and is an integral member of γ-secretase, a protein protease complex with specificity for intra-membranous cleavage of substrates such as β-amyloid precursor protein. Here, an interaction between PS1 and the Sec13 protein is described. Sec13 takes part in coat protein complex II, COPII, vesicular trafficking, nuclear pore function, and ER directed protein sequestering and degradation control. The interaction maps to the N-terminal part of the large hydrophilic PS1 loop and the first of the six WD40-repeats present in Sec13. The identified Sec13 interaction to PS1 is a new candidate interaction for linking PS1 to secretory and protein degrading vesicular circuits.

  10. The coat protein complex II, COPII, protein Sec13 directly interacts with presenilin-1

    Energy Technology Data Exchange (ETDEWEB)

    Nielsen, Anders Lade, E-mail: aln@humgen.au.dk [Department of Human Genetics, The Bartholin Building, University of Aarhus, DK-8000 Aarhus C (Denmark)

    2009-10-23

    Mutations in the human gene encoding presenilin-1, PS1, account for most cases of early-onset familial Alzheimer's disease. PS1 has nine transmembrane domains and a large loop orientated towards the cytoplasm. PS1 locates to cellular compartments as endoplasmic reticulum (ER), Golgi apparatus, vesicular structures, and plasma membrane, and is an integral member of {gamma}-secretase, a protein protease complex with specificity for intra-membranous cleavage of substrates such as {beta}-amyloid precursor protein. Here, an interaction between PS1 and the Sec13 protein is described. Sec13 takes part in coat protein complex II, COPII, vesicular trafficking, nuclear pore function, and ER directed protein sequestering and degradation control. The interaction maps to the N-terminal part of the large hydrophilic PS1 loop and the first of the six WD40-repeats present in Sec13. The identified Sec13 interaction to PS1 is a new candidate interaction for linking PS1 to secretory and protein degrading vesicular circuits.

  11. Discrepancies in reporting the CAG repeat lengths for Huntington's disease

    DEFF Research Database (Denmark)

    Quarrell, Oliver W; Handley, Olivia; O'Donovan, Kirsty

    2011-01-01

    Huntington's disease results from a CAG repeat expansion within the Huntingtin gene; this is measured routinely in diagnostic laboratories. The European Huntington's Disease Network REGISTRY project centrally measures CAG repeat lengths on fresh samples; these were compared with the original...

  12. cDNA-SRAP and Its Application in Differential Gene Expression Analysis: A Case Study in Erianthus arundinaceum

    Directory of Open Access Journals (Sweden)

    Youxiong Que

    2012-01-01

    Full Text Available Erianthus arundinaceum is a wild relative species of sugarcane. The aim of this research was to demonstrate the feasibility of cDNA-SRAP for differential gene expression and to explore the molecular mechanism of drought resistance in E. arundinaceum. cDNA-SRAP technique, for the first time, was applied in the analysis of differential gene expression in E. arundinaceum under drought stress. In total, eight differentially expressed genes with length of 185–427 bp were successfully isolated (GenBank Accession numbers: EU071770, EU071772, EU071774, EU071776, EU071777, EU071779, EU071780, and EU071781. Based on their homologies with genes in GenBank, these genes were assumed to encode ribonuclease III, vacuolar protein, ethylene insensitive protein, aerobactin biosynthesis protein, photosystem II protein, glucose transporter, leucine-rich repeat protein, and ammonia monooxygenase. Real-time PCR analysis on the expression profiling of gene (EU071774 encoding ethylene-insensitive protein and gene (EU071781 encoding ammonia monooxygenase revealed that the expression of these two genes was upregulated both by PEG and ABA treatments, suggesting that they may involve in the drought resistance of E. arundinaceum. This study constitutes the first report of genes activated in E. arundinaceum by drought stress and opens up the application of cDNA-SRAP in differential gene expression analysis in E. arundinaceum under certain stress conditions.

  13. Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

    Energy Technology Data Exchange (ETDEWEB)

    Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.; Jansen, Robert K.

    2006-01-09

    Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 and ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies

  14. The Heat Shock Protein 26 Gene is Required for Ethanol Tolerance in Drosophila

    Directory of Open Access Journals (Sweden)

    Awoyemi A. Awofala

    2011-01-01

    Full Text Available Stress plays an important role in drug- and addiction-related behaviours. However, the mechanisms underlying these behavioural responses are still poorly understood. In the light of recent reports that show consistent regulation of many genes encoding stress proteins including heat shock proteins following ethanol exposure in Drosophila , it was hypothesised that transition to alcohol dependence may involve the dysregulation of the circuits that mediate behavioural responses to stressors. Thus, behavioural genetic methodologies were used to investigate the role of the Drosophila hsp26 gene, a small heat shock protein coding gene which is induced in response to various stresses, in the development of rapid tolerance to ethanol sedation. Rapid tolerance was quantified as the percentage difference in the mean sedation times between the second and first ethanol exposure. Two independently isolated P-element mutations near the hsp26 gene eliminated the capacity for tolerance. In addition, RNAi-mediated functional knockdown of hsp26 expression in the glial cells and the whole nervous system also caused a defect in tolerance development. The rapid tolerance phenotype of the hsp26 mutants was rescued by the expression of the wild-type hsp26 gene in the nervous system. None of these manipulations of the hsp26 gene caused changes in the rate of ethanol absorption. Hsp26 genes are evolutionary conserved, thus the role of hsp26 in ethanol tolerance may present a new direction for research into alcohol dependency.

  15. Global regulation of gene expression by the MafR protein of Enterococcus faecalis

    Directory of Open Access Journals (Sweden)

    Sofía eRuiz-Cruz

    2016-01-01

    Full Text Available Enterococcus faecalis is a natural inhabitant of the human gastrointestinal tract. However, as an opportunistic pathogen, it is able to colonize other host niches and cause life-threatening infections. Its adaptation to new environments involves global changes in gene expression. The EF3013 gene (here named mafR of E. faecalis strain V583 encodes a protein (MafR, 482 residues that has sequence similarity to global response regulators of the Mga/AtxA family. The enterococcal OG1RF genome also encodes the MafR protein (gene OG1RF_12293. In this work, we have identified the promoter of the mafR gene using several in vivo approaches. Moreover, we show that MafR influences positively the transcription of many genes on a genome-wide scale. The most significant target genes encode components of PTS-type membrane transporters, components of ABC-type membrane transporters, and proteins involved in the metabolism of carbon sources. Some of these genes were previously reported to be up-regulated during the growth of E. faecalis in blood and/or in human urine. Furthermore, we show that a mafR deletion mutant strain induces a significant lower degree of inflammation in the peritoneal cavity of mice, suggesting that enterococcal cells deficient in MafR are less virulent. Our work indicates that MafR is a global transcriptional regulator. It might facilitate the adaptation of E. faecalis to particular host niches and, therefore, contribute to its potential virulence.

  16. MNS16A tandem repeats minisatellite of human telomerase gene: a risk factor for colorectal cancer.

    Science.gov (United States)

    Hofer, Philipp; Baierl, Andreas; Feik, Elisabeth; Führlinger, Gerhard; Leeb, Gernot; Mach, Karl; Holzmann, Klaus; Micksche, Michael; Gsur, Andrea

    2011-06-01

    Telomerase reactivation and expression of human telomerase gene [human telomerase reverse transcriptase (hTERT)] are hallmarks of unlimited proliferation potential of cancer cells. A polymorphic tandem repeats minisatellite of hTERT gene, termed MNS16A was reported to influence hTERT expression. To assess the role of MNS16A as potential biomarker for colorectal cancer (CRC), we investigated for the first time the association of MNS16A genotypes with risk of colorectal polyps and CRC. In the ongoing colorectal cancer study of Austria (CORSA), 3842 Caucasian participants were recruited within a large screening project in the province Burgenland including 90 CRC cases, 308 high-risk polyps, 1022 low-risk polyps and 1822 polyp free controls verified by colonoscopy. MNS16A genotypes were determined by polymerase chain reaction from genomic DNA. Associations of MNS16A genotypes with CRC risk were estimated by logistic regression analysis computing odds ratios (ORs) and 95% confidence intervals (CIs). We identified five different variable number of tandem repeats (VNTRs) of MNS16A including VNTR-364, a newly discovered rare variant. VNTR-274 allele was associated with a 2.7-fold significantly increased risk of CRC compared with the VNTR-302 wild-type (OR = 2.69; 95% CI = 1.11-6.50; P = 0.028). In our CORSA study, the medium length VNTR-274 was identified as risk factor for CRC. Although, this population-based study herewith reports the largest cohort size concerning MNS16A thus far, further large-scale studies in diverse populations are warranted to confirm hTERT MNS16A genotype as potential biomarker for assessment of CRC risk.

  17. Mapping of the Mouse Actin Capping Protein Beta Subunit Gene

    Directory of Open Access Journals (Sweden)

    Cooper John A

    2000-07-01

    Full Text Available Abstract Background Capping protein (CP, a heterodimer of α and β subunits, is found in all eukaryotes. CP binds to the barbed ends of actin filaments in vitro and controls actin assembly and cell motility in vivo. Vertebrates have three isoforms of CPβ produced by alternatively splicing from one gene; lower organisms have one gene and one isoform. Results We isolated genomic clones corresponding to the β subunit of mouse CP and identified its chromosomal location by interspecies backcross mapping. Conclusions The CPβ gene (Cappb1 mapped to Chromosome 4 between Cdc42 and D4Mit312. Three mouse mutations, snubnose, curly tail, and cribriform degeneration, map in the vicinity of the β gene.

  18. Sequence analyses reveal that a TPR–DP module, surrounded by recombinable flanking introns, could be at the origin of eukaryotic Hop and Hip TPR–DP domains and prokaryotic GerD proteins

    Science.gov (United States)

    Papandreou, Nikolaos; Chomilier, Jacques

    2008-01-01

    The co-chaperone Hop [heat shock protein (HSP) organising protein] is known to bind both Hsp70 and Hsp90. Hop comprises three repeats of a tetratricopeptide repeat (TPR) domain, each consisting of three TPR motifs. The first and last TPR domains are followed by a domain containing several dipeptide (DP) repeats called the DP domain. These analyses suggest that the hop genes result from successive recombination events of an ancestral TPR–DP module. From a hydrophobic cluster analysis of homologous Hop protein sequences derived from gene families, we can postulate that shifts in the open reading frames are at the origin of the present sequences. Moreover, these shifts can be related to the presence or absence of biological function. We propose to extend the family of Hop co-chaperons into the kingdom of bacteria, as several structurally related genes have been identified by hydrophobic cluster analysis. We also provide evidence of common structural characteristics between hop and hip genes, suggesting a shared precursor of ancestral TPR–DP domains. Electronic supplementary material The online version of this article (doi:10.1007/s12192-008-0083-8) contains supplementary material, which is available to authorized users. PMID:18987995

  19. Identification of a STOP1-like protein in Eucalyptus that regulates transcription of Al tolerance genes.

    Science.gov (United States)

    Sawaki, Yoshiharu; Kobayashi, Yuriko; Kihara-Doi, Tomonori; Nishikubo, Nobuyuki; Kawazu, Tetsu; Kobayashi, Masatomo; Kobayashi, Yasufumi; Iuchi, Satoshi; Koyama, Hiroyuki; Sato, Shigeru

    2014-06-01

    Tolerance to soil acidity is an important trait for eucalyptus clones that are introduced to commercial forestry plantations in pacific Asian countries, where acidic soil is dominant in many locations. A conserved transcription factor regulating aluminum (Al) and proton (H⁺) tolerance in land-plant species, STOP1 (SENSITIVE TOPROTON RHIZOTOXICITY 1)-like protein, was isolated by polymerase chain reaction-based cloning, and then suppressed by RNA interference in hairy roots produced by Agrobacterium rhizogenes-mediated transformation. Eucalyptus STOP1-like protein complemented proton tolerance in an Arabidopsis thaliana stop1-mutant, and localized to the nucleus in a transient assay of a green fluorescent protein fusion protein expressed in tobacco leaves by Agrobacterium tumefaciens-mediated transformation. Genes encoding a citrate transporting MULTIDRUGS AND TOXIC COMPOUND EXTRUSION protein and an orthologue of ALUMINUM SENSITIVE 3 were suppressed in transgenic hairy roots in which the STOP1 orthologue was knocked down. In summary, we identified a series of genes for Al-tolerance in eucalyptus, including a gene for STOP1-like protein and the Al-tolerance genes it regulates. These genes may be useful for molecular breeding and genomic selection of elite clones to introduce into acid soil regions. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  20. Isolation of nucleotide binding site-leucine rich repeat and kinase resistance gene analogues from sugarcane (Saccharum spp.).

    Science.gov (United States)

    Glynn, Neil C; Comstock, Jack C; Sood, Sushma G; Dang, Phat M; Chaparro, Jose X

    2008-01-01

    Resistance gene analogues (RGAs) have been isolated from many crops and offer potential in breeding for disease resistance through marker-assisted selection, either as closely linked or as perfect markers. Many R-gene sequences contain kinase domains, and indeed kinase genes have been reported as being proximal to R-genes, making kinase analogues an additionally promising target. The first step towards utilizing RGAs as markers for disease resistance is isolation and characterization of the sequences. Sugarcane clone US01-1158 was identified as resistant to yellow leaf caused by the sugarcane yellow leaf virus (SCYLV) and moderately resistant to rust caused by Puccinia melanocephala Sydow & Sydow. Degenerate primers that had previously proved useful for isolating RGAs and kinase analogues in wheat and soybean were used to amplify DNA from sugarcane (Saccharum spp.) clone US-01-1158. Sequences generated from 1512 positive clones were assembled into 134 contigs of between two and 105 sequences. Comparison of the contig consensuses with the NCBI sequence database using BLASTx showed that 20 had sequence homology to nuclear binding site and leucine rich repeat (NBS-LRR) RGAs, and eight to kinase genes. Alignment of the deduced amino acid sequences with similar sequences from the NCBI database allowed the identification of several conserved domains. The alignment and resulting phenetic tree showed that many of the sequences had greater similarity to sequences from other species than to one another. The use of degenerate primers is a useful method for isolating novel sugarcane RGA and kinase gene analogues. Further studies are needed to evaluate the role of these genes in disease resistance.