WorldWideScience

Sample records for expressed sequences tags

  1. Novel expressed sequence tag- simple sequence repeats (EST ...

    African Journals Online (AJOL)

    Using different bioinformatic criteria, the SUCEST database was used to mine for simple sequence repeat (SSR) markers. Among 42,189 clusters, 1,425 expressed sequence tag- simple sequence repeats (EST-SSRs) were identified in silico. Trinucleotide repeats were the most abundant SSRs detected. Of 212 primer pairs ...

  2. Expressed sequence tags (ESTs) and single nucleotide ...

    African Journals Online (AJOL)

    SERVER

    2008-02-19

    Feb 19, 2008 ... the discovery of the DNA, a new area of modern plant biotechnology begun. In plant ... Marker Assisted Breeding and Sequence Tagged Sites. (STS) are all in use in modern ...... and behaviour in the honey bee. Genome Res.

  3. Mining of expressed sequence tag libraries of cacao

    Indian Academy of Sciences (India)

    Expressed sequence tags (ESTs) provide researchers with a quick and inexpensive route for discovering new genes, data on gene expression and regulation, and also provide genic markers that help in constructing genome maps. Cacao is an important perennial crop of humid tropics. Cacao EST sequences, as available ...

  4. Mining microsatellite markers from public expressed sequence tag

    Indian Academy of Sciences (India)

    Home; Journals; Journal of Genetics; Volume 91; Issue 3. Mining microsatellite markers from public expressed sequence tag sequences for genetic diversity analysis in pomegranate. Zai-Hai Jian Xin-She Liu Jian-Bin Hu Yan-Hui Chen Jian-Can Feng. Research Note Volume 91 Issue 3 December 2012 pp 353-358 ...

  5. Survey of transposable elements in sugarcane expressed sequence tags (ESTs

    Directory of Open Access Journals (Sweden)

    Rossi Magdalena

    2001-01-01

    Full Text Available The sugarcane expressed sequence tag (SUCEST project has produced a large number of cDNA sequences from several plant tissues submitted or not to different conditions of stress. In this paper we report the result of a search for transposable elements (TEs revealing a surprising amount of expressed TEs homologues. Of the 260,781 sequences grouped in 81,223 fragment assembly program (Phrap clusters, a total of 276 clones showed homology to previously reported TEs using a stringent cut-off value of e-50 or better. Homologous clones to Copia/Ty1 and Gypsy/Ty3 groups of long terminal repeat (LTR retrotransposons were found but no non-LTR retroelements were identified. All major transposon families were represented in sugarcane including Activator (Ac, Mutator (MuDR, Suppressor-mutator (En/Spm and Mariner. In order to compare the TE diversity in grasses genomes, we carried out a search for TEs described in sugarcane related species O.sativa, Z. mays and S. bicolor. We also present preliminary results showing the potential use of TEs insertion pattern polymorphism as molecular markers for cultivar identification.

  6. Generation and analysis of expressed sequence tags from Botrytis cinerea

    Directory of Open Access Journals (Sweden)

    EVELYN SILVA

    2006-01-01

    Full Text Available Botrytis cinerea is a filamentous plant pathogen of a wide range of plant species, and its infection may cause enormous damage both during plant growth and in the post-harvest phase. We have constructed a cDNA library from an isolate of B. cinerea and have sequenced 11,482 expressed sequence tags that were assembled into 1,003 contigs sequences and 3,032 singletons. Approximately 81% of the unigenes showed significant similarity to genes coding for proteins with known functions: more than 50% of the sequences code for genes involved in cellular metabolism, 12% for transport of metabolites, and approximately 10% for cellular organization. Other functional categories include responses to biotic and abiotic stimuli, cell communication, cell homeostasis, and cell development. We carried out pair-wise comparisons with fungal databases to determine the B. cinerea unisequence set with relevant similarity to genes in other fungal pathogenic counterparts. Among the 4,035 non-redundant B. cinerea unigenes, 1,338 (23% have significant homology with Fusarium verticillioides unigenes. Similar values were obtained for Saccharomyces cerevisiae and Aspergillus nidulans (22% and 24%, respectively. The lower percentages of homology were with Magnaporthe grisae and Neurospora crassa (13% and 19%, respectively. Several genes involved in putative and known fungal virulence and general pathogenicity were identified. The results provide important information for future research on this fungal pathogen

  7. Analysis of expressed sequence tags from the Ulva prolifera (Chlorophyta)

    Science.gov (United States)

    Niu, Jianfeng; Hu, Haiyan; Hu, Songnian; Wang, Guangce; Peng, Guang; Sun, Song

    2010-01-01

    In 2008, a green tide broke out before the sailing competition of the 29th Olympic Games in Qingdao. The causative species was determined to be Enteromorpha prolifera ( Ulva prolifera O. F. Müller), a familiar green macroalga along the coastline of China. Rapid accumulation of a large biomass of floating U. prolifera prompted research on different aspects of this species. In this study, we constructed a nonnormalized cDNA library from the thalli of U. prolifera and acquired 10 072 high-quality expressed sequence tags (ESTs). These ESTs were assembled into 3 519 nonredundant gene groups, including 1 446 clusters and 2 073 singletons. After annotation with the nr database, a large number of genes were found to be related with chloroplast and ribosomal protein, GO functional classification showed 1 418 ESTs participated in photosynthesis and 1 359 ESTs were responsible for the generation of precursor metabolites and energy. In addition, rather comprehensive carbon fixation pathways were found in U. prolifera using KEGG. Some stress-related and signal transduction-related genes were also found in this study. All the evidences displayed that U. prolifera had substance and energy foundation for the intense photosynthesis and the rapid proliferation. Phylogenetic analysis of cytochrome c oxidase subunit I revealed that this green-tide causative species is most closely affiliated to Pseudendoclonium akinetum (Ulvophyceae).

  8. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    Science.gov (United States)

    de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084

  9. Gene mining a marama bean expressed sequence tags (ESTs ...

    African Journals Online (AJOL)

    The authors reported the identification of genes associated with embryonic development and microsatellite sequences. The future direction will entail characterization of these genes using gene over-expression and mutant assays. Key words: Namibia, simple sequence repeats (SSR), data mining, homology searches, ...

  10. Peanut (Arachis hypogaea Expressed Sequence Tag Project: Progress and Application

    Directory of Open Access Journals (Sweden)

    Suping Feng

    2012-01-01

    Full Text Available Many plant ESTs have been sequenced as an alternative to whole genome sequences, including peanut because of the genome size and complexity. The US peanut research community had the historic 2004 Atlanta Genomics Workshop and named the EST project as a main priority. As of August 2011, the peanut research community had deposited 252,832 ESTs in the public NCBI EST database, and this resource has been providing the community valuable tools and core foundations for various genome-scale experiments before the whole genome sequencing project. These EST resources have been used for marker development, gene cloning, microarray gene expression and genetic map construction. Certainly, the peanut EST sequence resources have been shown to have a wide range of applications and accomplished its essential role at the time of need. Then the EST project contributes to the second historic event, the Peanut Genome Project 2010 Inaugural Meeting also held in Atlanta where it was decided to sequence the entire peanut genome. After the completion of peanut whole genome sequencing, ESTs or transcriptome will continue to play an important role to fill in knowledge gaps, to identify particular genes and to explore gene function.

  11. Comprehensive Genetic Database of Expressed Sequence Tags for Coccolithophorids

    Science.gov (United States)

    Ranji, Mohammad; Hadaegh, Ahmad R.

    Coccolithophorids are unicellular, marine, golden-brown, single-celled algae (Haptophyta) commonly found in near-surface waters in patchy distributions. They belong to the Phytoplankton family that is known to be responsible for much of the earth reproduction. Phytoplankton, just like plants live based on the energy obtained by Photosynthesis which produces oxygen. Substantial amount of oxygen in the earth's atmosphere is produced by Phytoplankton through Photosynthesis. The single-celled Emiliana Huxleyi is the most commonly known specie of Coccolithophorids and is known for extracting bicarbonate (HCO3) from its environment and producing calcium carbonate to form Coccoliths. Coccolithophorids are one of the world's primary producers, contributing about 15% of the average oceanic phytoplankton biomass to the oceans. They produce elaborate, minute calcite platelets (Coccoliths), covering the cell to form a Coccosphere and supplying up to 60% of the bulk pelagic calcite deposited on the sea floors. In order to understand the genetics of Coccolithophorid and the complexities of their biochemical reactions, we decided to build a database to store a complete profile of these organisms' genomes. Although a variety of such databases currently exist, (http://www.geneservice.co.uk/home/) none have yet been developed to comprehensively address the sequencing efforts underway by the Coccolithophorid research community. This database is called CocooExpress and is available to public (http://bioinfo.csusm.edu) for both data queries and sequence contribution.

  12. Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range

    NARCIS (Netherlands)

    Massa, Sonia I.; Pearson, Gareth A.; Aires, Tania; Kube, Michael; Olsen, Jeanine L.; Reinhardt, Richard; Serrao, Ester A.; Arnaud-Haond, Sophie

    Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera midi during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to

  13. PASSIOMA: Exploring Expressed Sequence Tags during Flower Development in Passiflora spp.

    Directory of Open Access Journals (Sweden)

    Lucas Cutri

    2012-01-01

    Full Text Available The genus Passiflora provides a remarkable example of floral complexity and diversity. The extreme variation of Passiflora flower morphologies allowed a wide range of interactions with pollinators to evolve. We used the analysis of expressed sequence tags (ESTs as an approach for the characterization of genes expressed during Passiflora reproductive development. Analyzing the Passiflora floral EST database (named PASSIOMA, we found sequences showing significant sequence similarity to genes known to be involved in reproductive development such as MADS-box genes. Some of these sequences were studied using RT-PCR and in situ hybridization confirming their expression during Passiflora flower development. The detection of these novel sequences can contribute to the development of EST-based markers for important agronomic traits as well as to the establishment of genomic tools to study the naturally occurring floral diversity among Passiflora species.

  14. Expressed sequence tags of differential genes in the radioresistant mice and their parental mice

    International Nuclear Information System (INIS)

    Wang Qin; Yue Jingyin; Li Jin; Song Li; Liu Qiang; Mu Chuanjie; Wu Hongying

    2009-01-01

    Objective: To explore radioresistance correlative genes in IRM-2 inbred mouse. Methods: The total RNA was extracted from spleen cells of IRM-2 and their parent 615 and ICR/JCL mouse. The mRNA differential display technique was used to analyze gene expression differences. Each differential bands were amplified by PCR, cloned and sequenced. Results: There were 75 differential expression bands appearing in IRM-2 mouse but not in 615 and ICR/JCL mouse. Fifty-two pieces of cDNA sequences were got by sequencing. Twenty-one expressed sequence tags (EST) that were not the same as known mice genes were found and registered by comparing with GenBank database. Conclusion: Twenty-one EST denote that radioresistance correlative genes may be in IRM-2 mouse, which have laid a foundation for isolating and identifying radioresistance correlative genes in further study. (authors)

  15. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    DEFF Research Database (Denmark)

    de Souza, S J; Camargo, A A; Briones, M R

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central ...

  16. Analyses of an expressed sequence tag library from Taenia solium, Cysticerca.

    Directory of Open Access Journals (Sweden)

    Jonas Lundström

    Full Text Available BACKGROUND: Neurocysticercosis is a disease caused by the oral ingestion of eggs from the human parasitic worm Taenia solium. Although drugs are available they are controversial because of the side effects and poor efficiency. An expressed sequence tag (EST library is a method used to describe the gene expression profile and sequence of mRNA from a specific organism and stage. Such information can be used in order to find new targets for the development of drugs and to get a better understanding of the parasite biology. METHODS AND FINDINGS: Here an EST library consisting of 5760 sequences from the pig cysticerca stage has been constructed. In the library 1650 unique sequences were found and of these, 845 sequences (52% were novel to T. solium and not identified within other EST libraries. Furthermore, 918 sequences (55% were of unknown function. Amongst the 25 most frequently expressed sequences 6 had no relevant similarity to other sequences found in the Genbank NR DNA database. A prediction of putative signal peptides was also performed and 4 among the 25 were found to be predicted with a signal peptide. Proposed vaccine and diagnostic targets T24, Tsol18/HP6 and Tso31d could also be identified among the 25 most frequently expressed. CONCLUSIONS: An EST library has been produced from pig cysticerca and analyzed. More than half of the different ESTs sequenced contained a sequence with no suggested function and 845 novel EST sequences have been identified. The library increases the knowledge about what genes are expressed and to what level. It can also be used to study different areas of research such as drug and diagnostic development together with parasite fitness via e.g. immune modulation.

  17. Parallel Sequencing of Expressed Sequence Tags from Two Complementary DNA Libraries for High and Low Phosphorus Adaptation in Common Beans

    Directory of Open Access Journals (Sweden)

    Matthew W. Blair

    2011-11-01

    Full Text Available Expressed sequence tags (ESTs have proven useful for gene discovery in many crops. In this work, our objective was to construct complementary DNA (cDNA libraries from root tissues of common beans ( L. grown under low and high P hydroponic conditions and to conduct EST sequencing and comparative analyses of the libraries. Expressed sequence tag analysis of 3648 clones identified 2372 unigenes, of which 1591 were annotated as known genes while a total of 465 unigenes were not associated with any known gene. Unigenes with hits were categorized according to biological processes, molecular function, and cellular compartmentalization. Given the young tissue used to make the root libraries, genes for catalytic activity and binding were highly expressed. Comparisons with previous root EST sequencing and between the two libraries made here resulted in a set of genes to study further for differential gene expression and adaptation to low P, such as a 14 kDa praline-rich protein, a metallopeptidase, tonoplast intrinsic protein, adenosine triphosphate (ATP citrate synthase, and cell proliferation genes expressed in the low P treated plants. Given that common beans are often grown on acid soils of the tropics and subtropics that are usually low in P these genes and the two parallel libraries will be useful for selection for better uptake of this essential macronutrient. The importance of EST generation for common bean root tissues under low P and other abiotic soil stresses is also discussed.

  18. Expressed sequence tags from heat-shocked seagrass Zostera noltii (Hornemann) from its southern distribution range.

    Science.gov (United States)

    Massa, Sónia I; Pearson, Gareth A; Aires, Tânia; Kube, Michael; Olsen, Jeanine L; Reinhardt, Richard; Serrão, Ester A; Arnaud-Haond, Sophie

    2011-09-01

    Predicted global climate change threatens the distributional ranges of species worldwide. We identified genes expressed in the intertidal seagrass Zostera noltii during recovery from a simulated low tide heat-shock exposure. Five Expressed Sequence Tag (EST) libraries were compared, corresponding to four recovery times following sub-lethal temperature stress, and a non-stressed control. We sequenced and analyzed 7009 sequence reads from 30min, 2h, 4h and 24h after the beginning of the heat-shock (AHS), and 1585 from the control library, for a total of 8594 sequence reads. Among 51 Tentative UniGenes (TUGs) exhibiting significantly different expression between libraries, 19 (37.3%) were identified as 'molecular chaperones' and were over-expressed following heat-shock, while 12 (23.5%) were 'photosynthesis TUGs' generally under-expressed in heat-shocked plants. A time course analysis of expression showed a rapid increase in expression of the molecular chaperone class, most of which were heat-shock proteins; which increased from 2 sequence reads in the control library to almost 230 in the 30min AHS library, followed by a slow decrease during further recovery. In contrast, 'photosynthesis TUGs' were under-expressed 30min AHS compared with the control library, and declined progressively with recovery time in the stress libraries, with a total of 29 sequence reads 24h AHS, compared with 125 in the control. A total of 4734 TUGs were screened for EST-Single Sequence Repeats (EST-SSRs) and 86 microsatellites were identified. Copyright © 2011 Elsevier B.V. All rights reserved.

  19. Citrus plastid-related gene profiling based on expressed sequence tag analyses

    Directory of Open Access Journals (Sweden)

    Tercilio Calsa Jr.

    2007-01-01

    Full Text Available Plastid-related sequences, derived from putative nuclear or plastome genes, were searched in a large collection of expressed sequence tags (ESTs and genomic sequences from the Citrus Biotechnology initiative in Brazil. The identified putative Citrus chloroplast gene sequences were compared to those from Arabidopsis, Eucalyptus and Pinus. Differential expression profiling for plastid-directed nuclear-encoded proteins and photosynthesis-related gene expression variation between Citrus sinensis and Citrus reticulata, when inoculated or not with Xylella fastidiosa, were also analyzed. Presumed Citrus plastome regions were more similar to Eucalyptus. Some putative genes appeared to be preferentially expressed in vegetative tissues (leaves and bark or in reproductive organs (flowers and fruits. Genes preferentially expressed in fruit and flower may be associated with hypothetical physiological functions. Expression pattern clustering analysis suggested that photosynthesis- and carbon fixation-related genes appeared to be up- or down-regulated in a resistant or susceptible Citrus species after Xylella inoculation in comparison to non-infected controls, generating novel information which may be helpful to develop novel genetic manipulation strategies to control Citrus variegated chlorosis (CVC.

  20. Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.).

    Science.gov (United States)

    Cloutier, Sylvie; Miranda, Evelyn; Ward, Kerry; Radovanovic, Natasa; Reimer, Elsa; Walichnowski, Andrzej; Datla, Raju; Rowland, Gordon; Duguid, Scott; Ragupathy, Raja

    2012-08-01

    Flax is an important oilseed crop in North America and is mostly grown as a fibre crop in Europe. As a self-pollinated diploid with a small estimated genome size of ~370 Mb, flax is well suited for fast progress in genomics. In the last few years, important genetic resources have been developed for this crop. Here, we describe the assessment and comparative analyses of 1,506 putative simple sequence repeats (SSRs) of which, 1,164 were derived from BAC-end sequences (BESs) and 342 from expressed sequence tags (ESTs). The SSRs were assessed on a panel of 16 flax accessions with 673 (58 %) and 145 (42 %) primer pairs being polymorphic in the BESs and ESTs, respectively. With 818 novel polymorphic SSR primer pairs reported in this study, the repertoire of available SSRs in flax has more than doubled from the combined total of 508 of all previous reports. Among nucleotide motifs, trinucleotides were the most abundant irrespective of the class, but dinucleotides were the most polymorphic. SSR length was also positively correlated with polymorphism. Two dinucleotide (AT/TA and AG/GA) and two trinucleotide (AAT/ATA/TAA and GAA/AGA/AAG) motifs and their iterations, different from those reported in many other crops, accounted for more than half of all the SSRs and were also more polymorphic (63.4 %) than the rest of the markers (42.7 %). This improved resource promises to be useful in genetic, quantitative trait loci (QTL) and association mapping as well as for anchoring the physical/genetic map with the whole genome shotgun reference sequence of flax.

  1. Generation and analysis of expressed sequence tags from the ciliate protozoan parasite Ichthyophthirius multifiliis

    Directory of Open Access Journals (Sweden)

    Arias Covadonga

    2007-06-01

    Full Text Available Abstract Background The ciliate protozoan Ichthyophthirius multifiliis (Ich is an important parasite of freshwater fish that causes 'white spot disease' leading to significant losses. A genomic resource for large-scale studies of this parasite has been lacking. To study gene expression involved in Ich pathogenesis and virulence, our goal was to generate expressed sequence tags (ESTs for the development of a powerful microarray platform for the analysis of global gene expression in this species. Here, we initiated a project to sequence and analyze over 10,000 ESTs. Results We sequenced 10,368 EST clones using a normalized cDNA library made from pooled samples of the trophont, tomont, and theront life-cycle stages, and generated 9,769 sequences (94.2% success rate. Post-sequencing processing led to 8,432 high quality sequences. Clustering analysis of these ESTs allowed identification of 4,706 unique sequences containing 976 contigs and 3,730 singletons. These unique sequences represent over two million base pairs (~10% of Plasmodium falciparum genome, a phylogenetically related protozoan. BLASTX searches produced 2,518 significant (E-value -5 hits and further Gene Ontology (GO analysis annotated 1,008 of these genes. The ESTs were analyzed comparatively against the genomes of the related protozoa Tetrahymena thermophila and P. falciparum, allowing putative identification of additional genes. All the EST sequences were deposited by dbEST in GenBank (GenBank: EG957858–EG966289. Gene discovery and annotations are presented and discussed. Conclusion This set of ESTs represents a significant proportion of the Ich transcriptome, and provides a material basis for the development of microarrays useful for gene expression studies concerning Ich development, pathogenesis, and virulence.

  2. Rapid in silico cloning of genes using expressed sequence tags (ESTs).

    Science.gov (United States)

    Gill, R W; Sanseau, P

    2000-01-01

    Expressed sequence tags (ESTs) are short single-pass DNA sequences obtained from either end of cDNA clones. These ESTs are derived from a vast number of cDNA libraries obtained from different species. Human ESTs are the bulk of the data and have been widely used to identify new members of gene families, as markers on the human chromosomes, to discover polymorphism sites and to compare expression patterns in different tissues or pathologies states. Information strategies have been devised to query EST databases. Since most of the analysis is performed with a computer, the term "in silico" strategy has been coined. In this chapter we will review the current status of EST databases, the pros and cons of EST-type data and describe possible strategies to retrieve meaningful information.

  3. Mining and gene ontology based annotation of SSR markers from expressed sequence tags of Humulus lupulus

    Science.gov (United States)

    Singh, Swati; Gupta, Sanchita; Mani, Ashutosh; Chaturvedi, Anoop

    2012-01-01

    Humulus lupulus is commonly known as hops, a member of the family moraceae. Currently many projects are underway leading to the accumulation of voluminous genomic and expressed sequence tag sequences in public databases. The genetically characterized domains in these databases are limited due to non-availability of reliable molecular markers. The large data of EST sequences are available in hops. The simple sequence repeat markers extracted from EST data are used as molecular markers for genetic characterization, in the present study. 25,495 EST sequences were examined and assembled to get full-length sequences. Maximum frequency distribution was shown by mononucleotide SSR motifs i.e. 60.44% in contig and 62.16% in singleton where as minimum frequency are observed for hexanucleotide SSR in contig (0.09%) and pentanucleotide SSR in singletons (0.12%). Maximum trinucleotide motifs code for Glutamic acid (GAA) while AT/TA were the most frequent repeat of dinucleotide SSRs. Flanking primer pairs were designed in-silico for the SSR containing sequences. Functional categorization of SSRs containing sequences was done through gene ontology terms like biological process, cellular component and molecular function. PMID:22368382

  4. Expressed sequence tags as a tool for phylogenetic analysis of placental mammal evolution.

    Directory of Open Access Journals (Sweden)

    Morgan Kullberg

    Full Text Available BACKGROUND: We investigate the usefulness of expressed sequence tags, ESTs, for establishing divergences within the tree of placental mammals. This is done on the example of the established relationships among primates (human, lagomorphs (rabbit, rodents (rat and mouse, artiodactyls (cow, carnivorans (dog and proboscideans (elephant. METHODOLOGY/PRINCIPAL FINDINGS: We have produced 2000 ESTs (1.2 mega bases from a marsupial mouse and characterized the data for their use in phylogenetic analysis. The sequences were used to identify putative orthologous sequences from whole genome projects. Although most ESTs stem from single sequence reads, the frequency of potential sequencing errors was found to be lower than allelic variation. Most of the sequences represented slowly evolving housekeeping-type genes, with an average amino acid distance of 6.6% between human and mouse. Positive Darwinian selection was identified at only a few single sites. Phylogenetic analyses of the EST data yielded trees that were consistent with those established from whole genome projects. CONCLUSIONS: The general quality of EST sequences and the general absence of positive selection in these sequences make ESTs an attractive tool for phylogenetic analysis. The EST approach allows, at reasonable costs, a fast extension of data sampling from species outside the genome projects.

  5. Expression sequence tag library derived from peripheral blood mononuclear cells of the chlorocebus sabaeus

    Directory of Open Access Journals (Sweden)

    Tchitchek Nicolas

    2012-06-01

    Full Text Available Abstract Background African Green Monkeys (AGM are amongst the most frequently used nonhuman primate models in clinical and biomedical research, nevertheless only few genomic resources exist for this species. Such information would be essential for the development of dedicated new generation technologies in fundamental and pre-clinical research using this model, and would deliver new insights into primate evolution. Results We have exhaustively sequenced an Expression Sequence Tag (EST library made from a pool of Peripheral Blood Mononuclear Cells from sixteen Chlorocebus sabaeus monkeys. Twelve of them were infected with the Simian Immunodeficiency Virus. The mononuclear cells were or not stimulated in vitro with Concanavalin A, with lipopolysacharrides, or through mixed lymphocyte reaction in order to generate a representative and broad library of expressed sequences in immune cells. We report here 37,787 sequences, which were assembled into 14,410 contigs representing an estimated 12% of the C. sabaeus transcriptome. Using data from primate genome databases, 9,029 assembled sequences from C. sabaeus could be annotated. Sequences have been systematically aligned with ten cDNA references of primate species including Homo sapiens, Pan troglodytes, and Macaca mulatta to identify ortholog transcripts. For 506 transcripts, sequences were quasi-complete. In addition, 6,576 transcript fragments are potentially specific to the C. sabaeus or corresponding to not yet described primate genes. Conclusions The EST library we provide here will prove useful in gene annotation efforts for future sequencing of the African Green Monkey genomes. Furthermore, this library, which particularly well represents immunological and hematological gene expression, will be an important resource for the comparative analysis of gene expression in clinically relevant nonhuman primate and human research.

  6. Abiotic Stress-Related Expressed Sequence Tags from the Diploid Strawberry Fragaria vesca f. semperflorens

    Directory of Open Access Journals (Sweden)

    Maximo. Rivarola

    2011-03-01

    Full Text Available Strawberry ( spp. is a eudicotyledonous plant that belongs to the Rosaceae family, which includes other agronomically important plants such as raspberry ( L. and several tree-fruit species. Despite the vital role played by cultivated strawberry in agriculture, few stress-related gene expression characterizations of this crop are available. To increase the diversity of available transcriptome sequence, we produced 41,430 L. expressed sequence tags (ESTs from plants growing under water-, temperature-, and osmotic-stress conditions as well as a combination of heat and osmotic stresses that is often found in irrigated fields. Clustering and assembling of the ESTs resulted in a total of 11,836 contigs and singletons that were annotated using Gene Ontology (GO terms. Furthermore, over 1200 sequences with no match to available Rosaceae ESTs were found, including six that were assigned the “response to stress” GO category. Analysis of EST frequency provided an estimate of steady state transcript levels, with 91 sequences exhibiting at least a 20-fold difference between treatments. This EST collection represents a useful resource to advance our understanding of the abiotic stress-response mechanisms in strawberry. The sequence information may be translated to valuable tree crops in the Rosaceae family, where whole-plant treatments are not as simple or practical.

  7. Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm Eisenia fetida

    Science.gov (United States)

    Pirooznia, Mehdi; Gong, Ping; Guan, Xin; Inouye, Laura S; Yang, Kuan; Perkins, Edward J; Deng, Youping

    2007-01-01

    Background Eisenia fetida, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two E. fetida libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR. Results A total of 3144 good quality ESTs (GenBank dbEST accession number EH669363–EH672369 and EL515444–EL515580) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms Drosophila melanogaster, Mus musculus, Saccharomyces cerevisiae, and Caenorhabditis elegans. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2. Conclusion The ESTMD containing the sequence and annotation information of 4032 E. fetida ESTs is publicly accessible at . PMID:18047730

  8. Analysis and functional annotation of expressed sequence tags from the fall armyworm Spodoptera frugiperda

    Science.gov (United States)

    Deng, Youping; Dong, Yinghua; Thodima, Venkata; Clem, Rollie J; Passarelli, A Lorena

    2006-01-01

    Background Little is known about the genome sequences of lepidopteran insects, although this group of insects has been studied extensively in the fields of endocrinology, development, immunity, and pathogen-host interactions. In addition, cell lines derived from Spodoptera frugiperda and other lepidopteran insects are routinely used for baculovirus foreign gene expression. This study reports the results of an expressed sequence tag (EST) sequencing project in cells from the lepidopteran insect S. frugiperda, the fall armyworm. Results We have constructed an EST database using two cDNA libraries from the S. frugiperda-derived cell line, SF-21. The database consists of 2,367 ESTs which were assembled into 244 contigs and 951 singlets for a total of 1,195 unique sequences. Conclusion S. frugiperda is an agriculturally important pest insect and genomic information will be instrumental for establishing initial transcriptional profiling and gene function studies, and for obtaining information about genes manipulated during infections by insect pathogens such as baculoviruses. PMID:17052344

  9. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

    Directory of Open Access Journals (Sweden)

    Gao Zhihong

    2010-07-01

    Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.

  10. Comparative Genomics in Switchgrass Using 61,585 High-Quality Expressed Sequence Tags

    Directory of Open Access Journals (Sweden)

    Christian M. Tobias

    2008-11-01

    Full Text Available The development of genomic resources for switchgrass ( L., a perennial NAD-malic enzyme type C grass, is required to enable molecular breeding and biotechnological approaches for improving its value as a forage and bioenergy crop. Expressed sequence tag (EST sequencing is one method that can quickly sample gene inventories and produce data suitable for marker development or analysis of tissue-specific patterns of expression. Toward this goal, three cDNA libraries from callus, crown, and seedling tissues of ‘Kanlow’ switchgrass were end-sequenced to generate a total of 61,585 high-quality ESTs from 36,565 separate clones. Seventy-three percent of the assembled consensus sequences could be aligned with the sorghum [ (L. Moench] genome at a -value of <1 × 10, indicating a high degree of similarity. Sixty-five percent of the ESTs matched with gene ontology molecular terms, and 3.3% of the sequences were matched with genes that play potential roles in cell-wall biogenesis. The representation in the three libraries of gene families known to be associated with C photosynthesis, cellulose and β-glucan synthesis, phenylpropanoid biosynthesis, and peroxidase activity indicated likely roles for individual family members. Pairwise comparisons of synonymous codon substitutions were used to assess genome sequence diversity and indicated an overall similarity between the two genome copies present in the tetraploid. Identification of EST–simple sequence repeat markers and amplification on two individual parents of a mapping population yielded an average of 2.18 amplicons per individual, and 35% of the markers produced fragment length polymorphisms.

  11. Expressed sequence tag analysis of functional genes associated with adventitious rooting in Liriodendron hybrids.

    Science.gov (United States)

    Zhong, Y D; Sun, X Y; Liu, E Y; Li, Y Q; Gao, Z; Yu, F X

    2016-06-24

    Liriodendron hybrids (Liriodendron chinense x L. tulipifera) are important landscaping and afforestation hardwood trees. To date, little genomic research on adventitious rooting has been reported in these hybrids, as well as in the genus Liriodendron. In the present study, we used adventitious roots to construct the first cDNA library for Liriodendron hybrids. A total of 5176 expressed sequence tags (ESTs) were generated and clustered into 2921 unigenes. Among these unigenes, 2547 had significant homology to the non-redundant protein database representing a wide variety of putative functions. Homologs of these genes regulated many aspects of adventitious rooting, including those for auxin signal transduction and root hair development. Results of quantitative real-time polymerase chain reaction showed that AUX1, IRE, and FB1 were highly expressed in adventitious roots and the expression of AUX1, ARF1, NAC1, RHD1, and IRE increased during the development of adventitious roots. Additionally, 181 simple sequence repeats were identified from 166 ESTs and more than 91.16% of these were dinucleotide and trinucleotide repeats. To the best of our knowledge, the present study reports the identification of the genes associated with adventitious rooting in the genus Liriodendron for the first time and provides a valuable resource for future genomic studies. Expression analysis of selected genes could allow us to identify regulatory genes that may be essential for adventitious rooting.

  12. Expressed sequence tag-derived microsatellite markers of perennial ryegrass (Lolium perenne L.)

    DEFF Research Database (Denmark)

    Studer, Bruno; Asp, Torben; Frei, Ursula

    2008-01-01

    An expressed sequence tag (EST) library of the key grassland species perennial ryegrass (Lolium perenne L.) has been exploited as a resource for microsatellite marker development. Out of 955 simple sequence repeat (SSR) containing ESTs, 744 were used for primer design. Primer amplification was te...

  13. Genomic analysis of expressed sequence tags in American black bear Ursus americanus.

    Science.gov (United States)

    Zhao, Sen; Shao, Chunxuan; Goropashnaya, Anna V; Stewart, Nathan C; Xu, Yichi; Tøien, Øivind; Barnes, Brian M; Fedorov, Vadim B; Yan, Jun

    2010-03-26

    Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes.

  14. Genomic analysis of expressed sequence tags in American black bear Ursus americanus

    Science.gov (United States)

    2010-01-01

    Background Species of the bear family (Ursidae) are important organisms for research in molecular evolution, comparative physiology and conservation biology, but relatively little genetic sequence information is available for this group. Here we report the development and analyses of the first large scale Expressed Sequence Tag (EST) resource for the American black bear (Ursus americanus). Results Comprehensive analyses of molecular functions, alternative splicing, and tissue-specific expression of 38,757 black bear EST sequences were conducted using the dog genome as a reference. We identified 18 genes, involved in functions such as lipid catabolism, cell cycle, and vesicle-mediated transport, that are showing rapid evolution in the bear lineage Three genes, Phospholamban (PLN), cysteine glycine-rich protein 3 (CSRP3) and Troponin I type 3 (TNNI3), are related to heart contraction, and defects in these genes in humans lead to heart disease. Two genes, biphenyl hydrolase-like (BPHL) and CSRP3, contain positively selected sites in bear. Global analysis of evolution rates of hibernation-related genes in bear showed that they are largely conserved and slowly evolving genes, rather than novel and fast-evolving genes. Conclusion We provide a genomic resource for an important mammalian organism and our study sheds new light on the possible functions and evolution of bear genes. PMID:20338065

  15. An expressed sequence tag (EST) data mining strategy succeeding in the discovery of new G-protein coupled receptors.

    Science.gov (United States)

    Wittenberger, T; Schaller, H C; Hellebrand, S

    2001-03-30

    We have developed a comprehensive expressed sequence tag database search method and used it for the identification of new members of the G-protein coupled receptor superfamily. Our approach proved to be especially useful for the detection of expressed sequence tag sequences that do not encode conserved parts of a protein, making it an ideal tool for the identification of members of divergent protein families or of protein parts without conserved domain structures in the expressed sequence tag database. At least 14 of the expressed sequence tags found with this strategy are promising candidates for new putative G-protein coupled receptors. Here, we describe the sequence and expression analysis of five new members of this receptor superfamily, namely GPR84, GPR86, GPR87, GPR90 and GPR91. We also studied the genomic structure and chromosomal localization of the respective genes applying in silico methods. A cluster of six closely related G-protein coupled receptors was found on the human chromosome 3q24-3q25. It consists of four orphan receptors (GPR86, GPR87, GPR91, and H963), the purinergic receptor P2Y1, and the uridine 5'-diphosphoglucose receptor KIAA0001. It seems likely that these receptors evolved from a common ancestor and therefore might have related ligands. In conclusion, we describe a data mining procedure that proved to be useful for the identification and first characterization of new genes and is well applicable for other gene families. Copyright 2001 Academic Press.

  16. Expressed sequence tags related to nitrogen metabolism in maize inoculated with Azospirillum brasilense.

    Science.gov (United States)

    Pereira-Defilippi, L; Pereira, E M; Silva, F M; Moro, G V

    2017-05-31

    The relative quantitative real-time expression of two expressed sequence tags (ESTs) codifying for key enzymes in nitrogen metabolism in maize, nitrate reductase (ZmNR), and glutamine synthetase (ZmGln1-3) was performed for genotypes inoculated with Azospirillum brasilense. Two commercial single-cross hybrids (AG7098 and 2B707) and two experimental synthetic varieties (V2 and V4) were raised under controlled greenhouse conditions, in six treatment groups corresponding to different forms of inoculation and different levels of nitrogen application by top-dressing. The genotypes presented distinct responses to inoculation with A. brasilense. Increases in the expression of ZmNR were observed for the hybrids, while V4 only displayed a greater level of expression when the plants received nitrogenous fertilization by top-dressing and there was no inoculation. The expression of the ZmGln1-3EST was induced by A. brasilense in the hybrids and the variety V4. In contrast, the variety V2 did not respond to inoculation.

  17. Characterization of genic microsatellite markers derived from expressed sequence tags in Pacific abalone ( Haliotis discus hannai)

    Science.gov (United States)

    Li, Qi; Shu, Jing; Zhao, Cui; Liu, Shikai; Kong, Lingfeng; Zheng, Xiaodong

    2010-01-01

    Simple sequence repeat (SSR) markers were developed from the expressed sequence tags (ESTs) of Pacific abalone ( Haliotis discus hannai). Repeat motifs were found in 4.95% of the ESTs at a frequency of one repeat every 10.04 kb of EST sequences, after redundancy elimination. Seventeen polymorphic EST-SSRs were developed. The number of alleles per locus varied from 2-17, with an average of 6.8 alleles per locus. The expected and observed heterozygosities ranged from 0.159 to 0.928 and from 0.132 to 0.922, respectively. Twelve of the 17 loci (70.6%) were successfully amplified in H. diversicolor. Seventeen loci segregated in three families, with three showing the presence of null alleles (17.6%). The adequate level of variability and low frequency of null alleles observed in H. discus hannai, together with the high rate of transportability across Haliotis species, make this set of EST-SSR markers an important tool for comparative mapping, marker-assisted selection, and evolutionary studies, not only in the Pacific abalone, but also in related species.

  18. Analysis of expressed sequence tags of the cyclically parthenogenetic rotifer Brachionus plicatilis.

    Directory of Open Access Journals (Sweden)

    Koushirou Suga

    Full Text Available BACKGROUND: Rotifers are among the most common non-arthropod animals and are the most experimentally tractable members of the basal assemblage of metazoan phyla known as Gnathifera. The monogonont rotifer Brachionus plicatilis is a developing model system for ecotoxicology, aquatic ecology, cryptic speciation, and the evolution of sex, and is an important food source for finfish aquaculture. However, basic knowledge of the genome and transcriptome of any rotifer species has been lacking. METHODOLOGY/PRINCIPAL FINDINGS: We generated and partially sequenced a cDNA library from B. plicatilis and constructed a database of over 2300 expressed sequence tags corresponding to more than 450 transcripts. About 20% of the transcripts had no significant similarity to database sequences by BLAST; most of these contained open reading frames of significant length but few had recognized Pfam motifs. Sixteen transcripts accounted for 25% of the ESTs; four of these had no significant similarity to BLAST or Pfam databases. Putative up- and downstream untranslated regions are relatively short and AT rich. In contrast to bdelloid rotifers, there was no evidence of a conserved trans-spliced leader sequence among the transcripts and most genes were single-copy. CONCLUSIONS/SIGNIFICANCE: Despite the small size of this EST project it revealed several important features of the rotifer transcriptome and of individual monogonont genes. Because there is little genomic data for Gnathifera, the transcripts we found with no known function may represent genes that are species-, class-, phylum- or even superphylum-specific; the fact that some are among the most highly expressed indicates their importance. The absence of trans-spliced leader exons in this monogonont species contrasts with their abundance in bdelloid rotifers and indicates that the presence of this phenomenon can vary at the subphylum level. Our EST database provides a relatively large quantity of transcript

  19. Analysis of expressed sequence tags of the cyclically parthenogenetic rotifer Brachionus plicatilis.

    Science.gov (United States)

    Suga, Koushirou; Welch, David Mark; Tanaka, Yukari; Sakakura, Yoshitaka; Hagiwara, Atsushi

    2007-08-01

    Rotifers are among the most common non-arthropod animals and are the most experimentally tractable members of the basal assemblage of metazoan phyla known as Gnathifera. The monogonont rotifer Brachionus plicatilis is a developing model system for ecotoxicology, aquatic ecology, cryptic speciation, and the evolution of sex, and is an important food source for finfish aquaculture. However, basic knowledge of the genome and transcriptome of any rotifer species has been lacking. We generated and partially sequenced a cDNA library from B. plicatilis and constructed a database of over 2300 expressed sequence tags corresponding to more than 450 transcripts. About 20% of the transcripts had no significant similarity to database sequences by BLAST; most of these contained open reading frames of significant length but few had recognized Pfam motifs. Sixteen transcripts accounted for 25% of the ESTs; four of these had no significant similarity to BLAST or Pfam databases. Putative up- and downstream untranslated regions are relatively short and AT rich. In contrast to bdelloid rotifers, there was no evidence of a conserved trans-spliced leader sequence among the transcripts and most genes were single-copy. Despite the small size of this EST project it revealed several important features of the rotifer transcriptome and of individual monogonont genes. Because there is little genomic data for Gnathifera, the transcripts we found with no known function may represent genes that are species-, class-, phylum- or even superphylum-specific; the fact that some are among the most highly expressed indicates their importance. The absence of trans-spliced leader exons in this monogonont species contrasts with their abundance in bdelloid rotifers and indicates that the presence of this phenomenon can vary at the subphylum level. Our EST database provides a relatively large quantity of transcript-level data for B. plicatilis, and more generally of rotifers and other gnathiferan phyla, and

  20. A wing expressed sequence tag resource for Bicyclus anynana butterflies, an evo-devo model

    Directory of Open Access Journals (Sweden)

    Gruber Jonathan D

    2006-05-01

    Full Text Available Abstract Background Butterfly wing color patterns are a key model for integrating evolutionary developmental biology and the study of adaptive morphological evolution. Yet, despite the biological, economical and educational value of butterflies they are still relatively under-represented in terms of available genomic resources. Here, we describe an Expression Sequence Tag (EST project for Bicyclus anynana that has identified the largest available collection to date of expressed genes for any butterfly. Results By targeting cDNAs from developing wings at the stages when pattern is specified, we biased gene discovery towards genes potentially involved in pattern formation. Assembly of 9,903 ESTs from a subtracted library allowed us to identify 4,251 genes of which 2,461 were annotated based on BLAST analyses against relevant gene collections. Gene prediction software identified 2,202 peptides, of which 215 longer than 100 amino acids had no homology to any known proteins and, thus, potentially represent novel or highly diverged butterfly genes. We combined gene and Single Nucleotide Polymorphism (SNP identification by constructing cDNA libraries from pools of outbred individuals, and by sequencing clones from the 3' end to maximize alignment depth. Alignments of multi-member contigs allowed us to identify over 14,000 putative SNPs, with 316 genes having at least one high confidence double-hit SNP. We furthermore identified 320 microsatellites in transcribed genes that can potentially be used as genetic markers. Conclusion Our project was designed to combine gene and sequence polymorphism discovery and has generated the largest gene collection available for any butterfly and many potential markers in expressed genes. These resources will be invaluable for exploring the potential of B. anynana in particular, and butterflies in general, as models in ecological, evolutionary, and developmental genetics.

  1. Identification and validation of Asteraceae miRNAs by the expressed sequence tag analysis.

    Science.gov (United States)

    Monavar Feshani, Aboozar; Mohammadi, Saeed; Frazier, Taylor P; Abbasi, Abbas; Abedini, Raha; Karimi Farsad, Laleh; Ehya, Farveh; Salekdeh, Ghasem Hosseini; Mardi, Mohsen

    2012-02-10

    MicroRNAs (miRNAs) are small non-coding RNA molecules that play a vital role in the regulation of gene expression. Despite their identification in hundreds of plant species, few miRNAs have been identified in the Asteraceae, a large family that comprises approximately one tenth of all flowering plants. In this study, we used the expressed sequence tag (EST) analysis to identify potential conserved miRNAs and their putative target genes in the Asteraceae. We applied quantitative Real-Time PCR (qRT-PCR) to confirm the expression of eight potential miRNAs in Carthamus tinctorius and Helianthus annuus. We also performed qRT-PCR analysis to investigate the differential expression pattern of five newly identified miRNAs during five different cotyledon growth stages in safflower. Using these methods, we successfully identified and characterized 151 potentially conserved miRNAs, belonging to 26 miRNA families, in 11 genus of Asteraceae. EST analysis predicted that the newly identified conserved Asteraceae miRNAs target 130 total protein-coding ESTs in sunflower and safflower, as well as 433 additional target genes in other plant species. We experimentally confirmed the existence of seven predicted miRNAs, (miR156, miR159, miR160, miR162, miR166, miR396, and miR398) in safflower and sunflower seedlings. We also observed that five out of eight miRNAs are differentially expressed during cotyledon development. Our results indicate that miRNAs may be involved in the regulation of gene expression during seed germination and the formation of the cotyledons in the Asteraceae. The findings of this study might ultimately help in the understanding of miRNA-mediated gene regulation in important crop species. Copyright © 2011 Elsevier B.V. All rights reserved.

  2. Population structure of pigs determined by single nucleotide polymorphisms observed in assembled expressed sequence tags.

    Science.gov (United States)

    Matsumoto, Toshimi; Okumura, Naohiko; Uenishi, Hirohide; Hayashi, Takeshi; Hamasima, Noriyuki; Awata, Takashi

    2012-01-01

    We have collected more than 190000 porcine expressed sequence tags (ESTs) from full-length complementary DNA (cDNA) libraries and identified more than 2800 single nucleotide polymorphisms (SNPs). In this study, we tentatively chose 222 SNPs observed in assembled ESTs to study pigs of different breeds; 104 were selected by comparing the cDNA sequences of a Meishan pig and samples of three-way cross pigs (Landrace, Large White, and Duroc: LWD), and 118 were selected from LWD samples. To evaluate the genetic variation between the chosen SNPs from pig breeds, we determined the genotypes for 192 pig samples (11 pig groups) from our DNA reference panel with matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Of the 222 reference SNPs, 186 were successfully genotyped. A neighbor-joining tree showed that the pig groups were classified into two large clusters, namely, Euro-American and East Asian pig populations. F-statistics and the analysis of molecular variance of Euro-American pig groups revealed that approximately 25% of the genetic variations occurred because of intergroup differences. As the F(IS) values were less than the F(ST) values(,) the clustering, based on the Bayesian inference, implied that there was strong genetic differentiation among pig groups and less divergence within the groups in our samples. © 2011 The Authors. Animal Science Journal © 2011 Japanese Society of Animal Science.

  3. Generation, analysis and functional annotation of expressed sequence tags from the ectoparasitic mite Psoroptes ovis

    Directory of Open Access Journals (Sweden)

    Kenyon Fiona

    2011-07-01

    Full Text Available Abstract Background Sheep scab is caused by Psoroptes ovis and is arguably the most important ectoparasitic disease affecting sheep in the UK. The disease is highly contagious and causes and considerable pruritis and irritation and is therefore a major welfare concern. Current methods of treatment are unsustainable and in order to elucidate novel methods of disease control a more comprehensive understanding of the parasite is required. To date, no full genomic DNA sequence or large scale transcript datasets are available and prior to this study only 484 P. ovis expressed sequence tags (ESTs were accessible in public databases. Results In order to further expand upon the transcriptomic coverage of P. ovis thus facilitating novel insights into the mite biology we undertook a larger scale EST approach, incorporating newly generated and previously described P. ovis transcript data and representing the largest collection of P. ovis ESTs to date. We sequenced 1,574 ESTs and assembled these along with 484 previously generated P. ovis ESTs, which resulted in the identification of 1,545 unique P. ovis sequences. BLASTX searches identified 961 ESTs with significant hits (E-value P. ovis ESTs. Gene Ontology (GO analysis allowed the functional annotation of 880 ESTs and included predictions of signal peptide and transmembrane domains; allowing the identification of potential P. ovis excreted/secreted factors, and mapping of metabolic pathways. Conclusions This dataset currently represents the largest collection of P. ovis ESTs, all of which are publicly available in the GenBank EST database (dbEST (accession numbers FR748230 - FR749648. Functional analysis of this dataset identified important homologues, including house dust mite allergens and tick salivary factors. These findings offer new insights into the underlying biology of P. ovis, facilitating further investigations into mite biology and the identification of novel methods of intervention.

  4. Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

    Directory of Open Access Journals (Sweden)

    Shahin Arwa

    2012-11-01

    Full Text Available Abstract Background Bulbous flowers such as lily and tulip (Liliaceae family are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups and among the three monocot species: lily, tulip, and rice (6,900 groups were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions

  5. Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.

    Science.gov (United States)

    Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul

    2012-11-20

    Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable

  6. Expressed sequence tag-derived polymorphic SSR markers for Fucus serratus and amplification in other species of Fucus

    NARCIS (Netherlands)

    Coyer, J. A.; Hoarau, G.; Beszteri, B.; Pearson, G.; Olsen, J. L.

    The seaweed genus Fucus is a dominant component of intertidal shores throughout the North Atlantic and North Pacific and has been the focus of considerable developmental, ecological, and evolutionary research for the past century. Here, we present details of 21 expressed sequence tag-derived simple

  7. Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants.

    Science.gov (United States)

    Rodovalho, Cynara M; Ferro, Milene; Fonseca, Fernando Pp; Antonio, Erik A; Guilherme, Ivan R; Henrique-Silva, Flávio; Bacci, Maurício

    2011-06-17

    Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters.

  8. Expressed Sequence Tag-Simple Sequence Repeat (EST-SSR Marker Resources for Diversity Analysis of Mango (Mangifera indica L.

    Directory of Open Access Journals (Sweden)

    Natalie L. Dillon

    2014-01-01

    Full Text Available In this study, a collection of 24,840 expressed sequence tags (ESTs generated from five mango (Mangifera indica L. cDNA libraries was mined for EST-based simple sequence repeat (SSR markers. Over 1,000 ESTs with SSR motifs were detected from more than 24,000 EST sequences with di- and tri-nucleotide repeat motifs the most abundant. Of these, 25 EST-SSRs in genes involved in plant development, stress response, and fruit color and flavor development pathways were selected, developed into PCR markers and characterized in a population of 32 mango selections including M. indica varieties, and related Mangifera species. Twenty-four of the 25 EST-SSR markers exhibited polymorphisms, identifying a total of 86 alleles with an average of 5.38 alleles per locus, and distinguished between all Mangifera selections. Private alleles were identified for Mangifera species. These newly developed EST-SSR markers enhance the current 11 SSR mango genetic identity panel utilized by the Australian Mango Breeding Program. The current panel has been used to identify progeny and parents for selection and the application of this extended panel will further improve and help to design mango hybridization strategies for increased breeding efficiency.

  9. Single nucleotide polymorphism discovery from expressed sequence tags in the waterflea Daphnia magna

    Directory of Open Access Journals (Sweden)

    Souche Erika L

    2011-06-01

    Full Text Available Abstract Background Daphnia (Crustacea: Cladocera plays a central role in standing aquatic ecosystems, has a well known ecology and is widely used in population studies and environmental risk assessments. Daphnia magna is, especially in Europe, intensively used to study stress responses of natural populations to pollutants, climate change, and antagonistic interactions with predators and parasites, which have all been demonstrated to induce micro-evolutionary and adaptive responses. Although its ecology and evolutionary biology is intensively studied, little is known on the functional genomics underpinning of phenotypic responses to environmental stressors. The aim of the present study was to find genes expressed in presence of environmental stressors, and target such genes for single nucleotide polymorphic (SNP marker development. Results We developed three expressed sequence tag (EST libraries using clonal lineages of D. magna exposed to ecological stressors, namely fish predation, parasite infection and pesticide exposure. We used these newly developed ESTs and other Daphnia ESTs retrieved from NCBI GeneBank to mine for SNP markers targeting synonymous as well as non synonymous genetic variation. We validate the developed SNPs in six natural populations of D. magna distributed at regional scale. Conclusions A large proportion (47% of the produced ESTs are Daphnia lineage specific genes, which are potentially involved in responses to environmental stress rather than to general cellular functions and metabolic activities, or reflect the arthropod's aquatic lifestyle. The characterization of genes expressed under stress and the validation of their SNPs for population genetic study is important for identifying ecologically responsive genes in D. magna.

  10. Identification of candidates for cyclotide biosynthesis and cyclisation by expressed sequence tag analysis of Oldenlandia affinis

    Directory of Open Access Journals (Sweden)

    Suda Jan

    2010-02-01

    Full Text Available Abstract Background Cyclotides are a family of circular peptides that exhibit a range of biological activities, including anti-bacterial, cytotoxic, anti-HIV activities, and are proposed to function in plant defence. Their high stability has motivated their development as scaffolds for the stabilisation of peptide drugs. Oldenlandia affinis is a member of the Rubiaceae (coffee family from which 18 cyclotides have been sequenced to date, but the details of their processing from precursor proteins have only begun to be elucidated. To increase the speed at which genes involved in cyclotide biosynthesis and processing are being discovered, an expressed sequence tag (EST project was initiated to survey the transcript profile of O. affinis and to propose some future directions of research on in vivo protein cyclisation. Results Using flow cytometry the holoploid genome size (1C-value of O. affinis was estimated to be 4,210 - 4,284 Mbp, one of the largest genomes of the Rubiaceae family. High-quality ESTs were identified, 1,117 in total, from leaf cDNAs and assembled into 502 contigs, comprising 202 consensus sequences and 300 singletons. ESTs encoding the cyclotide precursors for kalata B1 (Oak1 and kalata B2 (Oak4 were among the 20 most abundant ESTs. In total, 31 ESTs encoded cyclotide precursors, representing a distinct commitment of 2.8% of the O. affinis transcriptome to cyclotide biosynthesis. The high expression levels of cyclotide precursor transcripts are consistent with the abundance of mature cyclic peptides in O. affinis. A new cyclotide precursor named Oak5 was isolated and represents the first cDNA for the bracelet class of cyclotides in O. affinis. Clones encoding enzymes potentially involved in processing cyclotides were also identified and include enzymes involved in oxidative folding and proteolytic processing. Conclusion The EST library generated in this study provides a valuable resource for the study of the cyclisation of plant

  11. Gene discovery and transcript analyses in the corn smut pathogen Ustilago maydis: expressed sequence tag and genome sequence comparison

    Directory of Open Access Journals (Sweden)

    Saville Barry J

    2007-09-01

    Full Text Available Abstract Background Ustilago maydis is the basidiomycete fungus responsible for common smut of corn and is a model organism for the study of fungal phytopathogenesis. To aid in the annotation of the genome sequence of this organism, several expressed sequence tag (EST libraries were generated from a variety of U. maydis cell types. In addition to utility in the context of gene identification and structure annotation, the ESTs were analyzed to identify differentially abundant transcripts and to detect evidence of alternative splicing and anti-sense transcription. Results Four cDNA libraries were constructed using RNA isolated from U. maydis diploid teliospores (U. maydis strains 518 × 521 and haploid cells of strain 521 grown under nutrient rich, carbon starved, and nitrogen starved conditions. Using the genome sequence as a scaffold, the 15,901 ESTs were assembled into 6,101 contiguous expressed sequences (contigs; among these, 5,482 corresponded to predicted genes in the MUMDB (MIPS Ustilago maydis database, while 619 aligned to regions of the genome not yet designated as genes in MUMDB. A comparison of EST abundance identified numerous genes that may be regulated in a cell type or starvation-specific manner. The transcriptional response to nitrogen starvation was assessed using RT-qPCR. The results of this suggest that there may be cross-talk between the nitrogen and carbon signalling pathways in U. maydis. Bioinformatic analysis identified numerous examples of alternative splicing and anti-sense transcription. While intron retention was the predominant form of alternative splicing in U. maydis, other varieties were also evident (e.g. exon skipping. Selected instances of both alternative splicing and anti-sense transcription were independently confirmed using RT-PCR. Conclusion Through this work: 1 substantial sequence information has been provided for U. maydis genome annotation; 2 new genes were identified through the discovery of 619

  12. Sugarcane expressed sequences tags (ESTs encoding enzymes involved in lignin biosynthesis pathways

    Directory of Open Access Journals (Sweden)

    Ramos Rose Lucia Braz

    2001-01-01

    Full Text Available Lignins are phenolic polymers found in the secondary wall of plant conductive systems where they play an important role by reducing the permeability of the cell wall to water. Lignins are also responsible for the rigidity of the cell wall and are involved in mechanisms of resistance to pathogens. The metabolic routes and enzymes involved in synthesis of lignins have been largely characterized and representative genes that encode enzymes involved in these processes have been cloned from several plant species. The synthesis of lignins is liked to the general metabolism of the phenylpropanoids in plants, having enzymes (e.g. phenylalanine ammonia-lyase (PAL, cinnamate 4-hydroxylase (C4H and caffeic acid O-methyltransferase (COMT common to other processes as well as specific enzymes such as cinnamoyl-CoA reductase (CCR and cinnamyl alcohol dehydrogenase (CAD. Some maize and sorghum mutants, shown to have defective in CAD and/or COMT activity, are easier to digest because they have a reduced lignin content, something which has motivated different research groups to alter the lignin content and composition of model plants by genetic engineering try to improve, for example, the efficiency of paper pulping and digestibility. In the work reported in this paper, we have made an inventory of the sugarcane expressed sequence tag (EST coding for enzymes involved in lignin metabolism which are present in the sugarcane EST genome project (SUCEST database. Our analysis focused on the key enzymes ferulate-5-hydroxylase (F5H, caffeic acid O-methyltransferase (COMT, caffeoyl CoA O-methyltransferase (CCoAOMT, hydroxycinnamate CoA ligase (4CL, cinnamoyl-CoA reductase (CCR and cinnamyl alcohol dehydrogenase (CAD. The comparative analysis of these genes with those described in other species could be used as molecular markers for breeding as well as for the manipulation of lignin metabolism in sugarcane.

  13. Large-scale Identification of Expressed Sequence Tags (ESTs from Nicotianatabacum by Normalized cDNA Library Sequencing

    Directory of Open Access Journals (Sweden)

    Alvarez S Perez

    2014-12-01

    Full Text Available An expressed sequence tags (EST resource for tobacco plants (Nicotianatabacum was established using high-throughput sequencing of randomly selected clones from one cDNA library representing a range of plant organs (leaf, stem, root and root base. Over 5000 ESTs were generated from the 3’ ends of 8000 clones, analyzed by BLAST searches and categorized functionally. All annotated ESTs were classified into 18 functional categories, unique transcripts involved in energy were the largest group accounting for 831 (32.32% of the annotated ESTs. After excluding 2450 non-significant tentative unique transcripts (TUTs, 100 unique sequences (1.67% of total TUTs were identified from the N. tabacum database. In the array result two genes strongly related to the tobacco mosaic virus (TMV were obtained, one basic form of pathogenesis-related protein 1 precursor (TBT012G08 and ubiquitin (TBT087G01. Both of them were found in the variety Hongda, some other important genes were classified into two groups, one of these implicated in plant development like those genes related to a photosynthetic process (chlorophyll a-b binding protein, photosystem I, ferredoxin I and III, ATP synthase and a further group including genes related to plant stress response (ubiquitin, ubiquitin-like protein SMT3, glycine-rich RNA binding protein, histones and methallothionein. The interesting finding in this study is that two of these genes have never been reported before in N. tabacum (ubiquitin-like protein SMT3 and methallothionein. The array results were confirmed using quantitative PCR.

  14. Classification, expression pattern and comparative analysis of sugarcane expressed sequences tags (ESTs encoding glycine-rich proteins (GRPs

    Directory of Open Access Journals (Sweden)

    Fusaro Adriana

    2001-01-01

    Full Text Available Since the isolation of the first glycine-rich proteins (GRPs in plants a wealth of new GRPs have been identified. The highly specific but diverse expression pattern of grp genes, taken together with the distinct sub-cellular localization of some GRP groups, clearly indicate that these proteins are involved in several independent physiological processes. Notwithstanding the absence of a clear definition of the role of GRPs in plant cells, studies conducted with these proteins have provided new and interesting insights into the molecular biology and cell biology of plants. Complexly regulated promoters and distinct mechanisms for the regulation of gene expression have been demonstrated and new protein targeting pathways, as well as the exportation of GRPs from different cell types have been discovered. These data show that GRPs can be useful as markers and/or models to understand distinct aspects of plant biology. In this paper, the structural and functional features of these proteins in sugarcane (Saccharum officinarum L. are summarized. Since this is the first description of GRPs in sugarcane, special emphasis has been given to the expression pattern of these GRP genes by studying their abundance and prevalence in the different cDNA-libraries of the Sugarcane Expressed Sequence Tag (SUCEST project . The comparison of sugarcane GRPs with GRPs from other species is also discussed.

  15. DNA repair-related genes in sugarcane expressed sequence tags (ESTs

    Directory of Open Access Journals (Sweden)

    R.M.A. Costa

    2001-12-01

    Full Text Available There is much interest in the identification and characterization of genes involved in DNA repair because of their importance in the maintenance of the genome integrity. The high level of conservation of DNA repair genes means that these genetic elements may be used in phylogenetic studies as a source of information on the genetic origin and evolution of species. The mechanisms by which damaged DNA is repaired are well understood in bacteria, yeast and mammals, but much remains to be learned as regards plants. We identified genes involved in DNA repair mechanisms in sugarcane using a similarity search of the Brazilian Sugarcane Expressed Sequence Tag (SUCEST database against known sequences deposited in other public databases (National Center of Biotechnology Information (NCBI database and the Munich Information Center for Protein Sequences (MIPS Arabidopsis thaliana database. This search revealed that most of the various proteins involved in DNA repair in sugarcane are similar to those found in other eukaryotes. However, we also identified certain intriguing features found only in plants, probably due to the independent evolution of this kingdom. The DNA repair mechanisms investigated include photoreactivation, base excision repair, nucleotide excision repair, mismatch repair, non-homologous end joining, homologous recombination repair and DNA lesion tolerance. We report the main differences found in the DNA repair machinery in plant cells as compared to other organisms. These differences point to potentially different strategies plants employ to deal with DNA damage, that deserve further investigation.A identificação e caracterização de genes envolvidos com reparo de DNA são de grande interesse, dada a sua importância na manutenção da integridade genômica. Além disso, a alta conservação dos genes de reparo de DNA faz com que possam ser utilizados como fonte de informação no que diz respeito à origem e evolução das esp

  16. The First Molecular Identification of an Olive Collection Applying Standard Simple Sequence Repeats and Novel Expressed Sequence Tag Markers.

    Science.gov (United States)

    Mousavi, Soraya; Mariotti, Roberto; Regni, Luca; Nasini, Luigi; Bufacchi, Marina; Pandolfi, Saverio; Baldoni, Luciana; Proietti, Primo

    2017-01-01

    Germplasm collections of tree crop species represent fundamental tools for conservation of diversity and key steps for its characterization and evaluation. For the olive tree, several collections were created all over the world, but only few of them have been fully characterized and molecularly identified. The olive collection of Perugia University (UNIPG), established in the years' 60, represents one of the first attempts to gather and safeguard olive diversity, keeping together cultivars from different countries. In the present study, a set of 370 olive trees previously uncharacterized was screened with 10 standard simple sequence repeats (SSRs) and nine new EST-SSR markers, to correctly and thoroughly identify all genotypes, verify their representativeness of the entire cultivated olive variation, and validate the effectiveness of new markers in comparison to standard genotyping tools. The SSR analysis revealed the presence of 59 genotypes, corresponding to 72 well known cultivars, 13 of them resulting exclusively present in this collection. The new EST-SSRs have shown values of diversity parameters quite similar to those of best standard SSRs. When compared to hundreds of Mediterranean cultivars, the UNIPG olive accessions were splitted into the three main populations (East, Center and West Mediterranean), confirming that the collection has a good representativeness of the entire olive variability. Furthermore, Bayesian analysis, performed on the 59 genotypes of the collection by the use of both sets of markers, have demonstrated their splitting into four clusters, with a well balanced membership obtained by EST respect to standard SSRs. The new OLEST ( Olea expressed sequence tags) SSR markers resulted as effective as the best standard markers. The information obtained from this study represents a high valuable tool for ex situ conservation and management of olive genetic resources, useful to build a common database from worldwide olive cultivar collections

  17. Expressed sequence tags from Atta laevigata and identification of candidate genes for the control of pest leaf-cutting ants

    Directory of Open Access Journals (Sweden)

    Henrique-Silva Flávio

    2011-06-01

    Full Text Available Abstract Background Leafcutters are the highest evolved within Neotropical ants in the tribe Attini and model systems for studying caste formation, labor division and symbiosis with microorganisms. Some species of leafcutters are agricultural pests controlled by chemicals which affect other animals and accumulate in the environment. Aiming to provide genetic basis for the study of leafcutters and for the development of more specific and environmentally friendly methods for the control of pest leafcutters, we generated expressed sequence tag data from Atta laevigata, one of the pest ants with broad geographic distribution in South America. Results The analysis of the expressed sequence tags allowed us to characterize 2,006 unique sequences in Atta laevigata. Sixteen of these genes had a high number of transcripts and are likely positively selected for high level of gene expression, being responsible for three basic biological functions: energy conservation through redox reactions in mitochondria; cytoskeleton and muscle structuring; regulation of gene expression and metabolism. Based on leafcutters lifestyle and reports of genes involved in key processes of other social insects, we identified 146 sequences potential targets for controlling pest leafcutters. The targets are responsible for antixenobiosis, development and longevity, immunity, resistance to pathogens, pheromone function, cell signaling, behavior, polysaccharide metabolism and arginine kynase activity. Conclusion The generation and analysis of expressed sequence tags from Atta laevigata have provided important genetic basis for future studies on the biology of leaf-cutting ants and may contribute to the development of a more specific and environmentally friendly method for the control of agricultural pest leafcutters.

  18. Characteristics of the Lotus japonicus gene repertoire deduced from large-scale expressed sequence tag (EST) analysis.

    Science.gov (United States)

    Asamizu, Erika; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2004-02-01

    To perform a comprehensive analysis of genes expressed in a model legume, Lotus japonicus, a total of 74472 3'-end expressed sequence tags (EST) were generated from cDNA libraries produced from six different organs. Clustering of sequences was performed with an identity criterion of 95% for 50 bases, and a total of 20457 non-redundant sequences, 8503 contigs and 11954 singletons were generated. EST sequence coverage was analyzed by using the annotated L. japonicus genomic sequence and 1093 of the 1889 predicted protein-encoding genes (57.9%) were hit by the EST sequence(s). Gene content was compared to several plant species. Among the 8503 contigs, 471 were identified as sequences conserved only in leguminous species and these included several disease resistance-related genes. This suggested that in legumes, these genes may have evolved specifically to resist pathogen attack. The rate of gene sequence divergence was assessed by comparing similarity level and functional category based on the Gene Ontology (GO) annotation of Arabidopsis genes. This revealed that genes encoding ribosomal proteins, as well as those related to translation, photosynthesis, and cellular structure were more abundantly represented in the highly conserved class, and that genes encoding transcription factors and receptor protein kinases were abundantly represented in the less conserved class. To make the sequence information and the cDNA clones available to the research community, a Web database with useful services was created at http://www.kazusa.or.jp/en/plant/lotus/EST/.

  19. Analyses of expressed sequence tags from the maize foliar pathogen Cercospora zeae-maydis identity novel genes expressed during vegetative infectious, and repoductive growth

    OpenAIRE

    Bluhm, B.H.; Lindquist, E.; Kema, G.H.J.; Goodwin, S.B.; Dunkle, L.D.

    2008-01-01

    The ascomycete fungus Cercospora zeae-maydis is an aggressive foliar pathogen of maize that causes substantial losses annually throughout the Western Hemisphere. Despite its impact on maize production, little is known about the regulation of pathogenesis in C. zeae-maydis at the molecular level. The objectives of this study were to generate a collection of expressed sequence tags (ESTs) from C. zeae-maydis and evaluate their expression during vegetative, infectious, and reproductive growth. R...

  20. Analyses of expressed sequence tags from the maize foliar pathogen Cercospora zeae-maydis identify novel genes expressed during vegetative, infectious, and reproductive growth

    OpenAIRE

    Bluhm, Burton H; Dhillon, Braham; Lindquist, Erika A; Kema, Gert HJ; Goodwin, Stephen B; Dunkle, Larry D

    2008-01-01

    Abstract Background The ascomycete fungus Cercospora zeae-maydis is an aggressive foliar pathogen of maize that causes substantial losses annually throughout the Western Hemisphere. Despite its impact on maize production, little is known about the regulation of pathogenesis in C. zeae-maydis at the molecular level. The objectives of this study were to generate a collection of expressed sequence tags (ESTs) from C. zeae-maydis and evaluate their expression during vegetative, infectious, and re...

  1. Development, characterization and cross species amplification of polymorphic microsatellite markers from expressed sequence tags of turmeric (Curcuma longa L.).

    Science.gov (United States)

    Siju, S; Dhanya, K; Syamkumar, S; Sasikumar, B; Sheeja, T E; Bhat, A I; Parthasarathy, V A

    2010-02-01

    Expressed sequence tags (ESTs) from turmeric (Curcuma longa L.) were used for the screening of type and frequency of Class I (hypervariable) simple sequence repeats (SSRs). A total of 231 microsatellite repeats were detected from 12,593 EST sequences of turmeric after redundancy elimination. The average density of Class I SSRs accounts to one SSR per 17.96 kb of EST. Mononucleotides were the most abundant class of microsatellite repeat in turmeric ESTs followed by trinucleotides. A robust set of 17 polymorphic EST-SSRs were developed and used for evaluating 20 turmeric accessions. The number of alleles detected ranged from 3 to 8 per loci. The developed markers were also evaluated in 13 related species of C. longa confirming high rate (100%) of cross species transferability. The polymorphic microsatellite markers generated from this study could be used for genetic diversity analysis and resolving the taxonomic confusion prevailing in the genus.

  2. The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

    DEFF Research Database (Denmark)

    Brentani, Helena; Caballero, Otávia L; Camargo, Anamaria A

    2003-01-01

    expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define approximately 23,500 genes, of which only approximately 1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes...... reveals that ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body....... More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants...

  3. Cell-free translational screening of an expression sequence tag library of Clonorchis sinensis for novel antigen discovery.

    Science.gov (United States)

    Kasi, Devi; Catherine, Christy; Lee, Seung-Won; Lee, Kyung-Ho; Kim, Yu Jung; Ro Lee, Myeong; Ju, Jung Won; Kim, Dong-Myung

    2017-05-01

    The rapidly evolving cloning and sequencing technologies have enabled understanding of genomic structure of parasite genomes, opening up new ways of combatting parasite-related diseases. To make the most of the exponentially accumulating genomic data, however, it is crucial to analyze the proteins encoded by these genomic sequences. In this study, we adopted an engineered cell-free protein synthesis system for large-scale expression screening of an expression sequence tag (EST) library of Clonorchis sinensis to identify potential antigens that can be used for diagnosis and treatment of clonorchiasis. To allow high-throughput expression and identification of individual genes comprising the library, a cell-free synthesis reaction was designed such that both the template DNA and the expressed proteins were co-immobilized on the same microbeads, leading to microbead-based linkage of the genotype and phenotype. This reaction configuration allowed streamlined expression, recovery, and analysis of proteins. This approach enabled us to identify 21 antigenic proteins. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:832-837, 2017. © 2017 American Institute of Chemical Engineers.

  4. Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

    Science.gov (United States)

    Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...

  5. Expressed sequences tags of the anther smut fungus, Microbotryum violaceum, identify mating and pathogenicity genes

    Directory of Open Access Journals (Sweden)

    Devier Benjamin

    2007-08-01

    Full Text Available Abstract Background The basidiomycete fungus Microbotryum violaceum is responsible for the anther-smut disease in many plants of the Caryophyllaceae family and is a model in genetics and evolutionary biology. Infection is initiated by dikaryotic hyphae produced after the conjugation of two haploid sporidia of opposite mating type. This study describes M. violaceum ESTs corresponding to nuclear genes expressed during conjugation and early hyphal production. Results A normalized cDNA library generated 24,128 sequences, which were assembled into 7,765 unique genes; 25.2% of them displayed significant similarity to annotated proteins from other organisms, 74.3% a weak similarity to the same set of known proteins, and 0.5% were orphans. We identified putative pheromone receptors and genes that in other fungi are involved in the mating process. We also identified many sequences similar to genes known to be involved in pathogenicity in other fungi. The M. violaceum EST database, MICROBASE, is available on the Web and provides access to the sequences, assembled contigs, annotations and programs to compare similarities against MICROBASE. Conclusion This study provides a basis for cloning the mating type locus, for further investigation of pathogenicity genes in the anther smut fungi, and for comparative genomics.

  6. Analysis of expressed sequence tags from a NaHCO(3)-treated alkali-tolerant plant, Chloris virgata.

    Science.gov (United States)

    Nishiuchi, Shunsaku; Fujihara, Kazumasa; Liu, Shenkui; Takano, Tetsuo

    2010-04-01

    Chloris virgata Swartz (C. virgata) is a gramineous wild plant that can survive in saline-alkali areas in northeast China. To examine the tolerance mechanisms of C. virgata, we constructed a cDNA library from whole plants of C. virgata that had been treated with 100 mM NaHCO(3) for 24 h and sequenced 3168 randomly selected clones. Most (2590) of the expressed sequence tags (ESTs) showed significant similarity to sequences in the NCBI database. Of the 2590 genes, 1893 were unique. Gene Ontology (GO) Slim annotations were obtained for 1081 ESTs by BLAST2GO and it was found that 75 genes of them were annotated with GO terms "response to stress", "response to abiotic stimulus", and "response to biotic stimulus", indicating these genes were likely to function in tolerance mechanism of C. virgata. In a separate experiment, 24 genes that are known from previous studies to be associated with abiotic stress tolerance were further examined by real-time RT-PCR to see how their expressions were affected by NaHCO(3) stress. NaHCO(3) treatment up-regulated the expressions of pathogenesis-related gene (DC998527), Win1 precursor gene (DC998617), catalase gene (DC999385), ribosome inactivating protein 1 (DC999555), Na(+)/H(+) antiporter gene (DC998043), and two-component regulator gene (DC998236). Copyright 2010 Elsevier Masson SAS. All rights reserved.

  7. Comparative analysis of differentially expressed sequence tags of sweet orange and mandarin infected with Xylella fastidiosa

    Directory of Open Access Journals (Sweden)

    Alessandra A. de Souza

    2007-01-01

    Full Text Available The Citrus ESTs Sequencing Project (CitEST conducted at Centro APTA Citros Sylvio Moreira/IAC has identified and catalogued ESTs representing a set of citrus genes expressed under relevant stress responses, including diseases such as citrus variegated chlorosis (CVC, caused by Xylella fastidiosa. All sweet orange (Citrus sinensis L. Osb. varieties are susceptible to X. fastidiosa. On the other hand, mandarins (C. reticulata Blanco are considered tolerant or resistant to the disease, although the bacterium can be sporadically detected within the trees, but no disease symptoms or economic losses are observed. To study their genetic responses to the presence of X. fastidiosa, we have compared EST libraries of leaf tissue of sweet orange Pêra IAC (highly susceptible cultivar to X. fastidiosa and mandarin ‘Ponkan’ (tolerant artificially infected with the bacterium. Using an in silico differential display, 172 genes were found to be significantly differentially expressed in such conditions. Sweet orange presented an increase in expression of photosynthesis related genes that could reveal a strategy to counterbalance a possible lower photosynthetic activity resulting from early effects of the bacterial colonization in affected plants. On the other hand, mandarin showed an active multi-component defense response against the bacterium similar to the non-host resistance pattern.

  8. Identification of Anhydrobiosis-related Genes from an Expressed Sequence Tag Database in the Cryptobiotic Midge Polypedilum vanderplanki (Diptera; Chironomidae)*

    Science.gov (United States)

    Cornette, Richard; Kanamori, Yasushi; Watanabe, Masahiko; Nakahara, Yuichi; Gusev, Oleg; Mitsumasu, Kanako; Kadono-Okuda, Keiko; Shimomura, Michihiko; Mita, Kazuei; Kikawada, Takahiro; Okuda, Takashi

    2010-01-01

    Some organisms are able to survive the loss of almost all their body water content, entering a latent state known as anhydrobiosis. The sleeping chironomid (Polypedilum vanderplanki) lives in the semi-arid regions of Africa, and its larvae can survive desiccation in an anhydrobiotic form during the dry season. To unveil the molecular mechanisms of this resistance to desiccation, an anhydrobiosis-related Expressed Sequence Tag (EST) database was obtained from the sequences of three cDNA libraries constructed from P. vanderplanki larvae after 0, 12, and 36 h of desiccation. The database contained 15,056 ESTs distributed into 4,807 UniGene clusters. ESTs were classified according to gene ontology categories, and putative expression patterns were deduced for all clusters on the basis of the number of clones in each library; expression patterns were confirmed by real-time PCR for selected genes. Among up-regulated genes, antioxidants, late embryogenesis abundant (LEA) proteins, and heat shock proteins (Hsps) were identified as important groups for anhydrobiosis. Genes related to trehalose metabolism and various transporters were also strongly induced by desiccation. Those results suggest that the oxidative stress response plays a central role in successful anhydrobiosis. Similarly, protein denaturation and aggregation may be prevented by marked up-regulation of Hsps and the anhydrobiosis-specific LEA proteins. A third major feature is the predicted increase in trehalose synthesis and in the expression of various transporter proteins allowing the distribution of trehalose and other solutes to all tissues. PMID:20833722

  9. In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

    Science.gov (United States)

    Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

    2011-01-01

    To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533

  10. Flavonoid Biosynthesis Genes Putatively Identified in the Aromatic Plant Polygonum minus via Expressed Sequences Tag (EST Analysis

    Directory of Open Access Journals (Sweden)

    Zamri Zainal

    2012-02-01

    Full Text Available P. minus is an aromatic plant, the leaf of which is widely used as a food additive and in the perfume industry. The leaf also accumulates secondary metabolites that act as active ingredients such as flavonoid. Due to limited genomic and transcriptomic data, the biosynthetic pathway of flavonoids is currently unclear. Identification of candidate genes involved in the flavonoid biosynthetic pathway will significantly contribute to understanding the biosynthesis of active compounds. We have constructed a standard cDNA library from P. minus leaves, and two normalized full-length enriched cDNA libraries were constructed from stem and root organs in order to create a gene resource for the biosynthesis of secondary metabolites, especially flavonoid biosynthesis. Thus, large‑scale sequencing of P. minus cDNA libraries identified 4196 expressed sequences tags (ESTs which were deposited in dbEST in the National Center of Biotechnology Information (NCBI. From the three constructed cDNA libraries, 11 ESTs encoding seven genes were mapped to the flavonoid biosynthetic pathway. Finally, three flavonoid biosynthetic pathway-related ESTs chalcone synthase, CHS (JG745304, flavonol synthase, FLS (JG705819 and leucoanthocyanidin dioxygenase, LDOX (JG745247 were selected for further examination by quantitative RT-PCR (qRT-PCR in different P. minus organs. Expression was detected in leaf, stem and root. Gene expression studies have been initiated in order to better understand the underlying physiological processes.

  11. In silico differential display of defense-related expressed sequence tags from sugarcane tissues infected with diazotrophic endophytes

    Directory of Open Access Journals (Sweden)

    Lambais Marcio R.

    2001-01-01

    Full Text Available The expression patterns of 277 sugarcane expressed sequence tags (EST-contigs encoding putative defense-related (DR proteins were evaluated using the Sugarcane EST database. The DR proteins evaluated included chitinases, beta-1,3-glucanases, phenylalanine ammonia-lyases, chalcone synthases, chalcone isomerases, isoflavone reductases, hydroxyproline-rich glycoproteins, proline-rich glycoproteins, peroxidases, catalases, superoxide dismutases, WRKY-like transcription factors and proteins involved in cell death control. Putative sugarcane WRKY proteins were compared and their phylogenetic relationships determined. A hierarchical clustering approach was used to identify DR ESTs with similar expression profiles in representative cDNA libraries. To identify DR ESTs differentially expressed in sugarcane tissues infected with Gluconacetobacter diazotrophicus or Herbaspirillum rubrisubalbicans, 179 putative DR EST-contigs expressed in non-infected tissues (leaves and roots and/or infected tissues were selected and arrayed by similarity of their expression profiles. Changes in the expression levels of 124 putative DR EST-contigs, expressed in non-infected tissues, were evaluated in infected tissues. Approximately 42% of these EST-contigs showed no expression in infected tissues, whereas 15% and 3% showed more than 2-fold suppression in tissues infected with G. diazotrophicus or H. rubrisubalbicans, respectively. Approximately 14 and 8% of the DR EST-contigs evaluated showed more than 2-fold induction in tissues infected with G. diazotrophicus or H. rubrisubalbicans, respectively. The differential expression of clusters of DR genes may be important in the establishment of a compatible interaction between sugarcane and diazotrophic endophytes. It is suggested that the hierarchical clustering approach can be used on a genome-wide scale to identify genes likely involved in controlling plant-microorganism interactions.

  12. Development of expressed sequence tag-simple sequence repeat markers for genetic characterization and population structure analysis of Praxelis clematidea (Asteraceae).

    Science.gov (United States)

    Wang, Q Z; Huang, M; Downie, S R; Chen, Z X

    2016-05-23

    Invasive plants tend to spread aggressively in new habitats and an understanding of their genetic diversity and population structure is useful for their management. In this study, expressed sequence tag-simple sequence repeat (EST-SSR) markers were developed for the invasive plant species Praxelis clematidea (Asteraceae) from 5548 Stevia rebaudiana (Asteraceae) expressed sequence tags (ESTs). A total of 133 microsatellite-containing ESTs (2.4%) were identified, of which 56 (42.1%) were hexanucleotide repeat motifs and 50 (37.6%) were trinucleotide repeat motifs. Of the 24 primer pairs designed from these 133 ESTs, 7 (29.2%) resulted in significant polymorphisms. The number of alleles per locus ranged from 5 to 9. The relatively high genetic diversity (H = 0.2667, I = 0.4212, and P = 100%) of P. clematidea was related to high gene flow (Nm = 1.4996) among populations. The coefficient of population differentiation (GST = 0.2500) indicated that most genetic variation occurred within populations. A Mantel test suggested that there was significant correlation between genetic distance and geographical distribution (r = 0.3192, P = 0.012). These results further support the transferability of EST-SSR markers between closely related genera of the same family.

  13. Analysis of expressed sequence tags from Citrus sinensis L. Osbeck infected with Xylella fastidiosa

    Directory of Open Access Journals (Sweden)

    Alessandra A. de Souza

    2007-01-01

    Full Text Available In order to understand the genetic responses resulting from physiological changes that occur in plants displaying citrus variegated chlorosis (CVC symptoms, we adopted a strategy of comparing two EST libraries from sweet orange [Citrus sinensis (L. Osbeck]. One of them was prepared with plants showing typical CVC symptoms caused by Xylella fastidiosa and the other with non-inoculated plants. We obtained 15,944 ESTs by sequencing the two cDNA libraries. Using an in silico hybridization strategy, 37 genes were found to have significant variation at the transcriptional level. Within this subset, 21 were up-regulated and 16 were down-regulated in plants with CVC. The main functional categories of the down-regulated transcripts in plants with CVC were associated with metabolism, protein modification, energy and transport facilitation. The majority of the up-regulated transcripts were associated with metabolism and defense response. Some transcripts associated with adaptation to stress conditions were up-regulated in plants with CVC and could explain why plants remain alive even under severe water and nutritional stress. Others of the up-regulated transcripts are related to defense response suggesting that sweet orange plants activate their defense machinery. The genes associated with stress response might be expressed as part of a secondary response related to physiological alterations caused by the infection.

  14. SSH analysis of endosperm transcripts and characterization of heat stress regulated expressed sequence tags in bread wheat

    Directory of Open Access Journals (Sweden)

    Suneha Goswami

    2016-08-01

    Full Text Available Heat stress is one of the major problems in agriculturally important cereal crops, especially wheat. Here, we have constructed a subtracted cDNA library from the endosperm of HS-treated (42°C for 2 h wheat cv. HD2985 by suppression subtractive hybridization (SSH. We identified ~550 recombinant clones ranging from 200 to 500 bp with an average size of 300 bp. Sanger’s sequencing was performed with 205 positive clones to generate the differentially expressed sequence tags (ESTs. Most of the ESTs were observed to be localized on the long arm of chromosome 2A and associated with heat stress tolerance and metabolic pathways. Identified ESTs were BLAST search using Ensemble, TriFLD and TIGR databases and the predicted CDS were translated and aligned with the protein sequences available in pfam and InterProScan 5 databases to predict the differentially expressed proteins (DEPs. We observed eight different types of post-translational modifications (PTMs in the DEPs corresponds to the cloned ESTs—147 sites with phosphorylation, 21 sites with sumoylation, 237 with palmitoylation, 96 sites with S-nitrosylation, 3066 calpain cleavage sites, and 103 tyrosine nitration sites, predicted to sense the heat stress and regulate the expression of stress genes. Twelve DEPs were observed to have transmembrane helixes (TMH in their structure, predicted to play the role of sensors of HS. Quantitative Real-Time PCR of randomly selected ESTs showed very high relative expression of HSP17 under HS; up-regulation was observed more in wheat cv. HD2985 (thermotolerant, as compared to HD2329 (thermosusceptible during grain-filling. The abundance of transcripts was further validated through northern blot analysis. The ESTs and their corresponding DEPs can be used as molecular marker for screening or targeted precision breeding program. PTMs identified in the DEPs can be used to elucidate the thermotolerance mechanism of wheat – a novel step towards the development of

  15. Comparative analysis of expressed sequence tags from three castes and two life stages of the termite Reticulitermes flavipes

    Directory of Open Access Journals (Sweden)

    Steller Matthew M

    2010-08-01

    Full Text Available Abstract Background Termites (Isoptera are eusocial insects whose colonies consist of morphologically and behaviorally specialized castes of sterile workers and soldiers, and reproductive alates. Previous studies on eusocial insects have indicated that caste differentiation and behavior are underlain by differential gene expression. Although much is known about gene expression in the honey bee, Apis mellifera, termites remain relatively understudied in this regard. Therefore, our objective was to assemble an expressed sequence tag (EST data base for the eastern subterranean termite, Reticulitermes flavipes, for future gene expression studies. Results Soldier, worker, and alate caste and two larval cDNA libraries were constructed, and approximately 15,000 randomly chosen clones were sequenced to compile an EST data base. Putative gene functions were assigned based on a BLASTX Swissprot search. Categorical in silico expression patterns for each library were compared using the R-statistic. A significant proportion of the ESTs of each caste and life stages had no significant similarity to those in existing data bases. All cDNA libraries, including those of non-reproductive worker and soldier castes, contained sequences with putative reproductive functions. Genes that showed a potential expression bias among castes included a putative antibacterial humoral response and translation elongation protein in soldiers and a chemosensory protein in alates. Conclusions We have expanded upon the available sequences for R. flavipes and utilized an in silico method to compare gene expression in different castes of an eusocial insect. The in silico analysis allowed us to identify several genes which may be differentially expressed and involved in caste differences. These include a gene overrepresented in the alate cDNA library with a predicted function of neurotransmitter secretion or cholesterol absorption and a gene predicted to be involved in protein

  16. ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs

    Directory of Open Access Journals (Sweden)

    Carter Kikia

    2007-05-01

    Full Text Available Abstract Background With the advent of low-cost, high-throughput sequencing, the amount of public domain Expressed Sequence Tag (EST sequence data available for both model and non-model organism is growing exponentially. While these data are widely used for characterizing various genomes, they also present a serious challenge for data quality control and validation due to their inherent deficiencies, particularly for species without genome sequences. Description ConiferEST is an integrated system for data reprocessing, visualization and mining of conifer ESTs. In its current release, Build 1.0, it houses 172,229 loblolly pine EST sequence reads, which were obtained from reprocessing raw DNA sequencer traces using our software – WebTraceMiner. The trace files were downloaded from NCBI Trace Archive. ConiferEST provides biologists unique, easy-to-use data visualization and mining tools for a variety of putative sequence features including cloning vector segments, adapter sequences, restriction endonuclease recognition sites, polyA and polyT runs, and their corresponding Phred quality values. Based on these putative features, verified sequence features such as 3' and/or 5' termini of cDNA inserts in either sense or non-sense strand have been identified in-silico. Interestingly, only 30.03% of the designated 3' ESTs were found to have an authenticated 5' terminus in the non-sense strand (i.e., polyT tails, while fewer than 5.34% of the designated 5' ESTs had a verified 5' terminus in the sense strand. Such previously ignored features provide valuable insight for data quality control and validation of error-prone ESTs, as well as the ability to identify novel functional motifs embedded in large EST datasets. We found that "double-termini adapters" were effective indicators of potential EST chimeras. For all sequences with in-silico verified termini/terminus, we used InterProScan to assign protein domain signatures, results of which are available

  17. ConiferEST: an integrated bioinformatics system for data reprocessing and mining of conifer expressed sequence tags (ESTs).

    Science.gov (United States)

    Liang, Chun; Wang, Gang; Liu, Lin; Ji, Guoli; Fang, Lin; Liu, Yuansheng; Carter, Kikia; Webb, Jason S; Dean, Jeffrey F D

    2007-05-29

    With the advent of low-cost, high-throughput sequencing, the amount of public domain Expressed Sequence Tag (EST) sequence data available for both model and non-model organism is growing exponentially. While these data are widely used for characterizing various genomes, they also present a serious challenge for data quality control and validation due to their inherent deficiencies, particularly for species without genome sequences. ConiferEST is an integrated system for data reprocessing, visualization and mining of conifer ESTs. In its current release, Build 1.0, it houses 172,229 loblolly pine EST sequence reads, which were obtained from reprocessing raw DNA sequencer traces using our software--WebTraceMiner. The trace files were downloaded from NCBI Trace Archive. ConiferEST provides biologists unique, easy-to-use data visualization and mining tools for a variety of putative sequence features including cloning vector segments, adapter sequences, restriction endonuclease recognition sites, polyA and polyT runs, and their corresponding Phred quality values. Based on these putative features, verified sequence features such as 3' and/or 5' termini of cDNA inserts in either sense or non-sense strand have been identified in-silico. Interestingly, only 30.03% of the designated 3' ESTs were found to have an authenticated 5' terminus in the non-sense strand (i.e., polyT tails), while fewer than 5.34% of the designated 5' ESTs had a verified 5' terminus in the sense strand. Such previously ignored features provide valuable insight for data quality control and validation of error-prone ESTs, as well as the ability to identify novel functional motifs embedded in large EST datasets. We found that "double-termini adapters" were effective indicators of potential EST chimeras. For all sequences with in-silico verified termini/terminus, we used InterProScan to assign protein domain signatures, results of which are available for in-depth exploration using our biologist

  18. Analysis and functional annotation of expressed sequence tags (ESTs from multiple tissues of oil palm (Elaeis guineensis Jacq.

    Directory of Open Access Journals (Sweden)

    Lee Weng-Wah

    2007-10-01

    Full Text Available Abstract Background Oil palm is the second largest source of edible oil which contributes to approximately 20% of the world's production of oils and fats. In order to understand the molecular biology involved in in vitro propagation, flowering, efficient utilization of nitrogen sources and root diseases, we have initiated an expressed sequence tag (EST analysis on oil palm. Results In this study, six cDNA libraries from oil palm zygotic embryos, suspension cells, shoot apical meristems, young flowers, mature flowers and roots, were constructed. We have generated a total of 14537 expressed sequence tags (ESTs from these libraries, from which 6464 tentative unique contigs (TUCs and 2129 singletons were obtained. Approximately 6008 of these tentative unique genes (TUGs have significant matches to the non-redundant protein database, from which 2361 were assigned to one or more Gene Ontology categories. Predominant transcripts and differentially expressed genes were identified in multiple oil palm tissues. Homologues of genes involved in many aspects of flower development were also identified among the EST collection, such as CONSTANS-like, AGAMOUS-like (AGL2, AGL20, LFY-like, SQUAMOSA, SQUAMOSA binding protein (SBP etc. Majority of them are the first representatives in oil palm, providing opportunities to explore the cause of epigenetic homeotic flowering abnormality in oil palm, given the importance of flowering in fruit production. The transcript levels of two flowering-related genes, EgSBP and EgSEP were analysed in the flower tissues of various developmental stages. Gene homologues for enzymes involved in oil biosynthesis, utilization of nitrogen sources, and scavenging of oxygen radicals, were also uncovered among the oil palm ESTs. Conclusion The EST sequences generated will allow comparative genomic studies between oil palm and other monocotyledonous and dicotyledonous plants, development of gene-targeted markers for the reference genetic map

  19. Identification and characterization of 43 microsatellite markers derived from expressed sequence tags of the sea cucumber ( Apostichopus japonicus)

    Science.gov (United States)

    Jiang, Qun; Li, Qi; Yu, Hong; Kong, Lingfeng

    2011-06-01

    The sea cucumber Apostichopus japonicus is a commercially and ecologically important species in China. A total of 3056 potential unigenes were generated after assembling 7597 A. japonicus expressed sequence tags (ESTs) downloaded from Gen-Bank. Two hundred and fifty microsatellite-containing ESTs (8.18%) and 299 simple sequence repeats (SSRs) were detected. The average density of SSRs was 1 per 7.403 kb of EST after redundancy elimination. Di-nucleotide repeat motifs appeared to be the most abundant type with a percentage of 69.90%. Of the 126 primer pairs designed, 90 amplified the expected products and 43 showed polymorphism in 30 individuals tested. The number of alleles per locus ranged from 2 to 26 with an average of 7.0 alleles, and the observed and expected heterozygosities varied from 0.067 to 1.000 and from 0.066 to 0.959, respectively. These new EST-derived microsatellite markers would provide sufficient polymorphism for population genetic studies and genome mapping of this sea cucumber species.

  20. An Expressed Sequence Tag Analysis of the Intertidal Brown Seaweeds Fucus serratus (L.) and F. vesiculosus (L.) (Heterokontophyta, Phaeophyceae) in Response to Abiotic Stressors

    NARCIS (Netherlands)

    Pearson, Gareth A.; Hoarau, Galice; Lago-Leston, Asuncion; Coyer, James A.; Kube, Michael; Reinhardt, Richard; Henckel, Kolja; Serrao, Ester T. A.; Corre, Erwan; Olsen, Jeanine L.

    In order to aid gene discovery and uncover genes responding to abiotic stressors in stress-tolerant brown algae of the genus Fucus, expressed sequence tags (ESTs) were studied in two species, Fucus serratus and Fucus vesiculosus. Clustering of over 12,000 ESTs from three libraries for heat

  1. Development and evaluation of expressed sequence tag-derived microsatellite markers for hop genotyping

    Czech Academy of Sciences Publication Activity Database

    Patzak, J.; Matoušek, Jaroslav

    2011-01-01

    Roč. 55, č. 4 (2011), s. 761-765 ISSN 0006-3134 R&D Projects: GA ČR GA521/08/0740; GA MZe QH81052 Institutional research plan: CEZ:AV0Z50510513 Keywords : cluster analysis * dendrogram * Humulus lupulus * simple sequence repeat Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 1.974, year: 2011

  2. An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

    Science.gov (United States)

    Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

    2004-01-01

    Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051

  3. Differentially displayed expressed sequence tags in Melipona scutellaris (Hymenoptera, Apidae, Meliponini) development.

    Science.gov (United States)

    Santana, Flávia A; Nunes, Francis M F; Vieira, Carlos U; Machado, Maria Alice M S; Kerr, Warwick E; Silva, Wilson A; Bonetti, Ana Maria

    2006-03-01

    We have compared gene expression, using the Differential Display Reverse Transcriptase-Polymerase Chain Reaction (DDRT-PCR) technique, by means of mRNA profile in Melipona scutellaris during ontogenetic postembryonic development, in adult worker, and in both Natural and Juvenile Hormone III-induced adult queen. Six, out of the nine ESTs described here, presented differentially expressed in the phases L1 or L2, or even in both of them, suggesting that key mechanisms to the development of Melipona scutellaris are regulated in these stages. The combination HT11G-AP05 revealed in L1 and L2 a product which matches to thioredoxin reductase protein domain in the Clostridium sporogenes, an important protein during cellular oxidoreduction processes. This study represents the first molecular evidence of differential gene expression profiles toward a description of the genetic developmental traits in the genus Melipona.

  4. Differentially displayed expressed sequence tags in Melipona scutellaris (Hymenoptera, Apidae, Meliponini development

    Directory of Open Access Journals (Sweden)

    Santana Flávia A.

    2006-01-01

    Full Text Available We have compared gene expression, using the Differential Display Reverse Transcriptase - Polymerase Chain Reaction (DDRT-PCR technique, by means of mRNA profile in Melipona scutellaris during ontogenetic postembryonic development, in adult worker, and in both Natural and Juvenile Hormone III-induced adult queen. Six, out of the nine ESTs described here, presented differentially expressed in the phases L1 or L2, or even in both of them, suggesting that key mechanisms to the development of Melipona scutellaris are regulated in these stages. The combination HT11G-AP05 revealed in L1 and L2 a product which matches to thioredoxin reductase protein domain in the Clostridium sporogenes, an important protein during cellular oxidoreduction processes. This study represents the first molecular evidence of differential gene expression profiles toward a description of the genetic developmental traits in the genus Melipona.

  5. Expressed sequence tag analysis of adult human optic nerve for NEIBank: Identification of cell type and tissue markers

    Directory of Open Access Journals (Sweden)

    Peterson Katherine

    2009-09-01

    Full Text Available Abstract Background The optic nerve is a pure white matter central nervous system (CNS tract with an isolated blood supply, and is widely used in physiological studies of white matter response to various insults. We examined the gene expression profile of human optic nerve (ON and, through the NEIBANK online resource, to provide a resource of sequenced verified cDNA clones. An un-normalized cDNA library was constructed from pooled human ON tissues and was used in expressed sequence tag (EST analysis. Location of an abundant oligodendrocyte marker was examined by immunofluorescence. Quantitative real time polymerase chain reaction (qRT-PCR and Western analysis were used to compare levels of expression for key calcium channel protein genes and protein product in primate and rodent ON. Results Our analyses revealed a profile similar in many respects to other white matter related tissues, but significantly different from previously available ON cDNA libraries. The previous libraries were found to include specific markers for other eye tissues, suggesting contamination. Immune/inflammatory markers were abundant in the new ON library. The oligodendrocyte marker QKI was abundant at the EST level. Immunofluorescence revealed that this protein is a useful oligodendrocyte cell-type marker in rodent and primate ONs. L-type calcium channel EST abundance was found to be particularly low. A qRT-PCR-based comparative mammalian species analysis reveals that L-type calcium channel expression levels are significantly lower in primate than in rodent ON, which may help account for the class-specific difference in responsiveness to calcium channel blocking agents. Several known eye disease genes are abundantly expressed in ON. Many genes associated with normal axonal function, mRNAs associated with axonal transport, inflammation and neuroprotection are observed. Conclusion We conclude that the new cDNA library is a faithful representation of human ON and EST data

  6. An expressed sequence tag (EST) library for Drosophila serrata, a model system for sexual selection and climatic adaptation studies.

    Science.gov (United States)

    Frentiu, Francesca D; Adamski, Marcin; McGraw, Elizabeth A; Blows, Mark W; Chenoweth, Stephen F

    2009-01-21

    The native Australian fly Drosophila serrata belongs to the highly speciose montium subgroup of the melanogaster species group. It has recently emerged as an excellent model system with which to address a number of important questions, including the evolution of traits under sexual selection and traits involved in climatic adaptation along latitudinal gradients. Understanding the molecular genetic basis of such traits has been limited by a lack of genomic resources for this species. Here, we present the first expressed sequence tag (EST) collection for D. serrata that will enable the identification of genes underlying sexually-selected phenotypes and physiological responses to environmental change and may help resolve controversial phylogenetic relationships within the montium subgroup. A normalized cDNA library was constructed from whole fly bodies at several developmental stages, including larvae and adults. Assembly of 11,616 clones sequenced from the 3' end allowed us to identify 6,607 unique contigs, of which at least 90% encoded peptides. Partial transcripts were discovered from a variety of genes of evolutionary interest by BLASTing contigs against the 12 Drosophila genomes currently sequenced. By incorporating into the cDNA library multiple individuals from populations spanning a large portion of the geographical range of D. serrata, we were able to identify 11,057 putative single nucleotide polymorphisms (SNPs), with 278 different contigs having at least one "double hit" SNP that is highly likely to be a real polymorphism. At least 394 EST-associated microsatellite markers, representing 355 different contigs, were also found, providing an additional set of genetic markers. The assembled EST library is available online at http://www.chenowethlab.org/serrata/index.cgi. We have provided the first gene collection and largest set of polymorphic genetic markers, to date, for the fly D. serrata. The EST collection will provide much needed genomic resources for

  7. Rediscovering medicinal plants' potential with OMICS: microsatellite survey in expressed sequence tags of eleven traditional plants with potent antidiabetic properties.

    Science.gov (United States)

    Sahu, Jagajjit; Sen, Priyabrata; Choudhury, Manabendra Dutta; Dehury, Budheswar; Barooah, Madhumita; Modi, Mahendra Kumar; Talukdar, Anupam Das

    2014-05-01

    Herbal medicines and traditionally used medicinal plants present an untapped potential for novel molecular target discovery using systems science and OMICS biotechnology driven strategies. Since up to 40% of the world's poor people have no access to government health services, traditional and folk medicines are often the only therapeutics available to them. In this vein, North East (NE) India is recognized for its rich bioresources. As part of the Indo-Burma hotspot, it is regarded as an epicenter of biodiversity for several plants having myriad traditional uses, including medicinal use. However, the improvement of these valuable bioresources through molecular breeding strategies, for example, using genic microsatellites or Simple Sequence Repeats (SSRs) or Expressed Sequence Tags (ESTs)-derived SSRs has not been fully utilized in large scale to date. In this study, we identified a total of 47,700 microsatellites from 109,609 ESTs of 11 medicinal plants (pineapple, papaya, noyontara, bitter orange, bermuda brass, ratalu, barbados nut, mango, mulberry, lotus, and guduchi) having proven antidiabetic properties. A total of 58,159 primer pairs were designed for the non-redundant 8060 SSR-positive ESTs and putative functions were assigned to 4483 unique contigs. Among the identified microsatellites, excluding mononucleotide repeats, di-/trinucleotides are predominant, among which repeat motifs of AG/CT and AAG/CTT were most abundant. Similarity search of SSR containing ESTs and antidiabetic gene sequences revealed 11 microsatellites linked to antidiabetic genes in five plants. GO term enrichment analysis revealed a total of 80 enriched GO terms widely distributed in 53 biological processes, 17 molecular functions, and 10 cellular components associated with the 11 markers. The present study therefore provides concrete insights into the frequency and distribution of SSRs in important medicinal resources. The microsatellite markers reported here markedly add to the genetic

  8. Construction of cDNA library and preliminary analysis of expressed sequence tags from Siberian tiger

    Science.gov (United States)

    Liu, Chang-Qing; Lu, Tao-Feng; Feng, Bao-Gang; Liu, Dan; Guan, Wei-Jun; Ma, Yue-Hui

    2010-01-01

    In this study we successfully constructed a full-length cDNA library from Siberian tiger, Panthera tigris altaica, the most well-known wild Animal. Total RNA was extracted from cultured Siberian tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.30×106 pfu/ml and 1.62×109 pfu/ml respectively. The proportion of recombinants from unamplified library was 90.5% and average length of exogenous inserts was 1.13 kb. A total of 282 individual ESTs with sizes ranging from 328 to 1,142bps were then analyzed the BLASTX score revealed that 53.9% of the sequences were classified as strong match, 38.6% as nominal and 7.4% as weak match. 28.0% of them were found to be related to enzyme/catalytic protein, 20.9% ESTs to metabolism, 13.1% ESTs to transport, 12.1% ESTs to signal transducer/cell communication, 9.9% ESTs to structure protein, 3.9% ESTs to immunity protein/defense metabolism, 3.2% ESTs to cell cycle, and 8.9 ESTs classified as novel genes. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genomic research of Siberian tigers. PMID:20941376

  9. Chasing migration genes: a brain expressed sequence tag resource for summer and migratory monarch butterflies (Danaus plexippus.

    Directory of Open Access Journals (Sweden)

    Haisun Zhu

    2008-01-01

    Full Text Available North American monarch butterflies (Danaus plexippus undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents approximately 52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our "snap-shot" analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive

  10. Exploring the host parasitism of the migratory plant-parasitic nematode Ditylenchus destuctor by expressed sequence tags analysis.

    Directory of Open Access Journals (Sweden)

    Huan Peng

    Full Text Available The potato rot nematode, Ditylenchus destructor, is a very destructive nematode pest on many agriculturally important crops worldwide, but the molecular characterization of its parasitism of plant has been limited. The effectors involved in nematode parasitism of plant for several sedentary endo-parasitic nematodes such as Heterodera glycines, Globodera rostochiensis and Meloidogyne incognita have been identified and extensively studied over the past two decades. Ditylenchus destructor, as a migratory plant parasitic nematode, has different feeding behavior, life cycle and host response. Comparing the transcriptome and parasitome among different types of plant-parasitic nematodes is the way to understand more fully the parasitic mechanism of plant nematodes. We undertook the approach of sequencing expressed sequence tags (ESTs derived from a mixed stage cDNA library of D. destructor. This is the first study of D. destructor ESTs. A total of 9800 ESTs were grouped into 5008 clusters including 3606 singletons and 1402 multi-member contigs, representing a catalog of D. destructor genes. Implementing a bioinformatics' workflow, we found 1391 clusters have no match in the available gene database; 31 clusters only have similarities to genes identified from D. africanus, the most closely related species to D. destructor; 1991 clusters were annotated using Gene Ontology (GO; 1550 clusters were assigned enzyme commission (EC numbers; and 1211 clusters were mapped to 181 KEGG biochemical pathways. 22 ESTs had similarities to reported nematode effectors. Interestedly, most of the effectors identified in this study are involved in host cell wall degradation or modification, such as 1,4-beta-glucanse, 1,3-beta-glucanse, pectate lyase, chitinases and expansin, or host defense suppression such as calreticulin, annexin and venom allergen-like protein. This result implies that the migratory plant-parasitic nematode D. destructor secrets similar effectors to

  11. Chasing Migration Genes: A Brain Expressed Sequence Tag Resource for Summer and Migratory Monarch Butterflies (Danaus plexippus)

    Science.gov (United States)

    Zhu, Haisun; Casselman, Amy; Reppert, Steven M.

    2008-01-01

    North American monarch butterflies (Danaus plexippus) undergo a spectacular fall migration. In contrast to summer butterflies, migrants are juvenile hormone (JH) deficient, which leads to reproductive diapause and increased longevity. Migrants also utilize time-compensated sun compass orientation to help them navigate to their overwintering grounds. Here, we describe a brain expressed sequence tag (EST) resource to identify genes involved in migratory behaviors. A brain EST library was constructed from summer and migrating butterflies. Of 9,484 unique sequences, 6068 had positive hits with the non-redundant protein database; the EST database likely represents ∼52% of the gene-encoding potential of the monarch genome. The brain transcriptome was cataloged using Gene Ontology and compared to Drosophila. Monarch genes were well represented, including those implicated in behavior. Three genes involved in increased JH activity (allatotropin, juvenile hormone acid methyltransfersase, and takeout) were upregulated in summer butterflies, compared to migrants. The locomotion-relevant turtle gene was marginally upregulated in migrants, while the foraging and single-minded genes were not differentially regulated. Many of the genes important for the monarch circadian clock mechanism (involved in sun compass orientation) were in the EST resource, including the newly identified cryptochrome 2. The EST database also revealed a novel Na+/K+ ATPase allele predicted to be more resistant to the toxic effects of milkweed than that reported previously. Potential genetic markers were identified from 3,486 EST contigs and included 1599 double-hit single nucleotide polymorphisms (SNPs) and 98 microsatellite polymorphisms. These data provide a template of the brain transcriptome for the monarch butterfly. Our “snap-shot” analysis of the differential regulation of candidate genes between summer and migratory butterflies suggests that unbiased, comprehensive transcriptional profiling

  12. Analyses of expressed sequence tags from the maize foliar pathogen Cercospora zeae-maydis identify novel genes expressed during vegetative, infectious, and reproductive growth.

    Science.gov (United States)

    Bluhm, Burton H; Dhillon, Braham; Lindquist, Erika A; Kema, Gert Hj; Goodwin, Stephen B; Dunkle, Larry D

    2008-11-04

    The ascomycete fungus Cercospora zeae-maydis is an aggressive foliar pathogen of maize that causes substantial losses annually throughout the Western Hemisphere. Despite its impact on maize production, little is known about the regulation of pathogenesis in C. zeae-maydis at the molecular level. The objectives of this study were to generate a collection of expressed sequence tags (ESTs) from C. zeae-maydis and evaluate their expression during vegetative, infectious, and reproductive growth. A total of 27,551 ESTs was obtained from five cDNA libraries constructed from vegetative and sporulating cultures of C. zeae-maydis. The ESTs, grouped into 4088 clusters and 531 singlets, represented 4619 putative unique genes. Of these, 36% encoded proteins similar (E value zeae-maydis, providing specific targets for characterization by molecular genetics and functional genomics. The EST data establish a foundation for future studies in evolutionary and comparative genomics among species of Cercospora and other groups of plant pathogenic fungi.

  13. Generation and analysis of expressed sequence tags from six developing xylem libraries in Pinus radiata D. Don

    Directory of Open Access Journals (Sweden)

    Dillon Shannon K

    2009-01-01

    Full Text Available Abstract Background Wood is a major renewable natural resource for the timber, fibre and bioenergy industry. Pinus radiata D. Don is the most important commercial plantation tree species in Australia and several other countries; however, genomic resources for this species are very limited in public databases. Our primary objective was to sequence a large number of expressed sequence tags (ESTs from genes involved in wood formation in radiata pine. Results Six developing xylem cDNA libraries were constructed from earlywood and latewood tissues sampled at juvenile (7 yrs, transition (11 yrs and mature (30 yrs ages, respectively. These xylem tissues represent six typical development stages in a rotation period of radiata pine. A total of 6,389 high quality ESTs were collected from 5,952 cDNA clones. Assembly of 5,952 ESTs from 5' end sequences generated 3,304 unigenes including 952 contigs and 2,352 singletons. About 97.0% of the 5,952 ESTs and 96.1% of the unigenes have matches in the UniProt and TIGR databases. Of the 3,174 unigenes with matches, 42.9% were not assigned GO (Gene Ontology terms and their functions are unknown or unclassified. More than half (52.1% of the 5,952 ESTs have matches in the Pfam database and represent 772 known protein families. About 18.0% of the 5,952 ESTs matched cell wall related genes in the MAIZEWALL database, representing all 18 categories, 91 of all 174 families and possibly 557 genes. Fifteen cell wall-related genes are ranked in the 30 most abundant genes, including CesA, tubulin, AGP, SAMS, actin, laccase, CCoAMT, MetE, phytocyanin, pectate lyase, cellulase, SuSy, expansin, chitinase and UDP-glucose dehydrogenase. Based on the PlantTFDB database 41 of the 64 transcription factor families in the poplar genome were identified as being involved in radiata pine wood formation. Comparative analysis of GO term abundance revealed a distinct transcriptome in juvenile earlywood formation compared to other stages of

  14. Genetic diversity analysis in Malaysian giant prawns using expressed sequence tag microsatellite markers for stock improvement program.

    Science.gov (United States)

    Atin, K H; Christianus, A; Fatin, N; Lutas, A C; Shabanimofrad, M; Subha, B

    2017-08-17

    The Malaysian giant prawn is among the most commonly cultured species of the genus Macrobrachium. Stocks of giant prawns from four rivers in Peninsular Malaysia have been used for aquaculture over the past 25 years, which has led to repeated harvesting, restocking, and transplantation between rivers. Consequently, a stock improvement program is now important to avoid the depletion of wild stocks and the loss of genetic diversity. However, the success of such an improvement program depends on our knowledge of the genetic variation of these base populations. The aim of the current study was to estimate genetic variation and differentiation of these riverine sources using novel expressed sequence tag-microsatellite (EST-SSR) markers, which not only are informative on genetic diversity but also provide information on immune and metabolic traits. Our findings indicated that the tested stocks have inbreeding depression due to a significant deficiency in heterozygotes, and F IS was estimated as 0.15538 to 0.31938. An F-statistics analysis suggested that the stocks are composed of one large panmictic population. Among the four locations, stocks from Johor, in the southern region of the peninsular, showed higher allelic and genetic diversity than the other stocks. To overcome inbreeding problems, the Johor population could be used as a base population in a stock improvement program by crossing to the other populations. The study demonstrated that EST-SSR markers can be incorporated in future marker assisted breeding to aid the proper management of the stocks by breeders and stakeholders in Malaysia.

  15. Large-scale identification of odorant-binding proteins and chemosensory proteins from expressed sequence tags in insects

    Science.gov (United States)

    2009-01-01

    Background Insect odorant binding proteins (OBPs) and chemosensory proteins (CSPs) play an important role in chemical communication of insects. Gene discovery of these proteins is a time-consuming task. In recent years, expressed sequence tags (ESTs) of many insect species have accumulated, thus providing a useful resource for gene discovery. Results We have developed a computational pipeline to identify OBP and CSP genes from insect ESTs. In total, 752,841 insect ESTs were examined from 54 species covering eight Orders of Insecta. From these ESTs, 142 OBPs and 177 CSPs were identified, of which 117 OBPs and 129 CSPs are new. The complete open reading frames (ORFs) of 88 OBPs and 123 CSPs were obtained by electronic elongation. We randomly chose 26 OBPs from eight species of insects, and 21 CSPs from four species for RT-PCR validation. Twenty two OBPs and 16 CSPs were confirmed by RT-PCR, proving the efficiency and reliability of the algorithm. Together with all family members obtained from the NCBI (OBPs) or the UniProtKB (CSPs), 850 OBPs and 237 CSPs were analyzed for their structural characteristics and evolutionary relationship. Conclusions A large number of new OBPs and CSPs were found, providing the basis for deeper understanding of these proteins. In addition, the conserved motif and evolutionary analysis provide some new insights into the evolution of insect OBPs and CSPs. Motif pattern fine-tune the functions of OBPs and CSPs, leading to the minor difference in binding sex pheromone or plant volatiles in different insect Orders. PMID:20034407

  16. Generation of expressed sequence tags for discovery of genes responsible for floral traits of Chrysanthemum morifolium by next-generation sequencing technology.

    Science.gov (United States)

    Sasaki, Katsutomo; Mitsuda, Nobutaka; Nashima, Kenji; Kishimoto, Kyutaro; Katayose, Yuichi; Kanamori, Hiroyuki; Ohmiya, Akemi

    2017-09-04

    Chrysanthemum morifolium is one of the most economically valuable ornamental plants worldwide. Chrysanthemum is an allohexaploid plant with a large genome that is commercially propagated by vegetative reproduction. New cultivars with different floral traits, such as color, morphology, and scent, have been generated mainly by classical cross-breeding and mutation breeding. However, only limited genetic resources and their genome information are available for the generation of new floral traits. To obtain useful information about molecular bases for floral traits of chrysanthemums, we read expressed sequence tags (ESTs) of chrysanthemums by high-throughput sequencing using the 454 pyrosequencing technology. We constructed normalized cDNA libraries, consisting of full-length, 3'-UTR, and 5'-UTR cDNAs derived from various tissues of chrysanthemums. These libraries produced a total number of 3,772,677 high-quality reads, which were assembled into 213,204 contigs. By comparing the data obtained with those of full genome-sequenced species, we confirmed that our chrysanthemum contig set contained the majority of all expressed genes, which was sufficient for further molecular analysis in chrysanthemums. We confirmed that our chrysanthemum EST set (contigs) contained a number of contigs that encoded transcription factors and enzymes involved in pigment and aroma compound metabolism that was comparable to that of other species. This information can serve as an informative resource for identifying genes involved in various biological processes in chrysanthemums. Moreover, the findings of our study will contribute to a better understanding of the floral characteristics of chrysanthemums including the myriad cultivars at the molecular level.

  17. Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

    Science.gov (United States)

    Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

    2012-12-01

    Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.

  18. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

    Science.gov (United States)

    Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis.

  19. Expressed sequence tag (EST) analysis of two subspecies of Metarhizium anisopliae reveals a plethora of secreted proteins with potential activity in insect hosts.

    Science.gov (United States)

    Freimoser, Florian M; Screen, Steven; Bagga, Savita; Hu, Gang; St Leger, Raymond J

    2003-01-01

    Expressed sequence tag (EST) libraries for Metarhizium anisopliae, the causative agent of green muscardine disease, were developed from the broad host-range pathogen Metarhizium anisopliae sf. anisopliae and the specific grasshopper pathogen, M. anisopliae sf. acridum. Approximately 1,700 5' end sequences from each subspecies were generated from cDNA libraries representing fungi grown under conditions that maximize secretion of cuticle-degrading enzymes. Both subspecies had ESTs for virtually all pathogenicity-related genes cloned to date from M. anisopliae, but many novel genes encoding potential virulence factors were also tagged. Enzymes with potential targets in the insect host included proteases, chitinases, phospholipases, lipases, esterases, phosphatases and enzymes producing toxic secondary metabolites. A diverse array of proteases composed 36 % of all M. anisopliae sf. anisopliae ESTs. Eighty percent of the ESTs that could be clustered into functional groups had significant matches (Ehistory of this clade.

  20. Analyses of expressed sequence tags from the maize foliar pathogen Cercospora zeae-maydis identify novel genes expressed during vegetative, infectious, and reproductive growth

    Directory of Open Access Journals (Sweden)

    Kema Gert HJ

    2008-11-01

    Full Text Available Abstract Background The ascomycete fungus Cercospora zeae-maydis is an aggressive foliar pathogen of maize that causes substantial losses annually throughout the Western Hemisphere. Despite its impact on maize production, little is known about the regulation of pathogenesis in C. zeae-maydis at the molecular level. The objectives of this study were to generate a collection of expressed sequence tags (ESTs from C. zeae-maydis and evaluate their expression during vegetative, infectious, and reproductive growth. Results A total of 27,551 ESTs was obtained from five cDNA libraries constructed from vegetative and sporulating cultures of C. zeae-maydis. The ESTs, grouped into 4088 clusters and 531 singlets, represented 4619 putative unique genes. Of these, 36% encoded proteins similar (E value ≤ 10-05 to characterized or annotated proteins from the NCBI non-redundant database representing diverse molecular functions and biological processes based on Gene Ontology (GO classification. We identified numerous, previously undescribed genes with potential roles in photoreception, pathogenesis, and the regulation of development as well as Zephyr, a novel, actively transcribed transposable element. Differential expression of selected genes was demonstrated by real-time PCR, supporting their proposed roles in vegetative, infectious, and reproductive growth. Conclusion Novel genes that are potentially involved in regulating growth, development, and pathogenesis were identified in C. zeae-maydis, providing specific targets for characterization by molecular genetics and functional genomics. The EST data establish a foundation for future studies in evolutionary and comparative genomics among species of Cercospora and other groups of plant pathogenic fungi.

  1. An analysis of expressed sequence tags of developing castor endosperm using a full-length cDNA library

    Directory of Open Access Journals (Sweden)

    Wallis James G

    2007-07-01

    Full Text Available Abstract Background Castor seeds are a major source for ricinoleate, an important industrial raw material. Genomics studies of castor plant will provide critical information for understanding seed metabolism, for effectively engineering ricinoleate production in transgenic oilseeds, or for genetically improving castor plants by eliminating toxic and allergic proteins in seeds. Results Full-length cDNAs are useful resources in annotating genes and in providing functional analysis of genes and their products. We constructed a full-length cDNA library from developing castor endosperm, and obtained 4,720 ESTs from 5'-ends of the cDNA clones representing 1,908 unique sequences. The most abundant transcripts are genes encoding storage proteins, ricin, agglutinin and oleosins. Several other sequences are also very numerous, including two acidic triacylglycerol lipases, and the oleate hydroxylase (FAH12 gene that is responsible for ricinoleate biosynthesis. The role(s of the lipases in developing castor seeds are not clear, and co-expressing of a lipase and the FAH12 did not result in significant changes in hydroxy fatty acid accumulation in transgenic Arabidopsis seeds. Only one oleate desaturase (FAD2 gene was identified in our cDNA sequences. Sequence and functional analyses of the castor FAD2 were carried out since it had not been characterized previously. Overexpression of castor FAD2 in a FAH12-expressing Arabidopsis line resulted in decreased accumulation of hydroxy fatty acids in transgenic seeds. Conclusion Our results suggest that transcriptional regulation of FAD2 and FAH12 genes maybe one of the mechanisms that contribute to a high level of ricinoleate accumulation in castor endosperm. The full-length cDNA library will be used to search for additional genes that affect ricinoleate accumulation in seed oils. Our EST sequences will also be useful to annotate the castor genome, which whole sequence is being generated by shotgun sequencing at

  2. Preparing and Analyzing Expressed Sequence Tags (ESTs Library for the Mammary Tissue of Local Turkish Kivircik Sheep

    Directory of Open Access Journals (Sweden)

    Nehir Ozdemir Ozgenturk

    2017-01-01

    Full Text Available Kivircik sheep is an important local Turkish sheep according to its meat quality and milk productivity. The aim of this study was to analyze gene expression profiles of both prenatal and postnatal stages for the Kivircik sheep. Therefore, two different cDNA libraries, which were taken from the same Kivircik sheep mammary gland tissue at prenatal and postnatal stages, were constructed. Total 3072 colonies which were randomly selected from the two libraries were sequenced for developing a sheep ESTs collection. We used Phred/Phrap computer programs for analysis of the raw EST and readable EST sequences were assembled with the CAP3 software. Putative functions of all unique sequences and statistical analysis were determined by Geneious software. Total 422 ESTs have over 80% similarity to known sequences of other organisms in NCBI classified by Panther database for the Gene Ontology (GO category. By comparing gene expression profiles, we observed some putative genes that may be relative to reproductive performance or play important roles in milk synthesis and secretion. A total of 2414 ESTs have been deposited to the NCBI GenBank database (GW996847–GW999260. EST data in this study have provided a new source of information to functional genome studies of sheep.

  3. Generation and Analysis of Expressed Sequence Tags (ESTs from Halophyte Atriplex canescens to Explore Salt-Responsive Related Genes

    Directory of Open Access Journals (Sweden)

    Jingtao Li

    2014-06-01

    Full Text Available Little information is available on gene expression profiling of halophyte A. canescens. To elucidate the molecular mechanism for stress tolerance in A. canescens, a full-length complementary DNA library was generated from A. canescens exposed to 400 mM NaCl, and provided 343 high-quality ESTs. In an evaluation of 343 valid EST sequences in the cDNA library, 197 unigenes were assembled, among which 190 unigenes (83.1% ESTs were identified according to their significant similarities with proteins of known functions. All the 343 EST sequences have been deposited in the dbEST GenBank under accession numbers JZ535802 to JZ536144. According to Arabidopsis MIPS functional category and GO classifications, we identified 193 unigenes of the 311 annotations EST, representing 72 non-redundant unigenes sharing similarities with genes related to the defense response. The sets of ESTs obtained provide a rich genetic resource and 17 up-regulated genes related to salt stress resistance were identified by qRT-PCR. Six of these genes may contribute crucially to earlier and later stage salt stress resistance. Additionally, among the 343 unigenes sequences, 22 simple sequence repeats (SSRs were also identified contributing to the study of A. canescens resources.

  4. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

    Directory of Open Access Journals (Sweden)

    Bendahmane Abdelhafid

    2011-05-01

    Full Text Available Abstract Background Melon (Cucumis melo, an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs and 3,073 single nucleotide polymorphisms (SNPs in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but

  5. Sequence tagging reveals unexpected modifications in toxicoproteomics

    Science.gov (United States)

    Dasari, Surendra; Chambers, Matthew C.; Codreanu, Simona G.; Liebler, Daniel C.; Collins, Ben C.; Pennington, Stephen R.; Gallagher, William M.; Tabb, David L.

    2010-01-01

    Toxicoproteomic samples are rich in posttranslational modifications (PTMs) of proteins. Identifying these modifications via standard database searching can incur significant performance penalties. Here we describe the latest developments in TagRecon, an algorithm that leverages inferred sequence tags to identify modified peptides in toxicoproteomic data sets. TagRecon identifies known modifications more effectively than the MyriMatch database search engine. TagRecon outperformed state of the art software in recognizing unanticipated modifications from LTQ, Orbitrap, and QTOF data sets. We developed user-friendly software for detecting persistent mass shifts from samples. We follow a three-step strategy for detecting unanticipated PTMs in samples. First, we identify the proteins present in the sample with a standard database search. Next, identified proteins are interrogated for unexpected PTMs with a sequence tag-based search. Finally, additional evidence is gathered for the detected mass shifts with a refinement search. Application of this technology on toxicoproteomic data sets revealed unintended cross-reactions between proteins and sample processing reagents. Twenty five proteins in rat liver showed signs of oxidative stress when exposed to potentially toxic drugs. These results demonstrate the value of mining toxicoproteomic data sets for modifications. PMID:21214251

  6. Transcriptional Regulations on the Low-Temperature-Induced Floral Transition in an Orchidaceae Species, Dendrobium nobile: An Expressed Sequence Tags Analysis

    Directory of Open Access Journals (Sweden)

    Shan Liang

    2012-01-01

    Full Text Available Vernalization-induced flowering is a cold-relevant adaptation in many species, but little is known about the genetic basis behind in Orchidaceae species. Here, we reported a collection of 15017 expressed sequence tags (ESTs from the vernalized axillary buds of an Orchidaceae species, Dendrobium nobile, which were assembled for 9616 unique gene clusters. Functional enrichment analysis showed that genes in relation to the responses to stresses, especially in the form of low temperatures, and those involving in protein biosynthesis and chromatin assembly were significantly overrepresented during 40 days of vernalization. Additionally, a total of 59 putative flowering-relevant genes were recognized, including those homologous to known key players in vernalization pathways in temperate cereals or Arabidopsis, such as cereal VRN1, FT/VRN3, and Arabidopsis AGL19. Results from this study suggest that the networks regulating vernalization-induced floral transition are conserved, but just in a part, in D. nobile, temperate cereals, and Arabidopsis.

  7. Generation and analysis of large-scale expressed sequence tags (ESTs from a full-length enriched cDNA library of porcine backfat tissue

    Directory of Open Access Journals (Sweden)

    Lee Hae-Young

    2006-02-01

    Full Text Available Abstract Background Genome research in farm animals will expand our basic knowledge of the genetic control of complex traits, and the results will be applied in the livestock industry to improve meat quality and productivity, as well as to reduce the incidence of disease. A combination of quantitative trait locus mapping and microarray analysis is a useful approach to reduce the overall effort needed to identify genes associated with quantitative traits of interest. Results We constructed a full-length enriched cDNA library from porcine backfat tissue. The estimated average size of the cDNA inserts was 1.7 kb, and the cDNA fullness ratio was 70%. In total, we deposited 16,110 high-quality sequences in the dbEST division of GenBank (accession numbers: DT319652-DT335761. For all the expressed sequence tags (ESTs, approximately 10.9 Mb of porcine sequence were generated with an average length of 674 bp per EST (range: 200–952 bp. Clustering and assembly of these ESTs resulted in a total of 5,008 unique sequences with 1,776 contigs (35.46% and 3,232 singleton (65.54% ESTs. From a total of 5,008 unique sequences, 3,154 (62.98% were similar to other sequences, and 1,854 (37.02% were identified as having no hit or low identity (Sus scrofa. Gene ontology (GO annotation of unique sequences showed that approximately 31.7, 32.3, and 30.8% were assigned molecular function, biological process, and cellular component GO terms, respectively. A total of 1,854 putative novel transcripts resulted after comparison and filtering with the TIGR SsGI; these included a large percentage of singletons (80.64% and a small proportion of contigs (13.36%. Conclusion The sequence data generated in this study will provide valuable information for studying expression profiles using EST-based microarrays and assist in the condensation of current pig TCs into clusters representing longer stretches of cDNA sequences. The isolation of genes expressed in backfat tissue is the

  8. Generation and analysis of expressed sequence tags (ESTs) of Camelina sativa to mine drought stress-responsive genes.

    Science.gov (United States)

    Kanth, Bashistha Kumar; Kumari, Shipra; Choi, Seo Hee; Ha, Hye-Jeong; Lee, Geung-Joo

    2015-11-06

    Camelina sativa is an oil-producing crop belonging to the family of Brassicaceae. Due to exceptionally high content of omega fatty acid, it is commercially grown around the world as edible oil, biofuel, and animal feed. A commonly referred 'false flax' or gold-of-pleasure Camelina sativa has been interested as one of biofuel feedstocks. The species can grow on marginal land due to its superior drought tolerance with low requirement of agricultural inputs. This crop has been unexploited due to very limited transcriptomic and genomic data. Use of gene-specific molecular markers is an important strategy for new cultivar development in breeding program. In this study, Illumina paired-end sequencing technology and bioinformatics tools were used to obtain expression profiling of genes responding to drought stress in Camelina sativa BN14. A total of more than 60,000 loci were assembled, corresponding to approximately 275 K transcripts. When the species was exposed to 10 kPa drought stress, 100 kPa drought stress, and rehydrated conditions, a total of 107, 2,989, and 982 genes, respectively, were up-regulated, while 146, 3,659, and 1189 genes, respectively, were down-regulated compared to control condition. Some unknown genes were found to be highly expressed under drought conditions, together with some already reported gene families such as senescence-associated genes, CAP160, and LEA under 100 kPa soil water condition, cysteine protease, 2OG, Fe(II)-dependent oxygenase, and RAD-like 1 under rehydrated condition. These genes will be further validated and mapped to determine their function and loci. This EST library will be favorably applied to develop gene-specific molecular markers and discover genes responsible for drought tolerance in Camelina species. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Identification of stress-induced genes from the drought-tolerant plant Prosopis juliflora (Swartz) DC. through analysis of expressed sequence tags.

    Science.gov (United States)

    George, Suja; Venkataraman, Gayatri; Parida, Ajay

    2007-05-01

    Abiotic stresses such as cold, salinity, drought, wounding, and heavy metal contamination adversely affect crop productivity throughout the world. Prosopis juliflora is a phreatophyte that can tolerate severe adverse environmental conditions such as drought, salinity, and heavy metal contamination. As a first step towards the characterization of genes that contribute to combating abiotic stress, construction and analysis of a cDNA library of P. juliflora genes is reported here. Random expressed sequence tag (EST) sequencing of 1750 clones produced 1467 high-quality reads. These clones were classified into functional categories, and BLAST comparisons revealed that 114 clones were homologous to genes implicated in stress response(s) and included heat shock proteins, metallothioneins, lipid transfer proteins, and late embryogenesis abundant proteins. Of the ESTs analyzed, 26% showed homology to previously uncharacterized genes in the databases. Fifty-two clones from this category were selected for reverse Northern analysis: 21 were shown to be upregulated and 16 downregulated. The results obtained by reverse Northern analysis were confirmed by Northern analysis. Clustering of the 1467 ESTs produced a total of 295 contigs encompassing 790 ESTs, resulting in a 54.2% redundancy. Two of the abundant genes coding for a nonspecific lipid transfer protein and late embryogenesis abundant protein were sequenced completely. Northern analysis (after polyethylene glycol stress) of the 2 genes was carried out. The implications of the analyzed genes in abiotic stress tolerance are also discussed.

  10. Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum).

    Science.gov (United States)

    Li, Xuyan; Hou, Yanming; Zhang, Li; Zhang, Wenhao; Quan, Chen; Cui, Yuhai; Bian, Shaomin

    2014-01-01

    MicroRNAs (miRNAs) are a class of endogenous, approximately 21nt in length, non-coding RNA, which mediate the expression of target genes primarily at post-transcriptional levels. miRNAs play critical roles in almost all plant cellular and metabolic processes. Although numerous miRNAs have been identified in the plant kingdom, the miRNAs in blueberry, which is an economically important small fruit crop, still remain totally unknown. In this study, we reported a computational identification of miRNAs and their targets in blueberry. By conducting an EST-based comparative genomics approach, 9 potential vco-miRNAs were discovered from 22,402 blueberry ESTs according to a series of filtering criteria, designated as vco-miR156-5p, vco-miR156-3p, vco-miR1436, vco-miR1522, vco-miR4495, vco-miR5120, vco-miR5658, vco-miR5783, and vco-miR5986. Based on sequence complementarity between miRNA and its target transcript, 34 target ESTs from blueberry and 70 targets from other species were identified for the vco-miRNAs. The targets were found to be involved in transcription, RNA splicing and binding, DNA duplication, signal transduction, transport and trafficking, stress response, as well as synthesis and metabolic process. These findings will greatly contribute to future research in regard to functions and regulatory mechanisms of blueberry miRNAs.

  11. Expressed sequence tags from larval gut of the European corn borer (Ostrinia nubilalis: Exploring candidate genes potentially involved in Bacillus thuringiensis toxicity and resistance

    Directory of Open Access Journals (Sweden)

    Crespo Andre LB

    2009-06-01

    Full Text Available Abstract Background Lepidoptera represents more than 160,000 insect species which include some of the most devastating pests of crops, forests, and stored products. However, the genomic information on lepidopteran insects is very limited. Only a few studies have focused on developing expressed sequence tag (EST libraries from the guts of lepidopteran larvae. Knowledge of the genes that are expressed in the insect gut are crucial for understanding basic physiology of food digestion, their interactions with Bacillus thuringiensis (Bt toxins, and for discovering new targets for novel toxins for use in pest management. This study analyzed the ESTs generated from the larval gut of the European corn borer (ECB, Ostrinia nubilalis, one of the most destructive pests of corn in North America and the western world. Our goals were to establish an ECB larval gut-specific EST database as a genomic resource for future research and to explore candidate genes potentially involved in insect-Bt interactions and Bt resistance in ECB. Results We constructed two cDNA libraries from the guts of the fifth-instar larvae of ECB and sequenced a total of 15,000 ESTs from these libraries. A total of 12,519 ESTs (83.4% appeared to be high quality with an average length of 656 bp. These ESTs represented 2,895 unique sequences, including 1,738 singletons and 1,157 contigs. Among the unique sequences, 62.7% encoded putative proteins that shared significant sequence similarities (E-value ≤ 10-3with the sequences available in GenBank. Our EST analysis revealed 52 candidate genes that potentially have roles in Bt toxicity and resistance. These genes encode 18 trypsin-like proteases, 18 chymotrypsin-like proteases, 13 aminopeptidases, 2 alkaline phosphatases and 1 cadherin-like protein. Comparisons of expression profiles of 41 selected candidate genes between Cry1Ab-susceptible and resistant strains of ECB by RT-PCR showed apparently decreased expressions in 2 trypsin-like and 2

  12. Identification and functional characterization of effectors in expressed sequence tags from various life cycle stages of the potato cyst nematode Globodera pallida.

    Science.gov (United States)

    Jones, John T; Kumar, Amar; Pylypenko, Liliya A; Thirugnanasambandam, Amarnath; Castelli, Lydia; Chapman, Sean; Cock, Peter J A; Grenier, Eric; Lilley, Catherine J; Phillips, Mark S; Blok, Vivian C

    2009-11-01

    In this article, we describe the analysis of over 9000 expressed sequence tags (ESTs) from cDNA libraries obtained from various life cycle stages of Globodera pallida. We have identified over 50 G. pallida effectors from this dataset using bioinformatics analysis, by screening clones in order to identify secreted proteins up-regulated after the onset of parasitism and using in situ hybridization to confirm the expression in pharyngeal gland cells. A substantial gene family encoding G. pallida SPRYSEC proteins has been identified. The expression of these genes is restricted to the dorsal pharyngeal gland cell. Different members of the SPRYSEC family of proteins from G. pallida show different subcellular localization patterns in plants, with some localized to the cytoplasm and others to the nucleus and nucleolus. Differences in subcellular localization may reflect diverse functional roles for each individual protein or, more likely, variety in the compartmentalization of plant proteins targeted by the nematode. Our data are therefore consistent with the suggestion that the SPRYSEC proteins suppress host defences, as suggested previously, and that they achieve this through interaction with a range of host targets.

  13. Analysis and functional annotation of expressed sequence tags from in vitro cell lines of elasmobranchs: Spiny dogfish shark (Squalus acanthias) and little skate (Leucoraja erinacea).

    Science.gov (United States)

    Parton, Angela; Bayne, Christopher J; Barnes, David W

    2010-09-01

    Elasmobranchs are the most commonly used experimental models among the jawed, cartilaginous fish (Chondrichthyes). Previously we developed cell lines from embryos of two elasmobranchs, Squalus acanthias the spiny dogfish shark (SAE line), and Leucoraja erinacea the little skate (LEE-1 line). From these lines cDNA libraries were derived and expressed sequence tags (ESTs) generated. From the SAE cell line 4303 unique transcripts were identified, with 1848 of these representing unknown sequences (showing no BLASTX identification). From the LEE-1 cell line, 3660 unique transcripts were identified, and unknown, unique sequences totaled 1333. Gene Ontology (GO) annotation showed that GO assignments for the two cell lines were in general similar. These results suggest that the procedures used to derive the cell lines led to isolation of cell types of the same general embryonic origin from both species. The LEE-1 transcripts included GO categories "envelope" and "oxidoreductase activity" but the SAE transcripts did not. GO analysis of SAE transcripts identified the category "anatomical structure formation" that was not present in LEE-1 cells. Increased organelle compartments may exist within LEE-1 cells compared to SAE cells, and the higher oxidoreductase activity in LEE-1 cells may indicate a role for these cells in responses associated with innate immunity or in steroidogenesis. These EST libraries from elasmobranch cell lines provide information for assembly of genomic sequences and are useful in revealing gene diversity, new genes and molecular markers, as well as in providing means for elucidation of full-length cDNAs and probes for gene array analyses. This is the first study of this type with members of the Chondrichthyes. Copyright 2010 Elsevier Inc. All rights reserved.

  14. A second generation framework for the analysis of microsatellites in expressed sequence tags and the development of EST-SSR markers for a conifer, Cryptomeria japonica

    Directory of Open Access Journals (Sweden)

    Ueno Saneyoshi

    2012-04-01

    Full Text Available Abstract Background Microsatellites or simple sequence repeats (SSRs in expressed sequence tags (ESTs are useful resources for genome analysis because of their abundance, functionality and polymorphism. The advent of commercial second generation sequencing machines has lead to new strategies for developing EST-SSR markers, necessitating the development of bioinformatic framework that can keep pace with the increasing quality and quantity of sequence data produced. We describe an open scheme for analyzing ESTs and developing EST-SSR markers from reads collected by Sanger sequencing and pyrosequencing of sugi (Cryptomeria japonica. Results We collected 141,097 sequence reads by Sanger sequencing and 1,333,444 by pyrosequencing. After trimming contaminant and low quality sequences, 118,319 Sanger and 1,201,150 pyrosequencing reads were passed to the MIRA assembler, generating 81,284 contigs that were analysed for SSRs. 4,059 SSRs were found in 3,694 (4.54% contigs, giving an SSR frequency lower than that in seven other plant species with gene indices (5.4–21.9%. The average GC content of the SSR-containing contigs was 41.55%, compared to 40.23% for all contigs. Tri-SSRs were the most common SSRs; the most common motif was AT, which was found in 655 (46.3% di-SSRs, followed by the AAG motif, found in 342 (25.9% tri-SSRs. Most (72.8% tri-SSRs were in coding regions, but 55.6% of the di-SSRs were in non-coding regions; the AT motif was most abundant in 3′ untranslated regions. Gene ontology (GO annotations showed that six GO terms were significantly overrepresented within SSR-containing contigs. Forty–four EST-SSR markers were developed from 192 primer pairs using two pipelines: read2Marker and the newly-developed CMiB, which combines several open tools. Markers resulting from both pipelines showed no differences in PCR success rate and polymorphisms, but PCR success and polymorphism were significantly affected by the expected PCR product size

  15. A second generation framework for the analysis of microsatellites in expressed sequence tags and the development of EST-SSR markers for a conifer, Cryptomeria japonica

    Science.gov (United States)

    2012-01-01

    Background Microsatellites or simple sequence repeats (SSRs) in expressed sequence tags (ESTs) are useful resources for genome analysis because of their abundance, functionality and polymorphism. The advent of commercial second generation sequencing machines has lead to new strategies for developing EST-SSR markers, necessitating the development of bioinformatic framework that can keep pace with the increasing quality and quantity of sequence data produced. We describe an open scheme for analyzing ESTs and developing EST-SSR markers from reads collected by Sanger sequencing and pyrosequencing of sugi (Cryptomeria japonica). Results We collected 141,097 sequence reads by Sanger sequencing and 1,333,444 by pyrosequencing. After trimming contaminant and low quality sequences, 118,319 Sanger and 1,201,150 pyrosequencing reads were passed to the MIRA assembler, generating 81,284 contigs that were analysed for SSRs. 4,059 SSRs were found in 3,694 (4.54%) contigs, giving an SSR frequency lower than that in seven other plant species with gene indices (5.4–21.9%). The average GC content of the SSR-containing contigs was 41.55%, compared to 40.23% for all contigs. Tri-SSRs were the most common SSRs; the most common motif was AT, which was found in 655 (46.3%) di-SSRs, followed by the AAG motif, found in 342 (25.9%) tri-SSRs. Most (72.8%) tri-SSRs were in coding regions, but 55.6% of the di-SSRs were in non-coding regions; the AT motif was most abundant in 3′ untranslated regions. Gene ontology (GO) annotations showed that six GO terms were significantly overrepresented within SSR-containing contigs. Forty–four EST-SSR markers were developed from 192 primer pairs using two pipelines: read2Marker and the newly-developed CMiB, which combines several open tools. Markers resulting from both pipelines showed no differences in PCR success rate and polymorphisms, but PCR success and polymorphism were significantly affected by the expected PCR product size and number of SSR

  16. Analysis of a normalised expressed sequence tag (EST) library from a key pollinator, the bumblebee Bombus terrestris.

    Science.gov (United States)

    Sadd, Ben M; Kube, Michael; Klages, Sven; Reinhardt, Richard; Schmid-Hempel, Paul

    2010-02-15

    The bumblebee, Bombus terrestris (Order Hymenoptera), is of widespread importance. This species is extensively used for commercial pollination in Europe, and along with other Bombus spp. is a key member of natural pollinator assemblages. Furthermore, the species is studied in a wide variety of biological fields. The objective of this project was to create a B. terrestris EST resource that will prove to be valuable in obtaining a deeper understanding of this significant social insect. A normalised cDNA library was constructed from the thorax and abdomen of B. terrestris workers in order to enhance the discovery of rare genes. A total of 29'428 ESTs were sequenced. Subsequent clustering resulted in 13'333 unique sequences. Of these, 58.8 percent had significant similarities to known proteins, with 54.5 percent having a "best-hit" to existing Hymenoptera sequences. Comparisons with the honeybee and other insects allowed the identification of potential candidates for gene loss, pseudogene evolution, and possible incomplete annotation in the honeybee genome. Further, given the focus of much basic research and the perceived threat of disease to natural and commercial populations, the immune system of bumblebees is a particularly relevant component. Although the library is derived from unchallenged bees, we still uncover transcription of a number of immune genes spanning the principally described insect immune pathways. Additionally, the EST library provides a resource for the discovery of genetic markers that can be used in population level studies. Indeed, initial screens identified 589 simple sequence repeats and 854 potential single nucleotide polymorphisms. The resource that these B. terrestris ESTs represent is valuable for ongoing work. The ESTs provide direct evidence of transcriptionally active regions, but they will also facilitate further functional genomics, gene discovery and future genome annotation. These are important aspects in obtaining a greater

  17. Development and validation of 89 novel expressed sequence tag-derived microsatellite markers in blood clam, Tegillarca granosa

    Science.gov (United States)

    Teng, Shuangshuang; Fang, Jun; Cai, Yilong; Chai, Xueliang; Xiao, Guoqiang

    2017-06-01

    Blood clam, Tegillarca granosa, is an important shellfish in Chinese mariculture industry. Investigative research in this species, such as genetic linkage mapping, requires a large panel of molecular markers. In present study, a total of 89 polymorphic microsatellite markers were developed in T. granosa using the sequence database of Life Sciences Technology 454 next generation sequencing technology. All 89 loci were characterized in 20 individual clams from a natural population inhabiting Yueqing Gulf, Zhejiang Province, China. The number of alleles per polymorphic locus varied between 2 and 15, while the observed heterozygosity, expected heterozygosity and polymorphic information content varied between 0.000 and 1.000, 0.102 and 0.921, and 0.048 and 0.886, respectively. Of the 89 loci identified, 32 loci deviated significantly from Hardy-Weinberg equilibrium following Bonferroni correction. Thirty nine markers, which were shown to be polymorphic in a full-sibling family, were tested in Mendelian segregations. As expected, 32 loci were co-dominantly segregated in a Mendelian fashion. These novel developed microsatellite markers represent useful research tools for investigation of population genetic structure and genetic diversity in this species.

  18. Identification of salt-induced genes from Salicornia brachiata, an extreme halophyte through expressed sequence tags analysis.

    Science.gov (United States)

    Jha, Bhavanath; Agarwal, Pradeep K; Reddy, Palakolanu Sudhakar; Lal, Sanjay; Sopory, Sudhir K; Reddy, Malireddy K

    2009-04-01

    Salinity severely affects plant growth and development causing crop loss worldwide. We have isolated a large number of salt-induced genes as well as unknown and hypothetical genes from Salicornia brachiata Roxb. (Amaranthaceae). This is the first description of identification of genes in response to salinity stress in this extreme halophyte plant. Salicornia accumulates salt in its pith and survives even at 2 M NaCl under field conditions. For isolating salt responsive genes, cDNA subtractive hybridization was performed between control and 500 mM NaCl treated plants. Out of the 1200 recombinant clones, 930 sequences were submitted to the NCBI database (GenBank accession: EB484528 to EB485289 and EC906125 to EC906292). 789 ESTs showed matching with different genes in NCBI database. 4.8% ESTs belonged to stress-tolerant gene category and approximately 29% ESTs showed no homology with known functional gene sequences, thus classified as unknown or hypothetical. The detection of a large number of ESTs with unknown putative function in this species makes it an interesting contribution. The 90 unknown and hypothetical genes were selected to study their differential regulation by reverse Northern analysis for identifying their role in salinity tolerance. Interestingly, both up and down regulation at 500 mM NaCl were observed (21 and 10 genes, respectively). Northern analysis of two important salt tolerant genes, ASR1 (Abscisic acid stress ripening gene) and plasma membrane H+ATPase, showed the basal level of transcripts in control condition and an increase with NaCl treatment. ASR1 gene is made full length using 5' RACE and its potential role in imparting salt tolerance is being studied.

  19. Construction of a cDNA library and preliminary analysis of expressed sequence tags in Piper hainanense.

    Science.gov (United States)

    Fan, R; Ling, P; Hao, C Y; Li, F P; Huang, L F; Wu, B D; Wu, H S

    2015-10-19

    Black pepper is a perennial climbing vine. It is widely cultivated because its berries can be utilized not only as a spice in food but also for medicinal use. This study aimed to construct a standardized, high-quality cDNA library to facilitated identification of new Piper hainanense transcripts. For this, 262 unigenes were used to generate raw reads. The average length of these 262 unigenes was 774.8 bp. Of these, 94 genes (35.9%) were newly identified, according to the NCBI protein database. Thus, identification of new genes may broaden the molecular knowledge of P. hainanense on the basis of Clusters of Orthologous Groups and Gene Ontology categories. In addition, certain basic genes linked to physiological processes, which can contribute to disease resistance and thereby to the breeding of black pepper. A total of 26 unigenes were found to be SSR markers. Dinucleotide SSR was the main repeat motif, accounting for 61.54%, followed by trinucleotide SSR (23.07%). Eight primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among twenty-one piper germplasm. These results present a novel sequence information of P. hainanense, which can serve as the foundation for further genetic research on this species.

  20. Analysis of expressed sequence tags from Actinidia: applications of a cross species EST database for gene discovery in the areas of flavor, health, color and ripening

    Directory of Open Access Journals (Sweden)

    Richardson Annette C

    2008-07-01

    Full Text Available Abstract Background Kiwifruit (Actinidia spp. are a relatively new, but economically important crop grown in many different parts of the world. Commercial success is driven by the development of new cultivars with novel consumer traits including flavor, appearance, healthful components and convenience. To increase our understanding of the genetic diversity and gene-based control of these key traits in Actinidia, we have produced a collection of 132,577 expressed sequence tags (ESTs. Results The ESTs were derived mainly from four Actinidia species (A. chinensis, A. deliciosa, A. arguta and A. eriantha and fell into 41,858 non redundant clusters (18,070 tentative consensus sequences and 23,788 EST singletons. Analysis of flavor and fragrance-related gene families (acyltransferases and carboxylesterases and pathways (terpenoid biosynthesis is presented in comparison with a chemical analysis of the compounds present in Actinidia including esters, acids, alcohols and terpenes. ESTs are identified for most genes in color pathways controlling chlorophyll degradation and carotenoid biosynthesis. In the health area, data are presented on the ESTs involved in ascorbic acid and quinic acid biosynthesis showing not only that genes for many of the steps in these pathways are represented in the database, but that genes encoding some critical steps are absent. In the convenience area, genes related to different stages of fruit softening are identified. Conclusion This large EST resource will allow researchers to undertake the tremendous challenge of understanding the molecular basis of genetic diversity in the Actinidia genus as well as provide an EST resource for comparative fruit genomics. The various bioinformatics analyses we have undertaken demonstrates the extent of coverage of ESTs for genes encoding different biochemical pathways in Actinidia.

  1. Ginger and turmeric expressed sequence tags identify signature genes for rhizome identity and development and the biosynthesis of curcuminoids, gingerols and terpenoids

    Science.gov (United States)

    2013-01-01

    Background Ginger (Zingiber officinale) and turmeric (Curcuma longa) accumulate important pharmacologically active metabolites at high levels in their rhizomes. Despite their importance, relatively little is known regarding gene expression in the rhizomes of ginger and turmeric. Results In order to identify rhizome-enriched genes and genes encoding specialized metabolism enzymes and pathway regulators, we evaluated an assembled collection of expressed sequence tags (ESTs) from eight different ginger and turmeric tissues. Comparisons to publicly available sorghum rhizome ESTs revealed a total of 777 gene transcripts expressed in ginger/turmeric and sorghum rhizomes but apparently absent from other tissues. The list of rhizome-specific transcripts was enriched for genes associated with regulation of tissue growth, development, and transcription. In particular, transcripts for ethylene response factors and AUX/IAA proteins appeared to accumulate in patterns mirroring results from previous studies regarding rhizome growth responses to exogenous applications of auxin and ethylene. Thus, these genes may play important roles in defining rhizome growth and development. Additional associations were made for ginger and turmeric rhizome-enriched MADS box transcription factors, their putative rhizome-enriched homologs in sorghum, and rhizomatous QTLs in rice. Additionally, analysis of both primary and specialized metabolism genes indicates that ginger and turmeric rhizomes are primarily devoted to the utilization of leaf supplied sucrose for the production and/or storage of specialized metabolites associated with the phenylpropanoid pathway and putative type III polyketide synthase gene products. This finding reinforces earlier hypotheses predicting roles of this enzyme class in the production of curcuminoids and gingerols. Conclusion A significant set of genes were found to be exclusively or preferentially expressed in the rhizome of ginger and turmeric. Specific

  2. An Expressed Sequence Tag collection from the male antennae of the Noctuid moth Spodoptera littoralis: a resource for olfactory and pheromone detection research

    Directory of Open Access Journals (Sweden)

    Maïbèche-Coisné Martine

    2011-01-01

    Full Text Available Abstract Background Nocturnal insects such as moths are ideal models to study the molecular bases of olfaction that they use, among examples, for the detection of mating partners and host plants. Knowing how an odour generates a neuronal signal in insect antennae is crucial for understanding the physiological bases of olfaction, and also could lead to the identification of original targets for the development of olfactory-based control strategies against herbivorous moth pests. Here, we describe an Expressed Sequence Tag (EST project to characterize the antennal transcriptome of the noctuid pest model, Spodoptera littoralis, and to identify candidate genes involved in odour/pheromone detection. Results By targeting cDNAs from male antennae, we biased gene discovery towards genes potentially involved in male olfaction, including pheromone reception. A total of 20760 ESTs were obtained from a normalized library and were assembled in 9033 unigenes. 6530 were annotated based on BLAST analyses and gene prediction software identified 6738 ORFs. The unigenes were compared to the Bombyx mori proteome and to ESTs derived from Lepidoptera transcriptome projects. We identified a large number of candidate genes involved in odour and pheromone detection and turnover, including 31 candidate chemosensory receptor genes, but also genes potentially involved in olfactory modulation. Conclusions Our project has generated a large collection of antennal transcripts from a Lepidoptera. The normalization process, allowing enrichment in low abundant genes, proved to be particularly relevant to identify chemosensory receptors in a species for which no genomic data are available. Our results also suggest that olfactory modulation can take place at the level of the antennae itself. These EST resources will be invaluable for exploring the mechanisms of olfaction and pheromone detection in S. littoralis, and for ultimately identifying original targets to fight against moth

  3. Cloning and Expression of Ontak Immunotoxin Using Intein Tag

    Directory of Open Access Journals (Sweden)

    SA Moosavizadeh

    2016-06-01

    Full Text Available Introduction: Inteins (INT are internal parts of a number of proteins in yeast and some other unicellular eukaryotes, which can be separated from the immature protein during protein splicing process. After identifying the mechanism of intein action, applications of these sequences are be considered in the single- step purification of recombinant proteins and different intein tags were developed. The most important advantage of using intein tags in purification of recombinant proteins than other affinity tags is no requirement of expensive protease enzymes and following additional steps to remove protease that make intein tags economically are considered more important. In the present study, denileukin diftitox immunotoxin (brand name Ontak, be fused with an intein tag and it was inserted in pTXB1 plasmid. Methods: In this study, with respect to multiple cloning sites (MCS of pTXB1, specific primers were designed. Polymerase Chain Reaction (PCR was performed and encoding sequence of ONTAK was cloned using restriction sites of NdeI and SapI. Recombinant vector (PTX-IDZ was transformed into E. coli strain ER2566 and expression of gene was studied. Results: The accuracy of recombinant construct was confirmed by PCR and enzymatic digestion. The produced recombinant proteins were confirmed by SDS-PAGE and Western blotting. Conclusion: Restriction site of SapI guarantees no additional residues incorporate in primary protein sequence. Also, the expression of this construct was analyzed in compare with fused protein to poly-His tag. According to the appropriate expression of fused protein in both constructs it was expected that one step- purification of considered drug protein will be success in the following steps.

  4. Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

    Science.gov (United States)

    Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

    2012-01-01

    Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604

  5. Analysis of expressed sequence tags (ESTs) from avocado seed (Persea americana var. drymifolia) reveals abundant expression of the gene encoding the antimicrobial peptide snakin.

    Science.gov (United States)

    Guzmán-Rodríguez, Jaquelina J; Ibarra-Laclette, Enrique; Herrera-Estrella, Luis; Ochoa-Zarzosa, Alejandra; Suárez-Rodríguez, Luis María; Rodríguez-Zapata, Luis C; Salgado-Garciglia, Rafael; Jimenez-Moraila, Beatriz; López-Meza, Joel E; López-Gómez, Rodolfo

    2013-09-01

    Avocado is one of the most important fruits in the world. Avocado "native mexicano" (Persea americana var. drymifolia) seeds are widely used in the propagation of this plant and are the primary source of rootstocks globally for a variety of avocado cultivars, such as the Hass avocado. Here, we report the isolation of 5005 ESTs from the 5' ends of P. americana var. drymifolia seed cDNA clones representing 1584 possible unigenes. These avocado seed ESTs were compared with the avocado flower EST library, and we detected several genes that are expressed either in both tissues or only in the seed. The snakin gene, which encodes an element of the innate immune response in plants, was one of those most frequently found among the seed ESTs, and this suggests that it is abundantly expressed in the avocado seed. We expressed the snakin gene in a heterologous system, namely the bovine endothelial cell line BVE-E6E7. Conditioned media from transfected BVE-E6E7 cells showed antimicrobial activity against strains of Escherichia coli and Staphylococcus aureus. This is the first study of the function of the snakin gene in plant seed tissue, and our observations suggest that this gene might play a protective role in the avocado seed. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  6. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Science.gov (United States)

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  7. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    Directory of Open Access Journals (Sweden)

    Changqing Liu

    2013-05-01

    Full Text Available In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.

  8. Identification of reproduction-related genes and SSR-markers through expressed sequence tags analysis of a monsoon breeding carp rohu, Labeo rohita (Hamilton).

    Science.gov (United States)

    Sahu, Dinesh K; Panda, Soumya P; Panda, Sujata; Das, Paramananda; Meher, Prem K; Hazra, Rupenangshu K; Peatman, Eric; Liu, Zhanjiang J; Eknath, Ambekar E; Nandi, Samiran

    2013-07-15

    Labeo rohita (Ham.) also called rohu is the most important freshwater aquaculture species on the Indian sub continent. Monsoon dependent breeding restricts its seed production beyond season indicating a strong genetic control about which very limited information is available. Additionally, few genomic resources are publicly available for this species. Here we sought to identify reproduction-relevant genes from normalized cDNA libraries of the brain-pituitary-gonad-liver (BPGL-axis) tissues of adult L. rohita collected during post preparatory phase. 6161 random clones sequenced (Sanger-based) from these libraries produced 4642 (75.34%) high-quality sequences. They were assembled into 3631 (78.22%) unique sequences composed of 709 contigs and 2922 singletons. A total of 182 unique sequences were found to be associated with reproduction-related genes, mainly under the GO term categories of reproduction, neuro-peptide hormone activity, hormone and receptor binding, receptor activity, signal transduction, embryonic development, cell-cell signaling, cell death and anti-apoptosis process. Several important reproduction-related genes reported here for the first time in L. rohita are zona pellucida sperm-binding protein 3, aquaporin-12, spermine oxidase, sperm associated antigen 7, testis expressed 261, progesterone receptor membrane component, Neuropeptide Y and Pro-opiomelanocortin. Quantitative RT-PCR-based analyses of 8 known and 8 unknown transcripts during preparatory and post-spawning phase showed increased expression level of most of the transcripts during preparatory phase (except Neuropeptide Y) in comparison to post-spawning phase indicating possible roles in initiation of gonad maturation. Expression of unknown transcripts was also found in prolific breeder common carp and tilapia, but levels of expression were much higher in seasonal breeder rohu. 3631 unique sequences contained 236 (6.49%) putative microsatellites with the AG (28.16%) repeat as the most

  9. Construction of an Ostrea edulis database from genomic and expressed sequence tags (ESTs) obtained from Bonamia ostreae infected haemocytes: Development of an immune-enriched oligo-microarray.

    Science.gov (United States)

    Pardo, Belén G; Álvarez-Dios, José Antonio; Cao, Asunción; Ramilo, Andrea; Gómez-Tato, Antonio; Planas, Josep V; Villalba, Antonio; Martínez, Paulino

    2016-12-01

    The flat oyster, Ostrea edulis, is one of the main farmed oysters, not only in Europe but also in the United States and Canada. Bonamiosis due to the parasite Bonamia ostreae has been associated with high mortality episodes in this species. This parasite is an intracellular protozoan that infects haemocytes, the main cells involved in oyster defence. Due to the economical and ecological importance of flat oyster, genomic data are badly needed for genetic improvement of the species, but they are still very scarce. The objective of this study is to develop a sequence database, OedulisDB, with new genomic and transcriptomic resources, providing new data and convenient tools to improve our knowledge of the oyster's immune mechanisms. Transcriptomic and genomic sequences were obtained using 454 pyrosequencing and compiled into an O. edulis database, OedulisDB, consisting of two sets of 10,318 and 7159 unique sequences that represent the oyster's genome (WG) and de novo haemocyte transcriptome (HT), respectively. The flat oyster transcriptome was obtained from two strains (naïve and tolerant) challenged with B. ostreae, and from their corresponding non-challenged controls. Approximately 78.5% of 5619 HT unique sequences were successfully annotated by Blast search using public databases. A total of 984 sequences were identified as being related to immune response and several key immune genes were identified for the first time in flat oyster. Additionally, transcriptome information was used to design and validate the first oligo-microarray in flat oyster enriched with immune sequences from haemocytes. Our transcriptomic and genomic sequencing and subsequent annotation have largely increased the scarce resources available for this economically important species and have enabled us to develop an OedulisDB database and accompanying tools for gene expression analysis. This study represents the first attempt to characterize in depth the O. edulis haemocyte transcriptome in

  10. Multiple tag labeling method for DNA sequencing

    Science.gov (United States)

    Mathies, R.A.; Huang, X.C.; Quesada, M.A.

    1995-07-25

    A DNA sequencing method is described which uses single lane or channel electrophoresis. Sequencing fragments are separated in the lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radioisotope labels. 5 figs.

  11. Assembly of 500,000 inter-specific catfish expressed sequence tags and large scale gene-associated marker development for whole genome association studies

    Energy Technology Data Exchange (ETDEWEB)

    Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric; Abernathy, Jason; Waldbieser, Geoff; Lindquist, Erika; Richardson, Paul; Lucas, Susan; Wang, Mei; Li, Ping; Thimmapuram, Jyothi; Liu, Lei; Vullaganti, Deepika; Kucuktas, Huseyin; Murdock, Christopher; Small, Brian C; Wilson, Melanie; Liu, Hong; Jiang, Yanliang; Lee, Yoona; Chen, Fei; Lu, Jianguo; Wang, Wenqi; Xu, Peng; Somridhivej, Benjaporn; Baoprasertkul, Puttharat; Quilang, Jonas; Sha, Zhenxia; Bao, Baolong; Wang, Yaping; Wang, Qun; Takano, Tomokazu; Nandi, Samiran; Liu, Shikai; Wong, Lilian; Kaltenboeck, Ludmilla; Quiniou, Sylvie; Bengten, Eva; Miller, Norman; Trant, John; Rokhsar, Daniel; Liu, Zhanjiang

    2010-03-23

    Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities to known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.

  12. A large scale analysis of cDNA in Arabidopsis thaliana: generation of 12,028 non-redundant expressed sequence tags from normalized and size-selected cDNA libraries.

    Science.gov (United States)

    Asamizu, E; Nakamura, Y; Sato, S; Tabata, S

    2000-06-30

    For comprehensive analysis of genes expressed in the model dicotyledonous plant, Arabidopsis thaliana, expressed sequence tags (ESTs) were accumulated. Normalized and size-selected cDNA libraries were constructed from aboveground organs, flower buds, roots, green siliques and liquid-cultured seedlings, respectively, and a total of 14,026 5'-end ESTs and 39,207 3'-end ESTs were obtained. The 3'-end ESTs could be clustered into 12,028 non-redundant groups. Similarity search of the non-redundant ESTs against the public non-redundant protein database indicated that 4816 groups show similarity to genes of known function, 1864 to hypothetical genes, and the remaining 5348 are novel sequences. Gene coverage by the non-redundant ESTs was analyzed using the annotated genomic sequences of approximately 10 Mb on chromosomes 3 and 5. A total of 923 regions were hit by at least one EST, among which only 499 regions were hit by the ESTs deposited in the public database. The result indicates that the EST source generated in this project complements the EST data in the public database and facilitates new gene discovery.

  13. Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Cirera, Susanna; Hedegaard, Jacob

    2007-01-01

    public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. RESULTS: Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which...... with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression...

  14. Characterization of expressed sequence tags from developing fibers of Gossypium barbadense and evaluation of insertion-deletion variation in tetraploid cultivated cotton species.

    Science.gov (United States)

    Lv, Yuanda; Zhao, Liang; Xu, Xiaoyang; Wang, Lei; Wang, Cheng; Zhang, Tianzhen; Guo, Wangzhen

    2013-03-13

    Cotton is the leading fiber crop worldwide. Gossypium barbadense is an important species of cotton because of its extra-long staple fibers with superior luster and silkiness. However, a systematic analysis and utilization of cDNA sequences from G. barbadense fiber development remains understudied. A total of 21,079 high quality sequences were generated from two non-normalized cDNA libraries prepared by using a mixture of G. barbadense Hai7124 fibers and ovules. After assembly processing, a set of 8,653 unigenes were obtained. Of those, 7,786 were matched to known proteins and 7,316 were assigned to functional categories. The molecular functions of these unigenes were mostly related to binding and catalytic activity, and carbohydrate, amino acid, and energy metabolisms were major contributors among the subsets of metabolism. Sequences comparison between G. barbadense and G. hirsutum revealed that 8,245 unigenes from G. barbadense were detected the similarity with those released publicly in G. hirsutum, however, the remaining 408 sequences had no hits against G. hirsutum unigenes database. Furthermore, 13,275 putative ESTs InDels loci involved in the orthologous and/or homoeologous differences between/within G. barbadense and G. hirsutum were discovered by in silico analyses, and 2,160 InDel markers were developed by ESTs with more than five insertions or deletions. By gel electrophoresis combined with sequencing verification, 71.11% candidate InDel loci were reconfirmed orthologous and/or homoeologous loci polymorphisms using G. hirsutum acc TM-1 and G. barbadense cv Hai7124. Blastx result showed among 2,160 InDel loci, 81 with significant function similarity with known genes associated with secondary wall synthesis process, indicating the important roles in fiber quality in tetraploid cultivated cotton species. Sequence comparisons and InDel markers development will lay the groundwork for promoting the identification of genes related to superior agronomic traits

  15. Application of an E. coli signal sequence as a versatile inclusion body tag.

    Science.gov (United States)

    Jong, Wouter S P; Vikström, David; Houben, Diane; van den Berg van Saparoea, H Bart; de Gier, Jan-Willem; Luirink, Joen

    2017-03-21

    Heterologous protein production in Escherichia coli often suffers from bottlenecks such as proteolytic degradation, complex purification procedures and toxicity towards the expression host. Production of proteins in an insoluble form in inclusion bodies (IBs) can alleviate these problems. Unfortunately, the propensity of heterologous proteins to form IBs is variable and difficult to predict. Hence, fusing the target protein to an aggregation prone polypeptide or IB-tag is a useful strategy to produce difficult-to-express proteins in an insoluble form. When screening for signal sequences that mediate optimal targeting of heterologous proteins to the periplasmic space of E. coli, we observed that fusion to the 39 amino acid signal sequence of E. coli TorA (ssTorA) did not promote targeting but rather directed high-level expression of the human proteins hEGF, Pla2 and IL-3 in IBs. Further analysis revealed that ssTorA even mediated IB formation of the highly soluble endogenous E. coli proteins TrxA and MBP. The ssTorA also induced aggregation when fused to the C-terminus of target proteins and appeared functional as IB-tag in E. coli K-12 as well as B strains. An additive effect on IB-formation was observed upon fusion of multiple ssTorA sequences in tandem, provoking almost complete aggregation of TrxA and MBP. The ssTorA-moiety was successfully used to produce the intrinsically unstable hEGF and the toxic fusion partner SymE, demonstrating its applicability as an IB-tag for difficult-to-express and toxic proteins. We present proof-of-concept for the use of ssTorA as a small, versatile tag for robust E. coli-based expression of heterologous proteins in IBs.

  16. Expressed sequence tags (ESTs) and single nucleotide ...

    African Journals Online (AJOL)

    SERVER

    2008-02-19

    Feb 19, 2008 ... polymorphisms (SNPs): Emerging molecular marker tools for ... knowledge in plant biology, breeding and biotechnology. The emergence of many ...... phenotypes: past successes for Mendelian disease, future approaches for ...

  17. CREST--classification resources for environmental sequence tags.

    Directory of Open Access Journals (Sweden)

    Anders Lanzén

    Full Text Available Sequencing of taxonomic or phylogenetic markers is becoming a fast and efficient method for studying environmental microbial communities. This has resulted in a steadily growing collection of marker sequences, most notably of the small-subunit (SSU ribosomal RNA gene, and an increased understanding of microbial phylogeny, diversity and community composition patterns. However, to utilize these large datasets together with new sequencing technologies, a reliable and flexible system for taxonomic classification is critical. We developed CREST (Classification Resources for Environmental Sequence Tags, a set of resources and tools for generating and utilizing custom taxonomies and reference datasets for classification of environmental sequences. CREST uses an alignment-based classification method with the lowest common ancestor algorithm. It also uses explicit rank similarity criteria to reduce false positives and identify novel taxa. We implemented this method in a web server, a command line tool and the graphical user interfaced program MEGAN. Further, we provide the SSU rRNA reference database and taxonomy SilvaMod, derived from the publicly available SILVA SSURef, for classification of sequences from bacteria, archaea and eukaryotes. Using cross-validation and environmental datasets, we compared the performance of CREST and SilvaMod to the RDP Classifier. We also utilized Greengenes as a reference database, both with CREST and the RDP Classifier. These analyses indicate that CREST performs better than alignment-free methods with higher recall rate (sensitivity as well as precision, and with the ability to accurately identify most sequences from novel taxa. Classification using SilvaMod performed better than with Greengenes, particularly when applied to environmental sequences. CREST is freely available under a GNU General Public License (v3 from http://apps.cbu.uib.no/crest and http://lcaclassifier.googlecode.com.

  18. trieFinder: an efficient program for annotating Digital Gene Expression (DGE) tags.

    Science.gov (United States)

    Renaud, Gabriel; LaFave, Matthew C; Liang, Jin; Wolfsberg, Tyra G; Burgess, Shawn M

    2014-10-13

    Quantification of a transcriptional profile is a useful way to evaluate the activity of a cell at a given point in time. Although RNA-Seq has revolutionized transcriptional profiling, the costs of RNA-Seq are still significantly higher than microarrays, and often the depth of data delivered from RNA-Seq is in excess of what is needed for simple transcript quantification. Digital Gene Expression (DGE) is a cost-effective, sequence-based approach for simple transcript quantification: by sequencing one read per molecule of RNA, this technique can be used to efficiently count transcripts while obviating the need for transcript-length normalization and reducing the total numbers of reads necessary for accurate quantification. Here, we present trieFinder, a program specifically designed to rapidly map, parse, and annotate DGE tags of various lengths against cDNA and/or genomic sequence databases. The trieFinder algorithm maps DGE tags in a two-step process. First, it scans FASTA files of RefSeq, UniGene, and genomic DNA sequences to create a database of all tags that can be derived from a predefined restriction site. Next, it compares the experimental DGE tags to this tag database, taking advantage of the fact that the tags are stored as a prefix tree, or "trie", which allows for linear-time searches for exact matches. DGE tags with mismatches are analyzed by recursive calls in the data structure. We find that, in terms of alignment speed, the mapping functionality of trieFinder compares favorably with Bowtie. trieFinder can quickly provide the user an annotation of the DGE tags from three sources simultaneously, simplifying transcript quantification and novel transcript detection, delivering the data in a simple parsed format, obviating the need to post-process the alignment results. trieFinder is available at http://research.nhgri.nih.gov/software/trieFinder/.

  19. Isolation, sequence identification and tissue expression profile of a ...

    African Journals Online (AJOL)

    The complete expressed sequence tag (CDS) sequence of Banna mini-pig inbred line (BMI) ribokinase gene (RBKS) was amplified using the reverse transcription-polymerase chain reaction (RT-PCR) based on the conserved sequence information of the cattle or other mammals and known highly homologous swine ESTs.

  20. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    Science.gov (United States)

    Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

    2001-01-01

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022

  1. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

    Science.gov (United States)

    Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

    2001-10-09

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.

  2. Mining of biomarker genes from expressed sequence tags and differential display reverse transcriptase-polymerase chain reaction in the self-fertilizing fish, Kryptolebias marmoratus and their expression patterns in response to exposure to an endocrine-disrupting alkylphenol, bisphenol A.

    Science.gov (United States)

    Lee, Young-Mi; Rhee, Jae-Sung; Hwang, Dae-Sik; Kim, Il-Chan; Raisuddin, Sheikh; Lee, Jae-Seong

    2007-06-30

    Expressed sequence tags (ESTs) and differentially expressed cDNAs from the self-fertilizing fish, Kryptolebias marmoratus were mined to develop alternative biomarkers for endocrine-disrupting chemicals (EDCs). 1,577 K. marmoratus cDNA clones were randomly sequenced from the 5'-end. These clones corresponded to 1,518 and 1,519 genes in medaka dbEST and zebrafish dbEST, respectively. Of the matched genes, 197 and 115 genes obtained Unigene IDs in medaka dbEST and zebrafish dbEST, respectively. Many of the annotated genes are potential biomarkers for environmental stresses. In a differential display reverse transcriptase-polymerase chain reaction (DD RT-PCR) study, 56 differential expressed genes were obtained from fish liver exposed to bisphenol A. Of these, 16 genes were identified after BLAST search to GenBank, and the annotated genes were mainly involved in catalytic activity and binding. The expression patterns of these 16 genes were validated by real-time RT-PCR of liver tissue from fish exposed to bisphenol A. Our findings suggest that expression of these 16 genes is modulated by endocrine disrupting chemicals, and therefore that they are potential biomarkers for environmental stress including EDCs exposure.

  3. A new RF tagging pulse based on the Frank poly-phase perfect sequence

    DEFF Research Database (Denmark)

    Laustsen, Christoffer; Greferath, Marcus; Ringgaard, Steffen

    2014-01-01

    Radio frequency (RF) spectrally selective multiband pulses or tagging pulses, are applicable in a broad range of magnetic resonance methods. We demonstrate through simulations and experiments a new phase-modulation-only RF pulse for RF tagging based on the Frank poly-phase perfect sequence...

  4. Generation and analysis of a large-scale expressed sequence Tag database from a full-length enriched cDNA library of developing leaves of Gossypium hirsutum L.

    Directory of Open Access Journals (Sweden)

    Min Lin

    Full Text Available BACKGROUND: Cotton (Gossypium hirsutum L. is one of the world's most economically-important crops. However, its entire genome has not been sequenced, and limited resources are available in GenBank for understanding the molecular mechanisms underlying leaf development and senescence. METHODOLOGY/PRINCIPAL FINDINGS: In this study, 9,874 high-quality ESTs were generated from a normalized, full-length cDNA library derived from pooled RNA isolated from throughout leaf development during the plant blooming stage. After clustering and assembly of these ESTs, 5,191 unique sequences, representative 1,652 contigs and 3,539 singletons, were obtained. The average unique sequence length was 682 bp. Annotation of these unique sequences revealed that 84.4% showed significant homology to sequences in the NCBI non-redundant protein database, and 57.3% had significant hits to known proteins in the Swiss-Prot database. Comparative analysis indicated that our library added 2,400 ESTs and 991 unique sequences to those known for cotton. The unigenes were functionally characterized by gene ontology annotation. We identified 1,339 and 200 unigenes as potential leaf senescence-related genes and transcription factors, respectively. Moreover, nine genes related to leaf senescence and eleven MYB transcription factors were randomly selected for quantitative real-time PCR (qRT-PCR, which revealed that these genes were regulated differentially during senescence. The qRT-PCR for three GhYLSs revealed that these genes express express preferentially in senescent leaves. CONCLUSIONS/SIGNIFICANCE: These EST resources will provide valuable sequence information for gene expression profiling analyses and functional genomics studies to elucidate their roles, as well as for studying the mechanisms of leaf development and senescence in cotton and discovering candidate genes related to important agronomic traits of cotton. These data will also facilitate future whole-genome sequence

  5. The Biotechnological Applications of Recombinant Single-Domain Antibodies are Optimized by the C-Terminal Fusion to the EPEA Sequence (C Tag

    Directory of Open Access Journals (Sweden)

    Selma Djender

    2014-04-01

    Full Text Available We designed a vector for the bacterial expression of recombinant antibodies fused to a double tag composed of 6xHis and the EPEA amino acid sequence. EPEA sequence (C tag is tightly bound by a commercial antibody when expressed at the C-term end of a polypeptide. The antigen is released in the presence of 2 M MgCl2. Consequently, constructs fused to the 6xHis-C tags can be purified by two successive and orthogonal affinity steps. Single-domain antibodies were produced either in the periplasmic or in the cytoplasmic space of E. coli. Surprisingly, the first affinity purification step performed using the EPEA-binding resin already yielded homogeneous proteins. The presence of the C tag did not interfere with the binding activity of the antibodies, as assessed by FACS and SPR analyses, and the C tag was extremely effective for immunoprecipitating HER2 receptor. Finally, the Alexa488-coupled anti-C tag allowed for simplification of FACS and IF analyses. These results show that a tag of minimal dimensions can be effectively used to improve the applicability of recombinant antibodies as reagents. In our hands, C tag was superior to His-tag in affinity purification and pull-down experiments, and practical in any other standard immune technique.

  6. Organization and differential expression of the GACA/GATA tagged somatic and spermatozoal transcriptomes in Buffalo Bubalus bubalis

    Directory of Open Access Journals (Sweden)

    Srivastava Jyoti

    2008-03-01

    Full Text Available Abstract Background Simple sequence repeats (SSRs of GACA/GATA have been implicated with differentiation of sex-chromosomes and speciation. However, the organization of these repeats within genomes and transcriptomes, even in the best characterized organisms including human, remains unclear. The main objective of this study was to explore the buffalo transcriptome for its association with GACA/GATA repeats, and study the structural organization and differential expression of the GACA/GATA repeat tagged transcripts. Moreover, the distribution of GACA and GATA repeats in the prokaryotic and eukaryotic genomes was studied to highlight their significance in genome evolution. Results We explored several genomes and transcriptomes, and observed total absence of these repeats in the prokaryotes, with their gradual accumulation in higher eukaryotes. Further, employing novel microsatellite associated sequence amplification (MASA approach using varying length oligos based on GACA and GATA repeats; we identified and characterized 44 types of known and novel mRNA transcripts tagged with these repeats from different somatic tissues, gonads and spermatozoa of water buffalo Bubalus bubalis. GACA was found to be associated with higher number of transcripts compared to that with GATA. Exclusive presence of several GACA-tagged transcripts in a tissue or spermatozoa, and absence of the GATA-tagged ones in lung/heart highlights their tissue-specific significance. Of all the GACA/GATA tagged transcripts, ~30% demonstrated inter-tissue and/or tissue-spermatozoal sequence polymorphisms. Significantly, ~60% of the GACA-tagged and all the GATA-tagged transcripts showed highest or unique expression in the testis and/or spermatozoa. Moreover, ~75% GACA-tagged and all the GATA-tagged transcripts were found to be conserved across the species. Conclusion Present study is a pioneer attempt exploring GACA/GATA tagged transcriptome in any mammalian species highlighting their

  7. TagDust2: a generic method to extract reads from sequencing data.

    Science.gov (United States)

    Lassmann, Timo

    2015-01-28

    Arguably the most basic step in the analysis of next generation sequencing data (NGS) involves the extraction of mappable reads from the raw reads produced by sequencing instruments. The presence of barcodes, adaptors and artifacts subject to sequencing errors makes this step non-trivial. Here I present TagDust2, a generic approach utilizing a library of hidden Markov models (HMM) to accurately extract reads from a wide array of possible read architectures. TagDust2 extracts more reads of higher quality compared to other approaches. Processing of multiplexed single, paired end and libraries containing unique molecular identifiers is fully supported. Two additional post processing steps are included to exclude known contaminants and filter out low complexity sequences. Finally, TagDust2 can automatically detect the library type of sequenced data from a predefined selection. Taken together TagDust2 is a feature rich, flexible and adaptive solution to go from raw to mappable NGS reads in a single step. The ability to recognize and record the contents of raw reads will help to automate and demystify the initial, and often poorly documented, steps in NGS data analysis pipelines. TagDust2 is freely available at: http://tagdust.sourceforge.net .

  8. Versatile Gene-Specific Sequence Tags for Arabidopsis Functional Genomics: Transcript Profiling and Reverse Genetics Applications

    Science.gov (United States)

    Hilson, Pierre; Allemeersch, Joke; Altmann, Thomas; Aubourg, Sébastien; Avon, Alexandra; Beynon, Jim; Bhalerao, Rishikesh P.; Bitton, Frédérique; Caboche, Michel; Cannoot, Bernard; Chardakov, Vasil; Cognet-Holliger, Cécile; Colot, Vincent; Crowe, Mark; Darimont, Caroline; Durinck, Steffen; Eickhoff, Holger; de Longevialle, Andéol Falcon; Farmer, Edward E.; Grant, Murray; Kuiper, Martin T.R.; Lehrach, Hans; Léon, Céline; Leyva, Antonio; Lundeberg, Joakim; Lurin, Claire; Moreau, Yves; Nietfeld, Wilfried; Paz-Ares, Javier; Reymond, Philippe; Rouzé, Pierre; Sandberg, Goran; Segura, Maria Dolores; Serizet, Carine; Tabrett, Alexandra; Taconnat, Ludivine; Thareau, Vincent; Van Hummelen, Paul; Vercruysse, Steven; Vuylsteke, Marnik; Weingartner, Magdalena; Weisbeek, Peter J.; Wirta, Valtteri; Wittink, Floyd R.A.; Zabeau, Marc; Small, Ian

    2004-01-01

    Microarray transcript profiling and RNA interference are two new technologies crucial for large-scale gene function studies in multicellular eukaryotes. Both rely on sequence-specific hybridization between complementary nucleic acid strands, inciting us to create a collection of gene-specific sequence tags (GSTs) representing at least 21,500 Arabidopsis genes and which are compatible with both approaches. The GSTs were carefully selected to ensure that each of them shared no significant similarity with any other region in the Arabidopsis genome. They were synthesized by PCR amplification from genomic DNA. Spotted microarrays fabricated from the GSTs show good dynamic range, specificity, and sensitivity in transcript profiling experiments. The GSTs have also been transferred to bacterial plasmid vectors via recombinational cloning protocols. These cloned GSTs constitute the ideal starting point for a variety of functional approaches, including reverse genetics. We have subcloned GSTs on a large scale into vectors designed for gene silencing in plant cells. We show that in planta expression of GST hairpin RNA results in the expected phenotypes in silenced Arabidopsis lines. These versatile GST resources provide novel and powerful tools for functional genomics. PMID:15489341

  9. Intraclade heterogeneity in nitrogen utilization by marine prokaryotes revealed using stable isotope probing coupled with tag sequencing (Tag-SIP

    Directory of Open Access Journals (Sweden)

    Michael Morando

    2016-12-01

    Full Text Available Nitrogen can greatly influence the structure and productivity of microbial communities through its relative availability and form. However, roles of specific organisms in the uptake of different nitrogen species remain poorly characterized. Most studies seeking to identify agents of assimilation have been correlative, indirectly linking activity measurements (e.g., nitrate uptake with the presence or absence of biological markers, particularly functional genes and their transcripts. Evidence is accumulating of previously underappreciated functional diversity in major microbial subpopulations, which may confer physiological advantages under certain environmental conditions leading to ecotype divergence. This microdiversity further complicates our view of genetic variation in environmental samples requiring the development of more targeted approaches. Here, next-generation tag sequencing was successfully coupled with stable isotope probing (Tag-SIP to assess the ability of individual phylotypes to assimilate a particular N source. Our results provide the first direct evidence of nitrate utilization by organisms thought to lack the genes required for this process including the heterotrophic clades SAR11 and the Archaeal Marine Group II (MG-II. We also provide new direct evidence of in situ nitrate utilization by the cyanobacterium Prochlorococcus in support of recent findings. Furthermore, these results revealed widespread functional heterogeneity, i.e. different levels of N assimilation within clades, likely reflecting niche partitioning by ecotypes. The addition of nitrate utilization to ecosystem and ecosystem models by these globally dominant clades will likely improve the mechanistic accuracy of these models.

  10. Expression and purification of recombinant proteins in Escherichia coli tagged with the metal-binding protein CusF.

    Science.gov (United States)

    Cantu-Bustos, J Enrique; Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Galbraith, David W; McEvoy, Megan M; Zarate, Xristo

    2016-05-01

    Production of recombinant proteins in Escherichia coli has been improved considerably through the use of fusion proteins, because they increase protein solubility and facilitate purification via affinity chromatography. In this article, we propose the use of CusF as a new fusion partner for expression and purification of recombinant proteins in E. coli. Using a cell-free protein expression system, based on the E. coli S30 extract, Green Fluorescent Protein (GFP) was expressed with a series of different N-terminal tags, immobilized on self-assembled protein microarrays, and its fluorescence quantified. GFP tagged with CusF showed the highest fluorescence intensity, and this was greater than the intensities from corresponding GFP constructs that contained MBP or GST tags. Analysis of protein production in vivo showed that CusF produces large amounts of soluble protein with low levels of inclusion bodies. Furthermore, fusion proteins can be exported to the cellular periplasm, if CusF contains the signal sequence. Taking advantage of its ability to bind copper ions, recombinant proteins can be purified with readily available IMAC resins charged with this metal ion, producing pure proteins after purification and tag removal. We therefore recommend the use of CusF as a viable alternative to MBP or GST as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. Copyright © 2016 Elsevier Inc. All rights reserved.

  11. MR colonography with fecal tagging: comparison between 2D turbo FLASH and 3D FLASH sequences

    International Nuclear Information System (INIS)

    Papanikolaou, Nickolas; Grammatikakis, John; Maris, Thomas; Prassopoulos, Panos; Gourtsoyiannis, Nicholas; Lauenstein, Thomas

    2003-01-01

    The objective of this study was to compare inversion recovery turbo 2D fast low-angle shot (FLASH) and 3D FLASH sequences for fecal-tagged MR colonography studies. Fifteen consecutive patients with indications for colonoscopy underwent MR colonography with fecal tagging. An inversion recovery turbo-FLASH sequence was applied and compared in terms of artifacts presence, efficiency for masking residual stool, and colonic wall conspicuity with a fat-saturated 3D FLASH sequence. Both sequences were acquired following administration of paramagnetic contrast agent. Contrast-to-noise ratio and relative contrast between colonic wall and lumen were calculated and compared for both sequences. Turbo 2D FLASH provided fewer artifacts, higher efficiency for masking the residual stool, and colonic wall conspicuity equivalent to 3D FLASH. An inversion time of 10 ms provided homogeneously low signal intensity of the colonic lumen. Contrast to noise between colonic wall and lumen was significantly higher in the 3D FLASH images, whereas differences in relative contrast were not statistically significant. An optimized inversion-recovery 2D turbo-FLASH sequence provides better fecal tagging results and should be added to the 3D FLASH sequence when designing dark-lumen MR colonography examination protocols. (orig.)

  12. Efficient protein production method for NMR using soluble protein tags with cold shock expression vector

    International Nuclear Information System (INIS)

    Hayashi, Kokoro; Kojima, Chojiro

    2010-01-01

    The E. coli protein expression system is one of the most useful methods employed for NMR sample preparation. However, the production of some recombinant proteins in E. coli is often hampered by difficulties such as low expression level and low solubility. To address these problems, a modified cold-shock expression system containing a glutathione S-transferase (GST) tag, the pCold-GST system, was investigated. The pCold-GST system successfully expressed 9 out of 10 proteins that otherwise could not be expressed using a conventional E. coli expression system. Here, we applied the pCold-GST system to 84 proteins and 78 proteins were successfully expressed in the soluble fraction. Three other cold-shock expression systems containing a maltose binding protein tag (pCold-MBP), protein G B1 domain tag (pCold-GB1) or thioredoxin tag (pCold-Trx) were also developed to improve the yield. Additionally, we show that a C-terminal proline tag, which is invisible in 1 H- 15 N HSQC spectra, inhibits protein degradation and increases the final yield of unstable proteins. The purified proteins were amenable to NMR analyses. These data suggest that pCold expression systems combined with soluble protein tags can be utilized to improve the expression and purification of various proteins for NMR analysis.

  13. Efficient protein production method for NMR using soluble protein tags with cold shock expression vector

    Energy Technology Data Exchange (ETDEWEB)

    Hayashi, Kokoro [Fujifilm Corporation, Analysis Technology Center (Japan); Kojima, Chojiro, E-mail: kojima@protein.osaka-u.ac.j [Nara Institute of Science and Technology (NAIST), Graduate School of Biological Sciences (Japan)

    2010-11-15

    The E. coli protein expression system is one of the most useful methods employed for NMR sample preparation. However, the production of some recombinant proteins in E. coli is often hampered by difficulties such as low expression level and low solubility. To address these problems, a modified cold-shock expression system containing a glutathione S-transferase (GST) tag, the pCold-GST system, was investigated. The pCold-GST system successfully expressed 9 out of 10 proteins that otherwise could not be expressed using a conventional E. coli expression system. Here, we applied the pCold-GST system to 84 proteins and 78 proteins were successfully expressed in the soluble fraction. Three other cold-shock expression systems containing a maltose binding protein tag (pCold-MBP), protein G B1 domain tag (pCold-GB1) or thioredoxin tag (pCold-Trx) were also developed to improve the yield. Additionally, we show that a C-terminal proline tag, which is invisible in {sup 1}H-{sup 15}N HSQC spectra, inhibits protein degradation and increases the final yield of unstable proteins. The purified proteins were amenable to NMR analyses. These data suggest that pCold expression systems combined with soluble protein tags can be utilized to improve the expression and purification of various proteins for NMR analysis.

  14. Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

    Science.gov (United States)

    Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

    2014-10-01

    Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.

  15. Expression and purification of the non-tagged LipL32 of pathogenic Leptospira

    Directory of Open Access Journals (Sweden)

    P. Hauk

    2011-04-01

    Full Text Available Leptospirosis is a reemerging infectious disease and the most disseminated zoonosis worldwide. A leptospiral surface protein, LipL32, only occurs in pathogenic Leptospira, and is the most abundant protein on the bacterial surface, being described as an important factor in host immunogenic response and also in bacterial infection. We describe here an alternative and simple purification protocol for non-tagged recombinant LipL32. The recombinant LipL32(21-272 was expressed in Escherichia coli without His-tag or any other tag used to facilitate recombinant protein purification. The recombinant protein was expressed in the soluble form, and the purification was based on ion exchange (anionic and cationic and hydrophobic interactions. The final purification yielded 3 mg soluble LipL32(21-272 per liter of the induced culture. Antiserum produced against the recombinant protein was effective to detect native LipL32 from cell extracts of several Leptospira serovars. The purified recombinant LipL32(21-272 produced by this protocol can be used for structural, biochemical and functional studies and avoids the risk of possible interactions and interferences of the tags commonly used as well as the time consuming and almost always inefficient methods to cleave these tags when a tag-free LipL32 is needed. Non-tagged LipL32 may represent an alternative antigen for biochemical studies, for serodiagnosis and for the development of a vaccine against leptospirosis.

  16. Expression and purification of the non-tagged LipL32 of pathogenic Leptospira.

    Science.gov (United States)

    Hauk, P; Carvalho, E; Ho, P L

    2011-04-01

    Leptospirosis is a reemerging infectious disease and the most disseminated zoonosis worldwide. A leptospiral surface protein, LipL32, only occurs in pathogenic Leptospira, and is the most abundant protein on the bacterial surface, being described as an important factor in host immunogenic response and also in bacterial infection. We describe here an alternative and simple purification protocol for non-tagged recombinant LipL32. The recombinant LipL32(21-272) was expressed in Escherichia coli without His-tag or any other tag used to facilitate recombinant protein purification. The recombinant protein was expressed in the soluble form, and the purification was based on ion exchange (anionic and cationic) and hydrophobic interactions. The final purification yielded 3 mg soluble LipL32(21-272) per liter of the induced culture. Antiserum produced against the recombinant protein was effective to detect native LipL32 from cell extracts of several Leptospira serovars. The purified recombinant LipL32(21-272) produced by this protocol can be used for structural, biochemical and functional studies and avoids the risk of possible interactions and interferences of the tags commonly used as well as the time consuming and almost always inefficient methods to cleave these tags when a tag-free LipL32 is needed. Non-tagged LipL32 may represent an alternative antigen for biochemical studies, for serodiagnosis and for the development of a vaccine against leptospirosis.

  17. An efficient tag derived from the common epitope of tospoviral NSs proteins for monitoring recombinant proteins expressed in both bacterial and plant systems.

    Science.gov (United States)

    Cheng, Hao-Wen; Chen, Kuan-Chun; Raja, Joseph A J; Li, Jian-Xian; Yeh, Shyi-Dong

    2013-04-15

    NSscon (23 aa), a common epitope in the gene silencing suppressor NSs proteins of the members of the Watermelon silver mottle virus (WSMoV) serogroup, was previously identified. In this investigation, we expressed different green fluorescent protein (GFP)-fused deletions of NSscon in bacteria and reacted with NSscon monoclonal antibody (MAb). Our results indicated that the core 9 amino acids, "(109)KFTMHNQIF(117)", denoted as "nss", retain the reactivity of NSscon. In bacterial pET system, four different recombinant proteins labeled with nss, either at N- or C-extremes, were readily detectable without position effects, with sensitivity superior to that for the polyhistidine-tag. When the nss-tagged Zucchini yellow mosaic virus (ZYMV) helper component-protease (HC-Pro) and WSMoV nucleocapsid protein were transiently expressed by agroinfiltration in tobacco, they were readily detectable and the tag's possible efficacy for gene silencing suppression was not noticed. Co-immunoprecipitation of nss-tagged and non-tagged proteins expressed from bacteria confirmed the interaction of potyviral HC-Pro and coat protein. Thus, we conclude that this novel nss sequence is highly valuable for tagging recombinant proteins in both bacterial and plant expression systems. Copyright © 2013 Elsevier B.V. All rights reserved.

  18. Myocardial tagging by Cardiovascular Magnetic Resonance: evolution of techniques--pulse sequences, analysis algorithms, and applications

    Directory of Open Access Journals (Sweden)

    Ibrahim El-Sayed H

    2011-07-01

    Full Text Available Abstract Cardiovascular magnetic resonance (CMR tagging has been established as an essential technique for measuring regional myocardial function. It allows quantification of local intramyocardial motion measures, e.g. strain and strain rate. The invention of CMR tagging came in the late eighties, where the technique allowed for the first time for visualizing transmural myocardial movement without having to implant physical markers. This new idea opened the door for a series of developments and improvements that continue up to the present time. Different tagging techniques are currently available that are more extensive, improved, and sophisticated than they were twenty years ago. Each of these techniques has different versions for improved resolution, signal-to-noise ratio (SNR, scan time, anatomical coverage, three-dimensional capability, and image quality. The tagging techniques covered in this article can be broadly divided into two main categories: 1 Basic techniques, which include magnetization saturation, spatial modulation of magnetization (SPAMM, delay alternating with nutations for tailored excitation (DANTE, and complementary SPAMM (CSPAMM; and 2 Advanced techniques, which include harmonic phase (HARP, displacement encoding with stimulated echoes (DENSE, and strain encoding (SENC. Although most of these techniques were developed by separate groups and evolved from different backgrounds, they are in fact closely related to each other, and they can be interpreted from more than one perspective. Some of these techniques even followed parallel paths of developments, as illustrated in the article. As each technique has its own advantages, some efforts have been made to combine different techniques together for improved image quality or composite information acquisition. In this review, different developments in pulse sequences and related image processing techniques are described along with the necessities that led to their invention

  19. Mining microsatellite markers from public expressed sequence tag ...

    Indian Academy of Sciences (India)

    2Shangqiu Vocational and Technical College, Shangqiu 476100, People's Republic of China .... Pomegranate EST data retrieval, assembly and SSR primer .... Single-letter abbreviation for the district of collection of the accessions: G, Gansu ...

  20. Characterization of Expressed Sequence Tags (ESTs) from Stylophora pistillata

    KAUST Repository

    Zoccola, Didier

    2011-04-09

    Coral reefs are the most productive marine ecosystems with highest species diversity. During the last decades, reefs have been facing multiples global and anthropogenic stressors leading to bleaching and death of entire reefs.

  1. Analysis of expressed sequence tags derived from inflorescence ...

    African Journals Online (AJOL)

    PRECIOUS

    2009-11-02

    Nov 2, 2009 ... cellular component and molecular function. ... early terminal flowering causes forking of the main axis ... interaction of some important genes involved in the .... different protein signature recognition methods based on the.

  2. Characterization of 108 novel expressed sequence tag-derived ...

    Indian Academy of Sciences (India)

    Shuangshuang Teng

    tribution to the ongoing development of conservation and management programmes. ... synonymous, or from untranslated regions (UTRs). Results and discussion ...... nology Innovation Project of Wenzhou City (N20120017). References.

  3. Expressed Peptide Tags: An additional layer of data for genome annotation

    Energy Technology Data Exchange (ETDEWEB)

    Savidor, Alon [ORNL; Donahoo, Ryan S [ORNL; Hurtado-Gonzales, Oscar [University of Tennessee, Knoxville (UTK); Verberkmoes, Nathan C [ORNL; Shah, Manesh B [ORNL; Lamour, Kurt H [ORNL; McDonald, W Hayes [ORNL

    2006-01-01

    While genome sequencing is becoming ever more routine, genome annotation remains a challenging process. Identification of the coding sequences within the genomic milieu presents a tremendous challenge, especially for eukaryotes with their complex gene architectures. Here we present a method to assist the annotation process through the use of proteomic data and bioinformatics. Mass spectra of digested protein preparations of the organism of interest were acquired and searched against a protein database created by a six frame translation of the genome. The identified peptides were mapped back to the genome, compared to the current annotation, and then categorized as supporting or extending the current genome annotation. We named the classified peptides Expressed Peptide Tags (EPTs). The well annotated bacterium Rhodopseudomonas palustris was used as a control for the method and showed high degree of correlation between EPT mapping and the current annotation, with 86% of the EPTs confirming existing gene calls and less than 1% of the EPTs expanding on the current annotation. The eukaryotic plant pathogens Phytophthora ramorum and Phytophthora sojae, whose genomes have been recently sequenced and are much less well annotated, were also subjected to this method. A series of algorithmic steps were taken to increase the confidence of EPT identification for these organisms, including generation of smaller sub-databases to be searched against, and definition of EPT criteria that accommodates the more complex eukaryotic gene architecture. As expected, the analysis of the Phytophthora species showed less correlation between EPT mapping and their current annotation. While ~77% of Phytophthora EPTs supported the current annotation, a portion of them (7.2% and 12.6% for P. ramorum and P. sojae, respectively) suggested modification to current gene calls or identified novel genes that were missed by the current genome annotation of these organisms.

  4. Poly(A)-tag deep sequencing data processing to extract poly(A) sites.

    Science.gov (United States)

    Wu, Xiaohui; Ji, Guoli; Li, Qingshun Quinn

    2015-01-01

    Polyadenylation [poly(A)] is an essential posttranscriptional processing step in the maturation of eukaryotic mRNA. The advent of next-generation sequencing (NGS) technology has offered feasible means to generate large-scale data and new opportunities for intensive study of polyadenylation, particularly deep sequencing of the transcriptome targeting the junction of 3'-UTR and the poly(A) tail of the transcript. To take advantage of this unprecedented amount of data, we present an automated workflow to identify polyadenylation sites by integrating NGS data cleaning, processing, mapping, normalizing, and clustering. In this pipeline, a series of Perl scripts are seamlessly integrated to iteratively map the single- or paired-end sequences to the reference genome. After mapping, the poly(A) tags (PATs) at the same genome coordinate are grouped into one cleavage site, and the internal priming artifacts removed. Then the ambiguous region is introduced to parse the genome annotation for cleavage site clustering. Finally, cleavage sites within a close range of 24 nucleotides and from different samples can be clustered into poly(A) clusters. This procedure could be used to identify thousands of reliable poly(A) clusters from millions of NGS sequences in different tissues or treatments.

  5. cis sequence effects on gene expression

    Directory of Open Access Journals (Sweden)

    Jacobs Kevin

    2007-08-01

    Full Text Available Abstract Background Sequence and transcriptional variability within and between individuals are typically studied independently. The joint analysis of sequence and gene expression variation (genetical genomics provides insight into the role of linked sequence variation in the regulation of gene expression. We investigated the role of sequence variation in cis on gene expression (cis sequence effects in a group of genes commonly studied in cancer research in lymphoblastoid cell lines. We estimated the proportion of genes exhibiting cis sequence effects and the proportion of gene expression variation explained by cis sequence effects using three different analytical approaches, and compared our results to the literature. Results We generated gene expression profiling data at N = 697 candidate genes from N = 30 lymphoblastoid cell lines for this study and used available candidate gene resequencing data at N = 552 candidate genes to identify N = 30 candidate genes with sufficient variance in both datasets for the investigation of cis sequence effects. We used two additive models and the haplotype phylogeny scanning approach of Templeton (Tree Scanning to evaluate association between individual SNPs, all SNPs at a gene, and diplotypes, with log-transformed gene expression. SNPs and diplotypes at eight candidate genes exhibited statistically significant (p cis sequence effects in our study, respectively. Conclusion Based on analysis of our results and the extant literature, one in four genes exhibits significant cis sequence effects, and for these genes, about 30% of gene expression variation is accounted for by cis sequence variation. Despite diverse experimental approaches, the presence or absence of significant cis sequence effects is largely supported by previously published studies.

  6. Hepatitis C virus expressing reporter tagged NS5A protein

    DEFF Research Database (Denmark)

    2014-01-01

    Hepatitis C reporter viruses containing Core through NS2 of prototype isolates of all major HCV genotypes and the remaining genes of isolate JFH1, by insertion of reporter genes in domain III of HCV NS5A were developed. A deletion upstream of the inserted reporter gene sequence conferred favorable...... growth kinetics in Huh7.5 cells to these viruses. These reporter viruses can be used for high throughput analysis of drug and vaccine candidates as well as patient samples. JFH1-based intergenotypic recombinants with genotype specific homotypic 5'UTR, or heterotypic 5'UTR (either of genotype 1a (strain H...

  7. An expression tag toolbox for microbial production of membrane bound plant cytochromes P450

    DEFF Research Database (Denmark)

    Vazquez Albacete, Dario; Cavaleiro, Mafalda; Christensen, Ulla

    2017-01-01

    of the intermediate and the final product of the pathway. Finally, the effect of a robustly performing expression tag was explored with a library of 49 different P450s from medicinal plants and nearly half of these were improved in expression by more than 2-fold. The developed toolbox serves as platform to tune P450...... tag chimeras of the model plant P450 CYP79A1 in different Escherichia coli strains. Using a high-throughput screening platform based on C-terminal GFP fusions, we identify several highly expressing and robustly performing chimeric designs. Analysis of long-term cultures by flow cytometry showed...... homogeneous populations for some of the conditions. Three chimeric designs were chosen for a more complex combinatorial assembly of a multigene pathway consisting of two P450s and a redox partner. Cells expressing these recombinant enzymes catalysed the conversion of the substrate to highly different ratios...

  8. An evaluation of sequence tagged microsatellite site markers for genetic analysis within Citrus and related species.

    Science.gov (United States)

    Kijas, J M; Fowler, J C; Thomas, M R

    1995-04-01

    Microsatellites, also called sequence tagged microsatellite sites (STMSs), have become important markers for genome analysis but are currently little studied in plants. To assess the value of STMSs for analysis within the Citrus plant species, two example STMSs were isolated from an intergeneric cross between rangpur lime (Citrus x limonia Osbeck) and trifoliate orange (Poncirus trifoliata (L.) Raf.). Unique flanking primers were constructed for polymerase chain reaction amplification both within the test cross and across a broad range of citrus and related species. Both loci showed length variation between test cross parents with alleles segregating in a Mendelian fashion to progeny. Amplification across species showed the STMS flanking primers to be conserved in every genome tested. The traits of polymorphism, inheritance, and conservation across species mean that STMS markers are ideal for genome mapping within Citrus, which contains high levels of genetic variability.

  9. Expression and purification of recombinant proteins in Escherichia coli tagged with a small metal-binding protein from Nitrosomonas europaea.

    Science.gov (United States)

    Vargas-Cortez, Teresa; Morones-Ramirez, Jose Ruben; Balderas-Renteria, Isaias; Zarate, Xristo

    2016-02-01

    Escherichia coli is still the preferred organism for large-scale production of recombinant proteins. The use of fusion proteins has helped considerably in enhancing the solubility of heterologous proteins and their purification with affinity chromatography. Here, the use of a small metal-binding protein (SmbP) from Nitrosomonas europaea is described as a new fusion protein for protein expression and purification in E. coli. Fluorescent proteins tagged at the N-terminal with SmbP showed high levels of solubility, compared with those of maltose-binding protein and glutathione S-transferase, and low formation of inclusion bodies. Using commercially available IMAC resins charged with Ni(II), highly pure recombinant proteins were obtained after just one chromatography step. Proteins may be purified from the periplasm of E. coli if SmbP contains the signal sequence at the N-terminal. After removal of the SmbP tag from the protein of interest, high-yields are obtained since SmbP is a protein of just 9.9 kDa. The results here obtained suggest that SmbP is a good alternative as a fusion protein/affinity tag for the production of soluble recombinant proteins in E. coli. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Analyzing Plasmodium falciparum erythrocyte membrane protein 1 gene expression by a next generation sequencing based method

    DEFF Research Database (Denmark)

    Jespersen, Jakob S.; Petersen, Bent; Seguin-Orlando, Andaine

    2013-01-01

    at identifying PfEMP1 features associated with high virulence. Here we present the first effective method for sequence analysis of var genes expressed in field samples: a sequential PCR and next generation sequencing based technique applied on expressed var sequence tags and subsequently on long range PCR......, encoded by ~60 highly variable 'var' genes per haploid genome. PfEMP1 is exported to the surface of infected erythrocytes and is thought to be fundamental to immune evasion by adhesion to host and parasite factors. The highly variable nature has constituted a roadblock in var expression studies aimed...

  11. A re-assessment of gene-tag classification approaches for describing var gene expression patterns during human Plasmodium falciparum malaria parasite infections.

    Science.gov (United States)

    Githinji, George; Bull, Peter C

    2017-01-01

    PfEMP1 are variant parasite antigens that are inserted on the surface of Plasmodium falciparum infected erythrocytes (IE). Through interactions with various host molecules, PfEMP1 mediate IE sequestration in tissues and play a key role in the pathology of severe malaria. PfEMP1 is encoded by a diverse multi-gene family called var . Previous studies have shown that that expression of specific subsets of var genes are associated with low levels of host immunity and severe malaria. However, in most clinical studies to date, full-length var gene sequences were unavailable and various approaches have been used to make comparisons between var gene expression profiles in different parasite isolates using limited information. Several studies have relied on the classification of a 300 - 500 base-pair "DBLα tag" region in the DBLα domain located at the 5' end of most var genes. We assessed the relationship between various DBLα tag classification methods, and sequence features that are only fully assessable through full-length var gene sequences. We compared these different sequence features in full-length var gene from six fully sequenced laboratory isolates. These comparisons show that despite a long history of recombination,   DBLα sequence tag classification can provide functional information on important features of full-length var genes. Notably, a specific subset of DBLα tags previously defined as "group A-like" is associated with CIDRα1 domains proposed to bind to endothelial protein C receptor. This analysis helps to bring together different sources of data that have been used to assess var gene expression in clinical parasite isolates.

  12. Sequence-indexed mutations in maize using the UniformMu transposon-tagging population

    Directory of Open Access Journals (Sweden)

    Baier John

    2007-05-01

    Full Text Available Abstract Background Gene knockouts are a critical resource for functional genomics. In Arabidopsis, comprehensive knockout collections were generated by amplifying and sequencing genomic DNA flanking insertion mutants. These Flanking Sequence Tags (FSTs map each mutant to a specific locus within the genome. In maize, FSTs have been generated using DNA transposons. Transposable elements can generate unstable insertions that are difficult to analyze for simple knockout phenotypes. Transposons can also generate somatic insertions that fail to segregate in subsequent generations. Results Transposon insertion sites from 106 UniformMu FSTs were tested for inheritance by locus-specific PCR. We confirmed 89% of the FSTs to be germinal transposon insertions. We found no evidence for somatic insertions within the 11% of insertion sites that were not confirmed. Instead, this subset of insertion sites had errors in locus-specific primer design due to incomplete or low-quality genomic sequences. The locus-specific PCR assays identified a knockout of a 6-phosphogluconate dehydrogenase gene that co-segregates with a seed mutant phenotype. The mutant phenotype linked to this knockout generates novel hypotheses about the role for the plastid-localized oxidative pentose phosphate pathway during grain-fill. Conclusion We show that FSTs from the UniformMu population identify stable, germinal insertion sites in maize. Moreover, we show that these sequence-indexed mutations can be readily used for reverse genetic analysis. We conclude from these data that the current collection of 1,882 non-redundant insertion sites from UniformMu provide a genome-wide resource for reverse genetics.

  13. GenEST, a powerful bidirectional link between cDNA sequence data and gene expression profiles generated by cDNA-AFLP

    NARCIS (Netherlands)

    Qin Ling,; Prins, P.; Jones, J.T.; Popeijus, H.; Smant, G.; Bakker, J.; Helder, J.

    2001-01-01

    The release of vast quantities of DNA sequence data by large-scale genome and expressed sequence tag (EST) projects underlines the necessity for the development of efficient and inexpensive ways to link sequence databases with temporal and spatial expression profiles. Here we demonstrate the power

  14. Polarized expression of the GFP-tagged rat V(1a) vasopressin receptor.

    Science.gov (United States)

    Campos, D M; Reyes, C E; Sarmiento, J; Navarro, J; González, C B

    2001-11-30

    We investigated the targeting of the V(1a) receptor fused with the green fluorescence protein (V(1a)R-GFP) in polarized MDCK cells. Cells expressing V(1a)R-GFP displayed binding to vasopressin (AVP) and AVP-induced calcium responses, similar to cells expressing the wild-type V1a receptor. Interestingly, as with the wild-type V(1a)R, V(1a)R-GFP is preferentially distributed in the basolateral side of MDCK cells as monitored by confocal microscopy. Furthermore, AVP induced internalization of GFP-tagged receptors. Therefore, the GFP-tagged V(1a) receptor retains all the sorting signals of the wild-type receptor and offers an excellent system to elucidate the mechanisms of cell trafficking of V(1a) receptors.

  15. Population genomics of parallel adaptation in threespine stickleback using sequenced RAD tags.

    Directory of Open Access Journals (Sweden)

    Paul A Hohenlohe

    2010-02-01

    Full Text Available Next-generation sequencing technology provides novel opportunities for gathering genome-scale sequence data in natural populations, laying the empirical foundation for the evolving field of population genomics. Here we conducted a genome scan of nucleotide diversity and differentiation in natural populations of threespine stickleback (Gasterosteus aculeatus. We used Illumina-sequenced RAD tags to identify and type over 45,000 single nucleotide polymorphisms (SNPs in each of 100 individuals from two oceanic and three freshwater populations. Overall estimates of genetic diversity and differentiation among populations confirm the biogeographic hypothesis that large panmictic oceanic populations have repeatedly given rise to phenotypically divergent freshwater populations. Genomic regions exhibiting signatures of both balancing and divergent selection were remarkably consistent across multiple, independently derived populations, indicating that replicate parallel phenotypic evolution in stickleback may be occurring through extensive, parallel genetic evolution at a genome-wide scale. Some of these genomic regions co-localize with previously identified QTL for stickleback phenotypic variation identified using laboratory mapping crosses. In addition, we have identified several novel regions showing parallel differentiation across independent populations. Annotation of these regions revealed numerous genes that are candidates for stickleback phenotypic evolution and will form the basis of future genetic analyses in this and other organisms. This study represents the first high-density SNP-based genome scan of genetic diversity and differentiation for populations of threespine stickleback in the wild. These data illustrate the complementary nature of laboratory crosses and population genomic scans by confirming the adaptive significance of previously identified genomic regions, elucidating the particular evolutionary and demographic history of such

  16. Expression and purification of the non-tagged LipL32 of pathogenic Leptospira

    OpenAIRE

    Hauk, P.; Carvalho, E.; Ho, P.L.

    2011-01-01

    Leptospirosis is a reemerging infectious disease and the most disseminated zoonosis worldwide. A leptospiral surface protein, LipL32, only occurs in pathogenic Leptospira, and is the most abundant protein on the bacterial surface, being described as an important factor in host immunogenic response and also in bacterial infection. We describe here an alternative and simple purification protocol for non-tagged recombinant LipL32. The recombinant LipL32(21-272) was expressed in Escherichia coli ...

  17. An SSVEP-actuated brain computer interface using phase-tagged flickering sequences: a cursor system.

    Science.gov (United States)

    Lee, Po-Lei; Sie, Jyun-Jie; Liu, Yu-Ju; Wu, Chi-Hsun; Lee, Ming-Huan; Shu, Chih-Hung; Li, Po-Hung; Sun, Chia-Wei; Shyu, Kuo-Kai

    2010-07-01

    This study presents a new steady-state visual evoked potential (SSVEP)-based brain computer interface (BCI). SSVEPs, induced by phase-tagged flashes in eight light emitting diodes (LEDs), were used to control four cursor movements (up, right, down, and left) and four button functions (on, off, right-, and left-clicks) on a screen menu. EEG signals were measured by one EEG electrode placed at Oz position, referring to the international EEG 10-20 system. Since SSVEPs are time-locked and phase-locked to the onsets of SSVEP flashes, EEG signals were bandpass-filtered and segmented into epochs, and then averaged across a number of epochs to sharpen the recorded SSVEPs. Phase lags between the measured SSVEPs and a reference SSVEP were measured, and targets were recognized based on these phase lags. The current design used eight LEDs to flicker at 31.25 Hz with 45 degrees phase margin between any two adjacent SSVEP flickers. The SSVEP responses were filtered within 29.25-33.25 Hz and then averaged over 60 epochs. Owing to the utilization of high-frequency flickers, the induced SSVEPs were away from low-frequency noises, 60 Hz electricity noise, and eye movement artifacts. As a consequence, we achieved a simple architecture that did not require eye movement monitoring or other artifact detection and removal. The high-frequency design also achieved a flicker fusion effect for better visualization. Seven subjects were recruited in this study to sequentially input a command sequence, consisting of a sequence of eight cursor functions, repeated three times. The accuracy and information transfer rate (mean +/- SD) over the seven subjects were 93.14 +/- 5.73% and 28.29 +/- 12.19 bits/min, respectively. The proposed system can provide a reliable channel for severely disabled patients to communicate with external environments.

  18. Sorting of a HaloTag protein that has only a signal peptide sequence into exocrine secretory granules without protein aggregation.

    Science.gov (United States)

    Fujita-Yoshigaki, Junko; Matsuki-Fukushima, Miwako; Yokoyama, Megumi; Katsumata-Kato, Osamu

    2013-11-15

    The mechanism involved in the sorting and accumulation of secretory cargo proteins, such as amylase, into secretory granules of exocrine cells remains to be solved. To clarify that sorting mechanism, we expressed a reporter protein HaloTag fused with partial sequences of salivary amylase protein in primary cultured parotid acinar cells. We found that a HaloTag protein fused with only the signal peptide sequence (Met(1)-Ala(25)) of amylase, termed SS25H, colocalized well with endogenous amylase, which was confirmed by immunofluorescence microscopy. Percoll-density gradient centrifugation of secretory granule fractions shows that the distributions of amylase and SS25H were similar. These results suggest that SS25H is transported to secretory granules and is not discriminated from endogenous amylase by the machinery that functions to remove proteins other than granule cargo from immature granules. Another reporter protein, DsRed2, that has the same signal peptide sequence also colocalized with amylase, suggesting that the sorting to secretory granules is not dependent on a characteristic of the HaloTag protein. Whereas Blue Native PAGE demonstrates that endogenous amylase forms a high-molecular-weight complex, SS25H does not participate in the complex and does not form self-aggregates. Nevertheless, SS25H was released from cells by the addition of a β-adrenergic agonist, isoproterenol, which also induces amylase secretion. These results indicate that addition of the signal peptide sequence, which is necessary for the translocation in the endoplasmic reticulum, is sufficient for the transportation and storage of cargo proteins in secretory granules of exocrine cells.

  19. High-level expression of soluble recombinant proteins in Escherichia coli using an HE-maltotriose-binding protein fusion tag.

    Science.gov (United States)

    Han, Yingqian; Guo, Wanying; Su, Bingqian; Guo, Yujie; Wang, Jiang; Chu, Beibei; Yang, Guoyu

    2018-02-01

    Recombinant proteins are commonly expressed in prokaryotic expression systems for large-scale production. The use of genetically engineered affinity and solubility enhancing fusion proteins has increased greatly in recent years, and there now exists a considerable repertoire of these that can be used to enhance the expression, stability, solubility, folding, and purification of their fusion partner. Here, a modified histidine tag (HE) used as an affinity tag was employed together with a truncated maltotriose-binding protein (MBP; consisting of residues 59-433) from Pyrococcus furiosus as a solubility enhancing tag accompanying a tobacco etch virus protease-recognition site for protein expression and purification in Escherichia coli. Various proteins tagged at the N-terminus with HE-MBP(Pyr) were expressed in E. coli BL21(DE3) cells to determine expression and solubility relative to those tagged with His6-MBP or His6-MBP(Pyr). Furthermore, four HE-MBP(Pyr)-fused proteins were purified by immobilized metal affinity chromatography to assess the affinity of HE with immobilized Ni 2+ . Our results showed that HE-MBP(Pyr) represents an attractive fusion protein allowing high levels of soluble expression and purification of recombinant protein in E. coli. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Tryptophan tags and de novo designed complementary affinity ligands for the expression and purification of recombinant proteins.

    Science.gov (United States)

    Pina, Ana Sofia; Carvalho, Sara; Dias, Ana Margarida G C; Guilherme, Márcia; Pereira, Alice S; Caraça, Luciana T; Coroadinha, Ana Sofia; Lowe, Christopher R; Roque, A Cecília A

    2016-11-11

    A common strategy for the production and purification of recombinant proteins is to fuse a tag to the protein terminal residues and employ a "tag-specific" ligand for fusion protein capture and purification. In this work, we explored the effect of two tryptophan-based tags, NWNWNW and WFWFWF, on the expression and purification of Green Fluorescence Protein (GFP) used as a model fusion protein. The titers obtained with the expression of these fusion proteins in soluble form were 0.11mgml -1 and 0.48mgml -1 for WFWFWF and NWNWNW, respectively. A combinatorial library comprising 64 ligands based on the Ugi reaction was prepared and screened for binding GFP-tagged and non-tagged proteins. Complementary ligands A2C2 and A3C1 were selected for the effective capture of NWNWNW and WFWFWF tagged proteins, respectively, in soluble forms. These affinity pairs displayed 10 6 M -1 affinity constants and Qmax values of 19.11±2.60ugg -1 and 79.39ugg -1 for the systems WFWFWF AND NWNWNW, respectively. GFP fused to the WFWFWF affinity tag was also produced as inclusion bodies, and a refolding-on column strategy was explored using the ligand A4C8, selected from the combinatorial library of ligands but in presence of denaturant agents. Copyright © 2016 Elsevier B.V. All rights reserved.

  1. Distinct cis regulatory elements govern the expression of TAG1 in embryonic sensory ganglia and spinal cord.

    Directory of Open Access Journals (Sweden)

    Yoav Hadas

    Full Text Available Cell fate commitment of spinal progenitor neurons is initiated by long-range, midline-derived, morphogens that regulate an array of transcription factors that, in turn, act sequentially or in parallel to control neuronal differentiation. Included among these are transcription factors that regulate the expression of receptors for guidance cues, thereby determining axonal trajectories. The Ig/FNIII superfamily molecules TAG1/Axonin1/CNTN2 (TAG1 and Neurofascin (Nfasc are co-expressed in numerous neuronal cell types in the CNS and PNS - for example motor, DRG and interneurons - both promote neurite outgrowth and both are required for the architecture and function of nodes of Ranvier. The genes encoding TAG1 and Nfasc are adjacent in the genome, an arrangement which is evolutionarily conserved. To study the transcriptional network that governs TAG1 and Nfasc expression in spinal motor and commissural neurons, we set out to identify cis elements that regulate their expression. Two evolutionarily conserved DNA modules, one located between the Nfasc and TAG1 genes and the second directly 5' to the first exon and encompassing the first intron of TAG1, were identified that direct complementary expression to the CNS and PNS, respectively, of the embryonic hindbrain and spinal cord. Sequential deletions and point mutations of the CNS enhancer element revealed a 130bp element containing three conserved E-boxes required for motor neuron expression. In combination, these two elements appear to recapitulate a major part of the pattern of TAG1 expression in the embryonic nervous system.

  2. Construction of a plasmid coding for green fluorescent protein tagged cathepsin L and data on expression in colorectal carcinoma cells

    Directory of Open Access Journals (Sweden)

    Tripti Tamhane

    2015-12-01

    Full Text Available The endo-lysosomal cysteine cathepsin L has recently been shown to have moonlighting activities in that its unexpected nuclear localization in colorectal carcinoma cells is involved in cell cycle progression (Tamhane et al., 2015 [1]. Here, we show data on the construction and sequence of a plasmid coding for human cathepsin L tagged with an enhanced green fluorescent protein (phCL-EGFP in which the fluorescent protein is covalently attached to the C-terminus of the protease. The plasmid was used for transfection of HCT116 colorectal carcinoma cells, while data from non-transfected and pEGFP-N1-transfected cells is also shown. Immunoblotting data of lysates from non-transfected controls and HCT116 cells transfected with pEGFP-N1 and phCL-EGFP, showed stable expression of cathepsin L-enhanced green fluorescent protein chimeras, while endogenous cathepsin L protein amounts exceed those of hCL-EGFP chimeras. An effect of phCL-EGFP expression on proliferation and metabolic states of HCT116 cells at 24 h post-transfection was observed.

  3. Segmental isotope labeling of proteins for NMR structural study using a protein S tag for higher expression and solubility

    International Nuclear Information System (INIS)

    Kobayashi, Hiroshi; Swapna, G. V. T.; Wu, Kuen-Phon; Afinogenova, Yuliya; Conover, Kenith; Mao, Binchen; Montelione, Gaetano T.; Inouye, Masayori

    2012-01-01

    A common obstacle to NMR studies of proteins is sample preparation. In many cases, proteins targeted for NMR studies are poorly expressed and/or expressed in insoluble forms. Here, we describe a novel approach to overcome these problems. In the protein S tag-intein (PSTI) technology, two tandem 92-residue N-terminal domains of protein S (PrS 2 ) from Myxococcus xanthus is fused at the N-terminal end of a protein to enhance its expression and solubility. Using intein technology, the isotope-labeled PrS 2 -tag is replaced with non-isotope labeled PrS 2 -tag, silencing the NMR signals from PrS 2 -tag in isotope-filtered 1 H-detected NMR experiments. This method was applied to the E. coli ribosome binding factor A (RbfA), which aggregates and precipitates in the absence of a solubilization tag unless the C-terminal 25-residue segment is deleted (RbfAΔ25). Using the PrS 2 -tag, full-length well-behaved RbfA samples could be successfully prepared for NMR studies. PrS 2 (non-labeled)-tagged RbfA (isotope-labeled) was produced with the use of the intein approach. The well-resolved TROSY-HSQC spectrum of full-length PrS 2 -tagged RbfA superimposes with the TROSY-HSQC spectrum of RbfAΔ25, indicating that PrS 2 -tag does not affect the structure of the protein to which it is fused. Using a smaller PrS-tag, consisting of a single N-terminal domain of protein S, triple resonance experiments were performed, and most of the backbone 1 H, 15 N and 13 C resonance assignments for full-length E. coli RbfA were determined. Analysis of these chemical shift data with the Chemical Shift Index and heteronuclear 1 H– 15 N NOE measurements reveal the dynamic nature of the C-terminal segment of the full-length RbfA protein, which could not be inferred using the truncated RbfAΔ25 construct. CS-Rosetta calculations also demonstrate that the core structure of full-length RbfA is similar to that of the RbfAΔ25 construct.

  4. The generation of knock-in mice expressing fluorescently tagged galanin receptors 1 and 2

    Science.gov (United States)

    Kerr, Niall; Holmes, Fiona E.; Hobson, Sally-Ann; Vanderplank, Penny; Leard, Alan; Balthasar, Nina; Wynick, David

    2015-01-01

    The neuropeptide galanin has diverse roles in the central and peripheral nervous systems, by activating the G protein-coupled receptors Gal1, Gal2 and the less studied Gal3 (GalR1–3 gene products). There is a wealth of data on expression of Gal1–3 at the mRNA level, but not at the protein level due to the lack of specificity of currently available antibodies. Here we report the generation of knock-in mice expressing Gal1 or Gal2 receptor fluorescently tagged at the C-terminus with, respectively, mCherry or hrGFP (humanized Renilla green fluorescent protein). In dorsal root ganglia (DRG) neurons expressing the highest levels of Gal1-mCherry, localization to the somatic cell membrane was detected by live-cell fluorescence and immunohistochemistry, and that fluorescence decreased upon addition of galanin. In spinal cord, abundant Gal1-mCherry immunoreactive processes were detected in the superficial layers of the dorsal horn, and highly expressing intrinsic neurons of the lamina III/IV border showed both somatic cell membrane localization and outward transport of receptor from the cell body, detected as puncta within cell processes. In brain, high levels of Gal1-mCherry immunofluorescence were detected within thalamus, hypothalamus and amygdala, with a high density of nerve endings in the external zone of the median eminence, and regions with lesser immunoreactivity included the dorsal raphe nucleus. Gal2-hrGFP mRNA was detected in DRG, but live-cell fluorescence was at the limits of detection, drawing attention to both the much lower mRNA expression than to Gal1 in mice and the previously unrecognized potential for translational control by upstream open reading frames (uORFs). PMID:26292267

  5. Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

    Directory of Open Access Journals (Sweden)

    Maggi Giorgio P

    2008-06-01

    Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.

  6. Novel expressed sequences identified in a model of androgen independent prostate cancer

    Directory of Open Access Journals (Sweden)

    Jones Steven JM

    2007-01-01

    Full Text Available Abstract Background Prostate cancer is the most frequently diagnosed cancer in American men, and few effective treatment options are available to patients who develop hormone-refractory prostate cancer. The molecular changes that occur to allow prostate cells to proliferate in the absence of androgens are not fully understood. Results Subtractive hybridization experiments performed with samples from an in vivo model of hormonal progression identified 25 expressed sequences representing novel human transcripts. Intriguingly, these 25 sequences have small open-reading frames and are not highly conserved through evolution, suggesting many of these novel expressed sequences may be derived from untranslated regions of novel transcripts or from non-coding transcripts. Examination of a large metalibrary of human Serial Analysis of Gene Expression (SAGE tags demonstrated that only three of these novel sequences had been previously detected. RT-PCR experiments confirmed that the 6 sequences tested were expressed in specific human tissues, as well as in clinical samples of prostate cancer. Further RT-PCR experiments for five of these fragments indicated they originated from large untranslated regions of unannotated transcripts. Conclusion This study underlines the value of using complementary techniques in the annotation of the human genome. The tissue-specific expression of 4 of the 6 clones tested indicates the expression of these novel transcripts is tightly regulated, and future work will determine the possible role(s these novel transcripts may play in the progression of prostate cancer.

  7. Skin tags: A link between lesional mast cell count/tryptase expression and obesity and dyslipidemia

    Directory of Open Access Journals (Sweden)

    Samar Abdallah M Salem

    2013-01-01

    Full Text Available Background:The etiology of skin tags (STs is not fully understood. A relation to diabetes mellitus and obesity was suggested. Few studies of possible mast cells (MCs involvement were reported. Tyrptase is a mast cell mediator and a potent fibroblast growth factor. It may provide a molecular link between mast cell activation and fibrosis. Aims: The aim was to assess clinical and laboratory findings in patients with STs, and the possible link between obesity, dyslipidemia, and lesional MC count/tryptase expression. Materials and Methods: A total of 20 patients with STs were subjected to clinical examination, estimation of body mass index (BMI, fasting blood glucose (FBG, postprandial blood glucose (PPBG, serum cholesterol and triglycerides, abdominal ultrasound for fatty liver assessment, in addition to study of MCs through staining for MC tryptase in two skin biopsies; lesional and nonlesional (control. Results:All patients showed abnormally high BMI and hypertriglyceridemia, with abnormal sonographic pattern in 15 patients (75%. STs number positively correlated with the age of patients. STs showed significantly higher MC counts and tryptase expression, compared with control skin ( P < 0.001, with no correlation of the STs number or MC count with BMI, FBG, PPBG or serum cholesterol. Obese patients showed a significantly higher MC count than overweight and there was a positive correlation between MC count and serum triglycerides. Axilla and under breast STs showed a higher MC count compared with other sites. Conclusions:STs seem to be related to obesity and hypertriglyceridemia. MCs with their tryptase are possibly involved in pathogenesis of STs. MC count is related to the associated factors; obesity and serum triglycerides. MC tryptase expression is a reliable method for accurate tissue MC counting.

  8. Cardiac MR tagging: optimization of sequence parameters and comparison at 1.5 T and 3.0 T in a volunteer study

    International Nuclear Information System (INIS)

    Kramer, U.; Fenchel, M.; Klumpp, B.; Claussen, C.D.; Miller, S.; Deshpande, V.; Laub, G.; Finn, J.P.

    2006-01-01

    Purpose: The aim of this study was the optimization of a gradient echo (GRE) MR tagging sequence at 3.0 T in comparison to 1.5 T in order to obtain the best image contrast between the myocardium, tag lines and blood signal. Theoretically expected improvements of signal-to-noise (SNR) and contrast-to-noise ratios (CNR) were also calculated. Materials and methods: 14 healthy volunteers (8 male, 6 female; mean age 43.4±10.3 years) were scanned using a 3.0 T as well as a 1.5 T whole-body system. A GRE flash-2 D tagging sequence was evaluated (midventricular short axis view) by varying the flip angle (8-16 ), slice thickness (4-8 mm; fixed flip angle 1.5/3.0 T: 12 /8 , tag size 8 mm) and tag size (4-8 mm, fixed flip angle 1.5/3.0 T: 12 /8 , slice thickness 6 mm). The field of view, acquisition time and temporal resolution (45 ms) were kept constant. Qualitative and quantitative image analysis was performed by calculating the SNR, CNR tag as well as the relative contrast between the myocardium and tag lines (RCMT). Results: Based on individual comparison, the best imaging protocol was found at a slice thickness of 6 mm, tag size of 8 mm, optimized flip angle of 8 (3.0 T) and 12 (1.5 T), respectively. Compared to 1.5 T, a significantly higher overall image score was determined (mean±sd; 3.2±0.2 vs 2.7±0.4) and a strong correlation between the CNR tag and RCMT for flip angle α and the slice thickness was found. A higher field strength resulted in an 80% increase in the CNR tag compared to 1.5 T (mean 10.7/6.1). Furthermore, the SNR was improved by 35% (mean 20.6/15.3) and the RCMT by 35% (mean 0.47/0.35). Conclusion: Myocardial tagging at 3.0 T has shown superior image quality in comparison to 1.5 T due to a higher baseline SNR and an improved CNR as well as RCMT. The suppressed fading of the tags enables the accessibility to the diastolic phase of the cardiac cycle. (orig.)

  9. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    DEFF Research Database (Denmark)

    Camargo, A A; Samaia, H P; Dias-Neto, E

    2001-01-01

    ,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most...

  10. Random Tagging Genotyping by Sequencing (rtGBS, an Unbiased Approach to Locate Restriction Enzyme Sites across the Target Genome.

    Directory of Open Access Journals (Sweden)

    Elena Hilario

    Full Text Available Genotyping by sequencing (GBS is a restriction enzyme based targeted approach developed to reduce the genome complexity and discover genetic markers when a priori sequence information is unavailable. Sufficient coverage at each locus is essential to distinguish heterozygous from homozygous sites accurately. The number of GBS samples able to be pooled in one sequencing lane is limited by the number of restriction sites present in the genome and the read depth required at each site per sample for accurate calling of single-nucleotide polymorphisms. Loci bias was observed using a slight modification of the Elshire et al.some restriction enzyme sites were represented in higher proportions while others were poorly represented or absent. This bias could be due to the quality of genomic DNA, the endonuclease and ligase reaction efficiency, the distance between restriction sites, the preferential amplification of small library restriction fragments, or bias towards cluster formation of small amplicons during the sequencing process. To overcome these issues, we have developed a GBS method based on randomly tagging genomic DNA (rtGBS. By randomly landing on the genome, we can, with less bias, find restriction sites that are far apart, and undetected by the standard GBS (stdGBS method. The study comprises two types of biological replicates: six different kiwifruit plants and two independent DNA extractions per plant; and three types of technical replicates: four samples of each DNA extraction, stdGBS vs. rtGBS methods, and two independent library amplifications, each sequenced in separate lanes. A statistically significant unbiased distribution of restriction fragment size by rtGBS showed that this method targeted 49% (39,145 of BamH I sites shared with the reference genome, compared to only 14% (11,513 by stdGBS.

  11. Endogenous retrovirus sequences expressed in male mammalian ...

    African Journals Online (AJOL)

    In humans, one ERV family, human endogenous retrovirus- K (HERV-K) is abundantly expressed, and is associated with germ cell tumours, while ERV3 env is expressed in normal human testis. Conclusion: The expression of ERVs in male reproductive tissues suggests a possible role in normal and disease conditions ...

  12. Molecular characterization, sequence analysis and tissue expression of a porcine gene – MOSPD2

    Directory of Open Access Journals (Sweden)

    Yang Jie

    2017-01-01

    Full Text Available The full-length cDNA sequence of a porcine gene, MOSPD2, was amplified using the rapid amplification of cDNA ends method based on a pig expressed sequence tag sequence which was highly homologous to the coding sequence of the human MOSPD2 gene. Sequence prediction analysis revealed that the open reading frame of this gene encodes a protein of 491 amino acids that has high homology with the motile sperm domain-containing protein 2 (MOSPD2 of five species: horse (89%, human (90%, chimpanzee (89%, rhesus monkey (89% and mouse (85%; thus, it could be defined as a porcine MOSPD2 gene. This novel porcine gene was assigned GeneID: 100153601. This gene is structured in 15 exons and 14 introns as revealed by computer-assisted analysis. The phylogenetic analysis revealed that the porcine MOSPD2 gene has a closer genetic relationship with the MOSPD2 gene of horse. Tissue expression analysis indicated that the porcine MOSPD2 gene is generally and differentially expressed in the spleen, muscle, skin, kidney, lung, liver, fat and heart. Our experiment is the first to establish the primary foundation for further research on the porcine MOSPD2 gene.

  13. Soluble expression and purifiation of hepatitis B core antigen (HBcAg subgenotype B3 in Escherichia coli using thioredoxin fusion tag

    Directory of Open Access Journals (Sweden)

    Rahmah Waty

    2017-08-01

    Full Text Available Objective: To express HBcAg protein (hepatitis B virus subgenotype B3 in Escherichia coli in soluble form. Methods: HBcAg sequence of hepatitis B virus subgenotype B3 was cloned into plasmid pET32a and introduced to E. coli BL21 (DE3. The E. coli was grown in Luria-Bertani (LB medium supplemented with ampicillin with agitation. Protein expression was induced by adding isopropyl-β-D-thiogalactopyranoside (IPTG at concentrations of 0.1 mmol/L, 0.3 mmol/L, and 0.5 mmol/L at room temperature (28 °C. The bacteria were dissolved in lysis buffer and lysed by freeze-thawing method then sonication. The fusion protein [thioredoxin A-(His6tag-HBcAg] was purified using immobilized metal affinity chromatography. The protein expression was analyzed by SDS-PAGE, dot blot, and western blot. Results: This research showed that DNA sequence of HBcAg could be propagated in pET32a and soluble protein was successfully expressed in E. coli. Induction with 0.3 mmol/L IPTG and 4-hour incubation was the best condition to express the HBcAg protein. SDS-PAGE and dot blot analysis showed that HBcAg protein could be expressed in E. coli. Western blot analysis showed that molecular weight of HBcAg fusion protein was about 38.5 kDa. Conclusions: This study confirmed that HBcAg protein could be expressed in soluble form in E. coli.

  14. Impact of cell type and epitope tagging on heterologous expression of G protein-coupled receptor: a systematic study on angiotensin type II receptor.

    Directory of Open Access Journals (Sweden)

    Lili Jiang

    Full Text Available Despite heterologous expression of epitope-tagged GPCR is widely adopted for functional characterization, there is lacking of systematic analysis of the impact of expression host and epitope tag on GPCR expression. Angiotensin type II (AT2 receptor displays agonist-dependent and -independent activities, coupling to a spectrum of signaling molecules. However, consensus has not been reached on the subcellular distributions, signaling cascades and receptor-mediated actions. To examine the contributions of host cell and epitope tag on receptor expression and activity, epitope-tagged AT2 receptor variants were transiently or stably expressed in HEK293, CHO-K1 and PC12 cells. The epitope-tagged AT2 receptor variants were detected both on the cell membrane and in the perinuclear region. In transiently transfected HEK293 cells, Myc-AT2 existed predominantly as monomer. Additionally, a ladder of ubiquitinated AT2 receptor proteins was detected. By contrast, stably expressed epitope-tagged AT2 receptor variants existed as both monomer and high molecular weight complexes, and the latter was enriched in cell surface. Glycosylation promoted cell surface expression of Myc-AT2 but had no effect on AT2-GFP in HEK293 cells. In cells that stably expressed Myc-AT2, serum starvation induced apoptosis in CHO-K1 cells but not in HEK293 or PC12 cells. Instead, HEK293 and PC12 cells stably expressing Myc-AT2 exhibited partial cell cycle arrest with cells accumulating at G1 and S phases, respectively. Taken together, these results suggest that expression levels, subcellular distributions and ligand-independent constitutive activities of AT2 receptor were cell type-dependent while posttranslational processing of nascent AT2 receptor protein was modulated by epitope tag and mode of expression.

  15. Sequence and RT-PCR expression analysis of two peroxidases from Arabidopsis thaliana belonging to a novel evolutionary branch of plant perioxidases

    DEFF Research Database (Denmark)

    Kjærsgård, I.V.H.; Jespersen, H.M.; Rasmussen, Søren Kjærsgård

    1997-01-01

    cDNA clones encoding two new Arabidopsis thaliana peroxidases, ATP la and ATP 2a, have been identified by searching the Arabidopsis database of expressed sequence tags (dbEST). They represent a novel branch of hitherto uncharacterized plant peroxidases which is only 35% identical in amino acid...

  16. Identifying novel genes in C. elegans using SAGE tags

    Directory of Open Access Journals (Sweden)

    Chen Nansheng

    2010-12-01

    Full Text Available Abstract Background Despite extensive efforts devoted to predicting protein-coding genes in genome sequences, many bona fide genes have not been found and many existing gene models are not accurate in all sequenced eukaryote genomes. This situation is partly explained by the fact that gene prediction programs have been developed based on our incomplete understanding of gene feature information such as splicing and promoter characteristics. Additionally, full-length cDNAs of many genes and their isoforms are hard to obtain due to their low level or rare expression. In order to obtain full-length sequences of all protein-coding genes, alternative approaches are required. Results In this project, we have developed a method of reconstructing full-length cDNA sequences based on short expressed sequence tags which is called sequence tag-based amplification of cDNA ends (STACE. Expressed tags are used as anchors for retrieving full-length transcripts in two rounds of PCR amplification. We have demonstrated the application of STACE in reconstructing full-length cDNA sequences using expressed tags mined in an array of serial analysis of gene expression (SAGE of C. elegans cDNA libraries. We have successfully applied STACE to recover sequence information for 12 genes, for two of which we found isoforms. STACE was used to successfully recover full-length cDNA sequences for seven of these genes. Conclusions The STACE method can be used to effectively reconstruct full-length cDNA sequences of genes that are under-represented in cDNA sequencing projects and have been missed by existing gene prediction methods, but their existence has been suggested by short sequence tags such as SAGE tags.

  17. High-yield secretion of recombinant proteins expressed in tobacco cell culture with a designer glycopeptide tag: Process development.

    Science.gov (United States)

    Zhang, Ningning; Gonzalez, Maria; Savary, Brett; Xu, Jianfeng

    2016-03-01

    Low-yield protein production remains the most significant economic hurdle with plant cell culture technology. Fusions of recombinant proteins with hydroxyproline-O-glycosylated designer glycopeptide tags have consistently boosted secreted protein yields. This prompted us to study the process development of this technology aiming to achieve productivity levels necessary for commercial viability. We used a tobacco BY-2 cell culture expressing EGFP as fusion with a glycopeptide tag comprised of 32 repeat of "Ser-Pro" dipeptide, or (SP)32 , to study cell growth and protein secretion, culture scale-up, and establishment of perfusion cultures for continuous production. The BY-2 cells accumulated low levels of cell biomass (~7.5 g DW/L) in Schenk & Hildebrandt medium, but secreted high yields of (SP)32 -tagged EGFP (125 mg/L). Protein productivity of the cell culture has been stable for 6.0 years. The BY-2 cells cultured in a 5-L bioreactor similarly produced high secreted protein yield at 131 mg/L. Successful operation of a cell perfusion culture for 30 days was achieved under the perfusion rate of 0.25 and 0.5 day(-1) , generating a protein volumetric productivity of 17.6 and 28.9 mg/day/L, respectively. This research demonstrates the great potential of the designer glycopeptide technology for use in commercial production of valuable proteins with plant cell cultures. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Identification of differentially expressed sequences in bud ...

    African Journals Online (AJOL)

    The developmental process of lily flower bud differentiation has been studied in morphology thoroughly, but the mechanism in molecular biology is still ambiguous and few studies on genetic expression have been carried out. Little is known about the physiological responses of flower bud differentiation in Oriental hybrid lily ...

  19. Endogenous retrovirus sequences expressed in male mammalian ...

    African Journals Online (AJOL)

    Objectives: To review the research findings on the expression of endogenous retroviruses and retroviral-related particles in male mammalian reproductive tissues, and to discuss their possible role in normal cellular events and association with disease conditions in male reproductive tissues. Data sources: Published ...

  20. Predicting tissue-specific expressions based on sequence characteristics

    KAUST Repository

    Paik, Hyojung; Ryu, Tae Woo; Heo, Hyoungsam; Seo, Seungwon; Lee, Doheon; Hur, Cheolgoo

    2011-01-01

    In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.

  1. Predicting tissue-specific expressions based on sequence characteristics

    KAUST Repository

    Paik, Hyojung

    2011-04-30

    In multicellular organisms, including humans, understanding expression specificity at the tissue level is essential for interpreting protein function, such as tissue differentiation. We developed a prediction approach via generated sequence features from overrepresented patterns in housekeeping (HK) and tissue-specific (TS) genes to classify TS expression in humans. Using TS domains and transcriptional factor binding sites (TFBSs), sequence characteristics were used as indices of expressed tissues in a Random Forest algorithm by scoring exclusive patterns considering the biological intuition; TFBSs regulate gene expression, and the domains reflect the functional specificity of a TS gene. Our proposed approach displayed better performance than previous attempts and was validated using computational and experimental methods.

  2. Billfish Tagging

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The SWFSC's constituent-based Billfish Tagging Program began in 1963 and since that time has provided conventional spaghetti type tags and tagging supplies to...

  3. Simple purification method for a recombinantly expressed native His-tag-free aminopeptidase A from Lactobacillus delbrueckii.

    Science.gov (United States)

    Stressler, Timo; Tanzer, Coralie; Ewert, Jacob; Claaßen, Wolfgang; Fischer, Lutz

    2017-03-01

    The aminopeptidase A (PepA; EC 3.4.11.7) is an intracellular exopeptidase present in lactic acid bacteria. The PepA cleaves glutamyl/aspartyl residues from the N-terminal end of peptides and can, therefore, be applied for the production of protein hydrolysates with an increased amount of these amino acids, which results in a savory taste (umami). The first PepA from a lactobacilli strain was recombinantly expressed in Escherichia coli in a recently published study and harbored a C-terminal His 6 -tag for easier purification. Due to the fact that a His-tag might influence the properties of an enzyme, a simple purification method for the non-His-tagged PepA was required. Surprisingly, the PepA precipitated at a very low ammonium sulfate concentration of 5%. Unusual for a precipitating step, the purity of PepA was over 95% and the obtained activity yield was 110%. The high purity allows biochemical characterization and kinetic investigation. As a result, the optimum pH (6.0-6.5) and temperature (60-65 °C) were comparable to the His 6 -tag harboring PepA; the K M value was at 0.79 mM slightly lower compared to 1.21 mM, respectively. Since PepA is a homo dodecamer, it has a high molecular mass of approximately 480 kDa. Therefore, a subsequent preparative size-exclusion chromatography (SEC) step seemed promising. The PepA after SEC was purified to homogeneity. In summary, the simple two-step purification method presented can be applied to purify high amounts of PepA that will allow the performance of experiments in the future to crystalize PepA for the first time. Copyright © 2016 Elsevier Inc. All rights reserved.

  4. Tag questions Tag questions

    Directory of Open Access Journals (Sweden)

    David Brazil

    2008-04-01

    Full Text Available The so-called 'tag' structures of English have received a lot of attention in language teaching programmes, attention that is not hard to justify when one considers the problems and anxiety they can occasion for many foreign learners. Most teachers one speaks to seem fairly willing to agree, however, that traditional treatments of the topic leave much to be desired. It happens, also, that, when considered collectively, the tags and some related phenomena have a special heoretical interest. For they constitute a field in which it seems essential to bring together insights that derive from the study of several aspects of linguistic organisation, aspects which in some recent work have been held to need distinctive kinds of descriptive category to handle. Traditional treatments have found it necessary to recognise different syntactic types (e.g. 'same polarity' and 'reversed polarity' tags and ifferent intonational treatments ("falling'and 'rising' tag; while the way the communicative significance of the various permutations is described normally requires reference to the expectations they signal regarding the immediately following behaviour of the other party (in the common phrase, 'What kind of answer they expect'. This last consideration places the matter squarely in the arena of recent work on the analysis of interactive discourse. The so-called 'tag' structures of English have received a lot of attention in language teaching programmes, attention that is not hard to justify when one considers the problems and anxiety they can occasion for many foreign learners. Most teachers one speaks to seem fairly willing to agree, however, that traditional treatments of the topic leave much to be desired. It happens, also, that, when considered collectively, the tags and some related phenomena have a special heoretical interest. For they constitute a field in which it seems essential to bring together insights that derive from the study of several aspects

  5. A SEP tag enhances the expression, solubility and yield of recombinant TEV protease without altering its activity.

    Science.gov (United States)

    Nautiyal, Kalpana; Kuroda, Yutaka

    2018-05-25

    Tobacco Etch Virus (TEV) protease is used in the purification of recombinant proteins, but its usage is often hampered by solubility issues. Here, we report a short, 12-residue solubility enhancing peptide (SEP) tag attached at the C-terminus of TEV (TEV-C9R). We assessed the effects of the C9R tag on the biophysical and biochemical characteristics of TEV. The yield of HPLC purified TEV-C9R expressed in E. coli grown in 200 mL LB or TB media was between 10 and 13 mg, which was up to 6.5 times higher than the yield of the untagged TEV (untagged-TEV). TEV-C9R was active over a pH range of 5-8, which was wider than that of the commonly used thrombin, and it remained active upon incubation at 60 °C much longer than the untagged-TEV, which aggregated at this temperature. Static and dynamic light scattering demonstrated the higher solubility of purified TEV-C9R. Furthermore, the thermal unfolding of TEV-C9R, as assessed by circular dichroism at pH 4.7, was almost perfectly reversible, in contrast to that of untagged-TEV, which aggregated at high temperature. These results demonstrate the improved biophysical and biochemical characteristics of TEV-C9R originating from higher solubility and provide another example of how SEP tags can enhance enzyme solubility without altering its activity. Copyright © 2018 Elsevier B.V. All rights reserved.

  6. Birbeck granule-like "organized smooth endoplasmic reticulum" resulting from the expression of a cytoplasmic YFP-tagged langerin.

    Directory of Open Access Journals (Sweden)

    Cédric Lenormand

    Full Text Available Langerin is required for the biogenesis of Birbeck granules (BGs, the characteristic organelles of Langerhans cells. We previously used a Langerin-YFP fusion protein having a C-terminal luminal YFP tag to dynamically decipher the molecular and cellular processes which accompany the traffic of Langerin. In order to elucidate the interactions of Langerin with its trafficking effectors and their structural impact on the biogenesis of BGs, we generated a YFP-Langerin chimera with an N-terminal, cytosolic YFP tag. This latter fusion protein induced the formation of YFP-positive large puncta. Live cell imaging coupled to a fluorescence recovery after photobleaching approach showed that this coalescence of proteins in newly formed compartments was static. In contrast, the YFP-positive structures present in the pericentriolar region of cells expressing Langerin-YFP chimera, displayed fluorescent recovery characteristics compatible with active membrane exchanges. Using correlative light-electron microscopy we showed that the coalescent structures represented highly organized stacks of membranes with a pentalaminar architecture typical of BGs. Continuities between these organelles and the rough endoplasmic reticulum allowed us to identify the stacks of membranes as a form of "Organized Smooth Endoplasmic Reticulum" (OSER, with distinct molecular and physiological properties. The involvement of homotypic interactions between cytoplasmic YFP molecules was demonstrated using an A206K variant of YFP, which restored most of the Langerin traffic and BG characteristics observed in Langerhans cells. Mutation of the carbohydrate recognition domain also blocked the formation of OSER. Hence, a "double-lock" mechanism governs the behavior of YFP-Langerin, where asymmetric homodimerization of the YFP tag and homotypic interactions between the lectin domains of Langerin molecules participate in its retention and the subsequent formation of BG-like OSER. These

  7. Expression profiling and comparative sequence derived insights into lipid metabolism

    Energy Technology Data Exchange (ETDEWEB)

    Callow, Matthew J.; Rubin, Edward M.

    2001-12-19

    Expression profiling and genomic DNA sequence comparisons are increasingly being applied to the identification and analysis of the genes involved in lipid metabolism. Not only has genome-wide expression profiling aided in the identification of novel genes involved in important processes in lipid metabolism such as sterol efflux, but the utilization of information from these studies has added to our understanding of the regulation of pathways participating in the process. Coupled with these gene expression studies, cross species comparison, searching for sequences conserved through evolution, has proven to be a powerful tool to identify important non-coding regulatory sequences as well as the discovery of novel genes relevant to lipid biology. An example of the value of this approach was the recent chance discovery of a new apolipoprotein gene (apo AV) that has dramatic effects upon triglyceride metabolism in mice and humans.

  8. PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET sequence data

    Directory of Open Access Journals (Sweden)

    Wei Chia-Lin

    2006-08-01

    Full Text Available Abstract Background We recently developed the Paired End diTag (PET strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. Results We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the ProjectManager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. Conclusion The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.

  9. Method to produce acetyldiacylglycerols (ac-TAGs) by expression of an acetyltransferase gene isolated from Euonymus alatus (burning bush)

    Energy Technology Data Exchange (ETDEWEB)

    Durrett, Timothy; Ohlrogge, John; Pollard, Michael

    2016-05-03

    The present invention relates to novel diacylglycerol acyltransferase genes and proteins, and methods of their use. In particular, the invention describes genes encoding proteins having diacylglycerol acetyltransferase activity, specifically for transferring an acetyl group to a diacylglycerol substrate to form acetyl-Triacylglycerols (ac-TAGS), for example, a 3-acetyl-1,2-diacyl-sn-glycerol. The present invention encompasses both native and recombinant wild-type forms of the transferase, as well as mutants and variant forms. The present invention also relates to methods of using novel diacylglycerol acyltransferase genes and proteins, including their expression in transgenic organisms at commercially viable levels, for increasing production of 3-acetyl-1,2-diacyl-sn-glycerols in plant oils and altering the composition of oils produced by microorganisms, such as yeast, by increasing ac-TAG production. Additionally, oils produced by methods of the present inventions comprising genes and proteins are contemplated for use as biodiesel fuel, in polymer production and as naturally produced food oils with reduced calories.

  10. PGC TagSNP and its interaction with H. pylori and relation with gene expression in susceptibility to gastric carcinogenesis.

    Directory of Open Access Journals (Sweden)

    Cai-Yun He

    Full Text Available BACKGROUND: Pepsinogen C (PGC plays an important role in sustaining the cellular differentiation during the process of gastric carcinogenesis. This study aimed to assess the role of PGC tagSNPs and their interactions with Helicobacter pylori (H. pylori in the development of gastric cancer and its precursor, atrophic gastritis. METHODS: Four PGC tagSNPs (rs6941539, rs6912200, rs3789210 and rs6939861 were genotyped by Sequenom MassARRAY platform in a total of 2311 subjects consisting of 642 gastric cancer, 774 atrophic gastritis, and 895 healthy control subjects. The mRNA and protein expression levels of PGC in gastric tissues and in serum were respectively measured by quantitative reverse transcriptase-polymerase chain reaction (qRT-PCR, immunohistochemistry, and Eenzyme-linked immunoabsorbent assay (ELISA. RESULTS: We found associations between PGC rs3789210 CG/GG genotypes and reduced gastric cancer risk and between PGC rs6939861 A variant allele and increased risks of both gastric cancer and atrophic gastritis. As for the haplotypes of PGC rs6941539-rs6912200-rs3789210-rs6939861 loci, the TTCA and TTGG haplotypes were respectively associated with increased and reduced risks of both gastric cancer and atrophic gastritis; additionally, the CTCA haplotype was associated with increased atrophic gastritis risk. Very interestingly, rs6912200 CT/TT genotypes had a positive interaction with H. pylori, synergistically elevating the gastric cancer risk. Moreover, healthy subjects who carried rs6912200 CT, TT and CT/TT variant genotypes had lower histological and serum expression levels of PGC protein. CONCLUSIONS: Our findings highlight an important role of PGC rs3789210 and rs6939861 in altering susceptibility to atrophic gastritis and/or gastric cancer. Moreover, people who carry rs6912200 variant genotypes exhibit higher gastric cancer risk in case of getting H. pylori infection, which strongly suggest a necessity of preventing and/or eliminating H

  11. DNA sequence and prokaryotic expression analysis of vitellogenin ...

    African Journals Online (AJOL)

    In this study, the DNA sequence of vitellogenin from Antheraea pernyi (Ap-Vg) was identified and its functional domain (30-740 aa, Ap-Vg-1) was expressed in Escherichia coli BL21 (DE3) cells. The recombinant Ap-Vg-1 proteins were purified and used for antibody preparation. The results showed that the intact DNA ...

  12. Sequence and expression analysis of gaps in human chromosome 20

    DEFF Research Database (Denmark)

    Minocherhomji, Sheroy; Seemann, Stefan; Mang, Yuan

    2012-01-01

    /or overlap disease-associated loci, including the DLGAP4 locus. In this study, we sequenced ~99% of all three unfinished gaps on human chr 20, determined their complete genomic sizes and assessed epigenetic profiles using a combination of Sanger sequencing, mate pair paired-end high-throughput sequencing......The finished human genome-assemblies comprise several hundred un-sequenced euchromatic gaps, which may be rich in long polypurine/polypyrimidine stretches. Human chromosome 20 (chr 20) currently has three unfinished gaps remaining on its q-arm. All three gaps are within gene-dense regions and...... and chromatin, methylation and expression analyses. We found histone 3 trimethylated at Lysine 27 to be distributed across all three gaps in immortalized B-lymphocytes. In one gap, five novel CpG islands were predominantly hypermethylated in genomic DNA from peripheral blood lymphocytes and human cerebellum...

  13. Characterization of soybean genomic features by analysis of its expressed sequence tags

    DEFF Research Database (Denmark)

    Tian, Ai-Guo; Wang, Jun; Cui, Peng

    2004-01-01

    to be fast-evolving. Soybean unigenes with no match to genes within the Arabidopsis genome were identified as soybean-specific genes. These genes were mainly involved in nodule development and the synthesis of seed storage proteins. In addition, we also identified 61 genes regulated by salicylic acid, 1...

  14. Androgenesis in chickpea: Anther culture and expressed sequence tags derived annotation

    DEFF Research Database (Denmark)

    Panchangam, Sameera Sastry; Mallikarjuna, Nalini; Gaur, Pooran M.

    2014-01-01

    Double haploid technique is not routinely used in legume breeding programs, though recent publications report haploid plants via anther culture in chickpea (Cicer arietinum L.). The focus of this study was to develop an efficient and reproducible protocol for the production of double haploids wit...

  15. Sequence diversity and differential expression of major phenylpropanoid-flavonoid biosynthetic genes among three mango varieties.

    Science.gov (United States)

    Hoang, Van L T; Innes, David J; Shaw, P Nicholas; Monteith, Gregory R; Gidley, Michael J; Dietzgen, Ralf G

    2015-07-30

    Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango varieties Mangifera indica L., a member of the family Anacardiaceae: Kensington Pride (KP), Irwin (IW) and Nam Doc Mai (NDM) and to determine associations with gene expression and mango flavonoid profiles. A close evolutionary relationship between mango genes and those from the woody species poplar of the Salicaceae family (Populus trichocarpa) and grape of the Vitaceae family (Vitis vinifera), was revealed through phylogenetic analysis of PF pathway genes. We discovered 145 SNPs in total within coding sequences with an average frequency of one SNP every 316 bp. Variety IW had the highest SNP frequency (one SNP every 258 bp) while KP and NDM had similar frequencies (one SNP every 369 bp and 360 bp, respectively). The position in the PF pathway appeared to influence the extent of genetic diversity of the encoded enzymes. The entry point enzymes phenylalanine lyase (PAL), cinnamate 4-mono-oxygenase (C4H) and chalcone synthase (CHS) had low levels of SNP diversity in their coding sequences, whereas anthocyanidin reductase (ANR) showed the highest SNP frequency followed by flavonoid 3'-hydroxylase (F3'H). Quantitative PCR revealed characteristic patterns of gene expression that differed between mango peel and flesh, and between varieties. The combination of mango expressed sequence tags and availability of well-established reference PF biosynthetic genes from other plant species allowed the identification of coding sequences of genes that may lead to the formation of important flavonoid compounds in mango fruits and facilitated characterisation of single nucleotide polymorphisms between varieties. We discovered an association between the extent of sequence variation and

  16. Sequence and expression pattern of a novel human orphan G-protein-coupled receptor, GPRC5B, a family C receptor with a short amino-terminal domain

    DEFF Research Database (Denmark)

    Bräuner-Osborne, Hans; Krogsgaard-Larsen, P

    2000-01-01

    Query of GenBank with the amino acid sequence of human metabotropic glutamate receptor subtype 2 (mGluR2) identified a predicted gene product of unknown function on BAC clone CIT987SK-A-69G12 (located on chromosome band 16p12) as a homologous protein. The transcript, entitled GPRC5B, was cloned f...... from an expressed sequence tag clone that contained the entire open reading frame of the transcript encoding a protein of 395 amino acids. Analysis of the protein sequence reveal that GPRC5B contains a signal peptide and seven transmembrane alpha-helices, which is a hallmark of G...

  17. Haplotypes of the TaGS5-A1 gene are associated with thousand-kernel weight in Chinese bread wheat

    Directory of Open Access Journals (Sweden)

    Wang Sha Sha

    2016-06-01

    Full Text Available In previous work, we cloned TaGS5 gene and found the association of TaGS5-A1 alleles with agronomic traits. In this study, the promoter sequence of the TaGS5-A1 gene was isolated from bread wheat. Sequencing results revealed that a G insertion was found in position -1925 bp of the TaGS5-A1 gene (Reference to ATG, which occurred in the Sp1 domain of the promoter sequence. Combined with previous single nucleotide polymorphism (SNP in the TaGS5-A1 exon sequence, four genotypes were formed at the TaGS5-A1 locus and were designated as TaGS5-A1a-a, TaGS5-A1a-b, TaGS5-A1b-a, and TaGS5-A1b-b, respectively. Analysis of the association of TaGS5-A1 alleles with agronomic traits indicated that cultivars with the TaGS5-A1a-b allele possessed significantly higher thousand-kernel weight (TKW and lower plant height than cultivars with the TaGS5-A1a-a allele, and cultivars with the TaGS5-A1b-b allele showed higher TKW than cultivars with the TaGS5-A1b-a allele. The differences of these traits between the TaGS5-A1a-a and TaGS5-A1a-b alleles were larger than those of the TaGS5-A1b-a and TaGS5-A1b-b alleles, suggesting that the -1925G insertion plays the more important role in TaGS5-A1a genotypes than in TaGS5-A1b genotypes. qRT-PCR indicated that TaGS5-A1b-b possessed the significantly highest expression level among four TaGS5-A1 haplotypes in mature seeds and further showed a significantly higher expression level than TaGS5-A1b-a at five different developmental stages of the seeds, suggesting that high expression of TaGS5-A1 was positively associated with high TKW in bread wheat. This study could provide a relatively superior genotype in view of TKW in wheat breeding programs and could also provide important information for dissection of the regulatory mechanism of the yield-related traits.

  18. Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles.

    Science.gov (United States)

    Guo, Shaogui; Liu, Jingan; Zheng, Yi; Huang, Mingyun; Zhang, Haiying; Gong, Guoyi; He, Hongju; Ren, Yi; Zhong, Silin; Fei, Zhangjun; Xu, Yong

    2011-09-21

    Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and molecular basis of these changes and significantly expand the watermelon transcript catalog, we have selected four critical stages of watermelon fruit development and used Roche/454 next-generation sequencing technology to generate a large expressed sequence tag (EST) dataset and a comprehensive transcriptome profile for watermelon fruit flesh tissues. We performed half Roche/454 GS-FLX run for each of the four watermelon fruit developmental stages (immature white, white-pink flesh, red flesh and over-ripe) and obtained 577,023 high quality ESTs with an average length of 302.8 bp. De novo assembly of these ESTs together with 11,786 watermelon ESTs collected from GenBank produced 75,068 unigenes with a total length of approximately 31.8 Mb. Overall 54.9% of the unigenes showed significant similarities to known sequences in GenBank non-redundant (nr) protein database and around two-thirds of them matched proteins of cucumber, the most closely-related species with a sequenced genome. The unigenes were further assigned with gene ontology (GO) terms and mapped to biochemical pathways. More than 5,000 SSRs were identified from the EST collection. Furthermore we carried out digital gene expression analysis of these ESTs and identified 3,023 genes that were differentially expressed during watermelon fruit development and ripening, which provided novel insights into watermelon fruit biology and a comprehensive resource of candidate genes for future functional analysis. We then generated profiles of several interesting metabolites that are important to fruit quality including pigmentation and sweetness. Integrative analysis of metabolite and digital gene expression

  19. Expression and purification of short hydrophobic elastin-like polypeptides with maltose-binding protein as a solubility tag.

    Science.gov (United States)

    Bataille, Laure; Dieryck, Wilfrid; Hocquellet, Agnès; Cabanne, Charlotte; Bathany, Katell; Lecommandoux, Sébastien; Garbay, Bertrand; Garanger, Elisabeth

    2015-06-01

    Elastin-like polypeptides (ELPs) are biodegradable polymers with interesting physico-chemical properties for biomedical and biotechnological applications. The recombinant expression of hydrophobic elastin-like polypeptides is often difficult because they possess low transition temperatures, and therefore form aggregates at sub-ambient temperatures. To circumvent this difficulty, we expressed in Escherichia coli three hydrophobic ELPs (VPGIG)n with variable lengths (n=20, 40, and 60) in fusion with the maltose-binding protein (MBP). Fusion proteins were soluble and yields of purified MBP-ELP ranged between 66 and 127mg/L culture. After digestion of the fusion proteins by enterokinase, the ELP moiety was purified by using inverse transition cycling. The purified fraction containing ELP40 was slightly contaminated by traces of undigested fusion protein. Purification of ELP60 was impaired because of co-purification of the MBP tag during inverse transition cycling. ELP20 was successfully purified to homogeneity, as assessed by gel electrophoresis and mass spectrometry analyses. The transition temperature of ELP20 was measured at 15.4°C in low salt buffer. In conclusion, this method can be used to produce hydrophobic ELP of low molecular mass. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Optimized expression in Pichia pastoris eliminates common protein contaminants from subsequent His-tag purification.

    Science.gov (United States)

    Chen, Yong; Li, Yang; Liu, Peng; Sun, Qun; Liu, Zhu

    2014-04-01

    A weakness of using immobilized metal affinity chromatography (IMAC) to purify recombinant proteins expressed in Pichia pastoris is the co-purification of native proteins that exhibit high affinities for Ni-IMAC. We have determined the elution profiles of P. pastoris proteins and have examined the native proteins that co-purify when eluting with 100 mM imidazole. Four major contaminants were identified: mitochondrial alcohol dehydrogenase isozyme III (mADH), nucleotide excision repair endonuclease, and the hypothetical proteins TPHA_0L01390 and TDEL_0B02190 which are homologous proteins derived from Tetrapisispora phaffii and Torulaspora delbrueckii, respectively. A new P. pastoris expression strain was engineered that eliminated the predominant contaminant, mADH, by gene disruption. The total amount of protein contaminants was reduced by 55 % without effecting cell growth. The present study demonstrates the feasibility of using a proteomic approach to facilitate bioprocess optimization.

  1. Long span DNA paired-end-tag (DNA-PET sequencing strategy for the interrogation of genomic structural mutations and fusion-point-guided reconstruction of amplicons.

    Directory of Open Access Journals (Sweden)

    Fei Yao

    Full Text Available Structural variations (SVs contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. While genomic DNA paired-end-tag (DNA-PET sequencing is an attractive approach to identify genomic SVs, the current application of PET sequencing with short insert size DNA can be insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We employed a recently developed procedure to generate PET sequencing data using large DNA inserts of 10-20 kb and compared their characteristics with short insert (1 kb libraries for their ability to identify SVs. Our results suggest that although short insert libraries bear an advantage in identifying small deletions, they do not provide significantly better breakpoint resolution. In contrast, large inserts are superior to short inserts in providing higher physical genome coverage for the same sequencing cost and achieve greater sensitivity, in practice, for the identification of several classes of SVs, such as copy number neutral and complex events. Furthermore, our results confirm that large insert libraries allow for the identification of SVs within repetitive sequences, which cannot be spanned by short inserts. This provides a key advantage in studying rearrangements in cancer, and we show how it can be used in a fusion-point-guided-concatenation algorithm to study focally amplified regions in cancer.

  2. Development and application of sequence-tagged microsatellite site (STMS) markers in chickpea (Cicer arietinum), banana (Musa spp.) and their major pathogens, Ascochyta rabiei and Mycosphaerella fijiensis

    International Nuclear Information System (INIS)

    Winter, P.; Kaemmer, D.; Paff, T.; Geistlinger, J.; Neu, C.; Kahl, G.

    2001-01-01

    DNA markers of various kinds have found widespread application in many facets of plant breeding and plant pathogen control. Yet another marker type, sequence-tagged microsatellite (STMS) markers, provides the markers of choice for nearly every crop because of their co-dominant nature, reliability, ease of application and high polymorphic information content. We report here on the development of a whole set of STMS markers and the respective, selected primer sequences for two important crops, chickpea (Cicer arietinum L.) and banana (Musa acuminata), and for their most devastating fungal pathogens, Ascochyta rabiei and Mycosphaerella fijiensis, respectively. These markers were generated either by direct screening of size-selected genomic libraries with microsatellite-complementary oligonucleotides, or by enrichment of DNA fragments containing microsatellite sequences. A total of 69 markers for chickpea, 15 markers for M. acuminata, 19 markers for A rabiei and 11 markers for M. fijiensis, selected on the basis of their high information content and ease of use are presented here. These can be applied for mapping of the respective genomes, for various population studies, and cultivar and isolate identification. We further demonstrate that several of these markers can potentially be applied across species boundaries and thus could increase the marker repertoire also for other species of the genus Cicer, Musa and for Ascochyta-type pathogens of bean, and potentially also of lentil and pea. (author)

  3. Planarian homeobox genes: cloning, sequence analysis, and expression.

    Science.gov (United States)

    Garcia-Fernàndez, J; Baguñà, J; Saló, E

    1991-01-01

    Freshwater planarians (Platyhelminthes, Turbellaria, and Tricladida) are acoelomate, triploblastic, unsegmented, and bilaterally symmetrical organisms that are mainly known for their ample power to regenerate a complete organism from a small piece of their body. To identify potential pattern-control genes in planarian regeneration, we have isolated two homeobox-containing genes, Dth-1 and Dth-2 [Dugesia (Girardia) tigrina homeobox], by using degenerate oligonucleotides corresponding to the most conserved amino acid sequence from helix-3 of the homeodomain. Dth-1 and Dth-2 homeodomains are closely related (68% at the nucleotide level and 78% at the protein level) and show the conserved residues characteristic of the homeodomains identified to data. Similarity with most homeobox sequences is low (30-50%), except with Drosophila NK homeodomains (80-82% with NK-2) and the rodent TTF-1 homeodomain (77-87%). Some unusual amino acid residues specific to NK-2, TTF-1, Dth-1, and Dth-2 can be observed in the recognition helix (helix-3) and may define a family of homeodomains. The deduced amino acid sequences from the cDNAs contain, in addition to the homeodomain, other domains also present in various homeobox-containing genes. The expression of both genes, detected by Northern blot analysis, appear slightly higher in cephalic regions than in the rest of the intact organism, while a slight increase is detected in the central period (5 days) or regeneration. Images PMID:1714599

  4. Computational exploration of microRNAs from expressed sequence tags of Humulus lupulus, target predictions and expression analysis

    Czech Academy of Sciences Publication Activity Database

    Mishra, Ajay Kumar; Duraisamy, Ganesh Selvaraj; Týcová, Anna; Matoušek, Jaroslav

    2015-01-01

    Roč. 59, December 2015 (2015), s. 131-141 ISSN 1476-9271 R&D Projects: GA ČR GA13-03037S Institutional support: RVO:60077344 Keywords : MicroRNA * Humulus lupulus * Comparative genomics * Posttranscriptional gene regulation Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 1.014, year: 2015

  5. Expression and Purification of Recombinant Proteins in Escherichia coli with a His6 or Dual His6-MBP Tag.

    Science.gov (United States)

    Raran-Kurussi, Sreejith; Waugh, David S

    2017-01-01

    Rapid advances in bioengineering and biotechnology over the past three decades have greatly facilitated the production of recombinant proteins in Escherichia coli. Affinity-based methods that employ protein or peptide based tags for protein purification have been instrumental in this progress. Yet insolubility of recombinant proteins in E. coli remains a persistent problem. One way around this problem is to fuse an aggregation-prone protein to a highly soluble partner. E. coli maltose-binding protein (MBP) is widely acknowledged as a highly effective solubilizing agent. In this chapter, we describe how to construct either a His 6 - or a dual His 6 -MBP tagged fusion protein by Gateway ® recombinational cloning and how to evaluate their yield and solubility. We also describe a simple and rapid procedure to test the solubility of proteins after removing their N-terminal fusion tags by tobacco etch virus (TEV) protease digestion. The choice of whether to use a His 6 tag or a His 6 -MBP tag can be made on the basis of this solubility test.

  6. Comparison of direct boiling method with commercial kits for extracting fecal microbiome DNA by Illumina sequencing of 16S rRNA tags.

    Science.gov (United States)

    Peng, Xin; Yu, Ke-Qiang; Deng, Guan-Hua; Jiang, Yun-Xia; Wang, Yu; Zhang, Guo-Xia; Zhou, Hong-Wei

    2013-12-01

    Low cost and high throughput capacity are major advantages of using next generation sequencing (NGS) techniques to determine metagenomic 16S rRNA tag sequences. These methods have significantly changed our view of microorganisms in the fields of human health and environmental science. However, DNA extraction using commercial kits has shortcomings of high cost and time constraint. In the present study, we evaluated the determination of fecal microbiomes using a direct boiling method compared with 5 different commercial extraction methods, e.g., Qiagen and MO BIO kits. Principal coordinate analysis (PCoA) using UniFrac distances and clustering showed that direct boiling of a wide range of feces concentrations gave a similar pattern of bacterial communities as those obtained from most of the commercial kits, with the exception of the MO BIO method. Fecal concentration by boiling method affected the estimation of α-diversity indices, otherwise results were generally comparable between boiling and commercial methods. The operational taxonomic units (OTUs) determined through direct boiling showed highly consistent frequencies with those determined through most of the commercial methods. Even those for the MO BIO kit were also obtained by the direct boiling method with high confidence. The present study suggested that direct boiling could be used to determine the fecal microbiome and using this method would significantly reduce the cost and improve the efficiency of the sample preparation for studying gut microbiome diversity. © 2013 Elsevier B.V. All rights reserved.

  7. Monomorphism in humans and sequence differences among higher primates for a sequence tagged site (STS) in homeo box cluster 2 as assayed by denaturing gradient electrophoresis

    Energy Technology Data Exchange (ETDEWEB)

    Ruano, G.; Ruddle, F.H.; Kidd, K.K. (Yale Univ., New Haven, CT (United States)); Gray, M.R. (Tufts Univ., Boston, MA (United States)); Miki, Tetsuro (Osaka Univ. (Japan)); Ferguson-Smith, A.C. (Inst. of Animal Physiology and Genetics Research, Cambridge (United Kingdom))

    1990-03-11

    The human homeo box cluster 2 (HOX2) contains genes coding for DNA binding proteins involved in developmental control and is highly conserved between mouse and man. The authors have applied in concert the Polymerase Chain Reaction (PCR) and Denaturing Gradient Electrophoresis (DGE) to amplify defined primate HOX2 segments and to detect sequence differences among them. They have sequenced a PstI fragment 4 kb upstream from HOX 2.2 and synthesized primers delimiting both halves of 630 bp segment within it PCR on various unrelated humans and SC-PCR on chimpanzee, gorilla, orangutan and gibbon yielded products of the same length for each primer pair.

  8. Expression dynamics and ultrastructural localization of epitope-tagged Abutilon mosaic virus nuclear shuttle and movement proteins in Nicotiana benthamiana cells

    International Nuclear Information System (INIS)

    Kleinow, Tatjana; Tanwir, Fariha; Kocher, Cornelia; Krenz, Bjoern; Wege, Christina; Jeske, Holger

    2009-01-01

    The geminivirus Abutilon mosaic virus (AbMV) encodes two proteins which are essential for viral spread within plants. The nuclear shuttle protein (NSP) transfers viral DNA between the nucleus and cytoplasm, whereas the movement protein (MP) facilitates transport between cells through plasmodesmata and long-distance via phloem. An inducible overexpression system for epitope-tagged NSP and MP in plants yielded unprecedented amounts of both proteins. Western blots revealed extensive posttranslational modification and truncation for MP, but not for NSP. Ultrastructural examination of Nicotiana benthamiana tissues showed characteristic nucleopathic alterations, including fibrillar rings, when epitope-tagged NSP and MP were simultaneously expressed in leaves locally infected with an AbMV DNA A in which the coat protein gene was replaced by a green fluorescent protein encoding gene. Immunogold labelling localized NSP in the nucleoplasm and in the fibrillar rings. MP appeared at the cell periphery, probably the plasma membrane, and plasmodesmata.

  9. Anti-DNA antibodies: Sequencing, cloning, and expression

    Energy Technology Data Exchange (ETDEWEB)

    Barry, M.M.

    1992-01-01

    To gain some insight into the mechanism of systemic lupus erythematosus, and the interactions involved in proteins binding to DNA four anti-DNA antibodies have been investigated. Two of the antibodies, Hed 10 and Jel 242, have previously been prepared from female NZB/NZW mice which develop an autoimmune disease resembling human SLE. The remaining two antibodies, Jel 72 and Jel 318, have previously been produced via immunization of C57BL/6 mice. The isotypes of the four antibodies investigated in this thesis were determined by an enzyme-linked-immunosorbent assay. All four antibodies contained [kappa] light chains and [gamma]2a heavy chains except Jel 318 which contains a [gamma]2b heavy chain. The complete variable regions of the heavy and light chains of these four antibodies were sequenced from their respective mRNAs. The gene segments and variable gene families expressed in each antibody were identified. Analysis of the genes used in the autoimmune anti-DNA antibodies and those produced by immunization indicated no obvious differences to account for their different origins. Examination of the amino acid residues present in the complementary-determining regions of these four antibodies indicates a preference for aromatic amino acids. Jel 72 and Jel 242 contain three arginine residues in the third complementary-determining region. A single-chain Fv and the variable region of the heavy chain of Hed 10 were expressed in Escherichia coli. Expression resulted in the production of a 26,000 M[sub r] protein and a 15,000 M[sub r] protein. An immunoblot indicated that the 26,000 M[sub r] protein was the Fv for Hed 10, while the 15,000 M[sub r] protein was shown to bind poly (dT). The contribution of the heavy chain to DNA binding was assessed.

  10. LOX: Inferring level of expression from diverse methods of census sequencing

    KAUST Repository

    Zhang, Zhang

    2010-06-10

    Summary: We present LOX (Level Of eXpression) that estimates the Level Of gene eXpression from high-throughput-expressed sequence datasets with multiple treatments or samples. Unlike most analyses, LOX incorporates a gene bias model that facilitates integration of diverse transcriptomic sequencing data that arises when transcriptomic data have been produced using diverse experimental methodologies. LOX integrates overall sequence count tallies normalized by total expressed sequence count to provide expression levels for each gene relative to all treatments as well as Bayesian credible intervals. © The Author 2010. Published by Oxford University Press. All rights reserved.

  11. LOX: Inferring level of expression from diverse methods of census sequencing

    KAUST Repository

    Zhang, Zhang; Ló pez-Girá ldez, Francesc Francisco; Townsend, Jeffrey P.

    2010-01-01

    Summary: We present LOX (Level Of eXpression) that estimates the Level Of gene eXpression from high-throughput-expressed sequence datasets with multiple treatments or samples. Unlike most analyses, LOX incorporates a gene bias model that facilitates integration of diverse transcriptomic sequencing data that arises when transcriptomic data have been produced using diverse experimental methodologies. LOX integrates overall sequence count tallies normalized by total expressed sequence count to provide expression levels for each gene relative to all treatments as well as Bayesian credible intervals. © The Author 2010. Published by Oxford University Press. All rights reserved.

  12. Isolation and characterization of gene sequences expressed in cotton fiber

    Directory of Open Access Journals (Sweden)

    Taciana de Carvalho Coutinho

    2016-06-01

    Full Text Available ABSTRACT Cotton fiber are tubular cells which develop from the differentiation of ovule epidermis. In addition to being one of the most important natural fiber of the textile group, cotton fiber afford an excellent experimental system for studying the cell wall. The aim of this work was to isolate and characterise the genes expressed in cotton fiber (Gossypium hirsutum L. to be used in future work in cotton breeding. Fiber of the cotton cultivar CNPA ITA 90 II were used to extract RNA for the subsequent generation of a cDNA library. Seventeen sequences were obtained, of which 14 were already described in the NCBI database (National Centre for Biotechnology Information, such as those encoding the lipid transfer proteins (LTPs and arabinogalactans (AGP. However, other cDNAs such as the B05 clone, which displays homology with the glycosyltransferases, have still not been described for this crop. Nevertheless, results showed that several clones obtained in this study are associated with cell wall proteins, wall-modifying enzymes and lipid transfer proteins directly involved in fiber development.

  13. Baculovirus-mediated expression and isolation of human ribosomal phosphoprotein P0 carrying a GST-tag in a functional state

    International Nuclear Information System (INIS)

    Abo, Yohichi; Hagiya, Akiko; Naganuma, Takao; Tohkairin, Yukiko; Shiomi, Kunihiro; Kajiura, Zenta; Hachimori, Akira; Uchiumi, Toshio; Nakagaki, Masao

    2004-01-01

    We constructed an overexpression system for human ribosomal phosphoprotein P0, together with P1 and P2, which is crucially important for translation. Genes for these proteins, fused with the glutathione S-transferase (GST)-tag at the N-terminus, were inserted into baculovirus and introduced to insect cells. The fusion proteins, but not the proteins without the tag, were efficiently expressed into cells as soluble forms. The fusion protein GST.P0 as well as GST.P1/GST.P2 was phosphorylated in cells as detected by incorporation of 32 P and reactivity with monoclonal anti-phosphoserine antibody. GST.P0 expressed in insect cells, but not the protein obtained in Escherichia coli, had the ability to form a complex with P1 and P2 proteins and to bind to 28S rRNA. Moreover, the GST.P0-P1-P2 complex participated in high eEF-2-dependent GTPase activity. Baculovirus expression systems appear to provide recombinant human P0 samples that can be used for studies on the structure and function

  14. Genes differentially expressed in medulloblastoma and fetal brain

    NARCIS (Netherlands)

    Michiels, E. M.; Oussoren, E.; van Groenigen, M.; Pauws, E.; Bossuyt, P. M.; Voûte, P. A.; Baas, F.

    1999-01-01

    Serial analysis of gene expression (SAGE) was used to identify genes that might be involved in the development or growth of medulloblastoma, a childhood brain tumor. Sequence tags from medulloblastoma (10229) and fetal brain (10692) were determined. The distributions of sequence tags in each

  15. Establishment of a tagged variant of Lgr4 receptor suitable for functional and expression studies in the mouse

    Czech Academy of Sciences Publication Activity Database

    Kříž, Vítězslav; Krausová, Michaela; Burešová, Petra; Dobeš, Jan; Hrčkulák, Dušan; Babošová, Oľga; Švec, Jiří; Kořínek, Vladimír

    2017-01-01

    Roč. 26, č. 5 (2017), s. 689-701 ISSN 0962-8819 R&D Projects: GA ČR(CZ) GA14-33952S; GA MŠk LO1419; GA MŠk(CZ) LM2015040; GA MŠk ED2.1.00/19.0395 Keywords : Genome editing * Hemagglutinin tag * Knock-in * R-spondin * TALENs * Wnt signaling Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Biochemistry and molecular biology Impact factor: 2.341, year: 2016

  16. Unexpected observations after mapping LongSAGE tags to the human genome

    Directory of Open Access Journals (Sweden)

    Duret Laurent

    2007-05-01

    Full Text Available Abstract Background SAGE has been used widely to study the expression of known transcripts, but much less to annotate new transcribed regions. LongSAGE produces tags that are sufficiently long to be reliably mapped to a whole-genome sequence. Here we used this property to study the position of human LongSAGE tags obtained from all public libraries. We focused mainly on tags that do not map to known transcripts. Results Using a published error rate in SAGE libraries, we first removed the tags likely to result from sequencing errors. We then observed that an unexpectedly large number of the remaining tags still did not match the genome sequence. Some of these correspond to parts of human mRNAs, such as polyA tails, junctions between two exons and polymorphic regions of transcripts. Another non-negligible proportion can be attributed to contamination by murine transcripts and to residual sequencing errors. After filtering out our data with these screens to ensure that our dataset is highly reliable, we studied the tags that map once to the genome. 31% of these tags correspond to unannotated transcripts. The others map to known transcribed regions, but many of them (nearly half are located either in antisense or in new variants of these known transcripts. Conclusion We performed a comprehensive study of all publicly available human LongSAGE tags, and carefully verified the reliability of these data. We found the potential origin of many tags that did not match the human genome sequence. The properties of the remaining tags imply that the level of sequencing error may have been under-estimated. The frequency of tags matching once the genome sequence but not in an annotated exon suggests that the human transcriptome is much more complex than shown by the current human genome annotations, with many new splicing variants and antisense transcripts. SAGE data is appropriate to map new transcripts to the genome, as demonstrated by the high rate of cross

  17. Cloning, sequencing and expression of cDNA encoding growth ...

    Indian Academy of Sciences (India)

    Unknown

    of medicine, animal husbandry, fish farming and animal ..... northern pike (Esox lucius) growth hormone; Mol. Mar. Biol. ... prolactin 1-luciferase fusion gene in African catfish and ... 1988 Cloning and sequencing of cDNA that encodes goat.

  18. Reliable and inexpensive expression of large, tagged, exogenous proteins in murine bone marrow-derived macrophages using a second generation lentiviral system

    Directory of Open Access Journals (Sweden)

    Matthew R. Miller

    2015-08-01

    Full Text Available Over the past two decades, researchers have struggled to efficiently express foreign DNA in primary macrophages, impeding research progress. The applications of lipofection, electroporation, microinjection, and viral-mediated transfer typically result in disruptions in macrophage differentiation and function, low expression levels of exogenous proteins, limited efficiency and high cell mortality. In this report, after extensive optimization, we present a method of expressing large tagged proteins at high efficiency, consistency, and low cost using lentiviral infection. This method utilizes laboratory-propagated second generation plasmids to produce efficient virus that can be stored for later use. The expression of proteins up to 150 kDa in size is achieved in 30–70% of cells while maintaining normal macrophage differentiation and morphology as determined by fluorescence microscopy and Western blot analysis. This manuscript delineates the reagents and methods used to produce lentivirus to express exogenous DNA in murine bone marrow-derived macrophages sufficient for single cell microscopy as well as functional assays requiring large numbers of murine bone marrow-derived macrophages.

  19. Sequence and expression analyses of ethylene response factors highly expressed in latex cells from Hevea brasiliensis.

    Directory of Open Access Journals (Sweden)

    Piyanuch Piyatrakul

    Full Text Available The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors.

  20. Efficient expression of a soluble lipid transfer protein (LTP) of Platanus orientalis using short peptide tags and structural comparison with the natural form.

    Science.gov (United States)

    Salari, Farhad; Vahedi, Fatemeh; Chamani, Jamshidkhan; Varasteh, Abdolreza; Ketabdar, Hanieh; Sankian, Mojtaba

    2015-01-01

    Successful recombinant allergen-based immunotherapy has drawn a great deal of attention to use recombinant allergens for new therapeutic and/or diagnostic strategies. The Escherichia coli expression system is frequently used to produce recombinant allergens; however, protein expression in E. coli often results in inclusion bodies. Here, we focused on the expression of two recombinant soluble forms of Pla or 3 using solubility-enhancing peptide tags, human immune deficiency virus type 1 transactivator of transcription core domain and poly-arginine-lysine: rTAT-Pla or 3 and rPoly-Arg-Lys-Pla or 3. Structural characteristics and IgE reactivity of purified recombinant proteins were compared with natural Pla or 3 (nPla or 3) isolated from Platanus orientalis using circular dichroism spectra, fluorescence spectroscopy, and immunoblotting. Likewise, intrinsic viscosity and Stokes radius of the natural and recombinant Pla or 3 allergens were determined to analyze structural compactness in aqueous media. The results indicate high-level solubility and efficient expression of the fusion proteins (rTAT-Pla or 3 and rPoly-Arg-Lys-Pla or 3) compared with the wild-type recombinant. Furthermore, the similar structural characteristics and IgE-binding activities of the fusion proteins to nPla or 3 provide a promising tool for allergy diagnosis and treatment. © 2014 International Union of Biochemistry and Molecular Biology, Inc.

  1. Molecular cloning, expression analysis and sequence prediction of ...

    African Journals Online (AJOL)

    CCAAT/enhancer-binding protein beta as an essential transcriptional factor, regulates the differentiation of adipocytes and the deposition of fat. Herein, we cloned the whole open reading frame (ORF) of bovine C/EBPβ gene and analyzed its putative protein structures via DNA cloning and sequence analysis. Then, the ...

  2. Transcriptome profiling and digital gene expression by deep sequencing in early somatic embryogenesis of endangered medicinal Eleutherococcus senticosus Maxim.

    Science.gov (United States)

    Tao, Lei; Zhao, Yue; Wu, Ying; Wang, Qiuyu; Yuan, Hongmei; Zhao, Lijuan; Guo, Wendong; You, Xiangling

    2016-03-01

    Somatic embryogenesis (SE) has been studied as a model system to understand molecular events in physiology, biochemistry, and cytology during plant embryo development. In particular, it is exceedingly difficult to access the morphological and early regulatory events in zygotic embryos. To understand the molecular mechanisms regulating early SE in Eleutherococcus senticosus Maxim., we used high-throughput RNA-Seq technology to investigate its transcriptome. We obtained 58,327,688 reads, which were assembled into 75,803 unique unigenes. To better understand their functions, the unigenes were annotated using the Clusters of Orthologous Groups, Gene Ontology, and Kyoto Encyclopedia of Genes and Genomes databases. Digital gene expression libraries revealed differences in gene expression profiles at different developmental stages (embryogenic callus, yellow embryogenic callus, global embryo). We obtained a sequencing depth of >5.6 million tags per sample and identified many differentially expressed genes at various stages of SE. The initiation of SE affected gene expression in many KEGG pathways, but predominantly that in metabolic pathways, biosynthesis of secondary metabolites, and plant hormone signal transduction. This information on the changes in the multiple pathways related to SE induction in E. senticosus Maxim. embryogenic tissue will contribute to a more comprehensive understanding of the mechanisms involved in early SE. Additionally, the differentially expressed genes may act as molecular markers and could play very important roles in the early stage of SE. The results are a comprehensive molecular biology resource for investigating SE of E. senticosus Maxim. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. Intragenic HIV-1 env sequences that enhance gag expression

    International Nuclear Information System (INIS)

    Suptawiwat, Ornpreya; Sutthent, Ruengpung; Lee, T.-H.; Auewarakul, Prasert

    2003-01-01

    Expression of HIV-1 genes is regulated at multiple levels including the complex RNA splicing and transport mechanisms. Multiple cis-acting elements involved in these regulations have been previously identified in various regions of HIV-1 genome. Here we show that another cis-acting element was present in HIV-1 env region. This element enhanced the expression of Gag when inserted together with Rev response element (RRE) into a truncated HIV-1 genome in the presence of Rev. The enhancing activity was mapped to a 263-bp fragment in the gp41 region downstream to RRE. RNA analysis showed that it might function by promoting RNA stability and Rev-dependent RNA export. The enhancement was specific to Rev-dependent expression, since it did not enhance Gag expression driven by Sam68, a cellular protein that has been shown to be able to substitute for Rev in RNA export function

  4. Isolation, sequencing and expression of RED, a novel human gene encoding an acidic-basic dipeptide repeat.

    Science.gov (United States)

    Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T

    1999-04-16

    A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.

  5. The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

    Science.gov (United States)

    Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

    2014-01-01

    Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...

  6. Yellowtail Tagging Data (MRDBS)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Yellowtail Flounder Tagging Program began in 2003 and works with commercial fishermen to tag and release yellowtaiI flounder with pink and yellow disc tags or...

  7. Quantitative Proteomics Analysis of Altered Protein Expression in the Placental Villous Tissue of Early Pregnancy Loss Using Isobaric Tandem Mass Tags

    Directory of Open Access Journals (Sweden)

    Xiaobei Ni

    2014-01-01

    Full Text Available Many pregnant women suffer miscarriages during early gestation, but the description of these early pregnancy losses (EPL can be somewhat confusing because of the complexities of early development. Thus, the identification of proteins with different expression profiles related to early pregnancy loss is essential for understanding the comprehensive pathophysiological mechanism. In this study, we report a gel-free tandem mass tags- (TMT- labeling based proteomic analysis of five placental villous tissues from patients with early pregnancy loss and five from normal pregnant women. The application of this method resulted in the identification of 3423 proteins and 19647 peptides among the patient group and the matched normal control group. Qualitative and quantitative proteomic analysis revealed 51 proteins to be differentially abundant between the two groups (≥1.2-fold, Student's t-test, P<0.05. To obtain an overview of the biological functions of the proteins whose expression levels altered significantly in EPL group, gene ontology analysis was performed. We also investigated the twelve proteins with a difference over 1.5-fold using pathways analysis. Our results demonstrate that the gel-free TMT-based proteomic approach allows the quantification of differences in protein expression levels, which is useful for obtaining molecular insights into early pregnancy loss.

  8. Serial analysis of gene expression (SAGE) in normal human trabecular meshwork.

    Science.gov (United States)

    Liu, Yutao; Munro, Drew; Layfield, David; Dellinger, Andrew; Walter, Jeffrey; Peterson, Katherine; Rickman, Catherine Bowes; Allingham, R Rand; Hauser, Michael A

    2011-04-08

    To identify the genes expressed in normal human trabecular meshwork tissue, a tissue critical to the pathogenesis of glaucoma. Total RNA was extracted from human trabecular meshwork (HTM) harvested from 3 different donors. Extracted RNA was used to synthesize individual SAGE (serial analysis of gene expression) libraries using the I-SAGE Long kit from Invitrogen. Libraries were analyzed using SAGE 2000 software to extract the 17 base pair sequence tags. The extracted sequence tags were mapped to the genome using SAGE Genie map. A total of 298,834 SAGE tags were identified from all HTM libraries (96,842, 88,126, and 113,866 tags, respectively). Collectively, there were 107,325 unique tags. There were 10,329 unique tags with a minimum of 2 counts from a single library. These tags were mapped to known unique Unigene clusters. Approximately 29% of the tags (orphan tags) did not map to a known Unigene cluster. Thirteen percent of the tags mapped to at least 2 Unigene clusters. Sequence tags from many glaucoma-related genes, including myocilin, optineurin, and WD repeat domain 36, were identified. This is the first time SAGE analysis has been used to characterize the gene expression profile in normal HTM. SAGE analysis provides an unbiased sampling of gene expression of the target tissue. These data will provide new and valuable information to improve understanding of the biology of human aqueous outflow.

  9. Molecular cloning, sequencing and recombinant expression of a ...

    African Journals Online (AJOL)

    The 4D8 gene was recently discovered in Ixodes scapularis and identified as a tick protective antigen. Vaccination using recombinant 4D8 from I. scapularis showed a significant reduction against I. scapularis tick infestation in a sheep model. This protein is expressed in both salivary gland and gut tissues, and is thought to ...

  10. Investigation of ANGPTL3 expression, exon sequence and promotor ...

    African Journals Online (AJOL)

    like proteins, has been demonstrated to affect lipid metabolism by inhibiting the activity of lipoprotein lipase (LPL). Objective: To compare the ANGPTL3 mRNA and protein expression, exon mutation and promoter district CpG island methylation ...

  11. Molecular cloning, sequence analysis and tissue expression of ...

    African Journals Online (AJOL)

    Proofreader

    2017-10-01

    Oct 1, 2017 ... p-distance model for amino acid substitutions. A bootstrap .... These were a thymine/cytosine (T/C) SNP and a thymine/adenine (T/A) SNP. ..... Two rat homologues of Drosophila achaete–scute specifically expressed in ...

  12. Tissue Distribution of Kir7.1 Inwardly Rectifying K+ Channel Probed in a Knock-in Mouse Expressing a Haemagglutinin-Tagged Protein

    Directory of Open Access Journals (Sweden)

    Isabel Cornejo

    2018-04-01

    Full Text Available Kir7.1 encoded by the Kcnj13 gene in the mouse is an inwardly rectifying K+ channel present in epithelia where it shares membrane localization with the Na+/K+-pump. Further investigations of the localisation and function of Kir7.1 would benefit from the availability of a knockout mouse, but perinatal mortality attributed to cleft palate in the neonate has thwarted this research. To facilitate localisation studies we now use CRISPR/Cas9 technology to generate a knock-in mouse, the Kir7.1-HA that expresses the channel tagged with a haemagglutinin (HA epitope. The availability of antibodies for the HA epitope allows for application of western blot and immunolocalisation methods using widely available anti-HA antibodies with WT tissues providing unambiguous negative control. We demonstrate that Kir7.1-HA cloned from the choroid plexus of the knock-in mouse has the electrophysiological properties of the native channel, including characteristically large Rb+ currents. These large Kir7.1-mediated currents are accompanied by abundant apical membrane Kir7.1-HA immunoreactivity. WT-controlled western blots demonstrate the presence of Kir7.1-HA in the eye and the choroid plexus, trachea and lung, and intestinal epithelium but exclusively in the ileum. In the kidney, and at variance with previous reports in the rat and guinea-pig, Kir7.1-HA is expressed in the inner medulla but not in the cortex or outer medulla. In isolated tubules immunoreactivity was associated with inner medulla collecting ducts but not thin limbs of the loop of Henle. Kir7.1-HA shows basolateral expression in the respiratory tract epithelium from trachea to bronchioli. The channel also appears basolateral in the epithelium of the nasal cavity and nasopharynx in newborn animals. We show that HA-tagged Kir7.1 channel introduced in the mouse by a knock-in procedure has functional properties similar to the native protein and the animal thus generated has clear advantages in localisation

  13. Tissue Distribution of Kir7.1 Inwardly Rectifying K+ Channel Probed in a Knock-in Mouse Expressing a Haemagglutinin-Tagged Protein.

    Science.gov (United States)

    Cornejo, Isabel; Villanueva, Sandra; Burgos, Johanna; López-Cayuqueo, Karen I; Chambrey, Régine; Julio-Kalajzić, Francisca; Buelvas, Neudo; Niemeyer, María I; Figueiras-Fierro, Dulce; Brown, Peter D; Sepúlveda, Francisco V; Cid, L P

    2018-01-01

    Kir7.1 encoded by the Kcnj13 gene in the mouse is an inwardly rectifying K + channel present in epithelia where it shares membrane localization with the Na + /K + -pump. Further investigations of the localisation and function of Kir7.1 would benefit from the availability of a knockout mouse, but perinatal mortality attributed to cleft palate in the neonate has thwarted this research. To facilitate localisation studies we now use CRISPR/Cas9 technology to generate a knock-in mouse, the Kir7.1-HA that expresses the channel tagged with a haemagglutinin (HA) epitope. The availability of antibodies for the HA epitope allows for application of western blot and immunolocalisation methods using widely available anti-HA antibodies with WT tissues providing unambiguous negative control. We demonstrate that Kir7.1-HA cloned from the choroid plexus of the knock-in mouse has the electrophysiological properties of the native channel, including characteristically large Rb + currents. These large Kir7.1-mediated currents are accompanied by abundant apical membrane Kir7.1-HA immunoreactivity. WT-controlled western blots demonstrate the presence of Kir7.1-HA in the eye and the choroid plexus, trachea and lung, and intestinal epithelium but exclusively in the ileum. In the kidney, and at variance with previous reports in the rat and guinea-pig, Kir7.1-HA is expressed in the inner medulla but not in the cortex or outer medulla. In isolated tubules immunoreactivity was associated with inner medulla collecting ducts but not thin limbs of the loop of Henle. Kir7.1-HA shows basolateral expression in the respiratory tract epithelium from trachea to bronchioli. The channel also appears basolateral in the epithelium of the nasal cavity and nasopharynx in newborn animals. We show that HA-tagged Kir7.1 channel introduced in the mouse by a knock-in procedure has functional properties similar to the native protein and the animal thus generated has clear advantages in localisation studies. It

  14. Behavioral tagging of extinction learning.

    Science.gov (United States)

    de Carvalho Myskiw, Jociane; Benetti, Fernando; Izquierdo, Iván

    2013-01-15

    Extinction of contextual fear in rats is enhanced by exposure to a novel environment at 1-2 h before or 1 h after extinction training. This effect is antagonized by administration of protein synthesis inhibitors anisomycin and rapamycin into the hippocampus, but not into the amygdala, immediately after either novelty or extinction training, as well as by the gene expression blocker 5,6-dichloro-1-beta-D-ribofuranosylbenzimidazole administered after novelty training, but not after extinction training. Thus, this effect can be attributed to a mechanism similar to synaptic tagging, through which long-term potentiation can be enhanced by other long-term potentiations or by exposure to a novel environment in a protein synthesis-dependent fashion. Extinction learning produces a tag at the appropriate synapses, whereas novelty learning causes the synthesis of plasticity-related proteins that are captured by the tag, strengthening the synapses that generated this tag.

  15. An expressed sequence tag (EST) library for Drosophila serrata, a model system for sexual selection and climatic adaptation studies

    OpenAIRE

    Frentiu, Francesca D; Adamski, Marcin; McGraw, Elizabeth A; Blows, Mark W; Chenoweth, Stephen F

    2009-01-01

    Abstract Background The native Australian fly Drosophila serrata belongs to the highly speciose montium subgroup of the melanogaster species group. It has recently emerged as an excellent model system with which to address a number of important questions, including the evolution of traits under sexual selection and traits involved in climatic adaptation along latitudinal gradients. Understanding the molecular genetic basis of such traits has been limited by a lack of genomic resources for thi...

  16. Use of expressed sequence tags as an alternative approach for the identification of Taenia solium metacestode excretion/secretion proteins.

    NARCIS (Netherlands)

    Victor, B.; Dorny, P.; Kanobana, K.; Polman, K.; Lindh, J.; Deelder, A.M.; Palmblad, M.; Gabriel, S.

    2013-01-01

    Background: Taenia solium taeniasis/cysticercosis is a zoonotic helminth infection mainly found in rural regions of Africa, Asia and Latin America. In endemic areas, diagnosis of cysticercosis largely depends on serology, but these methods have their drawbacks and require improvement. This implies

  17. Computational Prediction, Target Identification and Experimental Validation of miRNAs from Expressed Sequence Tags in Cannabis sativa L

    Czech Academy of Sciences Publication Activity Database

    Duraisamy, Ganesh Selvaraj; Mishra, Ajay Kumar; Jakše, J.; Matoušek, Jaroslav

    2015-01-01

    Roč. 4, č. 2 (2015), s. 32-42 ISSN 2320-0189 R&D Projects: GA ČR GA13-03037S Institutional support: RVO:60077344 Keywords : Cannabis sativa * microRNAs * Cis-regulating elements * Computational approach Subject RIV: EB - Genetics ; Molecular Biology

  18. Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

    Science.gov (United States)

    Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

    2011-01-01

    Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...

  19. Profiling of wheat class III peroxidase genes derived from powdery mildew-attacked epidermis reveals distinct sequence-associated expression patterns.

    Science.gov (United States)

    Liu, Guosheng; Sheng, Xiaoyan; Greenshields, David L; Ogieglo, Adam; Kaminskyj, Susan; Selvaraj, Gopalan; Wei, Yangdou

    2005-07-01

    A cDNA library was constructed from leaf epidermis of diploid wheat (Triticum monococcum) infected with the powdery mildew fungus (Blumeria graminis f. sp. tritici) and was screened for genes encoding peroxidases. From 2,500 expressed sequence tags (ESTs), 36 cDNAs representing 10 peroxidase genes (designated TmPRX1 to TmPRX10) were isolated and further characterized. Alignment of the deduced amino acid sequences and phylogenetic clustering with peroxidases from other plant species demonstrated that these peroxidases fall into four distinct groups. Differential expression and tissue-specific localization among the members were observed during the B. graminis f. sp. tritici attack using Northern blots and reverse-transcriptase polymerase chain reaction analyses. Consistent with its abundance in the EST collection, TmPRX1 expression showed the highest induction during pathogen attack and fluctuated in response to the fungal parasitic stages. TmPRX1 to TmPRX6 were expressed predominantly in mesophyll cells, whereas TmPRX7 to TmPRX10, which feature a putative C-terminal propeptide, were detectable mainly in epidermal cells. Using TmPRX8 as a representative, we demonstrated that its C-terminal propeptide was sufficient to target a green fluorescent protein fusion protein to the vacuoles in onion cells. Finally, differential expression profiles of the TmPRXs after abiotic stresses and signal molecule treatments were used to dissect the potential role of these peroxidases in multiple stress and defense pathways.

  20. Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

    Science.gov (United States)

    Amirhaeri, S; Wohlrab, F; Wells, R D

    1995-02-17

    The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.

  1. A review of recommendations for sequencing receptive and expressive language instruction.

    Science.gov (United States)

    Petursdottir, Anna Ingeborg; Carr, James E

    2011-01-01

    We review recommendations for sequencing instruction in receptive and expressive language objectives in early and intensive behavioral intervention (EIBI) programs. Several books recommend completing receptive protocols before introducing corresponding expressive protocols. However, this recommendation has little empirical support, and some evidence exists that the reverse sequence may be more efficient. Alternative recommendations include teaching receptive and expressive skills simultaneously (M. L. Sundberg & Partington, 1998) and building learning histories that lead to acquisition of receptive and expressive skills without direct instruction (Greer & Ross, 2008). Empirical support for these recommendations also is limited. Future research should assess the relative efficiency of receptive-before-expressive, expressive-before-receptive, and simultaneous training with children who have diagnoses of autism spectrum disorders. In addition, further evaluation is needed of the potential benefits of multiple-exemplar training and other variables that may influence the efficiency of receptive and expressive instruction.

  2. A Lactococcus lactis expression vector set with multiple affinity tags to facilitate isolation and direct labeling of heterologous secreted proteins

    NARCIS (Netherlands)

    Pastrana, Francisco Romero; Neef, Jolanda; van Dijl, Jan Maarten; Buist, Girbe

    The gram-positive bacterium Lactococcus lactis is a useful host for extracellular protein production. A main advantage of L. lactis over other bacterial expression systems is that lactococcal cells display low levels of autolysis and proteolysis. Previously, we developed a set of vectors for

  3. Generation and annotation of lodgepole pine and oleoresin-induced expressed sequences from the blue-stain fungus Ophiostoma clavigerum, a Mountain Pine Beetle-associated pathogen.

    Science.gov (United States)

    DiGuistini, Scott; Ralph, Steven G; Lim, Young W; Holt, Robert; Jones, Steven; Bohlmann, Jörg; Breuil, Colette

    2007-02-01

    Ophiostoma clavigerum is a destructive pathogen of lodgepole pine (Pinus contorta) forests in western North America. It is therefore a relevant system for a genomics analysis of fungi vectored by bark beetles. To begin characterizing molecular interactions between the pathogen and its conifer host, we created an expressed sequence tag (EST) collection for O. clavigerum. Lodgepole pine sawdust and oleoresin media were selected to stimulate gene expression that would be specific to this host interaction. Over 6500 cDNA clones, derived from four normalized cDNA libraries, were single-pass sequenced from the 3' end. After quality screening, we identified 5975 high-quality reads with an average PHRED 20 of greater than 750 bp. Clustering and assembly of this high-quality EST set resulted in the identification of 2620 unique putative transcripts. BLASTX analysis revealed that only 67% of these unique transcripts could be matched to known or predicted protein sequences in public databases. Functional classification of these sequences provided initial insights into the transcriptome of O. clavigerum. Of particular interest, our ESTs represent an extensive collection of cytochrome P450 s, ATP-binding-cassette-type transporters and genes involved in 1,8-dihydroxynaphthalene-melanin biosynthesis. These results are discussed in the context of detoxification of conifer oleoresins and fungal pathogenesis.

  4. Gene gun bombardment-mediated expression and translocation of EGFP-tagged GLUT4 in skeletal muscle fibres in vivo

    DEFF Research Database (Denmark)

    Lauritzen, Hans P M M; Reynet, Christine; Schjerling, Peter

    2002-01-01

    the enhanced green fluorescent protein (EGFP) labelling technique with physical transfection methods in vivo: intramuscular plasmid injection or gene gun bombardment. During optimisation experiments with plasmid coding for the EGFP reporter alone EGFP-positive muscle fibres were counted after collagenase...... treatment of in vivo transfected flexor digitorum brevis (FDB) muscles. In contrast to gene gun bombardment, intramuscular injection produced EGFP expression in only a few fibres. Regardless of the transfection technique, EGFP expression was higher in muscles from 2-week-old rats than in those from 6-week......Cellular protein trafficking has been studied to date only in vitro or with techniques that are invasive and have a low time resolution. To establish a gentle method for analysis of glucose transporter-4 (GLUT4) trafficking in vivo in fully differentiated rat skeletal muscle fibres we combined...

  5. Effect of 5'-flanking sequence deletions on expression of the human insulin gene in transgenic mice

    DEFF Research Database (Denmark)

    Fromont-Racine, M; Bucchini, D; Madsen, O

    1990-01-01

    Expression of the human insulin gene was examined in transgenic mouse lines carrying the gene with various lengths of DNA sequences 5' to the transcription start site (+1). Expression of the transgene was demonstrated by 1) the presence of human C-peptide in urine, 2) the presence of specific...... of the transgene was observed in cell types other than beta-islet cells....

  6. Expression of Kirsten murine sarcoma virus sequences in Beagle dog tissues

    Energy Technology Data Exchange (ETDEWEB)

    Kerkof, P R; Kelly, G

    1988-12-01

    Labeled cDNA synthesized from RNA extracted from {sup 238}PuO{sub 2}-, {sup 239}PuO{sub 2}-, and {sup 90}Sr-induced lung tumors in Beagle dogs, from nontumor tissue from {sup 239}PuO{sub 2}-exposed dogs, and from unexposed dog lung and liver tissue produces strong hybridization signals with a plasmid (pKSma) that contains Kirsten murine sarcoma virus (KMSV) sequences. At least 90 percent of the KMSV sequences are expressed in these dog tissues, including sequences corresponding to p21 K-ras, qp70 envelope glycoprotein, and at least one other proviral sequence. The expression of Kirsten ras and other sarcoma virus sequences may have important implications for the interpretation of carcinogenesis studies in these dogs. (author)

  7. Expression of Kirsten murine sarcoma virus sequences in Beagle dog tissues

    International Nuclear Information System (INIS)

    Kerkof, P.R.; Kelly, G.

    1988-01-01

    Labeled cDNA synthesized from RNA extracted from 238 PuO 2 -, 239 PuO 2 -, and 90 Sr-induced lung tumors in Beagle dogs, from nontumor tissue from 239 PuO 2 -exposed dogs, and from unexposed dog lung and liver tissue produces strong hybridization signals with a plasmid (pKSma) that contains Kirsten murine sarcoma virus (KMSV) sequences. At least 90 percent of the KMSV sequences are expressed in these dog tissues, including sequences corresponding to p21 K-ras, qp70 envelope glycoprotein, and at least one other proviral sequence. The expression of Kirsten ras and other sarcoma virus sequences may have important implications for the interpretation of carcinogenesis studies in these dogs. (author)

  8. Sequence, 'subtle' alternative splicing and expression of the CYYR1 (cysteine/tyrosine-rich 1) mRNA in human neuroendocrine tumors

    International Nuclear Information System (INIS)

    Vitale, Lorenza; Coppola, Domenico; Strippoli, Pierluigi; Frabetti, Flavia; Huntsman, Shane A; Canaider, Silvia; Casadei, Raffaella; Lenzi, Luca; Facchin, Federica; Carinci, Paolo; Zannotti, Maria

    2007-01-01

    CYYR1 is a recently identified gene located on human chromosome 21 whose product has no similarity to any known protein and is of unknown function. Analysis of expressed sequence tags (ESTs) have revealed high human CYYR1 expression in cells belonging to the diffuse neuroendocrine system (DNES). These cells may be the origin of neuroendocrine (NE) tumors. The aim of this study was to conduct an initial analysis of sequence, splicing and expression of the CYYR1 mRNA in human NE tumors. The CYYR1 mRNA coding sequence (CDS) was studied in 32 NE tumors by RT-PCR and sequence analysis. A subtle alternative splicing was identified generating two isoforms of CYYR1 mRNA differing in terms of the absence (CAG - isoform, the first described mRNA for CYYR1 locus) or the presence (CAG + isoform) of a CAG codon. When present, this specific codon determines the presence of an alanine residue, at the exon 3/exon 4 junction of the CYYR1 mRNA. The two mRNA isoform amounts were determined by quantitative relative RT-PCR in 29 NE tumors, 2 non-neuroendocrine tumors and 10 normal tissues. A bioinformatic analysis was performed to search for the existence of the two CYYR1 isoforms in other species. The CYYR1 CDS did not show differences compared to the reference sequence in any of the samples, with the exception of an NE tumor arising in the neck region. Sequence analysis of this tumor identified a change in the CDS 333 position (T instead of C), leading to the amino acid mutation P111S. NE tumor samples showed no significant difference in either CYYR1 CAG - or CAG + isoform expression compared to control tissues. CYYR1 CAG - isoform was significantly more expressed than CAG + isoform in NE tumors as well as in control samples investigated. Bioinformatic analysis revealed that only the genomic sequence of Pan troglodytes CYYR1 is consistent with the possible existence of the two described mRNA isoforms. A new 'subtle' splicing isoform (CAG + ) of CYYR1 mRNA, the sequence and

  9. Strep-Tagged Protein Purification.

    Science.gov (United States)

    Maertens, Barbara; Spriestersbach, Anne; Kubicek, Jan; Schäfer, Frank

    2015-01-01

    The Strep-tag system can be used to purify recombinant proteins from any expression system. Here, protocols for lysis and affinity purification of Strep-tagged proteins from E. coli, baculovirus-infected insect cells, and transfected mammalian cells are given. Depending on the amount of Strep-tagged protein in the lysate, a protocol for batch binding and subsequent washing and eluting by gravity flow can be used. Agarose-based matrices with the coupled Strep-Tactin ligand are the resins of choice, with a binding capacity of up to 9 mg ml(-1). For purification of lower amounts of Strep-tagged proteins, the use of Strep-Tactin magnetic beads is suitable. In addition, Strep-tagged protein purification can also be automated using prepacked columns for FPLC or other liquid-handling chromatography instrumentation, but automated purification is not discussed in this protocol. The protocols described here can be regarded as an update of the Strep-Tag Protein Handbook (Qiagen, 2009). © 2015 Elsevier Inc. All rights reserved.

  10. Quantitative miRNA expression analysis: comparing microarrays with next-generation sequencing

    DEFF Research Database (Denmark)

    Willenbrock, Hanni; Salomon, Jesper; Søkilde, Rolf

    2009-01-01

    Recently, next-generation sequencing has been introduced as a promising, new platform for assessing the copy number of transcripts, while the existing microarray technology is considered less reliable for absolute, quantitative expression measurements. Nonetheless, so far, results from the two...... technologies have only been compared based on biological data, leading to the conclusion that, although they are somewhat correlated, expression values differ significantly. Here, we use synthetic RNA samples, resembling human microRNA samples, to find that microarray expression measures actually correlate...... better with sample RNA content than expression measures obtained from sequencing data. In addition, microarrays appear highly sensitive and perform equivalently to next-generation sequencing in terms of reproducibility and relative ratio quantification....

  11. mESAdb: microRNA expression and sequence analysis database.

    Science.gov (United States)

    Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen

    2011-01-01

    microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data.

  12. Single-step affinity and cost-effective purification of recombinant proteins using the Sepharose-binding lectin-tag from the mushroom Laetiporus sulphureus as fusion partner.

    Science.gov (United States)

    Li, Xiao-Jing; Liu, Jin-Ling; Gao, Dong-Sheng; Wan, Wen-Yan; Yang, Xia; Li, Yong-Tao; Chang, Hong-Tao; Chen, Lu; Wang, Chuan-Qing; Zhao, Jun

    2016-03-01

    Previous research showed that a lectin from the mushroom Laetiporus sulphureus, designed LSL, bound to Sepharose and could be eluted by lactose. In this study, by taking advantage of the strong affinity of LSL-tag for Sepharose, we developed a single-step purification method for LSL-tagged fusion proteins. We utilized unmodified Sepharose-4B as a specific adsorbent and 0.2 M lactose solution as an elution buffer. Fusion proteins of LSL-tag and porcine circovirus capsid protein, designated LSL-Cap was recovered with purity of 90 ± 4%, and yield of 87 ± 3% from crude extract of recombinant Escherichia coli. To enable the remove of LSL-tag, tobacco etch virus (TEV) protease recognition sequence was placed downstream of LSL-tag in the expression vector, and LSL-tagged TEV protease, designated LSL-TEV, was also expressed in E. coli., and was recovered with purity of 82 ± 5%, and yield of 85 ± 2% from crude extract of recombinant E. coli. After digestion of LSL-tagged recombinant proteins with LSL-TEV, the LSL tag and LSL-TEV can be easily removed by passing the digested products through the Sepharose column. It is of worthy noting that the Sepharose can be reused after washing with PBS. The LSL affinity purification method enables rapid and inexpensive purification of LSL-tagged fusion proteins and scale-up production of native proteins. Copyright © 2015 Elsevier Inc. All rights reserved.

  13. Presence and Expression of Microbial Genes Regulating Soil Nitrogen Dynamics Along the Tanana River Successional Sequence

    Science.gov (United States)

    Boone, R. D.; Rogers, S. L.

    2004-12-01

    We report on work to assess the functional gene sequences for soil microbiota that control nitrogen cycle pathways along the successional sequence (willow, alder, poplar, white spruce, black spruce) on the Tanana River floodplain, Interior Alaska. Microbial DNA and mRNA were extracted from soils (0-10 cm depth) for amoA (ammonium monooxygenase), nifH (nitrogenase reductase), napA (nitrate reductase), and nirS and nirK (nitrite reductase) genes. Gene presence was determined by amplification of a conserved sequence of each gene employing sequence specific oligonucleotide primers and Polymerase Chain Reaction (PCR). Expression of the genes was measured via nested reverse transcriptase PCR amplification of the extracted mRNA. Amplified PCR products were visualized on agarose electrophoresis gels. All five successional stages show evidence for the presence and expression of microbial genes that regulate N fixation (free-living), nitrification, and nitrate reduction. We detected (1) nifH, napA, and nirK presence and amoA expression (mRNA production) for all five successional stages and (2) nirS and amoA presence and nifH, nirK, and napA expression for early successional stages (willow, alder, poplar). The results highlight that the existing body of previous process-level work has not sufficiently considered the microbial potential for a nitrate economy and free-living N fixation along the complete floodplain successional sequence.

  14. Identification and expression of the tig gene coding for trigger factor from psychrophilic bacteria with no information of genome sequence available.

    Science.gov (United States)

    Lee, Kyunghee; Choi, Hyojung; Im, Hana

    2009-08-01

    Trigger factor (TF) plays a key role as a molecular chaperone with a peptidyl-prolyl cis-trans isomerase (PPIase) activity by which cells promote folding of newly synthesized proteins coming out of ribosomes. Since psychrophilic bacteria grow at a quite low temperature, between 4 and 15 degrees C, TF from such bacteria was investigated and compared with that of mesophilic bacteria E. coli in order to offer an explanation of cold-adaptation at a molecular level. Using a combination of gradient PCRs with homologous primers and LA PCR in vitro cloning technology, the tig gene was fully identified from Psychromonas arctica, whose genome sequence is not yet available. The resulting amino acid sequence of the TF was compared with other homologous TFs using sequence alignments to search for common domains. In addition, we have developed a protein expression system, by which TF proteins from P. arctica (PaTF) were produced by IPTG induction upon cloning the tig gene on expression vectors, such as pAED4. We have further examined the role of expressed psychrophilic PaTF on survival against cold treatment at 4 degrees C. Finally, we have attempted the in vitro biochemical characterization of TF proteins with His-tags expressed in a pET system, such as the PPIase activity of PaTF protein. Our results demonstrate that the expressed PaTF proteins helped cells survive against cold environments in vivo and the purified PaTF in vitro display the functional PPIase activity in a concentration dependent manner.

  15. ESPRIT: A Method for Defining Soluble Expression Constructs in Poorly Understood Gene Sequences.

    Science.gov (United States)

    Mas, Philippe J; Hart, Darren J

    2017-01-01

    Production of soluble, purifiable domains or multi-domain fragments of proteins is a prerequisite for structural biology and other applications. When target sequences are poorly annotated, or when there are few similar sequences available for alignments, identification of domains can be problematic. A method called expression of soluble proteins by random incremental truncation (ESPRIT) addresses this problem by high-throughput automated screening of tens of thousands of enzymatically truncated gene fragments. Rare soluble constructs are identified by experimental screening, and the boundaries revealed by DNA sequencing.

  16. Molecular cloning, sequence characterization and expression pattern of Rab18 gene from watermelon (Citrullus lanatus).

    Science.gov (United States)

    Xinli, Xiao; Lei, Peng

    2015-03-04

    The complete mRNA sequence of watermelon Rab18 gene was amplified through the rapid amplification of cDNA ends (RACE) method. The full-length mRNA was 1010 bp containing a 645 bp open reading frame, which encodes a protein of 214 amino acids. Sequence analysis revealed that watermelon Rab18 protein shares high homology with the Rab18 of cucumber (99%), muskmelon (98%), Morus notabilis (90%), tomato (89%), wine grape (89%) and potato (88%). Phylogenetic analysis revealed that watermelon Rab18 gene has a closer genetic relationship with Rab18 gene of cucumber and muskmelon. Tissue expression profile analysis indicated that watermelon Rab18 gene was highly expressed in root, stem and leaf, moderately expressed in flower and weakly expressed in fruit.

  17. Cloning, sequencing, and expression of cDNA for human β-glucuronidase

    International Nuclear Information System (INIS)

    Oshima, A.; Kyle, J.W.; Miller, R.D.

    1987-01-01

    The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH 2 -terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

  18. Development of simple sequence repeat (SSR) markers that are ...

    African Journals Online (AJOL)

    Simple sequence repeats (SSRs) markers were developed through data mining of 3,803 expressed sequence tags (ESTs) previously published. A total of 144 di- to penta-type SSRs were identified and they were screened for polymorphism between two turnip cultivars, 'Tsuda' and 'Yurugi Akamaru'. Out of 90 EST-SSRs for ...

  19. Gillnet Tag Program

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Certain fishery management programs require vessels to obtain gillnet tags to be used with their gillnet gear. Gillnet tag data is a collection of requests and...

  20. Cooperative Tagging Center (CTC)

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — The Cooperative Tagging Center (CTC) began as the Cooperative Game Fish Tagging Program (GTP) at Woods Hole Oceanographic Institute (WHOI) in 1954. The GTP was...

  1. North Pacific Albacore Tagging

    Data.gov (United States)

    National Oceanic and Atmospheric Administration, Department of Commerce — Conventional tagging data are available from 1971 to 1996. Electronic tagging data are available from 2000 to present. The data are managed by SWFSC in Access...

  2. De novo transcriptome sequencing of axolotl blastema for identification of differentially expressed genes during limb regeneration

    Science.gov (United States)

    2013-01-01

    Background Salamanders are unique among vertebrates in their ability to completely regenerate amputated limbs through the mediation of blastema cells located at the stump ends. This regeneration is nerve-dependent because blastema formation and regeneration does not occur after limb denervation. To obtain the genomic information of blastema tissues, de novo transcriptomes from both blastema tissues and denervated stump ends of Ambystoma mexicanum (axolotls) 14 days post-amputation were sequenced and compared using Solexa DNA sequencing. Results The sequencing done for this study produced 40,688,892 reads that were assembled into 307,345 transcribed sequences. The N50 of transcribed sequence length was 562 bases. A similarity search with known proteins identified 39,200 different genes to be expressed during limb regeneration with a cut-off E-value exceeding 10-5. We annotated assembled sequences by using gene descriptions, gene ontology, and clusters of orthologous group terms. Targeted searches using these annotations showed that the majority of the genes were in the categories of essential metabolic pathways, transcription factors and conserved signaling pathways, and novel candidate genes for regenerative processes. We discovered and confirmed numerous sequences of the candidate genes by using quantitative polymerase chain reaction and in situ hybridization. Conclusion The results of this study demonstrate that de novo transcriptome sequencing allows gene expression analysis in a species lacking genome information and provides the most comprehensive mRNA sequence resources for axolotls. The characterization of the axolotl transcriptome can help elucidate the molecular mechanisms underlying blastema formation during limb regeneration. PMID:23815514

  3. Tagging vs. Controlled Vocabulary

    DEFF Research Database (Denmark)

    Bogers, Toine; Petras, Vivien

    2015-01-01

    The popularity of social tagging has sparked a great deal of debate on whether tags could replace or improve upon professional metadata as descriptors of books and other information objects. In this paper we present a large-scale empirical comparison of the contributions of individual information...... that tags and controlled vocabulary terms do not actually outperform each other consistently, but seem to provide complementary contributions: some information needs are best addressed using controlled vocabulary terms whereas other are best addressed using tags....

  4. Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

    International Nuclear Information System (INIS)

    Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie; Weinberg, Marc S.; Arbuthnot, Patrick

    2009-01-01

    RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR) shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.

  5. Inhibition of hepatitis B virus replication with linear DNA sequences expressing antiviral micro-RNA shuttles

    Energy Technology Data Exchange (ETDEWEB)

    Chattopadhyay, Saket; Ely, Abdullah; Bloom, Kristie; Weinberg, Marc S. [Antiviral Gene Therapy Research Unit, University of the Witwatersrand (South Africa); Arbuthnot, Patrick, E-mail: Patrick.Arbuthnot@wits.ac.za [Antiviral Gene Therapy Research Unit, University of the Witwatersrand (South Africa)

    2009-11-20

    RNA interference (RNAi) may be harnessed to inhibit viral gene expression and this approach is being developed to counter chronic infection with hepatitis B virus (HBV). Compared to synthetic RNAi activators, DNA expression cassettes that generate silencing sequences have advantages of sustained efficacy and ease of propagation in plasmid DNA (pDNA). However, the large size of pDNAs and inclusion of sequences conferring antibiotic resistance and immunostimulation limit delivery efficiency and safety. To develop use of alternative DNA templates that may be applied for therapeutic gene silencing, we assessed the usefulness of PCR-generated linear expression cassettes that produce anti-HBV micro-RNA (miR) shuttles. We found that silencing of HBV markers of replication was efficient (>75%) in cell culture and in vivo. miR shuttles were processed to form anti-HBV guide strands and there was no evidence of induction of the interferon response. Modification of terminal sequences to include flanking human adenoviral type-5 inverted terminal repeats was easily achieved and did not compromise silencing efficacy. These linear DNA sequences should have utility in the development of gene silencing applications where modifications of terminal elements with elimination of potentially harmful and non-essential sequences are required.

  6. Cutaneous skin tag

    Science.gov (United States)

    Skin tag; Acrochordon; Fibroepithelial polyp ... have diabetes. They are thought to occur from skin rubbing against skin. ... The tag sticks out of the skin and may have a short, narrow stalk connecting it to the surface of the skin. Some skin tags are as long as ...

  7. Towards Universal Semantic Tagging

    NARCIS (Netherlands)

    Abzianidze, Lasha; Bos, Johan

    2017-01-01

    The paper proposes the task of universal semantic tagging---tagging word tokens with language-neutral, semantically informative tags. We argue that the task, with its independent nature, contributes to better semantic analysis for wide-coverage multilingual text. We present the initial version of

  8. A series of N-terminal epitope tagged Hdh knock-in alleles expressing normal and mutant huntingtin: their application to understanding the effect of increasing the length of normal huntingtin’s polyglutamine stretch on CAG140 mouse model pathogenesis

    Directory of Open Access Journals (Sweden)

    Zheng Shuqiu

    2012-08-01

    Full Text Available Abstract Background Huntington’s disease (HD is an autosomal dominant neurodegenerative disease that is caused by the expansion of a polyglutamine (polyQ stretch within Huntingtin (htt, the protein product of the HD gene. Although studies in vitro have suggested that the mutant htt can act in a potentially dominant negative fashion by sequestering wild-type htt into insoluble protein aggregates, the role of the length of the normal htt polyQ stretch, and the adjacent proline-rich region (PRR in modulating HD mouse model pathogenesis is currently unknown. Results We describe the generation and characterization of a series of knock-in HD mouse models that express versions of the mouse HD gene (Hdh encoding N-terminal hemaglutinin (HA or 3xFlag epitope tagged full-length htt with different polyQ lengths (HA7Q-, 3xFlag7Q-, 3xFlag20Q-, and 3xFlag140Q-htt and substitution of the adjacent mouse PRR with the human PRR (3xFlag20Q- and 3xFlag140Q-htt. Using co-immunoprecipitation and immunohistochemistry analyses, we detect no significant interaction between soluble full-length normal 7Q- htt and mutant (140Q htt, but we do observe N-terminal fragments of epitope-tagged normal htt in mutant htt aggregates. When the sequences encoding normal mouse htt’s polyQ stretch and PRR are replaced with non-pathogenic human sequence in mice also expressing 140Q-htt, aggregation foci within the striatum, and the mean size of htt inclusions are increased, along with an increase in striatal lipofuscin and gliosis. Conclusion In mice, soluble full-length normal and mutant htt are predominantly monomeric. In heterozygous knock-in HD mouse models, substituting the normal mouse polyQ and PRR with normal human sequence can exacerbate some neuropathological phenotypes.

  9. A resource of genome-wide single-nucleotide polymorphisms generated by RAD tag sequencing in the critically endangered European eel

    DEFF Research Database (Denmark)

    Pujolar, J.M.; Jacobsen, M.W.; Frydenberg, J.

    2013-01-01

    Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the Eu...... 425 loci and 376 918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome...

  10. Identification and isolation of gene differentially expressed on scrotal ...

    African Journals Online (AJOL)

    Results of BLAST with GenBank show that three genes or expressed sequence tag (ESTs) were unknown, and there were eight sequences highly identified to be Bos taurus mRNA for proline-rich protein P-B and other sequences were B. taurus ebd-P2 pseudogene, B. taurus similar to F-box only protein 21 isoform 2, ...

  11. A multiplexed miRNA and transgene expression platform for simultaneous repression and expression of protein coding sequences.

    Science.gov (United States)

    Seyhan, Attila A

    2016-01-01

    Knockdown of single or multiple gene targets by RNA interference (RNAi) is necessary to overcome escape mutants or isoform redundancy. It is also necessary to use multiple RNAi reagents to knockdown multiple targets. It is also desirable to express a transgene or positive regulatory elements and inhibit a target gene in a coordinated fashion. This study reports a flexible multiplexed RNAi and transgene platform using endogenous intronic primary microRNAs (pri-miRNAs) as a scaffold located in the green fluorescent protein (GFP) as a model for any functional transgene. The multiplexed intronic miRNA - GFP transgene platform was designed to co-express multiple small RNAs within the polycistronic cluster from a Pol II promoter at more moderate levels to reduce potential vector toxicity. The native intronic miRNAs are co-transcribed with a precursor GFP mRNA as a single transcript and presumably cleaved out of the precursor-(pre) mRNA by the RNA splicing machinery, spliceosome. The spliced intron with miRNA hairpins will be further processed into mature miRNAs or small interfering RNAs (siRNAs) capable of triggering RNAi effects, while the ligated exons become a mature messenger RNA for the translation of the functional GFP protein. Data show that this approach led to robust RNAi-mediated silencing of multiple Renilla Luciferase (R-Luc)-tagged target genes and coordinated expression of functional GFP from a single transcript in transiently transfected HeLa cells. The results demonstrated that this design facilitates the coordinated expression of all mature miRNAs either as individual miRNAs or as multiple miRNAs and the associated protein. The data suggest that, it is possible to simultaneously deliver multiple negative (miRNA or shRNA) and positive (transgene) regulatory elements. Because many cellular processes require simultaneous repression and activation of downstream pathways, this approach offers a platform technology to achieve that dual manipulation efficiently

  12. Extracting Tag Hierarchies

    Science.gov (United States)

    Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely

    2013-01-01

    Tagging items with descriptive annotations or keywords is a very natural way to compress and highlight information about the properties of the given entity. Over the years several methods have been proposed for extracting a hierarchy between the tags for systems with a "flat", egalitarian organization of the tags, which is very common when the tags correspond to free words given by numerous independent people. Here we present a complete framework for automated tag hierarchy extraction based on tag occurrence statistics. Along with proposing new algorithms, we are also introducing different quality measures enabling the detailed comparison of competing approaches from different aspects. Furthermore, we set up a synthetic, computer generated benchmark providing a versatile tool for testing, with a couple of tunable parameters capable of generating a wide range of test beds. Beside the computer generated input we also use real data in our studies, including a biological example with a pre-defined hierarchy between the tags. The encouraging similarity between the pre-defined and reconstructed hierarchy, as well as the seemingly meaningful hierarchies obtained for other real systems indicate that tag hierarchy extraction is a very promising direction for further research with a great potential for practical applications. Tags have become very prevalent nowadays in various online platforms ranging from blogs through scientific publications to protein databases. Furthermore, tagging systems dedicated for voluntary tagging of photos, films, books, etc. with free words are also becoming popular. The emerging large collections of tags associated with different objects are often referred to as folksonomies, highlighting their collaborative origin and the “flat” organization of the tags opposed to traditional hierarchical categorization. Adding a tag hierarchy corresponding to a given folksonomy can very effectively help narrowing or broadening the scope of search

  13. Extracting tag hierarchies.

    Directory of Open Access Journals (Sweden)

    Gergely Tibély

    Full Text Available Tagging items with descriptive annotations or keywords is a very natural way to compress and highlight information about the properties of the given entity. Over the years several methods have been proposed for extracting a hierarchy between the tags for systems with a "flat", egalitarian organization of the tags, which is very common when the tags correspond to free words given by numerous independent people. Here we present a complete framework for automated tag hierarchy extraction based on tag occurrence statistics. Along with proposing new algorithms, we are also introducing different quality measures enabling the detailed comparison of competing approaches from different aspects. Furthermore, we set up a synthetic, computer generated benchmark providing a versatile tool for testing, with a couple of tunable parameters capable of generating a wide range of test beds. Beside the computer generated input we also use real data in our studies, including a biological example with a pre-defined hierarchy between the tags. The encouraging similarity between the pre-defined and reconstructed hierarchy, as well as the seemingly meaningful hierarchies obtained for other real systems indicate that tag hierarchy extraction is a very promising direction for further research with a great potential for practical applications. Tags have become very prevalent nowadays in various online platforms ranging from blogs through scientific publications to protein databases. Furthermore, tagging systems dedicated for voluntary tagging of photos, films, books, etc. with free words are also becoming popular. The emerging large collections of tags associated with different objects are often referred to as folksonomies, highlighting their collaborative origin and the "flat" organization of the tags opposed to traditional hierarchical categorization. Adding a tag hierarchy corresponding to a given folksonomy can very effectively help narrowing or broadening the scope of

  14. Extracting tag hierarchies.

    Science.gov (United States)

    Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely

    2013-01-01

    Tagging items with descriptive annotations or keywords is a very natural way to compress and highlight information about the properties of the given entity. Over the years several methods have been proposed for extracting a hierarchy between the tags for systems with a "flat", egalitarian organization of the tags, which is very common when the tags correspond to free words given by numerous independent people. Here we present a complete framework for automated tag hierarchy extraction based on tag occurrence statistics. Along with proposing new algorithms, we are also introducing different quality measures enabling the detailed comparison of competing approaches from different aspects. Furthermore, we set up a synthetic, computer generated benchmark providing a versatile tool for testing, with a couple of tunable parameters capable of generating a wide range of test beds. Beside the computer generated input we also use real data in our studies, including a biological example with a pre-defined hierarchy between the tags. The encouraging similarity between the pre-defined and reconstructed hierarchy, as well as the seemingly meaningful hierarchies obtained for other real systems indicate that tag hierarchy extraction is a very promising direction for further research with a great potential for practical applications. Tags have become very prevalent nowadays in various online platforms ranging from blogs through scientific publications to protein databases. Furthermore, tagging systems dedicated for voluntary tagging of photos, films, books, etc. with free words are also becoming popular. The emerging large collections of tags associated with different objects are often referred to as folksonomies, highlighting their collaborative origin and the "flat" organization of the tags opposed to traditional hierarchical categorization. Adding a tag hierarchy corresponding to a given folksonomy can very effectively help narrowing or broadening the scope of search. Moreover

  15. Sets of RNA repeated tags and hybridization-sensitive fluorescent probes for distinct images of RNA in a living cell.

    Directory of Open Access Journals (Sweden)

    Takeshi Kubota

    Full Text Available BACKGROUND: Imaging the behavior of RNA in a living cell is a powerful means for understanding RNA functions and acquiring spatiotemporal information in a single cell. For more distinct RNA imaging in a living cell, a more effective chemical method to fluorescently label RNA is now required. In addition, development of the technology labeling with different colors for different RNA would make it easier to analyze plural RNA strands expressing in a cell. METHODOLOGY/PRINCIPAL FINDINGS: Tag technology for RNA imaging in a living cell has been developed based on the unique chemical functions of exciton-controlled hybridization-sensitive oligonucleotide (ECHO probes. Repetitions of selected 18-nucleotide RNA tags were incorporated into the mRNA 3'-UTR. Pairs with complementary ECHO probes exhibited hybridization-sensitive fluorescence emission for the mRNA expressed in a living cell. The mRNA in a nucleus was detected clearly as fluorescent puncta, and the images of the expression of two mRNAs were obtained independently and simultaneously with two orthogonal tag-probe pairs. CONCLUSIONS/SIGNIFICANCE: A compact and repeated label has been developed for RNA imaging in a living cell, based on the photochemistry of ECHO probes. The pairs of an 18-nt RNA tag and the complementary ECHO probes are highly thermostable, sequence-specifically emissive, and orthogonal to each other. The nucleotide length necessary for one tag sequence is much shorter compared with conventional tag technologies, resulting in easy preparation of the tag sequences with a larger number of repeats for more distinct RNA imaging.

  16. [Clone, construct, expression and verification of lactoferricin B gene and several sequence mutations in yeast].

    Science.gov (United States)

    Feng, Yong-qian; Zha, Xiao-jun; Zhai, Chao-yang

    2007-07-01

    To construct the eucaryotic recombinant plasmid of pYES2/LactoferricinB expressing in yeast of S. cerevisiae, of which the expressed protein antibacterial activity was verified in preliminary. By self-template PCR method, the gene of Lactoferricin B and its several sequence mutations were amplified with the parts of the pre-synthesized single chains. And then Lactoferricin B gene and its mutants were cloned into the vector of pYES2 to construct the recombined expression plasmid pYES2/Lactoferricin B etc. extracted and used to transform the yeast S. cerevisiae. The expressions of proteins were determined after induced by galactose. The expression proteins were collected and purified by hydronium-exchange column, and the bacterial inhibited test was applied to identify the protein antibacterial activities. The PCR amplifying and DNA sequencing tests indicated that the purpose plasmid contained the Lactoferricin B gene and several mutations. The induced target proteins were confirmed by SDS-PAGE electrophoresis and mass spectrum test. The protein antibacterial activities of mutations were verified in preliminary. The recombined plasmid pYES2/Lactoferricin B etc. are successfully constructed and induced to express in yeast cell of S. cerevisiae; the obtained recombined protein of Lactoferricin B provides a basis for further research work on the biological function and antibacterial activity.

  17. microRNA expression profiling in fetal single ventricle malformation identified by deep sequencing.

    Science.gov (United States)

    Yu, Zhang-Bin; Han, Shu-Ping; Bai, Yun-Fei; Zhu, Chun; Pan, Ya; Guo, Xi-Rong

    2012-01-01

    microRNAs (miRNAs) have emerged as key regulators in many biological processes, particularly cardiac growth and development, although the specific miRNA expression profile associated with this process remains to be elucidated. This study aimed to characterize the cellular microRNA profile involved in the development of congenital heart malformation, through the investigation of single ventricle (SV) defects. Comprehensive miRNA profiling in human fetal SV cardiac tissue was performed by deep sequencing. Differential expression of 48 miRNAs was revealed by sequencing by oligonucleotide ligation and detection (SOLiD) analysis. Of these, 38 were down-regulated and 10 were up-regulated in differentiated SV cardiac tissue, compared to control cardiac tissue. This was confirmed by real-time quantitative reverse transcription-polymerase chain reaction (qRT-PCR) analysis. Predicted target genes of the 48 differentially expressed miRNAs were analyzed by gene ontology and categorized according to cellular process, regulation of biological process and metabolic process. Pathway-Express analysis identified the WNT and mTOR signaling pathways as the most significant processes putatively affected by the differential expression of these miRNAs. The candidate genes involved in cardiac development were identified as potential targets for these differentially expressed microRNAs and the collaborative network of microRNAs and cardiac development related-mRNAs was constructed. These data provide the basis for future investigation of the mechanism of the occurrence and development of fetal SV malformations.

  18. Sequence and expression pattern of the germ line marker vasa in honey bees and stingless bees

    Science.gov (United States)

    2009-01-01

    Queens and workers of social insects differ in the rates of egg laying. Using genomic information we determined the sequence of vasa, a highly conserved gene specific to the germ line of metazoans, for the honey bee and four stingless bees. The vasa sequence of social bees differed from that of other insects in two motifs. By RT-PCR we confirmed the germ line specificity of Amvasa expression in honey bees. In situ hybridization on ovarioles showed that Amvasa is expressed throughout the germarium, except for the transition zone beneath the terminal filament. A diffuse vasa signal was also seen in terminal filaments suggesting the presence of germ line cells. Oocytes showed elevated levels of Amvasa transcripts in the lower germarium and after follicles became segregated. In previtellogenic follicles, Amvasa transcription was detected in the trophocytes, which appear to supply its mRNA to the growing oocyte. A similar picture was obtained for ovarioles of the stingless bee Melipona quadrifasciata, except that Amvasa expression was higher in the oocytes of previtellogenic follicles. The social bees differ in this respect from Drosophila, the model system for insect oogenesis, suggesting that changes in the sequence and expression pattern of vasa may have occurred during social evolution. PMID:21637523

  19. Sequence and expression pattern of the germ line marker vasa in honey bees and stingless bees

    Directory of Open Access Journals (Sweden)

    Érica Donato Tanaka

    2009-01-01

    Full Text Available Queens and workers of social insects differ in the rates of egg laying. Using genomic information we determined the sequence of vasa, a highly conserved gene specific to the germ line of metazoans, for the honey bee and four stingless bees. The vasa sequence of social bees differed from that of other insects in two motifs. By RT-PCR we confirmed the germ line specificity of Amvasa expression in honey bees. In situ hybridization on ovarioles showed that Amvasa is expressed throughout the germarium, except for the transition zone beneath the terminal filament. A diffuse vasa signal was also seen in terminal filaments suggesting the presence of germ line cells. Oocytes showed elevated levels of Amvasa transcripts in the lower germarium and after follicles became segregated. In previtellogenic follicles, Amvasa transcription was detected in the trophocytes, which appear to supply its mRNA to the growing oocyte. A similar picture was obtained for ovarioles of the stingless bee Melipona quadrifasciata, except that Amvasa expression was higher in the oocytes of previtellogenic follicles. The social bees differ in this respect from Drosophila, the model system for insect oogenesis, suggesting that changes in the sequence and expression pattern of vasa may have occurred during social evolution.

  20. An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data.

    Directory of Open Access Journals (Sweden)

    Daniel Ramsköld

    2009-12-01

    Full Text Available The parts of the genome transcribed by a cell or tissue reflect the biological processes and functions it carries out. We characterized the features of mammalian tissue transcriptomes at the gene level through analysis of RNA deep sequencing (RNA-Seq data across human and mouse tissues and cell lines. We observed that roughly 8,000 protein-coding genes were ubiquitously expressed, contributing to around 75% of all mRNAs by message copy number in most tissues. These mRNAs encoded proteins that were often intracellular, and tended to be involved in metabolism, transcription, RNA processing or translation. In contrast, genes for secreted or plasma membrane proteins were generally expressed in only a subset of tissues. The distribution of expression levels was broad but fairly continuous: no support was found for the concept of distinct expression classes of genes. Expression estimates that included reads mapping to coding exons only correlated better with qRT-PCR data than estimates which also included 3' untranslated regions (UTRs. Muscle and liver had the least complex transcriptomes, in that they expressed predominantly ubiquitous genes and a large fraction of the transcripts came from a few highly expressed genes, whereas brain, kidney and testis expressed more complex transcriptomes with the vast majority of genes expressed and relatively small contributions from the most expressed genes. mRNAs expressed in brain had unusually long 3'UTRs, and mean 3'UTR length was higher for genes involved in development, morphogenesis and signal transduction, suggesting added complexity of UTR-based regulation for these genes. Our results support a model in which variable exterior components feed into a large, densely connected core composed of ubiquitously expressed intracellular proteins.

  1. Assessment of adaptive evolution between wheat and rice as deduced from full-length common wheat cDNA sequence data and expression patterns

    Directory of Open Access Journals (Sweden)

    Hayashizaki Yoshihide

    2009-06-01

    Full Text Available Abstract Background Wheat is an allopolyploid plant that harbors a huge, complex genome. Therefore, accumulation of expressed sequence tags (ESTs for wheat is becoming particularly important for functional genomics and molecular breeding. We prepared a comprehensive collection of ESTs from the various tissues that develop during the wheat life cycle and from tissues subjected to stress. We also examined their expression profiles in silico. As full-length cDNAs are indispensable to certify the collected ESTs and annotate the genes in the wheat genome, we performed a systematic survey and sequencing of the full-length cDNA clones. This sequence information is a valuable genetic resource for functional genomics and will enable carrying out comparative genomics in cereals. Results As part of the functional genomics and development of genomic wheat resources, we have generated a collection of full-length cDNAs from common wheat. By grouping the ESTs of recombinant clones randomly selected from the full-length cDNA library, we were able to sequence 6,162 independent clones with high accuracy. About 10% of the clones were wheat-unique genes, without any counterparts within the DNA database. Wheat clones that showed high homology to those of rice were selected in order to investigate their expression patterns in various tissues throughout the wheat life cycle and in response to abiotic-stress treatments. To assess the variability of genes that have evolved differently in wheat and rice, we calculated the substitution rate (Ka/Ks of the counterparts in wheat and rice. Genes that were preferentially expressed in certain tissues or treatments had higher Ka/Ks values than those in other tissues and treatments, which suggests that the genes with the higher variability expressed in these tissues is under adaptive selection. Conclusion We have generated a high-quality full-length cDNA resource for common wheat, which is essential for continuation of the

  2. A novel approach to sequence validating protein expression clones with automated decision making

    Directory of Open Access Journals (Sweden)

    Mohr Stephanie E

    2007-06-01

    Full Text Available Abstract Background Whereas the molecular assembly of protein expression clones is readily automated and routinely accomplished in high throughput, sequence verification of these clones is still largely performed manually, an arduous and time consuming process. The ultimate goal of validation is to determine if a given plasmid clone matches its reference sequence sufficiently to be "acceptable" for use in protein expression experiments. Given the accelerating increase in availability of tens of thousands of unverified clones, there is a strong demand for rapid, efficient and accurate software that automates clone validation. Results We have developed an Automated Clone Evaluation (ACE system – the first comprehensive, multi-platform, web-based plasmid sequence verification software package. ACE automates the clone verification process by defining each clone sequence as a list of multidimensional discrepancy objects, each describing a difference between the clone and its expected sequence including the resulting polypeptide consequences. To evaluate clones automatically, this list can be compared against user acceptance criteria that specify the allowable number of discrepancies of each type. This strategy allows users to re-evaluate the same set of clones against different acceptance criteria as needed for use in other experiments. ACE manages the entire sequence validation process including contig management, identifying and annotating discrepancies, determining if discrepancies correspond to polymorphisms and clone finishing. Designed to manage thousands of clones simultaneously, ACE maintains a relational database to store information about clones at various completion stages, project processing parameters and acceptance criteria. In a direct comparison, the automated analysis by ACE took less time and was more accurate than a manual analysis of a 93 gene clone set. Conclusion ACE was designed to facilitate high throughput clone sequence

  3. Analyses of expressed sequence tags from the maize foliar pathogen Cercospora zeae-maydis identity novel genes expressed during vegetative infectious, and repoductive growth

    NARCIS (Netherlands)

    Bluhm, B.H.; Lindquist, E.; Kema, G.H.J.; Goodwin, S.B.; Dunkle, L.D.

    2008-01-01

    The ascomycete fungus Cercospora zeae-maydis is an aggressive foliar pathogen of maize that causes substantial losses annually throughout the Western Hemisphere. Despite its impact on maize production, little is known about the regulation of pathogenesis in C. zeae-maydis at the molecular level. The

  4. Differential cellulolytic activity of native-form and C-terminal tagged-form cellulase derived from coptotermes formosanus and expressed in E. coli

    Science.gov (United States)

    The endogenous cellulase gene (CfEG3a) of Coptotermes formosanus, an economically important pest termite, was cloned and overexpressed in both native form (nCfEG) and C-terminal His-tagged form (tCfEG) in E.coli. Both forms of recombinant cellulases showed hydrolytic activity on cellulosic substrate...

  5. The biological activity of a recombinantly expressed (His)(6)-tagged peanut allergen (rAra h 1) is unaffected by endotoxin removal

    DEFF Research Database (Denmark)

    Jensen, Louise Bjerremann; Torp, Anna Maria; Andersen, Sven Bode

    2008-01-01

    The application of recombinant (His)(6)-tagged proteins in cell culture assays is associated with problems due to lipopolysaccharide (LPS) contamination. LPS stimulates cells of the immune system, thereby masking antigen-specific activation of T cells. Due to the affinity of LPS for histidine it ...

  6. Hepatitis C virus expressing flag-tagged envelope protein 2 has unaltered infectivity and density, is specifically neutralized by flag antibodies and can be purified by affinity chromatography

    DEFF Research Database (Denmark)

    Prentø, Jannick Cornelius; Bukh, Jens

    2011-01-01

    to the original virus. Flag-tagged virus was susceptible to flag-specific antibody neutralization, and infected cells could be immuno-stained by anti-flag antibodies. Using affinity chromatography with anti-flag resin we repeatedly obtained ~30% recovery of infectious particles. The full viability and unaltered...

  7. Notes on SAW Tag Interrogation Techniques

    Science.gov (United States)

    Barton, Richard J.

    2010-01-01

    We consider the problem of interrogating a single SAW RFID tag with a known ID and known range in the presence of multiple interfering tags under the following assumptions: (1) The RF propagation environment is well approximated as a simple delay channel with geometric power-decay constant alpha >/= 2. (2) The interfering tag IDs are unknown but well approximated as independent, identically distributed random samples from a probability distribution of tag ID waveforms with known second-order properties, and the tag of interest is drawn independently from the same distribution. (3) The ranges of the interfering tags are unknown but well approximated as independent, identically distributed realizations of a random variable rho with a known probability distribution f(sub rho) , and the tag ranges are independent of the tag ID waveforms. In particular, we model the tag waveforms as random impulse responses from a wide-sense-stationary, uncorrelated-scattering (WSSUS) fading channel with known bandwidth and scattering function. A brief discussion of the properties of such channels and the notation used to describe them in this document is given in the Appendix. Under these assumptions, we derive the expression for the output signal-to-noise ratio (SNR) for an arbitrary combination of transmitted interrogation signal and linear receiver filter. Based on this expression, we derive the optimal interrogator configuration (i.e., transmitted signal/receiver filter combination) in the two extreme noise/interference regimes, i.e., noise-limited and interference-limited, under the additional assumption that the coherence bandwidth of the tags is much smaller than the total tag bandwidth. Finally, we evaluate the performance of both optimal interrogators over a broad range of operating scenarios using both numerical simulation based on the assumed model and Monte Carlo simulation based on a small sample of measured tag waveforms. The performance evaluation results not only

  8. Replication error deficient and proficient colorectal cancer gene expression differences caused by 3'UTR polyT sequence deletions

    DEFF Research Database (Denmark)

    Wilding, Jennifer L; McGowan, Simon; Liu, Ying

    2010-01-01

    , and have distinct pathologies. Regulatory sequences controlling all aspects of mRNA processing, especially including message stability, are found in the 3'UTR sequence of most genes. The relevant sequences are typically A/U-rich elements or U repeats. Microarray analysis of 14 RER+ (deficient) and 16 RER......- (proficient) colorectal cancer cell lines confirms a striking difference in expression profiles. Analysis of the incidence of mononucleotide repeat sequences in the 3'UTRs, 5'UTRs, and coding sequences of those genes most differentially expressed in RER+ versus RER- cell lines has shown that much...... of this differential expression can be explained by the occurrence of a massive enrichment of genes with 3'UTR T repeats longer than 11 base pairs in the most differentially expressed genes. This enrichment was confirmed by analysis of two published consensus sets of RER differentially expressed probesets for a large...

  9. Molecular characterization, tissue expression and sequence variability of the barramundi (Lates calcarifer myostatin gene

    Directory of Open Access Journals (Sweden)

    Smith-Keune Carolyn

    2008-02-01

    Full Text Available Abstract Background Myostatin (MSTN is a member of the transforming growth factor-β superfamily that negatively regulates growth of skeletal muscle tissue. The gene encoding for the MSTN peptide is a consolidate candidate for the enhancement of productivity in terrestrial livestock. This gene potentially represents an important target for growth improvement of cultured finfish. Results Here we report molecular characterization, tissue expression and sequence variability of the barramundi (Lates calcarifer MSTN-1 gene. The barramundi MSTN-1 was encoded by three exons 379, 371 and 381 bp in length and translated into a 376-amino acid peptide. Intron 1 and 2 were 412 and 819 bp in length and presented typical GT...AG splicing sites. The upstream region contained cis-regulatory elements such as TATA-box and E-boxes. A first assessment of sequence variability suggested that higher mutation rates are found in the 5' flanking region with several SNP's present in this species. A putative micro RNA target site has also been observed in the 3'UTR (untranslated region and is highly conserved across teleost fish. The deduced amino acid sequence was conserved across vertebrates and exhibited characteristic conserved putative functional residues including a cleavage motif of proteolysis (RXXR, nine cysteines and two glycosilation sites. A qualitative analysis of the barramundi MSTN-1 expression pattern revealed that, in adult fish, transcripts are differentially expressed in various tissues other than skeletal muscles including gill, heart, kidney, intestine, liver, spleen, eye, gonad and brain. Conclusion Our findings provide valuable insights such as sequence variation and genomic information which will aid the further investigation of the barramundi MSTN-1 gene in association with growth. The finding for the first time in finfish MSTN of a miRNA target site in the 3'UTR provides an opportunity for the identification of regulatory mutations on the

  10. The Role of Heterologous Chloroplast Sequence Elements in Transgene Integration and Expression1[W][OA

    Science.gov (United States)

    Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry

    2010-01-01

    Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5′ untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5′ UTR and 3′ UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5′ UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5′ UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation. PMID:20130101

  11. The role of heterologous chloroplast sequence elements in transgene integration and expression.

    Science.gov (United States)

    Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry

    2010-04-01

    Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5' untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5' UTR and 3' UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5' UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5' UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation.

  12. Tags on healthcare information websites

    DEFF Research Database (Denmark)

    Lykke, Marianne; Ådland, Marit Kristine

    2018-01-01

    This paper explores tags and tagging behaviour on health information websites using an empirical, user-oriented, exploratory case study. Taggers and editors were interviewed about tags and tagging, while taggers solved tasks that included applying tags to a website. This qualitative data...... articles, request information, and value article content. Some of these show that tags are not only not only topical descriptions, but communicative by intent. This result can potentially inform the design of tagging features....

  13. Tumor transcriptome sequencing reveals allelic expression imbalances associated with copy number alterations.

    Directory of Open Access Journals (Sweden)

    Brian B Tuch

    Full Text Available Due to growing throughput and shrinking cost, massively parallel sequencing is rapidly becoming an attractive alternative to microarrays for the genome-wide study of gene expression and copy number alterations in primary tumors. The sequencing of transcripts (RNA-Seq should offer several advantages over microarray-based methods, including the ability to detect somatic mutations and accurately measure allele-specific expression. To investigate these advantages we have applied a novel, strand-specific RNA-Seq method to tumors and matched normal tissue from three patients with oral squamous cell carcinomas. Additionally, to better understand the genomic determinants of the gene expression changes observed, we have sequenced the tumor and normal genomes of one of these patients. We demonstrate here that our RNA-Seq method accurately measures allelic imbalance and that measurement on the genome-wide scale yields novel insights into cancer etiology. As expected, the set of genes differentially expressed in the tumors is enriched for cell adhesion and differentiation functions, but, unexpectedly, the set of allelically imbalanced genes is also enriched for these same cancer-related functions. By comparing the transcriptomic perturbations observed in one patient to his underlying normal and tumor genomes, we find that allelic imbalance in the tumor is associated with copy number mutations and that copy number mutations are, in turn, strongly associated with changes in transcript abundance. These results support a model in which allele-specific deletions and duplications drive allele-specific changes in gene expression in the developing tumor.

  14. [Sequences and expression pattern of mce gene in Leptospira interrogans of different serogroups].

    Science.gov (United States)

    Zhang, Lei; Xue, Feng; Yan, Jie; Mao, Ya-fei; Li, Li-wei

    2008-11-01

    To determine the frequency of mce gene in Leptospira interrogans, and to investigate the gene transcription levels of L. interrogans before and after infecting cells. The segments of entire mce genes from 13 L.interrogans strains and 1 L.biflexa strain were amplified by PCR and then sequenced after T-A cloning. A prokaryotic expression system of mce gene was constructed; the expression and output of the target recombinant protein rMce were examined by SDS-PAGE and Western Blot assay. Rabbits were intradermally immunized with rMce to prepare the antiserum, the titer of antiserum was measured by immunodiffusion test. The transcription levels of mce gene in L.interrogans serogroup Icterohaemorrhagiae serovar lai strain 56601 before and after infecting J774A.1 cells were monitored by real-time fluorescence quantitative RT-PCR. mce gene was carried in all tested L.interrogans strains, but not in L.biflexa serogroup Semaranga serovar patoc strain Patoc I. The similarities of nucleotide and putative amino acid sequences of the cloned mce genes to the reported sequences (GenBank accession No: NP712236) were 99.02%-100% and 97.91%-100%, respectively. The constructed prokaryotic expression system of mce gene expressed rMce and the output of rMce was about 5% of the total bacterial proteins. The antiserum against whole cell of L.interrogans strain 56601 efficiently recognized rMce. After infecting J774A.1 cells, transcription levels of the mce gene in L.interrogans strain 56601 were remarkably up-regulated. The constructed prokaryotic expression system of mce gene and the prepared antiserum against rMce provide useful tools for further study of the gene function.

  15. P22 Arc repressor: enhanced expression of unstable mutants by addition of polar C-terminal sequences.

    OpenAIRE

    Milla, M. E.; Brown, B. M.; Sauer, R. T.

    1993-01-01

    Many mutant variants of the P22 Arc repressor are subject to intracellular proteolysis in Escherichia coli, which precludes their expression at levels sufficient for purification and subsequent biochemical characterization. Here we examine the effects of several different C-terminal extension sequences on the expression and activity of a set of Arc mutants. We show that two tail sequences, KNQHE (st5) and H6KNQHE (st11), increase the expression levels of most mutants from 10- to 20-fold and, ...

  16. Impacts of Neanderthal-Introgressed Sequences on the Landscape of Human Gene Expression.

    Science.gov (United States)

    McCoy, Rajiv C; Wakefield, Jon; Akey, Joshua M

    2017-02-23

    Regulatory variation influencing gene expression is a key contributor to phenotypic diversity, both within and between species. Unfortunately, RNA degrades too rapidly to be recovered from fossil remains, limiting functional genomic insights about our extinct hominin relatives. Many Neanderthal sequences survive in modern humans due to ancient hybridization, providing an opportunity to assess their contributions to transcriptional variation and to test hypotheses about regulatory evolution. We developed a flexible Bayesian statistical approach to quantify allele-specific expression (ASE) in complex RNA-seq datasets. We identified widespread expression differences between Neanderthal and modern human alleles, indicating pervasive cis-regulatory impacts of introgression. Brain regions and testes exhibited significant downregulation of Neanderthal alleles relative to other tissues, consistent with natural selection influencing the tissue-specific regulatory landscape. Our study demonstrates that Neanderthal-inherited sequences are not silent remnants of ancient interbreeding but have measurable impacts on gene expression that contribute to variation in modern human phenotypes. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Sarcocystis neurona merozoites express a family of immunogenic surface antigens that are orthologues of the Toxoplasma gondii surface antigens (SAGs) and SAG-related sequences.

    Science.gov (United States)

    Howe, Daniel K; Gaji, Rajshekhar Y; Mroz-Barrett, Meaghan; Gubbels, Marc-Jan; Striepen, Boris; Stamper, Shelby

    2005-02-01

    Sarcocystis neurona is a member of the Apicomplexa that causes myelitis and encephalitis in horses but normally cycles between the opossum and small mammals. Analysis of an S. neurona expressed sequence tag (EST) database revealed four paralogous proteins that exhibit clear homology to the family of surface antigens (SAGs) and SAG-related sequences of Toxoplasma gondii. The primary peptide sequences of the S. neurona proteins are consistent with the two-domain structure that has been described for the T. gondii SAGs, and each was predicted to have an amino-terminal signal peptide and a carboxyl-terminal glycolipid anchor addition site, suggesting surface localization. All four proteins were confirmed to be membrane associated and displayed on the surface of S. neurona merozoites. Due to their surface localization and homology to T. gondii surface antigens, these S. neurona proteins were designated SnSAG1, SnSAG2, SnSAG3, and SnSAG4. Consistent with their homology, the SnSAGs elicited a robust immune response in infected and immunized animals, and their conserved structure further suggests that the SnSAGs similarly serve as adhesins for attachment to host cells. Whether the S. neurona SAG family is as extensive as the T. gondii SAG family remains unresolved, but it is probable that additional SnSAGs will be revealed as more S. neurona ESTs are generated. The existence of an SnSAG family in S. neurona indicates that expression of multiple related surface antigens is not unique to the ubiquitous organism T. gondii. Instead, the SAG gene family is a common trait that presumably has an essential, conserved function(s).

  18. High-throughput genotyping-by-sequencing facilitates molecular tagging of a novel rust resistance gene, R 15 , in sunflower (Helianthus annuus L.).

    Science.gov (United States)

    Ma, G J; Song, Q J; Markell, S G; Qi, L L

    2018-03-21

    A novel rust resistance gene, R 15 , derived from the cultivated sunflower HA-R8 was assigned to linkage group 8 of the sunflower genome using a genotyping-by-sequencing approach. SNP markers closely linked to R 15 were identified, facilitating marker-assisted selection of resistance genes. The rust virulence gene is co-evolving with the resistance gene in sunflower, leading to the emergence of new physiologic pathotypes. This presents a continuous threat to the sunflower crop necessitating the development of resistant sunflower hybrids providing a more efficient, durable, and environmentally friendly host plant resistance. The inbred line HA-R8 carries a gene conferring resistance to all known races of the rust pathogen in North America and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments of 140 F 2 individuals derived from a cross of HA 89 with HA-R8, rust resistance in the population was found to be conferred by a single dominant gene (R 15 ) originating from HA-R8. Genotypic analysis with the currently available SSR markers failed to find any association between rust resistance and any markers. Therefore, we used genotyping-by-sequencing (GBS) analysis to achieve better genomic coverage. The GBS data showed that R 15 was located at the top end of linkage group (LG) 8. Saturation with 71 previously mapped SNP markers selected within this region further showed that it was located in a resistance gene cluster on LG8, and mapped to a 1.0-cM region between three co-segregating SNP makers SFW01920, SFW00128, and SFW05824 as well as the NSA_008457 SNP marker. These closely linked markers will facilitate marker-assisted selection and breeding in sunflower.

  19. Three new shRNA expression vectors targeting the CYP3A4 coding sequence to inhibit its expression

    Directory of Open Access Journals (Sweden)

    Siyun Xu

    2014-10-01

    Full Text Available RNA interference (RNAi is useful for selective gene silencing. Cytochrome P450 3A4 (CYP3A4, which metabolizes approximately 50% of drugs in clinical use, plays an important role in drug metabolism. In this study, we aimed to develop a short hairpin RNA (shRNA to modulate CYP3A4 expression. Three new shRNAs (S1, S2 and S3 were designed to target the coding sequence (CDS of CYP3A4, cloned into a shRNA expression vector, and tested in different cells. The mixture of three shRNAs produced optimal reduction (55% in CYP3A4 CDS-luciferase activity in both CHL and HEK293 cells. Endogenous CYP3A4 expression in HepG2 cells was decreased about 50% at both mRNA and protein level after transfection of the mixture of three shRNAs. In contrast, CYP3A5 gene expression was not altered by the shRNAs, supporting the selectivity of CYP3A4 shRNAs. In addition, HepG2 cells transfected with CYP3A4 shRNAs were less sensitive to Ginkgolic acids, whose toxic metabolites are produced by CYP3A4. These results demonstrate that vector-based shRNAs could modulate CYP3A4 expression in cells through their actions on CYP3A4 CDS, and CYP3A4 shRNAs may be utilized to define the role of CYP3A4 in drug metabolism and toxicity.

  20. Sequence and expression analyses of porcine ISG15 and ISG43 genes.

    Science.gov (United States)

    Huang, Jiangnan; Zhao, Shuhong; Zhu, Mengjin; Wu, Zhenfang; Yu, Mei

    2009-08-01

    The coding sequences of porcine interferon-stimulated gene 15 (ISG15) and the interferon-stimulated gene (ISG43) were cloned from swine spleen mRNA. The amino acid sequences deduced from porcine ISG15 and ISG43 genes coding sequence shared 24-75% and 29-83% similarity with ISG15s and ISG43s from other vertebrates, respectively. Structural analyses revealed that porcine ISG15 comprises two ubiquitin homologues motifs (UBQ) domain and a conserved C-terminal LRLRGG conjugating motif. Porcine ISG43 contains an ubiquitin-processing proteases-like domain. Phylogenetic analyses showed that porcine ISG15 and ISG43 were mostly related to rat ISG15 and cattle ISG43, respectively. Using quantitative real-time PCR assay, significant increased expression levels of porcine ISG15 and ISG43 genes were detected in porcine kidney endothelial cells (PK15) cells treated with poly I:C. We also observed the enhanced mRNA expression of three members of dsRNA pattern-recognition receptors (PRR), TLR3, DDX58 and IFIH1, which have been reported to act as critical receptors in inducing the mRNA expression of ISG15 and ISG43 genes. However, we did not detect any induced mRNA expression of IFNalpha and IFNbeta, suggesting that transcriptional activations of ISG15 and ISG43 were mediated through IFN-independent signaling pathway in the poly I:C treated PK15 cells. Association analyses in a Landrace pig population revealed that ISG15 c.347T>C (BstUI) polymorphism and the ISG43 c.953T>G (BccI) polymorphism were significantly associated with hematological parameters and immune-related traits.

  1. Heparin-binding peptide as a novel affinity tag for purification of recombinant proteins.

    Science.gov (United States)

    Morris, Jacqueline; Jayanthi, Srinivas; Langston, Rebekah; Daily, Anna; Kight, Alicia; McNabb, David S; Henry, Ralph; Kumar, Thallapuranam Krishnaswamy Suresh

    2016-10-01

    Purification of recombinant proteins constitutes a significant part of the downstream processing in biopharmaceutical industries. Major costs involved in the production of bio-therapeutics mainly depend on the number of purification steps used during the downstream process. Affinity chromatography is a widely used method for the purification of recombinant proteins expressed in different expression host platforms. Recombinant protein purification is achieved by fusing appropriate affinity tags to either N- or C- terminus of the target recombinant proteins. Currently available protein/peptide affinity tags have proved quite useful in the purification of recombinant proteins. However, these affinity tags suffer from specific limitations in their use under different conditions of purification. In this study, we have designed a novel 34-amino acid heparin-binding affinity tag (HB-tag) for the purification of recombinant proteins expressed in Escherichia coli (E. coli) cells. HB-tag fused recombinant proteins were overexpressed in E. coli in high yields. A one-step heparin-Sepharose-based affinity chromatography protocol was developed to purify HB-fused recombinant proteins to homogeneity using a simple sodium chloride step gradient elution. The HB-tag has also been shown to facilitate the purification of target recombinant proteins from their 8 M urea denatured state(s). The HB-tag has been demonstrated to be successfully released from the fusion protein by an appropriate protease treatment to obtain the recombinant target protein(s) in high yields. Results of the two-dimensional NMR spectroscopy experiments indicate that the purified recombinant target protein(s) exist in the native conformation. Polyclonal antibodies raised against the HB-peptide sequence, exhibited high binding specificity and sensitivity to the HB-fused recombinant proteins (∼10 ng) in different crude cell extracts obtained from diverse expression hosts. In our opinion, the HB-tag provides a

  2. Calling genotypes from public RNA-sequencing data enables identification of genetic variants that affect gene-expression levels

    NARCIS (Netherlands)

    Deelen, Patrick; Zhernakova, Daria V.; de Haan, Mark; van der Sijde, Marijke; Bonder, Marc Jan; Karjalainen, Juha; van der Velde, K. Joeri; Abbott, Kristin M.; Fu, Jingyuan; Wijmenga, Cisca; Sinke, Richard J.; Swertz, Morris A.; Franke, Lude

    2015-01-01

    Background: RNA-sequencing (RNA-seq) is a powerful technique for the identification of genetic variants that affect gene-expression levels, either through expression quantitative trait locus (eQTL) mapping or through allele-specific expression (ASE) analysis. Given increasing numbers of RNA-seq

  3. Intercellular signalling in Vibrio harveyi: sequence and function of genes regulating expression of luminescence.

    Science.gov (United States)

    Bassler, B L; Wright, M; Showalter, R E; Silverman, M R

    1993-08-01

    Density-dependent expression of luminescence in Vibrio harveyi is regulated by the concentration of an extracellular signal molecule (autoinducer) in the culture medium. A recombinant clone that restored function to one class of spontaneous dim mutants was found to encode functions necessary for the synthesis of, and response to, a signal molecule. Sequence analysis of the region encoding these functions revealed three open reading frames, two (luxL and luxM) that are required for production of an autoinducer substance and a third (luxN) that is required for response to this signal substance. The LuxL and LuxM proteins are not similar in amino acid sequence to other proteins in the database, but the LuxN protein contains regions of sequence resembling both the histidine protein kinase and the response regulator domains of the family of two-component, signal transduction proteins. The phenotypes of mutants with luxL, luxM and luxN defects indicated that an additional signal-response system controlling density-dependent expression of luminescence remains to be identified.

  4. Transcriptome sequencing and whole genome expression profiling of chrysanthemum under dehydration stress

    Science.gov (United States)

    2013-01-01

    Background Chrysanthemum is one of the most important ornamental crops in the world and drought stress seriously limits its production and distribution. In order to generate a functional genomics resource and obtain a deeper understanding of the molecular mechanisms regarding chrysanthemum responses to dehydration stress, we performed large-scale transcriptome sequencing of chrysanthemum plants under dehydration stress using the Illumina sequencing technology. Results Two cDNA libraries constructed from mRNAs of control and dehydration-treated seedlings were sequenced by Illumina technology. A total of more than 100 million reads were generated and de novo assembled into 98,180 unique transcripts which were further extensively annotated by comparing their sequencing to different protein databases. Biochemical pathways were predicted from these transcript sequences. Furthermore, we performed gene expression profiling analysis upon dehydration treatment in chrysanthemum and identified 8,558 dehydration-responsive unique transcripts, including 307 transcription factors and 229 protein kinases and many well-known stress responsive genes. Gene ontology (GO) term enrichment and biochemical pathway analyses showed that dehydration stress caused changes in hormone response, secondary and amino acid metabolism, and light and photoperiod response. These findings suggest that drought tolerance of chrysanthemum plants may be related to the regulation of hormone biosynthesis and signaling, reduction of oxidative damage, stabilization of cell proteins and structures, and maintenance of energy and carbon supply. Conclusions Our transcriptome sequences can provide a valuable resource for chrysanthemum breeding and research and novel insights into chrysanthemum responses to dehydration stress and offer candidate genes or markers that can be used to guide future studies attempting to breed drought tolerant chrysanthemum cultivars. PMID:24074255

  5. PIT Tagging Anurans

    Science.gov (United States)

    McCreary, Brome

    2008-01-01

    The following video demonstrates a procedure to insert a passive integrated transponder (PIT) tag under the skin of an anuran (frog or toad) for research and monitoring purposes. Typically, a 12.5 mm tag (0.5 in.) is used to uniquely identify individual anurans as smal as 40 mm (1.6 in.) in length from snout to vent. Smaller tags are also available and allow smaller anurans to be tagged. The procedure does not differ for other sizes of tages or other sizes of anurans. Anyone using this procedure should ensure that the tag is small enough to fit easily behind the sacral hump of the anuran, as shown in this video.

  6. Cloning, sequencing, and expression of dnaK-operon proteins from the thermophilic bacterium Thermus thermophilus.

    Science.gov (United States)

    Osipiuk, J; Joachimiak, A

    1997-09-12

    We propose that the dnaK operon of Thermus thermophilus HB8 is composed of three functionally linked genes: dnaK, grpE, and dnaJ. The dnaK and dnaJ gene products are most closely related to their cyanobacterial homologs. The DnaK protein sequence places T. thermophilus in the plastid Hsp70 subfamily. In contrast, the grpE translated sequence is most similar to GrpE from Clostridium acetobutylicum, a Gram-positive anaerobic bacterium. A single promoter region, with homology to the Escherichia coli consensus promoter sequences recognized by the sigma70 and sigma32 transcription factors, precedes the postulated operon. This promoter is heat-shock inducible. The dnaK mRNA level increased more than 30 times upon 10 min of heat shock (from 70 degrees C to 85 degrees C). A strong transcription terminating sequence was found between the dnaK and grpE genes. The individual genes were cloned into pET expression vectors and the thermophilic proteins were overproduced at high levels in E. coli and purified to homogeneity. The recombinant T. thermophilus DnaK protein was shown to have a weak ATP-hydrolytic activity, with an optimum at 90 degrees C. The ATPase was stimulated by the presence of GrpE and DnaJ. Another open reading frame, coding for ClpB heat-shock protein, was found downstream of the dnaK operon.

  7. Regulatory sequences driving expression of the sea urchin Otp homeobox gene in oral ectoderm cells.

    Science.gov (United States)

    Cavalieri, Vincenzo; Bernardo, Maria Di; Spinelli, Giovanni

    2007-01-01

    PlOtp (Orthopedia), a homeodomain-containing transcription factor, has been recently characterized as a key regulator of the morphogenesis of the skeletal system in the embryo of the sea urchin Paracentrotus lividus. Otp acts as a positive regulator in a subset of oral ectodermal cells which transmit short-range signals to the underlying primary mesenchyme cells where skeletal synthesis is initiated. To shed some light on the molecular mechanisms involved in such a process, we begun a functional analysis of the cis-regulatory sequences of the Otp gene. Congruent with the spatial expression profile of the endogenous Otp gene, we found that while a DNA region from -494 to +358 is shown to drive in vivo GFP reporter expression in the oral ectoderm, but also in the foregut, a larger region spanning from -2044 to +358 is needed to give firmly established tissue specificity. Microinjection of PCR-amplified DNA constructs, truncated in the 5' regulatory region, and determination of GFP mRNA level in injected embryos allowed the identification of a 5'-flanking fragment of 184bp in length, essential for expression of the transgene in the oral ectoderm of pluteus stage embryos. Finally, we conducted DNAse I-footprinting assays in nuclear extracts for the 184bp region and detected two protected sequences. Data bank search indicates that these sites contain consensus binding sites for transcription factors.

  8. Approximation properties of haplotype tagging

    Directory of Open Access Journals (Sweden)

    Dreiseitl Stephan

    2006-01-01

    Full Text Available Abstract Background Single nucleotide polymorphisms (SNPs are locations at which the genomic sequences of population members differ. Since these differences are known to follow patterns, disease association studies are facilitated by identifying SNPs that allow the unique identification of such patterns. This process, known as haplotype tagging, is formulated as a combinatorial optimization problem and analyzed in terms of complexity and approximation properties. Results It is shown that the tagging problem is NP-hard but approximable within 1 + ln((n2 - n/2 for n haplotypes but not approximable within (1 - ε ln(n/2 for any ε > 0 unless NP ⊂ DTIME(nlog log n. A simple, very easily implementable algorithm that exhibits the above upper bound on solution quality is presented. This algorithm has running time O((2m - p + 1 ≤ O(m(n2 - n/2 where p ≤ min(n, m for n haplotypes of size m. As we show that the approximation bound is asymptotically tight, the algorithm presented is optimal with respect to this asymptotic bound. Conclusion The haplotype tagging problem is hard, but approachable with a fast, practical, and surprisingly simple algorithm that cannot be significantly improved upon on a single processor machine. Hence, significant improvement in computatational efforts expended can only be expected if the computational effort is distributed and done in parallel.

  9. Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat.

    KAUST Repository

    Leach, Lindsey J; Belfield, Eric J; Jiang, Caifu; Brown, Carly; Mithani, Aziz; Harberd, Nicholas P

    2014-01-01

    BACKGROUND: Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution 'nullisomic-tetrasomic' lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. RESULTS: We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. CONCLUSIONS: We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution.

  10. Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat.

    KAUST Repository

    Leach, Lindsey J

    2014-04-11

    BACKGROUND: Bread wheat (Triticum aestivum) has a large, complex and hexaploid genome consisting of A, B and D homoeologous chromosome sets. Therefore each wheat gene potentially exists as a trio of A, B and D homoeoloci, each of which may contribute differentially to wheat phenotypes. We describe a novel approach combining wheat cytogenetic resources (chromosome substitution \\'nullisomic-tetrasomic\\' lines) with next generation deep sequencing of gene transcripts (RNA-Seq), to directly and accurately identify homoeologue-specific single nucleotide variants and quantify the relative contribution of individual homoeoloci to gene expression. RESULTS: We discover, based on a sample comprising ~5-10% of the total wheat gene content, that at least 45% of wheat genes are expressed from all three distinct homoeoloci. Most of these genes show strikingly biased expression patterns in which expression is dominated by a single homoeolocus. The remaining ~55% of wheat genes are expressed from either one or two homoeoloci only, through a combination of extensive transcriptional silencing and homoeolocus loss. CONCLUSIONS: We conclude that wheat is tending towards functional diploidy, through a variety of mechanisms causing single homoeoloci to become the predominant source of gene transcripts. This discovery has profound consequences for wheat breeding and our understanding of wheat evolution.

  11. Metatranscriptome Sequencing Reveals Insights into the Gene Expression and Functional Potential of Rumen Wall Bacteria

    Directory of Open Access Journals (Sweden)

    Evelyne Mann

    2018-01-01

    Full Text Available Microbiota of the rumen wall constitute an important niche of rumen microbial ecology and their composition has been elucidated in different ruminants during the last years. However, the knowledge about the function of rumen wall microbes is still limited. Rumen wall biopsies were taken from three fistulated dairy cows under a standard forage-based diet and after 4 weeks of high concentrate feeding inducing a subacute rumen acidosis (SARA. Extracted RNA was used for metatranscriptome sequencing using Illumina HiSeq sequencing technology. The gene expression of the rumen wall microbial community was analyzed by mapping 35 million sequences against the Kyoto Encyclopedia for Genes and Genomes (KEGG database and determining differentially expressed genes. A total of 1,607 functional features were assigned with high expression of genes involved in central metabolism, galactose, starch and sucrose metabolism. The glycogen phosphorylase (EC:2.4.1.1 which degrades (1->4-alpha-D-glucans was among the highest expressed genes being transcribed by 115 bacterial genera. Energy metabolism genes were also highly expressed, including the pyruvate orthophosphate dikinase (EC:2.7.9.1 involved in pyruvate metabolism, which was covered by 177 genera. Nitrogen metabolism genes, in particular glutamate dehydrogenase (EC:1.4.1.4, glutamine synthetase (EC:6.3.1.2 and glutamate synthase (EC:1.4.1.13, EC:1.4.1.14 were also found to be highly expressed and prove rumen wall microbiota to be actively involved in providing host-relevant metabolites for exchange across the rumen wall. In addition, we found all four urease subunits (EC:3.5.1.5 transcribed by members of the genera Flavobacterium, Corynebacterium, Helicobacter, Clostridium, and Bacillus, and the dissimilatory sulfate reductase (EC 1.8.99.5 dsrABC, which is responsible for the reduction of sulfite to sulfide. We also provide in situ evidence for cellulose and cellobiose degradation, a key step in fiber-rich feed

  12. G-stack modulated probe intensities on expression arrays - sequence corrections and signal calibration

    Directory of Open Access Journals (Sweden)

    Fasold Mario

    2010-04-01

    Full Text Available Abstract Background The brightness of the probe spots on expression microarrays intends to measure the abundance of specific mRNA targets. Probes with runs of at least three guanines (G in their sequence show abnormal high intensities which reflect rather probe effects than target concentrations. This G-bias requires correction prior to downstream expression analysis. Results Longer runs of three or more consecutive G along the probe sequence and in particular triple degenerated G at its solution end ((GGG1-effect are associated with exceptionally large probe intensities on GeneChip expression arrays. This intensity bias is related to non-specific hybridization and affects both perfect match and mismatch probes. The (GGG1-effect tends to increase gradually for microarrays of later GeneChip generations. It was found for DNA/RNA as well as for DNA/DNA probe/target-hybridization chemistries. Amplification of sample RNA using T7-primers is associated with strong positive amplitudes of the G-bias whereas alternative amplification protocols using random primers give rise to much smaller and partly even negative amplitudes. We applied positional dependent sensitivity models to analyze the specifics of probe intensities in the context of all possible short sequence motifs of one to four adjacent nucleotides along the 25meric probe sequence. Most of the longer motifs are adequately described using a nearest-neighbor (NN model. In contrast, runs of degenerated guanines require explicit consideration of next nearest neighbors (GGG terms. Preprocessing methods such as vsn, RMA, dChip, MAS5 and gcRMA only insufficiently remove the G-bias from data. Conclusions Positional and motif dependent sensitivity models accounts for sequence effects of oligonucleotide probe intensities. We propose a positional dependent NN+GGG hybrid model to correct the intensity bias associated with probes containing poly-G motifs. It is implemented as a single-chip based calibration

  13. Identification of differentially expressed genes in cucumber (Cucumis sativus L.) root under waterlogging stress by digital gene expression profile.

    Science.gov (United States)

    Qi, Xiao-Hua; Xu, Xue-Wen; Lin, Xiao-Jian; Zhang, Wen-Jie; Chen, Xue-Hao

    2012-03-01

    High-throughput tag-sequencing (Tag-seq) analysis based on the Solexa Genome Analyzer platform was applied to analyze the gene expression profiling of cucumber plant at 5 time points over a 24h period of waterlogging treatment. Approximately 5.8 million total clean sequence tags per library were obtained with 143013 distinct clean tag sequences. Approximately 23.69%-29.61% of the distinct clean tags were mapped unambiguously to the unigene database, and 53.78%-60.66% of the distinct clean tags were mapped to the cucumber genome database. Analysis of the differentially expressed genes revealed that most of the genes were down-regulated in the waterlogging stages, and the differentially expressed genes mainly linked to carbon metabolism, photosynthesis, reactive oxygen species generation/scavenging, and hormone synthesis/signaling. Finally, quantitative real-time polymerase chain reaction using nine genes independently verified the tag-mapped results. This present study reveals the comprehensive mechanisms of waterlogging-responsive transcription in cucumber. Copyright © 2011 Elsevier Inc. All rights reserved.

  14. DMBT1 expression and glycosylation during the adenoma-carcinoma sequence in colorectal cancer

    DEFF Research Database (Denmark)

    Robbe, C; Paraskeva, C; Mollenhauer, J

    2005-01-01

    , location and its mode of secretion during malignant transformation in colorectal cancer. Using human colorectal PC/AA cell lines and tissue sections from individual patients, we have examined the expression of DMBT1 and its glycosylation in the adenoma-carcinoma sequence leading to the adenocarcinoma......The gene DMBT1 (deleted in malignant brain tumour-1) has been proposed to play a role in brain and epithelial cancer, but shows unusual features for a classical tumour-suppressor gene. On the one hand, DMBT1 has been linked to mucosal protection, whereas, on the other, it potentially plays a role...... in epithelial differentiation. Thus its function in a particular tissue is of mechanistic importance for its role in cancer. Because the former function requires secretion to the lumen and the latter function may depend on its presence in the extracellular matrix, we decided to investigate DMBT1 expression...

  15. Sequence homology and expression profile of genes associated with dna repair pathways in Mycobacterium leprae

    Directory of Open Access Journals (Sweden)

    Mukul Sharma

    2017-01-01

    Full Text Available Background: Survival of Mycobacterium leprae, the causative bacteria for leprosy, in the human host is dependent to an extent on the ways in which its genome integrity is retained. DNA repair mechanisms protect bacterial DNA from damage induced by various stress factors. The current study is aimed at understanding the sequence and functional annotation of DNA repair genes in M. leprae. Methods: T he genome of M. leprae was annotated using sequence alignment tools to identify DNA repair genes that have homologs in Mycobacterium tuberculosis and Escherichia coli. A set of 96 genes known to be involved in DNA repair mechanisms in E. coli and Mycobacteriaceae were chosen as a reference. Among these, 61 were identified in M. leprae based on sequence similarity and domain architecture. The 61 were classified into 36 characterized gene products (59%, 11 hypothetical proteins (18%, and 14 pseudogenes (23%. All these genes have homologs in M. tuberculosis and 49 (80.32% in E. coli. A set of 12 genes which are absent in E. coli were present in M. leprae and in Mycobacteriaceae. These 61 genes were further investigated for their expression profiles in the whole transcriptome microarray data of M. leprae which was obtained from the signal intensities of 60bp probes, tiling the entire genome with 10bp overlaps. Results: It was noted that transcripts corresponding to all the 61 genes were identified in the transcriptome data with varying expression levels ranging from 0.18 to 2.47 fold (normalized with 16SrRNA. The mRNA expression levels of a representative set of seven genes ( four annotated and three hypothetical protein coding genes were analyzed using quantitative Polymerase Chain Reaction (qPCR assays with RNA extracted from skin biopsies of 10 newly diagnosed, untreated leprosy cases. It was noted that RNA expression levels were higher for genes involved in homologous recombination whereas the genes with a low level of expression are involved in the

  16. Sequence homology and expression profile of genes associated with DNA repair pathways in Mycobacterium leprae.

    Science.gov (United States)

    Sharma, Mukul; Vedithi, Sundeep Chaitanya; Das, Madhusmita; Roy, Anindya; Ebenezer, Mannam

    2017-01-01

    Survival of Mycobacterium leprae, the causative bacteria for leprosy, in the human host is dependent to an extent on the ways in which its genome integrity is retained. DNA repair mechanisms protect bacterial DNA from damage induced by various stress factors. The current study is aimed at understanding the sequence and functional annotation of DNA repair genes in M. leprae. T he genome of M. leprae was annotated using sequence alignment tools to identify DNA repair genes that have homologs in Mycobacterium tuberculosis and Escherichia coli. A set of 96 genes known to be involved in DNA repair mechanisms in E. coli and Mycobacteriaceae were chosen as a reference. Among these, 61 were identified in M. leprae based on sequence similarity and domain architecture. The 61 were classified into 36 characterized gene products (59%), 11 hypothetical proteins (18%), and 14 pseudogenes (23%). All these genes have homologs in M. tuberculosis and 49 (80.32%) in E. coli. A set of 12 genes which are absent in E. coli were present in M. leprae and in Mycobacteriaceae. These 61 genes were further investigated for their expression profiles in the whole transcriptome microarray data of M. leprae which was obtained from the signal intensities of 60bp probes, tiling the entire genome with 10bp overlaps. It was noted that transcripts corresponding to all the 61 genes were identified in the transcriptome data with varying expression levels ranging from 0.18 to 2.47 fold (normalized with 16SrRNA). The mRNA expression levels of a representative set of seven genes ( four annotated and three hypothetical protein coding genes) were analyzed using quantitative Polymerase Chain Reaction (qPCR) assays with RNA extracted from skin biopsies of 10 newly diagnosed, untreated leprosy cases. It was noted that RNA expression levels were higher for genes involved in homologous recombination whereas the genes with a low level of expression are involved in the direct repair pathway. This study provided

  17. Flavour Tagging at LHCb

    CERN Multimedia

    Grabalosa Gandara, M

    2009-01-01

    To do precise CP violation measurements, the most possible accurate knowledge of the flavour at production of the reconstructed B meson is required. This poster summarizes the flavour tagging performances for the LHCb experiment. We use same side an opposite side algorithms to establish wheter the meson contained a b or a b\\bar quark. The final decision is obtained through a combination of several methods. The use of control channels, decays to a flavour specific final state, will allow to determine the wrong tag fraction \\omega (the probability of a tag to be wrong), which can be used as input for the determination of CKM unitary triangle angles.

  18. RNA Sequencing Reveals that Kaposi Sarcoma-Associated Herpesvirus Infection Mimics Hypoxia Gene Expression Signature

    Science.gov (United States)

    Viollet, Coralie; Davis, David A.; Tekeste, Shewit S.; Reczko, Martin; Pezzella, Francesco; Ragoussis, Jiannis

    2017-01-01

    Kaposi sarcoma-associated herpesvirus (KSHV) causes several tumors and hyperproliferative disorders. Hypoxia and hypoxia-inducible factors (HIFs) activate latent and lytic KSHV genes, and several KSHV proteins increase the cellular levels of HIF. Here, we used RNA sequencing, qRT-PCR, Taqman assays, and pathway analysis to explore the miRNA and mRNA response of uninfected and KSHV-infected cells to hypoxia, to compare this with the genetic changes seen in chronic latent KSHV infection, and to explore the degree to which hypoxia and KSHV infection interact in modulating mRNA and miRNA expression. We found that the gene expression signatures for KSHV infection and hypoxia have a 34% overlap. Moreover, there were considerable similarities between the genes up-regulated by hypoxia in uninfected (SLK) and in KSHV-infected (SLKK) cells. hsa-miR-210, a HIF-target known to have pro-angiogenic and anti-apoptotic properties, was significantly up-regulated by both KSHV infection and hypoxia using Taqman assays. Interestingly, expression of KSHV-encoded miRNAs was not affected by hypoxia. These results demonstrate that KSHV harnesses a part of the hypoxic cellular response and that a substantial portion of hypoxia-induced changes in cellular gene expression are induced by KSHV infection. Therefore, targeting hypoxic pathways may be a useful way to develop therapeutic strategies for KSHV-related diseases. PMID:28046107

  19. Accounting for technical noise in differential expression analysis of single-cell RNA sequencing data.

    Science.gov (United States)

    Jia, Cheng; Hu, Yu; Kelly, Derek; Kim, Junhyong; Li, Mingyao; Zhang, Nancy R

    2017-11-02

    Recent technological breakthroughs have made it possible to measure RNA expression at the single-cell level, thus paving the way for exploring expression heterogeneity among individual cells. Current single-cell RNA sequencing (scRNA-seq) protocols are complex and introduce technical biases that vary across cells, which can bias downstream analysis without proper adjustment. To account for cell-to-cell technical differences, we propose a statistical framework, TASC (Toolkit for Analysis of Single Cell RNA-seq), an empirical Bayes approach to reliably model the cell-specific dropout rates and amplification bias by use of external RNA spike-ins. TASC incorporates the technical parameters, which reflect cell-to-cell batch effects, into a hierarchical mixture model to estimate the biological variance of a gene and detect differentially expressed genes. More importantly, TASC is able to adjust for covariates to further eliminate confounding that may originate from cell size and cell cycle differences. In simulation and real scRNA-seq data, TASC achieves accurate Type I error control and displays competitive sensitivity and improved robustness to batch effects in differential expression analysis, compared to existing methods. TASC is programmed to be computationally efficient, taking advantage of multi-threaded parallelization. We believe that TASC will provide a robust platform for researchers to leverage the power of scRNA-seq. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence

    OpenAIRE

    Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

    2015-01-01

    Background: There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. Methods: All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinform...

  1. MOLECULAR CLONING, SEQUENCING, EXPRESSION AND BIOLOGICAL ACTIVITY OF GIANT PANDA (AILUROPODA MELANOLEUCA) INTERFERON-GAMMA.

    Science.gov (United States)

    Zhu, Hui; Wang, Wen-Xiu; Wang, Bao-Qin; Zhu, Xiao-Fu; Wu, Xu-Jin; Ma, Qing-Yi; Chen, De-Kun

    2012-06-29

    The giant panda (Ailuropoda melanoleuca) is an endangered species and indigenous to China. Interferon-gamma (IFN-γ) is the only member of type □ IFN and is vital for the regulation of host adapted immunity and inflammatory response. Little is known aboutthe FN-γ gene and its roles in giant panda.In this study, IFN-γ gene of Qinling giant panda was amplified from total blood RNA by RT-CPR, cloned, sequenced and analysed. The open reading frame (ORF) of Qinling giant panda IFN-γ encodes 152 amino acidsand is highly similar to Sichuan giant panda with an identity of 99.3% in cDNA sequence. The IFN-γ cDNA sequence was ligated to the pET32a vector and transformed into E. coli BL21 competent cells. Expression of recombinant IFN-γ protein of Qinling giant panda in E. coli was confirmed by SDS-PAGE and Western blot analysis. Biological activity assay indicated that the recombinant IFN-γ protein at the concentration of 4-10 µg/ml activated the giant panda peripheral blood lymphocytes,while at 12 µg/mlinhibited. the activation of the lymphocytes.These findings provide insights into the evolution of giant panda IFN-γ and information regarding amino acid residues essential for their biological activity.

  2. Cloning, sequencing, and expression of interferon-γ from elk in North America

    Science.gov (United States)

    Sweeney, Steven J.; Emerson, Carlene; Eriks, Inge S.

    2001-01-01

    Eradication of Mycobacterium bovis relies on accurate detection of infected animals, including potential domestic and wildlife reservoirs. Available diagnostic tests lack the sensitivity and specificity necessary for accurate detection, particularly in infected wildlife populations. Recently, an in vitro diagnostic test for cattle which measures plasma interferon-gamma (IFN-γ) levels in blood following in vitro incubation with M. bovis purified protein derivative has been enveloped. This test appears to have increased sensitivity over traditional testing. Unfortunately, it does not detect IFN-γ from Cervidae. To begin to address this problem, the IFN-γ gene from elk (Cervus elaphus) was cloned, sequenced, expressed, and characterized. cDNA was cloned from mitogen stimulated peripheral blood mononuclear cells. The predicted amino acid (aa) sequence was compared to known sequences from cattle, sheep, goats, red deer (Cervus elaphus), humans, and mice. Biological activity of the recombinant elk IFN-γ (rElkIFN-γ) was confirmed in a vesicular stomatitis virus cytopathic effect reduction assay. Production of monoclonal antibodies to IFN-γ epitopes conserved between ruminant species could provide an important tool for the development of reliable, practical diagnostic assays for detection of a delayed type hypersensitivity response to a variety of persistent infectious agents in ruminants, including M. bovis and Brucella abortus. Moreover, development of these reagents will aid investigators in studies to explore immunological responses of elk that are associated with resistance to infectious diseases.

  3. Molecular cloning, nucleotide sequence, and expression of the gene encoding human eosinophil differentiation factor (interleukin 5)

    International Nuclear Information System (INIS)

    Campbell, H.D.; Tucker, W.Q.J.; Hort, Y.; Martinson, M.E.; Mayo, G.; Clutterbuck, E.J.; Sanderson, C.J.; Young, I.G.

    1987-01-01

    The human eosinophil differentiation factor (EDF) gene was cloned from a genomic library in λ phage EMBL3A by using a murine EDF cDNA clone as a probe. The DNA sequence of a 3.2-kilobase BamHI fragment spanning the gene was determined. The gene contains three introns. The predicted amino acid sequence of 134 amino acids is identical with that recently reported for human interleukin 5 but shows no significant homology with other known hemopoietic growth regulators. The amino acid sequence shows strong homology (∼ 70% identity) with that of murine EDF. Recombinant human EDF, expressed from the human EDF gene after transfection into monkey COS cells, stimulated the production of eosinophils and eosinophil colonies from normal human bone marrow but had no effect on the production of neutrophils or mononuclear cells (monocytes and lymphoid cells). The apparent specificity of human EDF for the eosinophil lineage in myeloid hemopoiesis contrasts with the properties of human interleukin 3 and granulocyte/macrophage and granulocyte colony-stimulating factors but is directly analogous to the biological properties of murine EDF. Human EDF therefore represents a distinct hemopoietic growth factor that could play a central role in the regulation of eosinophilia

  4. Gene expression profiling of liver cancer stem cells by RNA-sequencing.

    Directory of Open Access Journals (Sweden)

    David W Y Ho

    Full Text Available BACKGROUND: Accumulating evidence supports that tumor growth and cancer relapse are driven by cancer stem cells. Our previous work has demonstrated the existence of CD90(+ liver cancer stem cells (CSCs in hepatocellular carcinoma (HCC. Nevertheless, the characteristics of these cells are still poorly understood. In this study, we employed a more sensitive RNA-sequencing (RNA-Seq to compare the gene expression profiling of CD90(+ cells sorted from tumor (CD90(+CSCs with parallel non-tumorous liver tissues (CD90(+NTSCs and elucidate the roles of putative target genes in hepatocarcinogenesis. METHODOLOGY/PRINCIPAL FINDINGS: CD90(+ cells were sorted respectively from tumor and adjacent non-tumorous human liver tissues using fluorescence-activated cell sorting. The amplified RNAs of CD90(+ cells from 3 HCC patients were subjected to RNA-Seq analysis. A differential gene expression profile was established between CD90(+CSCs and CD90(+NTSCs, and validated by quantitative real-time PCR (qRT-PCR on the same set of amplified RNAs, and further confirmed in an independent cohort of 12 HCC patients. Five hundred genes were differentially expressed (119 up-regulated and 381 down-regulated genes between CD90(+CSCs and CD90(+NTSCs. Gene ontology analysis indicated that the over-expressed genes in CD90(+CSCs were associated with inflammation, drug resistance and lipid metabolism. Among the differentially expressed genes, glypican-3 (GPC3, a member of glypican family, was markedly elevated in CD90(+CSCs compared to CD90(+NTSCs. Immunohistochemistry demonstrated that GPC3 was highly expressed in forty-two human liver tumor tissues but absent in adjacent non-tumorous liver tissues. Flow cytometry indicated that GPC3 was highly expressed in liver CD90(+CSCs and mature cancer cells in liver cancer cell lines and human liver tumor tissues. Furthermore, GPC3 expression was positively correlated with the number of CD90(+CSCs in liver tumor tissues. CONCLUSIONS

  5. Gene Expression Profiling of Liver Cancer Stem Cells by RNA-Sequencing

    Science.gov (United States)

    Lam, Chi Tat; Ng, Michael N. P.; Yu, Wan Ching; Lau, Joyce; Wan, Timothy; Wang, Xiaoqi; Yan, Zhixiang; Liu, Hang; Fan, Sheung Tat

    2012-01-01

    Background Accumulating evidence supports that tumor growth and cancer relapse are driven by cancer stem cells. Our previous work has demonstrated the existence of CD90+ liver cancer stem cells (CSCs) in hepatocellular carcinoma (HCC). Nevertheless, the characteristics of these cells are still poorly understood. In this study, we employed a more sensitive RNA-sequencing (RNA-Seq) to compare the gene expression profiling of CD90+ cells sorted from tumor (CD90+CSCs) with parallel non-tumorous liver tissues (CD90+NTSCs) and elucidate the roles of putative target genes in hepatocarcinogenesis. Methodology/Principal Findings CD90+ cells were sorted respectively from tumor and adjacent non-tumorous human liver tissues using fluorescence-activated cell sorting. The amplified RNAs of CD90+ cells from 3 HCC patients were subjected to RNA-Seq analysis. A differential gene expression profile was established between CD90+CSCs and CD90+NTSCs, and validated by quantitative real-time PCR (qRT-PCR) on the same set of amplified RNAs, and further confirmed in an independent cohort of 12 HCC patients. Five hundred genes were differentially expressed (119 up-regulated and 381 down-regulated genes) between CD90+CSCs and CD90+NTSCs. Gene ontology analysis indicated that the over-expressed genes in CD90+CSCs were associated with inflammation, drug resistance and lipid metabolism. Among the differentially expressed genes, glypican-3 (GPC3), a member of glypican family, was markedly elevated in CD90+CSCs compared to CD90+NTSCs. Immunohistochemistry demonstrated that GPC3 was highly expressed in forty-two human liver tumor tissues but absent in adjacent non-tumorous liver tissues. Flow cytometry indicated that GPC3 was highly expressed in liver CD90+CSCs and mature cancer cells in liver cancer cell lines and human liver tumor tissues. Furthermore, GPC3 expression was positively correlated with the number of CD90+CSCs in liver tumor tissues. Conclusions/Significance The identified genes

  6. Sequence elements controlling expression of Barley stripe mosaic virus subgenomic RNAs in vivo

    International Nuclear Information System (INIS)

    Johnson, Jennifer A.; Bragg, Jennifer N.; Lawrence, Diane M.; Jackson, Andrew O.

    2003-01-01

    Barley stripe mosaic virus (BSMV) contains three positive-sense, single-stranded genomic RNAs, designated α, β, and γ, that encode seven major proteins and one minor translational readthrough protein. Three proteins (αa, βa, and γa) are translated directly from the genomic RNAs and the remaining proteins encoded on RNAβ and RNAγ are expressed via three subgenomic messenger RNAs (sgRNAs). sgRNAβ1 directs synthesis of the triple gene block 1 (TGB1) protein. The TGB2 protein, the TGB2' minor translational readthrough protein, and the TGB3 protein are expressed from sgRNAβ2, which is present in considerably lower abundance than sgRNAβ1. A third sgRNA, sgRNAγ, is required for expression of the γb protein. We have used deletion analyses and site-specific mutations to define the boundaries of promoter regions that are critical for expression of the BSMV sgRNAs in infected protoplasts. The results reveal that the sgRNAβ1 promoter encompasses positions -29 to -2 relative to its transcription start site and is adjacent to a cis-acting element required for RNAβ replication that maps from -107 to -74 relative to the sgRNAβ1 start site. The core sgRNAβ2 promoter includes residues -32 to -17 relative to the sgRNAβ2 transcriptional start site, although maximal activity requires an upstream hexanucleotide sequence residing from positions -64 to -59. The sgRNAγ promoter maps from -21 to +2 relative to its transcription start site and therefore partially overlaps the γa gene. The sgRNAβ1, β2, and γ promoters also differ substantially in sequence, but have similarities to the putative homologous promoters of other Hordeiviruses. These differences are postulated to affect competition for the viral polymerase, coordination of the temporal expression and abundance of the TGB proteins, and constitutive expression of the γb protein

  7. Tagged Vector Contour (TVC)

    Data.gov (United States)

    Kansas Data Access and Support Center — The Kansas Tagged Vector Contour (TVC) dataset consists of digitized contours from the 7.5 minute topographic quadrangle maps. Coverage for the state is incomplete....

  8. Cloning, sequence and expression of the pel gene from an Amycolata sp.

    Science.gov (United States)

    Brühlmann, F; Keen, N T

    1997-11-20

    The pel gene from an Amycolata sp. encoding a pectate lyase (EC 4.2.2.2) was isolated by activity screening a genomic DNA library in Streptomyces lividans TK24. Subsequent subcloning and sequencing of a 2.3 kb BamHI BglII fragment revealed an open reading frame of 930 nt corresponding to a protein of 29,660 Da. The overall G + C content for the coding region was 65%, with a strong G + C preference in the third (wobble) codon position (93%). A putative ribosome-binding site 5'-GGGAG-3' preceded the translational start codon by 7 base pairs. The Amycolata pectate lyase contains a signal peptide of 26 amino acids, that is cleaved after the sequence Ala-Thr-Ala. The size of the deduced protein as well as its N-terminal amino-acid sequence match the wild-type pectate lyase from the Amycolata sp. Expression of the pel gene in S. lividans TK24 resulted in high pectate lyase activity in the culture supernatant, concomitant with the appearance of a dominant protein band on a sodium dodecyl polyacrylamide gel at 30 kDa. No pectate lyase activity was detected in E. coli BL21 with the pel gene under the strong T7 promotor. The deduced amino-acid sequence showed 40% identity with PelE from Erwinia chrysanthemi and the pectate lyase from Glomerella cingulata. The Amycolata pectate lyase clearly belongs to the pectate lyase superfamily, sharing all functional amino acids and likely has a similar structural topology as Pels from Erwinia chrysanthemi and Bacillus subtilis.

  9. LncRNA Expression Profile of Human Thoracic Aortic Dissection by High-Throughput Sequencing.

    Science.gov (United States)

    Sun, Jie; Chen, Guojun; Jing, Yuanwen; He, Xiang; Dong, Jianting; Zheng, Junmeng; Zou, Meisheng; Li, Hairui; Wang, Shifei; Sun, Yili; Liao, Wangjun; Liao, Yulin; Feng, Li; Bin, Jianping

    2018-01-01

    In this study, the long non-coding RNA (lncRNA) expression profile in human thoracic aortic dissection (TAD), a highly lethal cardiovascular disease, was investigated. Human TAD (n=3) and normal aortic tissues (NA) (n=3) were examined by high-throughput sequencing. Bioinformatics analyses were performed to predict the roles of aberrantly expressed lncRNAs. Quantitative real-time polymerase chain reaction (qRT-PCR) was applied to validate the results. A total of 269 lncRNAs (159 up-regulated and 110 down-regulated) and 2, 255 mRNAs (1 294 up-regulated and 961 down-regulated) were aberrantly expressed in human TAD (fold-change> 1.5, PTAD than in NA. The predicted binding motifs of three up-regulated lncRNAs (ENSG00000248508, ENSG00000226530, and EG00000259719) were correlated with up-regulated RUNX1 (R=0.982, PTAD. These findings suggest that lncRNAs are novel potential therapeutic targets for human TAD. © 2018 The Author(s). Published by S. Karger AG, Basel.

  10. Sequencing, physical organization and kinetic expression of the patulin biosynthetic gene cluster from Penicillium expansum

    International Nuclear Information System (INIS)

    Tannous, J.; El Khoury, R.; El Khoury, A.; Lteif, R.; Snini, S.; Lippi, Y.; Oswald, I.; Olivier, P.; Atoui, A.

    2014-01-01

    Patulin is a polyketide-derived mycotoxin produced by numerous filamentous fungi. Among them, Penicillium expansum is by far the most problematic species. This fungus is a destructive phytopathogen capable of growing on fruit, provoking the blue mold decay of apples and producing significant amounts of patulin. The biosynthetic pathway of this mycotoxin is chemically well-characterized, but its genetic bases remain largely unknown with only few characterized genes in less economic relevant species. The present study consisted of the identification and positional organization of the patulin gene cluster in P. expansum strain NRRL 35695. Several amplification reactions were performed with degenerative primers that were designed based on sequences from the orthologous genes available in other species. An improved genome Walking approach was used in order to sequence the remaining adjacent genes of the cluster. RACE-PCR was also carried out from mRNAs to determine the start and stop codons of the coding sequences. The patulin gene cluster in P. expansum consists of 15 genes in the following order: patH, patG, patF, patE, patD, patC, patB, patA, patM, patN, patO, patL, patI, patJ, and patK. These genes share 60–70% of identity with orthologous genes grouped differently, within a putative patulin cluster described in a non-producing strain of Aspergillus clavatus. The kinetics of patulin cluster genes expression was studied under patulin-permissive conditions (natural apple-based medium) and patulin-restrictive conditions (Eagle's minimal essential medium), and demonstrated a significant association between gene expression and patulin production. In conclusion, the sequence of the patulin cluster in P. expansum constitutes a key step for a better understanding of themechanisms leading to patulin production in this fungus. It will allow the role of each gene to be elucidated, and help to define strategies to reduce patulin production in apple-based products

  11. Top tagging with deep neural networks [Vidyo

    CERN Multimedia

    CERN. Geneva

    2017-01-01

    Recent literature on deep neural networks for top tagging has focussed on image based techniques or multivariate approaches using high level jet substructure variables. Here, we take a sequential approach to this task by using anordered sequence of energy deposits as training inputs. Unlike previous approaches, this strategy does not result in a loss of information during pixelization or the calculation of high level features. We also propose new preprocessing methods that do not alter key physical quantities such as jet mass. We compare the performance of this approach to standard tagging techniques and present results evaluating the robustness of the neural network to pileup.

  12. MicroRNA Expression Profile in Penile Cancer Revealed by Next-Generation Small RNA Sequencing.

    Directory of Open Access Journals (Sweden)

    Li Zhang

    Full Text Available Penile cancer (PeCa is a relatively rare tumor entity but possesses higher morbidity and mortality rates especially in developing countries. To date, the concrete pathogenic signaling pathways and core machineries involved in tumorigenesis and progression of PeCa remain to be elucidated. Several studies suggested miRNAs, which modulate gene expression at posttranscriptional level, were frequently mis-regulated and aberrantly expressed in human cancers. However, the miRNA profile in human PeCa has not been reported before. In this present study, the miRNA profile was obtained from 10 fresh penile cancerous tissues and matched adjacent non-cancerous tissues via next-generation sequencing. As a result, a total of 751 and 806 annotated miRNAs were identified in normal and cancerous penile tissues, respectively. Among which, 56 miRNAs with significantly different expression levels between paired tissues were identified. Subsequently, several annotated miRNAs were selected randomly and validated using quantitative real-time PCR. Compared with the previous publications regarding to the altered miRNAs expression in various cancers and especially genitourinary (prostate, bladder, kidney, testis cancers, the most majority of deregulated miRNAs showed the similar expression pattern in penile cancer. Moreover, the bioinformatics analyses suggested that the putative target genes of differentially expressed miRNAs between cancerous and matched normal penile tissues were tightly associated with cell junction, proliferation, growth as well as genomic instability and so on, by modulating Wnt, MAPK, p53, PI3K-Akt, Notch and TGF-β signaling pathways, which were all well-established to participate in cancer initiation and progression. Our work presents a global view of the differentially expressed miRNAs and potentially regulatory networks of their target genes for clarifying the pathogenic transformation of normal penis to PeCa, which research resource also

  13. Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

    Science.gov (United States)

    Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

    2015-01-01

    There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.

  14. Expression of Sme efflux pumps and multilocus sequence typing in clinical isolates of Stenotrophomonas maltophilia.

    Science.gov (United States)

    Cho, Hye Hyun; Sung, Ji Youn; Kwon, Kye Chul; Koo, Sun Hoe

    2012-01-01

    Stenotrophomonas maltophilia has emerged as an important opportunistic pathogen, which causes infections that are often difficult to manage because of the inherent resistance of the pathogen to a variety of antimicrobial agents. In this study, we analyzed the expressions of smeABC and smeDEF and their correlation with antimicrobial susceptibility. We also evaluated the genetic relatedness and epidemiological links among 33 isolates of S. maltophilia. In total, 33 S. maltophilia strains were isolated from patients in a tertiary hospital in Daejeon. Minimum inhibitory concentrations (MICs) of 11 antimicrobial agents were determined by using agar dilution method and E-test (BioMérieux, France). Real-time PCR analysis was performed to evaluate the expression of the Sme efflux systems in the S. maltophilia isolates. Additionally, an epidemiological investigation was performed using multilocus sequence typing (MLST) assays. The findings of susceptibility testing showed that the majority of the S. maltophilia isolates were resistant to β-lactams and aminoglycosides. Twenty-one clinical isolates overexpressed smeABC and showed high resistance to ciprofloxacin. Moreover, a high degree of genetic diversity was observed among the S. maltophilia isolates; 3 sequence types (STs) and 23 allelic profiles were observed. The smeABC efflux pump was associated with multidrug resistance in clinical isolates of S. maltophilia. In particular, smeABC efflux pumps appear to perform an important role in ciprofloxacin resistance of S. maltophilia. The MLST scheme for S. maltophilia represents a discriminatory typing method with stable markers and is appropriate for studying population structures.

  15. Cloning, Expression, Sequence Analysis and Homology Modeling of the Prolyl Endoprotease from Eurygaster integriceps Puton

    Directory of Open Access Journals (Sweden)

    Ravi Chandra Yandamuri

    2014-10-01

    Full Text Available eurygaster integriceps Puton, commonly known as sunn pest, is a major pest of wheat in Northern Africa, the Middle East and Eastern Europe. This insect injects a prolyl endoprotease into the wheat, destroying the gluten. The purpose of this study was to clone the full length cDNA of the sunn pest prolyl endoprotease (spPEP for expression in E. coli and to compare the amino acid sequence of the enzyme to other known PEPs in both phylogeny and potential tertiary structure. Sequence analysis shows that the 5ꞌ UTR contains several putative transcription factor binding sites for transcription factors known to be expressed in Drosophila that might be useful targets for inhibition of the enzyme. The spPEP was first identified as a prolyl endoprotease by Darkoh et al., 2010. The enzyme is a unique serine protease of the S9A family by way of its substrate recognition of the gluten proteins, which are greater than 30 kD in size. At 51% maximum identity to known PEPs, homology modeling using SWISS-MODEL, the porcine brain PEP (PDB: 2XWD was selected in the database of known PEP structures, resulting in a predicted tertiary structure 99% identical to the porcine brain PEP structure. A Km for the recombinant spPEP was determined to be 210 ± 53 µM for the zGly-Pro-pNA substrate in 0.025 M ethanolamine, pH 8.5, containing 0.1 M NaCl at 37 °C with a turnover rate of 172 ± 47 µM Gly-Pro-pNA/s/µM of enzyme.

  16. Expressed sequence enrichment for candidate gene analysis of citrus tristeza virus resistance.

    Science.gov (United States)

    Bernet, G P; Bretó, M P; Asins, M J

    2004-02-01

    Several studies have reported markers linked to a putative resistance gene from Poncirus trifoliata ( Ctv-R) located at linkage group 4 that confers resistance against one of the most important citrus pathogens, citrus tristeza virus (CTV). To be successful in both marker-assisted selection and transformation experiments, its accurate mapping is needed. Several factors may affect its localization, among them two are considered here: the definition of resistance and the genetic background of progeny. Two progenies derived from P. trifoliata, by self-pollination and by crossing with sour orange ( Citrus aurantium), a citrus rootstock well-adapted to arid and semi-arid areas, were used for linkage group-4 marker enrichment. Two new methodologies were used to enrich this region with expressed sequences. The enrichment of group 4 resulted in the fusion of several C. aurantium linkage groups. The new one A(7+3+4) is now saturated with 48 markers including expressed sequences. Surprisingly, sour orange was as resistant to the CTV isolate tested as was P. trifoliata, and three hybrids that carry Ctv-R, as deduced from its flanking markers, are susceptible to CTV. The new linkage maps were used to map Ctv-R under the hypothesis of monogenic inheritance. Its position on linkage group 4 of P. trifoliata differs from the location previously reported in other progenies. The genetic analysis of virus-plant interaction in the family derived from C. aurantium after a CTV chronic infection showed the segregation of five types of interaction, which is not compatible with the hypothesis of a single gene controlling resistance. Two major issues are discussed: another type of genetic analysis of CTV resistance is needed to avoid the assumption of monogenic inheritance, and transferring Ctv-R from P. trifoliata to sour orange might not avoid the CTV decline of sweet orange trees.

  17. Optimizing a massive parallel sequencing workflow for quantitative miRNA expression analysis.

    Directory of Open Access Journals (Sweden)

    Francesca Cordero

    Full Text Available BACKGROUND: Massive Parallel Sequencing methods (MPS can extend and improve the knowledge obtained by conventional microarray technology, both for mRNAs and short non-coding RNAs, e.g. miRNAs. The processing methods used to extract and interpret the information are an important aspect of dealing with the vast amounts of data generated from short read sequencing. Although the number of computational tools for MPS data analysis is constantly growing, their strengths and weaknesses as part of a complex analytical pipe-line have not yet been well investigated. PRIMARY FINDINGS: A benchmark MPS miRNA dataset, resembling a situation in which miRNAs are spiked in biological replication experiments was assembled by merging a publicly available MPS spike-in miRNAs data set with MPS data derived from healthy donor peripheral blood mononuclear cells. Using this data set we observed that short reads counts estimation is strongly under estimated in case of duplicates miRNAs, if whole genome is used as reference. Furthermore, the sensitivity of miRNAs detection is strongly dependent by the primary tool used in the analysis. Within the six aligners tested, specifically devoted to miRNA detection, SHRiMP and MicroRazerS show the highest sensitivity. Differential expression estimation is quite efficient. Within the five tools investigated, two of them (DESseq, baySeq show a very good specificity and sensitivity in the detection of differential expression. CONCLUSIONS: The results provided by our analysis allow the definition of a clear and simple analytical optimized workflow for miRNAs digital quantitative analysis.

  18. Optimizing a massive parallel sequencing workflow for quantitative miRNA expression analysis.

    Science.gov (United States)

    Cordero, Francesca; Beccuti, Marco; Arigoni, Maddalena; Donatelli, Susanna; Calogero, Raffaele A

    2012-01-01

    Massive Parallel Sequencing methods (MPS) can extend and improve the knowledge obtained by conventional microarray technology, both for mRNAs and short non-coding RNAs, e.g. miRNAs. The processing methods used to extract and interpret the information are an important aspect of dealing with the vast amounts of data generated from short read sequencing. Although the number of computational tools for MPS data analysis is constantly growing, their strengths and weaknesses as part of a complex analytical pipe-line have not yet been well investigated. A benchmark MPS miRNA dataset, resembling a situation in which miRNAs are spiked in biological replication experiments was assembled by merging a publicly available MPS spike-in miRNAs data set with MPS data derived from healthy donor peripheral blood mononuclear cells. Using this data set we observed that short reads counts estimation is strongly under estimated in case of duplicates miRNAs, if whole genome is used as reference. Furthermore, the sensitivity of miRNAs detection is strongly dependent by the primary tool used in the analysis. Within the six aligners tested, specifically devoted to miRNA detection, SHRiMP and MicroRazerS show the highest sensitivity. Differential expression estimation is quite efficient. Within the five tools investigated, two of them (DESseq, baySeq) show a very good specificity and sensitivity in the detection of differential expression. The results provided by our analysis allow the definition of a clear and simple analytical optimized workflow for miRNAs digital quantitative analysis.

  19. Cloning, DNA sequence, and expression of the Rhodobacter sphaeroides cytochrome c/sub 2/ gene

    Energy Technology Data Exchange (ETDEWEB)

    Donohue, T.J.; McEwan, A.G.; Kaplan, S.

    1986-11-01

    The Rhodobacter sphaeroides cytochrome c/sub 2/ functions as a mobile electron carrier in both aerobic and photosynthetic electron transport chains. Synthetic deoxyoligonucleotide probes, based on the known amino acid sequence of this protein (M/sub r/ 14,000), were used to identify and clone the cytochrome c/sub 2/ structural gene (cycA). DNA sequence analysis of the cycA gene indicated the presence of a typical procaryotic 21-residue signal sequence, suggesting that this periplasmic protein is synthesized in vivo as a precursor. Synthesis of an immunoreactive cytochrome c/sub 2/ precursor protein (M/sub r/ 15,500) was observed in vitro when plasmids containing the cycA gene were used as templates in an R. sphaeroides coupled transcription-translation system. Approximately 500 base pairs of DNA upstream of the cycA gene was sufficient to allow expression of this gene product in vitro. Northern blot analysis with an internal cycA-specific probe identified at least two possibly monocistronic transcripts present in both different cellular levels and relative stoichiometries in steady-state cells grown under different physiological conditions. The ratio of the small (740-mucleotide) and large (920-nucleotide) cycA-specific mRNA species was dependent on cultural conditions but was not affected by light intensity under photosynthetic conditions. These results suggest that the increase in the cellular level of the cytochrome c/sub 2/ protein found in photosynthetic cells was due, in part, to increased transcription of the single-copy cyc operon.

  20. Analysis of tag-position bias in MPSS technology

    Directory of Open Access Journals (Sweden)

    Rattray Magnus

    2006-04-01

    Full Text Available Abstract Background Massively Parallel Signature Sequencing (MPSS technology was recently developed as a high-throughput technology for measuring the concentration of mRNA transcripts in a sample. It has previously been observed that the position of the signature tag in a transcript (distance from 3' end can affect the measurement, but this effect has not been studied in detail. Results We quantify the effect of tag-position bias in Classic and Signature MPSS technology using published data from Arabidopsis, rice and human. We investigate the relationship between measured concentration and tag-position using nonlinear regression methods. The observed relationship is shown to be broadly consistent across different data sets. We find that there exist different and significant biases in both Classic and Signature MPSS data. For Classic MPSS data, genes with tag-position in the middle-range have highest measured abundance on average while genes with tag-position in the high-range, far from the 3' end, show a significant decrease. For Signature MPSS data, high-range tag-position genes tend to have a flatter relationship between tag-position and measured abundance. Thus, our results confirm that the Signature MPSS method fixes a substantial problem with the Classic MPSS method. For both Classic and Signature MPSS data there is a positive correlation between measured abundance and tag-position for low-range tag-position genes. Compared with the effects of mRNA length and number of exons, tag-position bias seems to be more significant in Arabadopsis. The tag-position bias is reflected both in the measured abundance of genes with a significant tag count and in the proportion of unexpressed genes identified. Conclusion Tag-position bias should be taken into consideration when measuring mRNA transcript abundance using MPSS technology, both in Classic and Signature MPSS methods.

  1. Construction and expression of eukaryotic expression vectors of full-length, amino-terminus and carboxyl-terminus Raf gene

    Directory of Open Access Journals (Sweden)

    Zhuomin WANG

    2008-06-01

    Full Text Available Background and objective Raf is a key molecule in the Ras-Raf-MEK-ERK signal transduction pathway and is highly activated in different human carcinomas. However, its biological functions and regulation mechanisms are still unclear. The aims of this study were to construct eukaryotic expression vectors with Raf full encoding region, truncated amino-terminus and carboxyl-terminus, respectively. Methods Eukaryotic expression vectors of pCMV-Tag2b-Raf-1, pCMV-Tag2b-N-Raf and pCMV-Tag2b-C-Raf were constructed by gene recombination technique and confirmed by restriction enzyme analysis and DNA sequencing. Furthermore, the expression of these fusion proteins was detected by western blot in transient transfected 293T cells. Results The sequences and open reading frames of these three vectors were completely consistent with experimental design. All target proteins can be detected in 293T cells. Conclusion Eukaryotic expression vectors of pCMV-Tag2b-Raf-1, pCMV-Tag2b-N-Raf and pCMV-Tag2b-C-Raf were successfully constructed and can be expressed in 293T cells.

  2. Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

    Directory of Open Access Journals (Sweden)

    Rachel Caldwell

    2015-01-01

    Full Text Available There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length.

  3. Inclusive Flavour Tagging Algorithm

    International Nuclear Information System (INIS)

    Likhomanenko, Tatiana; Derkach, Denis; Rogozhnikov, Alex

    2016-01-01

    Identifying the flavour of neutral B mesons production is one of the most important components needed in the study of time-dependent CP violation. The harsh environment of the Large Hadron Collider makes it particularly hard to succeed in this task. We present an inclusive flavour-tagging algorithm as an upgrade of the algorithms currently used by the LHCb experiment. Specifically, a probabilistic model which efficiently combines information from reconstructed vertices and tracks using machine learning is proposed. The algorithm does not use information about underlying physics process. It reduces the dependence on the performance of lower level identification capacities and thus increases the overall performance. The proposed inclusive flavour-tagging algorithm is applicable to tag the flavour of B mesons in any proton-proton experiment. (paper)

  4. Negative effect of the 5'-untranslated leader sequence on Ac transposon promoter expression.

    Science.gov (United States)

    Scortecci, K C; Raina, R; Fedoroff, N V; Van Sluys, M A

    1999-08-01

    Transposable elements are used in heterologous plant hosts to clone genes by insertional mutagenesis. The Activator (Ac) transposable element has been cloned from maize, and introduced into a variety of plants. However, differences in regulation and transposition frequency have been observed between different host plants. The cause of this variability is still unknown. To better understand the activity of the Ac element, we analyzed the Ac promoter region and its 5'-untranslated leader sequence (5' UTL). Transient assays in tobacco NT1 suspension cells showed that the Ac promoter is a weak promoter and its activity was localized by deletion analyses. The data presented here indicate that the core of the Ac promoter is contained within 153 bp fragment upstream to transcription start sites. An important inhibitory effect (80%) due to the presence of the 5' UTL was found on the expression of LUC reporter gene. Here we demonstrate that the presence of the 5' UTL in the constructs reduces the expression driven by either strong or weak promoters.

  5. dictyExpress: a web-based platform for sequence data management and analytics in Dictyostelium and beyond.

    Science.gov (United States)

    Stajdohar, Miha; Rosengarten, Rafael D; Kokosar, Janez; Jeran, Luka; Blenkus, Domen; Shaulsky, Gad; Zupan, Blaz

    2017-06-02

    Dictyostelium discoideum, a soil-dwelling social amoeba, is a model for the study of numerous biological processes. Research in the field has benefited mightily from the adoption of next-generation sequencing for genomics and transcriptomics. Dictyostelium biologists now face the widespread challenges of analyzing and exploring high dimensional data sets to generate hypotheses and discovering novel insights. We present dictyExpress (2.0), a web application designed for exploratory analysis of gene expression data, as well as data from related experiments such as Chromatin Immunoprecipitation sequencing (ChIP-Seq). The application features visualization modules that include time course expression profiles, clustering, gene ontology enrichment analysis, differential expression analysis and comparison of experiments. All visualizations are interactive and interconnected, such that the selection of genes in one module propagates instantly to visualizations in other modules. dictyExpress currently stores the data from over 800 Dictyostelium experiments and is embedded within a general-purpose software framework for management of next-generation sequencing data. dictyExpress allows users to explore their data in a broader context by reciprocal linking with dictyBase-a repository of Dictyostelium genomic data. In addition, we introduce a companion application called GenBoard, an intuitive graphic user interface for data management and bioinformatics analysis. dictyExpress and GenBoard enable broad adoption of next generation sequencing based inquiries by the Dictyostelium research community. Labs without the means to undertake deep sequencing projects can mine the data available to the public. The entire information flow, from raw sequence data to hypothesis testing, can be accomplished in an efficient workspace. The software framework is generalizable and represents a useful approach for any research community. To encourage more wide usage, the backend is open

  6. Policy administration in tag-based authorization

    NARCIS (Netherlands)

    Etalle, Sandro; Hinrichs, Timothy L.; Lee, Adam J.; Trivellato, Daniel; Zannone, Nicola

    2013-01-01

    Tag-Based Authorization (TBA) is a hybrid access control model that combines the ease of use of extensional access control models with the expressivity of logic-based formalisms. The main limitation of TBA is that it lacks support for policy administration. More precisely, it does not allow

  7. Learner Corpora without Error Tagging

    Directory of Open Access Journals (Sweden)

    Rastelli, Stefano

    2009-01-01

    Full Text Available The article explores the possibility of adopting a form-to-function perspective when annotating learner corpora in order to get deeper insights about systematic features of interlanguage. A split between forms and functions (or categories is desirable in order to avoid the "comparative fallacy" and because – especially in basic varieties – forms may precede functions (e.g., what resembles to a "noun" might have a different function or a function may show up in unexpected forms. In the computer-aided error analysis tradition, all items produced by learners are traced to a grid of error tags which is based on the categories of the target language. Differently, we believe it is possible to record and make retrievable both words and sequence of characters independently from their functional-grammatical label in the target language. For this purpose at the University of Pavia we adapted a probabilistic POS tagger designed for L1 on L2 data. Despite the criticism that this operation can raise, we found that it is better to work with "virtual categories" rather than with errors. The article outlines the theoretical background of the project and shows some examples in which some potential of SLA-oriented (non error-based tagging will be possibly made clearer.

  8. An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

    Science.gov (United States)

    Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

    2011-01-01

    cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

  9. Methyl-CpG island-associated genome signature tags

    Science.gov (United States)

    Dunn, John J

    2014-05-20

    Disclosed is a method for analyzing the organismic complexity of a sample through analysis of the nucleic acid in the sample. In the disclosed method, through a series of steps, including digestion with a type II restriction enzyme, ligation of capture adapters and linkers and digestion with a type IIS restriction enzyme, genome signature tags are produced. The sequences of a statistically significant number of the signature tags are determined and the sequences are used to identify and quantify the organisms in the sample. Various embodiments of the invention described herein include methods for using single point genome signature tags to analyze the related families present in a sample, methods for analyzing sequences associated with hyper- and hypo-methylated CpG islands, methods for visualizing organismic complexity change in a sampling location over time and methods for generating the genome signature tag profile of a sample of fragmented DNA.

  10. Co-transcriptomic Analysis by RNA Sequencing to Simultaneously Measure Regulated Gene Expression in Host and Bacterial Pathogen

    KAUST Repository

    Ravasi, Timothy; Mavromatis, Charalampos Harris; Bokil, Nilesh J.; Schembri, Mark A.; Sweet, Matthew J.

    2016-01-01

    Intramacrophage pathogens subvert antimicrobial defence pathways using various mechanisms, including the targeting of host TLR-mediated transcriptional responses. Conversely, TLR-inducible host defence mechanisms subject intramacrophage pathogens to stress, thus altering pathogen gene expression programs. Important biological insights can thus be gained through the analysis of gene expression changes in both the host and the pathogen during an infection. Traditionally, research methods have involved the use of qPCR, microarrays and/or RNA sequencing to identify transcriptional changes in either the host or the pathogen. Here we describe the application of RNA sequencing using samples obtained from in vitro infection assays to simultaneously quantify both host and bacterial pathogen gene expression changes, as well as general approaches that can be undertaken to interpret the RNA sequencing data that is generated. These methods can be used to provide insights into host TLR-regulated transcriptional responses to microbial challenge, as well as pathogen subversion mechanisms against such responses.

  11. Co-transcriptomic Analysis by RNA Sequencing to Simultaneously Measure Regulated Gene Expression in Host and Bacterial Pathogen

    KAUST Repository

    Ravasi, Timothy

    2016-01-24

    Intramacrophage pathogens subvert antimicrobial defence pathways using various mechanisms, including the targeting of host TLR-mediated transcriptional responses. Conversely, TLR-inducible host defence mechanisms subject intramacrophage pathogens to stress, thus altering pathogen gene expression programs. Important biological insights can thus be gained through the analysis of gene expression changes in both the host and the pathogen during an infection. Traditionally, research methods have involved the use of qPCR, microarrays and/or RNA sequencing to identify transcriptional changes in either the host or the pathogen. Here we describe the application of RNA sequencing using samples obtained from in vitro infection assays to simultaneously quantify both host and bacterial pathogen gene expression changes, as well as general approaches that can be undertaken to interpret the RNA sequencing data that is generated. These methods can be used to provide insights into host TLR-regulated transcriptional responses to microbial challenge, as well as pathogen subversion mechanisms against such responses.

  12. Effect of ATRX and G-Quadruplex Formation by the VNTR Sequence on α-Globin Gene Expression.

    Science.gov (United States)

    Li, Yue; Syed, Junetha; Suzuki, Yuki; Asamitsu, Sefan; Shioda, Norifumi; Wada, Takahito; Sugiyama, Hiroshi

    2016-05-17

    ATR-X (α-thalassemia/mental retardation X-linked) syndrome is caused by mutations in chromatin remodeler ATRX. ATRX can bind the variable number of tandem repeats (VNTR) sequence in the promoter region of the α-globin gene cluster. The VNTR sequence, which contains the potential G-quadruplex-forming sequence CGC(GGGGCGGGG)n , is involved in the downregulation of α-globin expression. We investigated G-quadruplex and i-motif formation in single-stranded DNA and long double-stranded DNA. The promoter region without the VNTR sequence showed approximately twofold higher luciferase activity than the promoter region harboring the VNTR sequence. G-quadruplex stabilizers hemin and TMPyP4 reduced the luciferase activity, whereas expression of ATRX led to a recovery in reporter activity. Our results demonstrate that stable G-quadruplex formation by the VNTR sequence downregulates the expression of α-globin genes and that ATRX might bind to and resolve the G-quadruplex. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. Sequence and RT-PCR expression analysis of two peroxidases from Arabidopsis thaliana belonging to a novel evolutionary branch of plant peroxidases.

    Science.gov (United States)

    Kjaersgård, I V; Jespersen, H M; Rasmussen, S K; Welinder, K G

    1997-03-01

    cDNA clones encoding two new Arabidopsis thaliana peroxidases, ATP 1a and ATP 2a, have been identified by searching the Arabidopsis database of expressed sequence tags (dbEST). They represent a novel branch of hitherto uncharacterized plant peroxidases which is only 35% identical in amino acid sequence to the well characterized group of basic plant peroxidases represented by the horseradish (Armoracia rusticana) isoperoxidases HRP C, HRP E5 and the similar Arabidopsis isoperoxidases ATP Ca, ATP Cb, and ATP Ea. However ATP 1a is 87% identical in amino acid sequence to a peroxidase encoded by an mRNA isolated from cotton (Gossypium hirsutum). As cotton and Arabidopsis belong to rather diverse families (Malvaceae and Crucifereae, respectively), in contrast with Arabidopsis and horseradish (both Crucifereae), the high degree of sequence identity indicates that this novel type of peroxidase, albeit of unknown function, is likely to be widespread in plant species. The atp 1 and atp 2 types of cDNA sequences were the most redundant among the 28 different isoperoxidases identified among about 200 peroxidase encoding ESTs. Interestingly, 8 out of totally 38 EST sequences coding for ATP 1 showed three identical nucleotide substitutions. This variant form is designated ATP 1b. Similarly, six out of totally 16 EST sequences coding for ATP 2 showed a number of deletions and nucleotide changes. This variant form is designated ATP 2b. The selected EST clones are full-length and contain coding regions of 993 nucleotides for atp 1a, and 984 nucleotides for atp 2a. These regions show 61% DNA sequence identity. The predicted mature proteins ATP 1a, and ATP 2a are 57% identical in sequence and contain the structurally and functionally important residues, characteristic of the plant peroxidase superfamily. However, they do show two differences of importance to peroxidase catalysis: (1) the asparagine residue linked with the active site distal histidine via hydrogen bonding is absent

  14. SATB1 regulates SPARC expression in K562 cell line through binding to a specific sequence in the third intron

    International Nuclear Information System (INIS)

    Li, K.; Cai, R.; Dai, B.B.; Zhang, X.Q.; Wang, H.J.; Ge, S.F.; Xu, W.R.; Lu, J.

    2007-01-01

    Special AT-rich binding protein 1 (SATB1), a cell type-specific nuclear matrix attachment region (MAR) DNA-binding protein, tethers to a specific DNA sequence and regulates gene expression through chromatin remodeling and HDAC (histone deacetylase complex) recruitment. In this study, a SATB1 eukaryotic expression plasmid was transfected into the human erythroleukemia K562 cell line and individual clones that stably over-expressed the SATB1 protein were isolated. Microarray analysis revealed that hundreds of genes were either up- or down-regulated in the SATB1 over-expressing K562 cell lines. One of these was the extra-cellular matrix glycoprotein, SPARC (human secreted protein acidic and rich in cysteine). siRNA knock-down of SATB1 also reduced SPARC expression, which was consistent with elevated SPARC levels in the SATB1 over-expressing cell line. Bioinformatics software Mat-inspector showed that a 17 bp DNA sequence in the third intron of SPARC possessed a high potential for SATB1 binding; a finding confirmed by Chromatin immunoprecipitation (ChIP) with anti-SATB1 antibody. Our results show for the first time that forced-expression of SATB1 in K562 cells triggers SPARC up-regulation by binding to a 17 bp DNA sequence in the third intron

  15. Increased frequency of single base substitutions in a population of transcripts expressed in cancer cells

    Directory of Open Access Journals (Sweden)

    Bianchetti Laurent

    2012-11-01

    Full Text Available Abstract Background Single Base Substitutions (SBS that alter transcripts expressed in cancer originate from somatic mutations. However, recent studies report SBS in transcripts that are not supported by the genomic DNA of tumor cells. Methods We used sequence based whole genome expression profiling, namely Long-SAGE (L-SAGE and Tag-seq (a combination of L-SAGE and deep sequencing, and computational methods to identify transcripts with greater SBS frequencies in cancer. Millions of tags produced by 40 healthy and 47 cancer L-SAGE experiments were compared to 1,959 Reference Tags (RT, i.e. tags matching the human genome exactly once. Similarly, tens of millions of tags produced by 7 healthy and 8 cancer Tag-seq experiments were compared to 8,572 RT. For each transcript, SBS frequencies in healthy and cancer cells were statistically tested for equality. Results In the L-SAGE and Tag-seq experiments, 372 and 4,289 transcripts respectively, showed greater SBS frequencies in cancer. Increased SBS frequencies could not be attributed to known Single Nucleotide Polymorphisms (SNP, catalogued somatic mutations or RNA-editing enzymes. Hypothesizing that Single Tags (ST, i.e. tags sequenced only once, were indicators of SBS, we observed that ST proportions were heterogeneously distributed across Embryonic Stem Cells (ESC, healthy differentiated and cancer cells. ESC had the lowest ST proportions, whereas cancer cells had the greatest. Finally, in a series of experiments carried out on a single patient at 1 healthy and 3 consecutive tumor stages, we could show that SBS frequencies increased during cancer progression. Conclusion If the mechanisms generating the base substitutions could be known, increased SBS frequency in transcripts would be a new useful biomarker of cancer. With the reduction of sequencing cost, sequence based whole genome expression profiling could be used to characterize increased SBS frequency in patient’s tumor and aid diagnostic.

  16. Increased frequency of single base substitutions in a population of transcripts expressed in cancer cells

    International Nuclear Information System (INIS)

    Bianchetti, Laurent; Kieffer, David; Féderkeil, Rémi; Poch, Olivier

    2012-01-01

    Single Base Substitutions (SBS) that alter transcripts expressed in cancer originate from somatic mutations. However, recent studies report SBS in transcripts that are not supported by the genomic DNA of tumor cells. We used sequence based whole genome expression profiling, namely Long-SAGE (L-SAGE) and Tag-seq (a combination of L-SAGE and deep sequencing), and computational methods to identify transcripts with greater SBS frequencies in cancer. Millions of tags produced by 40 healthy and 47 cancer L-SAGE experiments were compared to 1,959 Reference Tags (RT), i.e. tags matching the human genome exactly once. Similarly, tens of millions of tags produced by 7 healthy and 8 cancer Tag-seq experiments were compared to 8,572 RT. For each transcript, SBS frequencies in healthy and cancer cells were statistically tested for equality. In the L-SAGE and Tag-seq experiments, 372 and 4,289 transcripts respectively, showed greater SBS frequencies in cancer. Increased SBS frequencies could not be attributed to known Single Nucleotide Polymorphisms (SNP), catalogued somatic mutations or RNA-editing enzymes. Hypothesizing that Single Tags (ST), i.e. tags sequenced only once, were indicators of SBS, we observed that ST proportions were heterogeneously distributed across Embryonic Stem Cells (ESC), healthy differentiated and cancer cells. ESC had the lowest ST proportions, whereas cancer cells had the greatest. Finally, in a series of experiments carried out on a single patient at 1 healthy and 3 consecutive tumor stages, we could show that SBS frequencies increased during cancer progression. If the mechanisms generating the base substitutions could be known, increased SBS frequency in transcripts would be a new useful biomarker of cancer. With the reduction of sequencing cost, sequence based whole genome expression profiling could be used to characterize increased SBS frequency in patient’s tumor and aid diagnostic

  17. Personalization of tagging systems

    NARCIS (Netherlands)

    J. Wang (Jun); M. Clements (Maarten); J. Yang; A.P. de Vries (Arjen); M.J.T. Reinders

    2010-01-01

    htmlabstractSocial media systems have encouraged end user participation in the Internet, for the purpose of storing and distributing Internet content, sharing opinions and maintaining relationships. Collaborative tagging allows users to annotate the resulting user-generated content, and enables

  18. TagPad

    DEFF Research Database (Denmark)

    Bornoe, Nis; Barkhuus, Louise

    2013-01-01

    The area of cyberinfrastructures has looked extensively at research within the natural sciences, however, the social sciences have been largely overlooked in terms of novel data collection and analysis systems. We developed a probe tool, TagPad, to look at the process for social science data...

  19. Gene expression promoted by the SV40 DNA targeting sequence and the hypoxia-responsive element under normoxia and hypoxia

    Directory of Open Access Journals (Sweden)

    C.B. Sacramento

    2010-08-01

    Full Text Available The main objective of the present study was to find suitable DNA-targeting sequences (DTS for the construction of plasmid vectors to be used to treat ischemic diseases. The well-known Simian virus 40 nuclear DTS (SV40-DTS and hypoxia-responsive element (HRE sequences were used to construct plasmid vectors to express the human vascular endothelial growth factor gene (hVEGF. The rate of plasmid nuclear transport and consequent gene expression under normoxia (20% O2 and hypoxia (less than 5% O2 were determined. Plasmids containing the SV40-DTS or HRE sequences were constructed and used to transfect the A293T cell line (a human embryonic kidney cell line in vitro and mouse skeletal muscle cells in vivo. Plasmid transport to the nucleus was monitored by real-time PCR, and the expression level of the hVEGF gene was measured by ELISA. The in vitro nuclear transport efficiency of the SV40-DTS plasmid was about 50% lower under hypoxia, while the HRE plasmid was about 50% higher under hypoxia. Quantitation of reporter gene expression in vitro and in vivo, under hypoxia and normoxia, confirmed that the SV40-DTS plasmid functioned better under normoxia, while the HRE plasmid was superior under hypoxia. These results indicate that the efficiency of gene expression by plasmids containing DNA binding sequences is affected by the concentration of oxygen in the medium.

  20. Ontologies and tag-statistics

    Science.gov (United States)

    Tibély, Gergely; Pollner, Péter; Vicsek, Tamás; Palla, Gergely

    2012-05-01

    Due to the increasing popularity of collaborative tagging systems, the research on tagged networks, hypergraphs, ontologies, folksonomies and other related concepts is becoming an important interdisciplinary area with great potential and relevance for practical applications. In most collaborative tagging systems the tagging by the users is completely ‘flat’, while in some cases they are allowed to define a shallow hierarchy for their own tags. However, usually no overall hierarchical organization of the tags is given, and one of the interesting challenges of this area is to provide an algorithm generating the ontology of the tags from the available data. In contrast, there are also other types of tagged networks available for research, where the tags are already organized into a directed acyclic graph (DAG), encapsulating the ‘is a sub-category of’ type of hierarchy between each other. In this paper, we study how this DAG affects the statistical distribution of tags on the nodes marked by the tags in various real networks. The motivation for this research was the fact that understanding the tagging based on a known hierarchy can help in revealing the hidden hierarchy of tags in collaborative tagging systems. We analyse the relation between the tag-frequency and the position of the tag in the DAG in two large sub-networks of the English Wikipedia and a protein-protein interaction network. We also study the tag co-occurrence statistics by introducing a two-dimensional (2D) tag-distance distribution preserving both the difference in the levels and the absolute distance in the DAG for the co-occurring pairs of tags. Our most interesting finding is that the local relevance of tags in the DAG (i.e. their rank or significance as characterized by, e.g., the length of the branches starting from them) is much more important than their global distance from the root. Furthermore, we also introduce a simple tagging model based on random walks on the DAG, capable of

  1. Ontologies and tag-statistics

    International Nuclear Information System (INIS)

    Tibély, Gergely; Vicsek, Tamás; Pollner, Péter; Palla, Gergely

    2012-01-01

    Due to the increasing popularity of collaborative tagging systems, the research on tagged networks, hypergraphs, ontologies, folksonomies and other related concepts is becoming an important interdisciplinary area with great potential and relevance for practical applications. In most collaborative tagging systems the tagging by the users is completely ‘flat’, while in some cases they are allowed to define a shallow hierarchy for their own tags. However, usually no overall hierarchical organization of the tags is given, and one of the interesting challenges of this area is to provide an algorithm generating the ontology of the tags from the available data. In contrast, there are also other types of tagged networks available for research, where the tags are already organized into a directed acyclic graph (DAG), encapsulating the ‘is a sub-category of’ type of hierarchy between each other. In this paper, we study how this DAG affects the statistical distribution of tags on the nodes marked by the tags in various real networks. The motivation for this research was the fact that understanding the tagging based on a known hierarchy can help in revealing the hidden hierarchy of tags in collaborative tagging systems. We analyse the relation between the tag-frequency and the position of the tag in the DAG in two large sub-networks of the English Wikipedia and a protein-protein interaction network. We also study the tag co-occurrence statistics by introducing a two-dimensional (2D) tag-distance distribution preserving both the difference in the levels and the absolute distance in the DAG for the co-occurring pairs of tags. Our most interesting finding is that the local relevance of tags in the DAG (i.e. their rank or significance as characterized by, e.g., the length of the branches starting from them) is much more important than their global distance from the root. Furthermore, we also introduce a simple tagging model based on random walks on the DAG, capable of

  2. Cloning, sequence analysis, expression of Cyathus bulleri laccase in Pichia pastoris and characterization of recombinant laccase

    Directory of Open Access Journals (Sweden)

    Garg Neha

    2012-10-01

    Full Text Available Abstract Background Laccases are blue multi-copper oxidases and catalyze the oxidation of phenolic and non-phenolic compounds. There is considerable interest in using these enzymes for dye degradation as well as for synthesis of aromatic compounds. Laccases are produced at relatively low levels and, sometimes, as isozymes in the native fungi. The investigation of properties of individual enzymes therefore becomes difficult. The goal of this study was to over-produce a previously reported laccase from Cyathus bulleri using the well-established expression system of Pichia pastoris and examine and compare the properties of the recombinant enzyme with that of the native laccase. Results In this study, complete cDNA encoding laccase (Lac from white rot fungus Cyathus bulleri was amplified by RACE-PCR, cloned and expressed in the culture supernatant of Pichia pastoris under the control of the alcohol oxidase (AOX1 promoter. The coding region consisted of 1,542 bp and encodes a protein of 513 amino acids with a signal peptide of 16 amino acids. The deduced amino acid sequence of the matured protein displayed high homology with laccases from Trametes versicolor and Coprinus cinereus. The sequence analysis indicated the presence of Glu 460 and Ser 113 and LEL tripeptide at the position known to influence redox potential of laccases placing this enzyme as a high redox enzyme. Addition of copper sulfate to the production medium enhanced the level of laccase by about 12-fold to a final activity of 7200 U L-1. The recombinant laccase (rLac was purified by ~4-fold to a specific activity of ~85 U mg-1 protein. A detailed study of thermostability, chloride and solvent tolerance of the rLac indicated improvement in the first two properties when compared to the native laccase (nLac. Altered glycosylation pattern, identified by peptide mass finger printing, was proposed to contribute to altered properties of the rLac. Conclusion Laccase of C. bulleri was

  3. Cloning, sequence analysis, expression of Cyathus bulleri laccase in Pichia pastoris and characterization of recombinant laccase.

    Science.gov (United States)

    Garg, Neha; Bieler, Nora; Kenzom, Tenzin; Chhabra, Meenu; Ansorge-Schumacher, Marion; Mishra, Saroj

    2012-10-23

    Laccases are blue multi-copper oxidases and catalyze the oxidation of phenolic and non-phenolic compounds. There is considerable interest in using these enzymes for dye degradation as well as for synthesis of aromatic compounds. Laccases are produced at relatively low levels and, sometimes, as isozymes in the native fungi. The investigation of properties of individual enzymes therefore becomes difficult. The goal of this study was to over-produce a previously reported laccase from Cyathus bulleri using the well-established expression system of Pichia pastoris and examine and compare the properties of the recombinant enzyme with that of the native laccase. In this study, complete cDNA encoding laccase (Lac) from white rot fungus Cyathus bulleri was amplified by RACE-PCR, cloned and expressed in the culture supernatant of Pichia pastoris under the control of the alcohol oxidase (AOX)1 promoter. The coding region consisted of 1,542 bp and encodes a protein of 513 amino acids with a signal peptide of 16 amino acids. The deduced amino acid sequence of the matured protein displayed high homology with laccases from Trametes versicolor and Coprinus cinereus. The sequence analysis indicated the presence of Glu 460 and Ser 113 and LEL tripeptide at the position known to influence redox potential of laccases placing this enzyme as a high redox enzyme. Addition of copper sulfate to the production medium enhanced the level of laccase by about 12-fold to a final activity of 7200 U L-1. The recombinant laccase (rLac) was purified by ~4-fold to a specific activity of ~85 U mg(-1) protein. A detailed study of thermostability, chloride and solvent tolerance of the rLac indicated improvement in the first two properties when compared to the native laccase (nLac). Altered glycosylation pattern, identified by peptide mass finger printing, was proposed to contribute to altered properties of the rLac. Laccase of C. bulleri was successfully produced extra-cellularly to a high level of 7200

  4. Transcriptome Sequencing, De Novo Assembly and Differential Gene Expression Analysis of the Early Development of Acipenser baeri.

    Directory of Open Access Journals (Sweden)

    Wei Song

    Full Text Available The molecular mechanisms that drive the development of the endangered fossil fish species Acipenser baeri are difficult to study due to the lack of genomic data. Recent advances in sequencing technologies and the reducing cost of sequencing offer exclusive opportunities for exploring important molecular mechanisms underlying specific biological processes. This manuscript describes the large scale sequencing and analyses of mRNA from Acipenser baeri collected at five development time points using the Illumina Hiseq2000 platform. The sequencing reads were de novo assembled and clustered into 278167 unigenes, of which 57346 (20.62% had 45837 known homologues proteins in Uniprot protein databases while 11509 proteins matched with at least one sequence of assembled unigenes. The remaining 79.38% of unigenes could stand for non-coding unigenes or unigenes specific to A. baeri. A number of 43062 unigenes were annotated into functional categories via Gene Ontology (GO annotation whereas 29526 unigenes were associated with 329 pathways by mapping to KEGG database. Subsequently, 3479 differentially expressed genes were scanned within developmental stages and clustered into 50 gene expression profiles. Genes preferentially expressed at each stage were also identified. Through GO and KEGG pathway enrichment analysis, relevant physiological variations during the early development of A. baeri could be better cognized. Accordingly, the present study gives insights into the transcriptome profile of the early development of A. baeri, and the information contained in this large scale transcriptome will provide substantial references for A. baeri developmental biology and promote its aquaculture research.

  5. Sequences of emotional distress expressed by clients and acknowledged by therapists: are they associated more with some therapists than others?

    Science.gov (United States)

    Viney, L L

    1994-11-01

    When clients come to psychotherapy they are distressed, this distress usually being expressed in the form of anxiety, hostility, depression and helplessness. This study explored the sequences of emotional distress expressed by clients and acknowledged by therapists, and examined their associations with other factors. The transcripts of five therapists (two single sessions each) were content-analysed: they used personal construct, client centered, rational-emotive, Gestalt and transactional analysis therapy. Log-linear analyses of appropriate contingency table cell frequencies were conducted to test associations between identified sequences and the two variables of therapist and timing of completion of the sequence. Therapist-client sequences of Anxiety-Anxiety, Anxiety-Hostility and Helplessness-Hostility were found to be associated more with the personal construct and client centred therapists than with the rational-emotive therapist. Client-therapist sequences of Anxiety-Anxiety, Helplessness-Anxiety and Helplessness-Helplessness were more often found with the client centred therapist than the other therapists. For most of these sequences timing had an effect, yet timing rarely interacted with the therapist variable. The findings are discussed in terms of their relevance to the theoretical positions represented, the shortcomings of the research and the value of this methodology in studies linking therapy process with outcome.

  6. Natural LacI from E. coli Yields Faster Response and Higher Level of Expression than the LVA-Tagged LacI

    DEFF Research Database (Denmark)

    Andreassen, Patrick Rosendahl; Fredberg, Sofie; Horan, Mattias

    2014-01-01

    The lac promoter is one of the most commonly used promoters for expression control of recombinant genes in E. coli. In the absence of galactosides, the lac promoter is repressed by its repressor protein LacI. Since the lac promoter is regulated by a repressor, overexpression of LacI is necessary...

  7. Establishment of a root proteome reference map for the model legume Medicago truncatula using the expressed sequence tag database for peptide mass fingerprinting

    DEFF Research Database (Denmark)

    Mathesius, U; Keijzers, Guido; Natera, S H

    2001-01-01

    legume for the study of nodulation-related genes and proteins. Over 2,500 root proteins could be displayed reproducibly across an isoelectric focussing range of 4-7. We analysed 485 proteins by peptide mass fingerprinting, and 179 of those were identified by matching against the current M. truncatula....... This proteome map will be updated continuously (http://semele.anu.edu.au/2d/2d.html) and will be a powerful tool for investigating the molecular mechanisms of root symbioses in legumes....

  8. Association analysis of bacterial leaf spot resistance and SNP markers derived from expressed sequence tags (ESTs) in lettuce (Lactuca sativa L.)

    Science.gov (United States)

    Bacterial leaf spot of lettuce, caused by Xanthomonas campestris pv. vitians, is a devastating disease of lettuce worldwide. Since there are no chemicals available for effective control of the disease, host-plant resistance is highly desirable to protect lettuce production. A total of 179 lettuce ge...

  9. Tub-Tag Labeling; Chemoenzymatic Incorporation of Unnatural Amino Acids.

    Science.gov (United States)

    Helma, Jonas; Leonhardt, Heinrich; Hackenberger, Christian P R; Schumacher, Dominik

    2018-01-01

    Tub-tag labeling is a chemoenzymatic method that enables the site-specific labeling of proteins. Here, the natural enzyme tubulin tyrosine ligase incorporates noncanonical tyrosine derivatives to the terminal carboxylic acid of proteins containing a 14-amino acid recognition sequence called Tub-tag. The tyrosine derivative carries a unique chemical reporter allowing for a subsequent bioorthogonal modification of proteins with a great variety of probes. Here, we describe the Tub-tag protein modification protocol in detail and explain its utilization to generate labeled proteins for advanced applications in cell biology, imaging, and diagnostics.

  10. The role of tag suggestions in folksonomies

    NARCIS (Netherlands)

    Bollen, D.G.F.M.; Halpin, H.

    2009-01-01

    Most tagging systems support the user in the tag selection process by providing tag suggestions, or recommendations, based on a popularity measurement of tags other users provided when tagging the same resource. The majority of theories and mathematical models of tagging found in the literature

  11. Tag-elese or The Language of Tags

    Directory of Open Access Journals (Sweden)

    Jan Simons

    2008-01-01

    Full Text Available The core "meme" of Web 2.0 from which almost all other memes radiated was: 'You control your own data' (O'Reilly, 2005, 3. Key instruments for this user control are tagging systems that allow users to freely assign keywords of their own choosing to Internet resources of their own making as well as to documents produced by others. Of course, freely chosen keywords tags do not necessarily follow prefixed taxonomies or classification systems. But going by the maxim that interaction creates similarity and similarity creates interaction, the idea - or hope - is, however, that the tagging practices of individual users will eventually converge into an emergent common vocabulary or folksonomy (Merholz, 2004; Shirky, 2005; Vander Wal, 2005b; Mika, 2007. It is far from clear, however, that free tagging systems will eventually yield controlled vocabularies, and there are many incentives for idiosyncratic, ambiguous, and inconsistent uses of tags. Left to themselves, free tagging systems seem to be too wild and too chaotic for any order to emerge. But are these free tagging systems really as "feral" as they seem to be, or do they only look uncontrolled because one has been looking for order in the wrong place? I have done a quick-and-dirty" analysis of Flickr's tag cloud. The concept was: if folksonomies encourage users to tap on their own vernacular, everyday natural language must somehow "guide" the tagging practices of users of tagging systems. Flickr's tag cloud has been choosen because it may teach us something about tagging systems and folksonomies, and not - or not primarily - because of what tags may tell us about pictures.

  12. Beef quality with different intramuscular fat content and proteomic analysis using isobaric tag for relative and absolute quantitation of differentially expressed proteins.

    Science.gov (United States)

    Mao, Yanwei; Hopkins, David L; Zhang, Yimin; Li, Peng; Zhu, Lixian; Dong, Pengcheng; Liang, Rongrong; Dai, Jin; Wang, Xiaoyun; Luo, Xin

    2016-08-01

    Intramuscular fat (IMF) is an important trait for beef eating quality. The mechanism of how IMF is deposited in beef cattle muscle is not clear at the molecular level. The muscle (M. longissimus lumborum: LL) of a group of Xiangxi yellow×Angus cattle with high fat levels (HF), was compared to the muscle of a low fat group (LF). The meat quality and the expressed protein patterns were compared. It was shown that LL from the HF animals had a greater fat content (P<0.05) and lower moisture content (P<0.05) than LL from LF animals. Forty seven sarcoplasmic proteins were differentially expressed and identified between the two groups. These proteins are involved in 6 molecular functions and 16 biological processes, and affect the Mitogen-activated protein kinases pathway, insulin pathway and c-Jun N-terminal kinases leading to greater IMF deposition. Cattle in the HF group had greater oxidative capacity and lower glycolytic levels suggesting a greater energetic efficiency. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Digital gene expression profiling by 5'-end sequencing of cDNAs during reprogramming in the moss Physcomitrella patens.

    Directory of Open Access Journals (Sweden)

    Tomoaki Nishiyama

    Full Text Available Stem cells self-renew and repeatedly produce differentiated cells during development and growth. The differentiated cells can be converted into stem cells in some metazoans and land plants with appropriate treatments. After leaves of the moss Physcomitrella patens are excised, leaf cells reenter the cell cycle and commence tip growth, which is characteristic of stem cells called chloronema apical cells. To understand the underlying molecular mechanisms, a digital gene expression profiling method using mRNA 5'-end tags (5'-DGE was established. The 5'-DGE method produced reproducible data with a dynamic range of four orders that correlated well with qRT-PCR measurements. After the excision of leaves, the expression levels of 11% of the transcripts changed significantly within 6 h. Genes involved in stress responses and proteolysis were induced and those involved in metabolism, including photosynthesis, were reduced. The later processes of reprogramming involved photosynthesis recovery and higher macromolecule biosynthesis, including of RNA and proteins. Auxin and cytokinin signaling pathways, which are activated during stem cell formation via callus in flowering plants, are also activated during reprogramming in P. patens, although no exogenous phytohormone is applied in the moss system, suggesting that an intrinsic phytohormone regulatory system may be used in the moss.

  14. Aberrant splicing in transgenes containing introns, exons, and V5 epitopes: lessons from developing an FSHD mouse model expressing a D4Z4 repeat with flanking genomic sequences.

    Directory of Open Access Journals (Sweden)

    Eugénie Ansseau

    Full Text Available The DUX4 gene, encoded within D4Z4 repeats on human chromosome 4q35, has recently emerged as a key factor in the pathogenic mechanisms underlying Facioscapulohumeral muscular dystrophy (FSHD. This recognition prompted development of animal models expressing the DUX4 open reading frame (ORF alone or embedded within D4Z4 repeats. In the first published model, we used adeno-associated viral vectors (AAV and strong viral control elements (CMV promoter, SV40 poly A to demonstrate that the DUX4 cDNA caused dose-dependent toxicity in mouse muscles. As a follow-up, we designed a second generation of DUX4-expressing AAV vectors to more faithfully genocopy the FSHD-permissive D4Z4 repeat region located at 4q35. This new vector (called AAV.D4Z4.V5.pLAM contained the D4Z4/DUX4 promoter region, a V5 epitope-tagged DUX4 ORF, and the natural 3' untranslated region (pLAM harboring two small introns, DUX4 exons 2 and 3, and the non-canonical poly A signal required for stabilizing DUX4 mRNA in FSHD. AAV.D4Z4.V5.pLAM failed to recapitulate the robust pathology of our first generation vectors following delivery to mouse muscle. We found that the DUX4.V5 junction sequence created an unexpected splice donor in the pre-mRNA that was preferentially utilized to remove the V5 coding sequence and DUX4 stop codon, yielding non-functional DUX4 protein with 55 additional residues on its carboxyl-terminus. Importantly, we further found that aberrant splicing could occur in any expression construct containing a functional splice acceptor and sequences resembling minimal splice donors. Our findings represent an interesting case study with respect to AAV.D4Z4.V5.pLAM, but more broadly serve as a note of caution for designing constructs containing V5 epitope tags and/or transgenes with downstream introns and exons.

  15. The use of external electronic tags on fish: an evaluation of tag retention and tagging effects

    DEFF Research Database (Denmark)

    Jepsen, Niels; Thorstad, Eva B.; Havn, Torgeir

    2015-01-01

    External tagging of fish with electronic tags has been used for decades for a wide range of marine and freshwater species. In the early years of fish telemetry research, it was the most commonly used attachment method, but later internal implants became preferred. Recently, the number of telemetry...... unsuitable for surgical implantation, or when using tags with sensors recording the external environment. The most commonly reported problems with external tags are tissue damage, premature tag loss, and decreased swimming capacity, but the effects are highly context dependent and species specific. Reduced......, but particularly there are few studies on predation risk, social interactions, and studies distinguishing capture and handling effects from tagging effects. For PSATs, especially those that are large relative to fish size, there are particular problems with a high proportion of premature tag losses, reduced...

  16. Sequence analysis and over-expression of ribosomal protein S28 ...

    African Journals Online (AJOL)

    RPS28 is a component of the 40S small ribosomal subunit encoded by RPS28 gene, which is specific to eukaryotes. The cDNA and the genomic sequence of RPS28 were cloned successfully from the Giant Panda using RT-PCR technology and Touchdown-PCR, respectively. Both sequences were analyzed preliminarily ...

  17. Cloning, sequencing and expression of a novel xylanase cDNA from ...

    African Journals Online (AJOL)

    A strain SH 2016, capable of producing xylanase, was isolated and identified as Aspergillus awamori, based on its physiological and biochemical characteristics as well as its ITS rDNA gene sequence analysis. A xylanase gene of 591 bp was cloned from this newly isolated A. awamori and the ORF sequence predicted a ...

  18. Cloning, sequencing and expression of a xylanase gene from the maize pathogen Helminthosporium turcicum

    DEFF Research Database (Denmark)

    Degefu, Y.; Paulin, L.; Lübeck, Peter Stephensen

    2001-01-01

    A gene encoding an endoxylanase from the phytopathogenic fungus Helminthosporium turcicum Pass. was cloned and sequenced. The entire nucleotide sequence of a 1991 bp genomic fragment containing an endoxylanase gene was determined. The xylanase gene of 795 bp, interrupted by two introns of 52 and ...

  19. Study of mast cell count in skin tags

    Directory of Open Access Journals (Sweden)

    Zaher Hesham

    2007-01-01

    Full Text Available Background: Skin tags or acrochordons are common tumors of middle-aged and elderly subjects. They consist of loose fibrous tissue and occur mainly on the neck and major flexures as small, soft, pedunculated protrusions. Objectives: The aim was to compare the mast cells count in skin tags to adjacent normal skin in diabetic and nondiabetic participants in an attempt to elucidate the possible role of mast cells in the pathogenesis of skin tags. Participants and Methods: Thirty participants with skin tags were divided into group I (15 nondiabetic participants and group II (15 diabetic participants. Three biopsies were obtained from each participant: a large skin tag, a small skin tag and adjacent normal skin. Mast cell count from all the obtained sections was carried out, and the mast cell density was expressed as the average mast cell count/high power field (HPF. Results: A statistically significant increase in mast cells count in skin tags in comparison to normal skin was detected in group I and group II. There was no statistically significant difference between mast cell counts in skin tags of both the groups. Conclusion: Both the mast cell mediators and hyperinsulinemia are capable of inducing fibroblast proliferation and epidermal hyperplasia that are the main pathologic abnormalities seen in all types of skin tags. However, the presence of mast cells in all examined skin tags regardless of diabetes and obesity may point to the possible crucial role of mast cells in the etiogenesis of skin tags through its interaction with fibroblasts and keratinocytes.

  20. Social Tagging of Mission Data

    Science.gov (United States)

    Norris, Jeffrey S.; Wallick, Michael N.; Joswig, Joseph C.; Powell, Mark W.; Torres, Recaredo J.; Mittman, David S.; Abramyan, Lucy; Crockett, Thomas M.; Shams, Khawaja S.; Fox, Jason M.; hide

    2010-01-01

    Mars missions will generate a large amount of data in various forms, such as daily plans, images, and scientific information. Often, there is a semantic linkage between images that cannot be captured automatically. Software is needed that will provide a method for creating arbitrary tags for this mission data so that items with a similar tag can be related to each other. The tags should be visible and searchable for all users. A new routine was written to offer a new and more flexible search option over previous applications. This software allows users of the MSLICE program to apply any number of arbitrary tags to a piece of mission data through a MSLICE search interface. The application of tags creates relationships between data that did not previously exist. These tags can be easily removed and changed, and contain enough flexibility to be specifically configured for any mission. This gives users the ability to quickly recall or draw attention to particular pieces of mission data, for example: Give a semantic and meaningful description to mission data; for example, tag all images with a rock in them with the tag "rock." Rapidly recall specific and useful pieces of data; for example, tag a plan as"driving template." Call specific data to a user s attention; for example, tag a plan as "for:User." This software is part of the MSLICE release, which was written in Java. It will run on any current Windows, Macintosh, or Linux system.

  1. Human testis-expressed sequence 101 is limitedly distributed in germinal epithelium of testis and disappears in seminoma

    Directory of Open Access Journals (Sweden)

    Cong-Cong Shen

    2014-01-01

    Full Text Available BACKGROUND: Testis-expressed sequence 101 (TEX101 was found to be highly expressed in testis and involved in acrosome reaction in previous studies. Recently, the metastasis suppressor function of TEX101 in cancer was disclosed, but the comprehensive investigation of its expression has rarely been reported. In this study, the expression features of TEX101 in normal human organs and seminoma were systematically analyzed. RESULTS: Immunohistochemistry demonstrated intense staining of TEX101 in human testis tissues; however, its expression in 27 other types of normal human organs, including the ovary, was negligible. Higher expression of TEX101 was observed in the spermatocytes and spermatids of the testis, but relatively lower staining was detected in spermatogonia. Western blotting showed a single TEX101 band of 38 kDa in human testis, but it did not correspond to the predicted molecular weight of its mature form at 21 KDa. Furthermore, we examined seminoma tissues by immunohistochemistry and found that none of the 36 samples expressed TEX101. CONCLUSIONS: Our data confirmed TEX101 to be a testis protein that could be related to the maturation process of male germ cells. The lack of TEX101 in seminoma indicated its potential role in tumor progression. This characteristic expression of TEX101 could provide a valuable reference for understanding its biological functions.

  2. Facial Expression Recognition from Video Sequences Based on Spatial-Temporal Motion Local Binary Pattern and Gabor Multiorientation Fusion Histogram

    Directory of Open Access Journals (Sweden)

    Lei Zhao

    2017-01-01

    Full Text Available This paper proposes novel framework for facial expressions analysis using dynamic and static information in video sequences. First, based on incremental formulation, discriminative deformable face alignment method is adapted to locate facial points to correct in-plane head rotation and break up facial region from background. Then, spatial-temporal motion local binary pattern (LBP feature is extracted and integrated with Gabor multiorientation fusion histogram to give descriptors, which reflect static and dynamic texture information of facial expressions. Finally, a one-versus-one strategy based multiclass support vector machine (SVM classifier is applied to classify facial expressions. Experiments on Cohn-Kanade (CK + facial expression dataset illustrate that integrated framework outperforms methods using single descriptors. Compared with other state-of-the-art methods on CK+, MMI, and Oulu-CASIA VIS datasets, our proposed framework performs better.

  3. Mapping the transcription start points of the Staphylococcus aureus eap, emp, and vwb promoters reveals a conserved octanucleotide sequence that is essential for expression of these genes.

    Science.gov (United States)

    Harraghy, Niamh; Homerova, Dagmar; Herrmann, Mathias; Kormanec, Jan

    2008-01-01

    Mapping the transcription start points of the eap, emp, and vwb promoters revealed a conserved octanucleotide sequence (COS). Deleting this sequence abolished the expression of eap, emp, and vwb. However, electrophoretic mobility shift assays gave no evidence that this sequence was a binding site for SarA or SaeR, known regulators of eap and emp.

  4. Quantitative protein expression analysis of CLL B cells from mutated and unmutated IgV(H) subgroups using acid-cleavable isotope-coded affinity tag reagents.

    Science.gov (United States)

    Barnidge, David R; Jelinek, Diane F; Muddiman, David C; Kay, Neil E

    2005-01-01

    Relative protein expression levels were compared in leukemic B cells from two patients with chronic lymphocytic leukemia (CLL) having either mutated (M-CLL) or unmutated (UM-CLL) immunoglobulin variable heavy chain genes (IgV(H)). Cells were separated into cytosol and membrane protein fractions then labeled with acid-cleavable ICAT reagents (cICAT). Labeled proteins were digested with trypsin then subjected to SCX and affinity chromatography followed by LC-ESI-MS/MS analysis on a linear ion trap mass spectrometer. A total of 9 proteins from the cytosol fraction and 4 from the membrane fraction showed a 3-fold or greater difference between M-CLL and UM-CLL and a subset of these were examined by Western blot where results concurred with cICAT abundance ratios. The abundance of one of the proteins in particular, the mitochondrial membrane protein cytochrome c oxidase subunit COX G was examined in 6 M-CLL and 6 UM-CLL patients using western blot and results showed significantly greater levels (P < 0.001) in M-CLL patients vs UM-CLL patients. These results demonstrate that stable isotope labeling and mass spectrometry can complement 2D gel electrophoresis and gene microarray technologies for identifying putative and perhaps unique prognostic markers in CLL.

  5. DSAP: deep-sequencing small RNA analysis pipeline.

    Science.gov (United States)

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  6. High Resolution X-ray Diffraction Dataset for Bacillus licheniformis Gamma Glutamyl Transpeptidase-acivicin complex: SUMO-Tag Renders High Expression and Solubility.

    Science.gov (United States)

    Kumari, Shobha; Pal, Ravi Kant; Gupta, Rani; Goel, Manisha

    2017-02-01

    Gamma glutamyl transpeptidase, (GGT) is a ubiquitous protein which plays a central role in glutathione metabolism and has myriad clinical implications. It has been shown to be a virulence factor for pathogenic bacteria, inhibition of which results in reduced colonization potential. However, existing inhibitors are effective but toxic and therefore search is on for novel inhibitors, which makes it imperative to understand the interactions of various inhibitors with the protein in substantial detail. High resolution structures of protein bound to different inhibitors can serve this purpose. Gamma glutamyl transpeptidase from Bacillus licheniformis is one of the model systems that have been used to understand the structure-function correlation of the protein. The structures of the native protein (PDB code 4OTT), of its complex with glutamate (PDB code 4OTU) and that of its precursor mimic (PDB code 4Y23) are available, although at moderate/low resolution. In the present study, we are reporting the preliminary analysis of, high resolution X-ray diffraction data collected for the co-crystals of B. licheniformis, Gamma glutamyl transpeptidase, with its inhibitor, Acivicin. Crystals belong to the orthorhombic space group P2 1 2 1 2 1 and diffract X-ray to 1.45 Å resolution. This is the highest resolution data reported for all GGT structures available till now. The use of SUMO fused expression system enhanced yield of the target protein in the soluble fraction, facilitating recovery of protein with high purity. The preliminary analysis of this data set shows clear density for the inhibitor, acivicin, in the protein active site.

  7. Tissue-specific tagging of endogenous loci in Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Kate Koles

    2016-01-01

    Full Text Available Fluorescent protein tags have revolutionized cell and developmental biology, and in combination with binary expression systems they enable diverse tissue-specific studies of protein function. However these binary expression systems often do not recapitulate endogenous protein expression levels, localization, binding partners and/or developmental windows of gene expression. To address these limitations, we have developed a method called T-STEP (tissue-specific tagging of endogenous proteins that allows endogenous loci to be tagged in a tissue specific manner. T-STEP uses a combination of efficient CRISPR/Cas9-enhanced gene targeting and tissue-specific recombinase-mediated tag swapping to temporally and spatially label endogenous proteins. We have employed this method to GFP tag OCRL (a phosphoinositide-5-phosphatase in the endocytic pathway and Vps35 (a Parkinson's disease-implicated component of the endosomal retromer complex in diverse Drosophila tissues including neurons, glia, muscles and hemocytes. Selective tagging of endogenous proteins allows, for the first time, cell type-specific live imaging and proteomics in complex tissues.

  8. Antenna for passive RFID tags

    Science.gov (United States)

    Schiopu, Paul; Manea, Adrian; Cristea, Ionica; Grosu, Neculai; Vladescu, Marian; Craciun, Anca-Ileana; Craciun, Alexandru

    2015-02-01

    Minuscule devices, called RFID tags are attached to objects and persons and emit information which positioned readers may capture wirelessly. Many methods of identification have been used, but that of most common is to use a unique serial number for identification of person or object. RFID tags can be characterized as either active or passive [1,2]. Traditional passive tags are typically in "sleep" state until awakened by the reader's emitted field. In passive tags, the reader's field acts to charge the capacitor that powers the badge and this can be a combination of antenna and barcodes obtained with SAW( Surface Acoustic Wave) devices [1,2,3] . The antenna in an RFID tag is a conductive element that permits the tag to exchange data with the reader. The paper contribution are targeted to antenna for passive RFID tags. The electromagnetic field generated by the reader is somehow oriented by the reader antenna and power is induced in the tag only if the orientation of the tag antenna is appropriate. A tag placed orthogonal to the reader yield field will not be read. This is the reason that guided manufacturers to build circular polarized antenna capable of propagating a field that is alternatively polarized on all planes passing on the diffusion axis. Passive RFID tags are operated at the UHF frequencies of 868MHz (Europe) and 915MHz (USA) and at the microwave frequencies of 2,45 GHz and 5,8 GHz . Because the tags are small dimensions, in paper, we present the possibility to use circular polarization microstrip antenna with fractal edge [2].

  9. Nuclear studies with tagged photons

    International Nuclear Information System (INIS)

    Axel, P.

    1979-01-01

    First, the photon tagging technique will be described schematically, and a brief history of photon tagging will be given, including the 20 year development of this technique at Illinois. In the second part some typical operating conditions will be indicated for our tagged photon facility. The final section of this paper will illustrate some types of experiments by showing data obtained recently. (KBE) 891 KBE/KBE 892 ARA

  10. Mouse tetranectin: cDNA sequence, tissue-specific expression, and chromosomal mapping

    DEFF Research Database (Denmark)

    Ibaraki, K; Kozak, C A; Wewer, U M

    1995-01-01

    regulation, mouse tetranectin cDNA was cloned from a 16-day-old mouse embryo library. Sequence analysis revealed a 992-bp cDNA with an open reading frame of 606 bp, which is identical in length to the human tetranectin cDNA. The deduced amino acid sequence showed high homology to the human cDNA with 76......(s) of tetranectin. The sequence analysis revealed a difference in both sequence and size of the noncoding regions between mouse and human cDNAs. Northern analysis of the various tissues from mouse, rat, and cow showed the major transcript(s) to be approximately 1 kb, which is similar in size to that observed...

  11. Recoding method that removes inhibitory sequences and improves HIV gene expression

    Energy Technology Data Exchange (ETDEWEB)

    Rabadan, Raul; Krasnitz, Michael; Robins, Harlan; Witten, Daniela; Levine, Arnold

    2016-08-23

    The invention relates to inhibitory nucleotide signal sequences or "INS" sequences in the genomes of lentiviruses. In particular the invention relates to the AGG motif present in all viral genomes. The AGG motif may have an inhibitory effect on a virus, for example by reducing the levels of, or maintaining low steady-state levels of, viral RNAs in host cells, and inducing and/or maintaining in viral latency. In one aspect, the invention provides vaccines that contain, or are produced from, viral nucleic acids in which the AGG sequences have been mutated. In another aspect, the invention provides methods and compositions for affecting the function of the AGG motif, and methods for identifying other INS sequences in viral genomes.

  12. Cloning, sequence analysis, and expression of the large subunit of the human lymphocyte activation antigen 4F2

    International Nuclear Information System (INIS)

    Lumadue, J.A.; Glick, A.B.; Ruddle, F.H.

    1987-01-01

    Among the earliest expressed antigens on the surface of activated human lymphocytes is the surface antigen 4F2. The authors have used DNA-mediated gene transfer and fluorescence-activated cell sorting to obtain cell lines that contain the gene encoding the large subunit of the human 4F2 antigen in a mouse L-cell background. Human DNAs cloned from these cell lines were subsequently used as hybridization probes to isolate a full-length cDNA clone expressing 4F2. Sequence analysis of the coding region has revealed an amino acid sequence of 529 residues. Hydrophobicity plotting has predicted a probable structure for the protein that includes an external carboxyl terminus, an internal leader sequence, a single hydrophobic transmembrane domain, and two possible membrane-associated domains. The 4F2 cDNA detects a single 1.8-kilobase mRNA in T-cell and B-cell lines. RNA gel blot analysis of RNA derived from quiescent and serum-stimulated Swiss 3T3 fibroblasts reveals a cell-cycle modulation of 4F2 gene expression: the mRNA is present in quiescent fibroblasts but increases 8-fold 24-36 hr after stimulation, at the time of maximal DNA synthesis

  13. Cloning, sequence analysis, and expression of the large subunit of the human lymphocyte activation antigen 4F2

    Energy Technology Data Exchange (ETDEWEB)

    Lumadue, J.A.; Glick, A.B.; Ruddle, F.H.

    1987-12-01

    Among the earliest expressed antigens on the surface of activated human lymphocytes is the surface antigen 4F2. The authors have used DNA-mediated gene transfer and fluorescence-activated cell sorting to obtain cell lines that contain the gene encoding the large subunit of the human 4F2 antigen in a mouse L-cell background. Human DNAs cloned from these cell lines were subsequently used as hybridization probes to isolate a full-length cDNA clone expressing 4F2. Sequence analysis of the coding region has revealed an amino acid sequence of 529 residues. Hydrophobicity plotting has predicted a probable structure for the protein that includes an external carboxyl terminus, an internal leader sequence, a single hydrophobic transmembrane domain, and two possible membrane-associated domains. The 4F2 cDNA detects a single 1.8-kilobase mRNA in T-cell and B-cell lines. RNA gel blot analysis of RNA derived from quiescent and serum-stimulated Swiss 3T3 fibroblasts reveals a cell-cycle modulation of 4F2 gene expression: the mRNA is present in quiescent fibroblasts but increases 8-fold 24-36 hr after stimulation, at the time of maximal DNA synthesis.

  14. Using Next-Generation Sequencing to Detect Differential Expression Genes in Bradysia odoriphaga after Exposure to Insecticides

    Directory of Open Access Journals (Sweden)

    Haoliang Chen

    2017-11-01

    Full Text Available Bradysia odoriphaga (Diptera: Sciaridae is the most important pest of Chinese chive. Insecticides are used widely and frequently to control B. odoriphaga in China. However, the performance of the insecticides chlorpyrifos and clothianidin in controlling the Chinese chive maggot is quite different. Using next generation sequencing technology, different expression unigenes (DEUs in B. odoriphaga were detected after treatment with chlorpyrifos and clothianidin for 6 and 48 h in comparison with control. The number of DEUs ranged between 703 and 1161 after insecticide treatment. In these DEUs, 370–863 unigenes can be classified into 41–46 categories of gene ontology (GO, and 354–658 DEUs can be mapped into 987–1623 Kyoto Encyclopedia of Genes and Genomes (KEGG pathways. The expressions of DEUs related to insecticide-metabolism-related genes were analyzed. The cytochrome P450-like unigene group was the largest group in DEUs. Most glutathione S-transferase-like unigenes were down-regulated and most sodium channel-like unigenes were up-regulated after insecticide treatment. Finally, 14 insecticide-metabolism-related unigenes were chosen to confirm the relative expression in each treatment by quantitative Real Time Polymerase Chain Reaction (qRT-PCR. The results of qRT-PCR and RNA Sequencing (RNA-Seq are fairly well-established. Our results demonstrate that a next-generation sequencing tool facilitates the identification of insecticide-metabolism-related genes and the illustration of the insecticide mechanisms of chlorpyrifos and clothianidin.

  15. Buddy Tag CONOPS and Requirements.

    Energy Technology Data Exchange (ETDEWEB)

    Brotz, Jay Kristoffer [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Deland, Sharon M. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

    2015-12-01

    This document defines the concept of operations (CONOPS) and the requirements for the Buddy Tag, which is conceived and designed in collaboration between Sandia National Laboratories and Princeton University under the Department of State Key VerificationAssets Fund. The CONOPS describe how the tags are used to support verification of treaty limitations and is only defined to the extent necessary to support a tag design. The requirements define the necessary functions and desired non-functional features of the Buddy Tag at a high level

  16. Flavour tagging performance in LHCb

    International Nuclear Information System (INIS)

    Grabalosa Gandara, Marc

    2009-01-01

    To do precise CP violation measurements, the best possible determination of the flavour of the B-meson is necessary. This report summarizes the flavour tagging performances for the LHCb experiment. The flavour tagging is obtained through a combination of several methods, based on different signatures. The use of control channels, which are decays to flavour-specific final states, will allow to determine the wrong tag fraction ω (the probability of a tag to be wrong), which can be used as an input for the determination of CKM unitarity triangle angles.

  17. High-resolution analysis of the 5'-end transcriptome using a next generation DNA sequencer.

    Directory of Open Access Journals (Sweden)

    Shin-ichi Hashimoto

    Full Text Available Massively parallel, tag-based sequencing systems, such as the SOLiD system, hold the promise of revolutionizing the study of whole genome gene expression due to the number of data points that can be generated in a simple and cost-effective manner. We describe the development of a 5'-end transcriptome workflow for the SOLiD system and demonstrate the advantages in sensitivity and dynamic range offered by this tag-based application over traditional approaches for the study of whole genome gene expression. 5'-end transcriptome analysis was used to study whole genome gene expression within a colon cancer cell line, HT-29, treated with the DNA methyltransferase inhibitor, 5-aza-2'-deoxycytidine (5Aza. More than 20 million 25-base 5'-end tags were obtained from untreated and 5Aza-treated cells and matched to sequences within the human genome. Seventy three percent of the mapped unique tags were associated with RefSeq cDNA sequences, corresponding to approximately 14,000 different protein-coding genes in this single cell type. The level of expression of these genes ranged from 0.02 to 4,704 transcripts per cell. The sensitivity of a single sequence run of the SOLiD platform was 100-1,000 fold greater than that observed from 5'end SAGE data generated from the analysis of 70,000 tags obtained by Sanger sequencing. The high-resolution 5'end gene expression profiling presented in this study will not only provide novel insight into the transcriptional machinery but should also serve as a basis for a better understanding of cell biology.

  18. The Sequence Characteristics and Expression Models Reveal Superoxide Dismutase Involved in Cold Response and Fruiting Body Development in Volvariella volvacea

    Directory of Open Access Journals (Sweden)

    Jun-Jie Yan

    2016-01-01

    Full Text Available As the first defence for cells to counteract the toxicity of active oxygen, superoxide dismutase (SOD plays an important role in the response of living organisms to stress and cell differentiation. One extracellular Cu-ZnSOD (ecCu-ZnSOD, and two MnSODs, were identified based on the Volvariella volvacea genome sequence. All three genes have complicated alternative splicing modes during transcription; only when the fourth intron is retained can the Vv_Cu-Znsod1 gene be translated into a protein sequence with SOD functional domains. The expression levels of the three sod genes in the pilei are higher than in the stipe. The Vv_Cu-Znsod1 and the Vv_Mnsod2 are co-expressed in different developmental stages of the fruiting body, with the highest level of expression in the pilei of the egg stage, and they show a significant, positive correlation with the efficiency of karyogamy, indicating the potential role of these two genes during karyogamy. The expression of the ecCu-Znsod and two Vv_Mnsod genes showed a significant up-regulated when treated by cold stress for one hour; however, the lack of the intracellular Cu-ZnSOD encoding gene (icCu-Znsod and the special locus of the ecCu-Znsod gene initiation codon suggested a possible reason for the autolysis phenomenon of V. volvacea in cold conditions.

  19. SigEMD: A powerful method for differential gene expression analysis in single-cell RNA sequencing data.

    Science.gov (United States)

    Wang, Tianyu; Nabavi, Sheida

    2018-04-24

    Differential gene expression analysis is one of the significant efforts in single cell RNA sequencing (scRNAseq) analysis to discover the specific changes in expression levels of individual cell types. Since scRNAseq exhibits multimodality, large amounts of zero counts, and sparsity, it is different from the traditional bulk RNA sequencing (RNAseq) data. The new challenges of scRNAseq data promote the development of new methods for identifying differentially expressed (DE) genes. In this study, we proposed a new method, SigEMD, that combines a data imputation approach, a logistic regression model and a nonparametric method based on the Earth Mover's Distance, to precisely and efficiently identify DE genes in scRNAseq data. The regression model and data imputation are used to reduce the impact of large amounts of zero counts, and the nonparametric method is used to improve the sensitivity of detecting DE genes from multimodal scRNAseq data. By additionally employing gene interaction network information to adjust the final states of DE genes, we further reduce the false positives of calling DE genes. We used simulated datasets and real datasets to evaluate the detection accuracy of the proposed method and to compare its performance with those of other differential expression analysis methods. Results indicate that the proposed method has an overall powerful performance in terms of precision in detection, sensitivity, and specificity. Copyright © 2018 Elsevier Inc. All rights reserved.

  20. Methods for small RNA preparation for digital gene expression profiling by next-generation sequencing

    NARCIS (Netherlands)

    Linsen, S.E.V.; Cuppen, E.

    2012-01-01

    Digital gene expression (DGE) profiling techniques are playing an eminent role in the detection, localization, and differential expression quantification of many small RNA species, including microRNAs (1-3). Procedures in small RNA library preparation techniques typically include adapter ligation by

  1. Characterizing embryonic gene expression patterns in the mouse using nonredundant sequence-based selection

    DEFF Research Database (Denmark)

    Sousa-Nunes, Rita; Rana, Amer Ahmed; Kettleborough, Ross

    2003-01-01

    This article investigates the expression patterns of 160 genes that are expressed during early mouse development. The cDNAs were isolated from 7.5 d postcoitum (dpc) endoderm, a region that comprises visceral endoderm (VE), definitive endoderm, and the node-tissues that are required for the initi...

  2. LHCb Tag Collector

    International Nuclear Information System (INIS)

    Fernández, Paloma Fuente; Clemencic, Marco; Cousin, Nicolas

    2011-01-01

    The LHCb physics software consists of hundreds of packages, each of which is developed by one or more physicists. When the developers have some code changes that they would like released, they commit them to the version control system, and enter the revision number into a database. These changes have to be integrated into a new release of each of the physics analysis applications. Tests are then performed by a nightly build system, which rebuilds various configurations of the whole software stack and executes a suite of run-time functionality tests. A Tag Collector system has been developed using solid standard technologies to cover both the use cases of developers and integration managers. A simple Web interface, based on an AJAX-like technology, is available. Integration with SVN and Nightly Build System, is possible via a Python API. Data are stored in a relational database with the help of an ORM (Object-Relational Mapping) library.

  3. Internally readable identifying tag

    International Nuclear Information System (INIS)

    Jefferts, K.B.; Jefferts, E.R.

    1980-01-01

    A method of identifying non-metallic objects by means of X-ray equipment is described in detail. A small metal pin with a number of grooves cut in a pre-determined equi-spaced pattern is implanted into the non-metallic object and by decoding the groove patterns using X-ray equipment, the object is uniquely identified. A specific example of such an application is in studying the migratory habits of fish. The pin inserted into the snout of the fish is 0.010 inch in diameter, 0.040 inch in length with 8 possible positions for grooves if spaced 0.005 inch apart. With 6 of the groove positions available for data, the capacity is 2 6 or 64 combinations; clearly longer pins would increase the data capacity. This method of identification is a major advance over previous techniques which necessitated destruction of the fish in order to recover the identification tag. (UK)

  4. Unique gene expression profile of the proliferating Xenopus tadpole tail blastema cells deciphered by RNA-sequencing analysis.

    Directory of Open Access Journals (Sweden)

    Hiroshi Tsujioka

    Full Text Available Organ regenerative ability depends on the animal species and the developmental stage. The molecular bases for variable organ regenerative ability, however, remain unknown. Previous studies have identified genes preferentially expressed in the blastema tissues in various animals, but transcriptome analysis of the isolated proliferating blastema cells has not yet been reported. In the present study, we used RNA-sequencing analysis to analyze the gene expression profile of isolated proliferating blastema cells of regenerating Xenopus laevis tadpole tails. We used flow cytometry to isolate proliferating cells, and non-proliferating blastema cells, from regenerating tadpole tails as well as proliferating tail bud cells from tail bud embryos, the latter two of which were used as control cells, based on their DNA content. Among the 28 candidate genes identified by RNA-sequencing analysis, quantitative reverse transcription-polymerase chain reaction identified 10 genes whose expression was enriched in regenerating tadpole tails compared with non-regenerating tadpole tails or tails from the tail bud embryos. Among them, whole mount in situ hybridization revealed that chromosome segregation 1-like and interleukin 11 were expressed in the broad area of the tail blastema, while brevican, lysyl oxidase, and keratin 18 were mainly expressed in the notochord bud in regenerating tails. We further combined whole mount in situ hybridization with immunohistochemistry for the incorporated 5-bromo-2-deoxyuridine to confirm that keratin 18 and interleukin 11 were expressed in the proliferating tail blastema cells. Based on the proposed functions of their homologs in other animal species, these genes might have roles in the extracellular matrix formation in the notochord bud (brevican and lysyl oxidase, cell proliferation (chromosome segregation 1-like and keratin 18, and in the maintenance of the differentiation ability of proliferating blastema cells (interleukin 11

  5. Non PCR-amplified Transcripts and AFLP fragments as reduced representations of the quail genome for 454 Titanium sequencing

    Directory of Open Access Journals (Sweden)

    Leterrier Christine

    2010-07-01

    Full Text Available Abstract Background SNP (Single Nucleotide Polymorphism discovery is now routinely performed using high-throughput sequencing of reduced representation libraries. Our objective was to adapt 454 GS FLX based sequencing methodologies in order to obtain the largest possible dataset from two reduced representations libraries, produced by AFLP (Amplified Fragment Length Polymorphism for genomic DNA, and EST (Expressed Sequence Tag for the transcribed fraction of the genome. Findings The expressed fraction was obtained by preparing cDNA libraries without PCR amplification from quail embryo and brain. To optimize the information content for SNP analyses, libraries were prepared from individuals selected in three quail lines and each individual in the AFLP library was tagged. Sequencing runs produced 399,189 sequence reads from cDNA and 373,484 from genomic fragments, covering close to 250 Mb of sequence in total. Conclusions Both methods used to obtain reduced representations for high-throughput sequencing were successful after several improvements. The protocols may be used for several sequencing applications, such as de novo sequencing, tagged PCR fragments or long fragment sequencing of cDNA.

  6. Identification of a Flavivirus Sequence in a Marine Arthropod.

    Directory of Open Access Journals (Sweden)

    Michael J Conway

    Full Text Available Phylogenetic analysis has yet to uncover the early origins of flaviviruses. In this study, I mined a database of expressed sequence tags in order to discover novel flavivirus sequences. Flavivirus sequences were identified in a pool of mRNA extracted from the sea spider Endeis spinosa (Pycnogonida, Pantopoda. Reconstruction of the translated sequences and BLAST analysis matched the sequence to the flavivirus NS5 gene. Additional sequences corresponding to envelope and the NS5 MTase domain were also identified. Phylogenetic analysis of homologous NS5 sequences revealed that Endeis spinosa NS5 (ESNS5 is likely related to classical insect-specific flaviviruses. It is unclear if ESNS5 represents genetic material from an active viral infection or an integrated viral genome. These data raise the possibility that classical insect-specific flaviviruses and perhaps medically relevant flaviviruses, evolved from progenitors that infected marine arthropods.

  7. Molecular Cloning and Expression of Sequence Variants of Manganese Superoxide Dismutase Genes from Wheat

    Science.gov (United States)

    Reactive oxygen species (ROS) are very harmful to living organisms due to the potential oxidation of membrane lipids, DNA, proteins, and carbohydrates. Transformed E.coli strain QC 871, superoxide dismutase (SOD) double-mutant, with three sequence variant MnSOD1, MnSOD2, and MnSOD3 manganese supero...

  8. Variation of clinical expression in patients with Stargardt dystrophy and sequence variations in the ABCR gene.

    Science.gov (United States)

    Fishman, G A; Stone, E M; Grover, S; Derlacki, D J; Haines, H L; Hockey, R R

    1999-04-01

    To report the spectrum of ophthalmic findings in patients with Stargardt dystrophy or fundus flavimaculatus who have a specific sequence variation in the ABCR gene. Twenty-nine patients with Stargardt dystrophy or fundus flavimaculatus from different pedigrees were identified with possible disease-causing sequence variations in the ABCR gene from a group of 66 patients who were screened for sequence variations in this gene. Patients underwent a routine ocular examination, including slitlamp biomicroscopy and a dilated fundus examination. Fluorescein angiography was performed on 22 patients, and electroretinographic measurements were obtained on 24 of 29 patients. Kinetic visual fields were measured with a Goldmann perimeter in 26 patients. Single-strand conformation polymorphism analysis and DNA sequencing were used to identify variations in coding sequences of the ABCR gene. Three clinical phenotypes were observed among these 29 patients. In phenotype I, 9 of 12 patients had a sequence change in exon 42 of the ABCR gene in which the amino acid glutamic acid was substituted for glycine (Gly1961Glu). In only 4 of these 9 patients was a second possible disease-causing mutation found on the other ABCR allele. In addition to an atrophic-appearing macular lesion, phenotype I was characterized by localized perifoveal yellowish white flecks, the absence of a dark choroid, and normal electroretinographic amplitudes. Phenotype II consisted of 10 patients who showed a dark choroid and more diffuse yellowish white flecks in the fundus. None exhibited the Gly1961Glu change. Phenotype III consisted of 7 patients who showed extensive atrophic-appearing changes of the retinal pigment epithelium. Electroretinographic cone and rod amplitudes were reduced. One patient showed the Gly1961Glu change. A wide variation in clinical phenotype can occur in patients with sequence changes in the ABCR gene. In individual patients, a certain phenotype seems to be associated with the presence of

  9. Engineering the ATLAS TAG Browser

    CERN Document Server

    Zhang, Q; The ATLAS collaboration

    2011-01-01

    ELSSI is a web-based event metadata (TAG) browser and event-level selection service for ATLAS. TAGs from all ATLAS physics and Monte Carlo data sets are routinely loaded into Oracle databases as an integral part of event processing. As data volumes increase, more and more sites are joining the distributed TAG data hosting topology[1]. Meanwhile, TAG content and database schemata continue to evolve as new user requirements and additional sources of metadata emerge. All of this has posed many challenges to the development of ELSSI, which must support vast amounts of TAG data while source, content, geographic locations, and user query patterns may change over time. In this paper, we describe some of the challenges encountered in the process of developing ELSSI, and the software engineering strategies adopted to address those challenges. Approaches to management of access to data, browsing, data rendering, query building, query validation, execution, connection management, and communication with auxiliary service...

  10. Engineering the ATLAS TAG Browser

    CERN Document Server

    Zhang, Q; The ATLAS collaboration

    2011-01-01

    ELSSI is a web-based event metadata (TAG) browser and event-level selection service for ATLAS. TAGs from all ATLAS physics and Monte Carlo data sets are routinely loaded into Oracle databases as an integral part of event processing. As data volumes increase, more and more sites are joining the distributed TAG data hosting topology. Meanwhile, TAG content and database schemata continue to evolve as new user requirements and additional sources of metadata emerge. All of this has posed many challenges to the development of ELSSI, which must support vast amounts of TAG data while source, content, geographic locations, and user query patterns may change over time. In this paper, we describe some of the challenges encountered in the process of developing ELSSI, and the software engineering strategies adopted to address those challenges. Approaches to management of access to data, browsing, data rendering, query building, query validation, execution, connection management, and communication with auxiliary services a...

  11. Identification of the full-length β-actin sequence and expression profiles in the tree shrew (Tupaia belangeri).

    Science.gov (United States)

    Zheng, Yu; Yun, Chenxia; Wang, Qihui; Smith, Wanli W; Leng, Jing

    2015-02-01

    The tree shrew (Tupaia belangeri) diverges from the primate order (Primates) and is classified as a separate taxonomic group of mammals - Scandentia. It has been suggested that the tree shrew can be used as an animal model for studying human diseases; however, the genomic sequence of the tree shrew is largely unidentified. In the present study, we reported the full-length cDNA sequence of the housekeeping gene, β-actin, in the tree shrew. The amino acid sequence of β-actin in the tree shrew was compared to that of humans and other species; a simple phylogenetic relationship was discovered. Quantitative polymerase chain reaction (qPCR) and western blot analysis further demonstrated that the expression profiles of β-actin, as a general conservative housekeeping gene, in the tree shrew were similar to those in humans, although the expression levels varied among different types of tissue in the tree shrew. Our data provide evidence that the tree shrew has a close phylogenetic association with humans. These findings further enhance the potential that the tree shrew, as a species, may be used as an animal model for studying human disorders.

  12. Transcriptome sequencing and differential gene expression analysis in Viola yedoensis Makino (Fam. Violaceae) responsive to cadmium (Cd) pollution

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Jian [Key Laboratory of Biology and Genetic Improvement of Maize in Southwest Region, Ministry of Agriculture, Maize Research Institute of Sichuan Agricultural University, Wenjiang, Sichuan (China); Luo, Mao [Drug Discovery Research Center of Luzhou Medical College, Luzhou, Sichuan (China); Zhu, Ye; He, Ying; Wang, Qin [Department of Pharmacy of Luzhou Medical College, Luzhou, Sichuan (China); Zhang, Chun, E-mail: zc83good@126.com [Department of Pharmacy of Luzhou Medical College, Luzhou, Sichuan (China)

    2015-03-27

    Viola yedoensis Makino is an important Chinese traditional medicine plant adapted to cadmium (Cd) pollution regions. Illumina sequencing technology was used to sequence the transcriptome of V. yedoensis Makino. We sequenced Cd-treated (VIYCd) and untreated (VIYCK) samples of V. yedoensis, and obtained 100,410,834 and 83,587,676 high quality reads, respectively. After de novo assembly and quantitative assessment, 109,800 unigenes were finally generated with an average length of 661 bp. We then obtained functional annotations by aligning unigenes with public protein databases including NR, NT, SwissProt, KEGG and COG. In addition, 892 differentially expressed genes (DEGs) were investigated between the two libraries of untreated (VIYCK) and Cd-treated (VIYCd) plants. Moreover, 15 randomly selected DEGs were further validated with qRT-PCR and the results were highly accordant with the Solexa analysis. This study firstly generated a successful global analysis of the V. yedoensis transcriptome and it will provide for further studies on gene expression, genomics, and functional genomics in Violaceae. - Highlights: • A de novo assembly generated 109,800 unigenes and 5,4479 of them were annotated. • 31,285 could be classified into 26 COG categories. • 263 biosynthesis pathways were predicted and classified into five categories. • 892 DEGs were detected and 15 of them were validated by qRT-PCR.

  13. Sequence analysis and gene expression of putative exo- and endo-glucanases from oil palm (Elaeis guineensis) during fungal infection.

    Science.gov (United States)

    Yeoh, Keat-Ai; Othman, Abrizah; Meon, Sariah; Abdullah, Faridah; Ho, Chai-Ling

    2012-10-15

    Glucanases are enzymes that hydrolyze a variety β-d-glucosidic linkages. Plant β-1,3-glucanases are able to degrade fungal cell walls; and promote the release of cell-wall derived fungal elicitors. In this study, three full-length cDNA sequences encoding oil palm (Elaeis guineensis) glucanases were analyzed. Sequence analyses of the cDNA sequences suggested that EgGlc1-1 is a putative β-d-glucan exohydolase belonging to glycosyl hydrolase (GH) family 3 while EgGlc5-1 and EgGlc5-2 are putative glucan endo-1,3-β-glucosidases belonging to GH family 17. The transcript abundance of these genes in the roots and leaves of oil palm seedlings treated with Ganoderma boninense and Trichoderma harzianum was profiled to investigate the involvement of these glucanases in oil palm during fungal infection. The gene expression of EgGlc1-1 in the root of oil palm seedlings was increased by T. harzianum but suppressed by G. boninense; while the gene expression of both EgGlc5-1 and EgGlc5-2 in the roots of oil palm seedlings was suppressed by G. boninense or/and T. harzianum. Copyright © 2012 Elsevier GmbH. All rights reserved.

  14. Transcriptome sequencing and differential gene expression analysis in Viola yedoensis Makino (Fam. Violaceae) responsive to cadmium (Cd) pollution

    International Nuclear Information System (INIS)

    Gao, Jian; Luo, Mao; Zhu, Ye; He, Ying; Wang, Qin; Zhang, Chun

    2015-01-01

    Viola yedoensis Makino is an important Chinese traditional medicine plant adapted to cadmium (Cd) pollution regions. Illumina sequencing technology was used to sequence the transcriptome of V. yedoensis Makino. We sequenced Cd-treated (VIYCd) and untreated (VIYCK) samples of V. yedoensis, and obtained 100,410,834 and 83,587,676 high quality reads, respectively. After de novo assembly and quantitative assessment, 109,800 unigenes were finally generated with an average length of 661 bp. We then obtained functional annotations by aligning unigenes with public protein databases including NR, NT, SwissProt, KEGG and COG. In addition, 892 differentially expressed genes (DEGs) were investigated between the two libraries of untreated (VIYCK) and Cd-treated (VIYCd) plants. Moreover, 15 randomly selected DEGs were further validated with qRT-PCR and the results were highly accordant with the Solexa analysis. This study firstly generated a successful global analysis of the V. yedoensis transcriptome and it will provide for further studies on gene expression, genomics, and functional genomics in Violaceae. - Highlights: • A de novo assembly generated 109,800 unigenes and 5,4479 of them were annotated. • 31,285 could be classified into 26 COG categories. • 263 biosynthesis pathways were predicted and classified into five categories. • 892 DEGs were detected and 15 of them were validated by qRT-PCR

  15. OSIRIS-REx Touch-And-Go (TAG) Navigation Performance

    Science.gov (United States)

    Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

    2015-01-01

    The Origins Spectral Interpretation Resource identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIES-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper will summarize the Monte-Carlo simulation of the TAG sequence and present analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and +-2 cms of the targeted contact velocity. The paper will describe some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.

  16. OSIRI-REx Touch and Go (TAG) Navigation Performance

    Science.gov (United States)

    Berry, Kevin; Antreasian, Peter; Moreau, Michael C.; May, Alex; Sutter, Brian

    2015-01-01

    The Origins Spectral Interpretation Resource Identification Security Regolith Explorer (OSIRIS-REx) mission is a NASA New Frontiers mission launching in 2016 to rendezvous with the near-Earth asteroid (101955) Bennu in late 2018. Following an extensive campaign of proximity operations activities to characterize the properties of Bennu and select a suitable sample site, OSIRIS-REx will fly a Touch-And-Go (TAG) trajectory to the asteroid's surface to obtain a regolith sample. The paper summarizes the mission design of the TAG sequence, the propulsive maneuvers required to achieve the trajectory, and the sequence of events leading up to the TAG event. The paper also summarizes the Monte-Carlo simulation of the TAG sequence and presents analysis results that demonstrate the ability to conduct the TAG within 25 meters of the selected sample site and 2 cm/s of the targeted contact velocity. The paper describes some of the challenges associated with conducting precision navigation operations and ultimately contacting a very small asteroid.

  17. Serial analysis of gene expression in the silkworm, Bombyx mori.

    Science.gov (United States)

    Huang, Jianhua; Miao, Xuexia; Jin, Weirong; Couble, Pierre; Mita, Kasuei; Zhang, Yong; Liu, Wenbin; Zhuang, Leijun; Shen, Yan; Keime, Celine; Gandrillon, Olivier; Brouilly, Patrick; Briolay, Jerome; Zhao, Guoping; Huang, Yongping

    2005-08-01

    The silkworm Bombyx mori is one of the most economically important insects and serves as a model for Lepidoptera insects. We used serial analysis of gene expression (SAGE) to derive profiles of expressed genes during the developmental life cycle of the silkworm and to create a reference for understanding silkworm metamorphosis. We generated four SAGE libraries, one from each of the four developmental stages of the silkworm. In total we obtained 257,964 SAGE tags, of which 39,485 were unique tags. Sorted by copy number, 14.1% of the unique tags were detected at a median to high level (five or more copies), 24.2% at lower levels (two to four copies), and 61.7% as single copies. Using a basic local alignment search tool on the EST database, 35% of the tags matched known silkworm expressed sequence tags. SAGE demonstrated that a number of the genes were up- or down-regulated during the four developmental phases of the egg, larva, pupa, and adult. Furthermore, we found that the generation of longer cDNA fragments from SAGE tags constituted the most efficient method of gene identification, which facilitated the analysis of a large number of unknown genes.

  18. Expression

    Directory of Open Access Journals (Sweden)

    Wang-Xia Wang

    2014-02-01

    Full Text Available The miR-15/107 family comprises a group of 10 paralogous microRNAs (miRNAs, sharing a 5′ AGCAGC sequence. These miRNAs have overlapping targets. In order to characterize the expression of miR-15/107 family miRNAs, we employed customized TaqMan Low-Density micro-fluid PCR-array to investigate the expression of miR-15/107 family members, and other selected miRNAs, in 11 human tissues obtained at autopsy including the cerebral cortex, frontal cortex, primary visual cortex, thalamus, heart, lung, liver, kidney, spleen, stomach and skeletal muscle. miR-103, miR-195 and miR-497 were expressed at similar levels across various tissues, whereas miR-107 is enriched in brain samples. We also examined the expression patterns of evolutionarily conserved miR-15/107 miRNAs in three distinct primary rat brain cell preparations (enriched for cortical neurons, astrocytes and microglia, respectively. In primary cultures of rat brain cells, several members of the miR-15/107 family are enriched in neurons compared to other cell types in the central nervous system (CNS. In addition to mature miRNAs, we also examined the expression of precursors (pri-miRNAs. Our data suggested a generally poor correlation between the expression of mature miRNAs and their precursors. In summary, we provide a detailed study of the tissue and cell type-specific expression profile of this highly expressed and phylogenetically conserved family of miRNA genes.

  19. Identification of differentially expressed genes in sunflower (Helianthus annuus) leaves and roots under drought stress by RNA sequencing.

    Science.gov (United States)

    Liang, Chunbo; Wang, Wenjun; Wang, Jing; Ma, Jun; Li, Cen; Zhou, Fei; Zhang, Shuquan; Yu, Ying; Zhang, Liguo; Li, Weizhong; Huang, Xutang

    2017-10-25

    Sunflower is recognized as one of the most important oil plants with strong tolerance to drought in the world. In order to study the response mechanisms of sunflower plants to drought stress, gene expression profiling using high throughput sequencing was performed for seedling leaves and roots (sunflower inbred line R5) after 24 h of drought stress (15% PEG 6000). The transcriptome assembled using sequences of 12 samples was used as a reference. 805 and 198 genes were identified that were differentially expressed in leaves and roots, respectively. Another 71 genes were differentially expressed in both organs, in which more genes were up-regulated than down-regulated. In agreement with results obtained for other crops or from previous sunflower studies, we also observed that nine genes may be associated with the response of sunflower to drought. The results of this study may provide new information regarding the sunflower drought response, as well as add to the number of known genes associated with drought tolerance.

  20. Gene Expression Profiles in Paired Gingival Biopsies from Periodontitis-Affected and Healthy Tissues Revealed by Massively Parallel Sequencing

    Science.gov (United States)

    Båge, Tove; Lagervall, Maria; Jansson, Leif; Lundeberg, Joakim; Yucel-Lindberg, Tülay

    2012-01-01

    Periodontitis is a chronic inflammatory disease affecting the soft tissue and bone that surrounds the teeth. Despite extensive research, distinctive genes responsible for the disease have not been identified. The objective of this study was to elucidate transcriptome changes in periodontitis, by investigating gene expression profiles in gingival tissue obtained from periodontitis-affected and healthy gingiva from the same patient, using RNA-sequencing. Gingival biopsies were obtained from a disease-affected and a healthy site from each of 10 individuals diagnosed with periodontitis. Enrichment analysis performed among uniquely expressed genes for the periodontitis-affected and healthy tissues revealed several regulated pathways indicative of inflammation for the periodontitis-affected condition. Hierarchical clustering of the sequenced biopsies demonstrated clustering according to the degree of inflammation, as observed histologically in the biopsies, rather than clustering at the individual level. Among the top 50 upregulated genes in periodontitis-affected tissues, we investigated two genes which have not previously been demonstrated to be involved in periodontitis. These included interferon regulatory factor 4 and chemokine (C-C motif) ligand 18, which were also expressed at the protein level in gingival biopsies from patients with periodontitis. In conclusion, this study provides a first step towards a quantitative comprehensive insight into the transcriptome changes in periodontitis. We demonstrate for the first time site-specific local variation in gene expression profiles of periodontitis-affected and healthy tissues obtained from patients with periodontitis, using RNA-seq. Further, we have identified novel genes expressed in periodontitis tissues, which may constitute potential therapeutic targets for future treatment strategies of periodontitis. PMID:23029519

  1. Structure, sequence and expression of the hepatitis delta (δ) viral genome

    Science.gov (United States)

    Wang, Kang-Sheng; Choo, Qui-Lim; Weiner, Amy J.; Ou, Jing-Hsiung; Najarian, Richard C.; Thayer, Richard M.; Mullenbach, Guy T.; Denniston, Katherine J.; Gerin, John L.; Houghton, Michael

    1986-10-01

    Biochemical and electron microscopic data indicate that the human hepatitis δ viral agent contains a covalently closed circular and single-stranded RNA genome that has certain similarities with viroid-like agents from plants. The sequence of the viral genome (1,678 nucleotides) has been determined and an open reading frame within the complementary strand has been shown to encode an antigen that binds specifically to antisera from patients with chronic hepatitis δ viral infections.

  2. De novo transcriptome sequencing of Isaria cateniannulata and comparative analysis of gene expression in response to heat and cold stresses.

    Directory of Open Access Journals (Sweden)

    Dingfeng Wang

    Full Text Available Isaria cateniannulata is a very important and virulent entomopathogenic fungus that infects many insect pest species. Although I. cateniannulata is commonly exposed to extreme environmental temperature conditions, little is known about its molecular response mechanism to temperature stress. Here, we sequenced and de novo assembled the transcriptome of I. cateniannulata in response to high and low temperature stresses using Illumina RNA-Seq technology. Our assembly encompassed 17,514 unigenes (mean length = 1,197 bp, in which 11,445 unigenes (65.34% showed significant similarities to known sequences in NCBI non-redundant protein sequences (Nr database. Using digital gene expression analysis, 4,483 differentially expressed genes (DEGs were identified after heat treatment, including 2,905 up-regulated genes and 1,578 down-regulated genes. Under cold stress, 1,927 DEGs were identified, including 1,245 up-regulated genes and 682 down-regulated genes. The expression patterns of 18 randomly selected candidate DEGs resulting from quantitative real-time PCR (qRT-PCR were consistent with their transcriptome analysis results. Although DEGs were involved in many pathways, we focused on the genes that were involved in endocytosis: In heat stress, the pathway of clathrin-dependent endocytosis (CDE was active; however at low temperature stresses, the pathway of clathrin-independent endocytosis (CIE was active. Besides, four categories of DEGs acting as temperature sensors were observed, including cell-wall-major-components-metabolism-related (CWMCMR genes, heat shock protein (Hsp genes, intracellular-compatible-solutes-metabolism-related (ICSMR genes and glutathione S-transferase (GST. These results enhance our understanding of the molecular mechanisms of I. cateniannulata in response to temperature stresses and provide a valuable resource for the future investigations.

  3. Candidate genes revealed by a genome scan for mosquito resistance to a bacterial insecticide: sequence and gene expression variations

    Directory of Open Access Journals (Sweden)

    David Jean-Philippe

    2009-11-01

    Full Text Available Abstract Background Genome scans are becoming an increasingly popular approach to study the genetic basis of adaptation and speciation, but on their own, they are often helpless at identifying the specific gene(s or mutation(s targeted by selection. This shortcoming is hopefully bound to disappear in the near future, thanks to the wealth of new genomic resources that are currently being developed for many species. In this article, we provide a foretaste of this exciting new era by conducting a genome scan in the mosquito Aedes aegypti with the aim to look for candidate genes involved in resistance to Bacillus thuringiensis subsp. israelensis (Bti insecticidal toxins. Results The genome of a Bti-resistant and a Bti-susceptible strains was surveyed using about 500 MITE-based molecular markers, and the loci showing the highest inter-strain genetic differentiation were sequenced and mapped on the Aedes aegypti genome sequence. Several good candidate genes for Bti-resistance were identified in the vicinity of these highly differentiated markers. Two of them, coding for a cadherin and a leucine aminopeptidase, were further examined at the sequence and gene expression levels. In the resistant strain, the cadherin gene displayed patterns of nucleotide polymorphisms consistent with the action of positive selection (e.g. an excess of high compared to intermediate frequency mutations, as well as a significant under-expression compared to the susceptible strain. Conclusion Both sequence and gene expression analyses agree to suggest a role for positive selection in the evolution of this cadherin gene in the resistant strain. However, it is unlikely that resistance to Bti is conferred by this gene alone, and further investigation will be needed to characterize other genes significantly associated with Bti resistance in Ae. aegypti. Beyond these results, this article illustrates how genome scans can build on the body of new genomic information (here, full

  4. RNA-sequence analysis of gene expression from honeybees (Apis mellifera) infected with Nosema ceranae

    Science.gov (United States)

    Fougeroux, André; Petit, Fabien; Anselmo, Anna; Gorni, Chiara; Cucurachi, Marco; Cersini, Antonella; Granato, Anna; Cardeti, Giusy; Formato, Giovanni; Mutinelli, Franco; Giuffra, Elisabetta; Williams, John L.; Botti, Sara

    2017-01-01

    Honeybees (Apis mellifera) are constantly subjected to many biotic stressors including parasites. This study examined honeybees infected with Nosema ceranae (N. ceranae). N. ceranae infection increases the bees energy requirements and may contribute to their decreased survival. RNA-seq was used to investigate gene expression at days 5, 10 and 15 Post Infection (P.I) with N. ceranae. The expression levels of genes, isoforms, alternative transcription start sites (TSS) and differential promoter usage revealed a complex pattern of transcriptional and post-transcriptional gene regulation suggesting that bees use a range of tactics to cope with the stress of N. ceranae infection. N. ceranae infection may cause reduced immune function in the bees by: (i)disturbing the host amino acids metabolism (ii) down-regulating expression of antimicrobial peptides (iii) down-regulation of cuticle coatings and (iv) down-regulation of odorant binding proteins. PMID:28350872

  5. Identification of Molecular Tumor Markers in Renal Cell Carcinomas with TFE3 Protein Expression by RNA Sequencing

    Directory of Open Access Journals (Sweden)

    Dorothee Pflueger

    2013-11-01

    Full Text Available TFE3 translocation renal cell carcinoma (tRCC is defined by chromosomal translocations involving the TFE3 transcription factor at chromosome Xp11.2. Genetically proven TFE3 tRCCs have a broad histologic spectrum with overlapping features to other renal tumor subtypes. In this study,we aimed for characterizing RCC with TFE3 protein expression. Using next-generation whole transcriptome sequencing (RNA-Seq as a discovery tool, we analyzed fusion transcripts, gene expression profile, and somatic mutations in frozen tissue of one TFE3 tRCC. By applying a computational analysis developed to call chimeric RNA molecules from paired-end RNA-Seq data, we confirmed the known TFE3 translocation. Its fusion partner SFPQ has already been described as fusion partner in tRCCs. In addition, an RNAread-through chimera between TMED6 and COG8 as well as MET and KDR (VEGFR2 point mutations were identified. An EGFR mutation, but no chromosomal rearrangements, was identified in a control group of five clear cell RCCs (ccRCCs. The TFE3 tRCC could be clearly distinguished from the ccRCCs by RNA-Seq gene expression measurements using a previously reported tRCC gene signature. In validation experiments using reverse transcription-PCR, TMED6-COG8 chimera expression was significantly higher in nine TFE3 translocated and six TFE3-expressing/non-translocated RCCs than in 24 ccRCCs (P<.001 and 22 papillaryRCCs (P<.05-.07. Immunohistochemical analysis of selected genes from the tRCC gene signature showed significantly higher eukaryotic translation elongation factor 1 alpha 2 (EEF1A2 and Contactin 3 (CNTN3 expression in 16 TFE3 translocated and six TFE3-expressing/non-translocated RCCs than in over 200 ccRCCs (P < .0001, both.

  6. NHE-1 sequence and expression in toad, snake and fish red blood cells

    DEFF Research Database (Denmark)

    Thomsen, Steffen Nyegaard; Wang, Tobias; Kristensen, Torsten

    Red blood cells (RBC) from reptiles appear not to express regulatory volume increase (RVI) upon shrinkage (Kristensen et al., 2008). In other vertebrates, the RVI response is primarily mediated by activation of the Na+/H+ exchanger (NHE-1) and we, therefore decided to investigate whether red cells...... of reptiles express a different NHE-1 that responds less to volume activation compared to other vertebrates or simply lack the Na+/H+ exchanger. Using various tissues from the ball python (Python regius), Cane toad (Bufo marinus) and European perch (Perca fluviatilis), cDNA libraries were created...

  7. Polarized tagged photons

    International Nuclear Information System (INIS)

    Maximon, L.C.; Ganz, Eric; Aniel, Thierry; Miniac, Arlette de.

    1982-03-01

    We consider in detail the differential cross section for polarized bremsstrahlung for angles and energies in the range of interest for a tagging system and derive a high energy, small angle approximation for this cross section. We use these approximations to determine the maxima and minima of the cross sections for these two polarization states, dσperpendicular and dσparallel, and to evaluate these cross sections at the extrema. It is shown that both dσperpendicular and dσparallel have a very sharp dip in the region of small momentum transfers. However, their behavior in the region of the dip, as a function of the azimuthal angle phi, is quite different over most of the photon spectrum. The cross section dσperpendicular behaves similarly to the cross section for unpolarized photons in that as phi increases, the sharp dip vanishes, the minimum fuses with the second maximum, and the cross section then has only a single maximum. In contrast, the sharp dip in the cross section dσparallel remains as phi increases. Coulomb corrections to the Born approximation are considered, and do not fill in these dips

  8. Sequencing and transcriptional analysis of the Streptococcus thermophilus histamine biosynthesis gene cluster: factors that affect differential hdcA expression

    DEFF Research Database (Denmark)

    Calles-Enríquez, Marina; Hjort, Benjamin Benn; Andersen, Pia Skov

    2010-01-01

    to produce histamine. The hdc clusters of S. thermophilus CHCC1524 and CHCC6483 were sequenced, and the factors that affect histamine biosynthesis and histidine-decarboxylating gene (hdcA) expression were studied. The hdc cluster began with the hdcA gene, was followed by a transporter (hdcP), and ended...... with the hdcB gene, which is of unknown function. The three genes were orientated in the same direction. The genetic organization of the hdc cluster showed a unique organization among the lactic acid bacterial group and resembled those of Staphylococcus and Clostridium species, thus indicating possible...... acquisition through a horizontal transfer mechanism. Transcriptional analysis of the hdc cluster revealed the existence of a polycistronic mRNA covering the three genes. The histidine-decarboxylating gene (hdcA) of S. thermophilus demonstrated maximum expression during the stationary growth phase, with high...

  9. Heme regulates the expression in Saccharomyces cerevisiae of chimaeric genes containing 5'-flanking soybean leghemoglobin sequences

    DEFF Research Database (Denmark)

    Jensen, E O; Marcker, K A; Villadsen, IS

    1986-01-01

    The TM1 yeast mutant was transformed with a 2 micron-derived plasmid (YEp24) which carries a chimaeric gene containing the Escherichia coli chloramphenicol acetyl transferase (CAT) gene fused to the 5'- and 3'-flanking regions of the soybean leghemoglobin (Lb) c3 gene. Expression of the chimaeric...

  10. Sequence genomic organization and expression of two channel catfish Ictalurus punctatus Ghrelin receptors

    Science.gov (United States)

    Two ghrelin receptor (GHS-R) genes were isolated from channel catfish tissue and a bacterial artificial chromosome (BAC) library. The two receptors were characterized by determining tissue distribution, ontogeny of receptor mRNA expression, and effects of exogenous homologous ghrelin administration ...

  11. Laccase Gene Family in Cerrena sp. HYB07: Sequences, Heterologous Expression and Transcriptional Analysis

    Directory of Open Access Journals (Sweden)

    Jie Yang

    2016-08-01

    Full Text Available Laccases are a class of multi-copper oxidases with industrial potential. In this study, eight laccases (Lac1–8 from Cerrena sp. strain HYB07, a white-rot fungus with high laccase yields, were analyzed. The laccases showed moderate identities to each other as well as with other fungal laccases and were predicted to have high redox potentials except for Lac6. Selected laccase isozymes were heterologously expressed in the yeast Pichia pastoris, and different enzymatic properties were observed. Transcription of the eight laccase genes was differentially regulated during submerged and solid state fermentation, as shown by quantitative real-time polymerase chain reaction and validated reference genes. During 6-day submerged fermentation, Lac7 and 2 were successively the predominantly expressed laccase gene, accounting for over 95% of all laccase transcripts. Interestingly, accompanying Lac7 downregulation, Lac2 transcription was drastically upregulated on days 3 and 5 to 9958-fold of the level on day 1. Consistent with high mRNA abundance, Lac2 and 7, but not other laccases, were identified in the fermentation broth by LC-MS/MS. In solid state fermentation, less dramatic differences in transcript abundance were observed, and Lac3, 7 and 8 were more highly expressed than other laccase genes. Elucidating the properties and expression profiles of the laccase gene family will facilitate understanding, production and commercialization of the fungal strain and its laccases.

  12. cDNA sequence and tissue expression analysis of glucokinase from ...

    African Journals Online (AJOL)

    Yomi

    2012-01-10

    Jan 10, 2012 ... distribution of GK mRNA in brain, mesenteric adipose tissue, spleen, white muscle and liver of grass ... expression profile of GK mRNA in liver normalized with β-actin level was 31, 454 and 649-fold compared .... Primers and expected products used for GK gene cDNA RT-PCR, RACE and real-time PCR.

  13. Quantum tagging for tags containing secret classical data

    International Nuclear Information System (INIS)

    Kent, Adrian

    2011-01-01

    Various authors have considered schemes for quantum tagging, that is, authenticating the classical location of a classical tagging device by sending and receiving quantum signals from suitably located distant sites, in an environment controlled by an adversary whose quantum information processing and transmitting power is potentially unbounded. All of the schemes proposed elsewhere in the literature assume that the adversary is able to inspect the interior of the tagging device. All of these schemes have been shown to be breakable if the adversary has unbounded predistributed entanglement. We consider here the case in which the tagging device contains a finite key string shared with distant sites but kept secret from the adversary, and show this allows the location of the tagging device to be authenticated securely and indefinitely. Our protocol relies on quantum key distribution between the tagging device and at least one distant site, and demonstrates a new practical application of quantum key distribution. It also illustrates that the attainable security in position-based cryptography can depend crucially on apparently subtle details in the security scenario considered.

  14. Improved determination of left ventricular volume with myocardial tagging

    International Nuclear Information System (INIS)

    Peshock, R.M.; Takai, H.; Baker, K.V.; Clarke, G.D.; McDonald, G.G.; Parkey, R.W.

    1991-01-01

    Cine MR imaging can be used to determine ventricular volume and ejection fraction. However, definition of the endocardial surface can be difficult, leading some investigators to suggest that black-blood studies are preferable. Grid tagging with use of spatial modulation of magnetization has been used to improve assessments of wall motion. The purpose of this paper, is to determine if grid tagging would also facilitate definition of the endocardial border for volume and ejection fraction calculations. Grid tagging based on spatial modulation of magnetization was implemented on a Toshiba 0.5-T MR imaging device. Standard RAO images were obtained in 10 normal volunteers with use of standard cine MR imaging sequences (33/22) with and without grid tagging. Images were analyzed to determine ventricular volume, cardiac output and wall motion. Images obtained without tagging generally showed good contrast at end diastole, but definition of the endocardial border was frequently more difficult in middle to late systole. Images with tagging provided significantly better definition of endocardial borders, particularly during systole

  15. Modified intracellular-associated phenotypes in a recombinant Salmonella Typhi expressing S. Typhimurium SPI-3 sequences.

    Directory of Open Access Journals (Sweden)

    Patricio Retamal

    Full Text Available A bioinformatics comparison of Salmonella Pathogenicity Island 3 sequences from S. Typhi and S. Typhimurium serovars showed that ten genes are highly conserved. However three of them are pseudogenes in S. Typhi. Our aim was to understand what functions are lost in S. Typhi due to pseudogenes by constructing a S. Typhi genetic hybrid carrying the SPI-3 region of S. Typhimurium instead of its own SPI-3. We observed that under stressful conditions the hybrid strain showed a clear impairment in resistance to hydrogen peroxide and decreased survival within U937 culture monocytes. We hypothesized that the marT-fidL operon, encoded in SPI-3, was responsible for the new phenotypes because marT is a pseudogen in S. Typhi and has a demonstrated role as a transcriptional regulator in S. Typhimurium. Therefore we cloned and transferred the S. Typhimurium marT-fidL operon into S. Typhi and confirmed that invasion of monocytes was dramatically decreased. Finally, our findings suggest that the genomic and functional differences between SPI-3 sequences have implications in the host specificity of Typhi and Typhimurium serovars.

  16. RNA Sequencing Reveals the Alteration of the Expression of Novel Genes in Ethanol-Treated Embryoid Bodies.

    Science.gov (United States)

    Mandal, Chanchal; Kim, Sun Hwa; Chai, Jin Choul; Oh, Seon Mi; Lee, Young Seek; Jung, Kyoung Hwa; Chai, Young Gyu

    2016-01-01

    Fetal alcohol spectrum disorder is a collective term representing fetal abnormalities associated with maternal alcohol consumption. Prenatal alcohol exposure and related anomalies are well characterized, but the molecular mechanism behind this phenomenon is not well characterized. In this present study, our aim is to profile important genes that regulate cellular development during fetal development. Human embryonic carcinoma cells (NCCIT) are cultured to form embryoid bodies and then treated in the presence and absence of ethanol (50 mM). We employed RNA sequencing to profile differentially expressed genes in the ethanol-treated embryoid bodies from NCCIT vs. EB, NCCIT vs. EB+EtOH and EB vs. EB+EtOH data sets. A total of 632, 205 and 517 differentially expressed genes were identified from NCCIT vs. EB, NCCIT vs. EB+EtOH and EB vs. EB+EtOH, respectively. Functional annotation using bioinformatics tools reveal significant enrichment of differential cellular development and developmental disorders. Furthermore, a group of 42, 15 and 35 transcription factor-encoding genes are screened from all of the differentially expressed genes obtained from NCCIT vs. EB, NCCIT vs. EB+EtOH and EB vs. EB+EtOH, respectively. We validated relative gene expression levels of several transcription factors from these lists by quantitative real-time PCR. We hope that our study substantially contributes to the understanding of the molecular mechanism underlying the pathology of alcohol-mediated anomalies and ease further research.

  17. Myocardial tagging with steady state free precession techniques and semi-automatic postprocessing--impact on diagnostic value

    DEFF Research Database (Denmark)

    Johnson, Thorsten R C; Bayrhof, Nicole; Huber, Armin

    2007-01-01

    Our aim was to determine the diagnostic value of myocardial tagging sequences with regard to the evaluable share of the cardiac cycle. Thirty-three patients were examined at 1.5 T using tagging sequences with gradient-echo (GRE) readout, 18 patients at 1.5 T with steady-state free precession (SSF...

  18. Regulation of gene expression in Mycoplasmas: contribution from Mycoplasma hyopneumoniae and Mycoplasma synoviae genome sequences

    Directory of Open Access Journals (Sweden)

    Humberto Maciel França Madeira

    2007-01-01

    Full Text Available This report describes the transcription apparatus of Mycoplasma hyopneumoniae (strains J and 7448 and Mycoplasma synoviae, using a comparative genomics approach to summarize the main features related to transcription and control of gene expression in mycoplasmas. Most of the transcription-related genes present in the three strains are well conserved among mycoplasmas. Some unique aspects of transcription in mycoplasmas and the scarcity of regulatory proteins in mycoplasma genomes are discussed.

  19. Roles of Solvent Accessibility and Gene Expression in Modeling Protein Sequence Evolution

    OpenAIRE

    Kuangyu Wang; Shuhui Yu; Xiang Ji; Clemens Lakner; Alexander Griffing; Jeffrey L. Thorne

    2015-01-01

    Models of protein evolution tend to ignore functional constraints, although structural constraints are sometimes incorporated. Here we propose a probabilistic framework for codon substitution that evaluates joint effects of relative solvent accessibility (RSA), a structural constraint; and gene expression, a functional constraint. First, we explore the relationship between RSA and codon usage at the genomic scale as well as at the individual gene scale. Motivated by these results, we construc...

  20. Specific DNA-binding proteins and DNA sequences involved in steroid hormone regulation of gene expression

    International Nuclear Information System (INIS)

    Spelsberg, T.; Hora, J.; Horton, M.; Goldberger, A.; Littlefield, B.; Seelke, R.; Toyoda, H.

    1987-01-01

    Steroid hormones circulate in the blood and are taken by target cells via complexes with intracellular binding proteins termed receptors, that are hormone and tissue specific. Each receptor binds it specific steroid with very high affinity, having an equilibrium dissociation constant (K/sub d/) in the range of 10 -9 to 10 -10 M. Once bound by their specific steroid hormones, the steroid receptors undergo a conformational change which allows them to bind with high affinity to sites on chromatin, termed nuclear acceptor sites. There are estimated 5,000 to 10,000 of these sites expressed with an equal number not expressed (''masked'') in intact chromatin. The result of the binding to nuclear acceptor sites is an alteration of gene transcription or, in some cases, gene expression as measured by the changing levels of specific RNAs and proteins in that target tissue. Each steroid regulates specific effects on the RNA and protein profiles. The chronology of the above mechanism of action after injection of radiolabelled steroid as is follows: Steroid-receptor complex formation (1 minute), nuclear acceptor sites (2 minutes), effects on RNA synthesis (10 to 30 minutes), and finally the changing protein profiles via changes in protein synthesis and protein turnover (1 to 6 hours). Thus steroid receptors represent one of the first identified intracellular gene regulation proteins. The receptor molecules themselves are regulated by the presence or absence of the steroid molecule

  1. Molecular cloning, sequence identification and expression profile of domestic guinea pig (Cavia porcellus UGT1A1 gene

    Directory of Open Access Journals (Sweden)

    Yang Deming

    2016-01-01

    Full Text Available Domestic guinea pig is a model animal for human disease research. Uridine diphosphate glucuronosyltransferase 1 family, polypeptide A1 (UGT1A1 is an important human disease-related gene. In this study, the complete coding sequence of domestic guinea pig gene UGT1A1 was amplified by reverse transcription-polymerase chain reaction. The open reading frame of the domestic guinea pig UGT1A1 gene is 1602 bp in length and was found to encode a protein of 533 amino acids. Sequence analysis revealed that the UGT1A1 protein of domestic guinea pig shared high homology with the UGT1A1 proteins of degu (84%, damara mole-rat (84%, human (80%, northern white-cheeked gibbon (80%, Colobus angolensis palliatus (80% and golden snub-nosed monkey (79%. This gene contains five exons and four introns, as revealed by the computer-assisted analysis. The results also showed that the domestic guinea pig UGT1A1 gene had a close genetic relationship with the UGT1A1 gene of degu. The prediction of transmembrane helices showed that domestic guinea pig UGT1A1 might be a transmembrane protein. Expression profile analysis indicated that the domestic guinea pig UGT1A1 gene was differentially expressed in detected domestic guinea pig tissues. Our experiment laid a primary foundation for using the domestic guinea pig as a model animal to study the UGT1A1-related human diseases.

  2. De novo transcriptome sequencing and comparative analysis of differentially expressed genes in dryoperis fragrans under temperature stress

    International Nuclear Information System (INIS)

    Wang, W.Z.; Tong, W.S.; Gao, R.

    2016-01-01

    Dryopteris fragrans is a species of fern and contains flavonoids compounds with medicinal value. This study explain the temperature stress impact flavonoids synthesis in D. fragrans tissue culture seedlings under the low temperature at 4 degree C, high temperature at 35 degree C and moderate temperature at 25 degree C. By using Illumina HiSeq 2000 sequencing, 80.9 million raw sequence reads were de novo assembled into 66,716 non-redundant unigenes. 38,486 unigenes (57.7%) were annotated for their function. 13,973 unigenes and 29,598 unigenes were allocated to gene ontology (GO) and clusters of orthologous group (COG), respectively. 18,989 sequences mapped to 118 Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG), 204 genes were involved in flavonoid biosynthesis, regulation and transport. 25,292 and 16,817 unigenes exhibited marked differential expression in response to temperature shifts of 25 degree C to 4 degree C and 25 degree C to 35 degree C, respectively. 4CL and CHS genes involved in flavonoid biosynthesis were tested and suggested that they were responsible for biosynthesis of flavonoids. This study provides the first published data to describe the D. fragrans transcriptome and should accelerate understanding of flavonoids biosynthesis, regulation and transport mechanisms. Since most unigenes described here were successfully annotated, these results should facilitate future functional genomic understanding and research of D. fragrans. (author)

  3. Construction and characterization of 3A-epitope-tagged foot-and-mouth disease virus.

    Science.gov (United States)

    Ma, Xueqing; Li, Pinghua; Sun, Pu; Bai, Xingwen; Bao, Huifang; Lu, Zengjun; Fu, Yuanfang; Cao, Yimei; Li, Dong; Chen, Yingli; Qiao, Zilin; Liu, Zaixin

    2015-04-01

    Nonstructural protein 3A of foot-and-mouth disease virus (FMDV) is a partially conserved protein of 153 amino acids (aa) in most FMDVs examined to date. Specific deletion in the FMDV 3A protein has been associated with the inability of FMDV to grow in primary bovine cells and cause disease in cattle. However, the aa residues playing key roles in these processes are poorly understood. In this study, we constructed epitope-tagged FMDVs containing an 8 aa FLAG epitope, a 9 aa haemagglutinin (HA) epitope, and a 10 aa c-Myc epitope to substitute residues 94-101, 93-101, and 93-102 of 3A protein, respectively, using a recently developed O/SEA/Mya-98 FMDV infectious cDNA clone. Immunofluorescence assay (IFA), Western blot and sequence analysis showed that the epitope-tagged viruses stably maintained and expressed the foreign epitopes even after 10 serial passages in BHK-21 cells. The epitope-tagged viruses displayed growth properties and plaque phenotypes similar to those of the parental virus in BHK-21 cells. However, the epitope-tagged viruses exhibited lower growth rates and smaller plaque size phenotypes than those of the parental virus in primary fetal bovine kidney (FBK) cells, but similar growth properties and plaque phenotypes to those of the recombinant viruses harboring 93-102 deletion in 3A. These results demonstrate that the decreased ability of FMDV to replicate in primary bovine cells was not associated with the length of 3A, and the genetic determinant thought to play key role in decreased ability to replicate in primary bovine cells could be reduced from 93-102 residues to 8 aa residues at positions 94-101 in 3A protein. Copyright © 2015 Elsevier B.V. All rights reserved.

  4. Dissection of two soybean QTL conferring partial resistance to Phytophthora sojae through sequence and gene expression analysis

    Directory of Open Access Journals (Sweden)

    Wang Hehe

    2012-08-01

    Full Text Available Abstract Background Phytophthora sojae is the primary pathogen of soybeans that are grown on poorly drained soils. Race-specific resistance to P. sojae in soybean is gene-for-gene, although in many areas of the US and worldwide there are populations that have adapted to the most commonly deployed resistance to P. sojae ( Rps genes. Hence, this system has received increased attention towards identifying mechanisms and molecular markers associated with partial resistance to this pathogen. Several quantitative trait loci (QTL have been identified in the soybean cultivar ‘Conrad’ that contributes to the expression of partial resistance to multiple P. sojae isolates. Results In this study, two of the Conrad QTL on chromosome 19 were dissected through sequence and expression analysis of genes in both resistant (Conrad and susceptible (‘Sloan’ genotypes. There were 1025 single nucleotide polymorphisms (SNPs in 87 of 153 genes sequenced from Conrad and Sloan. There were 304 SNPs in 54 genes sequenced from Conrad compared to those from both Sloan and Williams 82, of which 11 genes had SNPs unique to Conrad. Eleven of 19 genes in these regions analyzed with qRT-PCR had significant differences in fold change of transcript abundance in response to infection with P. sojae in lines with QTL haplotype from the resistant parent compared to those with the susceptible parent haplotype. From these, 8 of the 11 genes had SNPs in the upstream, untranslated region, exon, intron, and/or downstream region. These 11 candidate genes encode proteins potentially involved in signal transduction, hormone-mediated pathways, plant cell structural modification, ubiquitination, and basal resistance. Conclusions These findings may indicate a complex defense network with multiple mechanisms underlying these two soybean QTL conferring resistance to P. sojae. SNP markers derived from these candidate genes can contribute to fine mapping of QTL and marker assisted breeding for